CN116555210A - Glycosyltransferases and their use in the preparation of rebaudioside E - Google Patents
Glycosyltransferases and their use in the preparation of rebaudioside E Download PDFInfo
- Publication number
- CN116555210A CN116555210A CN202210114711.6A CN202210114711A CN116555210A CN 116555210 A CN116555210 A CN 116555210A CN 202210114711 A CN202210114711 A CN 202210114711A CN 116555210 A CN116555210 A CN 116555210A
- Authority
- CN
- China
- Prior art keywords
- amino acid
- leu
- acid residue
- glu
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108700023372 Glycosyltransferases Proteins 0.000 title claims abstract description 61
- RLLCWNUIHGPAJY-RYBZXKSASA-N Rebaudioside E Natural products O=C(O[C@H]1[C@H](O[C@H]2[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O2)[C@@H](O)[C@@H](O)[C@H](CO)O1)[C@]1(C)[C@@H]2[C@@](C)([C@@H]3[C@@]4(CC(=C)[C@@](O[C@@H]5[C@@H](O[C@@H]6[C@@H](O)[C@H](O)[C@@H](O)[C@H](CO)O6)[C@H](O)[C@@H](O)[C@H](CO)O5)(C4)CC3)CC2)CCC1 RLLCWNUIHGPAJY-RYBZXKSASA-N 0.000 title claims abstract description 23
- RLLCWNUIHGPAJY-SFUUMPFESA-N rebaudioside E Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RLLCWNUIHGPAJY-SFUUMPFESA-N 0.000 title claims abstract description 22
- 238000002360 preparation method Methods 0.000 title claims abstract description 9
- 102000045442 glycosyltransferase activity proteins Human genes 0.000 title claims abstract description 6
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 title claims abstract description 6
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 137
- 102000051366 Glycosyltransferases Human genes 0.000 claims abstract description 55
- 108090000790 Enzymes Proteins 0.000 claims abstract description 28
- 102000004190 Enzymes Human genes 0.000 claims abstract description 27
- 238000000034 method Methods 0.000 claims abstract description 23
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 22
- HSCJRCZFDFQWRP-JZMIEXBBSA-N UDP-alpha-D-glucose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OP(O)(=O)OP(O)(=O)OC[C@@H]1[C@@H](O)[C@@H](O)[C@H](N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-JZMIEXBBSA-N 0.000 claims abstract description 13
- HSCJRCZFDFQWRP-UHFFFAOYSA-N Uridindiphosphoglukose Natural products OC1C(O)C(O)C(CO)OC1OP(O)(=O)OP(O)(=O)OCC1C(O)C(O)C(N2C(NC(=O)C=C2)=O)O1 HSCJRCZFDFQWRP-UHFFFAOYSA-N 0.000 claims abstract description 13
- 239000000348 glycosyl donor Substances 0.000 claims abstract description 9
- 238000012217 deletion Methods 0.000 claims abstract description 3
- 230000037430 deletion Effects 0.000 claims abstract description 3
- 108010043934 Sucrose synthase Proteins 0.000 claims description 36
- 235000019202 steviosides Nutrition 0.000 claims description 35
- UEDUENGHJMELGK-HYDKPPNVSA-N Stevioside Chemical group O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O UEDUENGHJMELGK-HYDKPPNVSA-N 0.000 claims description 26
- 229940013618 stevioside Drugs 0.000 claims description 26
- OHHNJQXIOPOJSC-UHFFFAOYSA-N stevioside Natural products CC1(CCCC2(C)C3(C)CCC4(CC3(CCC12C)CC4=C)OC5OC(CO)C(O)C(O)C5OC6OC(CO)C(O)C(O)C6O)C(=O)OC7OC(CO)C(O)C(O)C7O OHHNJQXIOPOJSC-UHFFFAOYSA-N 0.000 claims description 26
- 238000006243 chemical reaction Methods 0.000 claims description 19
- XTWYTFMLZFPYCI-KQYNXXCUSA-N 5'-adenylphosphoric acid Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O XTWYTFMLZFPYCI-KQYNXXCUSA-N 0.000 claims description 17
- XTWYTFMLZFPYCI-UHFFFAOYSA-N Adenosine diphosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(=O)OP(O)(O)=O)C(O)C1O XTWYTFMLZFPYCI-UHFFFAOYSA-N 0.000 claims description 17
- 229930006000 Sucrose Natural products 0.000 claims description 17
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 17
- 239000005720 sucrose Substances 0.000 claims description 17
- XCCTYIAWTASOJW-XVFCMESISA-N Uridine-5'-Diphosphate Chemical compound O[C@@H]1[C@H](O)[C@@H](COP(O)(=O)OP(O)(O)=O)O[C@H]1N1C(=O)NC(=O)C=C1 XCCTYIAWTASOJW-XVFCMESISA-N 0.000 claims description 10
- RPYRMTHVSUWHSV-CUZJHZIBSA-N rebaudioside D Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O RPYRMTHVSUWHSV-CUZJHZIBSA-N 0.000 claims description 10
- 125000003147 glycosyl group Chemical group 0.000 claims description 8
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 claims description 7
- 150000001413 amino acids Chemical class 0.000 claims description 7
- 230000015572 biosynthetic process Effects 0.000 claims description 7
- 239000008103 glucose Substances 0.000 claims description 7
- 108020004707 nucleic acids Proteins 0.000 claims description 6
- 102000039446 nucleic acids Human genes 0.000 claims description 6
- 150000007523 nucleic acids Chemical class 0.000 claims description 6
- 238000003786 synthesis reaction Methods 0.000 claims description 6
- 239000002773 nucleotide Substances 0.000 claims description 5
- 125000003729 nucleotide group Chemical group 0.000 claims description 5
- 238000012546 transfer Methods 0.000 claims description 5
- 238000012258 culturing Methods 0.000 claims description 4
- 239000013604 expression vector Substances 0.000 claims description 4
- 239000000937 glycosyl acceptor Substances 0.000 claims description 4
- 238000003259 recombinant expression Methods 0.000 claims description 4
- HELXLJCILKEWJH-UHFFFAOYSA-N entered according to Sigma 01432 Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC(C1OC2C(C(O)C(O)C(CO)O2)O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O HELXLJCILKEWJH-UHFFFAOYSA-N 0.000 claims description 3
- HELXLJCILKEWJH-NCGAPWICSA-N rebaudioside A Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(=O)O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O HELXLJCILKEWJH-NCGAPWICSA-N 0.000 claims description 3
- 239000001512 FEMA 4601 Substances 0.000 claims description 2
- 108010093096 Immobilized Enzymes Proteins 0.000 claims description 2
- HELXLJCILKEWJH-SEAGSNCFSA-N Rebaudioside A Natural products O=C(O[C@H]1[C@@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1)[C@@]1(C)[C@@H]2[C@](C)([C@H]3[C@@]4(CC(=C)[C@@](O[C@H]5[C@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@@H](O[C@H]6[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O6)[C@H](O)[C@@H](CO)O5)(C4)CC3)CC2)CCC1 HELXLJCILKEWJH-SEAGSNCFSA-N 0.000 claims description 2
- 238000000354 decomposition reaction Methods 0.000 claims description 2
- 235000019203 rebaudioside A Nutrition 0.000 claims description 2
- 238000004519 manufacturing process Methods 0.000 abstract description 7
- WFPZSXYXPSUOPY-ROYWQJLOSA-N ADP alpha-D-glucoside Chemical compound C([C@H]1O[C@H]([C@@H]([C@@H]1O)O)N1C=2N=CN=C(C=2N=C1)N)OP(O)(=O)OP(O)(=O)O[C@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O WFPZSXYXPSUOPY-ROYWQJLOSA-N 0.000 abstract description 6
- 230000003197 catalytic effect Effects 0.000 abstract description 6
- 101710096830 DNA-3-methyladenine glycosylase Proteins 0.000 abstract description 4
- 102100039128 DNA-3-methyladenine glycosylase Human genes 0.000 abstract description 4
- 238000009776 industrial production Methods 0.000 abstract description 4
- 230000002194 synthesizing effect Effects 0.000 abstract description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 33
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 26
- 108010050848 glycylleucine Proteins 0.000 description 20
- 108010034529 leucyl-lysine Proteins 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 13
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 12
- 239000000243 solution Substances 0.000 description 12
- 241000880493 Leptailurus serval Species 0.000 description 11
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 11
- 108010056582 methionylglutamic acid Proteins 0.000 description 11
- 108010070643 prolylglutamic acid Proteins 0.000 description 11
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 10
- 108010090461 DFG peptide Proteins 0.000 description 10
- QHGBCRCMBCWMBJ-UHFFFAOYSA-N Ile-Glu-Ala-Lys Natural products CCC(C)C(N)C(=O)NC(CCC(O)=O)C(=O)NC(C)C(=O)NC(C(O)=O)CCCCN QHGBCRCMBCWMBJ-UHFFFAOYSA-N 0.000 description 10
- 108010013835 arginine glutamate Proteins 0.000 description 10
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 10
- 239000013612 plasmid Substances 0.000 description 10
- 108010015796 prolylisoleucine Proteins 0.000 description 10
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 9
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 9
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 9
- 239000004383 Steviol glycoside Substances 0.000 description 9
- 108010081551 glycylphenylalanine Proteins 0.000 description 9
- 235000019411 steviol glycoside Nutrition 0.000 description 9
- 229930182488 steviol glycoside Natural products 0.000 description 9
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 8
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 8
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 8
- 108010028295 histidylhistidine Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- GZWOBWMOMPFPCD-CIUDSAMLSA-N Glu-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N GZWOBWMOMPFPCD-CIUDSAMLSA-N 0.000 description 7
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 7
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 7
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 7
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 7
- RXWPLVRJQNWXRQ-IHRRRGAJSA-N Met-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CNC=N1 RXWPLVRJQNWXRQ-IHRRRGAJSA-N 0.000 description 7
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 7
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 7
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 7
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 7
- WDIWOIRFNMLNKO-ULQDDVLXSA-N Val-Leu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WDIWOIRFNMLNKO-ULQDDVLXSA-N 0.000 description 7
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 7
- 108010081404 acein-2 Proteins 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 108010078144 glutaminyl-glycine Proteins 0.000 description 7
- 238000004128 high performance liquid chromatography Methods 0.000 description 7
- 108010048818 seryl-histidine Proteins 0.000 description 7
- 150000008144 steviol glycosides Chemical class 0.000 description 7
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 6
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 6
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 6
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 6
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 6
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 6
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 6
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 6
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 6
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 6
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 6
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 6
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 6
- KIJLEFNHWSXHRU-NUMRIWBASA-N Asp-Gln-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KIJLEFNHWSXHRU-NUMRIWBASA-N 0.000 description 6
- 241000588724 Escherichia coli Species 0.000 description 6
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 6
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 6
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 6
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 6
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 6
- XEKAJTCACGEBOK-KKUMJFAQSA-N Glu-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XEKAJTCACGEBOK-KKUMJFAQSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 6
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 6
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 6
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 6
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 6
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 6
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 6
- OHOXVDFVRDGFND-YUMQZZPRSA-N His-Cys-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O OHOXVDFVRDGFND-YUMQZZPRSA-N 0.000 description 6
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 6
- YEKYGQZUBCRNGH-DCAQKATOSA-N His-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CN=CN2)N)C(=O)N[C@@H](CO)C(=O)O YEKYGQZUBCRNGH-DCAQKATOSA-N 0.000 description 6
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 6
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 6
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 6
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 6
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 6
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 6
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 6
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 6
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 6
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 6
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 6
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 6
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 6
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 6
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 6
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 6
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 6
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 6
- URBJRJKWSUFCKS-AVGNSLFASA-N Lys-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N URBJRJKWSUFCKS-AVGNSLFASA-N 0.000 description 6
- JYVCOTWSRGFABJ-DCAQKATOSA-N Lys-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N JYVCOTWSRGFABJ-DCAQKATOSA-N 0.000 description 6
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 6
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 6
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 6
- SXJGROGVINAYSH-AVGNSLFASA-N Phe-Gln-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SXJGROGVINAYSH-AVGNSLFASA-N 0.000 description 6
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 6
- NRKNYPRRWXVELC-NQCBNZPSSA-N Phe-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=CC=C3)N NRKNYPRRWXVELC-NQCBNZPSSA-N 0.000 description 6
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 6
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 6
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 6
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 6
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 6
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 6
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 6
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 6
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 6
- WFAUDCSNCWJJAA-KXNHARMFSA-N Thr-Lys-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(O)=O WFAUDCSNCWJJAA-KXNHARMFSA-N 0.000 description 6
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 6
- UKINEYBQXPMOJO-UBHSHLNASA-N Trp-Asn-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N UKINEYBQXPMOJO-UBHSHLNASA-N 0.000 description 6
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 6
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 6
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 6
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 6
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 6
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 6
- 108010044940 alanylglutamine Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 108010054813 diprotin B Proteins 0.000 description 6
- 108010064235 lysylglycine Proteins 0.000 description 6
- 108090000623 proteins and genes Proteins 0.000 description 6
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 6
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 5
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 5
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 5
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 5
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 5
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 5
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 5
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 5
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 5
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 5
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 5
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 5
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 5
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 5
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 5
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 5
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 5
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 5
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 5
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 5
- SBMNPABNWKXNBJ-BQBZGAKWSA-N Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CO SBMNPABNWKXNBJ-BQBZGAKWSA-N 0.000 description 5
- OGZRZMJASKKMJZ-XIRDDKMYSA-N Trp-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N OGZRZMJASKKMJZ-XIRDDKMYSA-N 0.000 description 5
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 5
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 5
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 5
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 5
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 108010085325 histidylproline Proteins 0.000 description 5
- 239000007788 liquid Substances 0.000 description 5
- 108010026333 seryl-proline Proteins 0.000 description 5
- -1 steviol glycoside compounds Chemical class 0.000 description 5
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 4
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 4
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 4
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 4
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 4
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 4
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 4
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 4
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 4
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 4
- UXIPUCUHQBIQOS-SRVKXCTJSA-N Asp-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UXIPUCUHQBIQOS-SRVKXCTJSA-N 0.000 description 4
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 4
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 4
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 4
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 4
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 4
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 4
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 4
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 4
- CBWKURKPYSLMJV-SOUVJXGZSA-N Glu-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CBWKURKPYSLMJV-SOUVJXGZSA-N 0.000 description 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 4
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- JKSMZVCGQWVTBW-STQMWFEESA-N Gly-Trp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O JKSMZVCGQWVTBW-STQMWFEESA-N 0.000 description 4
- ORERHHPZDDEMSC-VGDYDELISA-N His-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ORERHHPZDDEMSC-VGDYDELISA-N 0.000 description 4
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 4
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 4
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 4
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 4
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 4
- MGUTVMBNOMJLKC-VKOGCVSHSA-N Ile-Trp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C(C)C)C(=O)O)N MGUTVMBNOMJLKC-VKOGCVSHSA-N 0.000 description 4
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 4
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 4
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- KVOFSTUWVSQMDK-KKUMJFAQSA-N Leu-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 KVOFSTUWVSQMDK-KKUMJFAQSA-N 0.000 description 4
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 4
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 4
- NHRINZSPIUXYQZ-DCAQKATOSA-N Leu-Met-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)O)N NHRINZSPIUXYQZ-DCAQKATOSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 4
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 4
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 4
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 4
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 4
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 4
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 4
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 4
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 4
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 4
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 4
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 4
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 4
- UCTIUWKCVNGEFH-OBJOEFQTSA-N Pro-Val-Gly-Pro Chemical compound N([C@@H](C(C)C)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 UCTIUWKCVNGEFH-OBJOEFQTSA-N 0.000 description 4
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 4
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 4
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 4
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 4
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 4
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 4
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 4
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 4
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 4
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 4
- QGFPYRPIUXBYGR-YDHLFZDLSA-N Val-Asn-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N QGFPYRPIUXBYGR-YDHLFZDLSA-N 0.000 description 4
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 108010049041 glutamylalanine Proteins 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010009298 lysylglutamic acid Proteins 0.000 description 4
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 4
- 239000013600 plasmid vector Substances 0.000 description 4
- 108010077112 prolyl-proline Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 3
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 3
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- ZWNFOZNJYNDNGM-UBHSHLNASA-N Cys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N ZWNFOZNJYNDNGM-UBHSHLNASA-N 0.000 description 3
- YGNPTRVNRUKVLA-DCAQKATOSA-N Gln-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N YGNPTRVNRUKVLA-DCAQKATOSA-N 0.000 description 3
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 3
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 3
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 3
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 3
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 3
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 3
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 3
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 3
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 3
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 3
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 3
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 3
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 3
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 3
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 3
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 3
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 3
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 3
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 3
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 3
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 3
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 235000019658 bitter taste Nutrition 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 3
- 229930027917 kanamycin Natural products 0.000 description 3
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 3
- 229960000318 kanamycin Drugs 0.000 description 3
- 229930182823 kanamycin A Natural products 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- DEWWPUNXRNGMQN-LPEHRKFASA-N Ala-Met-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N DEWWPUNXRNGMQN-LPEHRKFASA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 2
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 2
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 2
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 2
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- KZEUVLLVULIPNX-GUBZILKMSA-N Gln-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N KZEUVLLVULIPNX-GUBZILKMSA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 2
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- NPMSEUWUMOSEFM-CIUDSAMLSA-N Glu-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N NPMSEUWUMOSEFM-CIUDSAMLSA-N 0.000 description 2
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 2
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 2
- OMNVOTCFQQLEQU-CIUDSAMLSA-N His-Asn-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMNVOTCFQQLEQU-CIUDSAMLSA-N 0.000 description 2
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 2
- ZXIGYKICRDFISM-DJFWLOJKSA-N Ile-His-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZXIGYKICRDFISM-DJFWLOJKSA-N 0.000 description 2
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 2
- SVZFKLBRCYCIIY-CYDGBPFRSA-N Ile-Pro-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SVZFKLBRCYCIIY-CYDGBPFRSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- QLESZRANMSYLCZ-CYDGBPFRSA-N Met-Pro-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QLESZRANMSYLCZ-CYDGBPFRSA-N 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000544066 Stevia Species 0.000 description 2
- 244000228451 Stevia rebaudiana Species 0.000 description 2
- 235000006092 Stevia rebaudiana Nutrition 0.000 description 2
- QFVOYBUQQBFCRH-UHFFFAOYSA-N Steviol Natural products C1CC2(C3)CC(=C)C3(O)CCC2C2(C)C1C(C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-UHFFFAOYSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 2
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- 108010005233 alanylglutamic acid Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 238000007036 catalytic synthesis reaction Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 229930182470 glycoside Natural products 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 238000000265 homogenisation Methods 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 230000014759 maintenance of location Effects 0.000 description 2
- 239000002184 metal Substances 0.000 description 2
- QSRAJVGDWKFOGU-WBXIDTKBSA-N rebaudioside c Chemical compound O[C@@H]1[C@H](O)[C@@H](O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]1(CC[C@H]2[C@@]3(C)[C@@H]([C@](CCC3)(C)C(=O)O[C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)CC3)C(=C)C[C@]23C1 QSRAJVGDWKFOGU-WBXIDTKBSA-N 0.000 description 2
- QFVOYBUQQBFCRH-VQSWZGCSSA-N steviol Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)CC1)C[C@H]2[C@@]2(C)[C@H]1[C@](C)(C(O)=O)CCC2 QFVOYBUQQBFCRH-VQSWZGCSSA-N 0.000 description 2
- 229940032084 steviol Drugs 0.000 description 2
- 235000000346 sugar Nutrition 0.000 description 2
- 235000019640 taste Nutrition 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- TWCMVXMQHSVIOJ-UHFFFAOYSA-N Aglycone of yadanzioside D Natural products COC(=O)C12OCC34C(CC5C(=CC(O)C(O)C5(C)C3C(O)C1O)C)OC(=O)C(OC(=O)C)C24 TWCMVXMQHSVIOJ-UHFFFAOYSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 1
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 1
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 1
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- XHFXZQHTLJVZBN-FXQIFTODSA-N Asn-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N XHFXZQHTLJVZBN-FXQIFTODSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 1
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 1
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- NAAAPCLFJPURAM-HJGDQZAQSA-N Asp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O NAAAPCLFJPURAM-HJGDQZAQSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- PLMKQQMDOMTZGG-UHFFFAOYSA-N Astrantiagenin E-methylester Natural products CC12CCC(O)C(C)(CO)C1CCC1(C)C2CC=C2C3CC(C)(C)CCC3(C(=O)OC)CCC21C PLMKQQMDOMTZGG-UHFFFAOYSA-N 0.000 description 1
- 241000954177 Bangana ariza Species 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- SDWZYDDNSMPBRM-AVGNSLFASA-N Cys-Gln-Phe Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SDWZYDDNSMPBRM-AVGNSLFASA-N 0.000 description 1
- 229930186291 Dulcoside Natural products 0.000 description 1
- 239000001776 FEMA 4720 Substances 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- 239000005715 Fructose Substances 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WIMVKDYAKRAUCG-IHRRRGAJSA-N Gln-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O WIMVKDYAKRAUCG-IHRRRGAJSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- ZOTGXWMKUFSKEU-QXEWZRGKSA-N Gly-Ile-Met Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O ZOTGXWMKUFSKEU-QXEWZRGKSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 1
- YXBRCTXAEYSCHS-XVYDVKMFSA-N His-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YXBRCTXAEYSCHS-XVYDVKMFSA-N 0.000 description 1
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 1
- FYVHHKMHFPMBBG-GUBZILKMSA-N His-Gln-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FYVHHKMHFPMBBG-GUBZILKMSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 1
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 1
- PBJOQLUVSGXRSW-YTQUADARSA-N His-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N)C(=O)O PBJOQLUVSGXRSW-YTQUADARSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- OAQJOXZPGHTJNA-NGTWOADLSA-N Ile-Trp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N OAQJOXZPGHTJNA-NGTWOADLSA-N 0.000 description 1
- 241000235058 Komagataella pastoris Species 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- IEIHKHYMBIYQTH-YESZJQIVSA-N Lys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCCN)N)C(=O)O IEIHKHYMBIYQTH-YESZJQIVSA-N 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- CHLJXFMOQGYDNH-SZMVWBNQSA-N Met-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 CHLJXFMOQGYDNH-SZMVWBNQSA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- LNXGEYIEEUZGGH-JYJNAYRXSA-N Met-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCSC)CC1=CC=CC=C1 LNXGEYIEEUZGGH-JYJNAYRXSA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 1
- KIEPQOIQHFKQLK-PCBIJLKTSA-N Phe-Asn-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KIEPQOIQHFKQLK-PCBIJLKTSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 1
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 1
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- GSPPWVHVBBSPSY-FHWLQOOXSA-N Pro-His-Trp Chemical compound OC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@H](Cc1cnc[nH]1)NC(=O)[C@@H]1CCCN1 GSPPWVHVBBSPSY-FHWLQOOXSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- GIPHUOWOTCAJSR-UHFFFAOYSA-N Rebaudioside A. Natural products C1CC2C3(C)CCCC(C)(C(=O)OC4C(C(O)C(O)C(CO)O4)O)C3CCC2(C2)CC(=C)C21OC1OC(CO)C(O)C(O)C1OC(C1O)OC(CO)C(O)C1OC1OC(CO)C(O)C(O)C1O GIPHUOWOTCAJSR-UHFFFAOYSA-N 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- OMHUCGDTACNQEX-OSHKXICASA-N Steviolbioside Natural products O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O OMHUCGDTACNQEX-OSHKXICASA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 1
- IWAVRIPRTCJAQO-HSHDSVGOSA-N Thr-Pro-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IWAVRIPRTCJAQO-HSHDSVGOSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 1
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- HXNVJPQADLRHGR-JBACZVJFSA-N Trp-Glu-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N HXNVJPQADLRHGR-JBACZVJFSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- NLWCSMOXNKBRLC-WDSOQIARSA-N Trp-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLWCSMOXNKBRLC-WDSOQIARSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- AOLHUMAVONBBEZ-STQMWFEESA-N Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AOLHUMAVONBBEZ-STQMWFEESA-N 0.000 description 1
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- IPEODUGTOQPXKA-DXOBOGAASA-N [(2r,3s,4r,5r)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methyl phosphono hydrogen phosphate;(2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanal Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C=O.C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(=O)OP(O)(O)=O)[C@@H](O)[C@H]1O IPEODUGTOQPXKA-DXOBOGAASA-N 0.000 description 1
- 239000000370 acceptor Substances 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010070944 alanylhistidine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010038633 aspartylglutamate Proteins 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 238000010523 cascade reaction Methods 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- JLPRGBMUVNVSKP-AHUXISJXSA-M chembl2368336 Chemical compound [Na+].O([C@H]1[C@@H](O)[C@H](O)[C@H](CO)O[C@H]1O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C([O-])=O)[C@@H]1O[C@@H](CO)[C@@H](O)[C@H](O)[C@@H]1O JLPRGBMUVNVSKP-AHUXISJXSA-M 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000010612 desalination reaction Methods 0.000 description 1
- 230000001066 destructive effect Effects 0.000 description 1
- 150000002016 disaccharides Chemical class 0.000 description 1
- 239000000386 donor Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 150000002338 glycosides Chemical class 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010018006 histidylserine Proteins 0.000 description 1
- PFOARMALXZGCHY-UHFFFAOYSA-N homoegonol Natural products C1=C(OC)C(OC)=CC=C1C1=CC2=CC(CCCO)=CC(OC)=C2O1 PFOARMALXZGCHY-UHFFFAOYSA-N 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 150000002772 monosaccharides Chemical class 0.000 description 1
- 235000021096 natural sweeteners Nutrition 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 239000008055 phosphate buffer solution Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 108010029895 rubimetide Proteins 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- DRSKVOAJKLUMCL-MMUIXFKXSA-N u2n4xkx7hp Chemical compound O([C@H]1[C@H](O)[C@@H](CO)O[C@H]([C@@H]1O[C@H]1[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)O[C@]12C(=C)C[C@@]3(C1)CC[C@@H]1[C@@](C)(CCC[C@]1([C@@H]3CC2)C)C(O)=O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1O DRSKVOAJKLUMCL-MMUIXFKXSA-N 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 239000002351 wastewater Substances 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1051—Hexosyltransferases (2.4.1)
- C12N9/1062—Sucrose synthase (2.4.1.13)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P33/00—Preparation of steroids
- C12P33/20—Preparation of steroids containing heterocyclic rings
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/01—Hexosyltransferases (2.4.1)
- C12Y204/01013—Sucrose synthase (2.4.1.13)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/55—Design of synthesis routes, e.g. reducing the use of auxiliary or protecting groups
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Genetics & Genomics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
The invention discloses glycosyltransferases and their use in the preparation of rebaudioside E. The amino acid sequence of the glycosyltransferase of the invention comprises at least the amino acid residue differences compared to SEQ ID NO. 4 selected from the group consisting of: (1) Amino acid residues 429, 433, 435, 446, 448 and 449; (2) deletion of amino acid residues 1 to 8; amino acid residues at positions 9, 10, 11, 12, 16, 22, 23, 26, 27, 31, 42, 46, 49, 54, 56, 58, 64, 65, 70, 73, 96 and 106. The glycosyltransferase with good catalytic effect and good stability is screened out, the process condition for synthesizing the Reb E by an enzyme method is optimized, and the industrial production of the Reb E is facilitated; solves the problem of high price of glycosyl donor UDPG/ADPG, and reduces the production cost.
Description
Technical Field
The invention belongs to the field of biosynthesis, and particularly relates to glycosyltransferase and application thereof in preparing rebaudioside E.
Background
Stevioside (Steviol glycosides, also known as steviol glycoside) is a natural sweetener extracted from leaves of stevia rebaudiana Bertoni of Compositae, and is 10% -20% of dry weight of leaves, and is a mixture of various glycosides. Stevioside has the advantages of pure nature (from pure natural plant stevia), high sweetness, low calorie, economy in use, good stability, high safety and the like, but has the defect of bitter taste after the use, and limits the application of stevioside in the fields of food, beverage and the like. The intrinsic reason for the bitter taste after steviol glycoside is that the intrinsic molecular structure of steviol glycoside causes that the more the number of the sugar groups connected on R1 and R2 groups in steviol glycoside, the better the taste.
Steviol glycosides (steviol glycoside compounds) have the following structural formula:
the compounds corresponding to the substituents are shown in Table 1.
TABLE 1 steviol glycosides isolated from stevia rebaudiana
The steviol glycoside compounds have a common aglycone: steviol (Steviol), differing in the number and type of glycosyl groups attached at the C-13 and C-19 positions, mainly includes Stevioside (Stevioside), rebaudioside a (rebaudiosid a, reba a, RA), rebaudioside B, rebaudioside C, rebaudioside D, rebaudioside E (rebaudiosid E, RE), dulcoside, steviolbioside, etc.
The taste of the rebaudioside E is free from bitter taste, the sweetness is similar to that of sucrose, the structure of the rebaudioside E is more than that of stevioside on a C19 side chain on a framework, but the content of the rebaudioside E in dry leaves of the stevia is very small (much less than 1 percent), and the rebaudioside E is directly separated from the stevioside by a conventional physical means, so that the difficulty is high and the yield is very low. In addition, the process for enriching the rebaudioside E is complicated, multiple column passes, desalination, decoloration and recrystallization are needed after extraction, and a large amount of wastewater is generated in the production process, so that the method has high production cost and is not suitable for industrial mass production.
The bioconversion method mainly uses glycosyltransferase to catalyze glycosyl to transfer from an activated donor molecule to an acceptor molecule, so as to generate various glycoside compounds. Common glycosyl donors include monosaccharides, disaccharides, polysaccharides, phosphate sugars, uridine diphosphate-glucose, and the like. The glycosyl donor for preparing rebaudioside E by the bioconversion method is generally uridine diphosphate-glucose (UDP-glucose) or adenosine diphosphate-glucose (ADP-glucose), but the glycosyl donor is expensive, and the catalytic activity of glycosyltransferase (such as beta-1, 2-glycosyltransferase) is relatively low, so that the production cost is high, and the production efficiency is low.
Disclosure of Invention
The invention aims to overcome the defect of low enzyme activity when the existing glycosyltransferase is applied to biocatalysis preparation of Reb E, and provides the glycosyltransferase and the application thereof in preparation of rebaudioside E. The glycosyltransferase has good catalytic effect and stability, and can be used for synthesizing Reb E by combining with sucrose synthase (SUS) to realize cascade reaction; meanwhile, the regeneration of UDPG/ADPG is realized through sucrose and UDP/ADP, so that the problem of high price of glycosyl donor UDPG/ADPG is solved; and further optimizes the process conditions for synthesizing the Reb E by enzyme catalysis, provides more choices for optimizing the process conditions for realizing large-scale industrial production, and is favorable for realizing industrial production.
The invention solves the technical problems by the following technical proposal:
in a first aspect the present invention provides a glycosyltransferase having an amino acid sequence which comprises at least the amino acid residue differences compared to SEQ ID NO. 4 selected from the group consisting of:
(1) Having one or more of the following amino acid residue differences:
the amino acid residue at position 429 is D;
the amino acid residue at position 433 is D;
the amino acid residue at position 435 is V;
the amino acid residue at position 446 is S;
the amino acid residue at position 448 is K; and
the 449 amino acid residue is S;
(2) Having one or more of the following amino acid residue differences:
deletion of amino acid residues at positions 1 to 8;
the amino acid residue at position 9 is M;
the amino acid residue at position 10 is A;
the amino acid residue at position 11 is T;
the amino acid residue at position 12 is N;
the amino acid residue at position 16 is L;
the amino acid residue at position 22 is A;
the amino acid residue at position 23 is Y;
the amino acid residue at position 26 is I;
the amino acid residue at position 27 is S;
the amino acid residue at position 31 is N;
the amino acid residue at position 42 is L;
the amino acid residue at position 46 is C;
the amino acid residue at position 49 is R;
the amino acid residue at position 54 is S;
amino acid residue at position 56 is I;
the amino acid residue at position 58 is K;
the amino acid residue at position 64 is A;
the amino acid residue at position 65 is D;
the amino acid residue at position 70 is I;
the amino acid residue at position 73 is Q;
the amino acid residue at position 96 is P; and
the amino acid residue at position 106 is K.
In some embodiments of the invention, in (1), the amino acid sequence of the glycosyltransferase further comprises one or more amino acid residue differences compared to SEQ ID NO. 4 selected from the group consisting of:
the amino acid residue at position 399 is E;
the amino acid residue at position 400 is A;
the amino acid residue at position 403 is S;
the amino acid residue at position 405 is V;
the amino acid residue at position 406 is T;
the amino acid residue at position 408 is E;
the amino acid residue at position 419 is E;
the 422 th amino acid residue is K;
the amino acid residue at position 423 is N;
the amino acid residue at position 425 is K;
the amino acid residue at position 426 is S; and
the amino acid residue at position 427 is I.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase further comprises one or more amino acid residue differences compared to SEQ ID NO. 4 selected from the group consisting of:
the amino acid residue at position 373 is K;
amino acid residue at position 375 is M;
the amino acid residue at position 385 is V;
the amino acid residue at position 388 is D;
the amino acid residue at position 391 is K;
the amino acid residue at position 392 is I; and
the amino acid residue at position 395 is G.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase further comprises amino acid residue differences at one or more residue positions selected from the group consisting of:
amino acid residue 309 is E;
amino acid residue at position 315 is I;
the amino acid residue at position 317 is E;
the amino acid residue at position 324 is K;
the amino acid residue at position 325 is F;
the amino acid residue at position 326 is A;
the amino acid residue at position 329 is P;
amino acid residue at position 330 is R;
amino acid residue at position 364 is I;
the amino acid residue at position 365 is H; and
the amino acid residue at position 366 is N.
In some embodiments of the invention, the glycosyltransferase does not comprise one or more amino acid differences from positions 210 to 257 as compared to SEQ ID NO. 4.
In some embodiments of the invention, the glycosyltransferase does not comprise one or more amino acid residue differences from position 259 to position 306 as compared to SEQ ID NO. 4.
In some embodiments of the invention, the glycosyltransferase does not comprise one or more amino acid residue differences from positions 111 to 202 as compared to SEQ ID NO. 4.
In some embodiments of the invention, the glycosyltransferase does not comprise one or more amino acid residue differences from position 259 to position 306 and from position 111 to position 202 as compared to SEQ ID NO. 4.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase is shown as SEQ ID NO. 32.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase is shown in SEQ ID NO. 38.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase is shown as SEQ ID NO. 30.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase is shown as SEQ ID NO. 28.
In some embodiments of the invention, the amino acid sequence of the glycosyltransferase is shown as SEQ ID NO. 36.
In a second aspect the invention provides an isolated nucleic acid encoding a glycosyltransferase according to the first aspect.
In a third aspect the present invention provides a recombinant expression vector comprising a nucleic acid as described in the second aspect.
In a fourth aspect the present invention provides a transformant comprising a nucleic acid as described in the second aspect or a recombinant expression vector as described in the third aspect.
In a fifth aspect the present invention provides a method of preparing a glycosyltransferase according to the first aspect, the method comprising culturing a transformant according to the fourth aspect under conditions suitable for expression of the glycosyltransferase.
A sixth aspect of the invention provides a method of preparing rebaudioside E, the method comprising: glycosyltransferases transfer a glycosyl group on an activated glycosyl donor to a glycosyl acceptor;
wherein the glycosyltransferase is as described in the first aspect; the glycosyl acceptor is stevioside; the glycosyl donor is uridine diphosphate glucose and/or adenosine diphosphate glucose.
In some embodiments of the invention, the uridine diphosphate glucose and/or adenosine diphosphate glucose are produced by the destructive synthesis of sucrose.
In the invention, the decomposition and synthesis of sucrose means: in the presence of uridine diphosphate and/or adenosine diphosphate, a molecule of sucrose is decomposed by sucrose synthase to produce a molecule of fructose and a molecule of uridine diphosphate glucose and/or adenosine diphosphate glucose.
In some embodiments of the invention, the amino acid sequence of the sucrose synthase is as shown in SEQ ID NO. 24; the nucleotide sequence encoding the sucrose synthase is preferably as shown in SEQ ID NO. 23.
In some embodiments of the invention, the glycosyltransferase and the sucrose synthase are used in the form of a crude enzyme solution, a pure enzyme, an immobilized enzyme, or a cell expressing the glycosyltransferase and the sucrose synthase.
In the present invention, the host cell of the cell expressing the glycosyltransferase and the sucrose synthase may be conventional in the art, e.g., e.coli; the person skilled in the art can culture the cells and obtain glycosyltransferases and sucrose synthases by conventional means.
In some embodiments of the invention, the mass ratio of cells expressing the glycosyltransferase to stevioside is 3 (9-30), preferably 3:20.
In some embodiments of the invention, the mass ratio of the cell expressing the sucrose synthase to sucrose is 3 (150-300), preferably 3:200.
In some embodiments of the invention, the mass ratio of sucrose to stevioside is (0.5-3): 1, preferably 2:1.
In some embodiments of the invention, the sucrose to glucose uridine diphosphate or glucose adenosine diphosphate mass ratio is (500-3000): 1, preferably 2000:1.
In some embodiments of the invention, the method uses a reaction system with stevioside concentration of 50-250 g/L, pH of 5-8 and reaction temperature of 20-90 ℃.
In some embodiments of the invention, the reaction system comprises 1.5mL of glycosyltransferase, 0.3mL of sucrose synthase, 2g of sucrose, 1g of stevioside, 1mg of uridine diphosphate or adenosine diphosphate, pH 5.5, and reaction temperature of 60℃per 10mL of reaction system.
In a seventh aspect the present invention provides an enzyme combination comprising a glycosyltransferase according to the first aspect and a sucrose synthase having an amino acid sequence as shown in SEQ ID NO. 24.
In some embodiments of the invention, the sucrose synthase and the glycosyltransferase are used in a mass ratio of 1 (3-10), preferably in a mass ratio of 1:5.
In some embodiments of the invention, the nucleotide sequence encoding the sucrose synthase is shown in SEQ ID NO. 23.
An eighth aspect of the invention provides the use of a glycosyltransferase as described in the first aspect or an enzyme combination as described in the seventh aspect for the preparation of rebaudioside D or rebaudioside E.
In some embodiments of the invention, the rebaudioside D is produced by rebaudioside a.
In some embodiments of the invention, the rebaudioside E is produced by stevioside.
On the basis of conforming to the common knowledge in the field, the above preferred conditions can be arbitrarily combined to obtain the preferred examples of the invention.
The reagents and materials used in the present invention are commercially available.
The invention has the positive progress effects that:
the glycosyltransferase with good catalytic effect and good stability is screened out, the process condition for synthesizing the Reb E by an enzyme method is optimized, and the industrial production of the Reb E is facilitated; solves the problem of high price of glycosyl donor UDPG/ADPG, and reduces the production cost.
Drawings
FIG. 1 shows a synthetic route for preparing rebaudioside E from stevioside.
FIG. 2 shows a graph cut of the results of the HPLC detection method of the present invention, with a retention time of 12.761min for stevioside control.
FIG. 3 shows a profile screenshot of the results using the HPLC detection method of the present invention with a rebaudioside E control retention time of 11.757min.
FIG. 4 is a plot of the HPLC plot of the experimental results of the Enz.5 catalytic synthesis of Reb E in example 5.
Detailed Description
The invention is further illustrated by means of the following examples, which are not intended to limit the scope of the invention. The experimental methods, in which specific conditions are not noted in the following examples, were selected according to conventional methods and conditions, or according to the commercial specifications.
The experimental methods in the invention are all conventional methods unless otherwise specified, and specific reference is made to the "molecular cloning Experimental guidelines" by J.Sam Broker et al for gene cloning operations.
Amino acid shorthand symbols in the invention are conventional in the art unless otherwise specified, and amino acids corresponding to specific shorthand symbols are shown in table 2.
TABLE 2 amino acid alphabet
The codons corresponding to the amino acids are also conventional in the art, and the correspondence of specific amino acids to codons is shown in table 3.
TABLE 3 amino acid codon table
The route of the invention is schematically shown in figure 1.
KOD Mix enzyme was purchased from TOYOBO CO. LTD.DpnI enzyme was purchased from Yingwei Jiegui (Shanghai) trade Co., ltd; competent cells of E.coli Trans10 and E.coli BL21 (DE 3) were purchased from Beijing Ding Guo Changchun Biotechnology Limited. The reaction substrate stevioside was purchased from pichia pastoris (purity 95%). Sucrose was purchased from biological engineering (Shanghai) Inc. Reb E controls were purchased from shanghai source leaf biotechnology limited.
Conversion HPLC detection method: chromatographic column: ZORBAXEclipse plus C18 (4.6 mm. Times.150 mm,3.5 μm). Mobile phase: the aqueous 0.1% TFA solution was mobile phase A and the acetonitrile 0.1% TFA solution was mobile phase B, and the gradient elution was performed as shown in Table 4 below. Detection wavelength: 210nm; flow rate: 1mL/min; sample injection volume: 20. Mu.L; column temperature: 35 ℃. As shown in fig. 2, stevioside peak time: 12.761min; as shown in fig. 3, reb E peak time: 11.757min.
TABLE 4 gradient elution
Time(min) | A% | B% |
0.00 | 90 | 10 |
15.00 | 60 | 40 |
20.00 | 0 | 100 |
24.00 | 0 | 100 |
24.10 | 90 | 10 |
32.00 | 90 | 10 |
EXAMPLE 1 construction of library of beta-1, 2-glycosyltransferase mutants
The beta-1, 2-glycosyltransferase (beta-1, 2-GT enzyme) gene with the number of Enz.1 shown in SEQ ID NO. 1 is totally synthesized and is connected to a pET28a plasmid vector to obtain a recombinant plasmid pET28a-Enz.1, and a gene synthesis company is a biological engineering (Shanghai) stock company (Shanghai Songjiang region Min road 698). The amino acid sequence of Enz.1 is shown as SEQ ID NO. 2.
The beta-1, 2-glycosyltransferase (beta-1, 2-GT enzyme) enzyme gene with the number of Enz.2 shown in SEQ ID NO. 3 is totally synthesized and is connected to a pET28a plasmid vector to obtain a recombinant plasmid pET28a-Enz.2, and a gene synthesis company is a biological engineering (Shanghai) stock company (Shanghai Songjiang region Min road 698). The amino acid sequence of Enz.2 is shown in SEQ ID NO. 4.
PCR amplification was performed using pET28a-Enz.1 plasmid as a template, and the primer sequences Enz.X-F (X=3 to 10) and Km-R in Table 5, respectively, to obtain fragment one; PCR was performed using pET28a-Enz.2 plasmid as a template, and the primer sequences Enz.X-R (X=3 to 10) and Km-F in Table 5 were used to obtain fragment two. Fragments one and two were recombined using a norvirally homologous recombinase (Exnase II,5 xce II) and ligated into pET28a plasmid vector. After ligation, transformed into E.coli Trans10 competent cells, plated on LB medium containing 50. Mu.g/mL of kananamycin, and cultured overnight at 37 ℃; and (3) picking single colonies to an LB test tube (Km resistance), culturing for 8-10 hours, extracting plasmids, carrying out sequencing conversion and sequencing verification, and obtaining recombinant plasmids pET28 a-Enz.3-pET 28a-Enz.10 of each mutant.
TABLE 5 primer sequence listing
In the table: f is a forward primer, and R is a reverse primer.
The PCR amplification reaction system is as follows:
KOD Mix:25μL
ddH 2 O:20μL
primer: 2 mu L2
And (3) a template: 1 mu L
The amplification procedure was as follows:
(1)98℃3min
(2)98℃10s
(3)55℃5s
(4)68℃5s/kbp
(5)68℃5min
(6) Heat preservation at 12 DEG C
(2) And (4) circulating 34 times.
Example 2 preparation of beta-1, 2-glycosyltransferase
1. Protein expression:
the recombinant plasmids (pET 28 a-Enz.1-pET 28 a-Enz.10) with correct sequencing described in example 1 were transformed into competent cells of the host E.coli BL21 (DE 3) respectively, and genetically engineered strains containing the recombinant plasmids were obtained. Single colonies were individually picked and inoculated into 5mL LB liquid medium containing 50. Mu.g/mL kanamycin, and shake-cultured at 37℃for 4 hours. Transfer to 50mL fresh TB liquid medium also containing 50. Mu.g/mL kanamycin at 2% (v/v) inoculum size, shake culture to OD at 37 ℃ 600 When the concentration reaches about 0.8, IPTG (isopropyl-. Beta. -D-thiogalactoside) is added to the final concentration of 0.1mM, and the culture is induced at 25℃for 20 hours. After the completion of the culture, the culture broth was centrifuged at 4000rpm for 20 minutes, and the supernatant was discarded to collect the cells. Preserving at-20 ℃ for standby.
2. Obtaining crude enzyme liquid:
50mM Phosphate Buffer (PBS) having pH of 5.5 was prepared, and the cells obtained above were suspended at a ratio of 1:10 (M/V, g/mL), and homogenized by a high-pressure homogenizer (550 Mbar homogenization for 1.5 min); and (3) respectively centrifuging the homogenized enzyme solutions at 12000rpm for 2min to obtain crude enzyme solutions of the beta-1, 2-glycosyltransferase.
EXAMPLE 3 preparation of sucrose synthase SUS
The sucrose synthase (SUS) gene shown in SEQ ID NO. 23 was synthesized and ligated to the pET28a plasmid vector to obtain recombinant plasmid pET28a-SUS. The gene synthesis company is biological engineering (Shanghai) stock limited company (Shanghai city, songjiang region Min Ji Lu 698).
Plasmid pET28a-SUS is transformed into host E.coli BL21 (DE 3) competent cells to obtain the engineering strain containing sucrose synthase gene. Single colonies were picked and inoculated into 5mL LB liquid medium containing 50. Mu.g/mL kanamycinShake culturing for 4 hr at 37deg.C. Transfer to 50mL fresh TB liquid medium also containing 50. Mu.g/mL kanamycin at 2% (v/v) inoculum size, shake culture to OD at 37 ℃ 600 When about 0.8 was reached, IPTG was added to a final concentration of 0.1mM and the culture was induced at 25℃for 20 hours. After the completion of the culture, the culture broth was centrifuged at 4000rpm for 20 minutes, and the supernatant was discarded to collect the cells. Preserving at-20 ℃ for standby.
50mM phosphate buffer solution with pH of 5.5 is prepared, the bacterial cells obtained above are suspended according to the ratio of 1:10 (M/V, g/mL), high-pressure homogenization is carried out (550 Mbar, 1.5 min), and then the sucrose synthase crude enzyme solution is obtained after centrifugation at 12000rpm for 2 min.
EXAMPLE 4 screening of beta-1, 2-glycosyltransferase mutants
In a 1mL reaction system, 150. Mu.L of the crude enzyme solution of the beta-1, 2-glycosyltransferase prepared in example 2, 100g/L of Stevioside (STV) and 0.1g/L of UDP/ADP are added, 200g/L of sucrose and 30. Mu.L of the crude enzyme solution of the sucrose synthase are added, and finally 50mM of PBS with pH of 6.0 is added to a final volume of 1mL. The prepared reaction system was placed in a metal bath, reacted at 60℃and 600rpm for 30 minutes, 10. Mu.L of the reaction solution was added to 990. Mu.L of hydrochloric acid having pH2-3, vortexed, centrifuged at 13000rpm for 10 minutes, and the supernatant was analyzed for the concentration of Reb E by HPLC. The experimental results obtained using the HPLC detection method of the present invention are shown in Table 6.
TABLE 6 screening of beta-1, 2-glycosyltransferase mutants
Enzyme numbering | Nucleotide sequence | Amino acid sequence | Reb E%(ADP) | Reb E%(UDP) |
Enz.1 (control) | 1 | 2 | 2.327 | 5.501 |
Enz.2 (control) | 3 | 4 | 70.84 | 33.209 |
Enz.3 | 25 | 26 | 45.26 | 33.232 |
Enz.4 | 27 | 28 | 66.242 | 35.340 |
Enz.5 | 29 | 30 | 71.663 | 42.059 |
Enz.6 | 31 | 32 | 71.465 | 35.349 |
Enz.7 | 33 | 34 | 4.032 | 33.098 |
Enz.8 | 35 | 36 | 9.232 | 44.386 |
Enz.9 | 37 | 38 | 16.527 | 44.190 |
Enz.10 | 39 | 40 | 0.848 | 2.217 |
From the preliminary screening results in table 6, it can be seen that: when ADP is used as a sucrose synthase catalytic substrate, the activity of Enz.5 is the highest, and Enz.6 times; when UDP is used as a substrate for catalyzing sucrose synthase, the activity of Enz.8 is best, enz.9 times, the activity of Enz.5 is close to that of Enz.8 and Enz.9, and the catalysis effect of Enz.4 is better than that of the enzyme of a control group. From experimental data, the activity of Enz.5 for catalyzing ADP as a substrate or UDP as a substrate is relatively high, and when ADP is used as a sucrose synthase for catalyzing the substrate, the yield of the Enz.5 for catalyzing and generating the Reb E is far higher than that when UDP is used as the substrate. Thus, ADP is used as a catalytic substrate of sucrose synthase, enz.5 catalyzes glycosyl transfer reaction to prepare rebaudioside E.
EXAMPLE 5 enzyme Enz.5 catalytic Synthesis of Reb E
In a 10mL reaction system, 1.5mL of the crude enzyme solution of beta-1, 2-glycosyltransferase Enz.5 prepared by the method described in example 2, 0.3mL of the crude enzyme solution of sucrose synthase prepared by the method described in example 3, 100g/L stevioside, 200g/L sucrose and 0.1g/L ADP were added, and finally 50mM PBS with pH 5.5 was added to a final volume of 10mL. The prepared reaction system was placed in a metal bath, reacted at 60℃and 600rpm for 7 hours, 10. Mu.L of the reaction solution was added to 990. Mu.L of hydrochloric acid having pH2-3, vortexed, centrifuged at 13000rpm for 10 minutes, and the supernatant was analyzed for the concentration of Reb E by HPLC, as shown in FIG. 4, and the concentration of Reb E was 68.62%.
SEQUENCE LISTING
<110> chess Ke Lai Biotechnology (Shanghai) stock Co., ltd
<120> glycosyltransferase and its use in preparing rebaudioside E
<130> P210110234C
<160> 40
<170> PatentIn version 3.5
<210> 1
<211> 1329
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.1
<400> 1
atggcgacca acctgcgtgt tctgatgttc ccgtggctgg cgtacggcca catcagcccg 60
ttcctgaaca tcgcgaaaca gctggcggat cgtggtttcc tgatctatct gtgctccacc 120
cgcatcaacc tggaatctat catcaagaaa atcccggaaa aatacgcgga ttctatccat 180
ctgatcgaac ttcagctgcc ggagctgccg gaactgccgc cgcactatca caccactaac 240
ggtctgccgc cgcatctgaa cccgaccctg cacaaagcgc tgaaaatgtc taaaccgaac 300
ttcagccgca tcttgcagaa cctgaaaccg gacctgctga tctacgatgt gctccagccg 360
tgggcggaac acgtggcgaa cgaacagggc atcccggctg gcaaactgct ggtttcttgc 420
gcggcggttt tctcctactt tttctctttc cgtaaaaatc cgggcgttga atttccgttc 480
ccggcgatcc acctgccgga agtggaaaaa gttaaaatcc gtgaaatcct ggctaaagaa 540
ccggaagaag gcggccgtct ggacgaaggc aacaaacaga tgatgctgat gtgcacttct 600
cgtaccattg aagctaaata cattgattac tgcaccgaac tgtgcaactg gaaagttgtt 660
ccggttggtc cgccgttcca ggatctgatc actaacgatg cggataacaa agaactgatc 720
gattggctgg gcaccaaacc ggaaaactcc accgtgttcg ttagcttcgg ctccgaatac 780
ttcctgagca aagaagatat ggaagaaatt gctttcgctc tggaagcatc taacgttaac 840
ttcatctggg ttgtgcgttt cccgaaaggc gaagaacgta acctggaaga tgcactgccg 900
gaaggcttcc tggaacgtat tggtgaacgt ggtcgcgttc tggacaaatt cgcgccgcag 960
ccgcgcatcc tgaaccaccc gagcaccggc ggtttcatct ctcactgcgg ttggaacagc 1020
gttatggaaa gcatcgactt cggtgtgccg atcatcgcga tgccgatcca caacgatcag 1080
ccgatcaacg ctaaactgat ggttgaactg ggcgttgcgg ttgaaatcgt tcgtgatgat 1140
gatggtaaaa tccaccgcgg cgaaatcgcg gaagcactga aaagcgttgt gaccggtgaa 1200
accggcgaaa tcctgcgtgc gaaagttcgt gaaatcagca aaaacctgaa atccatccgt 1260
gacgaagaaa tggacgcggt tgctgaagaa ctgatccagc tgtgccgtaa ctctaacaaa 1320
agcaaataa 1329
<210> 2
<211> 442
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.1
<400> 2
Met Ala Thr Asn Leu Arg Val Leu Met Phe Pro Trp Leu Ala Tyr Gly
1 5 10 15
His Ile Ser Pro Phe Leu Asn Ile Ala Lys Gln Leu Ala Asp Arg Gly
20 25 30
Phe Leu Ile Tyr Leu Cys Ser Thr Arg Ile Asn Leu Glu Ser Ile Ile
35 40 45
Lys Lys Ile Pro Glu Lys Tyr Ala Asp Ser Ile His Leu Ile Glu Leu
50 55 60
Gln Leu Pro Glu Leu Pro Glu Leu Pro Pro His Tyr His Thr Thr Asn
65 70 75 80
Gly Leu Pro Pro His Leu Asn Pro Thr Leu His Lys Ala Leu Lys Met
85 90 95
Ser Lys Pro Asn Phe Ser Arg Ile Leu Gln Asn Leu Lys Pro Asp Leu
100 105 110
Leu Ile Tyr Asp Val Leu Gln Pro Trp Ala Glu His Val Ala Asn Glu
115 120 125
Gln Gly Ile Pro Ala Gly Lys Leu Leu Val Ser Cys Ala Ala Val Phe
130 135 140
Ser Tyr Phe Phe Ser Phe Arg Lys Asn Pro Gly Val Glu Phe Pro Phe
145 150 155 160
Pro Ala Ile His Leu Pro Glu Val Glu Lys Val Lys Ile Arg Glu Ile
165 170 175
Leu Ala Lys Glu Pro Glu Glu Gly Gly Arg Leu Asp Glu Gly Asn Lys
180 185 190
Gln Met Met Leu Met Cys Thr Ser Arg Thr Ile Glu Ala Lys Tyr Ile
195 200 205
Asp Tyr Cys Thr Glu Leu Cys Asn Trp Lys Val Val Pro Val Gly Pro
210 215 220
Pro Phe Gln Asp Leu Ile Thr Asn Asp Ala Asp Asn Lys Glu Leu Ile
225 230 235 240
Asp Trp Leu Gly Thr Lys Pro Glu Asn Ser Thr Val Phe Val Ser Phe
245 250 255
Gly Ser Glu Tyr Phe Leu Ser Lys Glu Asp Met Glu Glu Ile Ala Phe
260 265 270
Ala Leu Glu Ala Ser Asn Val Asn Phe Ile Trp Val Val Arg Phe Pro
275 280 285
Lys Gly Glu Glu Arg Asn Leu Glu Asp Ala Leu Pro Glu Gly Phe Leu
290 295 300
Glu Arg Ile Gly Glu Arg Gly Arg Val Leu Asp Lys Phe Ala Pro Gln
305 310 315 320
Pro Arg Ile Leu Asn His Pro Ser Thr Gly Gly Phe Ile Ser His Cys
325 330 335
Gly Trp Asn Ser Val Met Glu Ser Ile Asp Phe Gly Val Pro Ile Ile
340 345 350
Ala Met Pro Ile His Asn Asp Gln Pro Ile Asn Ala Lys Leu Met Val
355 360 365
Glu Leu Gly Val Ala Val Glu Ile Val Arg Asp Asp Asp Gly Lys Ile
370 375 380
His Arg Gly Glu Ile Ala Glu Ala Leu Lys Ser Val Val Thr Gly Glu
385 390 395 400
Thr Gly Glu Ile Leu Arg Ala Lys Val Arg Glu Ile Ser Lys Asn Leu
405 410 415
Lys Ser Ile Arg Asp Glu Glu Met Asp Ala Val Ala Glu Glu Leu Ile
420 425 430
Gln Leu Cys Arg Asn Ser Asn Lys Ser Lys
435 440
<210> 3
<211> 1350
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.2
<400> 3
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgcgta attgaggcta agtacctgga ctattgtacc 660
gaactgacca atgtaaaagt tgttccggtt ggtccgccgt ttcaggatcc gctgaccgaa 720
gatattgacg accccgaact gatggattgg ttagatacca aacccgaaca tagtgttgtc 780
tatgtgtcgt ttggcagcga agcgttcctg agccgtgaag atatggaaga agtcgcgttc 840
ggcctggagc tgagcggcgt gaactttatc tgggttgcac gctttccgaa aggcgaagaa 900
cagcgtctgg aagacgttct gccaaaaggc ttcctggaac gcgttggtga tcgtggtcgc 960
gttctggacc atctggtgcc gcaggcccat attctgaacc atccgagcac gggtggcttc 1020
atctctcatt gcggttggaa cagcgtcatg gaaagcattg atttcggcgt tccgatcatt 1080
gcgatgccga tgcagtggga tcagccgatt aacgcgagac tgcttgtgga attaggcgtg 1140
gcagtggaga tcccgcgtga tgaagatggc cgggtccacc gcgccgaaat tgcccgtgtc 1200
ctgaaagatg tgatttcggg cccgactggt gagatactgc gcgcgaaagt acgcgacatt 1260
agcgcacgcc tgagagcgag acgcgaggag gaaatgaacg cagcggcgga agaactgata 1320
cagctgtgtc gcaaccgcaa cgcctacaag 1350
<210> 4
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.2
<400> 4
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Val Ile Glu Ala Lys Tyr Leu Asp Tyr Cys Thr Glu Leu Thr Asn
210 215 220
Val Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Pro Leu Thr Glu
225 230 235 240
Asp Ile Asp Asp Pro Glu Leu Met Asp Trp Leu Asp Thr Lys Pro Glu
245 250 255
His Ser Val Val Tyr Val Ser Phe Gly Ser Glu Ala Phe Leu Ser Arg
260 265 270
Glu Asp Met Glu Glu Val Ala Phe Gly Leu Glu Leu Ser Gly Val Asn
275 280 285
Phe Ile Trp Val Ala Arg Phe Pro Lys Gly Glu Glu Gln Arg Leu Glu
290 295 300
Asp Val Leu Pro Lys Gly Phe Leu Glu Arg Val Gly Asp Arg Gly Arg
305 310 315 320
Val Leu Asp His Leu Val Pro Gln Ala His Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Met Gln Trp Asp Gln
355 360 365
Pro Ile Asn Ala Arg Leu Leu Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Pro Arg Asp Glu Asp Gly Arg Val His Arg Ala Glu Ile Ala Arg Val
385 390 395 400
Leu Lys Asp Val Ile Ser Gly Pro Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Asp Ile Ser Ala Arg Leu Arg Ala Arg Arg Glu Glu Glu Met
420 425 430
Asn Ala Ala Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Arg Asn Ala
435 440 445
Tyr Lys
450
<210> 5
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3-F
<400> 5
gagcacgggt ggtttcatct ctcactgcg 29
<210> 6
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3-R
<400> 6
agatgaaacc acccgtgctc ggatggttc 29
<210> 7
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4-F
<400> 7
gtgggatcag ccgatcaacg cta 23
<210> 8
<211> 23
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4-R
<400> 8
gttgatcggc tgatcccact gca 23
<210> 9
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5-F
<400> 9
gaaattgccg aagcactgaa aagcgttg 28
<210> 10
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5-R
<400> 10
ttcagtgctt cggcaatttc ggcgcggtgg 30
<210> 11
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6-F
<400> 11
gagacgcgac gaagaaatgg acgcgg 26
<210> 12
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6-R
<400> 12
ccatttcttc gtcgcgtctc gctctcaggc 30
<210> 13
<211> 24
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7-F
<400> 13
ctcccaactt cagccgcatc ttgc 24
<210> 14
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7-R
<400> 14
gatgcggctg aagttgggag cgctcatct 29
<210> 15
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8-F
<400> 15
gttctgccag aaggcttcct ggaacgta 28
<210> 16
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8-R
<400> 16
ggaagccttc tggcagaacg tcttccaga 29
<210> 17
<211> 30
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9-F
<400> 17
ctaaaccgaa ctttagcaag atccttcaaa 30
<210> 18
<211> 26
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9-R
<400> 18
cttgctaaag ttcggtttag acattt 26
<210> 19
<211> 28
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10-F
<400> 19
gaaaactccg ttgtctatgt gtcgtttg 28
<210> 20
<211> 29
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10-R
<400> 20
catagacaac ggagttttcc ggtttggtg 29
<210> 21
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Km-F
<400> 21
gcccgacatt atcgcgagc 19
<210> 22
<211> 19
<212> DNA
<213> Artificial Sequence
<220>
<223> Km-R
<400> 22
gggtataaat gggctcgcg 19
<210> 23
<211> 2415
<212> DNA
<213> Artificial Sequence
<220>
<223> SUS
<400> 23
atgcaccatc atcatcatca tggcggtagc ggcatgattg aagtactgcg ccaacagctg 60
ctggatagcc cgcgttcatg gcgtgcattc ctgcgtcatt tagtcgcatc tcagcgtgac 120
tcatggctac ataccgattt acagcacgcg tgcaagacgt ttcgtgaaca gcctccggaa 180
ggctatcctg aagatattgg ttggctggca gattttattg cgcattgcca ggaagcgatc 240
ttccgggatc cgtggatggt ttttgcgtgg cgtctacgtc caggtgtttg ggagtatgtg 300
cgcatacatg tagaacagct ggcggtggag gagctgagca ctgatgaata tctgcaagcc 360
aaagaacaac ttgttggctt aggtgcagaa ggtgaagctg ttctgacggt ggatttcgaa 420
gattttcgtc cggtgagcca gcgtttaaaa gacgagagca ccattggtga tggtcttacc 480
catctgaatc gtcatttagc aggtcgcatc tggactgatt tagcagcagg tcgtagtgct 540
attctggaat ttctgggcct gcatcgtctg gataaccaga atctgatgct gagcaacggc 600
aataccgatt ttgactcttt acgtcaaacc gtacaatatc tgggcacctt accaagagaa 660
actccgtggg cagagtttcg tgaagacatg cgtcgtcgtg gttttgaacc cggttggggc 720
aacaccgcgg gccgtgttcg cgaaaccatg cgtctgctga tggatctgct tgactctccg 780
agcccagctg ccctggagag cttcctggat cgcatcccga tgattagcaa cgttctgatc 840
gtgagcattc acggatggtt tgcgcaggac aaggttctgg gtcgtccgga cactggtggt 900
caggtcgtgt atattctgga tcaggcccgt gcactggaac gcgaaatgcg taaccgcctg 960
cgccaacagg gtgttgatgt ggagccgcgc attttgattg cgacccgttt aatcccggaa 1020
agtgatggca cgacttgtga ccagcgtctg gagcctgtcc atggtgccga gaatgtgcag 1080
attctgcgcg ttccgtttcg ctatgaggat ggtcgtattc acccgcattg gatctcacgc 1140
ttcaaggttt ggccgtatct tgaacgctat gcaagggatc tggaacgcga agttaaggcc 1200
gaattaggta gtcgtccaga tctgatcatc ggcaactata gcgacggtgg gctggttgca 1260
accatcctgt cagaaaaatt aggtgttacg cagtgcaaca ttgcacatgc cctggagaaa 1320
agcaagtacc cggggtccga tctgcattgg ccgctgtatg aacaggacca tcactttgcg 1380
tgtcagttta ccgcggatct gatcgcgatg aatgcagcag acatcatcgt gacgagcaca 1440
taccaggaaa ttgcaggtaa tgaccgcgag gttggtcaat atgaatctca ccaggactat 1500
actttaccgg gcttgtatcg tgtcgagaat ggtattgacg tgttcgatag caagtttaac 1560
attgtgagtc cgggcgcaga tccgagtacg tattttagct atgcccgtca tgaagaacgc 1620
ttctcgtcgc tgtggccaga aatcgaaagt ctgctgtttg gccgcgaacc aggtccggat 1680
attcgtggtg ttctcgaaga tcctcagaaa ccgattattc tgtcggtggc ccgtatggat 1740
cgcatcaaga acctgagcgg tctggccgaa ctgtatggtc ggagtgcgcg cttacgtagc 1800
ctggccaatt tggtgatcat cggtggtcat gttgatgtac aggccagtat ggatgcagaa 1860
gaacgcgaag aaatccgtcg tatgcacgag atcatggacc gctaccagct ggatggtcag 1920
atgcgttggg tgggatcgca tctggataaa cgcgtcgtgg gcgaattgta tcgtgtagtg 1980
gcggatggac gtggcgtttt tgtgcaacca gccctgtttg aggcgttcgg cctgaccgtg 2040
attgaggcaa tgagcagtgg cctgccagtg tttgcgaccc gccacggtgg tccgctggaa 2100
atcatcgaag acggcgttag cggcttccat attgatccca acgaccctga agcggtagca 2160
gaaaaactgg ccgacttcct ggaagcagcg cgtgaacgtc cgaagtattg ggaggaaatt 2220
agccaggcgg ctcttgcgcg cgtcagcgaa cgttacacgt gggagcgcta tgcggaacgc 2280
ttgatgacca tcgcgcgttg cttcggcttt tggcgcttcg ttctgtcacg cgaatcacag 2340
gtcatggaac gctatctgca aatgttccgc cacctgcaat ggcgcccgct ggctcatgcc 2400
gtaccgatgg agtaa 2415
<210> 24
<211> 804
<212> PRT
<213> Artificial Sequence
<220>
<223> SUS
<400> 24
Met His His His His His His Gly Gly Ser Gly Met Ile Glu Val Leu
1 5 10 15
Arg Gln Gln Leu Leu Asp Ser Pro Arg Ser Trp Arg Ala Phe Leu Arg
20 25 30
His Leu Val Ala Ser Gln Arg Asp Ser Trp Leu His Thr Asp Leu Gln
35 40 45
His Ala Cys Lys Thr Phe Arg Glu Gln Pro Pro Glu Gly Tyr Pro Glu
50 55 60
Asp Ile Gly Trp Leu Ala Asp Phe Ile Ala His Cys Gln Glu Ala Ile
65 70 75 80
Phe Arg Asp Pro Trp Met Val Phe Ala Trp Arg Leu Arg Pro Gly Val
85 90 95
Trp Glu Tyr Val Arg Ile His Val Glu Gln Leu Ala Val Glu Glu Leu
100 105 110
Ser Thr Asp Glu Tyr Leu Gln Ala Lys Glu Gln Leu Val Gly Leu Gly
115 120 125
Ala Glu Gly Glu Ala Val Leu Thr Val Asp Phe Glu Asp Phe Arg Pro
130 135 140
Val Ser Gln Arg Leu Lys Asp Glu Ser Thr Ile Gly Asp Gly Leu Thr
145 150 155 160
His Leu Asn Arg His Leu Ala Gly Arg Ile Trp Thr Asp Leu Ala Ala
165 170 175
Gly Arg Ser Ala Ile Leu Glu Phe Leu Gly Leu His Arg Leu Asp Asn
180 185 190
Gln Asn Leu Met Leu Ser Asn Gly Asn Thr Asp Phe Asp Ser Leu Arg
195 200 205
Gln Thr Val Gln Tyr Leu Gly Thr Leu Pro Arg Glu Thr Pro Trp Ala
210 215 220
Glu Phe Arg Glu Asp Met Arg Arg Arg Gly Phe Glu Pro Gly Trp Gly
225 230 235 240
Asn Thr Ala Gly Arg Val Arg Glu Thr Met Arg Leu Leu Met Asp Leu
245 250 255
Leu Asp Ser Pro Ser Pro Ala Ala Leu Glu Ser Phe Leu Asp Arg Ile
260 265 270
Pro Met Ile Ser Asn Val Leu Ile Val Ser Ile His Gly Trp Phe Ala
275 280 285
Gln Asp Lys Val Leu Gly Arg Pro Asp Thr Gly Gly Gln Val Val Tyr
290 295 300
Ile Leu Asp Gln Ala Arg Ala Leu Glu Arg Glu Met Arg Asn Arg Leu
305 310 315 320
Arg Gln Gln Gly Val Asp Val Glu Pro Arg Ile Leu Ile Ala Thr Arg
325 330 335
Leu Ile Pro Glu Ser Asp Gly Thr Thr Cys Asp Gln Arg Leu Glu Pro
340 345 350
Val His Gly Ala Glu Asn Val Gln Ile Leu Arg Val Pro Phe Arg Tyr
355 360 365
Glu Asp Gly Arg Ile His Pro His Trp Ile Ser Arg Phe Lys Val Trp
370 375 380
Pro Tyr Leu Glu Arg Tyr Ala Arg Asp Leu Glu Arg Glu Val Lys Ala
385 390 395 400
Glu Leu Gly Ser Arg Pro Asp Leu Ile Ile Gly Asn Tyr Ser Asp Gly
405 410 415
Gly Leu Val Ala Thr Ile Leu Ser Glu Lys Leu Gly Val Thr Gln Cys
420 425 430
Asn Ile Ala His Ala Leu Glu Lys Ser Lys Tyr Pro Gly Ser Asp Leu
435 440 445
His Trp Pro Leu Tyr Glu Gln Asp His His Phe Ala Cys Gln Phe Thr
450 455 460
Ala Asp Leu Ile Ala Met Asn Ala Ala Asp Ile Ile Val Thr Ser Thr
465 470 475 480
Tyr Gln Glu Ile Ala Gly Asn Asp Arg Glu Val Gly Gln Tyr Glu Ser
485 490 495
His Gln Asp Tyr Thr Leu Pro Gly Leu Tyr Arg Val Glu Asn Gly Ile
500 505 510
Asp Val Phe Asp Ser Lys Phe Asn Ile Val Ser Pro Gly Ala Asp Pro
515 520 525
Ser Thr Tyr Phe Ser Tyr Ala Arg His Glu Glu Arg Phe Ser Ser Leu
530 535 540
Trp Pro Glu Ile Glu Ser Leu Leu Phe Gly Arg Glu Pro Gly Pro Asp
545 550 555 560
Ile Arg Gly Val Leu Glu Asp Pro Gln Lys Pro Ile Ile Leu Ser Val
565 570 575
Ala Arg Met Asp Arg Ile Lys Asn Leu Ser Gly Leu Ala Glu Leu Tyr
580 585 590
Gly Arg Ser Ala Arg Leu Arg Ser Leu Ala Asn Leu Val Ile Ile Gly
595 600 605
Gly His Val Asp Val Gln Ala Ser Met Asp Ala Glu Glu Arg Glu Glu
610 615 620
Ile Arg Arg Met His Glu Ile Met Asp Arg Tyr Gln Leu Asp Gly Gln
625 630 635 640
Met Arg Trp Val Gly Ser His Leu Asp Lys Arg Val Val Gly Glu Leu
645 650 655
Tyr Arg Val Val Ala Asp Gly Arg Gly Val Phe Val Gln Pro Ala Leu
660 665 670
Phe Glu Ala Phe Gly Leu Thr Val Ile Glu Ala Met Ser Ser Gly Leu
675 680 685
Pro Val Phe Ala Thr Arg His Gly Gly Pro Leu Glu Ile Ile Glu Asp
690 695 700
Gly Val Ser Gly Phe His Ile Asp Pro Asn Asp Pro Glu Ala Val Ala
705 710 715 720
Glu Lys Leu Ala Asp Phe Leu Glu Ala Ala Arg Glu Arg Pro Lys Tyr
725 730 735
Trp Glu Glu Ile Ser Gln Ala Ala Leu Ala Arg Val Ser Glu Arg Tyr
740 745 750
Thr Trp Glu Arg Tyr Ala Glu Arg Leu Met Thr Ile Ala Arg Cys Phe
755 760 765
Gly Phe Trp Arg Phe Val Leu Ser Arg Glu Ser Gln Val Met Glu Arg
770 775 780
Tyr Leu Gln Met Phe Arg His Leu Gln Trp Arg Pro Leu Ala His Ala
785 790 795 800
Val Pro Met Glu
<210> 25
<211> 1329
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.3
<400> 25
atggcgacca acctgcgtgt tctgatgttc ccgtggctgg cgtacggcca catcagcccg 60
ttcctgaaca tcgcgaaaca gctggcggat cgtggtttcc tgatctatct gtgctccacc 120
cgcatcaacc tggaatctat catcaagaaa atcccggaaa aatacgcgga ttctatccat 180
ctgatcgaac ttcagctgcc ggagctgccg gaactgccgc cgcactatca caccactaac 240
ggtctgccgc cgcatctgaa cccgaccctg cacaaagcgc tgaaaatgtc taaaccgaac 300
ttcagccgca tcttgcagaa cctgaaaccg gacctgctga tctacgatgt gctccagccg 360
tgggcggaac acgtggcgaa cgaacagggc atcccggctg gcaaactgct ggtttcttgc 420
gcggcggttt tctcctactt tttctctttc cgtaaaaatc cgggcgttga atttccgttc 480
ccggcgatcc acctgccgga agtggaaaaa gttaaaatcc gtgaaatcct ggctaaagaa 540
ccggaagaag gcggccgtct ggacgaaggc aacaaacaga tgatgctgat gtgcacttct 600
cgtaccattg aagctaaata cattgattac tgcaccgaac tgtgcaactg gaaagttgtt 660
ccggttggtc cgccgttcca ggatctgatc actaacgatg cggataacaa agaactgatc 720
gattggctgg gcaccaaacc ggaaaactcc accgtgttcg ttagcttcgg ctccgaatac 780
ttcctgagca aagaagatat ggaagaaatt gctttcgctc tggaagcatc taacgttaac 840
ttcatctggg ttgtgcgttt cccgaaaggc gaagaacgta acctggaaga tgcactgccg 900
aaaggcttcc tggaacgcgt tggtgatcgt ggtcgcgttc tggaccatct ggtgccgcag 960
gcccatattc tgaaccatcc gagcacgggt ggtttcatct ctcactgcgg ttggaacagc 1020
gttatggaaa gcatcgactt cggtgtgccg atcatcgcga tgccgatcca caacgatcag 1080
ccgatcaacg ctaaactgat ggttgaactg ggcgttgcgg ttgaaatcgt tcgtgatgat 1140
gatggtaaaa tccaccgcgg cgaaatcgcg gaagcactga aaagcgttgt gaccggtgaa 1200
accggcgaaa tcctgcgtgc gaaagttcgt gaaatcagca aaaacctgaa atccatccgt 1260
gacgaagaaa tggacgcggt tgctgaagaa ctgatccagc tgtgccgtaa ctctaacaaa 1320
agcaaataa 1329
<210> 26
<211> 442
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.3
<400> 26
Met Ala Thr Asn Leu Arg Val Leu Met Phe Pro Trp Leu Ala Tyr Gly
1 5 10 15
His Ile Ser Pro Phe Leu Asn Ile Ala Lys Gln Leu Ala Asp Arg Gly
20 25 30
Phe Leu Ile Tyr Leu Cys Ser Thr Arg Ile Asn Leu Glu Ser Ile Ile
35 40 45
Lys Lys Ile Pro Glu Lys Tyr Ala Asp Ser Ile His Leu Ile Glu Leu
50 55 60
Gln Leu Pro Glu Leu Pro Glu Leu Pro Pro His Tyr His Thr Thr Asn
65 70 75 80
Gly Leu Pro Pro His Leu Asn Pro Thr Leu His Lys Ala Leu Lys Met
85 90 95
Ser Lys Pro Asn Phe Ser Arg Ile Leu Gln Asn Leu Lys Pro Asp Leu
100 105 110
Leu Ile Tyr Asp Val Leu Gln Pro Trp Ala Glu His Val Ala Asn Glu
115 120 125
Gln Gly Ile Pro Ala Gly Lys Leu Leu Val Ser Cys Ala Ala Val Phe
130 135 140
Ser Tyr Phe Phe Ser Phe Arg Lys Asn Pro Gly Val Glu Phe Pro Phe
145 150 155 160
Pro Ala Ile His Leu Pro Glu Val Glu Lys Val Lys Ile Arg Glu Ile
165 170 175
Leu Ala Lys Glu Pro Glu Glu Gly Gly Arg Leu Asp Glu Gly Asn Lys
180 185 190
Gln Met Met Leu Met Cys Thr Ser Arg Thr Ile Glu Ala Lys Tyr Ile
195 200 205
Asp Tyr Cys Thr Glu Leu Cys Asn Trp Lys Val Val Pro Val Gly Pro
210 215 220
Pro Phe Gln Asp Leu Ile Thr Asn Asp Ala Asp Asn Lys Glu Leu Ile
225 230 235 240
Asp Trp Leu Gly Thr Lys Pro Glu Asn Ser Thr Val Phe Val Ser Phe
245 250 255
Gly Ser Glu Tyr Phe Leu Ser Lys Glu Asp Met Glu Glu Ile Ala Phe
260 265 270
Ala Leu Glu Ala Ser Asn Val Asn Phe Ile Trp Val Val Arg Phe Pro
275 280 285
Lys Gly Glu Glu Arg Asn Leu Glu Asp Ala Leu Pro Lys Gly Phe Leu
290 295 300
Glu Arg Val Gly Asp Arg Gly Arg Val Leu Asp His Leu Val Pro Gln
305 310 315 320
Ala His Ile Leu Asn His Pro Ser Thr Gly Gly Phe Ile Ser His Cys
325 330 335
Gly Trp Asn Ser Val Met Glu Ser Ile Asp Phe Gly Val Pro Ile Ile
340 345 350
Ala Met Pro Ile His Asn Asp Gln Pro Ile Asn Ala Lys Leu Met Val
355 360 365
Glu Leu Gly Val Ala Val Glu Ile Val Arg Asp Asp Asp Gly Lys Ile
370 375 380
His Arg Gly Glu Ile Ala Glu Ala Leu Lys Ser Val Val Thr Gly Glu
385 390 395 400
Thr Gly Glu Ile Leu Arg Ala Lys Val Arg Glu Ile Ser Lys Asn Leu
405 410 415
Lys Ser Ile Arg Asp Glu Glu Met Asp Ala Val Ala Glu Glu Leu Ile
420 425 430
Gln Leu Cys Arg Asn Ser Asn Lys Ser Lys
435 440
<210> 27
<211> 1353
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.4
<400> 27
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgcgta attgaggcta agtacctgga ctattgtacc 660
gaactgacca atgtaaaagt tgttccggtt ggtccgccgt ttcaggatcc gctgaccgaa 720
gatattgacg accccgaact gatggattgg ttagatacca aacccgaaca tagtgttgtc 780
tatgtgtcgt ttggcagcga agcgttcctg agccgtgaag atatggaaga agtcgcgttc 840
ggcctggagc tgagcggcgt gaactttatc tgggttgcac gctttccgaa aggcgaagaa 900
cagcgtctgg aagacgttct gccaaaaggc ttcctggaac gcgttggtga tcgtggtcgc 960
gttctggacc atctggtgcc gcaggcccat attctgaacc atccgagcac gggtggcttc 1020
atctctcatt gcggttggaa cagcgtcatg gaaagcattg atttcggcgt tccgatcatt 1080
gcgatgccga tgcagtggga tcagccgatc aacgctaaac tgatggttga actgggcgtt 1140
gcggttgaaa tcgttcgtga tgatgatggt aaaatccacc gcggcgaaat cgcggaagca 1200
ctgaaaagcg ttgtgaccgg tgaaaccggc gaaatcctgc gtgcgaaagt tcgtgaaatc 1260
agcaaaaacc tgaaatccat ccgtgacgaa gaaatggacg cggttgctga agaactgatc 1320
cagctgtgcc gtaactctaa caaaagcaaa taa 1353
<210> 28
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.4
<400> 28
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Val Ile Glu Ala Lys Tyr Leu Asp Tyr Cys Thr Glu Leu Thr Asn
210 215 220
Val Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Pro Leu Thr Glu
225 230 235 240
Asp Ile Asp Asp Pro Glu Leu Met Asp Trp Leu Asp Thr Lys Pro Glu
245 250 255
His Ser Val Val Tyr Val Ser Phe Gly Ser Glu Ala Phe Leu Ser Arg
260 265 270
Glu Asp Met Glu Glu Val Ala Phe Gly Leu Glu Leu Ser Gly Val Asn
275 280 285
Phe Ile Trp Val Ala Arg Phe Pro Lys Gly Glu Glu Gln Arg Leu Glu
290 295 300
Asp Val Leu Pro Lys Gly Phe Leu Glu Arg Val Gly Asp Arg Gly Arg
305 310 315 320
Val Leu Asp His Leu Val Pro Gln Ala His Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Met Gln Trp Asp Gln
355 360 365
Pro Ile Asn Ala Lys Leu Met Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Val Arg Asp Asp Asp Gly Lys Ile His Arg Gly Glu Ile Ala Glu Ala
385 390 395 400
Leu Lys Ser Val Val Thr Gly Glu Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Glu Ile Ser Lys Asn Leu Lys Ser Ile Arg Asp Glu Glu Met
420 425 430
Asp Ala Val Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Ser Asn Lys
435 440 445
Ser Lys
450
<210> 29
<211> 1353
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.5
<400> 29
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgcgta attgaggcta agtacctgga ctattgtacc 660
gaactgacca atgtaaaagt tgttccggtt ggtccgccgt ttcaggatcc gctgaccgaa 720
gatattgacg accccgaact gatggattgg ttagatacca aacccgaaca tagtgttgtc 780
tatgtgtcgt ttggcagcga agcgttcctg agccgtgaag atatggaaga agtcgcgttc 840
ggcctggagc tgagcggcgt gaactttatc tgggttgcac gctttccgaa aggcgaagaa 900
cagcgtctgg aagacgttct gccaaaaggc ttcctggaac gcgttggtga tcgtggtcgc 960
gttctggacc atctggtgcc gcaggcccat attctgaacc atccgagcac gggtggcttc 1020
atctctcatt gcggttggaa cagcgtcatg gaaagcattg atttcggcgt tccgatcatt 1080
gcgatgccga tgcagtggga tcagccgatt aacgcgagac tgcttgtgga attaggcgtg 1140
gcagtggaga tcccgcgtga tgaagatggc cgggtccacc gcgccgaaat tgccgaagca 1200
ctgaaaagcg ttgtgaccgg tgaaaccggc gaaatcctgc gtgcgaaagt tcgtgaaatc 1260
agcaaaaacc tgaaatccat ccgtgacgaa gaaatggacg cggttgctga agaactgatc 1320
cagctgtgcc gtaactctaa caaaagcaaa taa 1353
<210> 30
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.5
<400> 30
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Val Ile Glu Ala Lys Tyr Leu Asp Tyr Cys Thr Glu Leu Thr Asn
210 215 220
Val Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Pro Leu Thr Glu
225 230 235 240
Asp Ile Asp Asp Pro Glu Leu Met Asp Trp Leu Asp Thr Lys Pro Glu
245 250 255
His Ser Val Val Tyr Val Ser Phe Gly Ser Glu Ala Phe Leu Ser Arg
260 265 270
Glu Asp Met Glu Glu Val Ala Phe Gly Leu Glu Leu Ser Gly Val Asn
275 280 285
Phe Ile Trp Val Ala Arg Phe Pro Lys Gly Glu Glu Gln Arg Leu Glu
290 295 300
Asp Val Leu Pro Lys Gly Phe Leu Glu Arg Val Gly Asp Arg Gly Arg
305 310 315 320
Val Leu Asp His Leu Val Pro Gln Ala His Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Met Gln Trp Asp Gln
355 360 365
Pro Ile Asn Ala Arg Leu Leu Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Pro Arg Asp Glu Asp Gly Arg Val His Arg Ala Glu Ile Ala Glu Ala
385 390 395 400
Leu Lys Ser Val Val Thr Gly Glu Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Glu Ile Ser Lys Asn Leu Lys Ser Ile Arg Asp Glu Glu Met
420 425 430
Asp Ala Val Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Ser Asn Lys
435 440 445
Ser Lys
450
<210> 31
<211> 1353
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.6
<400> 31
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgcgta attgaggcta agtacctgga ctattgtacc 660
gaactgacca atgtaaaagt tgttccggtt ggtccgccgt ttcaggatcc gctgaccgaa 720
gatattgacg accccgaact gatggattgg ttagatacca aacccgaaca tagtgttgtc 780
tatgtgtcgt ttggcagcga agcgttcctg agccgtgaag atatggaaga agtcgcgttc 840
ggcctggagc tgagcggcgt gaactttatc tgggttgcac gctttccgaa aggcgaagaa 900
cagcgtctgg aagacgttct gccaaaaggc ttcctggaac gcgttggtga tcgtggtcgc 960
gttctggacc atctggtgcc gcaggcccat attctgaacc atccgagcac gggtggcttc 1020
atctctcatt gcggttggaa cagcgtcatg gaaagcattg atttcggcgt tccgatcatt 1080
gcgatgccga tgcagtggga tcagccgatt aacgcgagac tgcttgtgga attaggcgtg 1140
gcagtggaga tcccgcgtga tgaagatggc cgggtccacc gcgccgaaat tgcccgtgtc 1200
ctgaaagatg tgatttcggg cccgactggt gagatactgc gcgcgaaagt acgcgacatt 1260
agcgcacgcc tgagagcgag acgcgacgaa gaaatggacg cggttgctga agaactgatc 1320
cagctgtgcc gtaactctaa caaaagcaaa taa 1353
<210> 32
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.6
<400> 32
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Val Ile Glu Ala Lys Tyr Leu Asp Tyr Cys Thr Glu Leu Thr Asn
210 215 220
Val Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Pro Leu Thr Glu
225 230 235 240
Asp Ile Asp Asp Pro Glu Leu Met Asp Trp Leu Asp Thr Lys Pro Glu
245 250 255
His Ser Val Val Tyr Val Ser Phe Gly Ser Glu Ala Phe Leu Ser Arg
260 265 270
Glu Asp Met Glu Glu Val Ala Phe Gly Leu Glu Leu Ser Gly Val Asn
275 280 285
Phe Ile Trp Val Ala Arg Phe Pro Lys Gly Glu Glu Gln Arg Leu Glu
290 295 300
Asp Val Leu Pro Lys Gly Phe Leu Glu Arg Val Gly Asp Arg Gly Arg
305 310 315 320
Val Leu Asp His Leu Val Pro Gln Ala His Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Met Gln Trp Asp Gln
355 360 365
Pro Ile Asn Ala Arg Leu Leu Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Pro Arg Asp Glu Asp Gly Arg Val His Arg Ala Glu Ile Ala Arg Val
385 390 395 400
Leu Lys Asp Val Ile Ser Gly Pro Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Asp Ile Ser Ala Arg Leu Arg Ala Arg Arg Asp Glu Glu Met
420 425 430
Asp Ala Val Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Ser Asn Lys
435 440 445
Ser Lys
450
<210> 33
<211> 1353
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.7
<400> 33
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgtacc attgaagcta aatacattga ttactgcacc 660
gaactgtgca actggaaagt tgttccggtt ggtccgccgt tccaggatct gatcactaac 720
gatgcggata acaaagaact gatcgattgg ctgggcacca aaccggaaaa ctccaccgtg 780
ttcgttagct tcggctccga atacttcctg agcaaagaag atatggaaga aattgctttc 840
gctctggaag catctaacgt taacttcatc tgggttgtgc gtttcccgaa aggcgaagaa 900
cgtaacctgg aagatgcact gccggaaggc ttcctggaac gtattggtga acgtggtcgc 960
gttctggaca aattcgcgcc gcagccgcgc atcctgaacc acccgagcac cggcggtttc 1020
atctctcact gcggttggaa cagcgttatg gaaagcatcg acttcggtgt gccgatcatc 1080
gcgatgccga tccacaacga tcagccgatc aacgctaaac tgatggttga actgggcgtt 1140
gcggttgaaa tcgttcgtga tgatgatggt aaaatccacc gcggcgaaat cgcggaagca 1200
ctgaaaagcg ttgtgaccgg tgaaaccggc gaaatcctgc gtgcgaaagt tcgtgaaatc 1260
agcaaaaacc tgaaatccat ccgtgacgaa gaaatggacg cggttgctga agaactgatc 1320
cagctgtgcc gtaactctaa caaaagcaaa taa 1353
<210> 34
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.7
<400> 34
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Thr Ile Glu Ala Lys Tyr Ile Asp Tyr Cys Thr Glu Leu Cys Asn
210 215 220
Trp Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Leu Ile Thr Asn
225 230 235 240
Asp Ala Asp Asn Lys Glu Leu Ile Asp Trp Leu Gly Thr Lys Pro Glu
245 250 255
Asn Ser Thr Val Phe Val Ser Phe Gly Ser Glu Tyr Phe Leu Ser Lys
260 265 270
Glu Asp Met Glu Glu Ile Ala Phe Ala Leu Glu Ala Ser Asn Val Asn
275 280 285
Phe Ile Trp Val Val Arg Phe Pro Lys Gly Glu Glu Arg Asn Leu Glu
290 295 300
Asp Ala Leu Pro Glu Gly Phe Leu Glu Arg Ile Gly Glu Arg Gly Arg
305 310 315 320
Val Leu Asp Lys Phe Ala Pro Gln Pro Arg Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Ile His Asn Asp Gln
355 360 365
Pro Ile Asn Ala Lys Leu Met Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Val Arg Asp Asp Asp Gly Lys Ile His Arg Gly Glu Ile Ala Glu Ala
385 390 395 400
Leu Lys Ser Val Val Thr Gly Glu Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Glu Ile Ser Lys Asn Leu Lys Ser Ile Arg Asp Glu Glu Met
420 425 430
Asp Ala Val Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Ser Asn Lys
435 440 445
Ser Lys
450
<210> 35
<211> 1353
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.8
<400> 35
atgcaccatc atcatgaagg cgtgagcgac cagaccctga gagtaacgat gtttccgtgg 60
cttgggctgg gtcatgttaa cccgtttttg cgtatcgcta aacaactggc cgatcgtggt 120
ttcgttatct atttagttag taccgctatt aacctcgaaa tgatcaaaaa gagaatcccg 180
gagaaataca gtaatagcat ccatctggtt gagctgcgcc tgccagaatt accggaactg 240
ccaccacatt accatactac caacggttta ccaccgcatc tgaacaaaac cctgcacaag 300
gcactgaaga tgagcgctcc caactttagc aagatccttc aaaatattaa gccggacctg 360
gtcctttacg attttctggt tccgtgggca gaaaaagtcg cgcttgaaca gggcatcccg 420
gctgttccat tgctaaccag tggtgcggca ctgttcagct actttttcaa cttcctgaag 480
cgaccgggtg aagagtttcc gtttgaggca atccgcctgt cgaagcgaga acaggataag 540
atgcgcgaga tgtttggaac agagccgcct gaagaagatt ttttagcgcc ggcccaggcc 600
ggtatcatgc tgatgtgcac gagccgcgta attgaggcta agtacctgga ctattgtacc 660
gaactgacca atgtaaaagt tgttccggtt ggtccgccgt ttcaggatcc gctgaccgaa 720
gatattgacg accccgaact gatggattgg ttagatacca aacccgaaca tagtgttgtc 780
tatgtgtcgt ttggcagcga agcgttcctg agccgtgaag atatggaaga agtcgcgttc 840
ggcctggagc tgagcggcgt gaactttatc tgggttgcac gctttccgaa aggcgaagaa 900
cagcgtctgg aagacgttct gccagaaggc ttcctggaac gtattggtga acgtggtcgc 960
gttctggaca aattcgcgcc gcagccgcgc atcctgaacc acccgagcac cggcggtttc 1020
atctctcact gcggttggaa cagcgttatg gaaagcatcg acttcggtgt gccgatcatc 1080
gcgatgccga tccacaacga tcagccgatc aacgctaaac tgatggttga actgggcgtt 1140
gcggttgaaa tcgttcgtga tgatgatggt aaaatccacc gcggcgaaat cgcggaagca 1200
ctgaaaagcg ttgtgaccgg tgaaaccggc gaaatcctgc gtgcgaaagt tcgtgaaatc 1260
agcaaaaacc tgaaatccat ccgtgacgaa gaaatggacg cggttgctga agaactgatc 1320
cagctgtgcc gtaactctaa caaaagcaaa taa 1353
<210> 36
<211> 450
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.8
<400> 36
Met His His His His Glu Gly Val Ser Asp Gln Thr Leu Arg Val Thr
1 5 10 15
Met Phe Pro Trp Leu Gly Leu Gly His Val Asn Pro Phe Leu Arg Ile
20 25 30
Ala Lys Gln Leu Ala Asp Arg Gly Phe Val Ile Tyr Leu Val Ser Thr
35 40 45
Ala Ile Asn Leu Glu Met Ile Lys Lys Arg Ile Pro Glu Lys Tyr Ser
50 55 60
Asn Ser Ile His Leu Val Glu Leu Arg Leu Pro Glu Leu Pro Glu Leu
65 70 75 80
Pro Pro His Tyr His Thr Thr Asn Gly Leu Pro Pro His Leu Asn Lys
85 90 95
Thr Leu His Lys Ala Leu Lys Met Ser Ala Pro Asn Phe Ser Lys Ile
100 105 110
Leu Gln Asn Ile Lys Pro Asp Leu Val Leu Tyr Asp Phe Leu Val Pro
115 120 125
Trp Ala Glu Lys Val Ala Leu Glu Gln Gly Ile Pro Ala Val Pro Leu
130 135 140
Leu Thr Ser Gly Ala Ala Leu Phe Ser Tyr Phe Phe Asn Phe Leu Lys
145 150 155 160
Arg Pro Gly Glu Glu Phe Pro Phe Glu Ala Ile Arg Leu Ser Lys Arg
165 170 175
Glu Gln Asp Lys Met Arg Glu Met Phe Gly Thr Glu Pro Pro Glu Glu
180 185 190
Asp Phe Leu Ala Pro Ala Gln Ala Gly Ile Met Leu Met Cys Thr Ser
195 200 205
Arg Val Ile Glu Ala Lys Tyr Leu Asp Tyr Cys Thr Glu Leu Thr Asn
210 215 220
Val Lys Val Val Pro Val Gly Pro Pro Phe Gln Asp Pro Leu Thr Glu
225 230 235 240
Asp Ile Asp Asp Pro Glu Leu Met Asp Trp Leu Asp Thr Lys Pro Glu
245 250 255
His Ser Val Val Tyr Val Ser Phe Gly Ser Glu Ala Phe Leu Ser Arg
260 265 270
Glu Asp Met Glu Glu Val Ala Phe Gly Leu Glu Leu Ser Gly Val Asn
275 280 285
Phe Ile Trp Val Ala Arg Phe Pro Lys Gly Glu Glu Gln Arg Leu Glu
290 295 300
Asp Val Leu Pro Glu Gly Phe Leu Glu Arg Ile Gly Glu Arg Gly Arg
305 310 315 320
Val Leu Asp Lys Phe Ala Pro Gln Pro Arg Ile Leu Asn His Pro Ser
325 330 335
Thr Gly Gly Phe Ile Ser His Cys Gly Trp Asn Ser Val Met Glu Ser
340 345 350
Ile Asp Phe Gly Val Pro Ile Ile Ala Met Pro Ile His Asn Asp Gln
355 360 365
Pro Ile Asn Ala Lys Leu Met Val Glu Leu Gly Val Ala Val Glu Ile
370 375 380
Val Arg Asp Asp Asp Gly Lys Ile His Arg Gly Glu Ile Ala Glu Ala
385 390 395 400
Leu Lys Ser Val Val Thr Gly Glu Thr Gly Glu Ile Leu Arg Ala Lys
405 410 415
Val Arg Glu Ile Ser Lys Asn Leu Lys Ser Ile Arg Asp Glu Glu Met
420 425 430
Asp Ala Val Ala Glu Glu Leu Ile Gln Leu Cys Arg Asn Ser Asn Lys
435 440 445
Ser Lys
450
<210> 37
<211> 1329
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.9
<400> 37
atggcgacca acctgcgtgt tctgatgttc ccgtggctgg cgtacggcca catcagcccg 60
ttcctgaaca tcgcgaaaca gctggcggat cgtggtttcc tgatctatct gtgctccacc 120
cgcatcaacc tggaatctat catcaagaaa atcccggaaa aatacgcgga ttctatccat 180
ctgatcgaac ttcagctgcc ggagctgccg gaactgccgc cgcactatca caccactaac 240
ggtctgccgc cgcatctgaa cccgaccctg cacaaagcgc tgaaaatgtc taaaccgaac 300
tttagcaaga tccttcaaaa tattaagccg gacctggtcc tttacgattt tctggttccg 360
tgggcagaaa aagtcgcgct tgaacagggc atcccggctg ttccattgct aaccagtggt 420
gcggcactgt tcagctactt tttcaacttc ctgaagcgac cgggtgaaga gtttccgttt 480
gaggcaatcc gcctgtcgaa gcgagaacag gataagatgc gcgagatgtt tggaacagag 540
ccgcctgaag aagatttttt agcgccggcc caggccggta tcatgctgat gtgcacgagc 600
cgcgtaattg aggctaagta cctggactat tgtaccgaac tgaccaatgt aaaagttgtt 660
ccggttggtc cgccgtttca ggatccgctg accgaagata ttgacgaccc cgaactgatg 720
gattggttag ataccaaacc cgaacatagt gttgtctatg tgtcgtttgg cagcgaagcg 780
ttcctgagcc gtgaagatat ggaagaagtc gcgttcggcc tggagctgag cggcgtgaac 840
tttatctggg ttgcacgctt tccgaaaggc gaagaacagc gtctggaaga cgttctgcca 900
aaaggcttcc tggaacgcgt tggtgatcgt ggtcgcgttc tggaccatct ggtgccgcag 960
gcccatattc tgaaccatcc gagcacgggt ggcttcatct ctcattgcgg ttggaacagc 1020
gtcatggaaa gcattgattt cggcgttccg atcattgcga tgccgatgca gtgggatcag 1080
ccgattaacg cgagactgct tgtggaatta ggcgtggcag tggagatccc gcgtgatgaa 1140
gatggccggg tccaccgcgc cgaaattgcc cgtgtcctga aagatgtgat ttcgggcccg 1200
actggtgaga tactgcgcgc gaaagtacgc gacattagcg cacgcctgag agcgagacgc 1260
gaggaggaaa tgaacgcagc ggcggaagaa ctgatacagc tgtgtcgcaa ccgcaacgcc 1320
tacaagtaa 1329
<210> 38
<211> 442
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.9
<400> 38
Met Ala Thr Asn Leu Arg Val Leu Met Phe Pro Trp Leu Ala Tyr Gly
1 5 10 15
His Ile Ser Pro Phe Leu Asn Ile Ala Lys Gln Leu Ala Asp Arg Gly
20 25 30
Phe Leu Ile Tyr Leu Cys Ser Thr Arg Ile Asn Leu Glu Ser Ile Ile
35 40 45
Lys Lys Ile Pro Glu Lys Tyr Ala Asp Ser Ile His Leu Ile Glu Leu
50 55 60
Gln Leu Pro Glu Leu Pro Glu Leu Pro Pro His Tyr His Thr Thr Asn
65 70 75 80
Gly Leu Pro Pro His Leu Asn Pro Thr Leu His Lys Ala Leu Lys Met
85 90 95
Ser Lys Pro Asn Phe Ser Lys Ile Leu Gln Asn Ile Lys Pro Asp Leu
100 105 110
Val Leu Tyr Asp Phe Leu Val Pro Trp Ala Glu Lys Val Ala Leu Glu
115 120 125
Gln Gly Ile Pro Ala Val Pro Leu Leu Thr Ser Gly Ala Ala Leu Phe
130 135 140
Ser Tyr Phe Phe Asn Phe Leu Lys Arg Pro Gly Glu Glu Phe Pro Phe
145 150 155 160
Glu Ala Ile Arg Leu Ser Lys Arg Glu Gln Asp Lys Met Arg Glu Met
165 170 175
Phe Gly Thr Glu Pro Pro Glu Glu Asp Phe Leu Ala Pro Ala Gln Ala
180 185 190
Gly Ile Met Leu Met Cys Thr Ser Arg Val Ile Glu Ala Lys Tyr Leu
195 200 205
Asp Tyr Cys Thr Glu Leu Thr Asn Val Lys Val Val Pro Val Gly Pro
210 215 220
Pro Phe Gln Asp Pro Leu Thr Glu Asp Ile Asp Asp Pro Glu Leu Met
225 230 235 240
Asp Trp Leu Asp Thr Lys Pro Glu His Ser Val Val Tyr Val Ser Phe
245 250 255
Gly Ser Glu Ala Phe Leu Ser Arg Glu Asp Met Glu Glu Val Ala Phe
260 265 270
Gly Leu Glu Leu Ser Gly Val Asn Phe Ile Trp Val Ala Arg Phe Pro
275 280 285
Lys Gly Glu Glu Gln Arg Leu Glu Asp Val Leu Pro Lys Gly Phe Leu
290 295 300
Glu Arg Val Gly Asp Arg Gly Arg Val Leu Asp His Leu Val Pro Gln
305 310 315 320
Ala His Ile Leu Asn His Pro Ser Thr Gly Gly Phe Ile Ser His Cys
325 330 335
Gly Trp Asn Ser Val Met Glu Ser Ile Asp Phe Gly Val Pro Ile Ile
340 345 350
Ala Met Pro Met Gln Trp Asp Gln Pro Ile Asn Ala Arg Leu Leu Val
355 360 365
Glu Leu Gly Val Ala Val Glu Ile Pro Arg Asp Glu Asp Gly Arg Val
370 375 380
His Arg Ala Glu Ile Ala Arg Val Leu Lys Asp Val Ile Ser Gly Pro
385 390 395 400
Thr Gly Glu Ile Leu Arg Ala Lys Val Arg Asp Ile Ser Ala Arg Leu
405 410 415
Arg Ala Arg Arg Glu Glu Glu Met Asn Ala Ala Ala Glu Glu Leu Ile
420 425 430
Gln Leu Cys Arg Asn Arg Asn Ala Tyr Lys
435 440
<210> 39
<211> 1329
<212> DNA
<213> Artificial Sequence
<220>
<223> Enz.10
<400> 39
atggcgacca acctgcgtgt tctgatgttc ccgtggctgg cgtacggcca catcagcccg 60
ttcctgaaca tcgcgaaaca gctggcggat cgtggtttcc tgatctatct gtgctccacc 120
cgcatcaacc tggaatctat catcaagaaa atcccggaaa aatacgcgga ttctatccat 180
ctgatcgaac ttcagctgcc ggagctgccg gaactgccgc cgcactatca caccactaac 240
ggtctgccgc cgcatctgaa cccgaccctg cacaaagcgc tgaaaatgtc taaaccgaac 300
ttcagccgca tcttgcagaa cctgaaaccg gacctgctga tctacgatgt gctccagccg 360
tgggcggaac acgtggcgaa cgaacagggc atcccggctg gcaaactgct ggtttcttgc 420
gcggcggttt tctcctactt tttctctttc cgtaaaaatc cgggcgttga atttccgttc 480
ccggcgatcc acctgccgga agtggaaaaa gttaaaatcc gtgaaatcct ggctaaagaa 540
ccggaagaag gcggccgtct ggacgaaggc aacaaacaga tgatgctgat gtgcacttct 600
cgtaccattg aagctaaata cattgattac tgcaccgaac tgtgcaactg gaaagttgtt 660
ccggttggtc cgccgttcca ggatctgatc actaacgatg cggataacaa agaactgatc 720
gattggctgg gcaccaaacc ggaaaactcc gttgtctatg tgtcgtttgg cagcgaagcg 780
ttcctgagcc gtgaagatat ggaagaagtc gcgttcggcc tggagctgag cggcgtgaac 840
tttatctggg ttgcacgctt tccgaaaggc gaagaacagc gtctggaaga cgttctgcca 900
aaaggcttcc tggaacgcgt tggtgatcgt ggtcgcgttc tggaccatct ggtgccgcag 960
gcccatattc tgaaccatcc gagcacgggt ggcttcatct ctcattgcgg ttggaacagc 1020
gtcatggaaa gcattgattt cggcgttccg atcattgcga tgccgatgca gtgggatcag 1080
ccgattaacg cgagactgct tgtggaatta ggcgtggcag tggagatccc gcgtgatgaa 1140
gatggccggg tccaccgcgc cgaaattgcc cgtgtcctga aagatgtgat ttcgggcccg 1200
actggtgaga tactgcgcgc gaaagtacgc gacattagcg cacgcctgag agcgagacgc 1260
gaggaggaaa tgaacgcagc ggcggaagaa ctgatacagc tgtgtcgcaa ccgcaacgcc 1320
tacaagtaa 1329
<210> 40
<211> 442
<212> PRT
<213> Artificial Sequence
<220>
<223> Enz.10
<400> 40
Met Ala Thr Asn Leu Arg Val Leu Met Phe Pro Trp Leu Ala Tyr Gly
1 5 10 15
His Ile Ser Pro Phe Leu Asn Ile Ala Lys Gln Leu Ala Asp Arg Gly
20 25 30
Phe Leu Ile Tyr Leu Cys Ser Thr Arg Ile Asn Leu Glu Ser Ile Ile
35 40 45
Lys Lys Ile Pro Glu Lys Tyr Ala Asp Ser Ile His Leu Ile Glu Leu
50 55 60
Gln Leu Pro Glu Leu Pro Glu Leu Pro Pro His Tyr His Thr Thr Asn
65 70 75 80
Gly Leu Pro Pro His Leu Asn Pro Thr Leu His Lys Ala Leu Lys Met
85 90 95
Ser Lys Pro Asn Phe Ser Arg Ile Leu Gln Asn Leu Lys Pro Asp Leu
100 105 110
Leu Ile Tyr Asp Val Leu Gln Pro Trp Ala Glu His Val Ala Asn Glu
115 120 125
Gln Gly Ile Pro Ala Gly Lys Leu Leu Val Ser Cys Ala Ala Val Phe
130 135 140
Ser Tyr Phe Phe Ser Phe Arg Lys Asn Pro Gly Val Glu Phe Pro Phe
145 150 155 160
Pro Ala Ile His Leu Pro Glu Val Glu Lys Val Lys Ile Arg Glu Ile
165 170 175
Leu Ala Lys Glu Pro Glu Glu Gly Gly Arg Leu Asp Glu Gly Asn Lys
180 185 190
Gln Met Met Leu Met Cys Thr Ser Arg Thr Ile Glu Ala Lys Tyr Ile
195 200 205
Asp Tyr Cys Thr Glu Leu Cys Asn Trp Lys Val Val Pro Val Gly Pro
210 215 220
Pro Phe Gln Asp Leu Ile Thr Asn Asp Ala Asp Asn Lys Glu Leu Ile
225 230 235 240
Asp Trp Leu Gly Thr Lys Pro Glu Asn Ser Val Val Tyr Val Ser Phe
245 250 255
Gly Ser Glu Ala Phe Leu Ser Arg Glu Asp Met Glu Glu Val Ala Phe
260 265 270
Gly Leu Glu Leu Ser Gly Val Asn Phe Ile Trp Val Ala Arg Phe Pro
275 280 285
Lys Gly Glu Glu Gln Arg Leu Glu Asp Val Leu Pro Lys Gly Phe Leu
290 295 300
Glu Arg Val Gly Asp Arg Gly Arg Val Leu Asp His Leu Val Pro Gln
305 310 315 320
Ala His Ile Leu Asn His Pro Ser Thr Gly Gly Phe Ile Ser His Cys
325 330 335
Gly Trp Asn Ser Val Met Glu Ser Ile Asp Phe Gly Val Pro Ile Ile
340 345 350
Ala Met Pro Met Gln Trp Asp Gln Pro Ile Asn Ala Arg Leu Leu Val
355 360 365
Glu Leu Gly Val Ala Val Glu Ile Pro Arg Asp Glu Asp Gly Arg Val
370 375 380
His Arg Ala Glu Ile Ala Arg Val Leu Lys Asp Val Ile Ser Gly Pro
385 390 395 400
Thr Gly Glu Ile Leu Arg Ala Lys Val Arg Asp Ile Ser Ala Arg Leu
405 410 415
Arg Ala Arg Arg Glu Glu Glu Met Asn Ala Ala Ala Glu Glu Leu Ile
420 425 430
Gln Leu Cys Arg Asn Arg Asn Ala Tyr Lys
435 440
Claims (11)
1. Glycosyltransferase, characterized in that its amino acid sequence comprises at least the amino acid residue differences compared to SEQ ID No. 4 selected from the group consisting of:
(1) Having one or more of the following amino acid residue differences:
the amino acid residue at position 429 is D;
the amino acid residue at position 433 is D;
the amino acid residue at position 435 is V;
the amino acid residue at position 446 is S;
the amino acid residue at position 448 is K; and
the 449 amino acid residue is S;
(2) Having one or more of the following amino acid residue differences:
deletion of amino acid residues at positions 1 to 8;
the amino acid residue at position 9 is M;
the amino acid residue at position 10 is A;
the amino acid residue at position 11 is T;
the amino acid residue at position 12 is N;
the amino acid residue at position 16 is L;
the amino acid residue at position 22 is A;
the amino acid residue at position 23 is Y;
the amino acid residue at position 26 is I;
the amino acid residue at position 27 is S;
the amino acid residue at position 31 is N;
the amino acid residue at position 42 is L;
the amino acid residue at position 46 is C;
the amino acid residue at position 49 is R;
the amino acid residue at position 54 is S;
amino acid residue at position 56 is I;
the amino acid residue at position 58 is K;
the amino acid residue at position 64 is A;
the amino acid residue at position 65 is D;
the amino acid residue at position 70 is I;
the amino acid residue at position 73 is Q;
the amino acid residue at position 96 is P; and
the amino acid residue at position 106 is K.
2. The glycosyltransferase of claim 1, wherein the glycosyltransferase comprises,
(1) The amino acid sequence of the glycosyltransferase further comprises one or more amino acid residue differences compared to SEQ ID No. 4 selected from the group consisting of:
the amino acid residue at position 399 is E;
the amino acid residue at position 400 is A;
the amino acid residue at position 403 is S;
the amino acid residue at position 405 is V;
the amino acid residue at position 406 is T;
the amino acid residue at position 408 is E;
the amino acid residue at position 419 is E;
the 422 th amino acid residue is K;
the amino acid residue at position 423 is N;
the amino acid residue at position 425 is K;
the amino acid residue at position 426 is S; and
the amino acid residue at position 427 is I;
preferably, the amino acid sequence of the glycosyltransferase further comprises one or more amino acid residue differences compared to SEQ ID NO. 4 selected from the group consisting of:
the amino acid residue at position 373 is K;
amino acid residue at position 375 is M;
the amino acid residue at position 385 is V;
the amino acid residue at position 388 is D;
the amino acid residue at position 391 is K;
the amino acid residue at position 392 is I; and
the amino acid residue at position 395 is G;
more preferably, the amino acid sequence of the glycosyltransferase further comprises amino acid residue differences at one or more residue positions selected from the following positions compared to SEQ ID NO: 4:
amino acid residue 309 is E;
amino acid residue at position 315 is I;
the amino acid residue at position 317 is E;
the amino acid residue at position 324 is K;
the amino acid residue at position 325 is F;
the amino acid residue at position 326 is A;
the amino acid residue at position 329 is P;
amino acid residue at position 330 is R;
amino acid residue at position 364 is I;
the amino acid residue at position 365 is H; and
the amino acid residue at position 366 is N.
3. The glycosyltransferase of claim 1 or 2, wherein the glycosyltransferase does not comprise one or more amino acid differences from positions 210 to 257 as compared to SEQ ID No. 4;
preferably, the amino acid sequence of the glycosyltransferase is shown as SEQ ID NO. 32; or, as shown in SEQ ID NO. 38; or as shown in SEQ ID NO. 30; or, as shown in SEQ ID NO. 28; or, as shown in SEQ ID NO. 36.
4. An isolated nucleic acid encoding the glycosyltransferase of any one of claims 1-3.
5. A recombinant expression vector comprising the nucleic acid of claim 4.
6. A transformant comprising the nucleic acid of claim 4 or the recombinant expression vector of claim 5.
7. A method of preparing the glycosyltransferase of any one of claims 1-3, comprising culturing the transformant of claim 6 under conditions suitable for expression of the glycosyltransferase.
8. A method of preparing rebaudioside E, the method comprising: glycosyltransferases transfer a glycosyl group on an activated glycosyl donor to a glycosyl acceptor;
wherein the glycosyltransferase is as claimed in any one of claims 1 to 3; the glycosyl acceptor is stevioside; the glycosyl donor is uridine diphosphate glucose and/or adenosine diphosphate glucose;
preferably, the uridine diphosphate glucose and/or adenosine diphosphate glucose are produced by the decomposition synthesis of sucrose; the amino acid sequence of the sucrose synthase is shown as SEQ ID NO. 24, and the nucleotide sequence for encoding the sucrose synthase is preferably shown as SEQ ID NO. 23;
more preferably, the glycosyltransferase and the sucrose synthase are used in the form of a crude enzyme solution, a pure enzyme, an immobilized enzyme or a cell expressing the glycosyltransferase and the sucrose synthase;
even more preferably, the mass ratio of cells expressing the glycosyltransferase to stevioside is 3 (9-30), preferably 3:20; the mass ratio of the cell expressing the sucrose synthase to sucrose is 3 (150-300), preferably 3:200; the mass ratio of the sucrose to the stevioside is (0.5-3) 1, preferably 2:1; the mass ratio of the sucrose to the uridine diphosphate glucose or the adenosine diphosphate glucose is (500-3000) 1, preferably 2000:1.
9. The method according to claim 8, wherein the stevioside concentration in the reaction system used in the method is 50-250 g/L, the pH is 5-8, and the reaction temperature is 20-90 ℃;
preferably, each 10mL of the reaction system comprises: 1.5mL of glycosyltransferase, 0.3mL of sucrose synthase, 2g of sucrose, 1g of stevioside, 1mg of uridine diphosphate or adenosine diphosphate, pH 5.5, and reaction temperature of 60 ℃.
10. An enzyme combination comprising a glycosyltransferase according to any one of claims 1 to 3 and a sucrose synthase having an amino acid sequence as shown in SEQ ID No. 24;
preferably, the nucleotide sequence of the sucrose synthase is shown as SEQ ID NO. 23; and/or the sucrose synthase and the glycosyltransferase are used in a mass ratio of 1 (3-10), preferably in a mass ratio of 1:5.
11. Use of the glycosyltransferase of any one of claims 1-3 or the enzyme combination of claim 10 in the preparation of rebaudioside D or rebaudioside E;
preferably, the rebaudioside D is prepared by rebaudioside a; the rebaudioside E is prepared from stevioside.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210114711.6A CN116555210A (en) | 2022-01-30 | 2022-01-30 | Glycosyltransferases and their use in the preparation of rebaudioside E |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210114711.6A CN116555210A (en) | 2022-01-30 | 2022-01-30 | Glycosyltransferases and their use in the preparation of rebaudioside E |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116555210A true CN116555210A (en) | 2023-08-08 |
Family
ID=87490390
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210114711.6A Pending CN116555210A (en) | 2022-01-30 | 2022-01-30 | Glycosyltransferases and their use in the preparation of rebaudioside E |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116555210A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109750072A (en) * | 2019-01-31 | 2019-05-14 | 南京工业大学 | Method for preparing rebaudioside E by enzyme method |
CN112375750A (en) * | 2020-12-02 | 2021-02-19 | 南京工业大学 | Glycosyltransferase mutant and method for catalytically synthesizing rebaudioside A by using same |
CN112805295A (en) * | 2018-07-30 | 2021-05-14 | 科德克希思公司 | Engineering glycosyltransferases and methods of glycosylation of steviol glycosides |
-
2022
- 2022-01-30 CN CN202210114711.6A patent/CN116555210A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112805295A (en) * | 2018-07-30 | 2021-05-14 | 科德克希思公司 | Engineering glycosyltransferases and methods of glycosylation of steviol glycosides |
CN109750072A (en) * | 2019-01-31 | 2019-05-14 | 南京工业大学 | Method for preparing rebaudioside E by enzyme method |
CN112375750A (en) * | 2020-12-02 | 2021-02-19 | 南京工业大学 | Glycosyltransferase mutant and method for catalytically synthesizing rebaudioside A by using same |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109750072B (en) | Method for preparing rebaudioside E by enzyme method | |
CN112080480B (en) | Glycosyltransferase mutants and uses thereof | |
CN111518782B (en) | Glycosyltransferase UGTZJ1 mutant and application thereof | |
US10472660B2 (en) | Method for preparing rebaudioside A from stevioside | |
EP4365285A1 (en) | Glycosyltransferase mutant and method for catalytic synthesis of rebaudioside m by means of using same | |
WO2023087518A1 (en) | Method for efficient biosynthesis of rebaudioside d using glycosyltransferase | |
CN115678867B (en) | Sucrose synthase and application thereof | |
EP4349989A1 (en) | Glycosyltransferase and application thereof | |
CN115449514B (en) | Beta-1, 2-glycosyltransferase and application thereof | |
CN115418358B (en) | Glycosyltransferase and application thereof | |
CN116555210A (en) | Glycosyltransferases and their use in the preparation of rebaudioside E | |
Kang et al. | Preparative synthesis of dTDP‐l‐rhamnose through combined enzymatic pathways | |
CN115725528B (en) | Glycosyltransferase and application thereof | |
CN116656641A (en) | Caffeic acid O-methyltransferase mutant and application thereof | |
CN116355874A (en) | Glycosyltransferase mutant and application thereof in preparation of quercetin-3-O rhamnoside | |
CN111019918B (en) | Glycosyltransferase mutant and application thereof | |
CN115478060B (en) | Glycosyltransferase and application thereof | |
CN110892068A (en) | UDP-glycosyltransferase | |
CN115404226A (en) | Sucrose synthase and application thereof in catalytic glycosylation reaction | |
CN106929525B (en) | Genetically engineered bacterium and application thereof in preparation of rebaudioside A | |
US20120315673A1 (en) | Microorganisms having enhanced sucrose mutase activity | |
CN118755690B (en) | Sucrose synthase and application thereof in biosynthesis of stevioside | |
US6841368B1 (en) | Enzymatic production of difructose dianhydride IV from sucrose and relevant enzymes and genes coding for them | |
CN116334162A (en) | Preparation method and application of rebaudioside I | |
CN113881649B (en) | Glycosyltransferase OsUGT91C1 mutant and application thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant after: Yikelai Biotechnology (Group) Co.,Ltd. Address before: Room 3114, Building B, 555 Dongchuan Road, Minhang District, Shanghai, 200241 Applicant before: Ecolab Biotechnology (Shanghai) Co.,Ltd. |
|
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |