CN115478041A - 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 - Google Patents
一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 Download PDFInfo
- Publication number
- CN115478041A CN115478041A CN202110596118.5A CN202110596118A CN115478041A CN 115478041 A CN115478041 A CN 115478041A CN 202110596118 A CN202110596118 A CN 202110596118A CN 115478041 A CN115478041 A CN 115478041A
- Authority
- CN
- China
- Prior art keywords
- dehydrogenase
- gly
- gene
- ala
- leu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- RBYJWCRKFLGNDB-UHFFFAOYSA-N 5-methylpyrazine-2-carboxylic acid Chemical compound CC1=CN=C(C(O)=O)C=N1 RBYJWCRKFLGNDB-UHFFFAOYSA-N 0.000 title claims abstract description 45
- 241000894006 Bacteria Species 0.000 title claims abstract description 25
- 238000010276 construction Methods 0.000 title claims abstract description 25
- 230000002194 synthesizing effect Effects 0.000 title claims abstract description 13
- HUMNYLRZRPPJDN-UHFFFAOYSA-N benzaldehyde Chemical compound O=CC1=CC=CC=C1 HUMNYLRZRPPJDN-UHFFFAOYSA-N 0.000 claims abstract description 110
- 108010034306 xylene monooxygenase Proteins 0.000 claims abstract description 63
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims abstract description 60
- 108010043325 Aryl-alcohol dehydrogenase Proteins 0.000 claims abstract description 59
- 108010043075 L-threonine 3-dehydrogenase Proteins 0.000 claims abstract description 59
- QNGNSVIICDLXHT-UHFFFAOYSA-N para-ethylbenzaldehyde Natural products CCC1=CC=C(C=O)C=C1 QNGNSVIICDLXHT-UHFFFAOYSA-N 0.000 claims abstract description 56
- 101710088194 Dehydrogenase Proteins 0.000 claims abstract description 55
- 238000006243 chemical reaction Methods 0.000 claims abstract description 32
- 239000004473 Threonine Substances 0.000 claims abstract description 30
- 229960002898 threonine Drugs 0.000 claims abstract description 30
- 238000000855 fermentation Methods 0.000 claims abstract description 27
- 230000004151 fermentation Effects 0.000 claims abstract description 27
- 239000001963 growth medium Substances 0.000 claims abstract description 12
- 239000013604 expression vector Substances 0.000 claims description 68
- 238000010367 cloning Methods 0.000 claims description 43
- 108090000623 proteins and genes Proteins 0.000 claims description 33
- 239000013613 expression plasmid Substances 0.000 claims description 30
- 238000000034 method Methods 0.000 claims description 21
- 241000588724 Escherichia coli Species 0.000 claims description 20
- 238000003259 recombinant expression Methods 0.000 claims description 19
- 150000001413 amino acids Chemical group 0.000 claims description 11
- 241000589517 Pseudomonas aeruginosa Species 0.000 claims description 10
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 claims description 8
- 230000035484 reaction time Effects 0.000 claims description 6
- HUMNYLRZRPPJDN-KWCOIAHCSA-N benzaldehyde Chemical group O=[11CH]C1=CC=CC=C1 HUMNYLRZRPPJDN-KWCOIAHCSA-N 0.000 claims description 2
- 239000000758 substrate Substances 0.000 abstract description 36
- 230000015572 biosynthetic process Effects 0.000 abstract description 10
- 238000004519 manufacturing process Methods 0.000 abstract description 10
- 238000003786 synthesis reaction Methods 0.000 abstract description 10
- 108091008146 restriction endonucleases Proteins 0.000 description 50
- 239000012634 fragment Substances 0.000 description 49
- 239000013612 plasmid Substances 0.000 description 48
- 101150058720 tdh gene Proteins 0.000 description 39
- 101100071188 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) tdh1 gene Proteins 0.000 description 37
- 101100071190 Vibrio parahaemolyticus serotype O3:K6 (strain RIMD 2210633) tdh2 gene Proteins 0.000 description 37
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 32
- 102000004190 Enzymes Human genes 0.000 description 29
- 108090000790 Enzymes Proteins 0.000 description 29
- 230000003321 amplification Effects 0.000 description 29
- 238000003199 nucleic acid amplification method Methods 0.000 description 29
- 108010002747 Pfu DNA polymerase Proteins 0.000 description 28
- 230000000977 initiatory effect Effects 0.000 description 28
- 239000013598 vector Substances 0.000 description 27
- 238000012163 sequencing technique Methods 0.000 description 25
- 238000005520 cutting process Methods 0.000 description 23
- 102000012410 DNA Ligases Human genes 0.000 description 21
- 108010061982 DNA Ligases Proteins 0.000 description 21
- 239000000047 product Substances 0.000 description 19
- 239000002609 medium Substances 0.000 description 18
- LCZUOKDVTBMCMX-UHFFFAOYSA-N 2,5-Dimethylpyrazine Chemical compound CC1=CN=C(C)C=N1 LCZUOKDVTBMCMX-UHFFFAOYSA-N 0.000 description 15
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 15
- 239000000411 inducer Substances 0.000 description 14
- 230000006698 induction Effects 0.000 description 14
- 210000004027 cell Anatomy 0.000 description 12
- 238000004128 high performance liquid chromatography Methods 0.000 description 12
- 235000011187 glycerol Nutrition 0.000 description 11
- 239000001888 Peptone Substances 0.000 description 9
- 108010080698 Peptones Proteins 0.000 description 9
- 229940041514 candida albicans extract Drugs 0.000 description 9
- 235000019319 peptone Nutrition 0.000 description 9
- 238000012545 processing Methods 0.000 description 9
- 230000001502 supplementing effect Effects 0.000 description 9
- 239000012138 yeast extract Substances 0.000 description 9
- 238000003776 cleavage reaction Methods 0.000 description 8
- 238000012258 culturing Methods 0.000 description 8
- 108010050848 glycylleucine Proteins 0.000 description 8
- 229930027917 kanamycin Natural products 0.000 description 8
- 229960000318 kanamycin Drugs 0.000 description 8
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 8
- 229930182823 kanamycin A Natural products 0.000 description 8
- 239000002773 nucleotide Substances 0.000 description 8
- 125000003729 nucleotide group Chemical group 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 238000012216 screening Methods 0.000 description 8
- 101100460704 Aspergillus sp. (strain MF297-2) notI gene Proteins 0.000 description 7
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 7
- 229960000723 ampicillin Drugs 0.000 description 7
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 7
- 229910052760 oxygen Inorganic materials 0.000 description 7
- 239000001301 oxygen Substances 0.000 description 7
- 239000001934 2,5-dimethylpyrazine Substances 0.000 description 6
- NKTOLZVEWDHZMU-UHFFFAOYSA-N 2,5-xylenol Chemical compound CC1=CC=C(C)C(O)=C1 NKTOLZVEWDHZMU-UHFFFAOYSA-N 0.000 description 6
- 239000000306 component Substances 0.000 description 6
- 238000001816 cooling Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 239000000843 powder Substances 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000003814 drug Substances 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 3
- 108010007843 NADH oxidase Proteins 0.000 description 3
- 239000002518 antifoaming agent Substances 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000012526 feed medium Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 108010064235 lysylglycine Proteins 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- 102000005369 Aldehyde Dehydrogenase Human genes 0.000 description 2
- 108020002663 Aldehyde Dehydrogenase Proteins 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 241000276408 Bacillus subtilis subsp. subtilis str. 168 Species 0.000 description 2
- OOLCSQQPSLIETN-JYJNAYRXSA-N Gln-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)O OOLCSQQPSLIETN-JYJNAYRXSA-N 0.000 description 2
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 2
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 2
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- 102100030764 Inactive L-threonine 3-dehydrogenase, mitochondrial Human genes 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 2
- RSTWKJFWBKFOFC-JYJNAYRXSA-N Pro-Trp-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O RSTWKJFWBKFOFC-JYJNAYRXSA-N 0.000 description 2
- 241000589776 Pseudomonas putida Species 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 230000002210 biocatalytic effect Effects 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 238000006555 catalytic reaction Methods 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000003912 environmental pollution Methods 0.000 description 2
- 238000010353 genetic engineering Methods 0.000 description 2
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 2
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 2
- 108010010147 glycylglutamine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 239000002054 inoculum Substances 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000012533 medium component Substances 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000003647 oxidation Effects 0.000 description 2
- 238000007254 oxidation reaction Methods 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010051242 phenylalanylserine Proteins 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- VDZOOKBUILJEDG-UHFFFAOYSA-M tetrabutylammonium hydroxide Chemical compound [OH-].CCCC[N+](CCCC)(CCCC)CCCC VDZOOKBUILJEDG-UHFFFAOYSA-M 0.000 description 2
- 108010061238 threonyl-glycine Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- SAUCHDKDCUROAO-UHFFFAOYSA-N 2-amino-3-oxobutanoic acid Chemical compound CC(=O)C(N)C(O)=O SAUCHDKDCUROAO-UHFFFAOYSA-N 0.000 description 1
- DJQOOSBJCLSSEY-UHFFFAOYSA-N Acipimox Chemical compound CC1=CN=C(C(O)=O)C=[N+]1[O-] DJQOOSBJCLSSEY-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 1
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 1
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- WEZNQZHACPSMEF-QEJZJMRPSA-N Ala-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 WEZNQZHACPSMEF-QEJZJMRPSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- RWWPBOUMKFBHAL-FXQIFTODSA-N Arg-Asn-Cys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O RWWPBOUMKFBHAL-FXQIFTODSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 1
- PCQXGEUALSFGIA-WDSOQIARSA-N Arg-His-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O PCQXGEUALSFGIA-WDSOQIARSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- SXNJBDYEBOUYOJ-DCAQKATOSA-N Asn-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N SXNJBDYEBOUYOJ-DCAQKATOSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- VCJCPARXDBEGNE-GUBZILKMSA-N Asn-Pro-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 VCJCPARXDBEGNE-GUBZILKMSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 1
- YRBGRUOSJROZEI-NHCYSSNCSA-N Asp-His-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O YRBGRUOSJROZEI-NHCYSSNCSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- PYXXJFRXIYAESU-PCBIJLKTSA-N Asp-Ile-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PYXXJFRXIYAESU-PCBIJLKTSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- BWJZSLQJNBSUPM-FXQIFTODSA-N Asp-Pro-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O BWJZSLQJNBSUPM-FXQIFTODSA-N 0.000 description 1
- XUVTWGPERWIERB-IHRRRGAJSA-N Asp-Pro-Phe Chemical compound N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O XUVTWGPERWIERB-IHRRRGAJSA-N 0.000 description 1
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- HQZGVYJBRSISDT-BQBZGAKWSA-N Cys-Gly-Arg Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQZGVYJBRSISDT-BQBZGAKWSA-N 0.000 description 1
- RWAZRMXTVSIVJR-YUMQZZPRSA-N Cys-Gly-His Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC1=CNC=N1)C(O)=O RWAZRMXTVSIVJR-YUMQZZPRSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 101100323111 Escherichia coli (strain K12) tynA gene Proteins 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- LKUWAWGNJYJODH-KBIXCLLPSA-N Gln-Ala-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKUWAWGNJYJODH-KBIXCLLPSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- JILRMFFFCHUUTJ-ACZMJKKPSA-N Gln-Ser-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O JILRMFFFCHUUTJ-ACZMJKKPSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 1
- QZQYITIKPAUDGN-GVXVVHGQSA-N Gln-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N QZQYITIKPAUDGN-GVXVVHGQSA-N 0.000 description 1
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- DXMOIVCNJIJQSC-QEJZJMRPSA-N Glu-Trp-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N DXMOIVCNJIJQSC-QEJZJMRPSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- SABZDFAAOJATBR-QWRGUYRKSA-N Gly-Cys-Phe Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SABZDFAAOJATBR-QWRGUYRKSA-N 0.000 description 1
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 1
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 1
- GMIWMPUGTFQFHK-KCTSRDHCSA-N His-Ala-Trp Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O GMIWMPUGTFQFHK-KCTSRDHCSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- SYIPVNMWBZXKMU-HJPIBITLSA-N His-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N SYIPVNMWBZXKMU-HJPIBITLSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- KHUFDBQXGLEIHC-BZSNNMDCSA-N His-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 KHUFDBQXGLEIHC-BZSNNMDCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- IPYVXYDYLHVWHU-GMOBBJLQSA-N Ile-Asn-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N IPYVXYDYLHVWHU-GMOBBJLQSA-N 0.000 description 1
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- URWXDJAEEGBADB-TUBUOCAGSA-N Ile-His-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N URWXDJAEEGBADB-TUBUOCAGSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- IALVDKNUFSTICJ-GMOBBJLQSA-N Ile-Met-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IALVDKNUFSTICJ-GMOBBJLQSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- CRYJOCSSSACEAA-VKOGCVSHSA-N Ile-Trp-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CRYJOCSSSACEAA-VKOGCVSHSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 1
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 1
- CUXRXAIAVYLVFD-ULQDDVLXSA-N Leu-Arg-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUXRXAIAVYLVFD-ULQDDVLXSA-N 0.000 description 1
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 1
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- QONKWXNJRRNTBV-AVGNSLFASA-N Leu-Pro-Met Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N QONKWXNJRRNTBV-AVGNSLFASA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- GHOIOYHDDKXIDX-SZMVWBNQSA-N Lys-Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 GHOIOYHDDKXIDX-SZMVWBNQSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 1
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 1
- BOJYMMBYBNOOGG-DCAQKATOSA-N Lys-Pro-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BOJYMMBYBNOOGG-DCAQKATOSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- 101150051213 MAOA gene Proteins 0.000 description 1
- CTVJSFRHUOSCQQ-DCAQKATOSA-N Met-Arg-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTVJSFRHUOSCQQ-DCAQKATOSA-N 0.000 description 1
- BQVJARUIXRXDKN-DCAQKATOSA-N Met-Asn-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BQVJARUIXRXDKN-DCAQKATOSA-N 0.000 description 1
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 1
- CHQWUYSNAOABIP-ZPFDUUQYSA-N Met-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N CHQWUYSNAOABIP-ZPFDUUQYSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- JZNGSNMTXAHMSV-AVGNSLFASA-N Met-His-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JZNGSNMTXAHMSV-AVGNSLFASA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 1
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- VEKRTVRZDMUOQN-AVGNSLFASA-N Met-Val-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 VEKRTVRZDMUOQN-AVGNSLFASA-N 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 1
- 101100046676 Neosartorya fumigata (strain ATCC MYA-4609 / Af293 / CBS 101355 / FGSC A1100) tpcA gene Proteins 0.000 description 1
- 102000004316 Oxidoreductases Human genes 0.000 description 1
- 108090000854 Oxidoreductases Proteins 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 1
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 1
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- KIQUCMUULDXTAZ-HJOGWXRNSA-N Phe-Tyr-Tyr Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O KIQUCMUULDXTAZ-HJOGWXRNSA-N 0.000 description 1
- 108030000925 Primary-amine oxidases Proteins 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- SRBFGSGDNNQABI-FHWLQOOXSA-N Pro-Leu-Trp Chemical compound N([C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 SRBFGSGDNNQABI-FHWLQOOXSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 241000831652 Salinivibrio sharmensis Species 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 1
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- KAJRRNHOVMZYBL-IRIUXVKKSA-N Thr-Tyr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAJRRNHOVMZYBL-IRIUXVKKSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- ZKVANNIVSDOQMG-HKUYNNGSSA-N Trp-Tyr-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)NCC(=O)O)N ZKVANNIVSDOQMG-HKUYNNGSSA-N 0.000 description 1
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 1
- FFWCYWZIVFIUDM-OYDLWJJNSA-N Trp-Val-Trp Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O FFWCYWZIVFIUDM-OYDLWJJNSA-N 0.000 description 1
- NOXKHHXSHQFSGJ-FQPOAREZSA-N Tyr-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NOXKHHXSHQFSGJ-FQPOAREZSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- IUQDEKCCHWRHRW-IHPCNDPISA-N Tyr-Asn-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IUQDEKCCHWRHRW-IHPCNDPISA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- XQYHLZNPOTXRMQ-KKUMJFAQSA-N Tyr-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XQYHLZNPOTXRMQ-KKUMJFAQSA-N 0.000 description 1
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 1
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 1
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- GMOLURHJBLOBFW-ONGXEEELSA-N Val-Gly-His Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMOLURHJBLOBFW-ONGXEEELSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 1
- 230000001133 acceleration Effects 0.000 description 1
- 229960003526 acipimox Drugs 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 230000003276 anti-hypertensive effect Effects 0.000 description 1
- 230000002365 anti-tubercular Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000036983 biotransformation Effects 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 239000006285 cell suspension Substances 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 239000013530 defoamer Substances 0.000 description 1
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 238000002848 electrochemical method Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 238000012215 gene cloning Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- ZJJXGWJIGJFDTL-UHFFFAOYSA-N glipizide Chemical compound C1=NC(C)=CN=C1C(=O)NCCC1=CC=C(S(=O)(=O)NC(=O)NC2CCCCC2)C=C1 ZJJXGWJIGJFDTL-UHFFFAOYSA-N 0.000 description 1
- 229960001381 glipizide Drugs 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 238000005469 granulation Methods 0.000 description 1
- 230000003179 granulation Effects 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 230000002218 hypoglycaemic effect Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 229910017053 inorganic salt Inorganic materials 0.000 description 1
- 239000002085 irritant Substances 0.000 description 1
- 231100000021 irritant Toxicity 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- CRBOSZMVDHYLJE-UHFFFAOYSA-N methyl 5-methylpyrazine-2-carboxylate Chemical compound COC(=O)C1=CN=C(C)C=N1 CRBOSZMVDHYLJE-UHFFFAOYSA-N 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000005817 monooxygenase reaction Methods 0.000 description 1
- 238000007040 multi-step synthesis reaction Methods 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 239000007800 oxidant agent Substances 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 235000018102 proteins Nutrition 0.000 description 1
- 102000004169 proteins and genes Human genes 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 125000003373 pyrazinyl group Chemical group 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0008—Oxidoreductases (1.) acting on the aldehyde or oxo group of donors (1.2)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0071—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14)
- C12N9/0073—Oxidoreductases (1.) acting on paired donors with incorporation of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen 1.14.13
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/10—Nitrogen as only ring hetero atom
- C12P17/12—Nitrogen as only ring hetero atom containing a six-membered hetero ring
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y101/00—Oxidoreductases acting on the CH-OH group of donors (1.1)
- C12Y101/01—Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
- C12Y101/01103—L-Threonine 3-dehydrogenase (1.1.1.103)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y101/00—Oxidoreductases acting on the CH-OH group of donors (1.1)
- C12Y101/01—Oxidoreductases acting on the CH-OH group of donors (1.1) with NAD+ or NADP+ as acceptor (1.1.1)
- C12Y101/01166—Hydroxycyclohexanecarboxylate dehydrogenase (1.1.1.166)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y102/00—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2)
- C12Y102/01—Oxidoreductases acting on the aldehyde or oxo group of donors (1.2) with NAD+ or NADP+ as acceptor (1.2.1)
- C12Y102/01007—Benzaldehyde dehydrogenase (NADP+) (1.2.1.7)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y114/00—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14)
- C12Y114/13—Oxidoreductases acting on paired donors, with incorporation or reduction of molecular oxygen (1.14) with NADH or NADPH as one donor, and incorporation of one atom of oxygen (1.14.13)
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
本发明属于生物合成技术领域,具体涉及一种高效合成5‑甲基吡嗪‑2‑羧酸工程菌的构建及应用。本发明所述重组工程菌表达了苏氨酸脱氢酶(TDH)、二甲苯单加氧酶(XMO)、苯甲醇脱氢酶(BADH)和苯甲醛脱氢酶(BZDH)。并利用本发明所述的重组工程菌为出发菌株,在含有L‑苏氨酸的培养基中进行发酵生产得到5‑甲基吡嗪‑2‑羧酸。本发明实现了以L‑苏氨酸为底物一步高效合成5‑甲基吡嗪‑2‑羧酸,转化率高达85%以上,本发明的方法工艺简单,生产周期短,底物转化率高,产物易于从反应液中分离纯化,工艺污染小,能耗低,易于放大生产。
Description
技术领域
本发明属于生物合成技术领域,具体涉及一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用。
背景技术
5-甲基吡嗪-2-羧酸(5-Methylpyrazine-2-carboxylic acid,MPCA)是一种米白色固体结晶,CAS号为5521-55-1,分子式为C6H6N2O2,分子量为138.12,熔点166-169℃,具有刺激性气味,暴露在空气中会缓慢氧化。5-甲基吡嗪-2-羧酸在医药工业领域有着广泛的用途,主要用于合成降血糖药物格列吡嗪、新型抗高血压药物阿昔莫司及抗结核药物5-甲基吡嗪-2-羧酸甲酯等药物。
目前其合成主要采用化学合成法,可分为分子间环合法、吡嗪侧链多步合成法、直接氧化法和电化学法。但化学合成法存在反应条件要求高、使用大量氧化剂、生产成本高、对环境污染较大的问题。生物转化法则具有合成工艺简单,底物选择性好,催化效率高,杂质少,对环境污染小的优点,极具发展前景。
瑞士Lonza公司已经实现了利用含二甲苯单加氧酶的菌株在批次补料反应器中制备5-甲基吡嗪-2羧酸。郑裕国等,化学与生物工程,2012,29(9):19-25,筛选得到一株含二甲苯单加氧酶的恶臭假单胞菌,并利用该菌株进行5-甲基吡嗪-2-羧酸的生物制备,经过补料发酵产率可达到75.6%,产物浓度达到20.41g/L,周期长达22天。
CN106434434A公开了一株具有高区域选择性的催化单加氧反应的产二甲苯单加氧酶的菌株HW-1,并利用该菌株发酵制备5-甲基吡嗪-2-羧酸,经过摇瓶发酵并适时进行补料,产物累积浓度可达到34.19g/L,产率81.4%。
CN107312806A公开了一种酶法生产5-甲基吡嗪-2-羧酸的方法,所述的酶为醛脱氢酶或醛脱氢酶的N端或/和C端连接标签得到的融合蛋白,以5-甲基-2-吡嗪醛为反应底物,在上述酶的催化下反应12h,得到产物5-甲基-2-吡嗪羧酸17.02mM。
CN107974428A公开了一种利用重组大肠杆菌转化生产5-甲基吡嗪-2-羧酸的方法,该方法主要是以来源于Pseudomonasputida ATCC 33015所携带的内源性制粒pWWO为模板,构建重组质粒,利用异丙基硫代半乳糖苷(IPTG)诱导的重组大肠杆菌表达二甲苯单加氧酶以及苯甲醇脱氢酶,然后全细胞转化底物2,5-二甲基吡嗪,产量为2.5g/L,摩尔转化率为96.2%。该方法转化用时较短,且摩尔转化率较高,但是该方法产量相对较低。
现有技术也存在以L-苏氨酸为底物催化合成2,5-二甲基吡嗪,例如CN111411067A公开了一种高产2,5-二甲基吡嗪的大肠杆菌重组菌。所述的重组菌以大肠杆菌K-12为宿主,过表达了L-苏氨酸脱氢酶,并异源表达了NADH氧化酶和氨基丙酮氧化酶,同时敲除了重组菌的2-氨基-3-酮丁酸CoA连接酶基因kb1和伯胺氧化酶基因tynA。曹艳丽等,以L-苏氨酸为发酵底物的2,5-二甲基吡嗪高产菌株构建,食品与发酵工业,2020,46(1):1-10.构建了一种以L-苏氨酸为发酵底物的2,5-二甲基吡嗪的生产菌株。通过利用Bacillussubtilis168(B.subtilis 168)外源表达不同微生物种属来源的L-苏氨酸脱氢酶(L-threonine dehydrogenase,TDH),并比较其利用L-苏氨酸为底物合成2,5-DMP的产量,挑选出2,5-DMP高产菌种,在此基础上进一步外源表达NADH氧化酶(NADH oxidase,NOX),以促进辅因子再生。构建了一株高产2,5-DMP的基因工程菌株B.subtilis168/pMA0911-tdh(Ec)-nox。该菌株以5.83g/L的L-苏氨酸为底物,发酵24h后2,5-DMP的产量高达616.04mg/L。
综上,现有技术多是以2,5-二甲基吡嗪为起始底物,由二甲苯单加氧酶、苯甲醇脱氢酶、苯甲醛脱氢酶三步催化后生成5-甲基吡嗪-2-羧酸。各自存在着操作不稳定,产能较低,生产成本高、周期长等缺陷,因此生产工艺简单、绿色环保且高效的5-甲基吡嗪-2-羧酸的生产方法亟待开发。
发明内容
为了解决现有技术的缺陷,本发明的目的在于提供一种高效合成5-甲基吡嗪-2-羧酸工程菌,该菌可以L-苏氨酸为底物发酵生产5-甲基吡嗪-2-羧酸,实现了L-苏氨酸一步转化为5-甲基吡嗪-2-羧酸。
本发明第一方面,提供一种生物催化合成5-甲基吡嗪-2-羧酸的重组工程菌,所述重组工程菌表达了苏氨酸脱氢酶(TDH)、二甲苯单加氧酶(XMO)、苯甲醇脱氢酶(BADH)和苯甲醛脱氢酶(BZDH)。
所述的苏氨酸脱氢酶为Escherichia coli来源的苏氨酸脱氢酶,其氨基酸序列如SEQ ID NO:1所示;核苷酸序列如SEQ ID NO:2所示。
所述的二甲苯单加氧酶为Pseudomonas aeruginosa strain来源的二甲苯单氧合酶,其氨基酸序列如SEQ ID NO:3所示;核苷酸序列如SEQ ID NO:4所示。
所述的苯甲醇脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醇脱氢酶,其氨基酸序列如SEQ ID NO:5所示;核苷酸序列如SEQ ID NO:6所示。
所述的苯甲醛脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醛脱氢酶,其氨基酸序列如SEQ ID NO:7所示;核苷酸序列如SEQ ID NO:8所示。
本发明的第二方面,提供所述重组工程菌的构建方法。
一种生物催化合成5-甲基吡嗪-2-羧酸的重组工程菌的构建方法,包括将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆到表达载体上构建得到重组表达质粒,然后将构建的表达质粒转入宿主细胞中进行表达。
优选地,所述重组工程菌的表达载体选自pRSFDuet-1、pET-28a、pETDuet-1、pCDFDuet-1中的一种或几种;在一个实施方案中,所述载体为pRSFDuet-1;在另一种实施方案中,所述载体为pET-28a及pETDuet-1;在另一种实施方案中,所述载体为pET-28a及pCDFDuet-1;在另一种实施方案中,所述载体为pET-28a、pETDuet-1及pCDFDuet-1;在另一种实施方案中,所述载体为pETDuet-1;在另一种实施方案中,所述载体为pETDuet-1和pET-28a。
所述的表达是将相应酶的基因连接到表达载体的重组表达质粒上,然后将表达质粒转入宿主细胞进行表达。
优选地,所述重组工程菌是以大肠杆菌BL21(DE3)为宿主构建得到的。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pRSFDuet-1上得到重组表达质粒pRSFDuet-1-tdhxmobadhbzdh,具体操作是将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo克隆在pRSFDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pRSFDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh和二甲苯单氧合酶xmo基因序列、苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的基因序列,测序后分两步利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pRSFDuet-1进行连接,构建表达载体pRSFDuet-1-tdhxmobadhbzdh;然后将表达质粒pRSFDuet-1-tdhxmobadhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh,二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-xmobadhbzdh,具体操作是将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh、二甲苯单氧合酶xmo基因序列、苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a、pETDuet-1进行连接,构建表达载体pET28a-tdh和pETDuet-1-xmobadhbzdh;然后将表达质粒pET28a-tdh和pETDuet-1-xmobadhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh,二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pCDFDuet-1上得到重组表达质粒pCDFDuet-1-xmobadhbzdh,具体操作是将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo和苯甲醇脱氢酶基因badh克隆到pCDFDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,苯甲醛脱氢酶基因bzdh克隆在pCDFDuet-1的第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh、二甲苯单氧合酶xmo基因序列和苯甲醇脱氢酶badh、苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a、pCDFDuet-1进行连接,构建表达载体pET28a-tdh和pCDFDuet-1-xmobadhbzdh;然后将表达质粒pET28a-tdh和pCDFDuet-1-xmobadhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh,将二甲苯单氧合酶基因xmo连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-xmo,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pCDFDuet-1上得到重组表达质粒pCDFDuet-1-badhbzdh,具体操作是将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醇脱氢酶基因badh克隆到pCDFDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,苯甲醛脱氢酶基因bzdh克隆在pCDFDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh、二甲苯单氧合酶xmo基因序列、苯甲醇脱氢酶badh、苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a、pETDuet-1、pCDFDuet-1进行连接,构建表达载体pET28a-tdh、pETDuet-1-xmo和pCDFDuet-1-badhbzdh。然后将表达质粒pET28a-tdh、pETDuet-1-xmo和pCDFDuet-1-badhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh,将二甲苯单氧合酶基因xmo连接到表达载体pCDFDuet-1上得到重组表达质粒pCDFDuet-1-xmo,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-badhbzdh,具体操作是将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pCDFDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醇脱氢酶基因badh克隆到pETDuet-1第一个多克隆位点的SacI和NotI的酶切位点,苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh、二甲苯单氧合酶xmo基因序列、苯甲醇脱氢酶badh、苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a、pETDuet-1、pCDFDuet-1进行连接,构建表达载体pET28a-tdh、pCDFDuet-1-xmo和pETDuet-1-badhbzdh。然后将表达质粒pET28a-tdh、pCDFDuet-1-xmo和pETDuet-1-badhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-tdhxmobadhbzdh,具体操作是将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh和二甲苯单氧合酶xmo基因序列、苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建表达载体pETDuet-1-tdhxmobadhbzdh。然后将表达质粒pETDuet-1-tdhxmobadhbzdh转入大肠杆菌BL21(DE3)中进行表达。
在另一种实施方式中,所述表达,是将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh,二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-xmobadhbzdh,具体操作是将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo和苯甲醇脱氢酶基因badh克隆到pETDuet-1第一个多克隆位点的酶切位点SacI和NotI之间,将苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的酶切位点NdeI和XhoI之间。分别在引物的引发下,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh、二甲苯单氧合酶xmo基因序列和苯甲醇脱氢酶badh的连续基因、苯甲醛脱氢酶bzdh的基因序列,测序后分别利用SacI和NotI、NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a和pETDuet-1进行连接,构建表达载体pET28a-tdh和pETDuet-1-xmobadhbzdh。然后将表达质粒pET28a-tdh和pETDuet-1-xmobadhbzdh转入大肠杆菌BL21(DE3)中进行表达。
本发明的第三方面,提供一种合成5-甲基吡嗪-2-羧酸的方法,所述方法是利用L-苏氨酸为底物,通过苏氨酸脱氢酶,二甲苯单加氧酶、苯甲醇脱氢酶、苯甲醛脱氢酶为催化剂催化反应制备5-甲基吡嗪-2-羧酸。
一种合成5-甲基吡嗪-2-羧酸的方法,利用本发明所述的重组工程菌为出发菌株,在含有L-苏氨酸的培养基中进行发酵生产得到5-甲基吡嗪-2-羧酸。
优选地,所述的发酵是以L-苏氨酸为唯一底物进行发酵生产。
发酵所用的培养基包括碳源、氮源、无机盐等;本发明不特别限定发酵培养基,只要有利于菌种生长即可,本领域技术人员可选择市售培养基或根据需要自行配置。在一个实施方式中,所述的发酵过程所用的培养基为蛋白胨1.2%、酵母浸粉2.4%、K2HPO417mM、KH2PO472mM、甘油0.5%,消泡剂SAG6300.001%。
优选地,底物L-苏氨酸的浓度为6-12g/L,优选为8-12g/L,进一步优选为10g/L。
优选地,所述的反应时间为8-12h,优选为8h。
优选地,所述的反应温度为22-30℃,优选为28℃。
优选地,所述的反应体系pH为6.5-7.5,优选pH为7.0~7.2。
下面将进一步详述一种高效合成5-甲基吡嗪-2-羧酸的方法,将表达苏氨酸脱氢酶、二甲苯单加氧酶、苯甲醇脱氢酶和苯甲醛脱氢酶的重组工程菌接种逐级扩大培养,按发酵液体积1~10%的接种量,并接种至装有培养基的发酵罐中,37±0.5℃下培养,控制pH6.5~7.5,待OD600达到12或以上时,加入终浓度为20-200μg/ml的诱导剂IPTG,22~30℃下培养3~5h,流加底物L-苏氨酸2~5h至终浓度为6-12g/L,继续发酵培养8~12h,反应结束。
更进一步地,所述加入诱导剂IPTG的终浓度为20~28μg/ml,优选24μg/ml。
更进一步地,所述的诱导温度为22~30℃,优选为28℃。
与现有技术相比,本发明取得了如下的有益效果:
本发明通过构建同时表达苏氨酸脱氢酶、二甲苯单加氧酶、苯甲醇脱氢酶和苯甲醛脱氢酶的重组工程菌,实现了以L-苏氨酸为底物一步高效合成5-甲基吡嗪-2-羧酸,摩尔转化率高达85%以上,本发明的方法工艺简单,生产周期短,底物转化率高,产物易于从反应液中分离纯化,工艺污染小,能耗低,易于放大生产。
附图说明
图1:表达载体pRSFDuet-1-tdhxmobadhbzdh质粒构建图;
图2:表达载体pET28a-tdh质粒构建图;
图3:表达载体pETDuet-1-xmobadhbzdh质粒构建图;
图4:表达载体pCDFDuet-1-xmobadhbzdh质粒构建图;
图5:表达载体pETDuet-1-xmo质粒构建图;
图6:表达载体pCDFDuet-1-badhbzdh质粒构建图;
图7:表达载体pCDFDuet-1-xmo质粒构建图;
图8:表达载体pETDuet-1-badhbzdh质粒构建图;
图9:表达载体pETDuet-1-tdhxmobadhbzdh质粒构建图;
图10:表达载体pETDuet-1-xmobadhbzdh-1质粒构建图;
图11:不同构建方式的菌种对底物转化率的影响;
图12:不同诱导剂浓度对底物转化率的影响;
图13:不同诱导温度对底物转化率的影响;
图14:不同诱导时间对底物转化率的影响;
图15:不同底物浓度对底物转化率的影响;
图16:反应时间对产物浓度的影响;
具体实施方式
下面结合具体实施例来对本发明的技术方案进行描述,应理解的是,以下实施例仅用于说明本发明而非用于限制本发明的保护范围。
以下实施例中所用到的试剂、材料等如无特殊说明,均可通过商业途径获得。实验中未介绍到的详细的实验条件、参数等均为本领域常规技术,基因克隆操作可参考J.萨姆布鲁克等编写的《分子克隆实验指南》。
本发明实施例中基因工程操作所用的一步克隆试剂盒均购自Vazyme,南京维诺赞生物科技有限公司;质粒提取试剂盒、DNA回收纯化试剂盒购自Axygen杭州有限公司;E.coli BL21(DE3)、质粒等购自上海生工;DNAmarker、FastPfu DNA聚合酶、低分子量标准蛋白、琼脂糖电泳试剂、引物合成与基因测序工作由上海生工有限公司完成。以上试剂使用方法参考商品说明书。
本发明实施例采用高效液相色谱法对5-甲基吡嗪-2-羧酸进行定量检测,具体条件如下:
色谱柱:YMC-Triart C18,5μm,4.6×250mm;
柱温:25℃;
流动相:甲醇-0.01mol/L四丁基氢氧化铵溶液(体积比15:85)(磷酸调至pH6.0);
流速:1.0ml/min;
检测波长:264nm;
进样量:20μl。
本发明实施例所涉及的基因:
苏氨酸脱氢酶为Escherichia coli来源的苏氨酸脱氢酶,其氨基酸序列如SEQ IDNO:1所示;核苷酸序列如SEQ ID NO:2所示;
二甲苯单加氧酶为Pseudomonas aeruginosa strain来源的二甲苯单氧合酶,其序列如SEQ ID NO:3所示;核苷酸序列如SEQ ID NO:4所示;
苯甲醇脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醇脱氢酶,其氨基酸序列如SEQ ID NO:5;核苷酸序列如SEQ ID NO:6所示;
苯甲醛脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醛脱氢酶,其氨基酸序列如SEQ ID NO:7;核苷酸序列如SEQ ID NO:8所示;
相关引物说明:
引物1:GAGCTCATGAAAGCGTTATCCAAACTGA;
引物2:GCGGCCGCTCAAATGCTAGCCACCCG;
引物3:CATATGATGGAAATCAAAGCAGCAAT;
引物4:CTCGAGTCAAAATGGGTAATTAGCTG。
引物5:GCGGCCGCTTAATCCCAGCTCAGAATAA;
引物6:GAGCTCATGGACACGCTTCGTTATTA;
引物7:GCGGCCGCTCAACCAATCCGGAGTACCG;
引物8:CATATGATGCGGGAAACAAAAGAGCA;
引物9:GAGCTCATGGAAATCAAAGCAGCAAT。
实施例1重组大肠杆菌的构建
(1)感受态细胞的制备
从-80℃冰箱中获得甘油管保藏的E.coli BL21(DE3)菌株,在无抗LB平板上划线,37℃培养10h,获取单菌落;挑取LB平板的单菌落,接种至含5ml的LB培养基的试管中,37℃、180rpm培养9h;从试管中取200μl菌液,接种到50μl的LB培养基中,37℃、180rpm培养OD600至0.4-0.6;将菌液在冰上预冷,取菌液至灭菌的离心管中,冰上放置10min,4℃、5000rpm离心10min;将上清液倒出,注意防止染菌,用预冷的0.1mol/L的CaCl2水溶液重悬沉淀细胞,并在冰上放置30min;4℃、5000rpm离心10min,弃上清,用预冷的含15%甘油的0.1mol/L的CaCl2水溶液重悬沉淀细胞,取100μl重悬细胞分装至灭菌的1.5ml离心管中,保藏于-80℃冰箱,备用。
(2)以筛选到的Escherichia coli、Pseudomonas aeruginosa strain菌株基因组为模板,通过PCR获得苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh,并通过一步克隆技术将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh连接在一起,形成完整的基因序列并构建到pET28a质粒上(位于SacI和NotI酶切位点之间),获得质粒pET28a-tdhxmobadhbzdh。
(3)质粒构建
质粒构建1:将苏氨酸脱氢酶基因tdh和二甲苯单氧合酶基因xmo完整序列克隆在pRSFDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pRSFDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和2)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh和二甲苯单氧合酶xmo的连续基因序列,测序后利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(3和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的连续基因序列,测序后NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pRSFDuet-1进行连接,构建得到表达载体pRSFDuet-1-tdhxmobadhbzdh,如图1所示。
质粒构建2:将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和5)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh基因序列,测序后用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,测序后利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a进行连接,构建得到表达载体pET28a-tdh。在引物(6和2)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得二甲苯单氧合酶xmo基因序列,测序后利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(3和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的连续基因序列,测序后利用NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建得到表达载体pETDuet-1-xmobadhbzdh。表达载体pET28a-tdh如图2所示,表达载体pETDuet-1-xmobadhbzdh如图3所示。
质粒构建3:将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh克隆到pCDFDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,苯甲醛脱氢酶基因bzdh克隆在pCDFDuet-1第二个多克隆位点的NdeI和XhoI的酶切位点之间。在引物(1和5)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh,测序后用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,测序后利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a进行连接,构建得到表达载体pET28a-tdh。在引物(6和7)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得二甲苯单氧合酶xmo和苯甲醇脱氢酶badh基因连续序列,测序后利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(8和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苯甲醛脱氢酶bzdh的基因序列,测序后分别利用NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pCDFDuet-1进行连接,构建得到表达载体pCDFDuet-1-xmobadhbzdh。表达载体pET28a-tdh如图2所示,表达载体pCDFDuet-1-xmobadhbzdh如图4所示。
质粒构建4:将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,将苯甲醇脱氢酶基因badh克隆到pCDFDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,苯甲醛脱氢酶基因bzdh克隆在pCDFDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和5)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh基因,测序后用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a进行连接,构建得到表达载体pET28a-tdh。在引物(6和2)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得二甲苯单氧合酶xmo基因序列,利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建得到表达载体pETDuet-1-xmo。在引物(9和7)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得苯甲醇脱氢酶badh基因序列,利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(8和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得苯甲醛脱氢酶bzdh基因序列,利用NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4 DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pCDFDuet-1进行连接,构建得到表达载体pCDFDuet-1-badhbzdh。表达载体pET28a-tdh如图2所示,表达载体pETDuet-1-xmo如图5所示,表达载体pCDFDuet-1-badhbzdh如图6所示。
质粒构建5:将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo克隆在pCDFDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,将苯甲醇脱氢酶基因badh克隆到pETDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和5)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh,测序后用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a进行连接,构建得到表达载体pET28a-tdh。在引物(6和2)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得二甲苯单氧合酶xmo基因序列,利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pCDFDuet-1进行连接,构建得到表达载体pCDFDuet-1-xmo。在引物(9和7)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得苯甲醇脱氢酶badh基因序列,利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(8和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真PfuDNA聚合酶进行扩增,获得苯甲醛脱氢酶bzdh基因序列,利用NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,再利用T4 DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建得到表达载体pETDuet-1-badhbzdh。表达载体pET28a-tdh如图2所示,表达载体pCDFDuet-1-xmo如图7所示,表达载体pETDuet-1-badhbzdh如图8所示。
质粒构建6:将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo克隆在pETDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,将苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和2)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh和二甲苯单氧合酶xmo的连续基因序列,测序后利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(3和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,苯甲醇脱氢酶badh和苯甲醛脱氢酶bzdh的连续基因序列,测序后NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4 DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建得到表达载体pETDuet-1-tdhxmobadhbzdh,表达载体pETDuet-1-tdhxmobadhbzdh如图9所示。
质粒构建7:将苏氨酸脱氢酶基因tdh克隆到pET28a的酶切位点SacI和NotI之间,将二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh克隆到pETDuet-1第一个多克隆位点的SacI和NotI酶切位点之间,苯甲醛脱氢酶基因bzdh克隆在pETDuet-1第二个多克隆位点的NdeI和XhoI酶切位点之间。在引物(1和5)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得苏氨酸脱氢酶tdh,测序后用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理,测序后利用T4 DNA连接酶(TaKaRa)将该片段同用相同的限制性内切酶处理的载体pET28a进行连接,构建得到表达载体pET28a-tdh。在引物(6和7)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,获得二甲苯单氧合酶xmo和苯甲醇脱氢酶badh连续基因序列,测序后利用SacI和NotI限制性内切酶(TaKaRa)对扩增片段进行处理;在引物(8和4)的引发下,以pET28a-tdhxmobadhbzdh质粒为模板,利用高保真Pfu DNA聚合酶进行扩增,和苯甲醛脱氢酶bzdh的基因序列,测序后分别利用NdeI和XhoI限制性内切酶(TaKaRa)对扩增片段进行处理,并分别利用T4DNA连接酶(TaKaRa)将两片段同用相同的限制性内切酶处理的载体pETDuet-1进行连接,构建得到表达载体pETDuet-1-xmobadhbzdh-1。表达载体pET28a-tdh如图2所示,表达载体pETDuet-1-xmobadhbzdh-1如图10所示。
(4)共表达tdh、xmo、badh和bzdh的重组大肠杆菌构建
首先将储藏于-80℃的大肠杆菌BL21(DE3)(Invitrogen)感受态细胞在0℃冰浴10min,然后在超净台内分别加入5μl的酶连产物体系,0℃冰浴30min,42℃水浴中热击90s,0℃冰浴2min,加入600μl的LB培养基,在37℃、200rpm摇床培养1h;分别涂布于含有50μg/ml含相应抗性(pET28a为卡那霉素抗性,pRSFduet-1为卡那霉素抗性,pCDFduet-1为链霉素抗性,pETduet-1为氨苄青霉素抗性)的LB平板,37℃下培养8-12h,每个平板随机挑取10个克隆抽提质粒进行测序鉴定,分别进行筛选,各获得1株含有步骤(3)中每种表达重组质粒的重组大肠杆菌。由步骤(3)中质粒构建1-7所得重组大肠杆菌分别依次编号为1-7号菌株。
(5)菌种库的筛选
种子瓶:将上述7株重组大肠杆菌菌种分别接种至种子瓶(培养基组分:蛋白胨1.2%、酵母浸粉2.4%、K2HPO417mM、KH2PO472mM、甘油0.5%)中,在37℃下培养12h。
筛选体系:将上述种子瓶分别接种1%至发酵瓶(培养基组分:蛋白胨1.2%、酵母浸粉2.4%、K2HPO417mM、KH2PO472mM、甘油0.5%,L-苏氨酸0.6%)中,在37℃下培养,pH控制7.0,待OD600达到12时加入诱导剂IPTG,IPTG终浓度为24μg/ml,降温至28℃诱导培养10h,反应结束,HPLC检测产物5-甲基吡嗪-2羧酸的含量,检测结果如图11所示。从图11结果可知,7号菌株BL21(DE3)-AXMS7摩尔转化率最高。
实施例2重组大肠杆菌BL21(DE3)-AXMS7发酵和催化条件的建立
发酵培养基组分:蛋白胨1.2%、酵母浸粉2.4%、K2HPO417mM、KH2PO472mM、甘油0.5%,消泡剂SAG6300.001%。
补料培养基成分:蛋白胨1.5%,酵母浸粉1%,甘油3%。
底物L-苏氨酸溶液:浓度500g/L。
将重组大肠杆菌BL21(DE3)-AXMS7接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的30ml的一级种子瓶培养基中,37℃,220rpm下培养8h。再以体积浓度1%接种量接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的150ml的二级种子瓶培养基中,于37℃,220rpm下培养10h,以3%接种量接种至含有15L培养基的发酵罐中。
(1)诱导剂浓度的筛选
在发酵罐培养过程中,在37℃、pH7.0下培养,待OD600达到12时,降温到28℃,加入诱导剂IPTG,IPTG终浓度为20、22、24、26、28μg/ml,溶氧DO迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,在28℃下诱导培养3h后,开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为100ml/h,流加完毕后继续28℃发酵培养8h,反应结束,HPLC检测产物5-甲基吡嗪-2羧酸的含量,结果如图12所示。从图12可以看出,诱导剂浓度为20、22、24、26、28μg/ml,产物转化率相当,诱导剂浓度为24μg/ml时产物转化率略高。
(2)诱导温度的筛选
在发酵罐培养过程中,在37℃、pH7.0下培养,待OD600达到12时,分别降温至22、25、28、30℃下,加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,分别培养3h后,开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为100ml/h,继续分别在22、25、28、30℃下培养8h后结束,HPLC检测产物5-甲基吡嗪-2羧酸的含量,结果如图13所示。从图13可以看出,诱导温度为28℃时,产物转化率较高。
(3)诱导时间的筛选
在发酵罐培养过程中,37℃、pH7.2条件下培养,待OD600达到12时,降温至28℃加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,分别在诱导3、4、5、6小时后开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为100ml/h,继续在28℃下发酵培养8h后反应结束HPLC检测产物5-甲基吡嗪-2-羧酸的含量,结果如图14所示。从图14可以看出,诱导时间对转化率的影响较小。
(4)底物浓度的筛选
在发酵罐培养过程中,37℃、pH7.2条件下培养,待OD600达到12时,降温至28℃加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,分别在诱导3小时后开始流加底物,流加速度为100ml/h,分别流加底物L-苏氨酸终浓度为4、6、8、10、12g/L,继续在28℃下发酵培养8h后反应结束,HPLC检测产物5-甲基吡嗪-2-羧酸的含量,结果如图15所示。从图15可以看出,随着底物浓度的增大,转化率有下降趋势,为了高浓度的产物5-甲基吡嗪-2羧酸,优选底物L-苏氨酸的浓度为10g/L。
(5)反应时间的筛选
在发酵罐培养过程中,37℃、pH7.2条件下培养,待OD600达到12时,降温至28℃加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,在诱导3小时后开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为100ml/h,继续在28℃下发酵培养12h,每小时取样通过HPLC检测产物5-甲基吡嗪-2-羧酸的含量,结果如图16所示。从图16可知,随着反应时间的延长,产物浓度逐渐增大,但达到8h以后产物浓度增加极其缓慢,因此,优选反应时间为8h。
实施例3 5-甲基吡嗪-2羧酸的合成
发酵培养基组分:蛋白胨1.2%、酵母浸粉2.4%、K2HPO417mM、KH2PO472mM、甘油0.5%,消泡剂SAG6300.001%。
补料培养基成分:蛋白胨1.5%,酵母浸粉1%,甘油3%。
底物L-苏氨酸溶液:浓度500g/L。
将重组大肠杆菌BL21(DE3)-AXMS7接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的30ml的一级种子瓶培养基中,37℃,220rpm下培养8h。再以体积浓度1%接种量接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的150ml的二级种子瓶培养基中,于37℃,220rpm下培养10h,以3%接种量接种至含有15L培养基的发酵罐中。待OD600达到12时,降温至28℃加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,在诱导3小时后开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为100ml/h,继续在28℃下发酵培养8h后反应结束,HPLC检测产物5-甲基吡嗪-2-羧酸的含量,达到9.97g/L,转化率为85.99%。
实施例4 5-甲基吡嗪-2羧酸的合成
发酵培养基组分调整:蛋白胨1.0%、酵母浸粉2.5%、K2HPO417mM、KH2PO472mM、甘油0.8%,消泡剂SAG6300.001%。
补料培养基成分:蛋白胨1.5%,酵母浸粉1%,甘油3%。
底物L-苏氨酸溶液:浓度500g/L。
将重组大肠杆菌BL21(DE3)-AXMS7接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的30ml的一级种子瓶培养基中,37℃,220rpm下培养8h。再以体积浓度1%接种量接种至同时含有50μg/ml卡那霉素和氨苄霉素抗性的150ml的二级种子瓶培养基中,于37℃,220rpm下培养10h,以3%接种量接种至含有15L培养基的发酵罐中。待OD600达到12时,降温至28℃加入终浓度为24μg/ml的诱导剂IPTG,溶氧迅速上升时补加补料培养基,通过补料维持DO在30%~40%之间,在诱导3小时后开始流加底物L-苏氨酸至终浓度为10g/L,流加速度为150ml/h,继续在28℃下发酵培养8h后反应结束,HPLC检测产物5-甲基吡嗪-2-羧酸的含量,达到9.86g/L,转化率为85.04%。
序列表
<110> 山东新时代药业有限公司
<120> 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用
<130> 2021
<160> 8
<170> SIPOSequenceListing 1.0
<210> 1
<211> 341
<212> PRT
<213> 未知(Unknown)
<400> 1
Met Lys Ala Leu Ser Lys Leu Lys Ala Glu Glu Gly Ile Trp Met Thr
1 5 10 15
Asp Val Pro Val Pro Glu Leu Gly His Asn Asp Leu Leu Ile Lys Ile
20 25 30
Arg Lys Thr Ala Ile Cys Gly Thr Asp Val His Ile Tyr Asn Trp Asp
35 40 45
Glu Trp Ser Gln Lys Thr Ile Pro Val Pro Met Val Val Gly His Glu
50 55 60
Tyr Val Gly Glu Val Val Gly Ile Gly Gln Glu Val Lys Gly Phe Lys
65 70 75 80
Ile Gly Asp Arg Val Ser Gly Glu Gly His Ile Thr Cys Gly His Cys
85 90 95
Arg Asn Cys Arg Gly Gly Arg Thr His Leu Cys Arg Asn Thr Ile Gly
100 105 110
Val Gly Val Asn Arg Pro Gly Cys Phe Ala Glu Tyr Leu Val Ile Pro
115 120 125
Ala Phe Asn Ala Phe Lys Ile Pro Asp Asn Ile Ser Asp Asp Leu Ala
130 135 140
Ala Ile Phe Asp Pro Phe Gly Asn Ala Val His Thr Ala Leu Ser Phe
145 150 155 160
Asp Leu Val Gly Glu Asp Val Leu Val Ser Gly Ala Gly Pro Ile Gly
165 170 175
Ile Met Ala Ala Ala Val Ala Lys His Val Gly Ala Arg Asn Val Val
180 185 190
Ile Thr Asp Val Asn Glu Tyr Arg Leu Glu Leu Ala Arg Lys Met Gly
195 200 205
Ile Thr Arg Ala Val Asn Val Ala Lys Glu Asn Leu Asn Asp Val Met
210 215 220
Ala Glu Leu Gly Met Thr Glu Gly Phe Asp Val Gly Leu Glu Met Ser
225 230 235 240
Gly Ala Pro Pro Ala Phe Arg Thr Met Leu Asp Thr Met Asn His Gly
245 250 255
Gly Arg Ile Ala Met Leu Gly Ile Pro Pro Ser Asp Met Ser Ile Asp
260 265 270
Trp Thr Lys Val Ile Phe Lys Gly Leu Phe Ile Lys Gly Ile Tyr Gly
275 280 285
Arg Glu Met Phe Glu Thr Trp Tyr Lys Met Ala Ala Leu Ile Gln Ser
290 295 300
Gly Leu Asp Leu Ser Pro Ile Ile Thr His Arg Phe Ser Ile Asp Asp
305 310 315 320
Phe Gln Lys Gly Phe Asp Ala Met Arg Ser Gly Gln Ser Gly Lys Val
325 330 335
Ile Leu Ser Trp Asp
340
<210> 2
<211> 1026
<212> DNA
<213> 未知(Unknown)
<400> 2
atgaaagcgt tatccaaact gaaagcggaa gagggcatct ggatgaccga cgttcctgta 60
ccggaactcg ggcataacga tctgctgatt aaaatccgta aaacagccat ctgcgggact 120
gacgttcaca tctataactg ggatgagtgg tcgcaaaaaa ccatcccggt gccgatggtc 180
gtgggccatg aatatgtcgg tgaagtggta ggtattggtc aggaagtgaa aggcttcaag 240
atcggcgatc gcgtttctgg cgaaggccat atcacctgtg gtcattgccg caactgtcgt 300
ggtggtcgta cccatttgtg ccgcaacacg ataggcgttg gtgttaatcg cccgggctgc 360
tttgccgaat atctggtgat cccggcattc aacgccttca aaatccccga caatatttcc 420
gatgacttag ccgcaatttt tgatcccttc ggtaacgccg tgcataccgc gctgtcgttt 480
gatctggtgg gcgaagatgt gctggtttct ggtgcaggcc cgattggtat tatggcagcg 540
gcggtggcga aacacgttgg tgcacgcaat gtggtgatca ctgatgttaa cgaataccgc 600
cttgagctgg cgcgtaaaat gggtatcacc cgtgcggtta acgtcgccaa agaaaatctc 660
aatgacgtga tggcggagtt aggcatgacc gaaggttttg atgtcggtct ggaaatgtcc 720
ggtgcgccgc cagcgtttcg taccatgctt gacaccatga atcacggcgg ccgtattgcg 780
atgctgggta ttccgccgtc tgatatgtct atcgactgga ccaaagtgat ctttaaaggc 840
ttgttcatta aaggtattta cggtcgtgag atgtttgaaa cctggtacaa gatggcggcg 900
ctgattcagt ctggcctcga tctttcgccg atcattaccc atcgtttctc tatcgatgat 960
ttccagaagg gctttgacgc tatgcgttcg ggccagtccg ggaaagttat tctgagctgg 1020
gattaa 1026
<210> 3
<211> 369
<212> PRT
<213> 未知(Unknown)
<400> 3
Met Asp Thr Leu Arg Tyr Tyr Leu Ile Pro Val Val Thr Ala Cys Gly
1 5 10 15
Leu Ile Gly Phe Tyr Tyr Gly Gly Tyr Trp Val Trp Leu Gly Ala Ala
20 25 30
Thr Phe Pro Ala Leu Met Val Leu Asp Val Ile Leu Pro Lys Asp Phe
35 40 45
Ser Ala Arg Lys Val Ser Pro Phe Phe Ala Asp Leu Thr Gln Tyr Leu
50 55 60
Gln Leu Pro Leu Met Ile Gly Leu Tyr Gly Leu Leu Val Phe Gly Val
65 70 75 80
Glu Asn Gly Arg Ile Glu Leu Ser Glu Pro Leu Gln Val Ala Gly Cys
85 90 95
Ile Leu Ser Leu Ala Trp Leu Ser Gly Val Pro Thr Leu Pro Val Ser
100 105 110
His Glu Leu Met His Arg Arg His Trp Leu Pro Arg Lys Met Ala Gln
115 120 125
Leu Leu Ala Met Phe Tyr Gly Asp Pro Asn Arg Asp Ile Ala His Val
130 135 140
Asn Thr His His Leu Tyr Leu Asp Thr Pro Leu Asp Ser Asp Thr Pro
145 150 155 160
Tyr Arg Gly Gln Thr Ile Tyr Ser Phe Val Ile Ser Ala Thr Val Gly
165 170 175
Ser Val Lys Asp Ala Ile Lys Ile Glu Ala Glu Thr Leu Arg Arg Lys
180 185 190
Gly Gln Ser Pro Trp Asn Leu Ser Asn Lys Thr Tyr Gln Tyr Val Ala
195 200 205
Leu Leu Leu Ala Leu Pro Gly Leu Val Ser Tyr Leu Gly Gly Pro Ala
210 215 220
Leu Gly Leu Val Thr Ile Ala Ser Met Ile Ile Ala Lys Gly Ile Val
225 230 235 240
Glu Gly Phe Asn Tyr Phe Gln His Tyr Gly Leu Val Arg Asp Leu Asp
245 250 255
Gln Pro Ile Leu Leu His His Ala Trp Asn His Met Gly Thr Ile Val
260 265 270
Arg Pro Leu Gly Cys Glu Ile Thr Asn His Ile Asn His His Ile Asp
275 280 285
Gly Tyr Thr Arg Phe Tyr Glu Leu Arg Pro Glu Lys Glu Ala Pro Gln
290 295 300
Met Pro Ser Leu Phe Val Cys Phe Leu Leu Gly Leu Ile Pro Pro Leu
305 310 315 320
Trp Phe Ala Leu Ile Ala Lys Pro Lys Leu Arg Asp Trp Asp Gln Arg
325 330 335
Tyr Ala Thr Pro Gly Glu Arg Glu Leu Ala Met Ala Ala Asn Lys Lys
340 345 350
Ala Gly Trp Pro Leu Trp Cys Glu Ser Glu Leu Gly Arg Val Ala Ser
355 360 365
Ile
<210> 4
<211> 1110
<212> DNA
<213> 未知(Unknown)
<400> 4
atggacacgc ttcgttatta cctgattcct gttgttactg cttgcgggct gatcggattt 60
tactatggtg gctattgggt ttggcttggg gcggcaacat tccctgcact gatggtgctt 120
gatgtcattt taccgaagga tttttcggcc agaaaggtaa gtcccttttt cgcagacctt 180
acccagtatt tgcagttacc attaatgatc ggtctatatg ggctccttgt cttcggagtt 240
gaaaacgggc gtatcgaact tagtgagccg ttacaagtgg cagggtgcat tctttctttg 300
gcttggctta gtggtgtgcc aactcttccg gtttcgcatg agttgatgca tcgtcgccac 360
tggttgcctc ggaaaatggc gcagctattg gctatgtttt atggtgatcc gaaccgagac 420
attgcccatg tcaacacgca tcacctttac ttagatacgc ctctcgatag cgatactccg 480
taccgtggtc agacaattta cagtttcgtg atcagtgcga cagttggttc cgtcaaagat 540
gcgataaaga ttgaggctga aactttacgt agaaaaggac agtcaccgtg gaatttgtcc 600
aacaaaacat atcaatatgt cgcacttctg ctcgctctgc ctggcctggt ttcttatctg 660
ggcgggccag cattagggtt ggttacgatt gcttcgatga ttattgcgaa agggatagtc 720
gagggtttta attactttca gcactatggt ttagtacgcg atttagatca gcctatcctc 780
ctgcaccacg cgtggaatca tatgggaaca attgtgcgcc cgctgggttg cgaaattact 840
aaccatatca atcatcatat tgacggctat acacggttct atgagttgcg tccggaaaaa 900
gaagccccgc agatgccttc gctctttgtg tgtttccttc tagggcttat tccgcctctt 960
tggttcgctc tcattgcaaa accaaagttg agagactggg accagcggta cgcaactcca 1020
ggtgagcgcg aactggctat ggctgcaaat aaaaaagcgg gatggccact gtggtgtgaa 1080
agtgaactgg gtcgggtggc tagcatttga 1110
<210> 5
<211> 366
<212> PRT
<213> 未知(Unknown)
<400> 5
Met Glu Ile Lys Ala Ala Ile Val Arg Gln Lys Asn Gly Pro Phe Leu
1 5 10 15
Leu Glu His Val Ala Leu Asn Glu Pro Ala Glu Asp Gln Val Leu Val
20 25 30
Arg Leu Val Ala Thr Gly Leu Cys His Thr Asp Leu Val Cys Arg Asp
35 40 45
Gln His Tyr Pro Val Pro Leu Pro Met Val Phe Gly His Glu Gly Ala
50 55 60
Gly Val Val Glu Arg Val Gly Ser Ala Val Lys Lys Val Gln Pro Gly
65 70 75 80
Asp His Val Val Leu Thr Phe Tyr Thr Cys Gly Ser Cys Asp Ala Cys
85 90 95
Leu Ser Gly Asp Pro Thr Ser Cys Ala Asn Ser Phe Gly Pro Asn Phe
100 105 110
Met Gly Arg Ser Val Thr Gly Glu Cys Thr Ile His Asp His Gln Gly
115 120 125
Ala Glu Val Gly Ala Ser Phe Phe Gly Gln Ser Ser Phe Ala Thr Tyr
130 135 140
Ala Leu Ser Tyr Glu Arg Asn Thr Val Lys Val Thr Lys Asp Val Pro
145 150 155 160
Leu Glu Leu Leu Gly Pro Leu Gly Cys Gly Ile Gln Thr Gly Ala Gly
165 170 175
Ser Val Leu Asn Ala Leu Asn Pro Pro Ala Gly Ser Ala Ile Ala Ile
180 185 190
Phe Gly Ala Gly Ala Val Gly Leu Ser Ala Val Met Ala Ala Val Val
195 200 205
Ala Gly Cys Thr Thr Ile Ile Ala Val Asp Val Lys Glu Asn Arg Leu
210 215 220
Glu Leu Ala Ser Glu Leu Gly Ala Thr His Ile Ile Asn Pro Ala Ala
225 230 235 240
Asn Asp Pro Ile Glu Ala Ile Lys Glu Ile Phe Ala Asp Gly Val Pro
245 250 255
Tyr Val Leu Glu Thr Ser Gly Leu Pro Ala Val Leu Thr Gln Ala Ile
260 265 270
Leu Ser Ser Ala Ile Gly Gly Glu Ile Gly Ile Val Gly Ala Pro Pro
275 280 285
Met Gly Ala Thr Val Pro Val Asp Ile Asn Phe Leu Leu Phe Asn Arg
290 295 300
Lys Leu Arg Gly Ile Val Glu Gly Gln Ser Ile Ser Asp Ile Phe Ile
305 310 315 320
Pro Arg Leu Val Glu Leu Tyr Arg Gln Gly Lys Phe Pro Phe Asp Lys
325 330 335
Leu Ile Lys Phe Tyr Pro Phe Asp Glu Ile Asn Arg Ala Ala Glu Asp
340 345 350
Ser Glu Lys Gly Val Thr Leu Lys Pro Val Leu Arg Ile Gly
355 360 365
<210> 6
<211> 1101
<212> DNA
<213> 未知(Unknown)
<400> 6
atggaaatca aagcagcaat agttcgccaa aaaaatggcc cgttcttact tgagcatgta 60
gctcttaatg agccagctga agatcaggtt ctcgttagat tggttgcaac cgggctgtgt 120
catacggatc tggtttgtcg cgatcagcat tatccggttc cactaccgat ggtatttggg 180
catgaagggg ctggtgtggt tgagcgggtt gggtccgcgg tcaaaaaggt tcagccgggc 240
gaccatgttg ttttgacatt ttatacctgc gggagttgtg atgcttgtct ttccggagac 300
cctaccagtt gtgcaaactc atttggccct aactttatgg ggcgctcggt aaccggggag 360
tgcaccatcc acgatcacca aggggcagag gtgggagcaa gcttttttgg gcagtcctcc 420
tttgcgacat atgcgctatc ttatgaacgt aacactgtga aggttacaaa agacgtaccg 480
cttgagttgc ttgggcctct tggttgtggc attcaaactg gcgcagggtc tgttctgaat 540
gcgcttaatc cgccagcggg ttctgctatc gcaatttttg gtgctggggc agttggtctt 600
tcggccgtga tggctgccgt tgtagcaggt tgtaccacca tcatcgctgt cgacgttaag 660
gaaaaccggc tggaactagc cagtgaactt ggggcgacgc acattattaa cccggccgct 720
aacgatccca ttgaggcgat caaagagata ttcgctgacg gtgttccgta tgtattggag 780
actagcggtt tgcccgccgt gcttacgcag gcgatcctca gctctgctat aggcggtgag 840
atcggtattg taggggcgcc acctatgggg gccacggtgc ccgttgacat taacttcctg 900
ctattcaatc gtaagcttcg tggaatcgtt gagggtcagt cgatctcgga tattttcatt 960
cccaggctgg tggagcttta tcgccagggg aagtttccgt ttgacaagct gattaagttt 1020
tatccttttg atgaaatcaa tcgagccgcc gaagattcgg aaaaaggcgt gacgcttaag 1080
ccggtactcc ggattggttg a 1101
<210> 7
<211> 487
<212> PRT
<213> 未知(Unknown)
<400> 7
Met Arg Glu Thr Lys Glu Gln Pro Ile Trp Tyr Gly Lys Val Phe Ser
1 5 10 15
Ser Asn Trp Val Glu Ala Arg Gly Gly Val Ala Asn Val Val Asp Pro
20 25 30
Ser Asn Gly Asp Ile Leu Gly Ile Thr Gly Val Ala Asn Gly Glu Asp
35 40 45
Val Asp Ala Ala Val Asn Ala Ala Lys Arg Ala Gln Lys Glu Trp Ala
50 55 60
Ala Ile Pro Phe Ser Glu Arg Ala Ala Ile Val Arg Lys Ala Ala Glu
65 70 75 80
Lys Leu Lys Glu Arg Glu Tyr Glu Phe Ala Asp Trp Asn Val Arg Glu
85 90 95
Cys Gly Ala Ile Arg Pro Lys Gly Leu Trp Glu Ala Gly Ile Ala Tyr
100 105 110
Glu Gln Met His Gln Ala Ala Gly Leu Ala Ser Leu Pro Asn Gly Thr
115 120 125
Leu Phe Pro Ser Ala Val Pro Gly Arg Met Asn Leu Cys Gln Arg Val
130 135 140
Pro Val Gly Val Val Gly Val Ile Ala Pro Trp Asn Phe Pro Leu Phe
145 150 155 160
Leu Ala Met Arg Ser Val Ala Pro Ala Leu Ala Leu Gly Asn Ala Val
165 170 175
Ile Leu Lys Pro Asp Leu Gln Thr Ala Val Thr Gly Gly Ala Leu Ile
180 185 190
Ala Glu Ile Phe Ser Asp Ala Gly Met Pro Asp Gly Val Leu His Val
195 200 205
Leu Pro Gly Gly Ala Asp Val Gly Glu Ser Met Val Ala Asn Ser Gly
210 215 220
Ile Asn Met Ile Ser Phe Thr Gly Ser Thr Gln Val Gly Arg Leu Ile
225 230 235 240
Gly Glu Lys Cys Gly Arg Met Leu Lys Lys Val Ala Leu Glu Leu Gly
245 250 255
Gly Asn Asn Val His Ile Val Leu Pro Asp Ala Asp Leu Glu Gly Ala
260 265 270
Val Ser Cys Ala Ala Trp Gly Thr Phe Leu His Gln Gly Gln Val Cys
275 280 285
Met Ala Ala Gly Arg His Leu Val His Arg Asp Val Ala Gln Gln Tyr
290 295 300
Ala Glu Lys Leu Ala Leu Arg Ala Lys Asn Leu Val Val Gly Asp Pro
305 310 315 320
Asn Ser Asp Gln Val His Leu Gly Pro Leu Ile Asn Glu Lys Gln Val
325 330 335
Val Arg Val His Ala Leu Val Glu Ser Ala Gln Arg Ala Gly Ala Gln
340 345 350
Val Leu Ala Gly Gly Thr Tyr Gln Asp Arg Tyr Tyr Gln Ala Thr Val
355 360 365
Ile Met Asp Val Lys Pro Glu Met Glu Val Phe Lys Ser Glu Ile Phe
370 375 380
Gly Pro Val Ala Pro Ile Thr Val Phe Asp Ser Ile Glu Glu Ala Ile
385 390 395 400
Glu Leu Ala Asn Cys Ser Glu Tyr Gly Leu Ala Ala Ser Ile His Thr
405 410 415
Arg Ala Leu Ala Thr Gly Leu Asp Ile Ala Lys Arg Leu Asn Thr Gly
420 425 430
Met Val His Ile Asn Asp Gln Pro Ile Asn Cys Glu Pro His Val Pro
435 440 445
Phe Gly Gly Met Gly Ala Ser Gly Ser Gly Gly Arg Phe Gly Gly Pro
450 455 460
Ala Ser Ile Glu Glu Phe Thr Gln Ser Gln Trp Ile Ser Met Val Glu
465 470 475 480
Lys Pro Ala Asn Tyr Pro Phe
485
<210> 8
<211> 1464
<212> DNA
<213> 未知(Unknown)
<400> 8
atgcgggaaa caaaagagca gcctatctgg tacgggaagg tgtttagttc taattgggta 60
gaggcgcggg gaggtgttgc caatgttgtc gatccgtcca atggagacat tcttggcatt 120
acgggtgttg ctaacggcga agatgtcgat gctgctgtga acgcagctaa gagagcgcaa 180
aaggaatggg ccgcaatacc atttagtgaa agagccgcca ttgtccgcaa ggctgccgaa 240
aaactaaagg agcgcgaata tgaattcgcc gattggaacg tacgggaatg cggcgcaatt 300
cgtccgaagg gcttatggga ggccggaatt gcgtatgagc aaatgcatca agctgcgggt 360
ctagcttctt tgcctaacgg tacattgttt ccatcggcag ttccagggcg catgaatctt 420
tgtcagcgcg ttccagttgg cgtggtcggc gtaattgcac cttggaattt cccgttgttt 480
ctagcaatgc gttcggtagc accagcctta gcgttgggta atgcggtgat cttaaagccc 540
gaccttcaga ctgctgtcac cgggggggcg ctcattgccg aaatcttttc cgacgctggc 600
atgccggacg gtgttcttca cgttcttcct ggtggagcgg acgtaggaga gtcaatggtt 660
gcgaactccg gaattaacat gatttctttt accgggtcca cacaggtggg ccggttgatc 720
ggagagaaat gcgggagaat gctgaaaaag gttgcgcttg aactgggtgg taataatgtc 780
cacatcgtgt tgcctgacgc cgatttagaa ggggctgtca gctgcgctgc ttggggtacg 840
tttttgcatc agggccaagt gtgcatggcc gccggacgtc atttagtaca tagggacgtt 900
gctcagcaat atgcagagaa actggcgcta cgtgccaaga acttagtggt gggggatcca 960
aactcggatc aagtgcatct cggcccgctt atcaatgaga aacaggtagt tcgcgtccac 1020
gcgctcgttg aatctgcgca aagggccggt gctcaggttt tggcgggagg tacgtatcaa 1080
gatcgctact accaagctac cgtaatcatg gatgtgaagc cggagatgga ggttttcaaa 1140
tctgaaattt tcggcccggt ggctccgatc actgtatttg acagtattga agaggcgatt 1200
gaattggcaa actgttcgga gtatgggttg gccgcatcta tccatactag ggcgttggcg 1260
actggtctag acatcgcaaa gcgtctaaat accggtatgg tccatattaa tgaccagcca 1320
attaactgtg agccgcatgt tcccttcgga ggaatgggtg cctcgggtag cggaggccgg 1380
tttggcggac ctgcaagtat tgaagaattt actcaatctc aatggattag tatggttgag 1440
aagccagcta attacccatt ttga 1464
Claims (10)
1.一种高效合成5-甲基吡嗪-2-羧酸的重组工程菌,其特征在于,所述重组工程菌表达苏氨酸脱氢酶TDH、二甲苯单加氧酶XMO、苯甲醇脱氢酶BADH和苯甲醛脱氢酶BZDH。
2.根据权利要求1所述的重组工程菌,其特征在于,所述的苏氨酸脱氢酶为Escherichia coli来源的苏氨酸脱氢酶,其氨基酸序列如SEQ ID NO:1所示;所述的二甲苯单加氧酶为Pseudomonas aeruginosa strain来源的二甲苯单氧合酶,其氨基酸序列如SEQID NO:3所示;所述的苯甲醇脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醇脱氢酶,其氨基酸序列如SEQ ID NO:5;所述苯甲醛脱氢酶为Pseudomonas aeruginosa strain来源的苯甲醛脱氢酶,其氨基酸序列如SEQ ID NO:7。
3.一种权利要求1或2所述的重组工程菌的构建方法,其特征在于,包括将苏氨酸脱氢酶基因tdh、二甲苯单氧合酶基因xmo、苯甲醇脱氢酶基因badh、苯甲醛脱氢酶基因bzdh克隆到表达载体上构建得到重组表达质粒,然后将构建的表达质粒转入宿主细胞中进行表达。
4.根据权利要求3所述的构建方法,其特征在于,所述的表达载体选自pRSFDuet-1、pET-28a、pETDuet-1、pCDFDuet-1中的一种或几种;优选所述的表达载体为pET-28a和pETDuet-1。
5.根据权利要求3所述的构建方法,其特征在于,所述的宿主细胞是大肠杆菌BL21(DE3)。
6.根据权利要求3所述的构建方法,其特征在于,将苏氨酸脱氢酶基因tdh连接到表达载体pET28a上得到重组表达质粒pET28a-tdh;二甲苯单氧合酶基因xmo与苯甲醇脱氢酶基因badh的连续基因、苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1上得到重组表达质粒pETDuet-1-xmobadhbzdh,然后将构建的表达质粒都转入大肠杆菌BL21(DE3)中进行表达。
7.根据权利要求6所述的构建方法,其特征在于,所述二甲苯单氧合酶基因xmo与苯甲醇脱氢酶基因badh连接到表达载体pETDuet-1的第一个克隆位点,苯甲醛脱氢酶基因bzdh连接到表达载体pETDuet-1的第二个克隆位点上。
8.一种高效合成5-甲基吡嗪-2-羧酸的方法,其特征在于,所述方法是利用权利要求1-2任一项所述的重组工程菌为出发菌株,在含有L-苏氨酸的培养基中进行发酵生产得到5-甲基吡嗪-2羧酸。
9.根据权利要求8所述的方法,其特征在于,L-苏氨酸的浓度为8-12g/L。
10.根据权利要求8所述的方法,其特征在于,反应时间为8-12h,反应温度为22~30℃。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110596118.5A CN115478041A (zh) | 2021-05-30 | 2021-05-30 | 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110596118.5A CN115478041A (zh) | 2021-05-30 | 2021-05-30 | 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN115478041A true CN115478041A (zh) | 2022-12-16 |
Family
ID=84420228
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110596118.5A Pending CN115478041A (zh) | 2021-05-30 | 2021-05-30 | 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115478041A (zh) |
-
2021
- 2021-05-30 CN CN202110596118.5A patent/CN115478041A/zh active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112143764B (zh) | 一种生物酶催化制备布瓦西坦中间体化合物的方法 | |
WO2010024445A1 (ja) | 光学活性なアミン誘導体を製造するための方法 | |
CN109735553B (zh) | 一种抗艾滋病药物阿扎那韦中间体的制备方法 | |
CN105779409B (zh) | 一种立体选择性酯酶、编码基因、载体、工程菌及其应用 | |
CN113355366A (zh) | 一种多酶级联制备2-苯乙醇的方法 | |
CN111996176A (zh) | 羰基还原酶突变体及其应用 | |
CN113355299B (zh) | 酮酸还原酶、基因、工程菌及在合成手性芳香2-羟酸中的应用 | |
CN113755539A (zh) | 一种二氢嘧啶氨基水解酶及其应用 | |
CN113817693A (zh) | 一种短链羰基还原酶PpYSDR突变体、编码基因、重组表达载体、基因工程菌及应用 | |
CN109943542B (zh) | 一种用于阿扎那韦中间体生产的醇脱氢酶 | |
CN117844772A (zh) | 一种胺脱氢酶突变体、工程菌及在合成(r)-3-氨基丁醇中的应用 | |
CN115478041A (zh) | 一种高效合成5-甲基吡嗪-2-羧酸工程菌的构建及应用 | |
CN111254170B (zh) | 一种多酶耦合制备(s)-1,2,3,4-四氢异喹啉-3-甲酸的方法 | |
CN110358804B (zh) | R-3-氨基正丁醇的酶法生产工艺 | |
CN114525291A (zh) | 一种羰基还原酶及利用该酶制备(r)-4-氯-3-羟基丁酸乙酯的方法 | |
WO2004022764A9 (en) | Use of malate dehydrogenase for nadh regeneration | |
CN115029329B (zh) | 一种羰基还原酶突变体及其在制备r-扁桃酸中的应用 | |
CN106085993B (zh) | 一种来源于葡萄藤伯克氏菌的酰胺酶、基因、菌株及其应用 | |
US20230107679A1 (en) | Method For Preparing (S)-1,2,3,4-Tetrahydroisoquinoline-1 Carboxylic Acid and Derivatives Thereof | |
CN107287256A (zh) | 全细胞催化合成l-2-哌啶甲酸的方法 | |
CN106701700B (zh) | 一种氧化酶及其应用 | |
CN114836486B (zh) | 酶催化合成手性beta-氨基醇的方法 | |
CN116064441B (zh) | L-泛解酸内酯脱氢酶突变体及其编码基因与应用 | |
JP2011177029A (ja) | 光学活性なアミン誘導体の製造方法 | |
CN106636022B (zh) | 一种氧化酶及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |