CN114703211A - Sucrose isomerase activity inclusion body fused with coiled coil structure domain - Google Patents
Sucrose isomerase activity inclusion body fused with coiled coil structure domain Download PDFInfo
- Publication number
- CN114703211A CN114703211A CN202210142279.1A CN202210142279A CN114703211A CN 114703211 A CN114703211 A CN 114703211A CN 202210142279 A CN202210142279 A CN 202210142279A CN 114703211 A CN114703211 A CN 114703211A
- Authority
- CN
- China
- Prior art keywords
- asp
- tdot
- arg
- asn
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 210000003000 inclusion body Anatomy 0.000 title claims abstract description 65
- 108010047540 sucrose isomerase Proteins 0.000 title claims abstract description 54
- 230000000694 effects Effects 0.000 title claims abstract description 48
- 108090000623 proteins and genes Proteins 0.000 claims description 32
- 238000006243 chemical reaction Methods 0.000 claims description 27
- PVXPPJIGRGXGCY-TZLCEDOOSA-N 6-O-alpha-D-glucopyranosyl-D-fructofuranose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)C(O)(CO)O1 PVXPPJIGRGXGCY-TZLCEDOOSA-N 0.000 claims description 21
- 229930006000 Sucrose Natural products 0.000 claims description 20
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 claims description 20
- 241000894006 Bacteria Species 0.000 claims description 19
- 239000005720 sucrose Substances 0.000 claims description 19
- 239000002773 nucleotide Substances 0.000 claims description 18
- 125000003729 nucleotide group Chemical group 0.000 claims description 18
- 102000037865 fusion proteins Human genes 0.000 claims description 13
- 108020001507 fusion proteins Proteins 0.000 claims description 13
- 230000001939 inductive effect Effects 0.000 claims description 6
- 125000003275 alpha amino acid group Chemical group 0.000 claims 2
- 239000013604 expression vector Substances 0.000 abstract description 17
- 238000003259 recombinant expression Methods 0.000 abstract description 15
- 241000588724 Escherichia coli Species 0.000 abstract description 14
- 241000410414 Klebsiella sp. LX3 Species 0.000 abstract description 4
- 238000011160 research Methods 0.000 abstract description 4
- 108010076504 Protein Sorting Signals Proteins 0.000 abstract description 3
- 230000002255 enzymatic effect Effects 0.000 abstract description 3
- 238000012512 characterization method Methods 0.000 abstract description 2
- 102000004190 Enzymes Human genes 0.000 description 48
- 108090000790 Enzymes Proteins 0.000 description 48
- 108020004414 DNA Proteins 0.000 description 18
- 229960004793 sucrose Drugs 0.000 description 18
- 238000000034 method Methods 0.000 description 16
- 239000000047 product Substances 0.000 description 15
- 210000004027 cell Anatomy 0.000 description 14
- 239000007788 liquid Substances 0.000 description 10
- 239000006228 supernatant Substances 0.000 description 10
- 230000003197 catalytic effect Effects 0.000 description 9
- 239000007795 chemical reaction product Substances 0.000 description 9
- 239000007853 buffer solution Substances 0.000 description 8
- 230000006698 induction Effects 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 238000002360 preparation method Methods 0.000 description 8
- 230000001965 increasing effect Effects 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 238000004458 analytical method Methods 0.000 description 6
- 239000000872 buffer Substances 0.000 description 6
- 238000012258 culturing Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 6
- 239000000758 substrate Substances 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 229930027917 kanamycin Natural products 0.000 description 5
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 5
- 229960000318 kanamycin Drugs 0.000 description 5
- 229930182823 kanamycin A Natural products 0.000 description 5
- 102000004169 proteins and genes Human genes 0.000 description 5
- 238000011084 recovery Methods 0.000 description 5
- 238000005406 washing Methods 0.000 description 5
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- 101150021746 SI gene Proteins 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- 230000003321 amplification Effects 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 238000006555 catalytic reaction Methods 0.000 description 4
- CBMPTFJVXNIWHP-UHFFFAOYSA-L disodium;hydrogen phosphate;2-hydroxypropane-1,2,3-tricarboxylic acid Chemical compound [Na+].[Na+].OP([O-])([O-])=O.OC(=O)CC(O)(C(O)=O)CC(O)=O CBMPTFJVXNIWHP-UHFFFAOYSA-L 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 239000002609 medium Substances 0.000 description 4
- 238000003199 nucleic acid amplification method Methods 0.000 description 4
- 239000002244 precipitate Substances 0.000 description 4
- 239000013049 sediment Substances 0.000 description 4
- 239000000126 substance Substances 0.000 description 4
- 108010020532 tyrosyl-proline Proteins 0.000 description 4
- WEVYAHXRMPXWCK-UHFFFAOYSA-N Acetonitrile Chemical compound CC#N WEVYAHXRMPXWCK-UHFFFAOYSA-N 0.000 description 3
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 3
- 241000556426 Erwinia rhapontici Species 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 108010093096 Immobilized Enzymes Proteins 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 239000012876 carrier material Substances 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 239000008367 deionised water Substances 0.000 description 3
- 229910021641 deionized water Inorganic materials 0.000 description 3
- 235000013305 food Nutrition 0.000 description 3
- 239000012634 fragment Substances 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 230000003100 immobilizing effect Effects 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 238000002156 mixing Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- 239000011734 sodium Substances 0.000 description 3
- 239000007787 solid Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 229920001351 ε-poly-L-lysine Polymers 0.000 description 3
- SVBWNHOBPFJIRU-UHFFFAOYSA-N 1-O-alpha-D-Glucopyranosyl-D-fructose Natural products OC1C(O)C(O)C(CO)OC1OCC1(O)C(O)C(O)C(O)CO1 SVBWNHOBPFJIRU-UHFFFAOYSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 2
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 2
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 2
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- 108010083946 Asp-Tyr-Leu-Lys Proteins 0.000 description 2
- 102000012410 DNA Ligases Human genes 0.000 description 2
- 108010061982 DNA Ligases Proteins 0.000 description 2
- 239000005715 Fructose Substances 0.000 description 2
- 108010010803 Gelatin Proteins 0.000 description 2
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 2
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 2
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 2
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 2
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 2
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 2
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 2
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- 241000205077 Staphylothermus marinus Species 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 2
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 2
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 2
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 2
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 2
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 2
- 108010047495 alanylglycine Proteins 0.000 description 2
- 150000001413 amino acids Chemical group 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010068265 aspartyltyrosine Proteins 0.000 description 2
- 238000009835 boiling Methods 0.000 description 2
- 238000005119 centrifugation Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000004132 cross linking Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000000855 fermentation Methods 0.000 description 2
- 230000004151 fermentation Effects 0.000 description 2
- 239000008273 gelatin Substances 0.000 description 2
- 229920000159 gelatin Polymers 0.000 description 2
- 235000019322 gelatine Nutrition 0.000 description 2
- 235000011852 gelatine desserts Nutrition 0.000 description 2
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 150000002772 monosaccharides Chemical class 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 239000000843 powder Substances 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 239000012474 protein marker Substances 0.000 description 2
- 230000036632 reaction speed Effects 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 239000011780 sodium chloride Substances 0.000 description 2
- 229910000162 sodium phosphate Inorganic materials 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- NMXLJRHBJVMYPD-IPFGBZKGSA-N trehalulose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@]1(O)CO[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 NMXLJRHBJVMYPD-IPFGBZKGSA-N 0.000 description 2
- 239000012137 tryptone Substances 0.000 description 2
- 229920001817 Agar Polymers 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 1
- CKIBTNMWVMKAHB-RWGOJESNSA-N Ala-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 CKIBTNMWVMKAHB-RWGOJESNSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 1
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 1
- GFFRWIJAFFMQGM-NUMRIWBASA-N Asn-Glu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFFRWIJAFFMQGM-NUMRIWBASA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 1
- XMHFCUKJRCQXGI-CIUDSAMLSA-N Asn-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O XMHFCUKJRCQXGI-CIUDSAMLSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 1
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 1
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- YBMUFUWSMIKJQA-GUBZILKMSA-N Asp-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N YBMUFUWSMIKJQA-GUBZILKMSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 229930091371 Fructose Natural products 0.000 description 1
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- CRRFJBGUGNNOCS-PEFMBERDSA-N Gln-Asp-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CRRFJBGUGNNOCS-PEFMBERDSA-N 0.000 description 1
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 1
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 1
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 1
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 1
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- LKOAAMXDJGEYMS-ZPFDUUQYSA-N Glu-Met-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LKOAAMXDJGEYMS-ZPFDUUQYSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- NZOAFWHVAFJERA-OALUTQOASA-N Gly-Phe-Trp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NZOAFWHVAFJERA-OALUTQOASA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 101001113903 Grapevine leafroll-associated virus 3 (isolate United States/NY1) Protein P4 Proteins 0.000 description 1
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 1
- SYIPVNMWBZXKMU-HJPIBITLSA-N His-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N SYIPVNMWBZXKMU-HJPIBITLSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 1
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 1
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- AREBLHSMLMRICD-PYJNHQTQSA-N Ile-His-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AREBLHSMLMRICD-PYJNHQTQSA-N 0.000 description 1
- APDIECQNNDGFPD-PYJNHQTQSA-N Ile-His-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N APDIECQNNDGFPD-PYJNHQTQSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- ZUPJCJINYQISSN-XUXIUFHCSA-N Ile-Met-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUPJCJINYQISSN-XUXIUFHCSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- 239000012880 LB liquid culture medium Substances 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 1
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 1
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 1
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 1
- UIJVKVHLCQSPOJ-XIRDDKMYSA-N Lys-Ser-Trp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O UIJVKVHLCQSPOJ-XIRDDKMYSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 101100001794 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) aps-2 gene Proteins 0.000 description 1
- 241000611870 Pantoea dispersa Species 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- RLUMIJXNHJVUCO-JBACZVJFSA-N Phe-Gln-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 RLUMIJXNHJVUCO-JBACZVJFSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 1
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 1
- RVEVENLSADZUMS-IHRRRGAJSA-N Phe-Pro-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RVEVENLSADZUMS-IHRRRGAJSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- 241000222640 Polyporus Species 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 1
- XWYXZPHPYKRYPA-GMOBBJLQSA-N Pro-Asn-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XWYXZPHPYKRYPA-GMOBBJLQSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 1
- YSUZKYSRAFNLRB-ULQDDVLXSA-N Pro-Gln-Trp Chemical compound N([C@@H](CCC(=O)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 YSUZKYSRAFNLRB-ULQDDVLXSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- XFFIGWGYMUFCCQ-ULQDDVLXSA-N Pro-His-Tyr Chemical compound C1=CC(O)=CC=C1C[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)[C@H]1[NH2+]CCC1)CC1=CN=CN1 XFFIGWGYMUFCCQ-ULQDDVLXSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 1
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- 108010091086 Recombinases Proteins 0.000 description 1
- 102000018120 Recombinases Human genes 0.000 description 1
- 240000000111 Saccharum officinarum Species 0.000 description 1
- 235000007201 Saccharum officinarum Nutrition 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241001622809 Serratia plymuthica Species 0.000 description 1
- 241001052560 Thallis Species 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- QHUWWSQZTFLXPQ-FJXKBIBVSA-N Thr-Met-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QHUWWSQZTFLXPQ-FJXKBIBVSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- 229910010413 TiO 2 Inorganic materials 0.000 description 1
- KZIQDVNORJKTMO-WDSOQIARSA-N Trp-Arg-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N KZIQDVNORJKTMO-WDSOQIARSA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- OBWQLWYNNZPWGX-QEJZJMRPSA-N Trp-Gln-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O OBWQLWYNNZPWGX-QEJZJMRPSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- AZBIIKDSDLVJAK-VHWLVUOQSA-N Trp-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N AZBIIKDSDLVJAK-VHWLVUOQSA-N 0.000 description 1
- VUMCLPHXCBIJJB-PMVMPFDFSA-N Trp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N VUMCLPHXCBIJJB-PMVMPFDFSA-N 0.000 description 1
- XXJDYWYVZBHELV-TUSQITKMSA-N Trp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)N[C@@H](CCCCN)C(=O)O)N XXJDYWYVZBHELV-TUSQITKMSA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 1
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000009849 deactivation Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 206010012601 diabetes mellitus Diseases 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000011389 fruit/vegetable juice Nutrition 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- XLYOFNOQVPJJNP-ZSJDYOACSA-N heavy water Substances [2H]O[2H] XLYOFNOQVPJJNP-ZSJDYOACSA-N 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 235000012907 honey Nutrition 0.000 description 1
- 239000005457 ice water Substances 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000002779 inactivation Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 238000009776 industrial production Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- 108010034529 leucyl-lysine Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 238000009630 liquid culture Methods 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 235000020098 plum wine Nutrition 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 239000002994 raw material Substances 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- QEVHRUUCFGRFIF-MDEJGZGSSA-N reserpine Chemical compound O([C@H]1[C@@H]([C@H]([C@H]2C[C@@H]3C4=C(C5=CC=C(OC)C=C5N4)CCN3C[C@H]2C1)C(=O)OC)OC)C(=O)C1=CC(OC)=C(OC)C(OC)=C1 QEVHRUUCFGRFIF-MDEJGZGSSA-N 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- AJPJDKMHJJGVTQ-UHFFFAOYSA-M sodium dihydrogen phosphate Chemical compound [Na+].OP(O)([O-])=O AJPJDKMHJJGVTQ-UHFFFAOYSA-M 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 235000011496 sports drink Nutrition 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N11/00—Carrier-bound or immobilised enzymes; Carrier-bound or immobilised microbial cells; Preparation thereof
- C12N11/16—Enzymes or microbial cells immobilised on or in a biological cell
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/70—Vectors or expression systems specially adapted for E. coli
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/12—Disaccharides
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P19/00—Preparation of compounds containing saccharide radicals
- C12P19/24—Preparation of compounds containing saccharide radicals produced by the action of an isomerase, e.g. fructose
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y504/00—Intramolecular transferases (5.4)
- C12Y504/99—Intramolecular transferases (5.4) transferring other groups (5.4.99)
- C12Y504/99011—Isomaltulose synthase (5.4.99.11)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Biophysics (AREA)
- Medicinal Chemistry (AREA)
- General Chemical & Material Sciences (AREA)
- Gastroenterology & Hepatology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
- Peptides Or Proteins (AREA)
Abstract
The invention discloses a sucrose isomerase activity inclusion body fused with a coiled coil domain, belonging to the field of biological engineering. The invention takes SI derived from Klebsiella sp.LX3 as a research object, Tdot is respectively fused at the N end or the C end of the SI from which a natural signal peptide sequence is removed, a recombinant expression vector is constructed and transferred into Escherichia coli E.coli BL21(DE3) for induced expression, and enzymatic property characterization is carried out after active inclusion bodies are further separated and purified, thereby providing a new strategy for heterologous expression and efficient utilization of recombinant sucrose isomerase.
Description
Technical Field
The invention relates to a sucrose isomerase activity inclusion body fused with a coiled coil structure domain, belonging to the field of biological engineering.
Background
Isomaltulose (alpha-D-glucopyranosyl-1, 6-fructose) is an isomer of sucrose, naturally present in honey, sugarcane juice, and has a sweetness of about 50% of sucrose. At present, isomaltulose is used mainly in the food industry as a substitute for sucrose in the production of health products, diabetic foods and sports drinks. In addition, isomaltulose is also used as a raw material for producing some surfactants and high molecular polymers in the chemical field, and thus is a functional sugar with good development prospects. At present, Sucrose Isomerase (SI) is mainly used for catalyzing Sucrose to be converted into isomaltulose industrially, and compared with a chemical method for synthesis, the method has the advantages of being mild in reaction conditions, environment-friendly in preparation process, high in reaction specificity and the like. Since the application of isomaltulose is more and more widespread and is more popular with consumers, the market demand of isomaltulose is increasing year by year, but the production is mainly monopolized by foreign companies, and the market price per ton is about 1.5 ten thousand yuan, so that the utilization rate of sucrose isomerase is increased, and the production cost of isomaltulose is reduced.
Various sucrose isomerase-producing bacteria have been found, including Erwinia rhapontici (Erwinia rhapontici), Serratia vulgaris (Serratia plymuthica), Polyporus frondosa (Pantoea dispersa), and Klebsiella pneumoniae (Klebsiella sp). However, the wild bacteria have low enzyme production level generally and are difficult to meet the requirements of industrial production, such as the fermentation enzyme activity of Erwinia rhapontici NX-5 is 1.3U/mL, and KlebsThe fermentation enzyme activity of iella sp.LX3 is 15.12U/mL. Therefore, researchers currently prepare recombinant sucrose isomerase mainly through heterologous expression means and immobilize it on a water-insoluble carrier to prepare immobilized enzyme so as to improve the utilization rate. Such as mesoporous TiO modified by epsilon-poly-L-lysine2And the sponge prepared from epsilon-poly-L-lysine and gelatin is used for immobilizing the sucrose isomerase, and the enzyme activity recovery rates are respectively 93.3% and 84.5%; the chitosan-glutaraldehyde crosslinking method, the sodium alginate-sodium carboxymethylcellulose mixed embedding method and the cross-linked enzyme aggregate method are used for immobilizing the sucrose isomerase, so that the enzyme activity recovery rates of 70.3%, 32.5% and 30.9% are respectively realized. In summary, although sucrose isomerase is successfully prepared into immobilized enzyme by the traditional adsorption method, binding method, crosslinking method and embedding method, the pure enzyme needs to be separated and purified by chromatographic technology first, and then the water-insoluble carrier and the pure enzyme are used for carrying out immobilization reaction, and the separation and purification, carrier preparation and chemical reagents related to the immobilization reaction in the process all increase the production cost of the immobilized enzyme, and are not beneficial to the large-scale production and application of the immobilized sucrose isomerase. In recent years, researches show that a specific tag sequence and enzyme protein are used for fusion expression, so that the specific tag sequence can be induced to form an Inclusion body (CatIBs, hereinafter referred to as active Inclusion body) with catalytic activity in cells, the expression quantity is large, the separation and purification are easy, especially the expression and immobilization processes are carried out synchronously, no additional carrier material and immobilization reaction are needed, the novel self-immobilized enzyme is a novel self-immobilized enzyme, the view that the past 'the Inclusion body is a completely inactive protein aggregate' is theoretically changed, and the novel enzyme heterologous expression and immobilization method has better industrial application potential.
Disclosure of Invention
In order to solve the problems in the prior art, the invention provides a sucrose isomerase activity inclusion body fused with a coiled coil domain. The SI derived from Klebsiella sp.LX3 is taken as a research object, and the invention has the advantages of higher sucrose conversion rate (99.5%), isomaltulose serving as a main product and the like. Tdot is respectively fused at the N end or the C end of SI from which a natural signal peptide sequence is removed, a recombinant expression vector is constructed and transferred into Escherichia coli E.coli BL21(DE3) for induced expression, and enzymatic property characterization is carried out after active inclusion bodies are further separated and purified, so that a new strategy is provided for heterologous expression and efficient utilization of the recombinant sucrose isomerase.
The invention provides a gene, the nucleotide sequence of which is shown as SEQ ID NO. 1.
The invention also provides a gene, and the nucleotide sequence is shown as SEQ ID NO. 2.
The invention also provides a fusion protein containing Tdot-SI and encoded by the gene with the nucleotide sequence shown as SEQ ID NO.1, and the amino acid sequence is shown as SEQ ID NO. 3.
The invention also provides a fusion protein containing SI-Tdot and encoded by the gene with the nucleotide sequence shown as SEQ ID NO.2, and the amino acid sequence is shown as SEQ ID NO. 4.
The invention also provides a recombinant engineering bacterium containing the gene with the nucleotide sequence shown as SEQ ID NO. 1.
The invention also provides a recombinant engineering bacterium containing the gene with the nucleotide sequence shown as SEQ ID NO. 2.
The invention also provides a sucrose isomerase active inclusion body which is obtained by inducing fusion protein expression by recombinant engineering bacteria containing a gene with a nucleotide sequence shown as SEQ ID NO.1 or recombinant engineering bacteria containing a gene with a nucleotide sequence shown as SEQ ID NO.2, and crushing, separating and purifying thalli.
Specifically, the preparation method of the inclusion body with sucrose isomerase activity comprises the following steps: a coiled-coil domain Tdot is fused at the N or C end of sucrose isomerase SI to construct a recombinant expression vector, and the recombinant expression vector is transferred into Escherichia coli E.coli BL21(DE3) for induced expression to prepare an active inclusion body.
The preparation method of the sucrose isomerase activity inclusion body of the recombinant engineering bacteria containing the gene with the nucleotide sequence shown as SEQ ID NO.1 specifically comprises the following steps:
(1) constructing a recombinant expression vector pET-24a-Tdot-SI containing a coding gene of Tdot-SI fusion protein, and transferring the recombinant expression vector into an expression host cell to obtain recombinant engineering bacteria;
(2) culturing the recombinant engineering bacteria, and inducing the expression of the fusion protein;
(3) collecting bacterial cells, carrying out cell disruption, separating cell disruption supernatant and a sediment sample containing the inclusion body, and washing and purifying the sediment sample by buffer solution to obtain the active inclusion body.
The preparation method of the sucrose isomerase activity inclusion body of the recombinant engineering bacteria containing the gene with the nucleotide sequence shown as SEQ ID NO.2 specifically comprises the following steps:
(1) constructing a recombinant expression vector pET-24b-SI-Tdot containing an encoding gene of SI-Tdot fusion protein, transferring the recombinant expression vector into an expression host cell to obtain recombinant engineering bacteria;
(2) culturing the recombinant engineering bacteria, and inducing the expression of the fusion protein;
(3) collecting bacterial cells, carrying out cell disruption, separating cell disruption supernatant and a sediment sample containing the inclusion body, and washing and purifying the sediment sample by buffer solution to obtain the active inclusion body.
Preferably, the sucrose isomerase SI is derived from Klebsiella sp.LX3, and the nucleotide sequence of the sucrose isomerase SI is shown as SEQ ID NO. 5.
Preferably, the Tdot gene is derived from Staphylothermus marinus (Staphylothermus marinus), and the nucleotide sequence of the Tdot gene is shown as SEQ ID NO. 6.
By way of example and not limitation, the expression host cell is preferably E.coli BL21(DE 3).
The invention also provides application of the inclusion body with sucrose isomerase activity in catalyzing conversion of sucrose into isomaltulose.
Further, the inclusion body of the sucrose isomerase catalyzes sucrose to be converted into isomaltulose, the pH value is 4.5-6.5, and the temperature is 30-45 ℃.
The present invention expresses the tetrameric coiled-coil domain of the cell-surface protein tetrabrachion (Tdot) derived from the cell surface protein tetrabrachion in fusion with SI and studies the enzymatic properties of the recombinase. Respectively fusing the N/C terminals of Tdot and SI to construct a recombinant expression vector, transferring the recombinant expression vector into E.coli BL21(DE3) for induced expression, and displaying that Tdot-SI and SI-Tdot are expressed as inclusion bodies with catalytic activity, while SI inclusion bodies without fused Tdot have no catalytic activity. Further, active inclusion bodies are purified by centrifugation and washing with a buffer solution, and the study on the enzymology property shows that the optimum reaction temperature of Tdot-SI and SI-Tdot active inclusion bodies is 40 ℃, and the optimum reaction pH is 5.5 and 5.0 respectively. The kinetic constants Km of Tdot-SI active inclusion bodies are 103.9 +/-9.5 mmol/L, kcat/Km is 0.06 +/-0.002L/(mmol.s), the kinetic constants Km of SI-Tdot active inclusion bodies are 54.4 +/-6.6 mmol/L, and kcat/Km is 0.03 +/-0.002L/(mmol.s). The results of the product specificity studies show that as the reaction temperature increases, the isomaltulose content of the product does not change significantly, whereas the trehalulose content of the product decreases as the conversion reaction temperature increases, while the monosaccharide content increases as the conversion reaction temperature increases. The sucrose isomerase active inclusion body fused with the coiled coil structure domain is successfully prepared by a fusion expression technology, has the advantages of synchronous realization of expression and immobilization as a novel self-immobilized enzyme, and provides a new strategy for realizing large-scale preparation and high-efficiency utilization of recombinant sucrose isomerase.
Drawings
FIG. 1 is colony PCR identification of transformed recombinant plasmid engineering bacteria; lane M is 250bp DNA Ladder Marker, and Lane 1 is pET-24 a-Tdot-SI; lane 2 is pET-24 b-SI-Tdot.
FIG. 2 is an SDS-PAGE analysis of recombinant 6 XHis-SI/SI-6 XHis; lane M is Protein Marker; lanes 1-4 are pET24b-6 XHis-SI before induction, after induction, supernatant, and pellet; lanes 5-8 show pET24b-SI-6 XHis, pre-induction, post-induction, supernatant, and pellet.
FIG. 3 is an SDS-PAGE analysis of recombinant Tdot-SI/SI-Tdot; lane M is Protein Marker; lanes 1-4 are inclusion bodies before, after, supernatant, and washing pET24a-Tdot-SI induction; lanes 5-8 are inclusion bodies before, after, supernatant, and wash of pET24b-SI-Tdot induction.
FIG. 4 is a graph of the effect of temperature on Tdot-SI and SI-Tdot enzyme activity.
FIG. 5 is a graph of the effect of pH on Tdot-SI and SI-Tdot enzyme activity.
Detailed Description
The following non-limiting examples will allow one of ordinary skill in the art to more fully understand the present invention, but are not intended to limit the invention in any way.
Sucrose isomerase SI (NCBI access number AAK82938) is derived from Klebsiella sp.LX3, and the nucleotide sequence of the sucrose isomerase SI gene is shown in SEQ ID NO. 5. The plasmid pET-28a-SI carrying the SI gene of sucrose isomerase can be synthesized by conventional means in the art. Plasmids pET24b-6 XHis-SI (nucleotide sequence shown in SEQ ID NO. 13) and pET24b-SI-6 XHis (nucleotide sequence shown in SEQ ID NO. 14) fused with histidine tags at the N/C terminal respectively can be synthesized by the conventional method in the field.
The nucleotide sequence of Tdot is shown in SEQ ID NO.6, and the plasmid pET-24a-Tdot carrying Tdot gene is synthesized by Suzhou Hongxi Biotechnology GmbH.
Kanamycin, Isopropyl-beta-D-thiogalactopyranoside (IPTG), Triton-X100, a SanPrep column type plasmid DNA small extraction kit, a SanPrep column type DNA gel recovery kit and a BCA protein quantitative detection kit are all purchased from the company Limited in the Biotechnology engineering (Shanghai).
HS DNA Polymerase, QuickCutTM BamH I and QuickCutTM Xho I, Dpn I, T4 DNA ligase, DNA and protein relative molecular weight standards were purchased from Takara Bio Inc.
Gene sequencing work was performed by Jilin province, Kuumei, Biotechnology, Inc.
Isomaltulose was purchased from Sigma.
Other chemicals were purchased from chemical reagents ltd, miuiou, department of tianjin.
LB liquid medium: contains 10g/L tryptone, 5g/L yeast powder and 10g/L NaCl, and has the pH value of 7.2; the LB solid medium was LB liquid medium to which 1.5% agar was added.
LLB medium: contains 10g/L tryptone, 5g/L yeast powder and 5g/L NaCl, and has a pH value of 7.2.
Buffer a Buffer: na (Na)2HPO4·12H2O:16.937g、NaH2PO42H 2O: 0.62g of the extract is dissolved in 750mL of deionized water, the pH value is adjusted to 8.0, 50mL of glycerol is added, and the volume is adjusted to 1L by using the deionized water.
Buffer B Buffer: 0.5mL of Triton-X100 was dissolved in 100mL of Buffer A Buffer.
Buffer C Buffer: na (Na)2HPO4·12H2O:16.937g、NaH2PO4·2H2O: 0.62g of the suspension is dissolved in 750mL of deionized water with the concentration of 450-.
EXAMPLE 1 construction of recombinant expression vectors
The pET-24a-Tdot-SI expression vector is constructed by using the inner primers in the table 1 and adopting an enzyme digestion connection method: taking pET-28a-SI preserved in a laboratory as a template, carrying SI coding gene which does not contain natural signal peptide, amplifying by using a primer P1/P2 to obtain an SI gene fragment containing a restriction enzyme cutting site, carrying out double restriction on the SI gene fragment and pET-24a-Tdot by using QuickCutTM BamH I and QuickCutTM Xho I, carrying out electrophoresis separation on restriction enzyme products by using 1% agarose gel, recovering a linearized vector and a target gene fragment by using a DNA gel recovery kit, and continuing carrying out ligation reaction by using T4 DNA ligase. The ligation reaction products were transformed into E.coli DH10B competent cells by heat shock and plated on LB solid plates containing 50. mu.g/mL kanamycin. The single colony is picked and colony PCR identification is carried out by using the primer P5/P6, and the result is shown in FIG. 1(A), and the amplification product has a single band at the position of 1500-2250bp and is consistent with the size of the expected amplification sequence 2055 bp. And continuing culturing the positive monoclonal antibody which is verified to be correct, extracting plasmids, then carrying out gene sequencing, and marking the recombinant expression vector which is correctly sequenced as pET-24 a-Tdot-SI.
The cloning technology of the Restriction free-cloning (RF) is adopted to construct a pET-24b-SI-Tdot expression vector: and carrying out a first round of PCR amplification reaction by using pET-24a-Tdot as a template and using a primer P3/P4, separating a PCR amplification product by using 1% agarose gel electrophoresis, recovering by using a DNA gel recovery kit, using the PCR amplification product as a long primer of a second round of PCR reaction, and continuously carrying out a second round of PCR reaction by using pET24b-SI-6 × His as a template. After the reaction was completed, Dpn I was added to decompose the template plasmid, and the reaction product was transformed into E.coli DH10B competent cells by electric shock and plated on LB solid plates containing 50. mu.g/mL kanamycin. The single colony is picked and subjected to colony PCR identification by using the primer P5/P6, and the result is shown in FIG. 1(B), and the amplification product has a single band at the position of 1500-2250bp and is consistent with the expected amplification sequence 2049bp in size. And continuing culturing the positive monoclonal antibody which is verified to be correct, extracting plasmids, then carrying out gene sequencing, and marking the recombinant expression vector which is correctly sequenced as pET-24 b-SI-Tdot.
TABLE 1 primer sequences used in the present invention
Example 2 inducible expression and isolation and purification of recombinant genes in E.coli
The constructed recombinant expression vectors pET-24a-Tdot-SI and pET-24b-SI-Tdot and the recombinant expression vectors pET24b-6 XHis-SI and pET24b-SI-6 XHis are respectively transformed into E.coli BL21(DE3) competent cells, the positive monoclonals are selected and inoculated into 5mL LB liquid culture medium containing 50 ug/mL kanamycin, and the liquid is cultured at 37 ℃ and 200r/min for 14h to obtain seed liquid. Inoculating the seed liquid into LLB liquid culture medium containing 50 μ g/mL kanamycin at 1% (v/v), culturing at 37 deg.C and 200r/min until OD600nm reaches 0.6-0.8, adding inducer, wherein recombinant engineering bacteria E.coli BL21(DE3)/pET-24a-Tdot-SI is added with IPTG with final concentration of 0.8mmol/L, E.coli BL21(DE3)/pET-24b-SI-Tdot is added with IPTG with final concentration of 0.1mmol/L, E.coli BL21(DE3)/pET24b-6 × His-SI and E.coli BL21(DE3)/pET24b-SI-6 × His is added with IPTG with final concentration of 0.5mmol/L, and culturing the bacterial liquid with IPTG added with the above liquid at 16 deg.C and 200r/min for inducing expression for 20 h.
And (3) separating and purifying the active inclusion bodies by using a buffer solution cleaning method: the bacterial liquid after induction expression is collected by centrifugation and is subjected to ultrasonic disruption in an ice water bath after being resuspended by Buffer A (the power is 500W, the disruption time is 3s, the interval is 5s, and the disruption time is 20 min). And centrifuging the crushed whole bacteria sample at 10000r/min for 15min to separate a soluble supernatant and a precipitate component containing the inclusion body. And washing the precipitate for 2 times and 3 times by using a Buffer B and a Buffer C in sequence, centrifuging the washed inclusion body sample for 15min at 10000r/min for recycling, and fully suspending the washed inclusion body sample by using the Buffer C to obtain the active inclusion body enzyme solution. And (3) adding a small amount of active inclusion body enzyme solution into an isometric 10% SDS solution for treatment for 5min to completely dissolve the inclusion bodies, and then determining the protein concentration by using a BCA protein quantitative detection kit. The cell disruption supernatant and the washed active inclusion body sample were detected by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE).
As shown in FIG. 2, SDS-PAGE analysis of recombinant 6 XHis-SI and SI-6 XHis was performed. After induction, a target band which is consistent with the theoretical relative molecular mass of 6 XHis-SI and SI-6 XHis of 68.14kDa appears at 66.4kDa, and is mainly expressed in the form of inclusion bodies, and enzyme activity detection finds that the inclusion body precipitate has no catalytic activity, which shows that recombinant SI protein is mainly expressed in the form of inactive inclusion bodies no matter histidine tags are fused at the N/C end.
As shown in FIG. 3, SDS-PAGE analysis of recombinant Tdot-SI and SI-Tdot is performed. After induction, a target band which is consistent with the theoretical relative molecular mass of Tdot-SI and SI-Tdot and has 73.18kDa appears on the whole bacteria sample above the 66.4kDa protein relative molecular mass standard, which indicates that the fusion protein is successfully induced and expressed. The expressed fusion proteins Tdot-SI and SI-Tdot still mainly exist in a precipitate sample in the form of inclusion bodies, and the inclusion bodies are subjected to enzyme activity detection to find that the fusion proteins have high catalytic activity, which indicates that the fusion Tdot label induces the SI to be expressed as an active inclusion body.
Example 4 determination of active Inclusion body enzyme Activity
(1) Determination of optimum reaction temperature
Taking 100 mu L of Tdot-SI and SI-Tdot activity inclusion body enzyme liquid with the concentration of 4mg/mL, respectively mixing with 400 mu L of pH 6.0 disodium hydrogen phosphate-citric acid buffer solution containing 4% of cane sugar, respectively reacting at 30, 35, 40, 45, 50 and 55 ℃ at 200r/min for 2h, and then treating a reaction sample in boiling water with the temperature of 100 ℃ for 15min for enzyme inactivation. Then, the reaction product after enzyme deactivation is centrifuged for 15min at 10000r/min, and the obtained supernatant is filtered by a 0.22 mu m filter membrane and then is subjected to HPLC detection. The HPLC detection conditions are as follows: the reaction product was detected using a Hypersil APS-2, 4.6X 250mm, 5 μm amino column, with a differential detector, mobile phase acetonitrile: water (80: 20 v/v), column temperature 35 ℃, sample size 10 μ L, flow rate 1.0 mL/min. And calculating the concentration of the isomaltulose in the reaction product and calculating the enzyme activity according to a labeling curve fitted by the isomaltulose standard substance and the corresponding peak area. Definition of enzyme activity units: the amount of enzyme required to release 1. mu. moL of isomaltulose per minute using sucrose as substrate was 1 enzyme activity unit (U). Definition of relative enzyme activity: the highest enzyme activity of the same group is 100%, and the calculated ratio is the relative enzyme activity.
The results are shown in fig. 4, the Tdot-SI and SI-Tdot active inclusion bodies have the highest enzyme activity at 40 ℃, the enzyme activity increases and increases with the temperature within the range of 30-40 ℃, the both have the enzyme activity of more than 80% within the range of 35-45 ℃, and the enzyme activities of the Tdot-SI and SI-Tdot active inclusion bodies are rapidly reduced after the temperature is higher than 45 ℃.
(2) Determination of optimum reaction pH
Mixing Tdot-SI and SI-Tdot activity inclusion body enzyme liquid with 4% sucrose pH 4.0, 4.5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0 disodium hydrogen phosphate-citric acid buffer solution, performing enzyme catalysis reaction at 40 deg.C, quantitatively detecting isomaltulose in the product, calculating enzyme activity, and measuring enzyme activity to determine optimum reaction pH.
As shown in FIG. 5, the enzyme activities of Tdot-SI and SI-Tdot activity inclusion bodies were increased with increasing pH in the range of pH 4.0 to 5.0, and had the highest enzyme activities at pH 5.5 and pH 5.0, respectively. In addition, both have more than 80% catalytic activity in the pH range of 5.0-6.0, and the enzyme activity rapidly decreases after pH above 6.5.
(3) Determination of the kinetic constants of the enzymatic reaction
Tdot-SI and SI-Tdot active inclusion body enzyme solutions are respectively mixed with sucrose solutions with different concentrations (10, 20, 40, 50, 100, 200, 250, 400, 500 and 800mmol/L), catalytic reaction is carried out under the optimal reaction conditions (Tdot-SI: 40 ℃, pH 5.5; SI-Tdot: 40 ℃, pH 5), and then the generation amount of the product isomaltulose is detected by HPLC and the reaction speed is calculated. Then nonlinear fitting is carried out by utilizing GraphPad Prism 5.0 based on a Michaelis-Menten equation to obtain kinetic constants Km and Vmax, and kcat/Km is calculated.
The specific activity of Tdot-SI and SI-Tdot active inclusion bodies is 622.6 +/-17.5U/g and 252.6 +/-17.1U/g respectively under the optimal reaction condition. Further detecting the conversion reaction product of the active inclusion body under sucrose substrates with different concentrations, calculating to obtain the catalytic reaction speed, obtaining a kinetic constant through nonlinear fitting, and obtaining the result shown in Table 2, wherein the K of Tdot-SIm103.9. + -. 9.5mmol/L, kcat/KmK of 0.06 + -0.002L/(mmols), SI-TdotmIs 54.4 +/-6.6 mmol/L, kcat/KmThe activity inclusion body formed by fusing the Tdot label with the N/C end of the SI has different kinetic parameters on substrate sucrose, and the SI-Tdot has stronger binding capacity with the substrate sucrose, but the catalytic efficiency is slightly lower than that of the Tdot-SI.
TABLE 2 kinetic parameters of Tdot-SI/SI-Tdot
Research work has been reported on the preparation of immobilized sucrose isomerase, such as mesoporous TiO modified with epsilon-poly-L-lysine2The specific activity of immobilized SI is 39.41U/g, Km is 204.92mmol/L [ Wu L, Liu Y, Chi B, et al, an innovative method for immobilizing a sugar isomer on epsilon-poly-L-lysine modified meso-TiO 2[ J].Food chemistry,2015,187:182-188.]Sucrose Isomerase was immobilized using a Sponge made of epsilon-poly-L-lysine and gelatin with a specific activity of 71.58U/g and a Km of 245.71mmol/L, and was not significantly changed from the unimmobilized free enzyme Km of 259.71mmol/L [ Wu L, Qiu J, Wu S, et al, Bioinspired Production of Antibacterial Sucrose isomer-Sponge for the Synthesis of Isomaltulose [ J L].Advanced Synthesis&Catalysis,2016,358(24):4030-4040.]. The invention synchronously expresses and fixes SI in cellsThe active inclusion body is prepared by the customization, so that a carrier material and an additional immobilization reaction are not needed, and the specific activity per unit mass is better, and the application potential is better. Meanwhile, the invention discovers that the active inclusion bodies formed by fusing the Tdot label at the N/C terminal of the SI show different kinetic parameters to the substrate sucrose, which indicates that the formed active inclusion bodies influence the combination with the substrate and the catalytic process due to different microstructures.
Example 6 product specificity analysis
Mixing 100 μ L of Tdot-SI active inclusion body enzyme solution with concentration of 4mg/mL and 400 μ L of disodium hydrogen phosphate-citric acid buffer solution with pH 5.5 containing 4% sucrose; mu.L of SI-Tdot active inclusion body enzyme solution with the concentration of 4mg/mL is mixed with 400 mu.L of disodium hydrogen phosphate-citric acid buffer solution with the pH value of 5.0 and containing 4 percent of sucrose. And (3) reacting the active inclusion body conversion reaction system at 30, 35 and 40 ℃ for 12h at 200r/min respectively, then treating an enzyme conversion sample by boiling water at 100 ℃ for 15min for inactivating enzyme, centrifuging at 10000r/min for 15min to obtain reaction product supernatant, filtering by using a 0.22 mu m filter membrane, and detecting the conversion reaction product by using HPLC.
As shown in Table 3, the isomaltulose content of the conversion reaction products did not change significantly in the current temperature range of Tdot-SI and SI-Tdot active inclusion bodies, while the trehalulose content of the products generally showed a decreasing trend with increasing conversion reaction temperature, while the monosaccharide (fructose and glucose) content increased with increasing temperature.
TABLE 3 analysis of the composition of the catalytic product of sucrose isomerase fused with Tdot tag at different temperatures
The invention successfully expresses sucrose isomerase as an active inclusion body in Escherichia coli E.coli BL21(DE3) by using a coiled coil domain Tdot as a fusion label through a fusion expression technology, realizes the synchronous expression and immobilization, is used as a novel self-immobilized enzyme, does not need a carrier material and a chemical reagent for a traditional immobilization reaction, has potential industrial application value, provides a new strategy for realizing the efficient preparation and utilization of the recombinant sucrose isomerase, and lays a foundation for further design and optimization.
SEQUENCE LISTING
<110> constitution of plum wine
<120> sucrose isomerase activity inclusion body fused with coiled coil domain
<130> 2022
<160> 14
<170> PatentIn version 3.5
<210> 1
<211> 1869
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 1
atgatcatta acgaaactgc cgatgacatc gtttatcgcc tgacagtcat tatcgatgat 60
cgctacgaat cgctgaaaaa cctgattacc ttacgtgcag atcgcttgga gatgatcatc 120
aatgacaatg tgtccaccat tctcgcgagc attggatccg caccatcctt gaatcaggat 180
attcacgttc aaaaggaaag tgaatatcct gcatggtgga aagaagctgt tttttatcag 240
atctatcctc gctcatttaa agacaccaat gatgatggca ttggcgatat tcgcggtatt 300
attgaaaagc tggactatct gaaatcgctc ggtattgacg ctatctggat caatccccat 360
tacgactctc cgaacaccga taacggctat gacatcagta attatcgtca gataatgaaa 420
gagtatggca caatggagga ttttgatagc cttgttgccg aaatgaaaaa acgaaatatg 480
cgcttaatga tcgacgtggt cattaaccat accagtgatc aacacccgtg gtttattcag 540
agtaaaagcg ataaaaacaa cccttatcgt gactattatt tctggcgtga cggaaaagat 600
aatcagccac ctaataatta cccctcattt ttcggcggct cggcatggca aaaagatgca 660
aagtcaggac agtactattt acactatttt gccagacagc aacctgatct caactgggat 720
aacccgaaag tacgtgagga tctttacgca atgctccgct tctggctgga taaaggcgtt 780
tcaggcatgc gatttgatac ggtggcaact tattccaaaa tcccgggatt tcccaatctg 840
acacctgaac aacagaaaaa ttttgctgaa caatacacca tggggcctaa tattcatcga 900
tacattcagg aaatgaaccg gaaagttctg tcccggtatg atgtggccac cgcgggtgaa 960
atttttggcg tcccgctgga tcgttcgtcg cagttttttg atcgccgccg acatgagctg 1020
aatatggcgt ttatgtttga cctcattcgt ctcgatcgcg acagcaatga acgctggcgt 1080
cacaagtcgt ggtcgctctc tcagttccgc cagatcatca gcaaaatgga tgtcacggtc 1140
ggaaagtatg gctggaacac gttcttctta gataaccatg acaacccccg tgcggtatct 1200
cacttcgggg atgacaggcc gcaatggcgg gaggcgtcgg ctaaggcact ggcgacgatt 1260
accctcactc agcgggcgac gccgtttatt tatcagggtt cagagctggg aatgactaat 1320
tatcccttca ggcaactcaa cgaatttgac gacatcgagg tcaaaggttt ctggcaggat 1380
tatgtccaga gtggaaaagt cacggccaca gagtttctcg ataatgtgcg cctgacgagc 1440
cgcgataaca gcagaacacc tttccagtgg aatgacaccc tgaatgctgg ttttactcgc 1500
ggaaagccgt ggtttcacat caacccaaac tatgtggaga tcaacgccga acgcgaagaa 1560
acccgcgaag attcagtgct gaattactat aaaaaaatga ttcagctacg ccaccatatc 1620
cctgctctgg tatatggcgc ctatcaggat cttaatccac aggacaatac cgtttatgcc 1680
tatacccgaa cgctgggtaa cgagcgttat ctggtcgtgg tgaactttaa ggagtacccg 1740
gtccgctata ctctcccggc taatgatgcc atcgaggaag tggtcattga tactcagcag 1800
caggcggctg cgccgcacag cacatccctg tcattgagcc cctggcaggc aggtgtgtat 1860
aagctgcgg 1869
<210> 2
<211> 1869
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 2
atggcaccat ccttgaatca ggatattcac gttcaaaagg aaagtgaata tcctgcatgg 60
tggaaagaag ctgtttttta tcagatctat cctcgctcat ttaaagacac caatgatgat 120
ggcattggcg atattcgcgg tattattgaa aagctggact atctgaaatc gctcggtatt 180
gacgctatct ggatcaatcc ccattacgac tctccgaaca ccgataacgg ctatgacatc 240
agtaattatc gtcagataat gaaagagtat ggcacaatgg aggattttga tagccttgtt 300
gccgaaatga aaaaacgaaa tatgcgctta atgatcgacg tggtcattaa ccataccagt 360
gatcaacacc cgtggtttat tcagagtaaa agcgataaaa acaaccctta tcgtgactat 420
tatttctggc gtgacggaaa agataatcag ccacctaata attacccctc atttttcggc 480
ggctcggcat ggcaaaaaga tgcaaagtca ggacagtact atttacacta ttttgccaga 540
cagcaacctg atctcaactg ggataacccg aaagtacgtg aggatcttta cgcaatgctc 600
cgcttctggc tggataaagg cgtttcaggc atgcgatttg atacggtggc aacttattcc 660
aaaatcccgg gatttcccaa tctgacacct gaacaacaga aaaattttgc tgaacaatac 720
accatggggc ctaatattca tcgatacatt caggaaatga accggaaagt tctgtcccgg 780
tatgatgtgg ccaccgcggg tgaaattttt ggcgtcccgc tggatcgttc gtcgcagttt 840
tttgatcgcc gccgacatga gctgaatatg gcgtttatgt ttgacctcat tcgtctcgat 900
cgcgacagca atgaacgctg gcgtcacaag tcgtggtcgc tctctcagtt ccgccagatc 960
atcagcaaaa tggatgtcac ggtcggaaag tatggctgga acacgttctt cttagataac 1020
catgacaacc cccgtgcggt atctcacttc ggggatgaca ggccgcaatg gcgggaggcg 1080
tcggctaagg cactggcgac gattaccctc actcagcggg cgacgccgtt tatttatcag 1140
ggttcagagc tgggaatgac taattatccc ttcaggcaac tcaacgaatt tgacgacatc 1200
gaggtcaaag gtttctggca ggattatgtc cagagtggaa aagtcacggc cacagagttt 1260
ctcgataatg tgcgcctgac gagccgcgat aacagcagaa cacctttcca gtggaatgac 1320
accctgaatg ctggttttac tcgcggaaag ccgtggtttc acatcaaccc aaactatgtg 1380
gagatcaacg ccgaacgcga agaaacccgc gaagattcag tgctgaatta ctataaaaaa 1440
atgattcagc tacgccacca tatccctgct ctggtatatg gcgcctatca ggatcttaat 1500
ccacaggaca ataccgttta tgcctatacc cgaacgctgg gtaacgagcg ttatctggtc 1560
gtggtgaact ttaaggagta cccggtccgc tatactctcc cggctaatga tgccatcgag 1620
gaagtggtca ttgatactca gcagcaggcg gctgcgccgc acagcacatc cctgtcattg 1680
agcccctggc aggcaggtgt gtataagctg cggggatcca tcattaacga aactgccgat 1740
gacatcgttt atcgcctgac agtcattatc gatgatcgct acgaatcgct gaaaaacctg 1800
attaccttac gtgcagatcg cttggagatg atcatcaatg acaatgtgtc caccattctc 1860
gcgagcatt 1869
<210> 3
<211> 623
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 3
Met Ile Ile Asn Glu Thr Ala Asp Asp Ile Val Tyr Arg Leu Thr Val
1 5 10 15
Ile Ile Asp Asp Arg Tyr Glu Ser Leu Lys Asn Leu Ile Thr Leu Arg
20 25 30
Ala Asp Arg Leu Glu Met Ile Ile Asn Asp Asn Val Ser Thr Ile Leu
35 40 45
Ala Ser Ile Gly Ser Ala Pro Ser Leu Asn Gln Asp Ile His Val Gln
50 55 60
Lys Glu Ser Glu Tyr Pro Ala Trp Trp Lys Glu Ala Val Phe Tyr Gln
65 70 75 80
Ile Tyr Pro Arg Ser Phe Lys Asp Thr Asn Asp Asp Gly Ile Gly Asp
85 90 95
Ile Arg Gly Ile Ile Glu Lys Leu Asp Tyr Leu Lys Ser Leu Gly Ile
100 105 110
Asp Ala Ile Trp Ile Asn Pro His Tyr Asp Ser Pro Asn Thr Asp Asn
115 120 125
Gly Tyr Asp Ile Ser Asn Tyr Arg Gln Ile Met Lys Glu Tyr Gly Thr
130 135 140
Met Glu Asp Phe Asp Ser Leu Val Ala Glu Met Lys Lys Arg Asn Met
145 150 155 160
Arg Leu Met Ile Asp Val Val Ile Asn His Thr Ser Asp Gln His Pro
165 170 175
Trp Phe Ile Gln Ser Lys Ser Asp Lys Asn Asn Pro Tyr Arg Asp Tyr
180 185 190
Tyr Phe Trp Arg Asp Gly Lys Asp Asn Gln Pro Pro Asn Asn Tyr Pro
195 200 205
Ser Phe Phe Gly Gly Ser Ala Trp Gln Lys Asp Ala Lys Ser Gly Gln
210 215 220
Tyr Tyr Leu His Tyr Phe Ala Arg Gln Gln Pro Asp Leu Asn Trp Asp
225 230 235 240
Asn Pro Lys Val Arg Glu Asp Leu Tyr Ala Met Leu Arg Phe Trp Leu
245 250 255
Asp Lys Gly Val Ser Gly Met Arg Phe Asp Thr Val Ala Thr Tyr Ser
260 265 270
Lys Ile Pro Gly Phe Pro Asn Leu Thr Pro Glu Gln Gln Lys Asn Phe
275 280 285
Ala Glu Gln Tyr Thr Met Gly Pro Asn Ile His Arg Tyr Ile Gln Glu
290 295 300
Met Asn Arg Lys Val Leu Ser Arg Tyr Asp Val Ala Thr Ala Gly Glu
305 310 315 320
Ile Phe Gly Val Pro Leu Asp Arg Ser Ser Gln Phe Phe Asp Arg Arg
325 330 335
Arg His Glu Leu Asn Met Ala Phe Met Phe Asp Leu Ile Arg Leu Asp
340 345 350
Arg Asp Ser Asn Glu Arg Trp Arg His Lys Ser Trp Ser Leu Ser Gln
355 360 365
Phe Arg Gln Ile Ile Ser Lys Met Asp Val Thr Val Gly Lys Tyr Gly
370 375 380
Trp Asn Thr Phe Phe Leu Asp Asn His Asp Asn Pro Arg Ala Val Ser
385 390 395 400
His Phe Gly Asp Asp Arg Pro Gln Trp Arg Glu Ala Ser Ala Lys Ala
405 410 415
Leu Ala Thr Ile Thr Leu Thr Gln Arg Ala Thr Pro Phe Ile Tyr Gln
420 425 430
Gly Ser Glu Leu Gly Met Thr Asn Tyr Pro Phe Arg Gln Leu Asn Glu
435 440 445
Phe Asp Asp Ile Glu Val Lys Gly Phe Trp Gln Asp Tyr Val Gln Ser
450 455 460
Gly Lys Val Thr Ala Thr Glu Phe Leu Asp Asn Val Arg Leu Thr Ser
465 470 475 480
Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp Asn Asp Thr Leu Asn Ala
485 490 495
Gly Phe Thr Arg Gly Lys Pro Trp Phe His Ile Asn Pro Asn Tyr Val
500 505 510
Glu Ile Asn Ala Glu Arg Glu Glu Thr Arg Glu Asp Ser Val Leu Asn
515 520 525
Tyr Tyr Lys Lys Met Ile Gln Leu Arg His His Ile Pro Ala Leu Val
530 535 540
Tyr Gly Ala Tyr Gln Asp Leu Asn Pro Gln Asp Asn Thr Val Tyr Ala
545 550 555 560
Tyr Thr Arg Thr Leu Gly Asn Glu Arg Tyr Leu Val Val Val Asn Phe
565 570 575
Lys Glu Tyr Pro Val Arg Tyr Thr Leu Pro Ala Asn Asp Ala Ile Glu
580 585 590
Glu Val Val Ile Asp Thr Gln Gln Gln Ala Ala Ala Pro His Ser Thr
595 600 605
Ser Leu Ser Leu Ser Pro Trp Gln Ala Gly Val Tyr Lys Leu Arg
610 615 620
<210> 4
<211> 623
<212> PRT
<213> Artificial Sequence (Artificial Sequence)
<400> 4
Met Ala Pro Ser Leu Asn Gln Asp Ile His Val Gln Lys Glu Ser Glu
1 5 10 15
Tyr Pro Ala Trp Trp Lys Glu Ala Val Phe Tyr Gln Ile Tyr Pro Arg
20 25 30
Ser Phe Lys Asp Thr Asn Asp Asp Gly Ile Gly Asp Ile Arg Gly Ile
35 40 45
Ile Glu Lys Leu Asp Tyr Leu Lys Ser Leu Gly Ile Asp Ala Ile Trp
50 55 60
Ile Asn Pro His Tyr Asp Ser Pro Asn Thr Asp Asn Gly Tyr Asp Ile
65 70 75 80
Ser Asn Tyr Arg Gln Ile Met Lys Glu Tyr Gly Thr Met Glu Asp Phe
85 90 95
Asp Ser Leu Val Ala Glu Met Lys Lys Arg Asn Met Arg Leu Met Ile
100 105 110
Asp Val Val Ile Asn His Thr Ser Asp Gln His Pro Trp Phe Ile Gln
115 120 125
Ser Lys Ser Asp Lys Asn Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg
130 135 140
Asp Gly Lys Asp Asn Gln Pro Pro Asn Asn Tyr Pro Ser Phe Phe Gly
145 150 155 160
Gly Ser Ala Trp Gln Lys Asp Ala Lys Ser Gly Gln Tyr Tyr Leu His
165 170 175
Tyr Phe Ala Arg Gln Gln Pro Asp Leu Asn Trp Asp Asn Pro Lys Val
180 185 190
Arg Glu Asp Leu Tyr Ala Met Leu Arg Phe Trp Leu Asp Lys Gly Val
195 200 205
Ser Gly Met Arg Phe Asp Thr Val Ala Thr Tyr Ser Lys Ile Pro Gly
210 215 220
Phe Pro Asn Leu Thr Pro Glu Gln Gln Lys Asn Phe Ala Glu Gln Tyr
225 230 235 240
Thr Met Gly Pro Asn Ile His Arg Tyr Ile Gln Glu Met Asn Arg Lys
245 250 255
Val Leu Ser Arg Tyr Asp Val Ala Thr Ala Gly Glu Ile Phe Gly Val
260 265 270
Pro Leu Asp Arg Ser Ser Gln Phe Phe Asp Arg Arg Arg His Glu Leu
275 280 285
Asn Met Ala Phe Met Phe Asp Leu Ile Arg Leu Asp Arg Asp Ser Asn
290 295 300
Glu Arg Trp Arg His Lys Ser Trp Ser Leu Ser Gln Phe Arg Gln Ile
305 310 315 320
Ile Ser Lys Met Asp Val Thr Val Gly Lys Tyr Gly Trp Asn Thr Phe
325 330 335
Phe Leu Asp Asn His Asp Asn Pro Arg Ala Val Ser His Phe Gly Asp
340 345 350
Asp Arg Pro Gln Trp Arg Glu Ala Ser Ala Lys Ala Leu Ala Thr Ile
355 360 365
Thr Leu Thr Gln Arg Ala Thr Pro Phe Ile Tyr Gln Gly Ser Glu Leu
370 375 380
Gly Met Thr Asn Tyr Pro Phe Arg Gln Leu Asn Glu Phe Asp Asp Ile
385 390 395 400
Glu Val Lys Gly Phe Trp Gln Asp Tyr Val Gln Ser Gly Lys Val Thr
405 410 415
Ala Thr Glu Phe Leu Asp Asn Val Arg Leu Thr Ser Arg Asp Asn Ser
420 425 430
Arg Thr Pro Phe Gln Trp Asn Asp Thr Leu Asn Ala Gly Phe Thr Arg
435 440 445
Gly Lys Pro Trp Phe His Ile Asn Pro Asn Tyr Val Glu Ile Asn Ala
450 455 460
Glu Arg Glu Glu Thr Arg Glu Asp Ser Val Leu Asn Tyr Tyr Lys Lys
465 470 475 480
Met Ile Gln Leu Arg His His Ile Pro Ala Leu Val Tyr Gly Ala Tyr
485 490 495
Gln Asp Leu Asn Pro Gln Asp Asn Thr Val Tyr Ala Tyr Thr Arg Thr
500 505 510
Leu Gly Asn Glu Arg Tyr Leu Val Val Val Asn Phe Lys Glu Tyr Pro
515 520 525
Val Arg Tyr Thr Leu Pro Ala Asn Asp Ala Ile Glu Glu Val Val Ile
530 535 540
Asp Thr Gln Gln Gln Ala Ala Ala Pro His Ser Thr Ser Leu Ser Leu
545 550 555 560
Ser Pro Trp Gln Ala Gly Val Tyr Lys Leu Arg Gly Ser Ile Ile Asn
565 570 575
Glu Thr Ala Asp Asp Ile Val Tyr Arg Leu Thr Val Ile Ile Asp Asp
580 585 590
Arg Tyr Glu Ser Leu Lys Asn Leu Ile Thr Leu Arg Ala Asp Arg Leu
595 600 605
Glu Met Ile Ile Asn Asp Asn Val Ser Thr Ile Leu Ala Ser Ile
610 615 620
<210> 5
<211> 1713
<212> DNA
<213> Sucrose Isomerase (SI)
<400> 5
atggcaccat ccttgaatca ggatattcac gttcaaaagg aaagtgaata tcctgcatgg 60
tggaaagaag ctgtttttta tcagatctat cctcgctcat ttaaagacac caatgatgat 120
ggcattggcg atattcgcgg tattattgaa aagctggact atctgaaatc gctcggtatt 180
gacgctatct ggatcaatcc ccattacgac tctccgaaca ccgataacgg ctatgacatc 240
agtaattatc gtcagataat gaaagagtat ggcacaatgg aggattttga tagccttgtt 300
gccgaaatga aaaaacgaaa tatgcgctta atgatcgacg tggtcattaa ccataccagt 360
gatcaacacc cgtggtttat tcagagtaaa agcgataaaa acaaccctta tcgtgactat 420
tatttctggc gtgacggaaa agataatcag ccacctaata attacccctc atttttcggc 480
ggctcggcat ggcaaaaaga tgcaaagtca ggacagtact atttacacta ttttgccaga 540
cagcaacctg atctcaactg ggataacccg aaagtacgtg aggatcttta cgcaatgctc 600
cgcttctggc tggataaagg cgtttcaggc atgcgatttg atacggtggc aacttattcc 660
aaaatcccgg gatttcccaa tctgacacct gaacaacaga aaaattttgc tgaacaatac 720
accatggggc ctaatattca tcgatacatt caggaaatga accggaaagt tctgtcccgg 780
tatgatgtgg ccaccgcggg tgaaattttt ggcgtcccgc tggatcgttc gtcgcagttt 840
tttgatcgcc gccgacatga gctgaatatg gcgtttatgt ttgacctcat tcgtctcgat 900
cgcgacagca atgaacgctg gcgtcacaag tcgtggtcgc tctctcagtt ccgccagatc 960
atcagcaaaa tggatgtcac ggtcggaaag tatggctgga acacgttctt cttagataac 1020
catgacaacc cccgtgcggt atctcacttc ggggatgaca ggccgcaatg gcgggaggcg 1080
tcggctaagg cactggcgac gattaccctc actcagcggg cgacgccgtt tatttatcag 1140
ggttcagagc tgggaatgac taattatccc ttcaggcaac tcaacgaatt tgacgacatc 1200
gaggtcaaag gtttctggca ggattatgtc cagagtggaa aagtcacggc cacagagttt 1260
ctcgataatg tgcgcctgac gagccgcgat aacagcagaa cacctttcca gtggaatgac 1320
accctgaatg ctggttttac tcgcggaaag ccgtggtttc acatcaaccc aaactatgtg 1380
gagatcaacg ccgaacgcga agaaacccgc gaagattcag tgctgaatta ctataaaaaa 1440
atgattcagc tacgccacca tatccctgct ctggtatatg gcgcctatca ggatcttaat 1500
ccacaggaca ataccgttta tgcctatacc cgaacgctgg gtaacgagcg ttatctggtc 1560
gtggtgaact ttaaggagta cccggtccgc tatactctcc cggctaatga tgccatcgag 1620
gaagtggtca ttgatactca gcagcaggcg gctgcgccgc acagcacatc cctgtcattg 1680
agcccctggc aggcaggtgt gtataagctg cgg 1713
<210> 6
<211> 153
<212> DNA
<213> coiled coil Domain (Tdot)
<400> 6
atgatcatta acgaaactgc cgatgacatc gtttatcgcc tgacagtcat tatcgatgat 60
cgctacgaat cgctgaaaaa cctgattacc ttacgtgcag atcgcttgga gatgatcatc 120
aatgacaatg tgtccaccat tctcgcgagc att 153
<210> 7
<211> 37
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 7
cgcggatccg caccatcctt gaatcaggat attcacg 37
<210> 8
<211> 37
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 8
cgcggatccg caccatcctt gaatcaggat attcacg 37
<210> 9
<211> 55
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 9
caggcaggtg tgtataagct gcggggatcc atcatcatta acgaaactgc cgatg 55
<210> 10
<211> 46
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 10
tcagtggtgg tggtggtggt gttaaatgct cgcgagaatg gtggac 46
<210> 11
<211> 20
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 11
<210> 12
<211> 19
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 12
gctagttatt gctcagcgg 19
<210> 13
<211> 6968
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 13
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040
ttttgtttaa ctttaagaag gagatataca tatgcatcat catcatcatc acgcaccatc 5100
cttgaatcag gatattcacg ttcaaaagga aagtgaatat cctgcatggt ggaaagaagc 5160
tgttttttat cagatctatc ctcgctcatt taaagacacc aatgatgatg gcattggcga 5220
tattcgcggt attattgaaa agctggacta tctgaaatcg ctcggtattg acgctatctg 5280
gatcaatccc cattacgact ctccgaacac cgataacggc tatgacatca gtaattatcg 5340
tcagataatg aaagagtatg gcacaatgga ggattttgat agccttgttg ccgaaatgaa 5400
aaaacgaaat atgcgcttaa tgatcgacgt ggtcattaac cataccagtg atcaacaccc 5460
gtggtttatt cagagtaaaa gcgataaaaa caacccttat cgtgactatt atttctggcg 5520
tgacggaaaa gataatcagc cacctaataa ttacccctca tttttcggcg gctcggcatg 5580
gcaaaaagat gcaaagtcag gacagtacta tttacactat tttgccagac agcaacctga 5640
tctcaactgg gataacccga aagtacgtga ggatctttac gcaatgctcc gcttctggct 5700
ggataaaggc gtttcaggca tgcgatttga tacggtggca acttattcca aaatcccggg 5760
atttcccaat ctgacacctg aacaacagaa aaattttgct gaacaataca ccatggggcc 5820
taatattcat cgatacattc aggaaatgaa ccggaaagtt ctgtcccggt atgatgtggc 5880
caccgcgggt gaaatttttg gcgtcccgct ggatcgttcg tcgcagtttt ttgatcgccg 5940
ccgacatgag ctgaatatgg cgtttatgtt tgacctcatt cgtctcgatc gcgacagcaa 6000
tgaacgctgg cgtcacaagt cgtggtcgct ctctcagttc cgccagatca tcagcaaaat 6060
ggatgtcacg gtcggaaagt atggctggaa cacgttcttc ttagataacc atgacaaccc 6120
ccgtgcggta tctcacttcg gggatgacag gccgcaatgg cgggaggcgt cggctaaggc 6180
actggcgacg attaccctca ctcagcgggc gacgccgttt atttatcagg gttcagagct 6240
gggaatgact aattatccct tcaggcaact caacgaattt gacgacatcg aggtcaaagg 6300
tttctggcag gattatgtcc agagtggaaa agtcacggcc acagagtttc tcgataatgt 6360
gcgcctgacg agccgcgata acagcagaac acctttccag tggaatgaca ccctgaatgc 6420
tggttttact cgcggaaagc cgtggtttca catcaaccca aactatgtgg agatcaacgc 6480
cgaacgcgaa gaaacccgcg aagattcagt gctgaattac tataaaaaaa tgattcagct 6540
acgccaccat atccctgctc tggtatatgg cgcctatcag gatcttaatc cacaggacaa 6600
taccgtttat gcctataccc gaacgctggg taacgagcgt tatctggtcg tggtgaactt 6660
taaggagtac ccggtccgct atactctccc ggctaatgat gccatcgagg aagtggtcat 6720
tgatactcag cagcaggcgg ctgcgccgca cagcacatcc ctgtcattga gcccctggca 6780
ggcaggtgtg tataagctgc ggtaactcga gcaccaccac caccaccact gagatccggc 6840
tgctaacaaa gcccgaaagg aagctgagtt ggctgctgcc accgctgagc aataactagc 6900
ataacccctt ggggcctcta aacgggtctt gaggggtttt ttgctgaaag gaggaactat 6960
atccggat 6968
<210> 14
<211> 6947
<212> DNA
<213> Artificial Sequence (Artificial Sequence)
<400> 14
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 60
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 180
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 2760
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2880
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 3780
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac accaccacgc 4320
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 4740
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4800
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4860
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040
ttttgtttaa ctttaagaag gagatataca tatggcacca tccttgaatc aggatattca 5100
cgttcaaaag gaaagtgaat atcctgcatg gtggaaagaa gctgtttttt atcagatcta 5160
tcctcgctca tttaaagaca ccaatgatga tggcattggc gatattcgcg gtattattga 5220
aaagctggac tatctgaaat cgctcggtat tgacgctatc tggatcaatc cccattacga 5280
ctctccgaac accgataacg gctatgacat cagtaattat cgtcagataa tgaaagagta 5340
tggcacaatg gaggattttg atagccttgt tgccgaaatg aaaaaacgaa atatgcgctt 5400
aatgatcgac gtggtcatta accataccag tgatcaacac ccgtggttta ttcagagtaa 5460
aagcgataaa aacaaccctt atcgtgacta ttatttctgg cgtgacggaa aagataatca 5520
gccacctaat aattacccct catttttcgg cggctcggca tggcaaaaag atgcaaagtc 5580
aggacagtac tatttacact attttgccag acagcaacct gatctcaact gggataaccc 5640
gaaagtacgt gaggatcttt acgcaatgct ccgcttctgg ctggataaag gcgtttcagg 5700
catgcgattt gatacggtgg caacttattc caaaatcccg ggatttccca atctgacacc 5760
tgaacaacag aaaaattttg ctgaacaata caccatgggg cctaatattc atcgatacat 5820
tcaggaaatg aaccggaaag ttctgtcccg gtatgatgtg gccaccgcgg gtgaaatttt 5880
tggcgtcccg ctggatcgtt cgtcgcagtt ttttgatcgc cgccgacatg agctgaatat 5940
ggcgtttatg tttgacctca ttcgtctcga tcgcgacagc aatgaacgct ggcgtcacaa 6000
gtcgtggtcg ctctctcagt tccgccagat catcagcaaa atggatgtca cggtcggaaa 6060
gtatggctgg aacacgttct tcttagataa ccatgacaac ccccgtgcgg tatctcactt 6120
cggggatgac aggccgcaat ggcgggaggc gtcggctaag gcactggcga cgattaccct 6180
cactcagcgg gcgacgccgt ttatttatca gggttcagag ctgggaatga ctaattatcc 6240
cttcaggcaa ctcaacgaat ttgacgacat cgaggtcaaa ggtttctggc aggattatgt 6300
ccagagtgga aaagtcacgg ccacagagtt tctcgataat gtgcgcctga cgagccgcga 6360
taacagcaga acacctttcc agtggaatga caccctgaat gctggtttta ctcgcggaaa 6420
gccgtggttt cacatcaacc caaactatgt ggagatcaac gccgaacgcg aagaaacccg 6480
cgaagattca gtgctgaatt actataaaaa aatgattcag ctacgccacc atatccctgc 6540
tctggtatat ggcgcctatc aggatcttaa tccacaggac aataccgttt atgcctatac 6600
ccgaacgctg ggtaacgagc gttatctggt cgtggtgaac tttaaggagt acccggtccg 6660
ctatactctc ccggctaatg atgccatcga ggaagtggtc attgatactc agcagcaggc 6720
ggctgcgccg cacagcacat ccctgtcatt gagcccctgg caggcaggtg tgtataagct 6780
gcggctcgag caccaccacc accaccactg agatccggct gctaacaaag cccgaaagga 6840
agctgagttg gctgctgcca ccgctgagca ataactagca taaccccttg gggcctctaa 6900
acgggtcttg aggggttttt tgctgaaagg aggaactata tccggat 6947
Claims (9)
1. A gene is characterized in that the nucleotide sequence is shown as SEQ ID NO. 1.
2. A gene is characterized in that the nucleotide sequence is shown as SEQ ID NO. 2.
3. The fusion protein encoded by the gene of claim 1 and comprising TdoT-SI, wherein the amino acid sequence is set forth in SEQ ID No. 3.
4. The fusion protein encoded by the gene of claim 2 and comprising SI-Tdot, wherein the amino acid sequence is set forth in SEQ ID No. 4.
5. A recombinant engineered bacterium containing the gene of claim 1.
6. A recombinant engineered bacterium containing the gene of claim 2.
7. An inclusion body of sucrose isomerase activity, which is obtained by inducing the expression of fusion protein by the recombinant engineering bacteria of claim 5 or 6, breaking, separating and purifying the bacteria.
8. Use of an inclusion body of sucrose isomerase activity for catalysing the conversion of sucrose to isomaltulose.
9. The use according to claim 8, wherein the sucrose isomerase activity comprises a sucrose isomerase activity that catalyzes the conversion of sucrose into isomaltulose at a pH of 4.5 to 6.5 and a temperature of 30 to 45 ℃.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210142279.1A CN114703211A (en) | 2022-02-16 | 2022-02-16 | Sucrose isomerase activity inclusion body fused with coiled coil structure domain |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210142279.1A CN114703211A (en) | 2022-02-16 | 2022-02-16 | Sucrose isomerase activity inclusion body fused with coiled coil structure domain |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114703211A true CN114703211A (en) | 2022-07-05 |
Family
ID=82166834
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210142279.1A Pending CN114703211A (en) | 2022-02-16 | 2022-02-16 | Sucrose isomerase activity inclusion body fused with coiled coil structure domain |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114703211A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103501637A (en) * | 2011-05-05 | 2014-01-08 | 赢创工业集团股份有限公司 | Process for preparing isomaltulose from plant juices |
CN109477113A (en) * | 2015-07-24 | 2019-03-15 | 爱沃克斯技术有限责任公司 | Catalytically active protein aggregates and method for the production thereof |
CN113789338A (en) * | 2021-09-15 | 2021-12-14 | 李宪臻 | Catalase activity inclusion body and application |
-
2022
- 2022-02-16 CN CN202210142279.1A patent/CN114703211A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103501637A (en) * | 2011-05-05 | 2014-01-08 | 赢创工业集团股份有限公司 | Process for preparing isomaltulose from plant juices |
CN109477113A (en) * | 2015-07-24 | 2019-03-15 | 爱沃克斯技术有限责任公司 | Catalytically active protein aggregates and method for the production thereof |
CN113789338A (en) * | 2021-09-15 | 2021-12-14 | 李宪臻 | Catalase activity inclusion body and application |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111454978B (en) | Surface display engineering bacterium for specifically adsorbing heavy metal lead and construction method and application thereof | |
CN106676051B (en) | It is a kind of to prepare the method and its application for efficiently synthesizing pantothenic acid genetic engineering bacterium | |
CN111304232B (en) | Method for purifying protein based on membrane surface fusion expression strategy and application thereof | |
CN111850007B (en) | Cellulosobody docking protein combination mutant 36864 applicable to low calcium ion concentration and application | |
CN114774452B (en) | Construction method and application of engineering escherichia coli for adsorbing mercury ions in solution | |
CN110257356B (en) | Enzyme capable of being used for synthesizing carnosine and coding gene thereof | |
CN111848758B (en) | Cellulosome docking protein mutant suitable for low calcium ion concentration and application | |
CN112481280A (en) | Method for preparing rare ginsenoside CK by gene combination transformation and application | |
CN114875004B (en) | High-stereoselectivity R-transketolase mutant and encoding gene and application thereof | |
CN112592877B (en) | Recombinant escherichia coli for over-expressing lsrC gene and construction method and application thereof | |
CN112481282B (en) | Carbohydrate binding module CBM6B protein capable of specifically recognizing xanthan gum side chain and application thereof | |
CN113151214B (en) | Protein PnlipA with lipase activity and gene and application thereof | |
CN111848757B (en) | Cellulosome docking protein combined mutant 36862 suitable for low calcium ion concentration and application | |
CN114703211A (en) | Sucrose isomerase activity inclusion body fused with coiled coil structure domain | |
CN113755460B (en) | Flavone reductase for preparing dihydroquercetin | |
CN113122561B (en) | Expression vector of membrane protein SohB and expression and purification method thereof | |
CN113122558B (en) | Expression vector of membrane protein AmpG and expression and purification method thereof | |
CN114591985B (en) | Mutant pectin lyase and application thereof | |
CN113337491B (en) | Structural domain for improving high-temperature resistance stability of keratinase and application thereof | |
CN112410361B (en) | Method for producing candida antarctica lipase B and specific DNA molecule used by method | |
CN111850006B (en) | Cellulosome docking protein combined mutant 36865 suitable for low calcium ion concentration and application | |
CN111850004B (en) | Cellulosomal dockerin mutant 36740 with improved activity and application thereof | |
CN113355304B (en) | Protein CpoC with zearalenone degrading enzyme activity and gene and application thereof | |
CN111848759B (en) | Cellulosomal dockerin mutant 36741 with improved activity and application thereof | |
CN113122557A (en) | Expression vector of membrane protein AmtB and expression purification method thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |