CN101600803A - 编码用于控制寄生线虫的截短的蔗糖异构酶多肽的多核苷酸 - Google Patents
编码用于控制寄生线虫的截短的蔗糖异构酶多肽的多核苷酸 Download PDFInfo
- Publication number
- CN101600803A CN101600803A CNA2008800039057A CN200880003905A CN101600803A CN 101600803 A CN101600803 A CN 101600803A CN A2008800039057 A CNA2008800039057 A CN A2008800039057A CN 200880003905 A CN200880003905 A CN 200880003905A CN 101600803 A CN101600803 A CN 101600803A
- Authority
- CN
- China
- Prior art keywords
- polynucleotide
- sequence
- asp
- plant
- seq
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108091033319 polynucleotide Proteins 0.000 title claims abstract description 119
- 102000040430 polynucleotide Human genes 0.000 title claims abstract description 119
- 239000002157 polynucleotide Substances 0.000 title claims abstract description 119
- 108010047540 sucrose isomerase Proteins 0.000 title claims abstract description 90
- 241000244206 Nematoda Species 0.000 title claims abstract description 88
- 108090000765 processed proteins & peptides Proteins 0.000 title claims abstract description 68
- 229920001184 polypeptide Polymers 0.000 title claims abstract description 66
- 102000004196 processed proteins & peptides Human genes 0.000 title claims abstract description 66
- 230000003071 parasitic effect Effects 0.000 title description 19
- 230000009261 transgenic effect Effects 0.000 claims abstract description 53
- 238000000034 method Methods 0.000 claims abstract description 41
- 239000013604 expression vector Substances 0.000 claims abstract description 21
- 241000196324 Embryophyta Species 0.000 claims description 241
- 239000002773 nucleotide Substances 0.000 claims description 25
- 244000068988 Glycine max Species 0.000 claims description 24
- 235000010469 Glycine max Nutrition 0.000 claims description 20
- 230000000694 effects Effects 0.000 claims description 15
- 235000021307 Triticum Nutrition 0.000 claims description 13
- 240000008042 Zea mays Species 0.000 claims description 13
- 244000075850 Avena orientalis Species 0.000 claims description 11
- 244000061456 Solanum tuberosum Species 0.000 claims description 11
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 claims description 11
- 235000002017 Zea mays subsp mays Nutrition 0.000 claims description 11
- 235000005822 corn Nutrition 0.000 claims description 11
- 230000002255 enzymatic effect Effects 0.000 claims description 11
- 244000098338 Triticum aestivum Species 0.000 claims description 10
- 239000005645 nematicide Substances 0.000 claims description 10
- 240000007594 Oryza sativa Species 0.000 claims description 9
- 235000007164 Oryza sativa Nutrition 0.000 claims description 9
- 235000002595 Solanum tuberosum Nutrition 0.000 claims description 9
- 240000005979 Hordeum vulgare Species 0.000 claims description 8
- 235000009566 rice Nutrition 0.000 claims description 8
- 235000002566 Capsicum Nutrition 0.000 claims description 7
- 241000219146 Gossypium Species 0.000 claims description 7
- 235000007340 Hordeum vulgare Nutrition 0.000 claims description 7
- 241000209510 Liliopsida Species 0.000 claims description 7
- 240000003768 Solanum lycopersicum Species 0.000 claims description 7
- 244000046109 Sorghum vulgare var. nervosum Species 0.000 claims description 7
- 241001233957 eudicotyledons Species 0.000 claims description 7
- 235000016068 Berberis vulgaris Nutrition 0.000 claims description 6
- 241000335053 Beta vulgaris Species 0.000 claims description 6
- 241000209082 Lolium Species 0.000 claims description 6
- 235000007688 Lycopersicon esculentum Nutrition 0.000 claims description 6
- 235000002637 Nicotiana tabacum Nutrition 0.000 claims description 6
- 244000082988 Secale cereale Species 0.000 claims description 6
- 241000219793 Trifolium Species 0.000 claims description 6
- 235000007319 Avena orientalis Nutrition 0.000 claims description 5
- 240000004713 Pisum sativum Species 0.000 claims description 5
- 235000007238 Secale cereale Nutrition 0.000 claims description 5
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 claims description 4
- 240000003259 Brassica oleracea var. botrytis Species 0.000 claims description 4
- 240000007087 Apium graveolens Species 0.000 claims description 2
- 235000015849 Apium graveolens Dulce Group Nutrition 0.000 claims description 2
- 235000010591 Appio Nutrition 0.000 claims description 2
- 229920000742 Cotton Polymers 0.000 claims description 2
- 235000003228 Lactuca sativa Nutrition 0.000 claims description 2
- 240000008415 Lactuca sativa Species 0.000 claims description 2
- 239000006002 Pepper Substances 0.000 claims description 2
- 235000016761 Piper aduncum Nutrition 0.000 claims description 2
- 235000017804 Piper guineense Nutrition 0.000 claims description 2
- 235000008184 Piper nigrum Nutrition 0.000 claims description 2
- 235000007558 Avena sp Nutrition 0.000 claims 1
- 244000291473 Musa acuminata Species 0.000 claims 1
- 235000018290 Musa x paradisiaca Nutrition 0.000 claims 1
- 244000061176 Nicotiana tabacum Species 0.000 claims 1
- 244000203593 Piper nigrum Species 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 description 56
- 241001442497 Globodera rostochiensis Species 0.000 description 44
- 210000004027 cell Anatomy 0.000 description 39
- 108020004414 DNA Proteins 0.000 description 38
- 241000894007 species Species 0.000 description 38
- 241000588698 Erwinia Species 0.000 description 28
- 210000001519 tissue Anatomy 0.000 description 25
- 206010011732 Cyst Diseases 0.000 description 22
- 241000243785 Meloidogyne javanica Species 0.000 description 22
- 208000031513 cyst Diseases 0.000 description 22
- 239000013612 plasmid Substances 0.000 description 20
- 230000001939 inductive effect Effects 0.000 description 16
- 125000003729 nucleotide group Chemical group 0.000 description 16
- 239000013598 vector Substances 0.000 description 16
- 241001480224 Heterodera Species 0.000 description 15
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 15
- 108010037850 glycylvaline Proteins 0.000 description 15
- 150000007523 nucleic acids Chemical class 0.000 description 15
- 230000009466 transformation Effects 0.000 description 15
- 150000001413 amino acids Chemical group 0.000 description 14
- 235000018102 proteins Nutrition 0.000 description 14
- 102000004169 proteins and genes Human genes 0.000 description 14
- 230000001105 regulatory effect Effects 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 13
- 102000039446 nucleic acids Human genes 0.000 description 13
- 108020004707 nucleic acids Proteins 0.000 description 13
- 235000013311 vegetables Nutrition 0.000 description 13
- PVXPPJIGRGXGCY-DJHAAKORSA-N 6-O-alpha-D-glucopyranosyl-alpha-D-fructofuranose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1OC[C@@H]1[C@@H](O)[C@H](O)[C@](O)(CO)O1 PVXPPJIGRGXGCY-DJHAAKORSA-N 0.000 description 11
- 241000607720 Serratia Species 0.000 description 11
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 10
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 10
- 241000208125 Nicotiana Species 0.000 description 10
- 229930006000 Sucrose Natural products 0.000 description 10
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 10
- 108010047495 alanylglycine Proteins 0.000 description 10
- 108010078144 glutaminyl-glycine Proteins 0.000 description 10
- 239000000523 sample Substances 0.000 description 10
- 239000002689 soil Substances 0.000 description 10
- 239000005720 sucrose Substances 0.000 description 10
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 9
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 9
- 241000219193 Brassicaceae Species 0.000 description 9
- MRVZCDSYLJXKKX-ACRUOGEOSA-N His-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N MRVZCDSYLJXKKX-ACRUOGEOSA-N 0.000 description 9
- 208000000291 Nematode infections Diseases 0.000 description 9
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 9
- 210000000056 organ Anatomy 0.000 description 9
- 108010073969 valyllysine Proteins 0.000 description 9
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 8
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 8
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 8
- 102000002322 Egg Proteins Human genes 0.000 description 8
- 108010000912 Egg Proteins Proteins 0.000 description 8
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 8
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 8
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 8
- 241000219071 Malvaceae Species 0.000 description 8
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 8
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 8
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 8
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 8
- 108010079547 glutamylmethionine Proteins 0.000 description 8
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 8
- 210000004681 ovum Anatomy 0.000 description 8
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- 108010020532 tyrosyl-proline Proteins 0.000 description 8
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 7
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 7
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 7
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 7
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 7
- 241000498254 Heterodera glycines Species 0.000 description 7
- 241000238631 Hexapoda Species 0.000 description 7
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 7
- 241000208202 Linaceae Species 0.000 description 7
- 241000219823 Medicago Species 0.000 description 7
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 7
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 7
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 7
- AOKZOUGUMLBPSS-PMVMPFDFSA-N Phe-Trp-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O AOKZOUGUMLBPSS-PMVMPFDFSA-N 0.000 description 7
- 241000208292 Solanaceae Species 0.000 description 7
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 7
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 7
- 235000001014 amino acid Nutrition 0.000 description 7
- 238000004166 bioassay Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 108010050848 glycylleucine Proteins 0.000 description 7
- 239000003550 marker Substances 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 241000589158 Agrobacterium Species 0.000 description 6
- 244000105624 Arachis hypogaea Species 0.000 description 6
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 6
- QXOPPIDJKPEKCW-GUBZILKMSA-N Asn-Pro-Arg Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O QXOPPIDJKPEKCW-GUBZILKMSA-N 0.000 description 6
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 6
- 235000009438 Gossypium Nutrition 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 6
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 6
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 6
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 6
- PBLLTSKBTAHDNA-KBPBESRZSA-N Lys-Gly-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PBLLTSKBTAHDNA-KBPBESRZSA-N 0.000 description 6
- 241000220225 Malus Species 0.000 description 6
- 240000003183 Manihot esculenta Species 0.000 description 6
- 241001143352 Meloidogyne Species 0.000 description 6
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 6
- PZUUMQPMHBJJKE-AVGNSLFASA-N Met-Leu-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N PZUUMQPMHBJJKE-AVGNSLFASA-N 0.000 description 6
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 6
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 6
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 6
- 240000004980 Rheum officinale Species 0.000 description 6
- 235000008081 Rheum officinale Nutrition 0.000 description 6
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 6
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 6
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 6
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 6
- PLXQRTXVLZUNMU-RNXOBYDBSA-N Tyr-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CC4=CC=C(C=C4)O)N PLXQRTXVLZUNMU-RNXOBYDBSA-N 0.000 description 6
- ITDWWLTTWRRLCC-KJEVXHAQSA-N Tyr-Thr-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ITDWWLTTWRRLCC-KJEVXHAQSA-N 0.000 description 6
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 6
- 108010013835 arginine glutamate Proteins 0.000 description 6
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 6
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 244000038559 crop plants Species 0.000 description 6
- 235000013399 edible fruits Nutrition 0.000 description 6
- 108010084389 glycyltryptophan Proteins 0.000 description 6
- 108010017391 lysylvaline Proteins 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 108010061238 threonyl-glycine Proteins 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 5
- IEAUDUOCWNPZBR-LKTVYLICSA-N Ala-Trp-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IEAUDUOCWNPZBR-LKTVYLICSA-N 0.000 description 5
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 5
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 5
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 5
- 235000005781 Avena Nutrition 0.000 description 5
- 240000008574 Capsicum frutescens Species 0.000 description 5
- 241001672694 Citrus reticulata Species 0.000 description 5
- 241000234642 Festuca Species 0.000 description 5
- 241000220223 Fragaria Species 0.000 description 5
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 5
- MWWOPNQSBXEUHO-ULQDDVLXSA-N His-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 MWWOPNQSBXEUHO-ULQDDVLXSA-N 0.000 description 5
- 241000209219 Hordeum Species 0.000 description 5
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 5
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 5
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 5
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 5
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 5
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 5
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 5
- 241000209094 Oryza Species 0.000 description 5
- 241000209046 Pennisetum Species 0.000 description 5
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 5
- YSUZKYSRAFNLRB-ULQDDVLXSA-N Pro-Gln-Trp Chemical compound N([C@@H](CCC(=O)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C(=O)[C@@H]1CCCN1 YSUZKYSRAFNLRB-ULQDDVLXSA-N 0.000 description 5
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 5
- 241000209056 Secale Species 0.000 description 5
- 235000002634 Solanum Nutrition 0.000 description 5
- 241000207763 Solanum Species 0.000 description 5
- 244000062793 Sorghum vulgare Species 0.000 description 5
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 5
- 241000209140 Triticum Species 0.000 description 5
- BXKWZPXTTSCOMX-AQZXSJQPSA-N Trp-Asn-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BXKWZPXTTSCOMX-AQZXSJQPSA-N 0.000 description 5
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 5
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 5
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 5
- 241000209149 Zea Species 0.000 description 5
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 5
- 108010068380 arginylarginine Proteins 0.000 description 5
- 108010060035 arginylproline Proteins 0.000 description 5
- 239000001390 capsicum minimum Substances 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 230000002068 genetic effect Effects 0.000 description 5
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 5
- 208000015181 infectious disease Diseases 0.000 description 5
- 210000001161 mammalian embryo Anatomy 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 108010029020 prolylglycine Proteins 0.000 description 5
- 241000228158 x Triticosecale Species 0.000 description 5
- 241001075517 Abelmoschus Species 0.000 description 4
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 4
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 4
- DPNHSNLIULPOBH-GUBZILKMSA-N Arg-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DPNHSNLIULPOBH-GUBZILKMSA-N 0.000 description 4
- BIGRHVNFFJTHEB-UBHSHLNASA-N Asn-Trp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O BIGRHVNFFJTHEB-UBHSHLNASA-N 0.000 description 4
- XLDMSQYOYXINSZ-QXEWZRGKSA-N Asn-Val-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLDMSQYOYXINSZ-QXEWZRGKSA-N 0.000 description 4
- 244000025254 Cannabis sativa Species 0.000 description 4
- 241000871189 Chenopodiaceae Species 0.000 description 4
- 241000207199 Citrus Species 0.000 description 4
- 240000007154 Coffea arabica Species 0.000 description 4
- 241000207782 Convolvulaceae Species 0.000 description 4
- 241000219112 Cucumis Species 0.000 description 4
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 4
- 241000219122 Cucurbita Species 0.000 description 4
- 241000221017 Euphorbiaceae Species 0.000 description 4
- 241000220485 Fabaceae Species 0.000 description 4
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 4
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 4
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 4
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 4
- 239000004471 Glycine Substances 0.000 description 4
- 240000000047 Gossypium barbadense Species 0.000 description 4
- 241000208818 Helianthus Species 0.000 description 4
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 4
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 4
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 4
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 4
- 241000208204 Linum Species 0.000 description 4
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 4
- 241000219833 Phaseolus Species 0.000 description 4
- 241000219843 Pisum Species 0.000 description 4
- GNFHQWNCSSPOBT-ULQDDVLXSA-N Pro-Trp-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCC(=O)N)C(=O)O GNFHQWNCSSPOBT-ULQDDVLXSA-N 0.000 description 4
- 235000004789 Rosa xanthina Nutrition 0.000 description 4
- 241000220222 Rosaceae Species 0.000 description 4
- 241001107098 Rubiaceae Species 0.000 description 4
- 241000209051 Saccharum Species 0.000 description 4
- 240000000111 Saccharum officinarum Species 0.000 description 4
- 235000005775 Setaria Nutrition 0.000 description 4
- 241000232088 Setaria <nematode> Species 0.000 description 4
- 241001122767 Theaceae Species 0.000 description 4
- 235000019714 Triticale Nutrition 0.000 description 4
- 108010005233 alanylglutamic acid Proteins 0.000 description 4
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 4
- 235000020971 citrus fruits Nutrition 0.000 description 4
- 235000013305 food Nutrition 0.000 description 4
- 239000012634 fragment Substances 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 238000009396 hybridization Methods 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 235000021374 legumes Nutrition 0.000 description 4
- 108010057821 leucylproline Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 235000021251 pulses Nutrition 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- 230000031068 symbiosis, encompassing mutualism through parasitism Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 3
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 3
- 108700028369 Alleles Proteins 0.000 description 3
- 241000207875 Antirrhinum Species 0.000 description 3
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 3
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 3
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 3
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 3
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- 240000002791 Brassica napus Species 0.000 description 3
- 244000045195 Cicer arietinum Species 0.000 description 3
- 240000001980 Cucurbita pepo Species 0.000 description 3
- 244000078127 Eleusine coracana Species 0.000 description 3
- OFPWCBGRYAOLMU-AVGNSLFASA-N Gln-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OFPWCBGRYAOLMU-AVGNSLFASA-N 0.000 description 3
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 3
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 3
- SYDJILXOZNEEDK-XIRDDKMYSA-N Glu-Arg-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SYDJILXOZNEEDK-XIRDDKMYSA-N 0.000 description 3
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 3
- 108010068370 Glutens Proteins 0.000 description 3
- XVYKMNXXJXQKME-XEGUGMAKSA-N Gly-Ile-Tyr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XVYKMNXXJXQKME-XEGUGMAKSA-N 0.000 description 3
- 235000009429 Gossypium barbadense Nutrition 0.000 description 3
- 244000299507 Gossypium hirsutum Species 0.000 description 3
- 206010020649 Hyperkeratosis Diseases 0.000 description 3
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 3
- 241000207783 Ipomoea Species 0.000 description 3
- 240000004322 Lens culinaris Species 0.000 description 3
- 235000010666 Lens esculenta Nutrition 0.000 description 3
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 3
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 3
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 3
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 3
- 235000016735 Manihot esculenta subsp esculenta Nutrition 0.000 description 3
- 241000243784 Meloidogyne arenaria Species 0.000 description 3
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 3
- 244000046052 Phaseolus vulgaris Species 0.000 description 3
- 241000209504 Poaceae Species 0.000 description 3
- 241000193945 Pratylenchidae Species 0.000 description 3
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 3
- 241000220324 Pyrus Species 0.000 description 3
- 235000007201 Saccharum officinarum Nutrition 0.000 description 3
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 3
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 3
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 3
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 3
- 240000006394 Sorghum bicolor Species 0.000 description 3
- 235000011684 Sorghum saccharatum Nutrition 0.000 description 3
- 244000269722 Thea sinensis Species 0.000 description 3
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 3
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 3
- 241000196508 Turbatrix Species 0.000 description 3
- 241000243782 Tylenchida Species 0.000 description 3
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 3
- 240000006677 Vicia faba Species 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 108010011559 alanylphenylalanine Proteins 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 108010062796 arginyllysine Proteins 0.000 description 3
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 3
- 108010093581 aspartyl-proline Proteins 0.000 description 3
- 239000003795 chemical substances by application Substances 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 230000000875 corresponding effect Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 230000018109 developmental process Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 230000012010 growth Effects 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 235000019713 millet Nutrition 0.000 description 3
- 235000016709 nutrition Nutrition 0.000 description 3
- 230000035764 nutrition Effects 0.000 description 3
- 108010024607 phenylalanylalanine Proteins 0.000 description 3
- 238000002360 preparation method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000000644 propagated effect Effects 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 2
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 2
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 2
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 2
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 2
- JPOQZCHGOTWRTM-FQPOAREZSA-N Ala-Tyr-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPOQZCHGOTWRTM-FQPOAREZSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- 241000234282 Allium Species 0.000 description 2
- 235000003840 Amygdalus nana Nutrition 0.000 description 2
- 235000003911 Arachis Nutrition 0.000 description 2
- 235000017060 Arachis glabrata Nutrition 0.000 description 2
- 235000010777 Arachis hypogaea Nutrition 0.000 description 2
- 235000018262 Arachis monticola Nutrition 0.000 description 2
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 2
- PHHRSPBBQUFULD-UWVGGRQHSA-N Arg-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N PHHRSPBBQUFULD-UWVGGRQHSA-N 0.000 description 2
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 2
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 2
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 2
- PPMTUXJSQDNUDE-CIUDSAMLSA-N Asn-Glu-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PPMTUXJSQDNUDE-CIUDSAMLSA-N 0.000 description 2
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 2
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 2
- XACXDSRQIXRMNS-OLHMAJIHSA-N Asp-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)O XACXDSRQIXRMNS-OLHMAJIHSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- LEYKQPDPZJIRTA-AQZXSJQPSA-N Asp-Trp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LEYKQPDPZJIRTA-AQZXSJQPSA-N 0.000 description 2
- 235000005340 Asparagus officinalis Nutrition 0.000 description 2
- 241000208838 Asteraceae Species 0.000 description 2
- 241001106067 Atropa Species 0.000 description 2
- 241000219198 Brassica Species 0.000 description 2
- 235000011331 Brassica Nutrition 0.000 description 2
- 244000178993 Brassica juncea Species 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 235000001169 Brassica oleracea var oleracea Nutrition 0.000 description 2
- 241000209507 Camellia Species 0.000 description 2
- 241001674939 Caulanthus Species 0.000 description 2
- 241000219312 Chenopodium Species 0.000 description 2
- 235000007516 Chrysanthemum Nutrition 0.000 description 2
- 235000010523 Cicer arietinum Nutrition 0.000 description 2
- 241000219109 Citrullus Species 0.000 description 2
- 244000241235 Citrullus lanatus Species 0.000 description 2
- 244000131522 Citrus pyriformis Species 0.000 description 2
- 241000723377 Coffea Species 0.000 description 2
- 241000020428 Colea Species 0.000 description 2
- 235000004035 Cryptotaenia japonica Nutrition 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- 235000009852 Cucurbita pepo Nutrition 0.000 description 2
- 241000219104 Cucurbitaceae Species 0.000 description 2
- 235000001427 Cylindropuntia imbricata Nutrition 0.000 description 2
- 240000005729 Cylindropuntia imbricata Species 0.000 description 2
- 241000208296 Datura Species 0.000 description 2
- 244000000626 Daucus carota Species 0.000 description 2
- 235000002767 Daucus carota Nutrition 0.000 description 2
- 240000001879 Digitalis lutea Species 0.000 description 2
- 235000007351 Eleusine Nutrition 0.000 description 2
- 241000209215 Eleusine Species 0.000 description 2
- 235000013499 Eleusine coracana subsp coracana Nutrition 0.000 description 2
- 241000556426 Erwinia rhapontici Species 0.000 description 2
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 2
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical compound OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 2
- 241000508723 Festuca rubra Species 0.000 description 2
- 240000009088 Fragaria x ananassa Species 0.000 description 2
- SSWAFVQFQWOJIJ-XIRDDKMYSA-N Gln-Arg-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N SSWAFVQFQWOJIJ-XIRDDKMYSA-N 0.000 description 2
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- SBHVGKBYOQKAEA-SDDRHHMPSA-N Gln-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O SBHVGKBYOQKAEA-SDDRHHMPSA-N 0.000 description 2
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- FTTHLXOMDMLKKW-FHWLQOOXSA-N Gln-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTTHLXOMDMLKKW-FHWLQOOXSA-N 0.000 description 2
- 241001442498 Globodera Species 0.000 description 2
- 241001460043 Globodera millefolii Species 0.000 description 2
- 241001489135 Globodera pallida Species 0.000 description 2
- 241000923667 Globodera tabacum Species 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 2
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 2
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 2
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 2
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- 244000020551 Helianthus annuus Species 0.000 description 2
- 241001481225 Heterodera avenae Species 0.000 description 2
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- 235000021506 Ipomoea Nutrition 0.000 description 2
- 235000013757 Juglans Nutrition 0.000 description 2
- 241000758789 Juglans Species 0.000 description 2
- 241000588748 Klebsiella Species 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- 241000208822 Lactuca Species 0.000 description 2
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 2
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 2
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 2
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- 241000234280 Liliaceae Species 0.000 description 2
- 240000004296 Lolium perenne Species 0.000 description 2
- 241000219745 Lupinus Species 0.000 description 2
- 235000010649 Lupinus albus Nutrition 0.000 description 2
- 240000000894 Lupinus albus Species 0.000 description 2
- 235000010653 Lupinus angustifolius Nutrition 0.000 description 2
- 240000005776 Lupinus angustifolius Species 0.000 description 2
- 235000010648 Lupinus luteus Nutrition 0.000 description 2
- 244000045959 Lupinus luteus Species 0.000 description 2
- 241000227653 Lycopersicon Species 0.000 description 2
- 235000002262 Lycopersicon Nutrition 0.000 description 2
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 2
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 2
- WWEWGPOLIJXGNX-XUXIUFHCSA-N Lys-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N WWEWGPOLIJXGNX-XUXIUFHCSA-N 0.000 description 2
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 2
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 2
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 2
- 240000004658 Medicago sativa Species 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- 241001465268 Meloidogyne camelliae Species 0.000 description 2
- 241000243787 Meloidogyne hapla Species 0.000 description 2
- 241000243786 Meloidogyne incognita Species 0.000 description 2
- 241001464670 Meloidogyne mali Species 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 2
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 240000003433 Miscanthus floridulus Species 0.000 description 2
- 240000000249 Morus alba Species 0.000 description 2
- 241000234615 Musaceae Species 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- 235000006508 Nelumbo nucifera Nutrition 0.000 description 2
- 240000002853 Nelumbo nucifera Species 0.000 description 2
- 235000006510 Nelumbo pentapetala Nutrition 0.000 description 2
- 108700026244 Open Reading Frames Proteins 0.000 description 2
- 241000209117 Panicum Species 0.000 description 2
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 2
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 2
- 241001520808 Panicum virgatum Species 0.000 description 2
- 241000208181 Pelargonium Species 0.000 description 2
- 244000115721 Pennisetum typhoides Species 0.000 description 2
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 241000218657 Picea Species 0.000 description 2
- 235000005205 Pinus Nutrition 0.000 description 2
- 241000218602 Pinus <genus> Species 0.000 description 2
- 241000219000 Populus Species 0.000 description 2
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 2
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 2
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 2
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 2
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 2
- 241000220299 Prunus Species 0.000 description 2
- 235000011432 Prunus Nutrition 0.000 description 2
- 241000589516 Pseudomonas Species 0.000 description 2
- 241000218683 Pseudotsuga Species 0.000 description 2
- 241000040495 Punctodera Species 0.000 description 2
- 241000218206 Ranunculus Species 0.000 description 2
- 241000220259 Raphanus Species 0.000 description 2
- 235000003846 Ricinus Nutrition 0.000 description 2
- 241000322381 Ricinus <louse> Species 0.000 description 2
- 240000000528 Ricinus communis Species 0.000 description 2
- 235000011449 Rosa Nutrition 0.000 description 2
- 241000220317 Rosa Species 0.000 description 2
- 241001092459 Rubus Species 0.000 description 2
- 241000780602 Senecio Species 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- WXWDPFVKQRVJBJ-CIUDSAMLSA-N Ser-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N WXWDPFVKQRVJBJ-CIUDSAMLSA-N 0.000 description 2
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 2
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 2
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 2
- 241000219161 Theobroma Species 0.000 description 2
- 244000299461 Theobroma cacao Species 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- 102000007641 Trefoil Factors Human genes 0.000 description 2
- 235000015724 Trifolium pratense Nutrition 0.000 description 2
- 241001312519 Trigonella Species 0.000 description 2
- 241000209143 Triticum turgidum subsp. durum Species 0.000 description 2
- TZNNEYFZZAHLBL-BPUTZDHNSA-N Trp-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O TZNNEYFZZAHLBL-BPUTZDHNSA-N 0.000 description 2
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 2
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 2
- WHJVRIBYQWHRQA-NQCBNZPSSA-N Trp-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 WHJVRIBYQWHRQA-NQCBNZPSSA-N 0.000 description 2
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 2
- LQGDFDYGDQEMGA-PXDAIIFMSA-N Tyr-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N LQGDFDYGDQEMGA-PXDAIIFMSA-N 0.000 description 2
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 2
- LVFZXRQQQDTBQH-IRIUXVKKSA-N Tyr-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LVFZXRQQQDTBQH-IRIUXVKKSA-N 0.000 description 2
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 2
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 2
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 2
- 235000012511 Vaccinium Nutrition 0.000 description 2
- 241000736767 Vaccinium Species 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 2
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- 241000219873 Vicia Species 0.000 description 2
- 235000010749 Vicia faba Nutrition 0.000 description 2
- 235000002098 Vicia faba var. major Nutrition 0.000 description 2
- 241000219977 Vigna Species 0.000 description 2
- 235000009392 Vitis Nutrition 0.000 description 2
- 241000219095 Vitis Species 0.000 description 2
- 244000193174 agave Species 0.000 description 2
- 108010044940 alanylglutamine Proteins 0.000 description 2
- 108010087924 alanylproline Proteins 0.000 description 2
- 108010077245 asparaginyl-proline Proteins 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 244000022203 blackseeded proso millet Species 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 235000019504 cigarettes Nutrition 0.000 description 2
- 238000004140 cleaning Methods 0.000 description 2
- 235000018597 common camellia Nutrition 0.000 description 2
- 239000002299 complementary DNA Substances 0.000 description 2
- 239000002131 composite material Substances 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000034994 death Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 235000013601 eggs Nutrition 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 230000008029 eradication Effects 0.000 description 2
- 238000013467 fragmentation Methods 0.000 description 2
- 238000006062 fragmentation reaction Methods 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010089804 glycyl-threonine Proteins 0.000 description 2
- 108010020688 glycylhistidine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 238000009399 inbreeding Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- -1 light Substances 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 235000005739 manihot Nutrition 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 239000002245 particle Substances 0.000 description 2
- 235000020232 peanut Nutrition 0.000 description 2
- 239000000575 pesticide Substances 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 235000014774 prunus Nutrition 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical group OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 229960000268 spectinomycin Drugs 0.000 description 2
- UNFWWIHTNXNPBV-WXKVUWSESA-N spectinomycin Chemical compound O([C@@H]1[C@@H](NC)[C@@H](O)[C@H]([C@@H]([C@H]1O1)O)NC)[C@]2(O)[C@H]1O[C@H](C)CC2=O UNFWWIHTNXNPBV-WXKVUWSESA-N 0.000 description 2
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 2
- 230000008685 targeting Effects 0.000 description 2
- 235000018322 upland cotton Nutrition 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- NWXMGUDVXFXRIG-WESIUVDSSA-N (4s,4as,5as,6s,12ar)-4-(dimethylamino)-1,6,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4,4a,5,5a-tetrahydrotetracene-2-carboxamide Chemical compound C1=CC=C2[C@](O)(C)[C@H]3C[C@H]4[C@H](N(C)C)C(=O)C(C(N)=O)=C(O)[C@@]4(O)C(=O)C3=C(O)C2=C1O NWXMGUDVXFXRIG-WESIUVDSSA-N 0.000 description 1
- 108020004465 16S ribosomal RNA Proteins 0.000 description 1
- CLQMBPJKHLGMQK-UHFFFAOYSA-N 2-(4-isopropyl-4-methyl-5-oxo-4,5-dihydro-1H-imidazol-2-yl)nicotinic acid Chemical compound N1C(=O)C(C(C)C)(C)N=C1C1=NC=CC=C1C(O)=O CLQMBPJKHLGMQK-UHFFFAOYSA-N 0.000 description 1
- GUMOJENFFHZAFP-UHFFFAOYSA-N 2-Ethoxynaphthalene Chemical compound C1=CC=CC2=CC(OCC)=CC=C21 GUMOJENFFHZAFP-UHFFFAOYSA-N 0.000 description 1
- 125000002124 5'-adenosyl group Chemical group N1=CN=C2N(C=NC2=C1N)[C@H]1[C@H](O)[C@H](O)[C@H](O1)C* 0.000 description 1
- 206010000060 Abdominal distension Diseases 0.000 description 1
- 240000004507 Abelmoschus esculentus Species 0.000 description 1
- 108010000700 Acetolactate synthase Proteins 0.000 description 1
- 108010013043 Acetylesterase Proteins 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241001136249 Agriotes lineatus Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 1
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 1
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- PHQXWZGXKAFWAZ-ZLIFDBKOSA-N Ala-Trp-Lys Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 PHQXWZGXKAFWAZ-ZLIFDBKOSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- 240000002234 Allium sativum Species 0.000 description 1
- 244000144730 Amygdalus persica Species 0.000 description 1
- 241000399940 Anguina tritici Species 0.000 description 1
- 241000196509 Anguinidae Species 0.000 description 1
- 241000294569 Aphelenchoides Species 0.000 description 1
- 206010002961 Aplasia Diseases 0.000 description 1
- 241000219194 Arabidopsis Species 0.000 description 1
- 101100060390 Arabidopsis thaliana CLT3 gene Proteins 0.000 description 1
- 101100194010 Arabidopsis thaliana RD29A gene Proteins 0.000 description 1
- 101100204479 Arabidopsis thaliana SWEET12 gene Proteins 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- JTZUZBADHGISJD-SRVKXCTJSA-N Arg-His-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JTZUZBADHGISJD-SRVKXCTJSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- YKZJPIPFKGYHKY-DCAQKATOSA-N Arg-Leu-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKZJPIPFKGYHKY-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 1
- AOJYORNRFWWEIV-IHRRRGAJSA-N Arg-Tyr-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 AOJYORNRFWWEIV-IHRRRGAJSA-N 0.000 description 1
- 240000006891 Artemisia vulgaris Species 0.000 description 1
- 241000209134 Arundinaria Species 0.000 description 1
- 244000189799 Asimina triloba Species 0.000 description 1
- 235000006264 Asimina triloba Nutrition 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- XLHLPYFMXGOASD-CIUDSAMLSA-N Asn-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XLHLPYFMXGOASD-CIUDSAMLSA-N 0.000 description 1
- PTSDPWIHOYMRGR-UGYAYLCHSA-N Asn-Ile-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O PTSDPWIHOYMRGR-UGYAYLCHSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- ZJIFRAPZHAGLGR-MELADBBJSA-N Asn-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZJIFRAPZHAGLGR-MELADBBJSA-N 0.000 description 1
- JTXVXGXTRXMOFJ-FXQIFTODSA-N Asn-Pro-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O JTXVXGXTRXMOFJ-FXQIFTODSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 1
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 1
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 1
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- CUQDCPXNZPDYFQ-ZLUOBGJFSA-N Asp-Ser-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O CUQDCPXNZPDYFQ-ZLUOBGJFSA-N 0.000 description 1
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000589149 Azotobacter vinelandii Species 0.000 description 1
- 241000580315 Belonolaimidae Species 0.000 description 1
- 101000972350 Bombyx mori Lebocin-4 Proteins 0.000 description 1
- 241000743774 Brachypodium Species 0.000 description 1
- 235000005855 Brassica juncea var. subintegrifolia Nutrition 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 240000008100 Brassica rapa Species 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 241000234673 Bromelia Species 0.000 description 1
- 241000234670 Bromeliaceae Species 0.000 description 1
- 241000209200 Bromus Species 0.000 description 1
- 235000004936 Bromus mango Nutrition 0.000 description 1
- 241000209202 Bromus secalinus Species 0.000 description 1
- 241000319930 Bryophyllum <angiosperm> Species 0.000 description 1
- 241000195940 Bryophyta Species 0.000 description 1
- 241001460030 Cactodera Species 0.000 description 1
- 235000010773 Cajanus indicus Nutrition 0.000 description 1
- 244000105627 Cajanus indicus Species 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 240000006846 Camellia reticulata Species 0.000 description 1
- 235000009467 Carica papaya Nutrition 0.000 description 1
- 241000723353 Chrysanthemum Species 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 241000220455 Cicer Species 0.000 description 1
- 235000010521 Cicer Nutrition 0.000 description 1
- 241000723343 Cichorium Species 0.000 description 1
- 235000009831 Citrullus lanatus Nutrition 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- 235000005979 Citrus limon Nutrition 0.000 description 1
- 244000276331 Citrus maxima Species 0.000 description 1
- 235000001759 Citrus maxima Nutrition 0.000 description 1
- 235000005976 Citrus sinensis Nutrition 0.000 description 1
- 240000002319 Citrus sinensis Species 0.000 description 1
- 244000016593 Coffea robusta Species 0.000 description 1
- 241000218631 Coniferophyta Species 0.000 description 1
- 241001540484 Criconematidae Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 240000007235 Cyanthillium patulum Species 0.000 description 1
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 241000208175 Daucus Species 0.000 description 1
- 241001444943 Dolichodera Species 0.000 description 1
- 206010058314 Dysplasia Diseases 0.000 description 1
- 235000014716 Eleusine indica Nutrition 0.000 description 1
- 244000025670 Eleusine indica Species 0.000 description 1
- 241000758993 Equisetidae Species 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 235000009419 Fagopyrum esculentum Nutrition 0.000 description 1
- 240000008620 Fagopyrum esculentum Species 0.000 description 1
- 241000234645 Festuca pratensis Species 0.000 description 1
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 241000208152 Geranium Species 0.000 description 1
- 108010061711 Gliadin Proteins 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- PONUFVLSGMQFAI-AVGNSLFASA-N Gln-Asn-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PONUFVLSGMQFAI-AVGNSLFASA-N 0.000 description 1
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 1
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- GLEGHWQNGPMKHO-DCAQKATOSA-N Gln-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GLEGHWQNGPMKHO-DCAQKATOSA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 1
- FGWRYRAVBVOHIB-XIRDDKMYSA-N Gln-Pro-Trp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O FGWRYRAVBVOHIB-XIRDDKMYSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 1
- VOUSELYGTNGEPB-NUMRIWBASA-N Gln-Thr-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O VOUSELYGTNGEPB-NUMRIWBASA-N 0.000 description 1
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 1
- NSEKYCAADBNQFE-XIRDDKMYSA-N Gln-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 NSEKYCAADBNQFE-XIRDDKMYSA-N 0.000 description 1
- OEIDWQHTRYEYGG-QEJZJMRPSA-N Gln-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N OEIDWQHTRYEYGG-QEJZJMRPSA-N 0.000 description 1
- CTJRFALAOYAJBX-NWLDYVSISA-N Gln-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N)O CTJRFALAOYAJBX-NWLDYVSISA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- 241001460041 Globodera artemisiae Species 0.000 description 1
- 241000482313 Globodera ellingtonae Species 0.000 description 1
- 241001331154 Globodera hypolysi Species 0.000 description 1
- 241001062604 Globodera mexicana Species 0.000 description 1
- 241000921106 Globodera virginiae Species 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- CBOVGULVQSVMPT-CIUDSAMLSA-N Glu-Pro-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O CBOVGULVQSVMPT-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- YOTHMZZSJKKEHZ-SZMVWBNQSA-N Glu-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCC(O)=O)=CNC2=C1 YOTHMZZSJKKEHZ-SZMVWBNQSA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- UZWUBBRJWFTHTD-LAEOZQHASA-N Glu-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O UZWUBBRJWFTHTD-LAEOZQHASA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- 241000893331 Gyponana mali Species 0.000 description 1
- 235000003222 Helianthus annuus Nutrition 0.000 description 1
- 241000756137 Hemerocallis Species 0.000 description 1
- 241000040385 Heterodera cajani Species 0.000 description 1
- 241000040388 Heterodera carotae Species 0.000 description 1
- 241000040387 Heterodera ciceri Species 0.000 description 1
- 241000040390 Heterodera cruciferae Species 0.000 description 1
- 241001186915 Heterodera elachista Species 0.000 description 1
- 241000040426 Heterodera filipjevi Species 0.000 description 1
- 241000580319 Heterodera goettingiana Species 0.000 description 1
- 241000916533 Heterodera hordecalis Species 0.000 description 1
- 241000040429 Heterodera humuli Species 0.000 description 1
- 241000040431 Heterodera latipons Species 0.000 description 1
- 241000040434 Heterodera medicaginis Species 0.000 description 1
- 241000040432 Heterodera oryzicola Species 0.000 description 1
- 244000309541 Heterodera rosii Species 0.000 description 1
- 241000040484 Heterodera sacchari Species 0.000 description 1
- 241000040489 Heterodera sorghi Species 0.000 description 1
- 241000040492 Heterodera urticae Species 0.000 description 1
- 241000580313 Heterodera zeae Species 0.000 description 1
- WGHJXSONOOTTCZ-JYJNAYRXSA-N His-Glu-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WGHJXSONOOTTCZ-JYJNAYRXSA-N 0.000 description 1
- BCZFOHDMCDXPDA-BZSNNMDCSA-N His-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)O BCZFOHDMCDXPDA-BZSNNMDCSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 1
- BCSGDNGNHKBRRJ-ULQDDVLXSA-N His-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N BCSGDNGNHKBRRJ-ULQDDVLXSA-N 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 241001540512 Hoplolaimidae Species 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- ZZHGKECPZXPXJF-PCBIJLKTSA-N Ile-Asn-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZZHGKECPZXPXJF-PCBIJLKTSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- YKLOMBNBQUTJDT-HVTMNAMFSA-N Ile-His-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YKLOMBNBQUTJDT-HVTMNAMFSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 1
- VOCZPDONPURUHV-QEWYBTABSA-N Ile-Phe-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VOCZPDONPURUHV-QEWYBTABSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- 244000017020 Ipomoea batatas Species 0.000 description 1
- 244000086690 Jasminum curcas Species 0.000 description 1
- 241000221089 Jatropha Species 0.000 description 1
- 241001091572 Kalanchoe Species 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 101710094902 Legumin Proteins 0.000 description 1
- 241000219739 Lens Species 0.000 description 1
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- GPXFZVUVPCFTMG-AVGNSLFASA-N Leu-Arg-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C GPXFZVUVPCFTMG-AVGNSLFASA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- 240000006240 Linum usitatissimum Species 0.000 description 1
- 235000004431 Linum usitatissimum Nutrition 0.000 description 1
- 244000100545 Lolium multiflorum Species 0.000 description 1
- 241000201425 Longidoridae Species 0.000 description 1
- 241001220360 Longidorus Species 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 1
- GGNOBVSOZPHLCE-GUBZILKMSA-N Lys-Gln-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GGNOBVSOZPHLCE-GUBZILKMSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- GQZMPWBZQALKJO-UWVGGRQHSA-N Lys-Gly-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O GQZMPWBZQALKJO-UWVGGRQHSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 1
- XFANQCRHTMOEAP-WDSOQIARSA-N Lys-Pro-Trp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XFANQCRHTMOEAP-WDSOQIARSA-N 0.000 description 1
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 1
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 1
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 241001300479 Macroptilium Species 0.000 description 1
- 241000218922 Magnoliophyta Species 0.000 description 1
- 241000121629 Majorana Species 0.000 description 1
- 240000007228 Mangifera indica Species 0.000 description 1
- 235000014826 Mangifera indica Nutrition 0.000 description 1
- 241000219828 Medicago truncatula Species 0.000 description 1
- 241000221574 Melampsora lini Species 0.000 description 1
- 241000961910 Melanoleuca microcephala Species 0.000 description 1
- 241000144336 Meloidogyne artiellia Species 0.000 description 1
- 244000309689 Meloidogyne brevicauda Species 0.000 description 1
- 241000611260 Meloidogyne chitwoodi Species 0.000 description 1
- 241001113272 Meloidogyne exigua Species 0.000 description 1
- 241001143337 Meloidogyne graminicola Species 0.000 description 1
- 241000531360 Meloidogyne inornata Species 0.000 description 1
- 241001423344 Meloidogyne microtyla Species 0.000 description 1
- 241001013479 Meloidogyne naasi Species 0.000 description 1
- 244000309690 Meloidogyne thamesi Species 0.000 description 1
- 206010027336 Menstruation delayed Diseases 0.000 description 1
- 101100409013 Mesembryanthemum crystallinum PPD gene Proteins 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- CFRRIZLGFGJEDB-SRVKXCTJSA-N Met-His-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CFRRIZLGFGJEDB-SRVKXCTJSA-N 0.000 description 1
- RKIIYGUHIQJCBW-SRVKXCTJSA-N Met-His-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RKIIYGUHIQJCBW-SRVKXCTJSA-N 0.000 description 1
- ORRNBLTZBBESPN-HJWJTTGWSA-N Met-Ile-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ORRNBLTZBBESPN-HJWJTTGWSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 1
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 1
- 241000218666 Metasequoia Species 0.000 description 1
- 244000179886 Moringa oleifera Species 0.000 description 1
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 1
- 241000234479 Narcissus Species 0.000 description 1
- 241001282315 Nemesis Species 0.000 description 1
- 101710089395 Oleosin Proteins 0.000 description 1
- 241000219830 Onobrychis Species 0.000 description 1
- 102000004020 Oxygenases Human genes 0.000 description 1
- 108090000417 Oxygenases Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 101150101414 PRP1 gene Proteins 0.000 description 1
- 235000007199 Panicum miliaceum Nutrition 0.000 description 1
- 241000611870 Pantoea dispersa Species 0.000 description 1
- 241001148649 Paratylenchidae Species 0.000 description 1
- 241000588701 Pectobacterium carotovorum Species 0.000 description 1
- 235000007195 Pennisetum typhoides Nutrition 0.000 description 1
- 102000003992 Peroxidases Human genes 0.000 description 1
- 240000007377 Petunia x hybrida Species 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 1
- 241000018646 Pinus brutia Species 0.000 description 1
- 235000011613 Pinus brutia Nutrition 0.000 description 1
- 240000003889 Piper guineense Species 0.000 description 1
- 241001092090 Pittosporum Species 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- 229920003171 Poly (ethylene oxide) Polymers 0.000 description 1
- 241000168036 Populus alba Species 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- 244000179560 Prunella vulgaris Species 0.000 description 1
- 235000009827 Prunus armeniaca Nutrition 0.000 description 1
- 244000018633 Prunus armeniaca Species 0.000 description 1
- 241001290151 Prunus avium subsp. avium Species 0.000 description 1
- 235000006029 Prunus persica var nucipersica Nutrition 0.000 description 1
- 235000006040 Prunus persica var persica Nutrition 0.000 description 1
- 244000017714 Prunus persica var. nucipersica Species 0.000 description 1
- 235000014443 Pyrus communis Nutrition 0.000 description 1
- 241000588746 Raoultella planticola Species 0.000 description 1
- 235000011483 Ribes Nutrition 0.000 description 1
- 241000220483 Ribes Species 0.000 description 1
- 235000001537 Ribes X gardonianum Nutrition 0.000 description 1
- 235000001535 Ribes X utile Nutrition 0.000 description 1
- 235000016919 Ribes petraeum Nutrition 0.000 description 1
- 244000281247 Ribes rubrum Species 0.000 description 1
- 235000002355 Ribes spicatum Nutrition 0.000 description 1
- 108010003581 Ribulose-bisphosphate carboxylase Proteins 0.000 description 1
- 235000004443 Ricinus communis Nutrition 0.000 description 1
- 244000137827 Rumex acetosa Species 0.000 description 1
- 235000015807 Rumex acetosa ssp. acetosa Nutrition 0.000 description 1
- 235000015447 Rumex acetosa ssp. alpestris Nutrition 0.000 description 1
- 235000015448 Rumex acetosa ssp. ambiguus Nutrition 0.000 description 1
- 235000015437 Rumex acetosa ssp. arifolius Nutrition 0.000 description 1
- 235000015436 Rumex acetosa ssp. lapponicus Nutrition 0.000 description 1
- 235000015441 Rumex acetosa ssp. pseudoxyria Nutrition 0.000 description 1
- 235000015439 Rumex acetosa ssp. thyrsiflorus Nutrition 0.000 description 1
- 241001106018 Salpiglossis Species 0.000 description 1
- 241001530126 Scrophularia Species 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 1
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 241001622809 Serratia plymuthica Species 0.000 description 1
- 235000008515 Setaria glauca Nutrition 0.000 description 1
- 240000005498 Setaria italica Species 0.000 description 1
- 241000220261 Sinapis Species 0.000 description 1
- 244000258044 Solanum gilo Species 0.000 description 1
- 235000009184 Spondias indica Nutrition 0.000 description 1
- 235000006468 Thea sinensis Nutrition 0.000 description 1
- 235000009470 Theobroma cacao Nutrition 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- ZQUKYJOKQBRBCS-GLLZPBPUSA-N Thr-Gln-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O ZQUKYJOKQBRBCS-GLLZPBPUSA-N 0.000 description 1
- CQNFRKAKGDSJFR-NUMRIWBASA-N Thr-Glu-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O CQNFRKAKGDSJFR-NUMRIWBASA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 1
- HEJJDUDEHLPDAW-CUJWVEQBSA-N Thr-His-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N)O HEJJDUDEHLPDAW-CUJWVEQBSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 1
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 1
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 241001540434 Trichodoridae Species 0.000 description 1
- 235000007264 Triticum durum Nutrition 0.000 description 1
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 1
- NOBINHCGDUHOBV-NAZCDGGXSA-N Trp-His-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NOBINHCGDUHOBV-NAZCDGGXSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- JZSLIZLZGWOJBJ-PMVMPFDFSA-N Trp-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N JZSLIZLZGWOJBJ-PMVMPFDFSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- 241000196666 Tylenchidae Species 0.000 description 1
- 241001540466 Tylenchulidae Species 0.000 description 1
- KSVMDJJCYKIXTK-IGNZVWTISA-N Tyr-Ala-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 KSVMDJJCYKIXTK-IGNZVWTISA-N 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- NGALWFGCOMHUSN-AVGNSLFASA-N Tyr-Gln-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NGALWFGCOMHUSN-AVGNSLFASA-N 0.000 description 1
- ULHJJQYGMWONTD-HKUYNNGSSA-N Tyr-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ULHJJQYGMWONTD-HKUYNNGSSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- XJPXTYLVMUZGNW-IHRRRGAJSA-N Tyr-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O XJPXTYLVMUZGNW-IHRRRGAJSA-N 0.000 description 1
- UUBKSZNKJUJQEJ-JRQIVUDYSA-N Tyr-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UUBKSZNKJUJQEJ-JRQIVUDYSA-N 0.000 description 1
- AKKYBQGHUAWPJR-MNSWYVGCSA-N Tyr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)O AKKYBQGHUAWPJR-MNSWYVGCSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 244000274883 Urtica dioica Species 0.000 description 1
- 235000009108 Urtica dioica Nutrition 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 235000010726 Vigna sinensis Nutrition 0.000 description 1
- 244000042314 Vigna unguiculata Species 0.000 description 1
- 241001464837 Viridiplantae Species 0.000 description 1
- 229920002494 Zein Polymers 0.000 description 1
- 241000083686 [Pseudomonas] mesoacidophila Species 0.000 description 1
- 101150067314 aadA gene Proteins 0.000 description 1
- 230000036579 abiotic stress Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 235000002783 ambrette Nutrition 0.000 description 1
- 244000096712 ambrette Species 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 238000003556 assay Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008952 bacterial invasion Effects 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 208000024330 bloating Diseases 0.000 description 1
- GDTBXPJZTBHREO-UHFFFAOYSA-N bromine Chemical compound BrBr GDTBXPJZTBHREO-UHFFFAOYSA-N 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 235000019693 cherries Nutrition 0.000 description 1
- 210000003763 chloroplast Anatomy 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000003967 crop rotation Methods 0.000 description 1
- 239000000287 crude extract Substances 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000006735 deficit Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- NEKNNCABDXGBEN-UHFFFAOYSA-L disodium;4-(4-chloro-2-methylphenoxy)butanoate;4-(2,4-dichlorophenoxy)butanoate Chemical compound [Na+].[Na+].CC1=CC(Cl)=CC=C1OCCCC([O-])=O.[O-]C(=O)CCCOC1=CC=C(Cl)C=C1Cl NEKNNCABDXGBEN-UHFFFAOYSA-L 0.000 description 1
- 210000002615 epidermis Anatomy 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010081551 glycylphenylalanine Proteins 0.000 description 1
- 230000012447 hatching Effects 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- WGCNASOHLSPBMP-UHFFFAOYSA-N hydroxyacetaldehyde Natural products OCC=O WGCNASOHLSPBMP-UHFFFAOYSA-N 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000000968 intestinal effect Effects 0.000 description 1
- 230000001418 larval effect Effects 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 208000004141 microcephaly Diseases 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- VYQNWZOUAUKGHI-UHFFFAOYSA-N monobenzone Chemical compound C1=CC(O)=CC=C1OCC1=CC=CC=C1 VYQNWZOUAUKGHI-UHFFFAOYSA-N 0.000 description 1
- 238000010417 needlework Methods 0.000 description 1
- 230000001069 nematicidal effect Effects 0.000 description 1
- 108010018675 nicotianamine synthase Proteins 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000000050 nutritive effect Effects 0.000 description 1
- 210000002221 olecranon process Anatomy 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 230000008775 paternal effect Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 235000021017 pears Nutrition 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- ZPLUZNXSYCCJOE-UHFFFAOYSA-N phosphoric acid;propan-2-one Chemical compound CC(C)=O.OP(O)(O)=O ZPLUZNXSYCCJOE-UHFFFAOYSA-N 0.000 description 1
- 238000003976 plant breeding Methods 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 229920001223 polyethylene glycol Polymers 0.000 description 1
- 229920000642 polymer Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 101150063097 ppdK gene Proteins 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108060006613 prolamin Proteins 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- NHDHVHZZCFYRSB-UHFFFAOYSA-N pyriproxyfen Chemical compound C=1C=CC=NC=1OC(C)COC(C=C1)=CC=C1OC1=CC=CC=C1 NHDHVHZZCFYRSB-UHFFFAOYSA-N 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 210000000614 rib Anatomy 0.000 description 1
- 230000005070 ripening Effects 0.000 description 1
- 238000009666 routine test Methods 0.000 description 1
- 101150015537 rps12 gene Proteins 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000035040 seed growth Effects 0.000 description 1
- 210000000582 semen Anatomy 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000012882 sequential analysis Methods 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 239000001509 sodium citrate Substances 0.000 description 1
- NLJMYIDDQXHKNR-UHFFFAOYSA-K sodium citrate Chemical compound O.O.[Na+].[Na+].[Na+].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O NLJMYIDDQXHKNR-UHFFFAOYSA-K 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 238000009331 sowing Methods 0.000 description 1
- 210000000352 storage cell Anatomy 0.000 description 1
- 235000021012 strawberries Nutrition 0.000 description 1
- 229960005322 streptomycin Drugs 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- AYEKOFBPNLCAJY-UHFFFAOYSA-O thiamine pyrophosphate Chemical compound CC1=C(CCOP(O)(=O)OP(O)(O)=O)SC=[N+]1CC1=CN=C(C)N=C1N AYEKOFBPNLCAJY-UHFFFAOYSA-O 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 108010080629 tryptophan-leucine Proteins 0.000 description 1
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 230000002792 vascular Effects 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 230000009385 viral infection Effects 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
- 239000005019 zein Substances 0.000 description 1
- 229940093612 zein Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8279—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance
- C12N15/8285—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for biotic stress resistance, pathogen resistance, disease resistance for nematode resistance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/90—Isomerases (5.)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A40/00—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production
- Y02A40/10—Adaptation technologies in agriculture, forestry, livestock or agroalimentary production in agriculture
- Y02A40/146—Genetically Modified [GMO] plants, e.g. transgenic plants
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Chemical & Material Sciences (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Cell Biology (AREA)
- Plant Pathology (AREA)
- Medicinal Chemistry (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明提供编码蔗糖异构酶多肽N末端截短形式的多核苷酸,所述多肽能够赋予植物提高的线虫抗性。本发明还提供产生具有提高的线虫抗性的转基因植物的方法、这种转基因植物的种子及表达载体,其所有均包含本发明的多核苷酸。
Description
相关申请的交叉参考
本申请要求2007年2月8日提交的美国临时申请序列号60/900,228的优先权。
发明领域
本发明涉及控制线虫,尤其是控制大豆胞囊线虫。本文公开的是产生具有提高的线虫抗性的转基因植物的方法、包含编码功能蛋白质的多核苷酸的表达载体和转基因植物及其产生的种子。
发明背景
线虫是微观的蠕虫样动物,其以多于2,000种蔬菜、水果和观赏植物的根、叶和茎为食,引起全世界1万亿美元的农作物损失。线虫的一种常见类型是根癌线虫(RKN),其进食在根上产生特有的虫瘿。其它摄取根为食的线虫是胞囊类型和损伤类型,其更多为宿主特异性的。
线虫在美国普遍存在,但它主要是南部和西部温暖潮湿地区和沙壤土中的问题。大豆胞囊线虫(SCN)Heterodera glycines首先于1954年在美国North Carolina中发现。其为大豆植物最严重的害虫。一些地区被SCN严重感染,以至于在没有控制措施的情况下大豆产量在经济上不再是可能的。尽管大豆是被SCN攻击的主要经济作物,但SCN寄生在总共50种宿主中,其包括农作物、蔬菜、观赏植物和杂草。
线虫损害的标志包括叶的发育障碍和黄化,以及在热时期植物的萎蔫。然而,线虫包括SCN可引起显著的产量损失,而却没有地上症状。此外,感染了SCN的根矮化或短小。线虫侵袭可减少根上的固氮根瘤的数量,并可使根对其它土壤传播植物病原体的攻击更为敏感。
线虫生活周期有三个主要阶段:卵期、幼虫期和成虫期。所述生活周期在线虫种类之间不同。例如,SCN生活周期通常在最适条件下24到30天内完成,而其它种类需要一年或更长时间来完成其生活周期。当温度和湿度水平在春季变得足够时,蠕虫状幼虫在土壤中从卵开始孵化。这些幼虫是可感染大豆根的线虫的唯一生活周期。
SCN的生活周期已经成为许多研究的主题,并因而可用作理解线虫生活周期的实例。在渗入大豆根后,SCN幼虫穿过根移动,直至它们接触到维管组织,在那里它们停止移动并开始摄取食物。线虫注射分泌物,所述分泌物修饰根的某些细胞并将它们转变成专门的进食位点。根细胞在形态上被转变成大的多核合胞体(或在RKN的情况下为巨细胞),其用作线虫的营养来源。活跃进食的线虫因此从植物偷取必需营养,导致产量降低。线虫进食时,它们膨胀,并且最终雌性线虫变得很大,以至于它们突破根组织并暴露于根表面。
在成虫时不膨胀的雄性SCN线虫从根中迁移到土壤中并与柠檬状的成年雌虫受精。然后雄虫死亡,而雌虫仍然附着在根系上并继续进食。膨胀的雌虫中的卵首先在体外的块囊或卵囊中开始发育,稍后在体腔内进行发育。最终成年雌虫的整个体腔内充满卵,雌线虫死亡。死亡雌虫充满卵的尸体被称为胞囊。胞囊最终离开并在土壤中游离存在。胞囊的壁变得很粗糙,这为其中包含的大约200到400个卵提供极好的保护。SCN卵在胞囊内生存,直至出现合适的孵化条件。尽管许多卵可在第一年内进行孵化,但许多也将在胞囊内生存若干年。
线虫依靠自己的力量每年只能沿土壤移动少许英寸。然而,线虫侵袭可以多种方式传播很大距离。可以移动被感染的土壤的任何东西均能够传播所述感染,包括农场机器、运载工具和工具、风、水、动物和农业工人。种子大小的土壤颗粒经常污染所收获的种子。结果,当将来自感染田地的污染种子种植于未感染的田地中时,线虫感染就可以进行传播。甚至有证据表明某些线虫种类通过鸟类传播。仅可以预防这些原因中的一些。
用于治理线虫感染的传统实践包括:在线虫感染的土地中维持适当的土壤营养和土壤pH水平;控制其它植物疾病,及昆虫和杂草害虫;仅在未感染田地工作后使用卫生设施,如耕地、播种和耕作线虫感染的田地;在感染田地中工作完之后用高压水或蒸气彻底清洗设备;除非已经将种子彻底清洗,否则不使用在感染田地中生长的种子用于种植未感染的田地;用非寄主农作物轮种感染田地并更换寄主农作物;使用杀线虫剂;和种植有抵抗力的植物变种。
已经提议方法用于植物的遗传转化,以赋予对植物寄生线虫提高的抵抗力。美国专利号5,589,622和5,824,876涉及线虫附着后在进食位点或其附近特异性表达的植物基因的鉴定。WO2004/005504描述了通过表达蔗糖异构酶基因产生线虫抗性植物的方法。在某些微生物中产生的蔗糖异构酶将蔗糖转化成异麦芽酮糖(palatinose)(参阅,美国专利号5,985,668和5,786,140)。
发明简述
本发明人已惊奇地发现与蔗糖异构酶类似但不具有蔗糖异构酶活性的蛋白质在转基因植物中表达时会赋予线虫抗性。本发明提供了多核苷酸、转基因植物和种子,以及克服或至少减轻有价值的农作物如大豆的线虫感染的方法。
因此,本发明包含分离的多核苷酸,所述分离的多核苷酸编码蔗糖异构酶多肽的N末端截短形式,其当转化入植物中时显示抗线虫活性,其中所述多肽不显示蔗糖异构酶酶促活性。
在另一实施方案中,本发明涉及包含有效连接多核苷酸的转录调节元件的表达载体,所述多核苷酸编码蔗糖异构酶多肽的N末端截短形式,其当转化到植物中时显示抗线虫活性,但不显示蔗糖异构酶酶促活性。
在另一实施方案中,本发明提供转化有表达载体的转基因植物,所述表达载体包含编码蔗糖异构酶多肽的N末端截短形式的分离的多核苷酸,其当转化到植物中时显示抗线虫活性,但不显示蔗糖异构酶酶促活性。本发明的转基因植物与植物的野生型变种相比显示出对线虫的抗性增加。
本发明的另一实施方案提供了对于分离的多核苷酸为不分离的转基因种子,所述多核苷酸编码蔗糖异构酶多肽的N末端截短形式,其当转化到植物中时显示抗线虫活性,但不显示蔗糖异构酶酶促活性。
在另一实施方案中,本发明提供了产生具有提高的线虫抗性的转基因植物的方法,其中所述方法包括以下步骤:将包含有效连接分离的多核苷酸的转录调节元件的表达载体导入植物中,并选择线虫抗性提高的转基因植物,所述分离的多核苷酸编码蔗糖异构酶多肽的N末端截短形式,其当转化到植物中时显示抗线虫活性,但不显示蔗糖异构酶酶促活性。
附图简述
图1a-1c显示了本发明截短的蔗糖异构酶(SEQ ID NO:1)与来自大黄欧文氏菌(Erwinia rhapontici)的全长蔗糖异构酶(登录号AF279281;SEQID NO:3)的DNA序列比对。使用AlignX程序在VNTI中进行比对(逐对比较,空位开放罚分=15,空位延伸罚分=6.66)。
图2显示了SEQ ID NO:2所描述的截短的大黄欧文氏菌氨基酸序列与SEQ ID NO:5所描述的普城沙雷氏菌(Serratia plymuthica)的蔗糖异构酶的截短氨基酸序列的总体百分比同一性。PID=总体百分比同一性。
图3a-3b显示了SEQ ID NO:2所描述的欧文氏菌截短蔗糖异构酶的示例性截短同源物与具有SEQ ID NO:5、14、15、16、17、18、19和20的同源物的氨基酸比对。Vector NTI软件包(空位开放罚分=15,空位延伸罚分=6.66,空位分隔罚分=8)。
图4显示了欧文氏菌截短蔗糖异构酶的示例性截短同源物的总体百分比同一性矩阵表。
优选实施方案详述
可参考本发明实施方案的以下详细描述和本文中包括的实施例来更容易地理解本发明。除非另有指出,本文中所用的术语应根据相关领域中普通技术人员常规用法来理解。
在本申请中,参考多种专利和文献出版物。所有这些出版物的公开内容和这些出版物中引用的那些参考特此以其整体引入本申请中作为参考,以更详尽地描述本发明所属的领域现状。所使用的缩写和命名法在本领域中被认为是标准,并通常用于如此处引用的那些专业杂志中。如此处所用及在所附权利要求中,单数形式“a”、“an”和“the”包括复数参考,除非上下文中清楚地另有说明。如此处所用,单词“或”指特定列表中的任何一个成员,并也包括该列表成员中的任何组合。
术语“大约”此处用于指近似、大致、大约或在一定区域内。当所述术语“大约”与数字范围结合使用时,其通过延伸高于和低于所设数值的边界修饰该范围。通常,术语“大约”此处用于修饰高于和低于所述值10%(高或低,更高或更低)的数值。
如此处所用,词语“核酸”、“核苷酸”或“多核苷酸”旨在包括DNA分子(例如,cDNA或基因组DNA)、RNA分子(例如,mRNA)、天然发生的、突变的、合成的DNA或RNA分子,和使用核苷酸类似物产生的DNA或RNA的类似物。其可以是单链的或双链的。这种核酸或多核苷酸包括,但不限于结构基因的编码序列、反义序列和不编码mRNA或蛋白质产物的非编码调节序列。多核苷酸可编码农艺学上有价值的性状或表型性状。
如此处所用,“分离的”多核苷酸当通过重组技术产生时基本上没有其它细胞物质或培养基,或当化学合成时基本上没有化学前体。
术语“基因”广泛用于指与生物功能相关的核酸的任何片段。因此,基因包括基因组序列中的内含子和外显子,或仅包括其表达所需的cDNA和/或调节序列中的编码序列。例如,基因指表达mRNA或功能RNA,或编码特定蛋白质,并包括调节序列的核酸片段。
术语“多肽”和“蛋白质”此处互换地使用来指连续氨基酸残基的多聚体。
如此处所用,术语“有效连接”或“功能连接”指单个核酸片段上多个核酸序列的连接,使得一个核酸的功能受另一核酸的影响。例如,如果两个DNA的位置使得调节DNA影响编码DNA的表达,那么调节DNA称为“有效连接到”表达RNA或编码多肽的DNA上。
如此处所用,术语“特异性表达”指被限制在一种或少数植物组织(空间限制)和/或一个或几个植物发育阶段(时间限制)的基因产物的表达。公认的是几乎不存在真正的特异性:启动子看起来优选在一些组织中开启,而在其它组织中没有或仅仅具有很少活性。这种现象称为泄漏表达。然而,如此处定义的特异性表达旨在包括在一种或少数植物组织或植物特定部位中的表达。
如此处所用,术语“启动子”指DNA序列,当其连接到目的核苷酸序列上时能够控制目的核苷酸序列转录成mRNA。启动子通常(尽管不是必须的)位于目的核苷酸(其向mRNA的转录受启动子的控制)5’(例如上游)(例如,结构基因转录起始位点附近),并为RNA聚合酶和用于转录起始的其它转录因子的特异性结合提供位点。
如此处所用,术语“转录调节元件”指能够调节有效连接的多核苷酸转录的多核苷酸。其包括,但不限于启动子、增强子、内含子、5’UTR和3’UTR。
如此处所用,术语“载体”指能够转运已经连接了另一核酸的核酸分子。载体的一种类型是“质粒”,其指可连接额外DNA片段的环状双链DNA环。在本说明书中,“质粒”和“载体”可互换使用,因为质粒是最常用的载体形式。载体可以是双元载体或包含左边界和右边界并可以包括中间目的基因的T-DNA。如此处所用,术语“表达载体”指能够在适当宿主细胞中指导特定核苷酸表达的载体。表达载体包含有效连接目的核酸的调节性核酸元件,其任选地有效连接到终止信号和/或其它调节元件上。
如此处所用,术语“同源物”指通过从共同的祖先DNA序列遗传而与第二个基因相关的基因。术语“同源物”可应用于通过物种形成事件(例如,直向同源物)分离的基因之间的关系或应用于通过遗传复制事件(例如,共生同源物)分离的基因之间的关系。
如此处所用,术语“直向同源物”指来自不同物种,但通过物种形成从共同的祖先基因进化而来的基因。直向同源物在进化过程中保留了相同的功能。直向同源物编码具有相同或类似功能的蛋白质。如此处所用,术语“共生同源物”指在基因组内通过复制而相关联的基因。共生同源物通常具有不同的功能或新的功能,但这些功能可能是相关的。
如此处所用,术语“在严格条件下杂交”旨在描述用于杂交和洗涤的条件,在所述条件下彼此之间至少60%相似或同一的核苷酸序列通常彼此保持杂交。在另一实施方案中,条件是使得彼此之间至少约65%、或至少约70%、或至少约75%或更高相似或同一的序列通常彼此保持杂交的条件。这种严格条件为本领域技术人员已知并如下文描述。严格条件的优选的非限制性实例是在6X氯化钠/柠檬酸钠(SSC)中,在约45℃中杂交,随后是在0.2X SSC、0.1%SDS中于50-65℃一次或更多次洗涤。
在两种核酸或多肽序列的上下文中,术语“序列同一性”或“同一性”参考当在特定比较窗(如在总体比对中的完整序列或在局部比对中的相似区域)中为最大对应性进行比对时两条序列中相同的残基。当序列同一性的百分比用于指蛋白质时,公认的是不相同的残基位置经常因保守氨基酸替换而不同,其中氨基酸残基被具有相似化学性质(例如,电荷或疏水性)的其它氨基酸残基替换并因而不改变分子的功能性质。当序列在保守替换中不同时,可向上调节百分比序列同一性以校正替换的保守性。通过这种保守替换而不同的序列被称为具有“序列相似性”或“相似性”。用于进行这些调整的手段为本领域技术人员所熟知。通常,这包括将保守替换评定为部分错配而不是完全错配,由此提高序列相似性的百分比。
如此处所用,“序列同一性的百分比”或“序列同一性百分比”指通过在比较窗内总体或局部比较两条最佳比对序列来测定的值,其中对两条序列的最佳比对而言,比较窗中序列部分可包含空位。原则上,可通过测定位置数目(在所述位置中相同核酸碱基或氨基酸残基出现在两条序列中以产生匹配的位置数),用比较窗中全部位置数除匹配位置数,并将结果乘以100以产生序列同一性的百分比来计算所述百分比。对蛋白质序列而言,可使用相同原理来计算“序列相似性的百分比”,其中保守替换计算为部分错配而非完全错配。因此,例如,给予相同氨基酸分值1,给予非保守替换分值0,给予保守替换0和1之间的分值。可从本领域已知的氨基酸矩阵(例如Blosum或PAM矩阵)中获得保守替换的得分。
用于比较的序列比对方法为本领域所熟知。可使用数学算法来完成两条序列之间百分比同一性或百分比相似性(对蛋白质而言)的测定。这些数学算法的优选非限制性实例为,Myers和Miller的算法(Bioinformatics,4(1):11-17,1988)、Needleman-Wunsch总体比对(J Mol Biol.48(3):443-53,1970)、Smith-Waterman局部比对(Journal of Molecular Biology,147:195-197,1981)、Pearson和Lipman的相似性搜索方法(search-for-similarity-method)(PNAS,85(8):2444-2448,1988)、Karlin和Altschul的算法(J.Mol.Biol.,215(3):403-410,1990;PNAS,90:5873-5877,1993)。可利用计算机实现这些数学算法用于序列比较,以测定序列同一性或鉴定同源物。
如此处所用,术语“保守区”或“保守结构域”指异源多核苷酸或多肽序列中的区域,其中在不同序列之间具有相对高程度的序列同一性。可例如使用Clustal W算法从多序列比对来鉴定“保守区”。
如此处所用,术语“细胞”或“植物细胞”指单细胞,并也包括细胞群体。群体可以是包含一种细胞类型的纯的群体。同样,所述群体可包含多于一种细胞类型。本发明意义中的植物细胞可以是分离的(例如,在悬浮培养中)或包含在任何发育阶段的植物组织、植物器官或植物中。
关于植物(或“植物组织”)的术语“组织”指多个植物细胞的排列,包括植物的分化组织和未分化组织。植物组织可组成植物器官的部分(例如植物叶的表皮),但也组成肿块组织(例如,愈伤组织)和培养中的多种细胞类型(例如,单细胞、原生质体、胚、愈伤组织、原胚体状体等)。植物组织可以在植物中、器官培养中、组织培养或细胞培养中。
关于植物(或“植物器官”)的术语“器官”指植物部分,并可包括,但不限于,例如根、果实、芽、茎、叶、下胚轴、子叶、花药、萼片、花瓣、花粉、种子等。
如此处所用,根据上下文,术语“植物”可理解为整株植物、植物细胞、植物器官、植物种子和植物后代。词语“植物”也指任何植物,尤其指种子植物,并可包括,但不限于农作物植物。植物部分包括,但不限于茎、根、芽、果实、胚珠、雄蕊、叶、胚、分生组织区、愈伤组织、配子体、孢子体、花粉、小孢子、下胚轴、子叶、花药、萼片、花瓣、花粉、种子等。可用于本发明方法中的植物种类通常广泛到易于实施转化技术的高等植物和低等植物种类,包括被子植物(单子叶和双子叶植物)、裸子植物、蕨类植物、楔叶类(horsetails)、草原植物、苔藓植物和多细胞藻类。
如此处所用,术语“转基因”旨在指含有转基因或其基因组已经被引入转基因改变或已经掺入了外源基因或多核苷酸的细胞和/或植物。可通过几种方法来产生转基因细胞、组织、器官和植物,包括将包含多核苷酸(通常是DNA)的“转基因”引入至靶细胞中或通过人为干涉如通过此处描述的方法将转基因整合至靶细胞的染色体中。
如此处所用,术语“不分离”指对特定性状而言多种植物如果其对该性状在一定程度上是纯合的,当不分离变种是自体受精时,观察不到后代中显著量的独立性状分离。
如此处所用,术语“对照植物”或“野生型植物”指在转基因植物或遗传修饰植物中为鉴定增强表型或想要的性状的目的,用于与转基因植物或遗传修饰植物相比较的植物细胞、外植体、种子、植物成分、植物组织、植物器官或整株植物。“对照植物”在一些情况下可以是转基因植物株系,其包含空载体或标记基因,但不含有所评估的转基因植物或遗传修饰植物中存在的目的重组多核苷酸。对照植物可以是与所检测的转基因植物或遗传修饰植物相同株系或变种的植物,或其可以是另一株系或变种,如已知具有特定表型、特征或已知基因型的植物。合适的对照植物将包括用于产生本文转基因植物的亲本株系的遗传未改变植物或非转基因植物。
如此处所用,术语“对线虫感染有抗性”或“具有线虫抗性的植物”指植物避免被线虫感染、杀死线虫或阻碍、减慢或终止线虫发育、生长或增殖的能力。这可以通过主动过程(例如通过产生对线虫有害的物质),或通过被动过程(像对线虫减少营养价值或不发育通过线虫进食位点诱导的结构,如合胞体或巨细胞)来实现。可以多种方式,例如通过计数能够在该植物上建立寄生的线虫,或测定线虫的发育时间、雄性和雌性线虫的比例或所产生胞囊或线虫卵的数量来测定植物的线虫抗性水平。对线虫感染抗性提高的植物是相比于另一植物对线虫感染更具抵抗力的植物,所述另一植物具有类似的或优选相同的基因型,而缺少赋予线虫提高抗性的基因或多个基因,例如对照或野生型植物。
术语“进食位点”或“合胞体位点”可互换地使用并如此处所用指线虫感染后在植物根中形成的进食位点。该位点用作线虫的营养来源。合胞体是胞囊线虫的进食位点,巨细胞是根癌线虫的进食位点。
如此处定义,“蔗糖异构酶多肽的N末端截短形式”指蔗糖异构酶多肽,其缺少在相应天然蔗糖异构酶多肽中发现的至少约5%、10%、15%、18%、20%、21%、22%、23%、24%或25%的N末端氨基酸。本发明蔗糖异构酶多肽的N末端截短形式是具有SEQ ID NO:2中阐明的氨基酸序列的多肽的同源物。可从全长蔗糖异构酶多肽的直向同源物和共生同源物中分离蔗糖异构酶多肽的额外的N末端截短形式。
在一个实施方案中,本发明提供编码不显示蔗糖异构酶酶促活性的蔗糖异构酶多肽的N末端截短形式的分离多核苷酸。根据本发明,任何全长蔗糖异构酶多肽的多核苷酸序列可用于鉴定编码不显示蔗糖异构酶酶促活性的蔗糖异构酶多肽的N末端截短形式的多核苷酸。在下文实施例中公开了确定蔗糖异构酶多肽的N末端截短形式中蔗糖异构酶活性存在或缺失的测定法。在示例性实施方案中,所述多核苷酸选自具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸;编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸有70%序列同一性的多核苷酸;编码与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽有70%序列同一性的多肽的多核苷酸;在严格条件下与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸杂交的多核苷酸;和在严格条件下与编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸杂交的多核苷酸。
本发明也具体体现在分离的多核苷酸中,所述分离的多核苷酸与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸至少约50-60%、或至少约60-70%、或至少约70-80%、80-85%、85-90%、90-95%、或至少约95%、96%、97%、98%、99%或更高相同或更相似。在另一实施方案中,本发明的多核苷酸包含编码多肽的多核苷酸,所述多肽与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的任何多肽至少约50-60%、或至少约60-70%、或至少约70-80%、80-85%、85-90%、90-95%、或至少约95%、96%、97%、98%、99%或更相同或更相似。
本发明分离多核苷酸中还包括的是不显示蔗糖异构酶活性的全长蔗糖异构酶多肽的等位基因变体。如此处所用,术语“等位基因变体”指含有多态性的多核苷酸,所述多态性导致所述核苷酸编码的蛋白质氨基酸序列的改变,并且其存在于自然群体中(例如,植物物种或变种)。这种天然等位基因变异通常可导致编码蛋白质的多核苷酸1-5%变异,或所编码蛋白质中1-5%变异。等位基因变体可通过对在大量不同植物中目的核酸进行测序来鉴定,例如可通过使用杂交探针鉴定那些植物中相同基因遗传位点来容易地进行鉴定。多核苷酸中任何及所有这种核酸变异,和所得氨基酸多态性或因天然等位基因变异并且不改变所编码蛋白质的功能活性的蛋白质变异均属于本发明的范围。
本发明还具体体现在转化有包含分离多核苷酸的表达载体的转基因植物中,所述分离的多核苷酸编码不显示蔗糖异构酶活性的蔗糖异构酶多肽的N末端截短形式,其中所述多核苷酸的表达赋予植物提高的线虫抗性。在一个示例性实施方案中,本发明转基因植物包含具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸。在另一示例性实施方案中,所述转基因植物包含编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸。在另一示例性实施方案中,本发明转基因植物包含多核苷酸,所述多核苷酸与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸至少约50-60%、或至少约60-70%、或至少约70-80%、80-85%、85-90%、90-95%、或至少约95%、96%、97%、98%、99%或更高同一性或相似性。在另一示例性实施方案中,本发明转基因植物包含多核苷酸,所述多核苷酸与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽有至少约50-60%、或至少约60-70%、或至少约70-80%、80-85%、85-90%、90-95%、或至少约95%、96%、97%、98%、99%或更高同一性或相似性。
本发明还提供转基因种子和来自该种子的后代植物,包括杂种和近亲交配,所述种子对编码不显示蔗糖异构酶活性的蔗糖异构酶多肽的N末端截短形式的多核苷酸而言是不分离的。本发明还提供植物育种的方法,例如以制备杂交可育转基因植物。所述方法包括将包含本发明特定表达载体的可育转基因植物与其自身或第二种植物(例如缺少特定表达载体的植物)杂交,以制备包含特定表达载体的杂交可育转基因植物的种子。然后种植所述种子以获得杂交可育转基因植物。植物可以是单子叶植物或双子叶植物。所述杂交可育转基因植物可以具有通过母本或父本遗传的所述特定表达载体。所述第二种植物可以是近亲交配植物。杂交可育转基因植物可以是杂种。本发明中还包括的是任何这些杂交可育转基因植物的种子。
本发明的另一实施方案涉及包含有效连接一个或更多个本发明多核苷酸的一个或更多个转录调节元件的表达载体,其中所述多核苷酸的表达赋予转基因植物提高的线虫抗性。在一个实施方案中,所述转录调节元件是能够调节有效连接的多核苷酸组成型表达的启动子。“组成型启动子”指能够在植物的所有或几乎所有发育阶段,所有或几乎所有植物组织中表达它控制的可读框的或调节元件启动子。组成型启动子包括,但不限于来自植物病毒的35S CaMV启动子(Franck等,Cell 21:285-294,1980)、Nos启动子、泛蛋白启动子(Christense等,Plant Mol.Biol.12:619-632,1992和18:581-8,1991)、MAS启动子(Velten等,EMBO J.3:2723-30,1984)、玉米H3组蛋白启动子(Lepetit等,Mol Gen.Genet 231:276-85,1992)、ALS启动子(WO96/30530)、19S CaMV启动子(US 5,352,605)、super启动子(US5,955,646)、玄参属花叶病毒启动子(US 6,051,753)、稻肌动蛋白启动子(US5,641,876)和核酮糖二磷酸羧化酶/加氧酶小亚基启动子(US 4,962,028)。
在另一实施方案中,转录调节元件是调节性启动子。“调节性启动子”指不是组成型,而是以时间和/或空间方式指导基因表达的启动子,并包括组织特异性和诱导型启动子。不同的启动子可在不同的组织或细胞类型中,或在不同的发育阶段,或响应不同的环境条件指导基因或调节元件的表达。
“组织特异性启动子”指不在所有植物细胞中表达却仅在特定器官的一种或多种细胞类型(如叶子和种子)、特定组织(如胚胎或子叶)或特定细胞类型(如叶薄壁组织或种子储藏细胞)中表达的调节性启动子。这些也包括时间调节的启动子,如在胚胎发生早期或晚期、发育种子或果实的果实催熟过程中、在完全分化的叶中或在序列开始时。合适的启动子包括来自油菜的油菜籽蛋白基因启动子(US 5,608,152)、来自蚕豆的USP启动子(Baeumlein等,1991 Mol Gen Genet.225(3):459-67)、来自拟南芥的油质蛋白启动子(WO 98/45461)、来自菜豆的菜豆蛋白启动子(US 5,504,200)、来自芸苔的Bce4启动子(WO 91/13980)或豆球蛋白B4启动子(LeB4;Baeumlein等,1992 Plant Journal,2(2):233-9),以及在单子叶植物像玉米、大麦、小麦、黑麦、稻等中赋予种子特异性表达的启动子。应注意的合适的启动子是来自大麦的lpt2或lpt1基因启动子(WO 95/15389和WO95/23230)或在WO 99/16890中描述的那些启动子(来自大麦醇溶蛋白基因、稻谷蛋白基因、水稻素基因、稻谷醇溶蛋白基因、小麦醇溶蛋白基因、小麦谷蛋白基因、玉米醇溶蛋白基因、燕麦谷蛋白基因、高粱kasirin基因和裸麦醇溶蛋白基因的启动子)。适合于在植物根组织中优先表达的启动子包括,例如来自玉米nicotianamine合酶基因的启动子(US 20030131377)和稻RCC3启动子(US 11/075,113)。用于在植物绿色组织中优先表达的合适启动子包括来自基因如玉米醛缩酶基因FDA的启动子(US 20040216189)、来自醛缩酶和丙酮正磷酸二激酶(PPDK)的启动子(Taniguchi等,Plant CellPhysiol.41(1):42-48,2000)。
“诱导型启动子”指那些调节性启动子,其在一种或更多种细胞类型中可通过外界刺激,例如化学品、光、激素、胁迫或病原体(如线虫)而开启。如果期望基因表达以时间特异性方式发生,那么化学诱导型启动子尤其合适。这种启动子的实例是水杨酸诱导型启动子(WO 95/19443)、四环素诱导型启动子(Gatz等,1992 Plant J.2:397-404)、来自核酮糖-1,5-二磷酸羧化酶(ssRUBISCO)的光诱导型启动子和乙醇诱导型启动子(WO 93/21334)。并且,响应生物胁迫或非生物胁迫条件的合适启动子是那些如病原体诱导型PRP1基因启动子(Ward等,1993 Plant.Mol.Biol.22:361-366)、来自番茄的热诱导型hsp80启动子(US 5187267)、来自马铃薯的冷诱导型α淀粉酶启动子(WO 96/12814)、玉米的干旱诱导型启动子(Busk等,Plant J.11:1285-1295,1997)、来自马铃薯的冷、干旱及高盐诱导型启动子(Kirch,Plant Mol.Biol.33:897-909,1997)或来自拟南芥的RD29A启动子(Yamaguchi-Shinozalei等Mol.Gen.Genet.236:331-340,1993),许多冷诱导型启动子如来自拟南芥的cor15a启动子(Genbank登录号U01377)、来自大麦的blt101和blt4.8(Genbank登录号AJ310994和U63993)、来自小麦的wcs120(Genbank登录号AF031235)、来自玉米的mlip15(Genbank登录号D26563)、来自芸苔的bn115(Genbank登录号U01377)和创伤诱导型pinII启动子(欧洲专利号375091)。
优选的启动子是根特异性、进食位点特异性、病原体诱导型或线虫诱导型启动子。
本发明的另一实施方案涉及产生转基因植物的方法,所述转基因植物包含编码不显示蔗糖异构酶活性的蔗糖异构酶多肽N末端截短形式的多核苷酸,其中所述方法包括步骤:将包含本发明多核苷酸的表达载体引入植物中;并选择线虫抗性提高的转基因植物。
用于将多核苷酸引入植物基因组中并从植物组织或植物细胞再生植物的多种方法例如在Plant Molecular Biology and Biotechnology(CRCPress,Boca Raton,Florida),第6/7章,第71-119页(1993);White FF(1993)Vectors for Gene Transfer in Higher Plants;Transgenic Plants,第1卷,Engineering and Utilization,编辑:Kung和Wu R,Academic Press,15-38;Jenes B等(1993)Techniques for Gene Transfer;Transgenic Plants,第1卷,Engineering and Utilization,编辑:Kung和R.Wu,Academic Press,第128-143页;Potrykus(1991)Annu Rev Plant Physiol Plant Molec Biol42:205-225;Halford NG,Shewry PR(2000)Br Med Bull 56(1):62-73中已知。
转化方法可包括直接和间接的转化方法。合适的直接方法包括聚乙二醇诱导的DNA摄取、脂质体介导的转化(US 4,536,475)、使用基因枪的生物射弹方法(Fromm ME等(1990)Bio/Technology.8(9):833-9;Gordon-Kamm等(1990)Plant Cell 2:603)、电穿孔法、干胚在包含DNA的溶液中进行温育和显微注射法。在这些直接转化方法的情况下,所用质粒不需要满足任何特殊的要求。可使用简单质粒,如pUC系列、pBR322、M13mp系列、pACYC184等的那些质粒。如果完整植物待从转化细胞再生,那么额外的选择标记基因优选位于所述质粒上。直接转化技术等同地适用于双子叶和单子叶植物。
也可通过农杆菌的细菌侵染(例如EP 0 116 718)、病毒载体的病毒感染(EP 0 067 553;US 4,407,956;WO 95/34668;WO 93/03161)或花粉(EP 0270356;WO 85/01856;US 4,684,611)来进行转化。基于农杆菌的转化技术(尤其对双子叶植物而言)为本领域所熟知。农杆菌菌株(例如根癌农杆菌或毛根农杆菌)包含在农杆菌感染后转化至植物中的质粒(Ti或Ri质粒)和T-DNA元件。所述T-DNA(转移DNA)整合到植物细胞的基因组中。所述T-DNA可以定位在Ti或Ri质粒上,或分别包含在所谓的双元载体中。用于农杆菌介导转化的方法描述于,例如Horsch RB等(1985)Science225:1229f中。农杆菌介导的转化最适合于双子叶植物,但也适合于单子叶植物。通过农杆菌对植物的转化描述于,例如White FF,Vectors for GeneTransfer in Higher Plants,Transgenic Plants,第1卷,Engineering andUtilization,S.D.Kung和R.Wu编辑,Academic Press,1993,第15-38页;Jenes B等Techniques for Gene Transfer,Transgenic Plants,第1卷,Engineering and Utilization,S.D.Kung和R.Wu编辑,Academic Press,1993,第128-143页;Potrykus(1991)Annu Rev Plant Physiol Plant MolBiol 42:205-225。
转化可能导致暂时的或稳定的转化和表达。尽管本发明的多核苷酸可插入这些广泛类别的任何植物和植物细胞中,但其特别用于农作物植物细胞中。
本发明的多核苷酸可直接转化到质粒基因组中。质粒表达利用优于核表达基因的大量拷贝数目以允许高表达水平,在所述质粒表达中基因通过同源重组插入到每一植物细胞中存在的数千拷贝的环形质粒基因组中。在一个实施方案中,核苷酸插入到质粒靶向载体中,并转化到想要的植物宿主的质粒基因组中。获得了对含有核苷酸序列的质粒基因组同型的植物,并且其优先能够高水平表达所述核苷酸。
质粒转化技术例如广泛描述于美国专利号5,451,513、5,545,817、5,545,818和5,877,462,WO 95/16783和WO 97/32977及McBride等(1994)Proc.Natl.Acad.Sci.USA 91,7301-7305中,此处所有均以其整体引入作为参考。用于质粒转化的基本技术包括例如使用生物射弹或原生质体转化(例如,氯化钙或PEG介导的转化)将位于选择标记侧翼的克隆质粒DNA区与核苷酸序列一起引入合适的靶组织。称为靶序列的1到1.5kb侧翼区促进与质粒基因组的同源重组,并因此允许原质体系特异区域的替换或修饰。开始,赋予壮观霉素和/或链霉素抗性的叶绿体16S rRNA和rps12基因中的点突变用作转化的选择标记(Svab等(1990)PNAS 87,8526-8530;Staub等(1992)Plant Cell 4,39-45)。这些标记之间克隆位点的存在允许用于引入外源基因的质粒靶向载体的产生(Staub等(1993)EMBO J.12,601-606)。通过用显性选择标记——编码壮观霉素解毒酶氨基葡糖苷-3′-腺苷转移酶的细菌aadA基因替换隐性rRNA或r-蛋白抗生素抗性基因来获得转化频率的显著提高(Svab等(1993)PNAS 90,913-917)。用于质粒转化的其它选择标记为本领域已知并包括在本发明范围内。
本发明的转基因植物可以是任何植物,例如,但不限于树、切花、观赏植物、蔬菜或农作物植物。所述植物可选自苜蓿属(Medicago)、番茄属(Lycopersicon)、芸苔属(Brassica)、香瓜属(Cucumis)、茄属(Solanum)、核桃属(Juglans)、棉属(Gossypium)、苹果属(Malus)、葡萄属(Vitis)、金鱼草属(Antirrhinum)、杨属(Populus)、草莓属(Fragaria)、拟南芥属、云杉属(Picea)、辣椒属(Capsicum)、藜属(Chenopodium)、菊属(Dendranthema)、牵牛属(Pharbitis)、松属(Pinus)、豌豆属(Pisum)、稻属(Oryza)、玉蜀黍属(Zea)、小麦属(Triticum)、小黑麦属(Triticale)、黑麦属(Secale)、黑麦草属(Lolium)、大麦属(Hordeum)、大豆属(Glycine)、黄杉属(Pseudotsuga)、伽蓝菜属(Kalanchoe)、甜菜属(Beta)、向日葵属(Helianthus)、烟草属(Nicotiana)、南瓜属(Cucurbita)、蔷薇属(Rosa)、草莓属、百脉根属(Lotus)、苜蓿属、驴食草属(Onobrychis)、车轴草属(trifolium)、胡卢巴属(Trigonella)、豇豆属(Vigna)、橘属(Citrus)、亚麻属(Linum)、老鹳草属(Gcranium)、木薯属(Manihot)、胡芦巴属(Daucus)、萝卜属(Raphanus)、白芥属(Sinapis)、颠茄属(Atropa)、曼陀罗属(Datura)、天仙子属(Hyoscyamus)、烟草属、碧冬茄属(Petunia)、毛地黄属(Digitalis)、Majorana、菊苣属(Ciahorium)、莴苣属(Lactuca)、雀麦属(Bromus)、天门冬属(Asparagus)、金鱼草属、萱草属(Heterocallis)、水仙属(Nemesis)、天竺葵属(Pelargonium)、稷属(Panieum)、狼尾草属(Pennisetum)、毛莨属(Ranunculus)、千里光属(Senecio)、喇叭舌属(Salpiglossis)、蓝英花属(Browaalia)、菜豆属(Phaseolus)、燕麦属(Avena)和葱属(Allium),或者所述植物选自谷类植物(其包括小麦、大麦、高粱、黑麦、黑小麦、玉米、稻、甘蔗)和树(其包括苹果、梨、柑橘、梅、樱桃、桃子、油桃、杏、木瓜、芒果、白杨、松树、水杉、雪松和橡树)。如此处所用,术语“植物”可以是双子叶农作物植物,如豌豆、苜蓿、大豆、胡萝卜、芹菜、番茄、马铃薯、棉花、烟草、胡椒、油菜、甜菜、卷心菜、花椰菜、绿花椰菜、莴苣和拟南芥。在一个实施方案中,所述植物是单子叶植物或双子叶植物。
优选地,植物是农作物植物。农作物植物是农业中使用的所有植物。因此,在一个实施方案中,所述植物是单子叶植物,优选禾本科(Poaceae)、芭蕉科(Musaceae)、百合科(Liliaceae)或凤梨科(Bromeliaceae)的植物,优选禾本科家族的植物。因此,在另一实施方案中,所述植物是禾本科玉蜀黍属、小麦属、稻属、大麦属、黑麦属、燕麦属、甘蔗属(Saccharum)、高粱属(Sorghum)、狼尾草属、狗尾草属(Setaria)、稷属、蟋蟀草属(Eleusine)、芒属(Miscanthus)、短柄草属(Brachypodium)、羊茅属(Festuca)或黑麦草属植物。当植物是玉蜀黍属(Zea)时,优选的物种是玉米(Z.mays)。当植物是小麦属(Triticum)时,优选的物种是普通小麦(T.aestivum)、斯佩尔特小麦(T.speltae)或硬粒小麦(T.durum)。当植物是稻属(Oryza)时,优选的物种是稻(O.sativa)。当植物是大麦属(Hordeum)时,优选的物种是大麦(H.vulgare)。当植物是黑麦属(Secale)时,优选的物种是黑麦(S.cereale)。当植物是燕麦属(Avena)时,优选的物种是燕麦(A.sativa)。当植物是甘蔗属(Saccharum)时,优选的物种是甘蔗(S.officinarum)。当植物是高梁属(Sorghum)时,优选的物种是蜀黍(S.vulgare)、两色蜀黍(S.bicolor)或苏丹草(S.sudanense)。当植物是狼尾草属(Pennisetum)时,优选的物种是御谷(P.glaucum)。当植物是狗尾草属(Setaria)时,优选的物种是谷子(S.italica)。当植物是黍属(Panicum)时,优选的物种是野生稷(P.miliaceum)或柳枝稷(P.virgatum)。当植物是穇属(Eleusine)时,优选的物种是穇子(E.coracana)。当植物是芒属(Miscanthus)时,优选的物种是芒(M.sinensis)。当植物是羊茅属(Festuca)时,优选的物种是苇状羊茅(F.arundinaria)、紫羊茅(F.rubra)或草甸羊茅(F.pratensis)。当植物是黑麦草属(Lolium)时,优选的物种是多年生黑麦草(L.perenne)或多花黑麦草(L.multiflorum)。或者植物可以是小黑麦属(Triticosecale)。
或者在一个实施方案中,植物是双子叶植物,优选地是豆科(Fabaceae)、茄科(Solanaceae)、芸苔科(Brassicaceae)、藜科(Chenopodiaceae)、菊科(Asteraceae)、锦葵科(Malvaceae)、亚麻科(Linaceae)、大戟科(Euphorbiaceae)、旋花科(Convolvulaceae)、蔷薇科(Rosaceae)、葫芦科(Cucurbitaceae)、山茶科(Theaceae)、茜草科(Rubiaceae)、梧桐科(Sterculiaceae)或柑橘(Citrus)科的植物。在一个实施方案中,植物是豆科(Fabaceae)、茄科(Solanaceae)或芸苔科(Brassicaceae)的植物。因此,在一个实施方案中,植物是豆科(Fabaceae),优选地是大豆属(Glycine)、豌豆属(Pisum)、落花生属(Arachis)、鹰嘴豆属(Cicer)、蚕豆属(Vicia)、菜豆属(Phaseolus)、羽扇豆属(Lupinus)、苜蓿属(Medicago)或兵豆属(Lens)。优选的豆科(Fabaceae)物种是截形苜蓿(M.truncatula)、紫苜蓿(M.sativa)、大豆(G.max)、豌豆(P.sativum)、花生(A.hypogea)、鹰嘴豆(C.arietinum)、蚕豆(V.faba)、菜豆(P.vulgaris)、白羽扇豆(Lupinusalbus)、黄羽扇豆(Lupinus luteus)、狭叶羽扇豆(Lupinus angustifolius)或兵豆(Lens culinaris)。更优选的是大豆(G.max)和花生(A.hypogea)、紫苜蓿(M.sativa)物种。最优选的是大豆(G.max)。当植物是茄科(Solanaceae)时,优选的属是茄属(Solanum)、番茄属(Lycopersicon)、烟草属(Nicotiana)或辣椒属(Capsicum)。优选的茄科物种是马铃薯(S.tuberosum)、番茄(L.esculentum)、烟草(N.tabaccum)或黄灯笼辣椒(C.chinense)。更优选的是马铃薯(S.tuberosum)。因此,在一个实施方案中,植物是十字花科(Brassicaceae),优选的是芸苔(Brassica)或萝卜属(Raphanus)。优选的十字花科(Brassicaceae)物种是欧洲油菜(B.napus)、甘蓝(B.oleracea)、芥菜(B.juncea)或芜青(B.rapa)物种。更优选的是欧洲油菜(B.napus)物种。当植物是藜科(Chenopodiaceae)时,优选的属是甜菜属(Beta),优选的物种是甜菜(B.vulgaris)。当植物是菊科(Asteraceae)时,优选的属是向日葵属(Helianthus),优选的物种是向日葵(H.annuus)。当植物是锦葵科(Malvaceae)时,优选的属是棉属(Gossypium)或秋葵属(Abelmoschus)。当属是棉属(Gossypium)时,优选的物种是陆地棉(G.hirsutum)或海岛棉(G.barbadense),最优选的物种是陆地棉(G.hirsutum)。秋葵属(Abelmoschus)的优选的物种是咖啡黄葵(A.esculentus)。当植物是亚麻科(Linaceae)时,优选的属是亚麻属(Linum),优选的物种是亚麻(L.usitatissimum)。当植物是大戟科(Euphorbiaceae)时,优选的属是木薯属(Manihot)、麻风树属(Jatropa)或蓖麻属(Ricinus),优选的物种是木薯(M.esculenta)、麻风树(J.curcas)或蓖麻(R.comunis)。当植物是旋花科(Convolvulaceae)时,优选的属是番薯属(Ipomea),优选的物种是I.batatas。当植物是蔷薇科(Rosaceae)时,优选的属是蔷薇属(Rosa)、苹果属(Malus)、梨属(Pyrus)、李属(Prunus)、悬钩子属(Rubus)、茶藨子属(Ribes)、越橘属(Vaccinium)或草莓属(Fragaria),优选的物种是杂种荷兰草莓(Fragaria x ananassa)。当植物是葫芦科(Cucurbitaceae)时,优选的属是黄瓜属(Cucumis)、西瓜属(Citrullus)或南瓜属(Cucurbita),优选的物种是黄瓜(Cucumis sativus)、西瓜(Citrulluslanatus)或西葫芦(Cucurbita pepo)。当植物是山茶科(Theaceae)时,优选的属是山茶属(Camellia),优选的物种是茶(Camellia sinensis)。当植物是茜草科(Rubiaceae)时,优选的属是咖啡属(Coffea),优选的物种是小果咖啡(C.arabica)或中果咖啡(C.canephora)。当植物是梧桐科(Sterculiaceae)时,优选的属是可可树属(Theobroma),优选的物种是可可树(T.cacao)。当植物是柑橘属(Citrus)时,优选的物种是甜橙(C.sinensis)、柠檬(C.limon)、桔(C.reticulata)、柚(C.maxima)和柑橘物种的杂种等等。在本发明优选的实施方案中,所述植物是大豆、马铃薯或玉米植物。
本发明的转基因植物可用于控制植物寄生线虫对农作物感染的方法中,所述方法包括以下步骤:从包含表达盒的种子生长农作物,所述表达盒包含有效连接本发明多核苷酸的转录调节元件,其中所述表达盒稳定整合到种子基因组中。
因此,本发明包含赋予植物线虫抗性的方法,所述方法包括步骤:制备包含有效连接启动子的本发明多核苷酸的表达盒;用所述表达盒转化受体植物;产生所述受体植物的一株或更多株转基因后代;并选择后代的线虫抗性。优选地,所述启动子是根特异性的或线虫诱导型的启动子或在线虫进食位点例如合胞体或巨细胞中介导表达的启动子。
本发明可用于降低植物寄生线虫对农作物的破坏或赋予植物线虫抗性。所述线虫可以是任何植物寄生线虫,像长针线虫科(Longidoridae)、毛刺线虫科(Trichodoridae)、滑刃线虫科(Aphelenchoidida)、粒线虫科(Anguinidae)、刺线虫科(Belonolaimidae)、环线虫科(Criconematidae)、异皮线虫科(Heterodidae)、纽带线虫科(Hoplolaimidae)、根结线虫科(Mcloidogynidae)、针线虫科(Paratylenchidae)、短体线虫科(Pratylenchidae)、小垫刃线虫科(Tylenchulidae)、垫刃线虫科(Tylenchidae)等科的线虫。优选地,所述寄生线虫属于诱导巨细胞或合胞体细胞的线虫科。诱导巨细胞或合胞体细胞的线虫发现于长针线虫科、毛刺线虫科、异皮线虫科、根结线虫科、短体线虫科或小垫刃线虫科。尤其发现于异皮线虫科和根结线虫科。
因此,本发明靶定的寄生线虫属于选自伪根瘤线虫属(Naccobus)、仙人掌胞囊线虫属(Cactodera)、长形胞囊线虫属(Dolichodera)、球异皮线虫属(Globodera)、异皮线虫属(Heterodera)、Punctodera、长针线虫线虫属(Longidorus)或根结线虫属(Meloidogyne)的一种或更多种。在优选的实施方案中,所述寄生线虫属于选自伪根瘤线虫属(Naccobus)、仙人掌胞囊线虫属、长形胞囊线虫属、球异皮线虫属、异皮线虫属、Punctodera或根结线虫属的一种或更多种。在更优选的实施方案中,所述寄生线虫属于选自球异皮线虫属、异皮线虫属或根结线虫属的一种或更多种。在甚至更优选的实施方案中,所述寄生线虫属于选自球异皮线虫属或异皮线虫的一种或两种。在另一实施方案中,所述寄生线虫属于根结线虫属。
当寄生线虫是球形胞囊线虫属(Globodera)时时,物种优选选自蕃草属球异皮线虫(G.achilleae)、蒿球异皮线虫(G.artemisiae)、G.hypolysi、G.mexicana、G.millefolii、苹果球异皮线虫(G.mali)、马铃薯白线虫(G.pallida)、马铃薯金线虫(G.rostochiensis)、烟草异皮线虫(G.Tabacum)和G.virginiae。在另一优选实施方案中,所述球形胞囊线虫属包括马铃薯白线虫(G.pallida)、烟草异皮线虫(G.Tabacum)或马铃薯金线虫(G.rostochiensis)中的至少一种。当所述寄生线虫是异皮线虫属时,物种优选选自燕麦异皮线虫(H.avenae)、胡萝卜异皮线虫(H.carotae)、鹰嘴豆异皮线虫(H.ciceri)、十字花科异皮线虫(H.cruciferae)、H.delvii、褐藻异皮线虫(H.elachista)、菲力普异皮线虫(H.filipjevi)、H.gambiensis、大豆异皮线虫、豌豆异皮线虫(H.goettingiana)、荞麦异皮线虫(H.graduni)、蛇麻异皮线虫(H.humuli)、大麦异皮线虫(H.hordecalis)、小麦异皮线虫(H.latipons)、燕麦异皮线虫(H.major)、苜蓿异皮线虫(H.medicaginis)、水稻异皮线虫(H.oryzicola)、巴基斯坦异皮线虫(H.pakistanensis)、酸模异皮线虫(H.rosii)、甘蔗异皮线虫(H.sacchari)、甜菜异皮线虫、高粱异皮线虫(H.sorghi)、车轴草异皮线虫、荨麻异皮线虫(H.urticae)、豇豆异皮线虫(H.vigni)和玉米异皮线虫(H.zeae)。在另一优选实施方案中,所述寄生异皮线虫包括大豆异皮线虫、燕麦异皮线虫、木豆异皮线虫、豌豆异皮线虫、车轴草异皮线虫、玉米异皮线虫或甜菜异皮线虫中的至少一种。在更优选的实施方案中,所述寄生线虫包括大豆异皮线虫或甜菜异皮线虫中的至少一种。在最优选的实施方案中,所述寄生线虫是大豆异皮线虫。
当寄生线虫是根结线虫属时,所述寄生线虫选自高粱根结线虫(M.acronea)、M.arabica、花生根结线虫(M.arenaria)、甘蓝根结线虫(M.artiellia)、短尾根结线虫(M.brevicauda)、山茶根结线虫(M.camelliae)、哥伦比亚根结线虫(M.chitwoodi)、咖啡根结线虫(M.cofeicola)、短小根结线虫(M.esigua)、禾草根结线虫(M.graminicola)、北方根结线虫(M.hapla)、南方根结线虫、印度根结线虫(M.indica)、海滨根结线虫(M.inornata)、爪哇根结线虫(M.javanica)、林氏根结线虫(M.lini)、苹果根结线虫(M.mali)、小头根结线虫(M.microcephala)、小突根结线虫(M.microtyla)、纳西根结线虫(M.naasi)、萨拉斯根结线虫(M.salasi)和泰晤士根结线虫(M.thamesi)。在优选的实施方案中,所述寄生线虫包括爪哇根结线虫、南方根结线虫、北方根结线虫、花生根结线虫或哥伦比亚根结线虫中的至少一种。
尽管已经按照某些实施方案描述了本发明的组合物和方法,但对本领域技术人员显而易见的是,变型可应用于此处描述的组合物、方法和方法的步骤或步骤顺序,而不背离本发明的概念、精神和范围。
实施例
实施例1:克隆编码蔗糖异构酶N末端截短形式的多核苷酸
在PCR反应中使用约0.1μg含欧文氏菌(Erwinia)蔗糖异构酶AF279281序列的质粒DNA作为DNA模板。用于PCR扩增截短的蔗糖异构酶序列的引物示于表1中并基于AF279281序列设计。由SEQ ID NO:12描述的引物序列包含用于容易克隆的AscI限制性位点。由SEQ ID NO:13描述的引物序列包含用于容易克隆的XhoI限制性位点。由SEQ ID NO:12和SEQ ID NO:13描述的引物序列用于扩增截短的蔗糖异构酶序列。
扩增到的DNA片段大小通过标准琼脂糖凝胶电泳进行验证并从凝胶中提取DNA。将纯化后的片段使用TOPO TA克隆试剂盒参照制造商说明书(Invitrogen)TOPO克隆至pCR2.1中。使用Applied Biosystem373A(Applied Biosystems,Foster City,California,美国)自动测序仪对克隆到的片段测序并通过使用来自序列分析工具Vector NTI(Invitrogen,Carlsbad,California,美国)中的序列比对ClustalW(EuropeanBioinformatics Institute,Cambridge,英国)验证该序列为预期的序列。由SEQ ID NO:1描述了编码欧文氏菌蔗糖异构酶的N末端截短形式的多核苷酸。为了方便克隆而在引物中引入的限制性位点不包含于序列之中。
表1:用于扩增SEQ ID NO:1多核苷酸的引物
引物名称 | 序列 | 目的 | SEQ ID NO: |
JT28引物 | GGCGCGCCACCATGAAAGAATACGGTACGATGGAAGAC | 5’引物 | 12 |
JT59引物 | CTCGAGCTACGGATTAAGTTTATAAATGCCCGACTG | 3’引物 | 13 |
实施例2:用于转化的载体构建
为了评估所克隆的编码蔗糖异构酶编码基因N末端截短形式的欧文氏菌多核苷酸的功能,使用限制性内切酶AscI和XhoI将对应于SEQ IDNO:1的1-1464位核苷酸的基因片段克隆到启动子下游,以产生表2中描述的有效连接到所述启动子序列上的表达载体。合胞体优选的启动子包括大豆MTN3启动子SEQ ID NO:7(p-47116125)(USSN 60/899,714)、拟南芥过氧化物酶POX启动子SEQ ID NO:8(p-At5g05340)(USSN 60/876,416)、拟南芥TPP 6磷酸海藻糖磷酸酶启动子SEQ ID NO:9(p-At1g35910)(USSN 60/874,375)、MTN21启动子SEQ ID NO:10(p-At1g21890)(USSN60/743,341)和At5g12170类启动子SEQ ID NO:11(USSN 60/899,693)。表2中描述的双元载体中的植物选择标记为来自拟南芥的乙酰羟酸合酶(AHAS)基因的抗除草剂形式(Sathasivan等,Plant Phys.97:1044-50,1991)。使用ARSENAL(灭草烟,BASF Corp,Florham Park,NJ)作为选择剂。
表2:包含SEQ ID NO:1片段的表达载体
载体 | 表达盒的组成(启动子::NCP编码基因) |
RJT21 | 超级启动子::SEQ ID NO:1 |
RJT22 | p-At1g21890::SEQ ID NO:1 |
RJT23 | p-47116125::SEQ ID NO:1 |
RJT51 | p-At5g05340::SEQ ID NO:1 |
RJT52 | p-At5g12170::SEQ ID NO:1 |
RJT53 | p-At1g35910::SEQ ID NO:1 |
实施例3:转基因大豆发根的产生和线虫生物测定
通过电穿孔将双元载体RJT21、RJT22、RJT23、RJT51、RJT52和RJT53转化至毛根农杆菌K599菌株中。使用已知方法用转化的农杆菌菌株诱导大豆发根的形成。通过使用未转化的毛根农杆菌还从大豆栽培种Williams 82(SCN易感)和Jack(SCN抗性)中产生非转基因发根,用作测定中线虫生长的对照。
在转化有载体的转基因发根和作为对照的来自Williams 82和Jack的非转基因发根中进行生物测定以评估线虫抗性。每一双元载体转化中产生几个独立的发根系用于生物测定。对于每一转化株系而言,根据如上概述的方法用SCN接种几个重复孔。线虫接种四周后,对每孔中的胞囊数进行计数并测定雌性指数。雌性指数为胞囊数与易感的栽培种Williams82相比的关系。
对于每一转化株系,测定重复孔的数目(n)、每孔胞囊的平均数(MEAN)和标准差(SE)值。结果表明检测的六个构建体中五个在多个转基因株系上显示出胞囊数显著下降。构建体RJT21、RJT22、RJT23、RJT52和RJT53的生物测定结果显示在多个转基因株系上胞囊数的统计学显著减少(p值<0.05)并且在测定的大部分转基因株系中显示出胞囊数降低的一般趋势。构建体RJT51的生物测定结果未显示出对胞囊数的显著影响。
实施例4:SEQ ID NO:1蔗糖异构酶片段的蔗糖异构酶测定
由SEQ ID NO:1代表的蔗糖异构酶多核苷酸的N末端截短形式是来自大黄欧文氏菌(Erwinia rhapontici)的蔗糖异构酶的截短形式(由SEQ IDNO:3代表的登陆号为AF279281的序列)。SEQ ID NO:1和SEQ ID NO:3的DNA序列比对示于图1中。由SEQ ID NO:1描述的截短的NCP DNA序列编码的多核苷酸(SEQ ID NO:2)导致N末端截短。基于实验数据由SEQ ID NO:2描述的多肽不具有蔗糖异构酶活性。
进行两组实验(下文中的测定A和测定B)来证实NCP的截短形式不能作为蔗糖异构酶起作用(即截短的蛋白质不能催化蔗糖异构化成异麦芽酮糖(palatinose))。
测定A.使用转基因大豆根的蔗糖异构酶活性测定
对转化有RJT51和RJT53的转基因大豆根进行分析。从根样品中提取蔗糖并在HPLC上对一式三份等分试样进行跑样。对照样品由W82和Jack组成并且W82补充有外部异麦芽酮糖(W82+异麦芽酮糖)。除阳性对照(W82+异麦芽酮糖)外,在分析的任何样品中未检测到异麦芽酮糖。
测定B.使用大肠杆菌的蔗糖异构酶活性测定
产生用于在细菌中表达的含全长(SEQ ID NO:3)和截短的(SEQ IDNO:1)欧文氏菌蔗糖异构酶基因的构建体。此外,产生用于在细菌中表达的含全长(SEQ ID NO:6)和截短的(SEQ ID NO:4)普城沙雷氏菌蔗糖异构酶基因的构建体。由SEQ ID NO:5描述的截短的沙雷氏菌蔗糖异构酶氨基酸序列与由SEQ ID NO:2描述的截短的欧文氏菌蔗糖异构酶序列间的氨基酸总体百分比同一性示于图2。四个构建体包含在IPTG诱导型启动子调控下的来自欧文氏菌和沙雷氏菌的指定的全长和截短的蔗糖异构酶基因。将四个构建体转化至大肠杆菌并且蔗糖异构酶表达不受IPTG诱导(样品a)或通过加入IPTG而得以诱导(样品b)。加入IPTG后,来自转基因细菌的粗提物与90mM蔗糖温育。在加入蔗糖后0分钟和60分钟取出样品。在指定的时间点上,中止反应并且将混合物中的等分试样注射进HPLC来测定蔗糖含量。观察到加入IPTG对实验结果没有关键影响,说明本实验中使用的IPTG诱导型启动子在未加入IPTG时在某种程度上也是有活性的。结果显示两种全长基因形式(来自欧文氏菌和沙雷氏菌)都具有蔗糖异构酶活性,因为当蔗糖完全耗尽时,在60分钟温育后均可产生显著量的异麦芽酮糖。与之相反,两个截短的基因形式在相同的实验条件下未能产生任何可检测到的异麦芽酮糖,并且蔗糖峰保持不变。结果示于表3中。
表3.测定在大肠杆菌中表达的构建体的蔗糖含量的HPLC测定
样品名称 | 蔗糖(nmol) | 异麦芽酮糖(nmol) | 海藻糖(nmol) |
SRS73-5a T0 | 1870.0 | n.a. | n.a. |
SRS73-5a T60 | 2239.1 | n.a. | n.a. |
SRS73-5b T0 | 1918.5 | n.a. | n.a. |
SRS73-5b T60 | 2277.7 | n.a. | n.a. |
SRS74-4a T0 | 1944.2 | 239.2 | 1.7 |
SRS74-4a T60 | 17.2 | 1911.0 | 189.8 |
SRS74-4b T0 | 1186.4 | 137.2 | n.a. |
SRS74-4b T60 | 9.3 | 3254.6 | 315.7 |
SRS75-2a T0 | 1834.0 | n.a. | n.a. |
SRS75-2a T60 | 2024.7 | n.a. | n.a. |
SRS75-2b T0 | 1907.3 | n.a. | n.a. |
SRS75-2b T60 | 1700.5 | n.a. | n.a. |
SRS76-3a T0 | 1952.0 | 8.2 | n.a. |
SRS76-3a T60 | 414.6 | 1093.8 | 38.9 |
SRS76-3b T0 | 2315.8 | 10.9 | n.a. |
SRS76-3b T60 | 96.8 | 3238.4 | 144.0 |
SRS73-5 NCP截短的欧文氏菌蔗糖异构酶(SEQ ID NO:1)
SRS74-4 全长欧文氏菌蔗糖异构酶(SEQ ID NO:3)
SRS75-2 截短的沙雷氏菌蔗糖异构酶(SEQ ID NO:4)
SRS76-3 全长沙雷氏菌蔗糖异构酶(SEQ ID NO:6)
表中:“a”样品:未加IPTG;“b”样品:加入IPTG
实施例5:蔗糖异构酶多肽的额外的N末端截短形式
如实施例3中所公开的,当与组成型启动子和线虫诱导型启动子有效连接并在大豆根中表达时,由SEQ ID NO:1描述的欧文氏菌蔗糖异构酶NCP基因的截短形式导致降低的胞囊数。如实施例4中所公开的,已显示欧文氏菌蔗糖异构酶基因的截短形式不具有蔗糖异构酶活性。此外,来自沙雷氏菌的同源蔗糖异构酶基因的截短形式不具有蔗糖异构酶活性,如实施例4中所示。
使用BLAST算法用由SEQ ID NO:2描述的截短的欧文氏菌蔗糖异构酶氨基酸序列来鉴定同源基因。鉴定到与由SEQ ID NO:2描述的欧文氏菌蔗糖异构酶多肽的N末端截短形式同源的几个示例性蔗糖异构酶基因的截短形式并且由SEQ ID NO:5和SEQ ID NO 14-20描述。所述同源物表现出与由SEQ ID NO:2描述的欧文氏菌截短的蔗糖异构酶NCP基因的一定范围的同源性。鉴定到的截短的同源物与由SEQ ID NO:2描述的欧文氏菌截短的蔗糖异构酶的氨基酸比对示于图3中。显示鉴定到的同源物和SEQID NO:2相互之间的百分比同一性的矩阵表示于图4中。
实施例6:同源物的载体构建
将对应于由SEQ ID NO:5和SEQ ID NO 14-20描述的氨基酸序列的核苷酸序列克隆入植物双元载体中,所述核苷酸序列编码与由SEQ IDNO:2描述的欧文氏菌截短的蔗糖异构酶同源的基因的截短形式。SEQ IDNO:4和SEQ ID NO 21-27描述了截短的同源物DNA序列。使用标准克隆技术将所描述的核苷酸序列有效连接到由SEQ ID NO:9描述的线虫诱导型启动子p-At1g35910上。双元载体中的植物选择标记产生对除草剂灭草烟(Imazapyr)的抗性。
实施例7:生物测定和胞囊计数
使用在共有的未决USSN 12/001,234中公开的生根植物测定系统进行生物测定来评估由SEQ ID NO:1的截短的欧文氏菌蔗糖异构酶赋予的线虫抗性。在转化有实施例6中描述的双元载体后产生转基因根。将多个转基因根系传代培养并且以约500J2/孔的水平用表面净化的品种3SCN第二阶段的幼虫(J2)接种。线虫接种后四周,对每孔中的胞囊数进行计数。对于每一转化构建体,计算每株系的胞囊数以测定平均胞囊数和构建体的标准误。每一转化构建体的胞囊数值与平行检测的空载体对照的胞囊数值相比较来确定所检测的构建体是否导致胞囊数目的下降。
本领域中的技术人员将认识到或使用仅仅例行试验能够确认此处描述的本发明的特定实施方案的许多等同方案。此类等同方案旨在包括在以下权利要求中。
序列表
<110>巴斯福植物科学有限公司
A·威格
<120>编码用于控制寄生线虫的截短的蔗糖异构酶多肽的多核苷酸
<130>PF 58853
<160>27
<170>PatentIn版本3.4
<210>1
<211>1464
<212>DNA
<213>大黄欧文氏菌
<400>1
atgaaagaat acggtacgat ggaagacttt gaccgtctta tttcagaaat gaagaaacgc 60
aatatgcgtt tgatgattga tattgttatc aaccacacca gcgatcagca tgcctggttt 120
gttcagagca aatcgggtaa gaacaacccc tacagggact attacttctg gcgtgacggt 180
aaggatggcc atgcccccaa taactatccc tccttcttcg gtggctcagc ctgggaaaaa 240
gacgataaat caggccagta ttacctccat tactttgcca aacagcaacc cgacctcaac 300
tgggacaatc ccaaagtccg tcaagacctg tatgacatgc tccgcttctg gttagataaa 360
ggcgtttctg gtttacgctt tgataccgtt gccacctact cgaaaatccc gaacttccct 420
gaccttagcc aacagcagtt aaaaaatttc gccgaggaat atactaaagg tcctaaaatt 480
cacgactacg tgaatgaaat gaacagagaa gtattatccc actatgatat cgccactgcg 540
ggggaaatat ttggggttcc tctggataaa tcgattaagt ttttcgatcg ccgtagaaat 600
gaattaaata tagcgtttac gtttgatctg atcaggctcg atcgtgatgc tgatgaaaga 660
tggcggcgaa aagactggac cctttcgcag ttccgaaaaa ttgtcgataa ggttgaccaa 720
acggcaggag agtatgggtg gaatgccttt ttcttagaca atcacgacaa tccccgcgcg 780
gtttctcact ttggtgatga tcgaccacaa tggcgcgagc atgcggcgaa agcactggca 840
acattgacgc tgacccagcg tgcaacgccg tttatctatc agggttcaga actcggtatg 900
accaattatc cctttaaaaa aatcgatgat ttcgatgatg tagaggtgaa aggtttttgg 960
caagactacg ttgaaacagg caaagtgaaa gctgaggaat tccttcaaaa cgtacgccaa 1020
accagccgtg ataacagcag aacccccttc cagtgggatg caagcaaaaa cgcgggcttt 1080
accagtggaa ccccctggtt aaaaatcaat cccaattata aagaaatcaa cagcgcagat 1140
cagattaata atccaaattc cgtatttaac tattatagaa agctgattaa cattcgccat 1200
gacatccctg ccttgaccta cggcagttat attgatttag accctgacaa caattcagtc 1260
tatgcttaca cccgaacgct cggcgctgaa aaatatcttg tggtcattaa ttttaaagaa 1320
gaagtgatgc actacaccct gcccggggat ttatccatca ataaggtgat tactgaaaac 1380
aacagtcaca ctattgtgaa taaaaatgac aggcaactcc gtcttgaacc ctggcagtcg 1440
ggcatttata aacttaatcc gtag 1464
<210>2
<211>487
<212>PRT
<213>大黄欧文氏菌
<400>2
Met Lys Glu Tyr Gly Thr Met Glu Asp Phe Asp Arg Leu Ile Ser Glu
1 5 10 15
Met Lys Lys Arg Asn Met Arg Leu Met Ile Asp Ile Val Ile Asn His
20 25 30
Thr Ser Asp Gln His Ala Trp Phe Val Gln Ser Lys Ser Gly Lys Asn
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg Asp Gly Lys Asp Gly His
50 55 60
Ala Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Glu Lys
65 70 75 80
Asp Asp Lys Ser Gly Gln Tyr Tyr Leu His Tyr Phe Ala Lys Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Gln Asp Leu Tyr Asp
100 105 110
Met Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Leu Arg Phe Asp
115 120 125
Thr Val Ala Thr Tyr Ser Lys Ile Pro Asn Phe Pro Asp Leu Ser Gln
130 135 140
Gln Gln Leu Lys Asn Phe Ala Glu Glu Tyr Thr Lys Gly Pro Lys Ile
145 150 155 160
His Asp Tyr Val Asn Glu Met Asn Arg Glu Val Leu Ser His Tyr Asp
165 170 175
Ile Ala Thr Ala Gly Glu Ile Phe Gly Val Pro Leu Asp Lys Ser Ile
180 185 190
Lys Phe Phe Asp Arg Arg Arg Asn Glu Leu Asn Ile Ala Phe Thr Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Asp Ala Asp Glu Arg Trp Arg Arg Lys
210 215 220
Asp Trp Thr Leu Ser Gln Phe Arg Lys Ile Val Asp Lys Val Asp Gln
225 230 235 240
Thr Ala Gly Glu Tyr Gly Trp Asn Ala Phe Phe Leu Asp Asn His Asp
245 250 255
Asn Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Gln Trp Arg
260 265 270
Glu His Ala Ala Lys Ala Leu Ala Thr Leu Thr Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Tyr Gln Gly Ser Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Lys Lys Ile Asp Asp Phe Asp Asp Val Glu Val Lys Gly Phe Trp
305 310 315 320
Gln Asp Tyr Val Glu Thr Gly Lys Val Lys Ala Glu Glu Phe Leu Gln
325 330 335
Asn Val Arg Gln Thr Ser Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp
340 345 350
Asp Ala Ser Lys Asn Ala Gly Phe Thr Ser Gly Thr Pro Trp Leu Lys
355 360 365
Ile Asn Pro Asn Tyr Lys Glu Ile Asn Ser Ala Asp Gln Ile Asn Asn
370 375 380
Pro Asn Ser Val Phe Asn Tyr Tyr Arg Lys Leu Ile Asn Ile Arg His
385 390 395 400
Asp Ile Pro Ala Leu Thr Tyr Gly Ser Tyr Ile Asp Leu Asp Pro Asp
405 410 415
Asn Asn Ser Val Tyr Ala Tyr Thr Arg Thr Leu Gly Ala Glu Lys Tyr
420 425 430
Leu Val Val Ile Asn Phe Lys Glu Glu Val Met His Tyr Thr Leu Pro
435 440 445
Gly Asp Leu Ser Ile Asn Lys Val Ile Thr Glu Asn Asn Ser His Thr
450 455 460
Ile Val Asn Lys Asn Asp Arg Gln Leu Arg Leu Glu Pro Trp Gln Ser
165 470 475 480
Gly Ile Tyr Lys Leu Asn Pro
485
<210>3
<211>1803
<212>DNA
<213>大黄欧文氏菌
<400>3
atgtcctctc aaggattgaa aacggctgtc gctatttttc ttgcaaccac tttttctgcc 60
acatcctatc aggcctgcag tgccgggcca gataccgccc cctcactcac cgttcagcaa 120
tcaaatgccc tgcccacatg gtggaagcag gctgtttttt atcaggtata tccacgctca 180
tttaaagata cgaatgggga tggcattggg gatttaaacg gtattattga gaatttagac 240
tatctgaaga aactgggtat tgatgcgatt tggatcaatc cacattacga ttcgccgaat 300
acggataatg gttatgacat ccgggattac cgtaagataa tgaaagaata cggtacgatg 360
gaagactttg accgtcttat ttcagaaatg aagaaacgca atatgcgttt gatgattgat 420
attgttatca accacaccag cgatcagcat gcctggtttg ttcagagcaa atcgggtaag 480
aacaacccct acagggacta ttacttctgg cgtgacggta aggatggcca tgcccccaat 540
aactatccct ccttcttcgg tggctcagcc tgggaaaaag acgataaatc aggccagtat 600
tacctccatt actttgccaa acagcaaccc gacctcaact gggacaatcc caaagtccgt 660
caagacctgt atgacatgct ccgcttctgg ttagataaag gcgtttctgg tttacgcttt 720
gataccgttg ccacctactc gaaaatcccg aacttccctg accttagcca acagcagtta 780
aaaaatttcg ccgaggaata tactaaaggt cctaaaattc acgactacgt gaatgaaatg 840
aacagagaag tattatccca ctatgatatc gccactgcgg gggaaatatt tggggttcct 900
ctggataaat cgattaagtt tttcgatcgc cgtagaaatg aattaaatat agcgtttacg 960
tttgatctga tcaggctcga tcgtgatgct gatgaaagat ggcggcgaaa agactggacc 1020
ctttcgcagt tccgaaaaat tgtcgataag gttgaccaaa cggcaggaga gtatgggtgg 1080
aatgcctttt tcttagacaa tcacgacaat ccccgcgcgg tttctcactt tggtgatgat 1140
cgaccacaat ggcgcgagca tgcggcgaaa gcactggcaa cattgacgct gacccagcgt 1200
gcaacgccgt ttatctatca gggttcagaa ctcggtatga ccaattatcc ctttaaaaaa 1260
atcgatgatt tcgatgatgt agaggtgaaa ggtttttggc aagactacgt tgaaacagge 1320
aaagtgaaag ctgaggaatt ccttcaaaac gtacgccaaa ccagccgtga taacagcaga 1380
acccccttcc agtgggatgc aagcaaaaac gcgggcttta ccagtggaac cccctggtta 1440
aaaatcaatc ccaattataa agaaatcaac agcgcagatc agattaataa tccaaattcc 1500
gtatttaact attatagaaa gctgattaac attcgccatg acatccctgc cttgacctac 1560
ggcagttata ttgatttaga ccctgacaac aattcagtct atgcttacac ccgaacgctc 1620
ggcgctgaaa aatatcttgt ggtcattaat tttaaagaag aagtgatgca ctacaccctg 1680
cccggggatt tatccatcaa taaggtgatt actgaaaaca acagtcacac tattgtgaat 1740
aaaaatgaca ggcaactccg tcttgaaccc tggcagtcgg gcatttataa acttaatccg 1800
tag 1803
<210>4
<211>1464
<212>DNA
<213>沙雷氏菌
<400>4
atgaaagaat atggcacgat ggaggatttt gaccgcctga tttctgaaat gaaaaaacgt 60
aacatgcggt tgatgattga tgtggtcatc aaccacacca gcgatcaaaa cgaatggttt 120
gttaaaagta aaagcagtaa ggataatcct tatcgtggct attacttctg gaaagatgct 180
aaagaagggc aggcgcctaa taattaccct tcattctttg gtggctcggc gtggcaaaaa 240
gatgaaaaga ccaatcaata ctacctgcac tattttgcta aacaacagcc tgacctaaac 300
tgggataacc ccaaagtccg tcaagatctt tatgcaatgt tgcgtttctg gttagataaa 360
ggcgtgtctg gtttacgctt tgatacggta gcgacctact caaaaattcc ggacttccca 420
aatctcaccc aacaacagct gaagaatttt gcagctgagt ataccaaggg ccctaatatt 480
catcgttacg tcaatgaaat gaatagagaa gttttgtctc attacgacat tgccactgcc 540
ggtgaaatct ttggcgtacc cttggatcaa tcgataaaat tcttcgatcg ccgtcgcgat 600
gagctgaaca tcgcatttac ctttgactta atcagactcg atcgagactc tgatcaaaga 660
tggcgtcgaa aagagtggaa attgtcgcaa ttccgacagg tcatcgataa cgttgaccgt 720
actgccggcg aatatggttg gaatgccttc ttcttggata accacgacaa tccgcgcgct 780
gtctcccact ttggcgatga tcgcccacaa tggcgcgagc catcggctaa agcgcttgca 840
accttgacgc tgactcaacg agcaacgcct tttatttatc aaggttcaga attgggcatg 900
accaattacc ccttcaaagc tattgatgaa ttcgatgata ttgaggtgaa aggtttttgg 960
catgactacg ttgagacagg aaaggtgaaa gccgacgagt tcttgcaaaa tgtacgcctg 1020
acgagcaggg ataacagccg gacaccgttc caatgggata cgagcaaaaa tgcaggattc 1080
acgagcggaa aaccttggtt caaggtcaat ccaaactacc aggaaatcaa tgcggtaagt 1140
caagtcgcac agcccgactc ggtatttaat tattatcgtc agttgatcaa gataaggcat 1200
aacatcccgg cactgaccta tggcacatac accgatttgg atcctgcaaa tgattcggtc 1260
tacgcctata cacgcagcct tggggcggaa aaatatcttg ttgtcgttaa cttccaggaa 1320
caagtgatga gatataaatt accggataat ctatccatcg agaaagtgat tatagaaagc 1380
aacagcaaaa acgttgtgaa aaagaatgat tccttactcg aactaaaacc atggcagtca 1440
ggggtttata aactaaatca ataa 1464
<210>5
<211>487
<212>PRT
<213>沙雷氏菌
<400>5
Met Lys Glu Tyr Gly Thr Met Glu Asp Phe Asp Arg Leu Ile Ser Glu
1 5 10 15
Met Lys Lys Arg Asn Met Arg Leu Met Ile Asp Val Val Ile Asn His
20 25 30
Thr Ser Asp Gln Asn Glu Trp Phe Val Lys Ser Lys Ser Ser Lys Asp
35 40 45
Asn Pro Tyr Arg Gly Tyr Tyr Phe Trp Lys Asp Ala Lys Glu Gly Gln
50 55 60
Ala Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Gln Lys
65 70 75 80
Asp Glu Lys Thr Asn Gln Tyr Tyr Leu His Tyr Phe Ala Lys Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Gln Asp Leu Tyr Ala
100 105 110
Met Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Leu Arg Phe Asp
115 120 125
Thr Val Ala Thr Tyr Ser Lys Ile Pro Asp Phe Pro Asn Leu Thr Gln
130 135 140
Gln Gln Leu Lys Asn Phe Ala Ala Glu Tyr Thr Lys Gly Pro Asn Ile
145 150 155 160
His Arg Tyr Val Asn Glu Met Asn Arg Glu Val Leu Ser His Tyr Asp
165 170 175
Ile Ala Thr Ala Gly Glu Ile Phe Gly Val Pro Leu Asp Gln Ser Ile
180 185 190
Lys Phe Phe Asp Arg Arg Arg Asp Glu Leu Asn Ile Ala Phe Thr Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Asp Ser Asp Gln Arg Trp Arg Arg Lys
210 215 220
Glu Trp Lys Leu Ser Gln Phe Arg Gln Val Ile Asp Asn Val Asp Arg
225 230 235 240
Thr Ala Gly Glu Tyr Gly Trp Asn Ala Phe Phe Leu Asp Asn His Asp
245 250 255
Asn Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Gln Trp Arg
260 265 270
Glu Pro Ser Ala Lys Ala Leu Ala Thr Leu Thr Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Tyr Gln Gly Ser Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Lys Ala Ile Asp Glu Phe Asp Asp Ile Glu Val Lys Gly Phe Trp
305 310 315 320
His Asp Tyr Val Glu Thr Gly Lys Val Lys Ala Asp Glu Phe Leu Gln
325 330 335
Asn Val Arg Leu Thr Ser Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp
340 345 350
Asp Thr Ser Lys Asn Ala Gly Phe Thr Ser Gly Lys Pro Trp Phe Lys
355 360 365
Val Asn Pro Asn Tyr Gln Glu Ile Asn Ala Val Ser Gln Val Ala Gln
370 375 380
Pro Asp Ser Val Phe Asn Tyr Tyr Arg Gln Leu Ile Lys Ile Arg His
385 390 395 400
Asn Ile Pro Ala Leu Thr Tyr Gly Thr Tyr Thr Asp Leu Asp Pro Ala
405 410 415
Asn Asp Ser Val Tyr Ala Tyr Thr Arg Ser Leu Gly Ala Glu Lys Tyr
420 425 430
Leu Val Val Val Asn Phe Gln Glu Gln Val Met Arg Tyr Lys Leu Pro
435 440 445
Asp Asn Leu Ser Ile Glu Lys Val Ile Ile Glu Ser Asn Ser Lys Asn
450 455 460
Val Val Lys Lys Asn Asp Ser Leu Leu Glu Leu Lys Pro Trp Gln Ser
465 470 475 480
Gly Val Tyr Lys Leu Asn Gln
485
<210>6
<211>1803
<212>DNA
<213>沙雷氏菌
<400>6
atgccccgtc aaggattgaa aactgcacta gcgatttttc taaccacatc attaagcgtc 60
tcatgccagc aagccttagg tacgcaacaa cccttgctta acgaaaagag tatcgaacag 120
tcgaaaacca tacctaaatg gtggaaggag gctgtttttt atcaggtgta tccgcgttcc 180
tttaaagaca ctaacgggga tggtatcggg gatattaaag gcatcataga aaaattagac 240
tatttaaaag ctttggggat tgatgccatt tggatcaacc cacattatga ctccccgaac 300
acggataatg gttacgatat acgtgattat cgaaaaatca tgaaagaata tggcacgatg 360
gaggattttg accgcctgat ttctgaaatg aaaaaacgta acatgcggtt gatgattgat 420
gtggtcatca accacaccag cgatcaaaac gaatggtttg ttaaaagtaa aagcagtaag 480
gataatcctt atcgtggcta ttacttctgg aaagatgcta aagaagggca ggcgcctaat 540
aattaccctt cattctttgg tggctcggcg tggcaaaaag atgaaaagac caatcaatac 600
tacctgcact attttgctaa acaacagcct gacctaaact gggataaccc caaagtccgt 660
caagatcttt atgcaatgtt gcgtttctgg ttagataaag gcgtgtctgg tttacgcttt 720
gatacggtag cgacctactc aaaaattccg gacttcccaa atctcaccca acaacagctg 780
aagaattttg cagctgagta taccaagggc cctaatattc atcgttacgt caatgaaatg 840
aatagagaag ttttgtctca ttacgacatt gccactgccg gtgaaatctt tggcgtaccc 900
ttggatcaat cgataaaatt cttcgatcgc cgtcgcgatg agctgaacat cgcatttacc 960
tttgacttaa tcagactcga tcgagactct gatcaaagat ggcgtcgaaa agagtggaaa 1020
ttgtcgcaat tccgacaggt catcgataac gttgaccgta ctgccggcga atatggttgg 1080
aatgccttct tcttggataa ccacgacaat ccgcgcgctg tctcccactt tggcgatgat 1140
cgcccacaat ggcgcgagcc atcggctaaa gcgcttgcaa ccttgacgct gactcaacga 1200
gcaacgcctt ttatttatca aggttcagaa ttgggcatga ccaattaccc cttcaaagct 1260
attgatgaat tcgatgatat tgaggtgaaa ggtttttggc atgactacgt tgagacagga 1320
aaggtgaaag ccgacgagtt cttgcaaaat gtacgcctga cgagcaggga taacagccgg 1380
acaccgttcc aatgggatac gagcaaaaat gcaggattca cgagcggaaa accttggttc 1440
aaggtcaatc caaactacca ggaaatcaat gcggtaagtc aagtcgcaca gcccgactcg 1500
gtatttaatt attatcgtca gttgatcaag ataaggcata acatcccggc actgacctat 1560
ggcacataca ccgatttgga tcctgcaaat gattcggtct acgcctatac acgcagcctt 1620
ggggcggaaa aatatcttgt tgtcgttaac ttccaggaac aagtgatgag atataaatta 1680
ccggataatc tatccatcga gaaagtgatt atagaaagca acagcaaaaa cgttgtgaaa 1740
aagaatgatt ccttactcga actaaaacca tggcagtcag gggtttataa actaaatcaa 1800
taa 1803
<210>7
<211>609
<212>DNA
<213>大豆
<400>7
gaagccacgt catgaagagt atatcatttc agtaatgttt tgagacgcct ctataatgct 60
ttaccaacaa aacaaaacaa aaaaaagaac atttgaaacc atttgtatta aaaaaaaaaa 120
ggtatattag gccataatat tataggtaac atgaaatatc aaatgacacg caagagtttt 180
gtcaaaaatg aaaccatcac acatcagaga ttatggcaaa taatgttttg tgtgtctctt 240
gcttcaccca taacataagc ctctataact ggagagaaga aaaaaaaaag tggaggggct 300
agggtgggaa tttggaagaa tacagttata ttgagcattg agcaagttga tagaaagctt 360
ctcaatttgt acaaaatttg catccacatg attattaaag acgtagacag cacttcttcc 420
ttcttttttt ctataagttt cttatatatt gttcttcatg ttttaatatt attactttat 480
gtacgcgtct aacagtagtc ctcccaaact gctataaata gagcctcttc aacgcacctc 540
ttggcagtac aaaaattatt catctcttct aagttctaat tttctaagca ttcagtaaaa 600
gaactaacc 609
<210>8
<211>2085
<212>DNA
<213>拟南芥
<400>8
cgaagagcat aagttttgtt caaatggccc aataacaaat taaaaacatg taaagtagtc 60
agtttaaaca agcatttgca taaagtgtgg ttaatattat attaaacttc acatccaatg 120
agcattcatg taatttaaag taactgaagt taagtatcta gaagcctttt tcttctattg 180
gttattaatt tgcttaattt tctttataag ttaatttctg gttggtgtga aaatgtgacc 240
agagaaggta tctaactttt ttttttcttt aatgaattcc actaaaattt aattctgtat 300
gtaacgcata tagtaaaatc tagaaagcga ccggcgtgcc tcctttggaa agtaatcctg 360
taaaagtaaa agccgcgtag tgtaaaagta tatgacttct tcttcccata attattttat 420
aattagtctt taatctaaat atttaaacat ataattcgtt ttacgagaaa gatcttcaca 480
ctcgattagt atacattaca tttaattccc tagttcataa aatggataac aaaaggctgt 540
gcgagattac aactgtactt gataattttg tataaatata tcctttatga atatatttta 600
gcattgatga ccgtacatgg ttaatccagt ctgcagcata acggagtatg atattaaatg 660
aacactttct gttcgtatca aatggtatcg aatattatta gagtgatcat tcagaagaaa 720
aaaagagaga gaagaaaacc tacagtgtaa acattttttt ttttgctaaa tacctacagt 780
gtaaacatga agtgctataa tttctgcaaa tagaaatcaa gaacagaaag agttgcttgg 840
aggaaaagaa atagaaaatt aagaaatcta gtgatgtaat aaatctttcc ataaaatcaa 900
atgtttggtc caaagtatta gttaaataat taggccacta ttcttgacaa ctctttttaa 960
caaactcttc tatattttct cgtggtacat atgctgaaaa agatgtatgt ctaatccata 1020
atatatctgt ataatgcgac tttcattatc tattagtacg acttctaacc tagaagataa 1080
caagcattag ctagggcatc aaaatcaacg tggaaaaacc tacgaaaagc acgaagtgat 1140
taatctgtgt aggggtggcg taagggtaaa gactaaagac tgagaatcta gggttcaagg 1200
cgtaaacttg ttctgctttt tgggtttcat tttattggcg aacaacattg atgtgtgtgg 1260
accatttggt gttcagggat tgagacaaga taatatgttt gctctcacct tctaggatta 1320
ctcgggtgct aagactcact tagtactatt gctatatcga tatactagtt cattaccaaa 1380
aaatggagtc ttcaaatttc gagttccaat atctgaaagc attgtttaaa gagatttgtt 1440
ttctccctgc acaattagtt tataacttca tatatacaca atcttatcaa tttacaacca 1500
ggtgtgtgtg aaccttcaca taatctctct tattcattca tgtatatatc caataaaagt 1560
tcgatatgtg aaattatata tctccatcta atgttagact attcccgggt cttgactata 1620
aatttaaagt attagacgag ctaattatat ttagcacaaa caatttcttc tgtaacagtg 1680
tcacgcttat cactaccaaa gaataaacac tgatctgttt taatctctta ttttctcacc 1740
catattcaaa gtcaactatt gcaagacttc gagataatta atttgatggc tatactattt 1800
acttgacatt tgggaaaata tattttcgct gataaatttg gtttttactt ctctctccga 1860
cggatataga aacaattcaa ttacatgcga aaatgataat tcaaccctat aaaccaaaac 1920
aaataacaga atgcacattt ttttcaacgc gttaggtcac ctatctttca ctttagaaca 1980
tcccttcacg tctctatata aacctcgact ctgttatcct ttgttcttca agtacaacaa 2040
tcaactctaa gtctattata ttcaagtctt tgttttaacc taaca 2085
<210>9
<211>1999
<212>DNA
<213>拟南芥
<400>9
gtagtgccct tcatggatac caaaagagaa aatttgattt agtgcataca tataacaata 60
taacgccgca taataatact gtataaaaca gtcatgtaac gatatgacag cagtaataca 120
gttccaagag acgttataat cgtatgcaat catatgcttg cgtagatttt ccaacagttt 180
tgtttcgttg ataggaggaa ctcaacactc tagggtagtg attggtagac actattagca 240
caaaaaatat taattttact ctgatgttta ccaaaaaagt taccaatcaa atatttaaga 300
gatcgtactc ttccacggcg actctaaaaa ccaaagatat aggttagact cataactact 360
ttataaagaa aatgtttaac gataactacc gagatctaat aaataaacct tcattttcaa 420
gtatattata tttgcttctt ttgtttatat atcaaaccaa gttctggttt ataaaaatat 480
tagataaaac tcgtctaaat aggtaggtgt aaaataaaat tttaaatttt tatcgataat 540
atttaaaatt tgaaaagtta ataatgatcc acacattttt tctaatattt aatttagtaa 600
tttttgtatt aaataaaatt tcaatcatat acattcgatt tttctataca ttttaactat 660
ctatttctgc ataataaact gtattttcat tttatacgct tcatcttatg gatgatattt 720
aaattttaaa tagtaattca tacacttttt aatatttaat ttagtatttt cttaaatcca 780
aattttaatc ttacaattta aatatctact ttaacataat acaaatacaa tttaatttca 840
ttgtattaaa ttcaaatata atttgattat aataaaatac aatttaattc taaaaagtcc 900
atcttagatt ttaattttcc tttttagttt tgaaaattaa aaatttaaat ttattagata 960
tatatgttac tttttcagtt ttcctattta tttaagaaaa aaatattttt taacacatgt 1020
caacttgtaa acaatagact gaacacgtca ttttatatta tgtttagttt tgaaaattaa 1080
agttaattaa atatttatat ttcttttttt tagcttttct aattattttt aaaatagtaa 1140
atatttttaa tacaaatcaa tatctgaaca atagatttga tacataacat aatcctataa 1200
attattaact tggaaaacga tagtttatat aataaaatta ttttcttaag ttctctaacc 1260
ataacaatta aactatattt tagcgaagaa aagaagagaa taccgagaga acgcaacttg 1320
cactaaaagc taccactttg gcaaatcact catttatatt attatatact atcacctcaa 1380
ttcaatcgaa acctcaaaat aacactaata tatacacaaa gaaacaacag aataacaccg 1440
aagaatatag gtttaggaaa atccagaatt tgttgagact aaagagatca aattttcgat 1500
acaaggtttt gctcaatttg tattttcata ataaaattct ttatttcacc atagacttac 1560
atgattagtt tttcttttaa taaaaaaaaa cacgcgacat gaaaattata ttatctcagt 1620
gttgtcgaat ttgaatttga attttgagtt aaatactaca catttgttga caacttatta 1680
aactttacaa gtctgctaca aatattgtca aatatttact aattaatgga ccaaaatcct 1740
ctaacttgca aatttgtatc tacatcaact taaaaattag gaatatgcga cccaaaaaaa 1800
aaaaaactag gaataataat aaaaaaatgg aatgatgtgg aggaagctct ttactctttg 1860
agaggaagtt tataaattga ccacacattt agtctattat catcacatgt attaagactt 1920
gacaacttgt ctttctcaca ccaaacccct ctcctctgtt tcataacatc tgctctttct 1980
tttttttcct aagccccta 1999
<210>10
<211>1967
<212>DNA
<213>拟南芥
<400>10
cagacaaaga attattggaa aacaatgaga atttttgacg gtggtttgtt ataatgtatt 60
attaaataac atgataatgg aaattacttt gttttagtta aaggaaaatt aatttgttgt 120
ttaataaact agtggtaggt aggaatagtt aaaatgtaag tatcaaagtt ttttgaattt 180
aagattaaga ttctcgaaat tcagttatta gcatacaaat gacataaatt atgaaaaaat 240
aaattaaaat aatgtcatac agatccagat gaaaatgtat aatgtatata catttgataa 300
aaatgaaaat gtattttcgg gttctcagtt tgttttgtga aatatcaata cacaatgtta 360
aaaaagaatc ggcttctttc agcttatgat attcattaat tttccacaca ccatttttca 420
aagggaaata gcaaaaaaaa ttaaaattaa aacagccagc taaattaatc agtgaaatca 480
tccaaactgt tttacaaaga cattttttcg gccaaatcaa ataaaaaaat cgattgttat 540
tgacagtctt tgtgatctta ttggttacgt tatacccacc tgtgcactcc acttttaagt 600
actacttcgt ctctaaatat ggtacggact aacttgaaat tagcctattg atttgcttag 660
aaattgataa atctttggac gagatggtgt ccactcttta aatcaccaca atgtccccta 720
tctattttcc gcgacaagat gaataagaat atgcactaaa cttaaccatc attcgcttat 780
acactatatt tattaaatca gctttctcat cgcctaaaat tcaatatttt tgggtccatt 840
atctacacga cacaatggat cattcacata cggccgcgca tcaaatgatt tcgtaagtcc 900
cggcaaatgt taataaacta tttgaaaaag aaagagtcat gtgtcccgtc aattcaagta 960
cttatttatt gtgatttttt gcacatatat agattaacat atattcatgg ttaaaacttg 1020
ttgatgctgc aaaaaggata attatcaccc acgtacatta ctcatatgaa tataaaaggt 1080
gcataatttt tttttttttt tttgtaatgt tttatgtata tacacatata gtataccaat 1140
tttttaacaa aacaaattac atatagataa caaagaggtg aatagtttcg atcgtgaata 1200
ttcaggttga tactaattag ttctcctttt gtagattcga caagtgtgat gagtggataa 1260
aaaaatggat gacgtcttga gtggattgta catatacaaa tagataatgt aagtgcatgc 1320
tttttgattc ttcgaaacta tttggttata actttcggat atacttataa caaaaaaaaa 1380
aacctttcgg atatacatgg ttcggcttgg acgtacaggt ctatataata atttgatata 1440
tattggtaca tttcatttat atactcttta ttggtacgat acattttgat tcgttatcaa 1500
tatattaata ccacattgac gagaacattc tcattagtga tcgtagatta ataatctagc 1560
catcttaata agcaaaatat ataatccaaa aaatgcgaca ttattttaca tacgcaagtg 1620
ttcacaacca atagtccaat atataaatta attaagtagg tatgtaatat aaccaaggaa 1680
tttacgatct aatccagttt tgattaccta gaacaagacc atagttagcc acacataatg 1740
gatacgtgct tgacaacaat taaaaaccta tatttttaaa agtgatgctt aaatagccaa 1800
tggattgaaa tgtgcactcg catatattgc tttttgtgtc agcacaattt ggctatataa 1860
gcaagtactc tcttgtagta atcattcaca gtcataacta attaagtaca tttgaataca 1920
tcaaatacca agaaagagaa att tagagag aaagagaaag agataaa 1967
<210>11
<211>1476
<212>DNA
<213>拟南芥
<400>11
gctcgcgtta gttccactca aggagtatcc tttcttcctt gcgcaactct ccaccttcgg 60
gtaaagtacc atctctagca tcttgagtct tgatcaactt ctgttttgct tactctcaaa 120
atgcattaat ttttttttat actagatcat agtattatat ctcttaatct acctattgaa 180
atctacttaa tgtttttact aaaacctacg tgtttctctt tagagaattt tgtgctatgc 240
atgaattaga ggttagtaat gtgtaatact tcataagtct agatttattt gttggttaac 300
acgtttagta attcacacac acacaccacc ttagatattt tactgtgaat tagaaaaaga 360
tacatagtta ggagtgtttt tttaaaaaaa ttcaatcatg agaaaattag aggtgtgatg 420
ttatacatta tgaaaatgca aagggcagat acgaataaat tagaaacttg tttaacgggt 480
cagagttggc ttctagtctc tttcgacttg gatacttctt cttctacaat tgggacatta 540
ttgtaggcgc attatatcat ttctctacat gcaatgaatg tacatacatt aattcacatt 600
tatttttgga ataatcatat gagtgatcga agtttgtatt tatatattca atcttcacaa 660
actactttta tttaaaaatc atttgcaaaa tgctatttta ttgacaaaaa gatatatgct 720
ataaaataaa ataaaattca caaactatag tcattaatac aaaaagaaat cattgaatat 780
ggtagagggg aaacaaaaaa aaaacacgac gatgtaagtt ggtggaacca cattatcaaa 840
ataaaagaag gtggtggaac caaattgaat aaagtccgtc catatcatta tccgtccctt 900
aggagcctct aattagtaat attcttatgg gtccactgtg gcttagagga cttgattaaa 960
accattctta tttagtgcta actttgtgag ggttggaata acgaaccaag ctgattcaaa 1020
ccattccaaa acaaagttgt cacatatttc aaaaccaaag tttaccggac agagaaatat 1080
ggtgtgtttt tctcaaacca agctaaatgg aatccattgt aaaccaaaat gttcacacct 1140
acctattctt ttggagtccc ttttccatgt gtttgctgtc tgctagtcaa gtttcattag 1200
ctgattgcct tgcatcatat tcttggatca actttttttt tttttttttt tggggtaatt 1260
aacaaaatgc ttaaatttct caagactata ggatcacatt acctgtgtgc ttaacataac 1320
ttttagatag gctagagaat tgatctatta caagataatc aataatttac agaagaaaac 1380
attctttttt ttgttctatt tccttcatgt aggtatgtag ctgtatatta tactatcttg 1440
tattttcgat atcgtgctgg aactgtcaca gatgca 1476
<210>12
<211>38
<212>DNA
<213>人工的
<220>
<223>引物序列
<400>12
ggcgcgccac catgaaagaa tacggtacga tggaagac 38
<210>13
<211>36
<212>DNA
<213>人工的
<220>
<223>引物序列
<400>13
ctcgagctac ggattaagtt tataaatgcc cgactg 36
<210>14
<211>485
<212>PRT
<213>克雷伯氏菌LX3
<400>14
Met Lys Glu Tyr Gly Thr Met Glu Asp Phe Asp Ser Leu Val Ala Glu
1 5 10 15
Met Lys Lys Arg Asn Met Arg Leu Met Ile Asp Val Val Ile Asn His
20 25 30
Thr Ser Asp Gln His Pro Trp Phe Ile Gln Ser Lys Ser Asp Lys Asn
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg Asp Gly Lys Asp Asn Gln
50 55 60
Pro Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Gln Lys
65 70 75 80
Asp Ala Lys Ser Gly Gln Tyr Tyr Leu His Tyr Phe Ala Arg Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Glu Asp Leu Tyr Ala
100 105 110
Met Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Met Arg Phe Asp
115 120 125
Thr Val Ala Thr Tyr Ser Lys Ile Pro Gly Phe Pro Asn Leu Thr Pro
130 135 140
Glu Gln Gln Lys Asn Phe Ala Glu Gln Tyr Thr Met Gly Pro Asn Ile
145 150 155 160
His Arg Tyr Ile Gln Glu Met Asn Arg Lys Val Leu Ser Arg Tyr Asp
165 170 175
Val Ala Thr Ala Gly Glu Ile Phe Gly Val Pro Leu Asp Arg Ser Ser
180 185 190
Gln Phe Phe Asp Arg Arg Arg His Glu Leu Asn Met Ala Phe Met Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Asp Ser Asn Glu Arg Trp Arg His Lys
210 215 220
Ser Trp Ser Leu Ser Gln Phe Arg Gln Ile Ile Ser Lys Met Asp Val
225 230 235 240
Thr Val Gly Lys Tyr Gly Trp Asn Thr Phe Phe Leu Asp Asn His Asp
245 250 255
Asn Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Gln Trp Arg
260 265 270
Glu Ala Ser Ala Lys Ala Leu Ala Thr Ile Thr Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Tyr Gln Gly Ser Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Arg Gln Leu Asn Glu Phe Asp Asp Ile Glu Val Lys Gly Phe Trp
305 310 315 320
Gln Asp Tyr Val Gln Ser Gly Lys Val Thr Ala Thr Glu Phe Leu Asp
325 330 335
Asn Val Arg Leu Thr Ser Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp
340 345 350
Asn Asp Thr Leu Asn Ala Gly Phe Thr Arg Gly Lys Pro Trp Phe His
355 360 365
Ile Asn Pro Asn Tyr Val Glu Ile Asn Ala Glu Arg Glu Glu Thr Arg
370 375 380
Glu Asp Ser Val Leu Asn Tyr Tyr Lys Lys Met Ile Gln Leu Arg His
385 390 395 400
His Ile Pro Ala Leu Val Tyr Gly Ala Tyr Gln Asp Leu Asn Pro Gln
405 410 415
Asp Asn Thr Val Tyr Ala Tyr Thr Arg Thr Leu Gly Asn Glu Arg Tyr
420 425 430
Leu Val Val Val Asn Phe Lys Glu Tyr Pro Val Arg Tyr Thr Leu Pro
435 440 445
Ala Asn Asp Ala Ile Glu Glu Val Val Ile Asp Thr Gln Gln Gln Ala
450 455 460
Ala Ala Pro His Ser Thr Ser Leu Ser Leu Ser Pro Trp Gln Ala Gly
465 470 475 480
Val Tyr Lys Leu Arg
485
<210>15
<211>485
<212>PRT
<213>植生拉乌尔菌(Raoultella planticola)
<400>15
Met Lys Glu Tyr Gly Thr Met Glu Asp Phe Asp Asn Leu Val Ala Glu
1 5 10 15
Met Lys Lys Arg Asn Met Arg Leu Met Ile Asp Val Val Ile Asn His
20 25 30
Thr Ser Asp Gln His Pro Trp Phe Ile Gln Ser Lys Ser Asp Lys Asn
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg Asp Gly Lys Asp Asn Gln
50 55 60
Pro Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Gln Lys
65 70 75 80
Asp Ala Lys Ser Gly Gln Tyr Tyr Leu His Tyr Phe Ala Arg Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Glu Asp Leu Tyr Ala
100 105 110
Met Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Ser Met Arg Phe Asp
115 120 125
Thr Val Ala Thr Tyr Ser Lys Ile Pro Gly Phe Pro Asn Leu Thr Pro
130 135 140
Glu Gln Gln Lys Asn Phe Ala Glu Gln Tyr Thr Met Gly Pro Asn Ile
145 150 155 160
His Arg Tyr Ile Gln Glu Met Asn Arg Lys Val Leu Ser Arg Tyr Asp
165 170 175
Val Ala Thr Ala Gly Glu Ile Phe Gly Val Pro Leu Asp Arg Ser Ser
180 185 190
Gln Phe Phe Asp Pro Arg Arg His Glu Leu Asn Met Ala Phe Met Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Asp Ser Asn Glu Arg Trp Arg His Lys
210 215 220
Ser Trp Ser Leu Ser Gln Phe Arg Gln Ile Ile Ser Lys Met Asp Val
225 230 235 240
Thr Val Gly Lys Tyr Gly Trp Asn Thr Phe Phe Leu Asp Asn His Asp
245 250 255
Asn Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Gln Trp Arg
260 265 270
Glu Ala Ser Ala Lys Ala Leu Ala Thr Ile Thr Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Tyr Gln Gly Ser Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Arg Gln Leu Asn Glu Phe Asp Asp Ile Glu Val Lys Gly Phe Trp
305 310 315 320
Gln Asp Tyr Val Gln Ser Gly Lys Val Thr Ala Thr Glu Phe Leu Asp
325 330 335
Asn Val Arg Leu Thr Ser Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp
340 345 350
Asn Asp Thr Leu Asn Ala Gly Phe Thr Arg Gly Lys Pro Trp Phe His
355 360 365
Ile Asn Pro Asn Tyr Val Glu Ile Asn Ala Glu Arg Glu Glu Thr Arg
370 375 380
Glu Asp Ser Val Leu Asn Tyr Tyr Lys Lys Met Ile Gln Leu Arg His
385 390 395 400
His Ile Pro Ala Leu Val Tyr Gly Ala Tyr Gln Asp Leu Asn Pro Gln
405 410 415
Asp Asn Thr Val Tyr Ala Tyr Thr Arg Thr Leu Gly Asn Glu Arg Tyr
420 425 430
Leu Val Val Val Asn Phe Lys Glu Tyr Pro Val Arg Tyr Thr Leu Pro
435 440 445
Ala Asn Asp Ala Ile Glu Glu Val Val Ile Asp Thr Gln Gln Gln Ala
450 455 460
Thr Ala Pro His Ser Thr Ser Leu Ser Leu Ser Pro Trp Gln Ala Gly
465 470 475 480
Val Tyr Lys Leu Arg
485
<210>16
<211>486
<212>PRT
<213>分散泛菌(Pantoea dispersa)
<400>16
Met Lys Glu Tyr Gly Ser Met Ala Asp Phe Asp Arg Leu Val Ala Glu
1 5 10 15
Met Asn Lys Arg Gly Met Arg Leu Met Ile Asp Ile Val Ile Asn His
20 25 30
Thr Ser Asp Arg His Arg Trp Phe Val Gln Ser Arg Ser Gly Lys Asp
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg Asp Gly Lys Gln Gly Gln
50 55 60
Ala Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Gln Leu
65 70 75 80
Asp Lys Gln Thr Asp Gln Tyr Tyr Leu His Tyr Phe Ala Pro Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Ala Glu Leu Tyr Asp
100 105 110
Ile Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Leu Arg Phe Asp
115 120 125
Thr Val Ala Thr Phe Ser Lys Ile Pro Gly Phe Pro Asp Leu Ser Lys
130 135 140
Ala Gln Leu Lys Asn Phe Ala Glu Ala Tyr Thr Glu Gly Pro Asn Ile
140 150 155 160
His Lys Tyr Ile His Glu Met Asn Arg Gln Val Leu Ser Lys Tyr Asn
165 170 175
Val Ala Thr Ala Gly Glu Ile Phe Gly Val Pro Val Ser Ala Met Pro
180 185 190
Asp Tyr Phe Asp Arg Arg Arg Glu Glu Leu Asn Ile Ala Phe Thr Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Tyr Pro Asp Gln Arg Trp Arg Arg Lys
210 215 220
Pro Trp Thr Leu Ser Gln Phe Arg Gln Val Ile Ser Gln Thr Asp Arg
225 230 235 240
Ala Ala Gly Glu Phe Gly Trp Asn Ala Phe Phe Leu Asp Asn His Asp
245 250 255
Asn Pro Arg Gln Val Ser His Phe Gly Asp Asp Ser Pro Gln Trp Arg
260 265 270
Glu Arg Ser Ala Lys Ala Leu Ala Thr Leu Leu Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Phe Gln Gly Ala Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Lys Asn Ile Glu Glu Phe Asp Asp Ile Glu Val Lys Gly Phe Trp
305 310 315 320
Asn Asp Tyr Val Ala Ser Gly Lys Val Asn Ala Ala Glu Phe Leu Gln
325 330 335
Glu Val Arg Met Thr Ser Arg Asp Asn Ser Arg Thr Pro Met Gln Trp
340 345 350
Asn Asp Ser Val Asn Ala Gly Phe Thr Gln Gly Lys Pro Trp Phe His
355 360 365
Leu Asn Pro Asn Tyr Lys Gln Ile Asn Ala Ala Arg Glu Val Asn Lys
370 375 380
Pro Asp Ser Val Phe Ser Tyr Tyr Arg Gln Leu Ile Asn Leu Arg His
385 390 395 400
Gln Ile Pro Ala Leu Thr Ser Gly Glu Tyr Arg Asp Leu Asp Pro Gln
405 410 415
Asn Asn Gln Val Tyr Ala Tyr Thr Arg Ile Leu Asp Asn Glu Lys Tyr
420 425 430
Leu Val Val Val Asn Phe Lys Pro Glu Gln Leu His Tyr Ala Leu Pro
435 440 445
Asp Asn Leu Thr Ile Ala Ser Ser Leu Leu Glu Asn Val His Gln Pro
450 455 460
Ser Leu Gln Glu Asn Ala Ser Thr Leu Thr Leu Ala Pro Trp Gln Ala
465 470 475 480
Gly Ile Tyr Lys Leu Asn
485
<210>17
<211>485
<212>PRT
<213>嗜中酸假单胞菌(Pseudomonas mesoacidophila)
<400>17
Met Lys Glu Tyr Gly Thr Met Glu Asp Phe Asp Arg Leu Met Ala Glu
1 5 10 15
Leu Lys Lys Arg Gly Met Arg Leu Met Val Asp Val Val Ile Asn His
20 25 30
Ser Ser Asp Gln His Glu Trp Phe Lys Ser Ser Arg Ala Ser Lys Asp
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Phe Trp Arg Asp Gly Lys Asp Gly His
50 55 60
Glu Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Glu Lys
65 70 75 80
Asp Pro Val Thr Gly Gln Tyr Tyr Leu His Tyr Phe Gly Arg Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Thr Pro Lys Leu Arg Glu Glu Leu Tyr Ala
100 105 110
Met Leu Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Met Arg Phe Asp
115 120 125
Thr Val Ala Thr Tyr Ser Lys Thr Pro Gly Phe Pro Asp Leu Thr Pro
130 135 140
Glu Gln Met Lys Asn Phe Ala Glu Ala Tyr Thr Gln Gly Pro Asn Leu
145 150 155 160
His Arg Tyr Leu Gln Glu Met His Glu Lys Val Phe Asp His Tyr Asp
165 170 175
Ala Val Thr Ala Gly Glu Ile Phe Gly Ala Pro Leu Asn Gln Val Pro
180 185 190
Leu Phe Ile Asp Ser Arg Arg Lys Glu Leu Asp Met Ala Phe Thr Phe
195 200 205
Asp Leu Ile Arg Tyr Asp Arg Ala Leu Asp Arg Trp His Thr Ile Pro
210 215 220
Arg Thr Leu Ala Asp Phe Arg Gln Thr Ile Asp Lys Val Asp Ala Ile
225 230 235 240
Ala Gly Glu Tyr Gly Trp Asn Thr Phe Phe Leu Gly Asn His Asp Asn
245 250 255
Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Gln Trp Arg Glu
260 265 270
Ala Ser Ala Lys Ala Leu Ala Thr Val Thr Leu Thr Gln Arg Gly Thr
275 280 285
Pro Phe Ile Phe Gln Gly Asp Glu Leu Gly Met Thr Asn Tyr Pro Phe
290 295 300
Lys Thr Leu Gln Asp Phe Asp Asp Ile Glu Val Lys Gly Phe Phe Gln
305 310 315 320
Asp Tyr Val Glu Thr Gly Lys Ala Thr Ala Glu Glu Leu Leu Thr Asn
325 330 335
Val Ala Leu Thr Ser Arg Asp Asn Ala Arg Thr Pro Phe Gln Trp Asp
340 345 350
Asp Ser Ala Asn Ala Gly Phe Thr Thr Gly Lys Pro Trp Leu Lys Val
355 360 365
Asn Pro Asn Tyr Thr Glu Ile Asn Ala Ala Arg Glu Ile Gly Asp Pro
370 375 380
Lys Ser Val Tyr Ser Phe Tyr Arg Asn Leu Ile Ser Ile Arg His Glu
385 390 395 400
Thr Pro Ala Leu Ser Thr Gly Ser Tyr Arg Asp Ile Asp Pro Ser Asn
405 410 415
Ala Asp Val Tyr Ala Tyr Thr Arg Ser Gln Asp Gly Glu Thr Tyr Leu
420 425 430
Val Val Val Asn Phe Lys Ala Glu Pro Arg Ser Phe Thr Leu Pro Asp
435 440 445
Gly Met His Ile Ala Glu Thr Leu Ile Glu Ser Ser Ser Pro Ala Ala
450 455 460
Pro Ala Ala Gly Ala Ala Ser Leu Glu Leu Gln Pro Trp Gln Ser Gly
465 470 475 480
Ile Tyr Lys Val Lys
485
<210>18
<211>486
<212>PRT
<213>胡萝卜软腐欧文氏菌(Erwinia carotovora)
<400>18
Met Lys Glu Tyr Gly Thr Met Asp Asp Phe Asp Arg Leu Ile Ala Glu
1 5 10 15
Met Lys Lys Arg Asp Met Arg Leu Met Ile Asp Val Val Val Asn His
20 25 30
Thr Ser Asp Glu His Glu Trp Phe Val Glu Ser Lys Lys Ser Lys Asp
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Ile Trp Arg Asp Gly Lys Asp Gly Thr
50 55 60
Gln Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Gln Lys
65 70 75 80
Asp Asn Ala Thr Gln Gln Tyr Tyr Leu His Tyr Phe Gly Val Gln Gln
85 90 95
Pro Asp Leu Asn Trp Asp Asn Pro Lys Val Arg Glu Glu Val Tyr Asp
100 105 110
Met Leu Arg Phe Trp Ile Asp Lys Gly Val Ser Gly Leu Arg Met Asp
115 120 125
Thr Val Ala Thr Phe Ser Lys Asn Pro Ala Phe Pro Asp Leu Thr Pro
130 135 140
Lys Gln Leu Gln Asn Phe Ala Tyr Thr Tyr Thr Gln Gly Pro Asn Leu
145 150 155 160
His Arg Tyr Ile Gln Glu Met His Gln Lys Val Leu Ala Lys Tyr Asp
165 170 175
Val Val Ser Ala Gly Glu Ile Phe Gly Val Pro Leu Glu Glu Ala Ala
180 185 190
Pro Phe Ile Asp Gln Arg Arg Lys Glu Leu Asp Met Ala Phe Ser Phe
195 200 205
Asp Leu Ile Arg Leu Asp Arg Ala Val Glu Glu Arg Trp Arg Arg Asn
210 215 220
Asp Trp Thr Leu Ser Gln Phe Arg Gln Ile Asn Asn Arg Leu Val Asp
225 230 235 240
Met Ala Gly Gln His Gly Trp Asn Thr Phe Phe Leu Ser Asn His Asp
245 250 255
Asn Pro Arg Ala Val Ser His Phe Gly Asp Asp Arg Pro Glu Trp Arg
260 265 270
Thr Arg Ser Ala Lys Ala Leu Ala Thr Leu Ala Leu Thr Gln Arg Ala
275 280 285
Thr Pro Phe Ile Tyr Gln Gly Asp Glu Leu Gly Met Thr Asn Tyr Pro
290 295 300
Phe Thr Ser Leu Ser Glu Phe Asp Asp Ile Glu Val Lys Gly Phe Trp
305 310 315 320
Gln Asp Phe Val Glu Thr Gly Lys Val Lys Pro Asp Val Phe Leu Glu
325 330 335
Asn Val Lys Gln Thr Ser Arg Asp Asn Ser Arg Thr Pro Phe Gln Trp
340 345 350
Ser Asn Thr Ala Gln Ala Gly Phe Thr Thr Gly Thr Pro Trp Phe Arg
355 360 365
Ile Asn Pro Asn Tyr Lys Asn Ile Asn Ala Glu Glu Gln Thr Gln Asn
370 375 380
Pro Asp Ser Ile Phe His Phe Tyr Arg Gln Leu Ile Glu Leu Arg His
385 390 395 400
Ala Thr Pro Ala Phe Thr Tyr Gly Thr Tyr Gln Asp Leu Asp Pro Asn
405 410 415
Asn Asn Glu Val Leu Ala Tyr Thr Arg Glu Leu Asn Gln Gln Arg Tyr
420 425 430
Leu Val Val Val Asn Phe Lys Glu Lys Pro Val His Tyr Val Leu Pro
435 440 445
Lys Thr Leu Ser Ile Lys Gln Ser Leu Leu Glu Ser Gly Gln Lys Asp
450 455 460
Lys Val Glu Pro Asn Ala Thr Thr Leu Glu Leu Gln Pro Trp Gln Ser
465 470 475 480
Gly Ile Tyr Gln Leu Asn
485
<210>19
<211>483
<212>PRT
<213>维涅兰德固氮菌(Azotobacter vinelandii)
<400>19
Met Ser Glu Phe Gly Asp Met Asp Asp Phe Glu Arg Leu Leu Ala Gly
1 5 10 15
Met Asn Lys Arg Gly Met Arg Leu Ile Ile Asp Leu Val Val Asn His
20 25 30
Ser Ser Asp Glu His Arg Trp Phe Val Glu Ser Arg Arg Ser Lys Asp
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr ThrTrp Arg Asp Gly Lys Asp Gly Ala
50 55 60
Ala Pro Asn Asn Tyr Pro Ser Phe Phe Gly Gly Ser Ala Trp Lys Lys
65 70 75 80
Asp Glu Ala Thr Gly Gln Tyr Tyr Leu His Tyr Phe Ala Gly Lys Gln
85 90 95
Pro Asp Leu Asn Trp Glu Asn Pro Glu Val Arg Ala Glu Val His Asp
100 105 110
Ile Met Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Phe Arg Met Asp
115 120 125
Val Ile Pro Phe Ile Ser Lys Gln Asp Gly Leu Pro Asp Leu Pro Ala
130 135 140
Gln Ala Leu Ala His Pro Glu Phe Val Tyr Ala Asn Gly Pro Arg Ile
145 150 155 160
His Glu Tyr Leu Gln Glu Met Asn Arg Glu Val Leu Ser Arg Tyr Asp
165 170 175
Thr Met Thr Val Gly Glu Ala Phe Gly Ile Thr Phe Glu Gln Ala Pro
180 185 190
Leu Phe Thr Asp Ala Arg Arg His Glu Leu Asn Met Ile Phe His Phe
195 200 205
Asp Leu Val Arg Leu Asp Arg Asp Gly Trp Arg Lys Lys Asp Trp Thr
210 215 220
Leu Pro Glu Leu Lys Ala Thr Tyr Ala Arg Ile Asp Arg Thr Gly Gly
225 230 235 240
Asp His Gly Trp Asn Thr Ser Phe Leu Gly Asn His Asp Asn Pro Arg
245 250 255
Ala Val Ser His Phe Gly Asp Asp Ser Pro Glu Trp Arg Ala Ala Ser
260 265 270
Ala Lys Ala Leu Ala Thr Met Met Leu Thr Gln Arg Ala Thr Pro Phe
275 280 285
Leu Tyr Gln Gly Asp Glu Leu Gly Met Thr Asn Tyr Pro Phe Arg Gly
290 295 300
Leu Glu Asp Tyr Asp Asp Val Glu Val Lys Gly Gln Trp Arg Asp Phe
305 310 315 320
Val Glu Ser Gly Lys Val Ser Ala Asp Glu Tyr Leu Ala His Leu Arg
325 330 335
Gln Thr Ser Arg Asp Asn Ala Arg Thr Pro Met Gln Trp Ser Asp Ala
340 345 350
Pro Asn Gly Gly Phe Thr Thr Gly Lys Pro Trp Leu Ala Val Asn Pro
355 360 365
Asn Tyr Pro Gln Val Asn Ala Ala Ser Gln Val Asp Asp Pro Gly Ser
370 375 380
Ile Tyr His His Tyr Arg Arg Leu Leu Glu Val Arg Arg Gln Thr Pro
385 390 395 400
Ala Leu Ile His Gly Gln Phe Arg Asp Leu Asp Pro Ala Asn Pro Lys
405 410 415
Val Phe Ala Tyr Thr Arg Thr Leu Asp Asp Lys Arg Tyr Leu Val Leu
420 425 430
Ile Asn Phe Thr Arg Glu Thr Val Ala Tyr Asp Leu Pro Glu Gly Leu
435 440 445
Lys Ile Ala Ala Thr Leu Leu Asp Asn Gly Ala Ala Gln Glu Ser Met
450 455 460
Gln Pro Gly Ala Ala Ser Val Thr Leu Gln Pro Trp Gln Ala Thr Ile
465 470 475 480
Tyr Arg Leu
<210>20
<211>475
<212>PRT
<213>柄杆菌K31
<400>20
Met Thr Gln Phe Gly Thr Met Ala Asp Phe Asp Ala Met Leu Ala Gly
1 5 10 15
Met Thr Ala Arg Gly Met Arg Leu Ile Ile Asp Leu Val Val Asn His
20 25 30
Ser Ser Asp Glu His Ala Trp Phe Val Lys Ser Arg Lys Gly Arg Glu
35 40 45
Asn Pro Tyr Arg Asp Tyr Tyr Ile Trp Arg Asp Gly Lys Asp Gly Gly
50 55 60
Pro Pro Asn Asn Tyr Ser Ala Phe Phe Gly Gly Pro Ala Trp Thr Phe
65 70 75 80
Asp Ala Val Thr Asp Gln Tyr Tyr Leu His Tyr Phe Ala Ala Lys Gln
85 90 95
Pro Asp Leu Asn Trp Glu Asn Pro Lys Val Arg Ala Glu Val His Asp
100 105 110
Leu Met Arg Phe Trp Leu Asp Lys Gly Val Ser Gly Phe Arg Met Asp
115 120 125
Val Ile Pro Phe Ile Ser Lys Pro Pro Gly Leu Pro Asp Leu Thr Pro
130 135 140
Gln Glu Arg Arg Ala Pro Gln Phe Val Tyr Ala Ala Asp Pro Lys Leu
145 150 155 160
His Asp Tyr Leu Arg Glu Met Arg Arg Glu Val Leu Asp His Tyr Asp
165 170 175
Thr Met Thr Val Gly Glu Ala Phe Gly Val Thr Pro Asp Ala Ala Arg
180 185 190
Asp Leu Ile Asp Ser Arg Arg Gly Glu Leu Asp Leu Val Phe Asn Phe
195 200 205
Asp Ile Val Arg Met Asp Ile Asp Gly Trp Arg Lys Thr Ser Trp Thr
210 215 220
Leu Pro Arg Leu Lys Ala Leu Tyr Thr Gln Leu Asp Gln Ala Ala Gly
225 230 235 240
Pro Phe Gly Trp Asn Thr Gln Phe Leu Ser Asn His Asp Asn Pro Arg
245 250 255
Ser Val Ser His Phe Gly Asp Asp Asp Pro Ala Trp Val Glu Arg Ser
260 265 270
Ala Lys Val Leu Ala Thr Leu Ile Leu Thr Gln Arg Gly Thr Pro Phe
275 280 285
Leu Tyr Gln Gly Glu Glu Leu Gly Met Thr Asn Tyr Pro Phe Gln Thr
290 295 300
Leu Asp Asp Phe Asp Asp Leu Glu Val Ala Gly Arg Trp Arg Asp Val
305 310 315 320
Lys His Arg Val Ser Glu Glu Glu Tyr Leu Ala Asn Ala Arg Ala Met
325 330 335
Gly Arg Asp Asn Ser Arg Thr Pro Met Gln Trp Thr Gly Asp Pro His
340 345 350
Gly Gly Phe Thr Thr Gly Lys Pro Trp Leu Ala Val Asn Pro Asn Ala
355 360 365
Ala Thr Ile Asn Ala Gln Asp Gln Ala Ala Arg Pro Asp Ser Val Leu
370 375 380
Thr His Cys Arg Ala Leu Ile Ala Trp Arg Arg Gly Ser Val Asp Leu
385 390 395 400
Arg Glu Gly Asp Tyr Arg Asp Ile Asp Pro Asp His Pro Gln Val Phe
405 410 415
Ala Tyr Arg Arg Gly Glu Gly Leu Leu Val Leu Leu Asn Phe Gly Arg
420 425 430
Glu Thr Val Arg Tyr Ala Leu Pro Glu Gly Leu Ala Ile Glu Ser Ala
435 440 445
Ala Phe Gly Ala Val Glu Ile Ala Gly Arg Val Val Ala Leu Thr Gly
450 455 460
Trp Ser Phe Val Ile Leu Thr Val Arg Asp Arg
165 470 475
<210>21
<211>1458
<212>DNA
<213>克雷伯氏菌LX3
<400>21
atgaaagagt atggcacaat ggaggatttt gatagccttg ttgccgaaat gaaaaaacga 60
aatatgcgct taatgatcga cgtggtcatt aaccatacca gtgatcaaca cccgtggttt 120
attcagagta aaagcgataa aaacaaccct tatcgtgact attatttctg gcgtgacgga 180
aaagataatc agccacctaa taattacccc tcatttttcg gcggctcggc atggcaaaaa 240
gatgcaaagt caggacagta ctatttacac tattttgcca gacagcaacc tgatctcaac 300
tgggataacc cgaaagtacg tgaggatctt tacgcaatgc tccgcttctg gctggataaa 360
ggcgtttcag gcatgcgatt tgatacggtg gcaacttatt ccaaaatccc gggatttccc 420
aatctgacac ctgaacaaca gaaaaatttt gctgaacaat acaccatggg gcctaatatt 480
catcgataca ttcaggaaat gaaccggaaa gttctgtccc ggtatgatgt ggccaccgcg 540
ggtgaaattt ttggcgtccc gctggatcgt tcgtcgcagt tttttgatcg ccgccgacat 600
gagctgaata tggcgtttat gtttgacctc attcgtctcg atcgcgacag caatgaacgc 660
tggcgtcaca agtcgtggtc gctctctcag ttccgccaga tcatcagcaa aatggatgtc 720
acggtcggaa agtatggctg gaacacgttc ttcttagata accatgacaa cccccgtgcg 780
gtatctcact tcggggatga caggccgcaa tggcgggagg cgtcggctaa ggcactggcg 840
acgattaccc tcactcagcg ggcgacgccg tttatttatc agggttcaga gctgggaatg 900
actaattatc ccttcaggca actcaacgaa tttgacgaca tcgaggtcaa aggtttctgg 960
caggattatg tccagagtgg aaaagtcacg gccacagagt ttctcgataa tgtgcgcctg 1020
acgagccgcg ataacagcag aacacctttc cagtggaatg acaccctgaa tgctggtttt 1080
actcgcggaa agccgtggtt tcacatcaac ccaaactatg tggagatcaa cgccgaacgc 1140
gaagaaaccc gcgaagattc agtgctgaat tactataaaa aaatgattca gctacgccac 1200
catatccctg ctctggtata tggcgcctat caggatctta atccacagga caataccgtt 1260
tatgcctata cccgaacgct gggtaacgag cgttatctgg tcgtggtgaa ctttaaggag 1320
tacccggtcc gctatactct cccggctaat gatgccatcg aggaagtggt cattgatact 1380
cagcagcagg cggctgcgcc gcacagcaca tccctgtcat tgagcccctg gcaggcaggt 1440
gtgtataagct gcggtaa 1458
<210>22
<211>1458
<212>DNA
<213>植生拉乌尔菌
<400>22
atgaaagagt atggcacaat ggaggatttt gataaccttg ttgccgaaat gaaaaaacga 60
aatatgcgct taatgatcga cgtggtcatt aaccatacca gtgatcaaca cccgtggttt 120
attcagagta aaagcgataa aaacaaccct tatcgtgact actatttctg gcgtgacgga 180
aaagataatc agccacctaa taattacccc tcatttttcg gcggctcggc atggcaaaaa 240
gatgcaaagt caggacagta ctatttacac tattttgcca gacagcaacc tgatctcaac 300
tgggataacc cgaaagtacg tgaggatctt tacgcaatgc tccgcttctg gctggataaa 360
ggcgtttcaa gcatgcgatt tgatacggtg gcaacttatt ccaaaatccc gggatttccc 420
aatctgacac ctgaacaaca gaaaaatttt gctgaacaat acaccatggg gcctaatatt 480
catcgataca ttcaggaaat gaaccggaaa gttctgtccc ggtatgatgt ggccaccgcg 540
ggtgaaattt ttggcgtccc gctggatcgt tcgtcccagt tttttgatcc ccgccgacat 600
gagctgaata tggcgtttat gtttgacctc attcgtctcg atcgcgacag caatgaacgc 660
tggcgtcaca agtcgtggtc gctctctcag ttccgccaga tcatcagcaa aatggatgtc 720
acggtcggaa agtatggctg gaacacgttc ttcttagata accatgacaa cccccgtgcg 780
gtatctcact tcggggatga caggccgcaa tggcgggagg cgtcggctaa ggcactggcg 840
acgattaccc tcactcagcg ggcgacgccg tttatttatc agggttcaga gctgggaatg 900
actaattatc ccttcaggca actcaacgaa tttgacgata tcgaggtcaa aggtttctgg 960
caggattatg tccagagtgg aaaagtcacg gccacagagt ttctcgataa tgtgcgcctg 1020
acgagccgcg ataacagcag aacacctttc cagtggaatg acaccctgaa tgctggtttt 1080
actcgcggaa agccgtggtt tcacatcaac ccaaactatg tggagatcaa cgccgaacgc 1140
gaagaaaccc gcgaagattc agtgctgaat tactataaaa aaatgattca gctacgccac 1200
catatccctg ctctggtata tggcgcctat caggatctta atccacagga caataccgtt 1260
tatgcctata cccgaacgct gggtaacgag cgttatctgg tcgtggtgaa ctttaaggag 1320
tacccggtcc gctatactct cccggctaat gatgccatcg aggaagtggt cattgatact 1380
cagcagcagg cgactgcgcc gcacagcaca tccctgtcat tgagcccctg gcaggcaggt 1440
gtgtataagc tgcggtaa 1458
<210>23
<211>1461
<212>DNA
<213>分散泛菌
<400>23
atgaaggagt acggcagcat ggctgacttt gaccgtctgg ttgccgaaat gaataaacgt 60
ggtatgcgcc tgatgattga tattgttatc aatcatacca gcgatcgtca ccgctggttt 120
gtgcagagcc gttcaggtaa agataatcct taccgcgact attatttctg gcgtgatggt 180
aaacagggac aggctcccaa taactatccc tctttctttg gcggttcagc ctggcaactg 240
gataaacaga ctgaccagta ttatctgcac tattttgcac cacagcagcc ggatctgaac 300
tgggataacc caaaagttcg ggctgaactc tacgatattc tgcgtttctg gctggataaa 360
ggcgtatccg gactacgttt tgataccgtg gctactttct ccaaaattcc tggcttcccg 420
gacctgtcaa aagcgcagct gaagaatttt gccgaagctt atactgaggg gccgaatatt 480
cataaatata tccatgaaat gaaccgccag gtactgtcta aatataatgt tgccaccgct 540
ggtgaaatct tcggtgtgcc agtgagtgct atgccggatt attttgaccg gcggcgtgaa 600
gaactcaata ttgctttcac ctttgatttg atcaggctcg atcgttatcc cgatcagcgc 660
tggcgtcgta aaccatggac attaagccag tttcgtcaag ttatctctca gactgaccgt 720
gccgccggtg aatttggctg gaacgccttt ttccttgata accatgataa cccgcgccag 780
gtctcacact ttggtgacga cagcccacaa tggcgcgaac gctcggcaaa agcactggca 840
acgctgctgc tgacgcagcg tgccacgccg tttatctttc agggggcgga gttgggaatg 900
actaattacc cctttaaaaa tatagaggaa tttgatgata ttgaggttaa aggcttctgg 960
aacgactatg tagccagcgg aaaagtaaac gctgctgaat ttttacagga ggttcgcatg 1020
accagccgcg ataacagccg aacaccaatg cagtggaacg actctgttaa tgccggattc 1080
acccagggca aaccctggtt tcacctcaat cccaactata agcaaatcaa tgccgccagg 1140
gaggtgaata aacccgactc ggtattcagt tactaccgtc aactgatcaa cctgcgtcac 1200
cagatcccgg cactgaccag tggtgaatac cgtgatctcg atccgcagaa taaccaggtc 1260
tatgcctata cccgtatact ggataatgaa aaatatctgg tggtagttaa ttttaaacct 1320
gagcagctgc attacgctct gccagataat ctgactattg ccagcagtct gctggaaaat 1380
gtccaccaac catcactgca agaaaatgcc tccacgctga ctcttgctcc gtggcaagcc 1440
gggatctata agctgaactg a 1461
<210>24
<211>1458
<212>DNA
<213>嗜中酸假单胞菌
<400>24
atgaaggaat atgggacgat ggaggacttc gatcgtctga tggctgagtt gaagaagcgc 60
ggcatgcggc tcatggttga tgtcgtgatc aaccattcga gtgaccaaca cgaatggttc 120
aagagcagcc gggcctccaa agacaatccc taccgtgact attatttctg gcgtgacggc 180
aaagacggtc acgagccaaa caattaccct tccttcttcg gcggttcggc atgggagaag 240
gaccccgtaa ccgggcaata ttacctgcat tatttcggtc gtcagcagcc agatctgaac 300
tgggacacgc cgaagcttcg cgaggaactc tatgcgatgc tgcggttctg gctcgacaag 360
ggcgtatcag gcatgcggtt cgatacggtg gctacctact cgaagacacc gggtttcccg 420
gatctgacac cggagcagat gaagaacttc gcggaggcct atacccaggg gccgaacctt 480
catcgttacc tgcaggaaat gcacgagaag gtcttcgatc attatgacgc ggtcacggcc 540
ggcgaaatct tcggcgctcc gctcaatcaa gtgccgctgt tcatcgacag ccggaggaaa 600
gagctggata tggctttcac cttcgatctg atccgttatg atcgcgcact ggatcgttgg 660
cataccattc cgcgtacctt agcggacttc cgtcaaacga tcgataaggt cgacgccatc 720
gcgggcgaat atggctggaa cacgttcttc ctcggcaatc acgacaatcc ccgtgcggta 780
tcgcattttg gtgacgatcg gccgcaatgg cgcgaagcct cggccaaggc tctggccacc 840
gtcaccttga cccagcgagg aacgccgttc atcttccaag gagatgaact cggaatgacc 900
aactacccct tcaagacgct gcaggacttt gatgatatcg aagtcaaagg cttctttcag 960
gactatgtcg aaaccggaaa ggcaactgcc gaggaattgc tgaccaatgt ggcgttgact 1020
agccgcgaca acgcccgcac gccctttcaa tgggatgaca gtgctaatgc gggattcacg 1080
accggcaagc cttggctaaa ggtcaatcca aactacactg agatcaacgc cgcgcgggaa 1140
attggcgatc ctaaatcggt ctacagcttt taccgcaacc tgatctcaat ccggcatgaa 1200
actcccgctc tttcgaccgg gagctatcgc gacatcgatc cgagtaatgc cgatgtctat 1260
gcctatacgc gcagccagga tggcgagacc tatctggtcg tagtcaactt caaggcagag 1320
ccaaggagtt tcacgcttcc ggacggcatg catattgccg aaaccctgat tgagagcagt 1380
tcgccagcag ctccggcggc gggggctgca agccttgagc tgcagccttg gcagtccggc 1440
atctacaagg tgaagtaa 1458
<210>25
<211>1461
<212>DNA
<213>胡萝卜软腐欧文氏菌
<400>25
atgaaagaat atggcacaat ggatgacttc gaccgactca ttgcagaaat gaaaaagcgt 60
gatatgcgat taatgataga tgttgtcgtt aatcacacca gcgatgagca tgaatggttt 120
gtcgaaagta aaaaatcaaa agataatcct tatcgcgact attatatttg gcgcgatggc 180
aaagatggca cacagcctaa taattacccc tccttcttcg gcggttccgc ctggcagaaa 240
gataacgcaa cacagcaata ttatctgcac tattttggcg tacagcagcc cgatctgaat 300
tgggataatc ccaaagtacg tgaagaagtg tacgacatgc tgcgtttctg gattgataaa 360
ggggtttctg ggctgcgtat ggataccgtg gcaacctttt ccaagaaccc ggctttcccc 420
gacctgacgc caaagcaact gcaaaacttt gcctacacct acacgcaggg ccctaatctg 480
catcgttaca ttcaggaaat gcaccaaaaa gtgctggcaa aatatgacgt cgtttccgca 540
ggtgaaattt tcggtgtacc gctggaggaa gcggccccgt ttatcgatca gcgccgtaaa 600
gagctcgata tggccttctc attcgatctt atccgtctcg atcgcgccgt agaggaaaga 660
tggcggcgga atgactggac gttgtcccag ttccgtcaga tcaacaatcg actggttgat 720
atggccgggc aacatggctg gaataccttc ttcctgagca accatgacaa cccgcgtgcg 780
gtatcacact tcggtgacga tcgcccagag tggcgcaccc gttccgctaa agcactggcg 840
acgttggcgt taacgcagcg cgcaactccg tttatttatc aaggagacga attgggcatg 900
accaactacc cgtttacgtc cttgtctgaa ttcgatgaca ttgaagttaa aggcttctgg 960
caggactttg tagagacagg aaaagtgaaa cctgatgtct tcctggaaaa cgtaaaacaa 1020
accagccgcg ataacagtcg cacaccgttc caatggagca atacggcaca ggcaggcttt 1080
actacaggta ctccctggtt ccgtattaac cccaactata agaacatcaa tgcagaggag 1140
caaacgcaaa atccagactc catcttccat ttctatcgtc aactgatcga attacgtcat 1200
gctacaccag cgttcaccta cggaacttat caggatcttg atccgaataa taacgaggta 1260
cttgcttata ctcgtgaact caatcagcaa cgttatctgg ttgtggtgaa ctttaaagaa 1320
aaacccgtgc attacgttct gccgaaaaca ctttccatca aacagtcttt actggaaagc 1380
gggcaaaaag acaaagtaga accaaacgcg acgacgcttg aattacagcc gtggcaatct 1440
gggatttatc agttgaacta a 1461
<210>26
<211>1452
<212>DNA
<213>维涅兰德固氮菌
<400>26
atgagcgaat tcggcgacat ggacgacttc gagcgcctgc tcgccgggat gaacaagcgc 60
ggcatgcgcc tgatcatcga tctggtggtc aaccacagca gcgacgagca tcgctggttc 120
gtcgagagcc gccggtcgaa ggacaacccc tatcgcgact actacacttg gcgcgacggc 180
aaggacggcg ctgcgccgaa caactatccg tcgttcttcg gcggctcggc ctggaagaag 240
gacgaggcca cggggcagta ctacctccac tacttcgccg gcaagcagcc cgacctgaac 300
tgggaaaacc ccgaggtccg cgccgaggtc cacgacatca tgcgcttctg gctggacaag 360
ggcgtgtccg gcttccgcat ggacgtgatt cccttcatct ccaaacagga cggcctgccc 420
gacctgcctg cgcaagccct ggcccatccc gagttcgtct acgcgaacgg cccgcgcatc 480
catgagtatc tccaggaaat gaaccgcgaa gtcctgtccc gctatgacac catgacggtc 540
ggcgaagcct tcggcatcac cttcgaacag gccccgctgt tcaccgacgc ccgccgtcac 600
gaactgaaca tgatcttcca tttcgacctg gtgcggctgg accgcgacgg ctggcgcaaa 660
aaggactgga cgctgcccga gctcaaggcg acctacgcgc ggatcgaccg caccggcggc 720
gaccatggct ggaacaccag tttcctgggc aaccacgaca atccccgcgc cgtttcccat 780
ttcggcgacg acagccccga atggcgcgcc gcctcggcca aggcgctggc gaccatgatg 840
ctcacccagc gcgccacgcc cttcctctac cagggcgacg aactgggcat gaccaactat 900
cccttccgcg gcctcgagga ctacgacgat gtcgaagtga agggccaatg gcgcgacttc 960
gtggaaagcg gcaaggtgtc ggcggacgag tatctcgccc acctgcgcca gaccagccgc 1020
gacaacgccc gcaccccgat gcagtggagc gacgcgccga acggcggctt caccaccggc 1080
aagccctggc ttgcggtcaa cccgaactat ccgcaggtca atgcggcatc ccaggtcgac 1140
gatcccggct cgatctacca tcactaccgt cgcctgctgg aagtgcgccg ccagaccccc 1200
gcgctcatcc acggccagtt ccgcgatctc gatccggcca atcccaaggt cttcgcctac 1260
acgcgcacgc tcgacgacaa gcgctatctg gtgctgatca acttcacccg cgagacggtc 1320
gcctacgacc tgccggaagg actgaagatc gccgccacgc tgctggacaa cggcgccgcg 1380
caagagtcga tgcaacccgg cgccgcgagc gtaacgctcc agccctggca ggcgacgatc 1440
taccggctct ga 1452
<210>27
<211>1428
<212>DNA
<213>柄杆菌K31
<400>27
atgacgcagt tcgggaccat ggccgatttc gacgccatgc tggccggcat gacggcgcgc 60
ggcatgcggc tgatcatcga cctggtggtc aatcacagca gcgacgaaca cgcctggttc 120
gtcaagagcc gcaagggtcg cgagaacccc tatcgcgact actacatctg gcgcgacggc 180
aaggatggcg gaccgcccaa caactacagc gccttcttcg gcgggccggc ctggaccttc 240
gacgcggtca cggaccagta ctacctccac tatttcgccg ccaagcagcc ggacctgaac 300
tgggaaaacc ccaaggtccg ggccgaggtg catgacctga tgcgcttctg gctcgacaag 360
ggcgtgtcgg ggttccggat ggacgtgatc cccttcatct ccaagccgcc gggcctgccg 420
gacctgacgc cgcaggagcg ccgcgcgccg cagttcgtct atgccgccga ccccaagctg 480
cacgactacc tgcgcgagat gcgccgcgag gtgttggacc actatgacac catgacggtc 540
ggcgaggcgt tcggggtcac gcccgatgcg gcccgcgacc tgatcgacag ccggcgcggc 600
gagctggacc tggtgttcaa tttcgacatc gtccgcatgg acatcgacgg ctggcgcaag 660
acctcctgga ccctgccccg gctgaaggcg ctctataccc agctggacca ggcggcgggg 720
ccgttcggct ggaacaccca gttcctgtcc aaccacgaca atccgcgctc ggtctcgcac 780
ttcggcgacg acgatcccgc atgggtcgag cgttcggcca aggtcctggc gaccctgatc 840
ctgacccaac gcggcacgcc gttcctctat cagggcgagg agctgggcat gaccaactac 900
ccgttccaga cgctggacga cttcgacgac ctggaggtgg ccggccgctg gcgcgacgtg 960
aagcaccggg tgtcggagga agagtacctg gccaacgccc gagccatggg ccgcgacaac 1020
agccgcacgc cgatgcagtg gacgggcgac ccgcacggcg gcttcaccac gggcaagccc 1080
tggctggcgg tcaatccgaa cgccgcgacg atcaacgccc aggaccaggc ggcgcggccg 1140
gactcggtgc tgacccactg ccgcgccctg atcgcctggc ggcgcggctc ggtcgacctg 1200
cgggagggcg actaccgcga catcgaccct gaccatccac aggtcttcgc ctatcgccgg 1260
ggcgaggggc tgctggtgct gctgaacttc gggcgggaaa cggtgcggta cgcgctgccg 1320
gagggcctgg cgatcgagag cgcggcgttc ggcgcggtcg agatcgcggg gcgggtcgtg 1380
gccttgacgg gctggagctt cgtgatcttg accgtcagag accgctag 1428
Claims (20)
1.编码蔗糖异构酶多肽N末端截短形式的分离的多核苷酸,其当转化至植物中时显示抗线虫活性,其中所述多肽不显示蔗糖异构酶酶促活性。
2.权利要求1的分离的多核苷酸,其选自:
a.具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸;
b.编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;
c.与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸有70%序列同一性的多核苷酸;
d.编码与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽有70%序列同一性的多肽的多核苷酸;
e.在严格条件下与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸杂交的多核苷酸;和
f.在严格条件下与编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸杂交的多核苷酸。
3.权利要求2的分离的多核苷酸,其中所述多核苷酸具有如SEQ IDNO:1、3、4、6、21、22、23、24、25、26或27中定义的序列。
4.权利要求2的分离的多核苷酸,其中所述多核苷酸编码具有如SEQID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽。
5.转化有表达载体的转基因植物,所述载体包含编码蔗糖异构酶多肽N末端截短形式的分离的多核苷酸,其当转化入植物中时显示抗线虫活性,其中所述多肽不显示蔗糖异构酶酶促活性。
6.权利要求5的转基因植物,其中所述分离的多核苷酸选自:
a)具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸;
b)编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;
c)与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸有70%序列同一性的多核苷酸;
d)编码与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽有70%序列同一性的多肽的多核苷酸;
e)在严格条件下与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸杂交的多核苷酸;和
f)在严格条件下与编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸杂交的多核苷酸。
7.权利要求6的植物,其中所述多核苷酸具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列。
8.权利要求6的植物,其中所述多核苷酸编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽。
9.权利要求5的植物,其进一步被限定为单子叶植物。
10.权利要求9的植物,其中所述植物选自玉米、小麦、稻、大麦、燕麦、黑麦、高粱、香蕉和黑麦草。
11.权利要求5的植物,其进一步被限定为双子叶植物。
12.权利要求11的植物,其中所述植物选自豌豆、苜蓿、大豆、胡萝卜、芹菜、番茄、马铃薯、棉花、烟草、胡椒、油菜、甜菜、卷心菜、花椰菜、绿花椰菜、莴苣和拟南芥。
13.权利要求1 2的植物,其中所述植物是大豆。
14.表达载体,其包含有效连接编码蔗糖异构酶多肽N末端截短形式的多核苷酸的启动子,所述多核苷酸当转化到植物中时显示抗线虫活性,其中所述多肽不显示蔗糖异构酶酶促活性。
15.权利要求14的表达载体,其中所述多核苷酸选自:
a)具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸;
b)编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;
c)与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸有70%序列同一性的多核苷酸;
d)编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;
e)在严格条件下与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸杂交的多核苷酸;和
f)在严格条件下与编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸杂交的多核苷酸。
16.权利要求14的表达载体,其中所述启动子选自组成型启动子、根特异性启动子和合胞体特异性启动子。
17.权利要求14的表达载体,其中所述多核苷酸具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列。
18.权利要求14的表达载体,其中所述多核苷酸编码具有如SEQ IDNO:2、5、14、15、16、17、18、19或20中定义的序列的多肽。
19.产生具有提高的线虫抗性的转基因植物的方法,其中所述方法包括步骤:
a)将表达载体引入植物中,该表达载体包含有效连接编码蔗糖异构酶多肽N末端截短形式的多核苷酸的启动子,所述多核苷酸当转化到植物中时显示抗线虫活性,其中所述多肽不显示蔗糖异构酶酶促活性;和
b)选择具有提高的线虫抗性的转基因植物。
20.权利要求19的方法,其中所述多核苷酸选自:
a)具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸;
b)编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸;
c)与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸有70%序列同一性的多核苷酸;
d)编码与具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽有70%序列同一性的多肽的多核苷酸;
e)在严格条件下与具有如SEQ ID NO:1、3、4、6、21、22、23、24、25、26或27中定义的序列的多核苷酸杂交的多核苷酸;和
f)在严格条件下与编码具有如SEQ ID NO:2、5、14、15、16、17、18、19或20中定义的序列的多肽的多核苷酸杂交的多核苷酸。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US90022807P | 2007-02-08 | 2007-02-08 | |
US60/900,228 | 2007-02-08 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101600803A true CN101600803A (zh) | 2009-12-09 |
Family
ID=39410297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2008800039057A Pending CN101600803A (zh) | 2007-02-08 | 2008-02-05 | 编码用于控制寄生线虫的截短的蔗糖异构酶多肽的多核苷酸 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20100064389A1 (zh) |
EP (1) | EP2111455A1 (zh) |
CN (1) | CN101600803A (zh) |
AR (1) | AR065327A1 (zh) |
BR (1) | BRPI0807010A2 (zh) |
CA (1) | CA2676293A1 (zh) |
MX (1) | MX2009007729A (zh) |
WO (1) | WO2008095916A1 (zh) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RS55986B1 (sr) | 2010-01-22 | 2017-09-29 | Bayer Ip Gmbh | Akaricidne i/ili insekticidne kombinacije aktivnih supstanci |
BR112013006612A2 (pt) | 2010-09-22 | 2017-10-24 | Bayer Ip Gmbh | uso de agentes de controle biológico ou químico para controle de insetos e nematódeos em culturas resistentes |
EP2460406A1 (en) | 2010-12-01 | 2012-06-06 | Bayer CropScience AG | Use of fluopyram for controlling nematodes in nematode resistant crops |
WO2013020985A1 (en) | 2011-08-10 | 2013-02-14 | Bayer Intellectual Property Gmbh | Active compound combinations comprising specific tetramic acid derivatives |
BR112014015002A2 (pt) | 2011-12-19 | 2017-06-13 | Bayer Cropscience Ag | uso de derivados de diamida de ácido antranílico para o controle de pragas em culturas transgênicas |
RU2014130906A (ru) * | 2011-12-28 | 2016-02-20 | Юниверсити Оф Теннесси Рисёрч Фаундейшн | Повышение защиты сои от вредителей |
EP2622961A1 (en) | 2012-02-02 | 2013-08-07 | Bayer CropScience AG | Acive compound combinations |
TWI654180B (zh) | 2012-06-29 | 2019-03-21 | 美商艾佛艾姆希公司 | 殺真菌之雜環羧醯胺 |
WO2014090765A1 (en) | 2012-12-12 | 2014-06-19 | Bayer Cropscience Ag | Use of 1-[2-fluoro-4-methyl-5-(2,2,2-trifluoroethylsulfinyl)phenyl]-5-amino-3-trifluoromethyl)-1 h-1,2,4 tfia zole for controlling nematodes in nematode-resistant crops |
KR101768748B1 (ko) * | 2014-07-29 | 2017-08-18 | 경희대학교 산학협력단 | 변이된 수크로스이성화효소 및 이의 제조방법 |
CN114032251A (zh) | 2015-08-07 | 2022-02-11 | 拜尔作物科学公司 | 根优先和胁迫诱导型启动子及其用途 |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE4447472C2 (de) * | 1994-01-19 | 1996-04-11 | Suedzucker Ag | Organismen mit reduziertem Palatinose- und Trehalulosestoffwechsel |
AU2003246353A1 (en) * | 2002-07-04 | 2004-01-23 | Sungene Gmbh And Co. Kgaa | Methods for obtaining pathogen resistance in plants |
-
2008
- 2008-02-05 BR BRPI0807010-5A2A patent/BRPI0807010A2/pt not_active IP Right Cessation
- 2008-02-05 CA CA002676293A patent/CA2676293A1/en not_active Abandoned
- 2008-02-05 MX MX2009007729A patent/MX2009007729A/es active IP Right Grant
- 2008-02-05 US US12/524,868 patent/US20100064389A1/en not_active Abandoned
- 2008-02-05 CN CNA2008800039057A patent/CN101600803A/zh active Pending
- 2008-02-05 WO PCT/EP2008/051382 patent/WO2008095916A1/en active Application Filing
- 2008-02-05 EP EP08708683A patent/EP2111455A1/en not_active Withdrawn
- 2008-02-08 AR ARP080100567A patent/AR065327A1/es unknown
Also Published As
Publication number | Publication date |
---|---|
CA2676293A1 (en) | 2008-08-14 |
US20100064389A1 (en) | 2010-03-11 |
WO2008095916A1 (en) | 2008-08-14 |
BRPI0807010A2 (pt) | 2014-04-22 |
AR065327A1 (es) | 2009-06-03 |
EP2111455A1 (en) | 2009-10-28 |
MX2009007729A (es) | 2009-07-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101605903B (zh) | 丙氨酸消旋酶基因对植物赋予线虫抗性的用途 | |
CN101600803A (zh) | 编码用于控制寄生线虫的截短的蔗糖异构酶多肽的多核苷酸 | |
CN101617050A (zh) | 利用海藻糖酶基因赋予植物线虫抗性 | |
CN101679995A (zh) | 线虫几丁质酶基因用于控制植物寄生性线虫的用途 | |
CN101605896A (zh) | 使用rna干扰控制线虫的组合物和方法 | |
CN101680001A (zh) | 用rna干扰控制线虫的组合物和方法 | |
ES2373614T3 (es) | Composiciones y métodos que utilizan interferencia de arn de tipo cdpk para el control de nemátodos. | |
EP2111452B1 (en) | Compositions and methods using rna interference of opr3-like gene for control of nematodes | |
CN101605904A (zh) | 用靶向mthfr样基因的rna干扰控制线虫的组合物和方法 | |
CN101365786A (zh) | 具有改良的生长特征的植物及其生产方法 | |
US20110258736A1 (en) | Pathogen Control Genes and Methods of Use in Plants | |
CN101605895A (zh) | 用cad-样基因的rna干扰控制线虫的组合物和方法 | |
CN102124025A (zh) | 抗线虫的转基因植物 | |
CN101605897A (zh) | 用sca1-样基因的rna干扰控制线虫的组合物和方法 | |
CN102203260A (zh) | 使用rna干扰控制线虫的组合物和方法 | |
CN101627126A (zh) | pEARLI1样病原体控制基因及其在植物中的使用方法 | |
MX2010011716A (es) | Composiciones y metodos para utilizar rna de interferencia para el control de nematodos. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20091209 |