JP2001169780A - Gene derived from docosahexaenoic acid-producing bacterium - Google Patents
Gene derived from docosahexaenoic acid-producing bacteriumInfo
- Publication number
- JP2001169780A JP2001169780A JP35661499A JP35661499A JP2001169780A JP 2001169780 A JP2001169780 A JP 2001169780A JP 35661499 A JP35661499 A JP 35661499A JP 35661499 A JP35661499 A JP 35661499A JP 2001169780 A JP2001169780 A JP 2001169780A
- Authority
- JP
- Japan
- Prior art keywords
- ala
- leu
- val
- ser
- gly
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 152
- MBMBGCFOFBJSGT-KUBAVDMBSA-N all-cis-docosa-4,7,10,13,16,19-hexaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 title claims abstract description 106
- 235000020669 docosahexaenoic acid Nutrition 0.000 title claims abstract description 76
- 229940090949 docosahexaenoic acid Drugs 0.000 title claims abstract description 30
- 241000894006 Bacteria Species 0.000 title claims abstract description 23
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 80
- 108090000790 Enzymes Proteins 0.000 claims abstract description 41
- 102000004190 Enzymes Human genes 0.000 claims abstract description 39
- 241000592260 Moritella Species 0.000 claims abstract description 4
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 91
- 230000001851 biosynthetic effect Effects 0.000 claims description 31
- 235000020673 eicosapentaenoic acid Nutrition 0.000 claims description 28
- 150000001413 amino acids Chemical class 0.000 claims description 26
- 238000000034 method Methods 0.000 claims description 21
- 239000002773 nucleotide Substances 0.000 claims description 13
- 125000003729 nucleotide group Chemical group 0.000 claims description 13
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 claims description 5
- 229960005135 eicosapentaenoic acid Drugs 0.000 claims description 5
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 3
- 238000012217 deletion Methods 0.000 claims description 2
- 230000037430 deletion Effects 0.000 claims description 2
- 238000006467 substitution reaction Methods 0.000 claims description 2
- 244000005700 microbiome Species 0.000 abstract description 8
- 239000000126 substance Substances 0.000 abstract description 6
- 230000003570 biosynthesizing effect Effects 0.000 abstract 1
- 230000002708 enhancing effect Effects 0.000 abstract 1
- 241000282326 Felis catus Species 0.000 description 83
- 108020004414 DNA Proteins 0.000 description 56
- 241000880493 Leptailurus serval Species 0.000 description 39
- 108010047495 alanylglycine Proteins 0.000 description 34
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 32
- 108010050848 glycylleucine Proteins 0.000 description 32
- 239000002585 base Substances 0.000 description 26
- 230000006870 function Effects 0.000 description 23
- 108010005233 alanylglutamic acid Proteins 0.000 description 19
- 108010053725 prolylvaline Proteins 0.000 description 17
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 16
- 108700026244 Open Reading Frames Proteins 0.000 description 15
- 108010047857 aspartylglycine Proteins 0.000 description 14
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 14
- 108010017391 lysylvaline Proteins 0.000 description 14
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 13
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 13
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 12
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 12
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 12
- 230000015572 biosynthetic process Effects 0.000 description 12
- 108010034529 leucyl-lysine Proteins 0.000 description 12
- 238000003752 polymerase chain reaction Methods 0.000 description 12
- 108010048818 seryl-histidine Proteins 0.000 description 12
- 108010061238 threonyl-glycine Proteins 0.000 description 12
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 11
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 11
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 11
- 108010087924 alanylproline Proteins 0.000 description 11
- 108010093581 aspartyl-proline Proteins 0.000 description 11
- 108010064235 lysylglycine Proteins 0.000 description 11
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 10
- 241000294598 Moritella marina Species 0.000 description 10
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 10
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 10
- 108010037850 glycylvaline Proteins 0.000 description 10
- 101000748061 Acholeplasma phage L2 Uncharacterized 16.1 kDa protein Proteins 0.000 description 9
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 9
- 101000947615 Clostridium perfringens Uncharacterized 38.4 kDa protein Proteins 0.000 description 9
- 101000964391 Enterococcus faecalis UPF0145 protein Proteins 0.000 description 9
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 9
- 101000748063 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 11.1 kDa protein in rep-hol intergenic region Proteins 0.000 description 9
- 101000790840 Klebsiella pneumoniae Uncharacterized 49.5 kDa protein in cps region Proteins 0.000 description 9
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 9
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 9
- 108010092854 aspartyllysine Proteins 0.000 description 9
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 9
- 108010090894 prolylleucine Proteins 0.000 description 9
- 238000003786 synthesis reaction Methods 0.000 description 9
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 8
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 8
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 8
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 8
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 8
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 8
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 8
- 108010079364 N-glycylalanine Proteins 0.000 description 8
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 8
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 8
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 8
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 8
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 8
- 108010041407 alanylaspartic acid Proteins 0.000 description 8
- 108010078144 glutaminyl-glycine Proteins 0.000 description 8
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 8
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 108010078274 isoleucylvaline Proteins 0.000 description 8
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 7
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 7
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 7
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 7
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 7
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 7
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 7
- 241000863430 Shewanella Species 0.000 description 7
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 7
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 7
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 7
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 7
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 7
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 7
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 108010068488 methionylphenylalanine Proteins 0.000 description 7
- 108010051242 phenylalanylserine Proteins 0.000 description 7
- 108010026333 seryl-proline Proteins 0.000 description 7
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 6
- 101710146995 Acyl carrier protein Proteins 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 6
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 6
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 6
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 6
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 6
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 6
- BVLPIIBTWIYOML-ZKWXMUAHSA-N Ala-Val-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BVLPIIBTWIYOML-ZKWXMUAHSA-N 0.000 description 6
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 6
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 6
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 6
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 6
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 6
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 6
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 6
- KPNWAJMEMRCLAL-GUBZILKMSA-N Gln-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KPNWAJMEMRCLAL-GUBZILKMSA-N 0.000 description 6
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 6
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 6
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 6
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 6
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 6
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 6
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 6
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 6
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 6
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 6
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 6
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 6
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 6
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 6
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 6
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 6
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 6
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 6
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 6
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 6
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 6
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 108010016616 cysteinylglycine Proteins 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 108010015792 glycyllysine Proteins 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 6
- 108010009298 lysylglutamic acid Proteins 0.000 description 6
- 108010085203 methionylmethionine Proteins 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 108010004914 prolylarginine Proteins 0.000 description 6
- 239000000523 sample Substances 0.000 description 6
- 108010038745 tryptophylglycine Proteins 0.000 description 6
- 108010020532 tyrosyl-proline Proteins 0.000 description 6
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 5
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 5
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 5
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 5
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 5
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 5
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 5
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 5
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 5
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 5
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 5
- 241000588724 Escherichia coli Species 0.000 description 5
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 5
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 5
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 5
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 5
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 5
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 5
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 5
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 5
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 5
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 5
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 5
- RZJOHSFAEZBWLK-CIUDSAMLSA-N Met-Gln-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N RZJOHSFAEZBWLK-CIUDSAMLSA-N 0.000 description 5
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 5
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 5
- VFWQQZMRKFOGLE-ZLUOBGJFSA-N Ser-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O VFWQQZMRKFOGLE-ZLUOBGJFSA-N 0.000 description 5
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 5
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 5
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 5
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 5
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 5
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 238000006243 chemical reaction Methods 0.000 description 5
- 235000014113 dietary fatty acids Nutrition 0.000 description 5
- 229930195729 fatty acid Natural products 0.000 description 5
- 239000000194 fatty acid Substances 0.000 description 5
- 150000004665 fatty acids Chemical class 0.000 description 5
- 108010049041 glutamylalanine Proteins 0.000 description 5
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 5
- 230000035772 mutation Effects 0.000 description 5
- 108010070643 prolylglutamic acid Proteins 0.000 description 5
- 108091008146 restriction endonucleases Proteins 0.000 description 5
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 5
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 5
- 108010073969 valyllysine Proteins 0.000 description 5
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 4
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 4
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 4
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 4
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 4
- MIPWEZAIMPYQST-FXQIFTODSA-N Ala-Cys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O MIPWEZAIMPYQST-FXQIFTODSA-N 0.000 description 4
- AWAXZRDKUHOPBO-GUBZILKMSA-N Ala-Gln-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O AWAXZRDKUHOPBO-GUBZILKMSA-N 0.000 description 4
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 4
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 4
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 4
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 4
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 4
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 4
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 4
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 4
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 4
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 4
- RNHKOQHGYMTHFR-UBHSHLNASA-N Ala-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 RNHKOQHGYMTHFR-UBHSHLNASA-N 0.000 description 4
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 4
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 4
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 4
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 4
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 4
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 4
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 4
- IOTKDTZEEBZNCM-UGYAYLCHSA-N Asn-Asn-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOTKDTZEEBZNCM-UGYAYLCHSA-N 0.000 description 4
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 4
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 4
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 4
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 4
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 4
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 4
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 4
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 4
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 4
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 4
- OEUQMKNNOWJREN-AVGNSLFASA-N Asp-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N OEUQMKNNOWJREN-AVGNSLFASA-N 0.000 description 4
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 4
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 4
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 4
- NZWDWXSWUQCNMG-GARJFASQSA-N Asp-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)C(=O)O NZWDWXSWUQCNMG-GARJFASQSA-N 0.000 description 4
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 4
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 4
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 4
- 101710172176 Fasciclin-1 Proteins 0.000 description 4
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 4
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 4
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 4
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 4
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 4
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 4
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 4
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 4
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 4
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 4
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 4
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 4
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 4
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 4
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 4
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 4
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 4
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 4
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 4
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 4
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 4
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 4
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 4
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 4
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 4
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 4
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 4
- XJFITURPHAKKAI-SRVKXCTJSA-N His-Pro-Gln Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(O)=O)C1=CN=CN1 XJFITURPHAKKAI-SRVKXCTJSA-N 0.000 description 4
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 4
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 4
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 4
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 4
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 4
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 4
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 4
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 4
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 4
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 4
- PWDSHAAAFXISLE-SXTJYALSSA-N Ile-Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O PWDSHAAAFXISLE-SXTJYALSSA-N 0.000 description 4
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 4
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 4
- PARSHQDZROHERM-NHCYSSNCSA-N Ile-Lys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N PARSHQDZROHERM-NHCYSSNCSA-N 0.000 description 4
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 4
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 4
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 4
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 4
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 4
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 4
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 4
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 4
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 4
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 4
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 4
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 4
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 4
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 4
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 4
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 4
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 4
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 4
- VHXMZJGOKIMETG-CQDKDKBSSA-N Lys-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCCCN)N VHXMZJGOKIMETG-CQDKDKBSSA-N 0.000 description 4
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 4
- ZMMDPRTXLAEMOD-BZSNNMDCSA-N Lys-His-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZMMDPRTXLAEMOD-BZSNNMDCSA-N 0.000 description 4
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 4
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 4
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 4
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 4
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 4
- UEEVBGHEGJMDDV-AVGNSLFASA-N Phe-Asp-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEEVBGHEGJMDDV-AVGNSLFASA-N 0.000 description 4
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 4
- KRYSMKKRRRWOCZ-QEWYBTABSA-N Phe-Ile-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KRYSMKKRRRWOCZ-QEWYBTABSA-N 0.000 description 4
- AFNJAQVMTIQTCB-DLOVCJGASA-N Phe-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 AFNJAQVMTIQTCB-DLOVCJGASA-N 0.000 description 4
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 4
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 4
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 4
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 4
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 4
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 4
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 4
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 4
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 4
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 4
- 102100040307 Protein FAM3B Human genes 0.000 description 4
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 4
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 4
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 4
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 4
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 4
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 4
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 4
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 4
- PIQRHJQWEPWFJG-UWJYBYFXSA-N Ser-Tyr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PIQRHJQWEPWFJG-UWJYBYFXSA-N 0.000 description 4
- YXGCIEUDOHILKR-IHRRRGAJSA-N Ser-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CO)N YXGCIEUDOHILKR-IHRRRGAJSA-N 0.000 description 4
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 4
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 4
- 102100029437 Serine/threonine-protein kinase A-Raf Human genes 0.000 description 4
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 4
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 4
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 4
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 4
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 4
- BOESUSAIMQGVJD-RYQLBKOJSA-N Trp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N BOESUSAIMQGVJD-RYQLBKOJSA-N 0.000 description 4
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 4
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 4
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 4
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 4
- WHVSJHJTMUHYBT-SRVKXCTJSA-N Val-Met-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)O)N WHVSJHJTMUHYBT-SRVKXCTJSA-N 0.000 description 4
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 4
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 4
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 4
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 4
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 4
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 4
- 108010011559 alanylphenylalanine Proteins 0.000 description 4
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 4
- 235000019867 fractionated palm kernal oil Nutrition 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010025306 histidylleucine Proteins 0.000 description 4
- 108010092114 histidylphenylalanine Proteins 0.000 description 4
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 4
- 108010056582 methionylglutamic acid Proteins 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 4
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- YAXNATKKPOWVCP-ZLUOBGJFSA-N Ala-Asn-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O YAXNATKKPOWVCP-ZLUOBGJFSA-N 0.000 description 3
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 3
- DECCMEWNXSNSDO-ZLUOBGJFSA-N Ala-Cys-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DECCMEWNXSNSDO-ZLUOBGJFSA-N 0.000 description 3
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 3
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 3
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 3
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 3
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 3
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 3
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 3
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 3
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 3
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 3
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 3
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 3
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 3
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 3
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 3
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 3
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 3
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 3
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 3
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 3
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 3
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 3
- OJNZVYSGVYLQIN-BQBZGAKWSA-N Gly-Met-Asp Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O OJNZVYSGVYLQIN-BQBZGAKWSA-N 0.000 description 3
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 3
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 3
- 101710128038 Hyaluronan synthase Proteins 0.000 description 3
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 3
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 3
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 3
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 3
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 3
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 3
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 3
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 3
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 3
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 3
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 3
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 3
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 3
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 3
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 3
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 3
- JHNOXVASMSXSNB-WEDXCCLWSA-N Lys-Thr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JHNOXVASMSXSNB-WEDXCCLWSA-N 0.000 description 3
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 3
- FBQMBZLJHOQAIH-GUBZILKMSA-N Met-Asp-Met Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O FBQMBZLJHOQAIH-GUBZILKMSA-N 0.000 description 3
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 3
- 241000929620 Moritella marina ATCC 15381 Species 0.000 description 3
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 3
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 3
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 3
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 3
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 3
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 3
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 3
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 3
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 3
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 3
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 3
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 3
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 3
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 3
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 3
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 3
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 3
- LHHDBONOFZDWMW-AAEUAGOBSA-N Trp-Asp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N LHHDBONOFZDWMW-AAEUAGOBSA-N 0.000 description 3
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 3
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 3
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 3
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 3
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 3
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- WMRWZYSRQUORHJ-YDHLFZDLSA-N Val-Phe-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WMRWZYSRQUORHJ-YDHLFZDLSA-N 0.000 description 3
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- 230000009471 action Effects 0.000 description 3
- 108010070944 alanylhistidine Proteins 0.000 description 3
- 108010013835 arginine glutamate Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 239000013601 cosmid vector Substances 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 235000021323 fish oil Nutrition 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010003700 lysyl aspartic acid Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000013598 vector Substances 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 2
- COEXAQSTZUWMRI-STQMWFEESA-N (2s)-1-[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 COEXAQSTZUWMRI-STQMWFEESA-N 0.000 description 2
- QVOBNSFUVPLVPE-ROUUACIJSA-N 2-[[(2s)-2-[[2-[[(2s)-2-amino-3-phenylpropanoyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 QVOBNSFUVPLVPE-ROUUACIJSA-N 0.000 description 2
- LVPCJMUBOHOZHE-UHFFFAOYSA-N 4-amino-2-[[2-[[2-[(2-amino-3-methylbutanoyl)amino]-3-methylpentanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-4-oxobutanoic acid Chemical compound CC(C)C(N)C(=O)NC(C(C)CC)C(=O)NC(C(=O)NC(CC(N)=O)C(O)=O)CC1=CN=CN1 LVPCJMUBOHOZHE-UHFFFAOYSA-N 0.000 description 2
- 101000621943 Acholeplasma phage L2 Probable integrase/recombinase Proteins 0.000 description 2
- 102000006488 Acyl-Carrier Protein S-Malonyltransferase Human genes 0.000 description 2
- 108010058912 Acyl-Carrier Protein S-Malonyltransferase Proteins 0.000 description 2
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 2
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- VBDMWOKJZDCFJM-FXQIFTODSA-N Ala-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N VBDMWOKJZDCFJM-FXQIFTODSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 2
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 2
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 2
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 2
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 2
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 2
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 2
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 2
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 2
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 2
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- RUQBGIMJQUWXPP-CYDGBPFRSA-N Ala-Leu-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O RUQBGIMJQUWXPP-CYDGBPFRSA-N 0.000 description 2
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 2
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 2
- GFEDXKNBZMPEDM-KZVJFYERSA-N Ala-Met-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GFEDXKNBZMPEDM-KZVJFYERSA-N 0.000 description 2
- HYIDEIQUCBKIPL-CQDKDKBSSA-N Ala-Phe-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N HYIDEIQUCBKIPL-CQDKDKBSSA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 2
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 2
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- CWRBRVZBMVJENN-UVBJJODRSA-N Ala-Trp-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCSC)C(=O)O)N CWRBRVZBMVJENN-UVBJJODRSA-N 0.000 description 2
- BGGAIXWIZCIFSG-XDTLVQLUSA-N Ala-Tyr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O BGGAIXWIZCIFSG-XDTLVQLUSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 2
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 2
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 2
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- 101000618348 Allochromatium vinosum (strain ATCC 17899 / DSM 180 / NBRC 103801 / NCIMB 10441 / D) Uncharacterized protein Alvin_0065 Proteins 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 2
- VYSRNGOMGHOJCK-GUBZILKMSA-N Arg-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N VYSRNGOMGHOJCK-GUBZILKMSA-N 0.000 description 2
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 2
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 2
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 2
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 2
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 2
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 2
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 2
- FEZJJKXNPSEYEV-CIUDSAMLSA-N Arg-Gln-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O FEZJJKXNPSEYEV-CIUDSAMLSA-N 0.000 description 2
- RYRQZJVFDVWURI-SRVKXCTJSA-N Arg-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N RYRQZJVFDVWURI-SRVKXCTJSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- DJAIOAKQIOGULM-DCAQKATOSA-N Arg-Glu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O DJAIOAKQIOGULM-DCAQKATOSA-N 0.000 description 2
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 2
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 2
- RATVAFHGEFAWDH-JYJNAYRXSA-N Arg-Phe-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCN=C(N)N)N RATVAFHGEFAWDH-JYJNAYRXSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 2
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 2
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 2
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 2
- QJWLLRZTJFPCHA-STECZYCISA-N Arg-Tyr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QJWLLRZTJFPCHA-STECZYCISA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 2
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 2
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 2
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 2
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 2
- HJRBIWRXULGMOA-ACZMJKKPSA-N Asn-Gln-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJRBIWRXULGMOA-ACZMJKKPSA-N 0.000 description 2
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 2
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 2
- IHUJUZBUOFTIOB-QEJZJMRPSA-N Asn-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N IHUJUZBUOFTIOB-QEJZJMRPSA-N 0.000 description 2
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 2
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 2
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- YGHCVNQOZZMHRZ-DJFWLOJKSA-N Asn-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N YGHCVNQOZZMHRZ-DJFWLOJKSA-N 0.000 description 2
- SXNJBDYEBOUYOJ-DCAQKATOSA-N Asn-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N SXNJBDYEBOUYOJ-DCAQKATOSA-N 0.000 description 2
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 2
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 2
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 2
- KNENKKKUYGEZIO-FXQIFTODSA-N Asn-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N KNENKKKUYGEZIO-FXQIFTODSA-N 0.000 description 2
- AEZCCDMZZJOGII-DCAQKATOSA-N Asn-Met-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O AEZCCDMZZJOGII-DCAQKATOSA-N 0.000 description 2
- RTFWCVDISAMGEQ-SRVKXCTJSA-N Asn-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N RTFWCVDISAMGEQ-SRVKXCTJSA-N 0.000 description 2
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 2
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 2
- HNXWVVHIGTZTBO-LKXGYXEUSA-N Asn-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O HNXWVVHIGTZTBO-LKXGYXEUSA-N 0.000 description 2
- XHTUGJCAEYOZOR-UBHSHLNASA-N Asn-Ser-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O XHTUGJCAEYOZOR-UBHSHLNASA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 2
- PUUPMDXIHCOPJU-HJGDQZAQSA-N Asn-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O PUUPMDXIHCOPJU-HJGDQZAQSA-N 0.000 description 2
- IPPFAOCLQSGHJV-WFBYXXMGSA-N Asn-Trp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O IPPFAOCLQSGHJV-WFBYXXMGSA-N 0.000 description 2
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 2
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- VPPXTHJNTYDNFJ-CIUDSAMLSA-N Asp-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N VPPXTHJNTYDNFJ-CIUDSAMLSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- YNQIDCRRTWGHJD-ZLUOBGJFSA-N Asp-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(O)=O YNQIDCRRTWGHJD-ZLUOBGJFSA-N 0.000 description 2
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 2
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 2
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 2
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 2
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 2
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 2
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 2
- LXKLDWVHXNZQGB-SRVKXCTJSA-N Asp-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O LXKLDWVHXNZQGB-SRVKXCTJSA-N 0.000 description 2
- LJRPYAZQQWHEEV-FXQIFTODSA-N Asp-Gln-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O LJRPYAZQQWHEEV-FXQIFTODSA-N 0.000 description 2
- VHQOCWWKXIOAQI-WDSKDSINSA-N Asp-Gln-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VHQOCWWKXIOAQI-WDSKDSINSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 2
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 2
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 2
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 2
- LNENWJXDHCFVOF-DCAQKATOSA-N Asp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LNENWJXDHCFVOF-DCAQKATOSA-N 0.000 description 2
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 2
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 2
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 2
- IDDMGSKZQDEDGA-SRVKXCTJSA-N Asp-Phe-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 IDDMGSKZQDEDGA-SRVKXCTJSA-N 0.000 description 2
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 2
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 2
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 2
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 2
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 2
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 2
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- CXEFNHOVIIDHFU-IHPCNDPISA-N Asp-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N CXEFNHOVIIDHFU-IHPCNDPISA-N 0.000 description 2
- OYSYWMMZGJSQRB-AVGNSLFASA-N Asp-Tyr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O OYSYWMMZGJSQRB-AVGNSLFASA-N 0.000 description 2
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 2
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 241000972773 Aulopiformes Species 0.000 description 2
- 101000781117 Autographa californica nuclear polyhedrosis virus Uncharacterized 12.4 kDa protein in CTL-LEF2 intergenic region Proteins 0.000 description 2
- 101000708323 Azospirillum brasilense Uncharacterized 28.8 kDa protein in nifR3-like 5'region Proteins 0.000 description 2
- 101000770311 Azotobacter chroococcum mcd 1 Uncharacterized 19.8 kDa protein in nifW 5'region Proteins 0.000 description 2
- 101000748761 Bacillus subtilis (strain 168) Uncharacterized MFS-type transporter YcxA Proteins 0.000 description 2
- 101000765620 Bacillus subtilis (strain 168) Uncharacterized protein YlxP Proteins 0.000 description 2
- 101000916134 Bacillus subtilis (strain 168) Uncharacterized protein YqxJ Proteins 0.000 description 2
- 101000754349 Bordetella pertussis (strain Tohama I / ATCC BAA-589 / NCTC 13251) UPF0065 protein BP0148 Proteins 0.000 description 2
- 101000827633 Caldicellulosiruptor sp. (strain Rt8B.4) Uncharacterized 23.9 kDa protein in xynA 3'region Proteins 0.000 description 2
- 101000947628 Claviceps purpurea Uncharacterized 11.8 kDa protein Proteins 0.000 description 2
- 101000686796 Clostridium perfringens Replication protein Proteins 0.000 description 2
- 241000555825 Clupeidae Species 0.000 description 2
- XEEIQMGZRFFSRD-XVYDVKMFSA-N Cys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N XEEIQMGZRFFSRD-XVYDVKMFSA-N 0.000 description 2
- FMDCYTBSPZMPQE-JBDRJPRFSA-N Cys-Ala-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMDCYTBSPZMPQE-JBDRJPRFSA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- GGIHYKLJUIZYGH-ZLUOBGJFSA-N Cys-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)O GGIHYKLJUIZYGH-ZLUOBGJFSA-N 0.000 description 2
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 2
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 2
- ZOMMHASZJQRLFS-IHRRRGAJSA-N Cys-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N ZOMMHASZJQRLFS-IHRRRGAJSA-N 0.000 description 2
- NGOIQDYZMIKCOK-NAKRPEOUSA-N Cys-Val-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NGOIQDYZMIKCOK-NAKRPEOUSA-N 0.000 description 2
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 2
- ALTQTAKGRFLRLR-GUBZILKMSA-N Cys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N ALTQTAKGRFLRLR-GUBZILKMSA-N 0.000 description 2
- 108010090461 DFG peptide Proteins 0.000 description 2
- -1 DHA ester Chemical class 0.000 description 2
- 101000788129 Escherichia coli Uncharacterized protein in sul1 3'region Proteins 0.000 description 2
- 101000788370 Escherichia phage P2 Uncharacterized 12.9 kDa protein in GpA 3'region Proteins 0.000 description 2
- 108010092526 GKPV peptide Proteins 0.000 description 2
- 101000787096 Geobacillus stearothermophilus Uncharacterized protein in gldA 3'region Proteins 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 2
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 2
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 2
- DLOHWQXXGMEZDW-CIUDSAMLSA-N Gln-Arg-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DLOHWQXXGMEZDW-CIUDSAMLSA-N 0.000 description 2
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 2
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 2
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 2
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 2
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 2
- RKAQZCDMSUQTSS-FXQIFTODSA-N Gln-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RKAQZCDMSUQTSS-FXQIFTODSA-N 0.000 description 2
- PZVJDMJHKUWSIV-AVGNSLFASA-N Gln-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)O PZVJDMJHKUWSIV-AVGNSLFASA-N 0.000 description 2
- IVCOYUURLWQDJQ-LPEHRKFASA-N Gln-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O IVCOYUURLWQDJQ-LPEHRKFASA-N 0.000 description 2
- QFJPFPCSXOXMKI-BPUTZDHNSA-N Gln-Gln-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N QFJPFPCSXOXMKI-BPUTZDHNSA-N 0.000 description 2
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 2
- KDXKFBSNIJYNNR-YVNDNENWSA-N Gln-Glu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDXKFBSNIJYNNR-YVNDNENWSA-N 0.000 description 2
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 2
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 2
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 2
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 2
- XQEAVUJIRZRLQQ-SZMVWBNQSA-N Gln-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)[C@H](CCC(=O)N)N XQEAVUJIRZRLQQ-SZMVWBNQSA-N 0.000 description 2
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 2
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 2
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 2
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 2
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 2
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 2
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 2
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 2
- PDXIOFXRBVDSHD-JBACZVJFSA-N Gln-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)NC(=O)[C@H](CCC(=O)N)N PDXIOFXRBVDSHD-JBACZVJFSA-N 0.000 description 2
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 2
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 2
- OKARHJKJTKFQBM-ACZMJKKPSA-N Gln-Ser-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OKARHJKJTKFQBM-ACZMJKKPSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- GTBXHETZPUURJE-KKUMJFAQSA-N Gln-Tyr-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GTBXHETZPUURJE-KKUMJFAQSA-N 0.000 description 2
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 2
- SJMJMEWQMBJYPR-DZKIICNBSA-N Gln-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N SJMJMEWQMBJYPR-DZKIICNBSA-N 0.000 description 2
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 2
- ICRKQMRFXYDYMK-LAEOZQHASA-N Gln-Val-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ICRKQMRFXYDYMK-LAEOZQHASA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 2
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 2
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 2
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 2
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 2
- SVZIKUHLRKVZIF-GUBZILKMSA-N Glu-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N SVZIKUHLRKVZIF-GUBZILKMSA-N 0.000 description 2
- RDDSZZJOKDVPAE-ACZMJKKPSA-N Glu-Asn-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDDSZZJOKDVPAE-ACZMJKKPSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 2
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 2
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 2
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- ZJFNRQHUIHKZJF-GUBZILKMSA-N Glu-His-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O ZJFNRQHUIHKZJF-GUBZILKMSA-N 0.000 description 2
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 2
- ZGKXAUIVGIBISK-SZMVWBNQSA-N Glu-His-Trp Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1c[nH]cn1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O ZGKXAUIVGIBISK-SZMVWBNQSA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 2
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 2
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 2
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 2
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 2
- QMOSCLNJVKSHHU-YUMQZZPRSA-N Glu-Met-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O QMOSCLNJVKSHHU-YUMQZZPRSA-N 0.000 description 2
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 2
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 2
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- LWYUQLZOIORFFJ-XKBZYTNZSA-N Glu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O LWYUQLZOIORFFJ-XKBZYTNZSA-N 0.000 description 2
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 2
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 2
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 2
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 2
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- PHONXOACARQMPM-BQBZGAKWSA-N Gly-Ala-Met Chemical compound [H]NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O PHONXOACARQMPM-BQBZGAKWSA-N 0.000 description 2
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 2
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 2
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- GVVKYKCOFMMTKZ-WHFBIAKZSA-N Gly-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)CN GVVKYKCOFMMTKZ-WHFBIAKZSA-N 0.000 description 2
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- AQLHORCVPGXDJW-IUCAKERBSA-N Gly-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN AQLHORCVPGXDJW-IUCAKERBSA-N 0.000 description 2
- GNPVTZJUUBPZKW-WDSKDSINSA-N Gly-Gln-Ser Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GNPVTZJUUBPZKW-WDSKDSINSA-N 0.000 description 2
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 2
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 2
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 2
- SIYTVHWNKGIGMD-HOTGVXAUSA-N Gly-His-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC3=CN=CN3)NC(=O)CN SIYTVHWNKGIGMD-HOTGVXAUSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 2
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 2
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 2
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 2
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 2
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 2
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 2
- RVGMVLVBDRQVKB-UWVGGRQHSA-N Gly-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN RVGMVLVBDRQVKB-UWVGGRQHSA-N 0.000 description 2
- LPHQAFLNEHWKFF-QXEWZRGKSA-N Gly-Met-Ile Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LPHQAFLNEHWKFF-QXEWZRGKSA-N 0.000 description 2
- FJWSJWACLMTDMI-WPRPVWTQSA-N Gly-Met-Val Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O FJWSJWACLMTDMI-WPRPVWTQSA-N 0.000 description 2
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 2
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 2
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 2
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 2
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 2
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 2
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 2
- IHDKKJVBLGXLEL-STQMWFEESA-N Gly-Tyr-Met Chemical compound CSCC[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)CN)C(O)=O IHDKKJVBLGXLEL-STQMWFEESA-N 0.000 description 2
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 2
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 2
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 2
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- 101000976889 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 19.2 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- HTZKFIYQMHJWSQ-INTQDDNPSA-N His-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N HTZKFIYQMHJWSQ-INTQDDNPSA-N 0.000 description 2
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 2
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 2
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 2
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 2
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 2
- TVRMJKNELJKNRS-GUBZILKMSA-N His-Glu-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N TVRMJKNELJKNRS-GUBZILKMSA-N 0.000 description 2
- BQFGKVYHKCNEMF-DCAQKATOSA-N His-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 BQFGKVYHKCNEMF-DCAQKATOSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- CSTNMMIHMYJGFR-IHRRRGAJSA-N His-His-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 CSTNMMIHMYJGFR-IHRRRGAJSA-N 0.000 description 2
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 2
- VTZYMXGGXOFBMX-DJFWLOJKSA-N His-Ile-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O VTZYMXGGXOFBMX-DJFWLOJKSA-N 0.000 description 2
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 2
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 2
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 2
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 2
- SAPLASXFNUYUFE-CQDKDKBSSA-N His-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CN=CN2)N SAPLASXFNUYUFE-CQDKDKBSSA-N 0.000 description 2
- KYFGGRHWLFZXPU-KKUMJFAQSA-N His-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N KYFGGRHWLFZXPU-KKUMJFAQSA-N 0.000 description 2
- BSVLMPMIXPQNKC-KBPBESRZSA-N His-Phe-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O BSVLMPMIXPQNKC-KBPBESRZSA-N 0.000 description 2
- BRQKGRLDDDQWQJ-MBLNEYKQSA-N His-Thr-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O BRQKGRLDDDQWQJ-MBLNEYKQSA-N 0.000 description 2
- JUCZDDVZBMPKRT-IXOXFDKPSA-N His-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O JUCZDDVZBMPKRT-IXOXFDKPSA-N 0.000 description 2
- ALPXXNRQBMRCPZ-MEYUZBJRSA-N His-Thr-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ALPXXNRQBMRCPZ-MEYUZBJRSA-N 0.000 description 2
- UWSMZKRTOZEGDD-CUJWVEQBSA-N His-Thr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O UWSMZKRTOZEGDD-CUJWVEQBSA-N 0.000 description 2
- LPBWRHRHEIYAIP-KKUMJFAQSA-N His-Tyr-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LPBWRHRHEIYAIP-KKUMJFAQSA-N 0.000 description 2
- RNVUQLOKVIPNEM-BZSNNMDCSA-N His-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O RNVUQLOKVIPNEM-BZSNNMDCSA-N 0.000 description 2
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 2
- CSTDQOOBZBAJKE-BWAGICSOSA-N His-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N)O CSTDQOOBZBAJKE-BWAGICSOSA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 2
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 2
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 2
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 2
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 2
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 2
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 2
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 2
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 2
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 2
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 2
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 2
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 2
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 2
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 2
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 2
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 2
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 2
- RCMNUBZKIIJCOI-ZPFDUUQYSA-N Ile-Met-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RCMNUBZKIIJCOI-ZPFDUUQYSA-N 0.000 description 2
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 2
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 2
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 2
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 2
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 2
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 2
- 101000827627 Klebsiella pneumoniae Putative low molecular weight protein-tyrosine-phosphatase Proteins 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 2
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 2
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 2
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 2
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 2
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- IASQBRJGRVXNJI-YUMQZZPRSA-N Leu-Cys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)NCC(O)=O IASQBRJGRVXNJI-YUMQZZPRSA-N 0.000 description 2
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 2
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 2
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 2
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 2
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 2
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 2
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 2
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 2
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 2
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- GAOJCVKPIGHTGO-UWVGGRQHSA-N Lys-Arg-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O GAOJCVKPIGHTGO-UWVGGRQHSA-N 0.000 description 2
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 2
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 2
- NCTDKZKNBDZDOL-GARJFASQSA-N Lys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O NCTDKZKNBDZDOL-GARJFASQSA-N 0.000 description 2
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 2
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 2
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 2
- PHHYNOUOUWYQRO-XIRDDKMYSA-N Lys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N PHHYNOUOUWYQRO-XIRDDKMYSA-N 0.000 description 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 2
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 2
- MQMIRLVJXQNTRJ-SDDRHHMPSA-N Lys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N)C(=O)O MQMIRLVJXQNTRJ-SDDRHHMPSA-N 0.000 description 2
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 2
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 2
- 108010003266 Lys-Leu-Tyr-Asp Proteins 0.000 description 2
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 2
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 2
- IPSDPDAOSAEWCN-RHYQMDGZSA-N Lys-Met-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IPSDPDAOSAEWCN-RHYQMDGZSA-N 0.000 description 2
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 2
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 2
- AZOFEHCPMBRNFD-BZSNNMDCSA-N Lys-Phe-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 AZOFEHCPMBRNFD-BZSNNMDCSA-N 0.000 description 2
- LUAJJLPHUXPQLH-KKUMJFAQSA-N Lys-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCCN)N LUAJJLPHUXPQLH-KKUMJFAQSA-N 0.000 description 2
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 2
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 2
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 2
- ZUGVARDEGWMMLK-SRVKXCTJSA-N Lys-Ser-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN ZUGVARDEGWMMLK-SRVKXCTJSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- GIKFNMZSGYAPEJ-HJGDQZAQSA-N Lys-Thr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O GIKFNMZSGYAPEJ-HJGDQZAQSA-N 0.000 description 2
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 2
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 2
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 2
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 2
- FRWZTWWOORIIBA-FXQIFTODSA-N Met-Asn-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FRWZTWWOORIIBA-FXQIFTODSA-N 0.000 description 2
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 2
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 2
- FVKRQMQQFGBXHV-QXEWZRGKSA-N Met-Asp-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FVKRQMQQFGBXHV-QXEWZRGKSA-N 0.000 description 2
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 2
- VOOINLQYUZOREH-SRVKXCTJSA-N Met-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N VOOINLQYUZOREH-SRVKXCTJSA-N 0.000 description 2
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 2
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 2
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 2
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 2
- BKIFWLQFOOKUCA-DCAQKATOSA-N Met-His-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N BKIFWLQFOOKUCA-DCAQKATOSA-N 0.000 description 2
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 2
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 2
- JKXVPNCSAMWUEJ-GUBZILKMSA-N Met-Met-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O JKXVPNCSAMWUEJ-GUBZILKMSA-N 0.000 description 2
- HUURTRNKPBHHKZ-JYJNAYRXSA-N Met-Phe-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 HUURTRNKPBHHKZ-JYJNAYRXSA-N 0.000 description 2
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 2
- XPVCDCMPKCERFT-GUBZILKMSA-N Met-Ser-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XPVCDCMPKCERFT-GUBZILKMSA-N 0.000 description 2
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- SPSSJSICDYYTQN-HJGDQZAQSA-N Met-Thr-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O SPSSJSICDYYTQN-HJGDQZAQSA-N 0.000 description 2
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 2
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 2
- RMLWDZINJUDMEB-IHRRRGAJSA-N Met-Tyr-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RMLWDZINJUDMEB-IHRRRGAJSA-N 0.000 description 2
- VYXIKLFLGRTANT-HRCADAONSA-N Met-Tyr-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N VYXIKLFLGRTANT-HRCADAONSA-N 0.000 description 2
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 2
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- 101001130841 Middle East respiratory syndrome-related coronavirus (isolate United Kingdom/H123990006/2012) Non-structural protein ORF5 Proteins 0.000 description 2
- 102000006833 Multifunctional Enzymes Human genes 0.000 description 2
- 108010047290 Multifunctional Enzymes Proteins 0.000 description 2
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 2
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- WSXKXSBOJXEZDV-DLOVCJGASA-N Phe-Ala-Asn Chemical compound NC(=O)C[C@@H](C([O-])=O)NC(=O)[C@H](C)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 WSXKXSBOJXEZDV-DLOVCJGASA-N 0.000 description 2
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 2
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- UHRNIXJAGGLKHP-DLOVCJGASA-N Phe-Ala-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O UHRNIXJAGGLKHP-DLOVCJGASA-N 0.000 description 2
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 2
- OXUMFAOVGFODPN-KKUMJFAQSA-N Phe-Asn-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OXUMFAOVGFODPN-KKUMJFAQSA-N 0.000 description 2
- ZENDEDYRYVHBEG-SRVKXCTJSA-N Phe-Asp-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZENDEDYRYVHBEG-SRVKXCTJSA-N 0.000 description 2
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 2
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 2
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 2
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 2
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 2
- OYQBFWWQSVIHBN-FHWLQOOXSA-N Phe-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O OYQBFWWQSVIHBN-FHWLQOOXSA-N 0.000 description 2
- NPLGQVKZFGJWAI-QWHCGFSZSA-N Phe-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O NPLGQVKZFGJWAI-QWHCGFSZSA-N 0.000 description 2
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 2
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 2
- MJAYDXWQQUOURZ-JYJNAYRXSA-N Phe-Lys-Gln Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MJAYDXWQQUOURZ-JYJNAYRXSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 2
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 2
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 2
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 2
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 2
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 2
- HBXAOEBRGLCLIW-AVGNSLFASA-N Phe-Ser-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HBXAOEBRGLCLIW-AVGNSLFASA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 2
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 2
- AGTHXWTYCLLYMC-FHWLQOOXSA-N Phe-Tyr-Glu Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 AGTHXWTYCLLYMC-FHWLQOOXSA-N 0.000 description 2
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 2
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- 101001135788 Pinus taeda (+)-alpha-pinene synthase, chloroplastic Proteins 0.000 description 2
- 108010030975 Polyketide Synthases Proteins 0.000 description 2
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 2
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 2
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 2
- SMCHPSMKAFIERP-FXQIFTODSA-N Pro-Asn-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 SMCHPSMKAFIERP-FXQIFTODSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- PZSCUPVOJGKHEP-CIUDSAMLSA-N Pro-Gln-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PZSCUPVOJGKHEP-CIUDSAMLSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- KTFZQPLSPLWLKN-KKUMJFAQSA-N Pro-Gln-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KTFZQPLSPLWLKN-KKUMJFAQSA-N 0.000 description 2
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 2
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 2
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 2
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 2
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 2
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 2
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 2
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 2
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 2
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 2
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 2
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 2
- RPLMFKUKFZOTER-AVGNSLFASA-N Pro-Met-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 RPLMFKUKFZOTER-AVGNSLFASA-N 0.000 description 2
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 2
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 2
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 2
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- BJCXXMGGPHRSHV-GUBZILKMSA-N Pro-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BJCXXMGGPHRSHV-GUBZILKMSA-N 0.000 description 2
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 2
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 2
- AJJDPGVVNPUZCR-RHYQMDGZSA-N Pro-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1)O AJJDPGVVNPUZCR-RHYQMDGZSA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 2
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 2
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 2
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 2
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 101000974028 Rhizobium leguminosarum bv. viciae (strain 3841) Putative cystathionine beta-lyase Proteins 0.000 description 2
- 101000756519 Rhodobacter capsulatus (strain ATCC BAA-309 / NBRC 16581 / SB1003) Uncharacterized protein RCAP_rcc00048 Proteins 0.000 description 2
- 101000948219 Rhodococcus erythropolis Uncharacterized 11.5 kDa protein in thcD 3'region Proteins 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 2
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 2
- UAJAYRMZGNQILN-BQBZGAKWSA-N Ser-Gly-Met Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UAJAYRMZGNQILN-BQBZGAKWSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 2
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 2
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 2
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 2
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- OWCVUSJMEBGMOK-YUMQZZPRSA-N Ser-Lys-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O OWCVUSJMEBGMOK-YUMQZZPRSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- XKFJENWJGHMDLI-QWRGUYRKSA-N Ser-Phe-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O XKFJENWJGHMDLI-QWRGUYRKSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 2
- QUGRFWPMPVIAPW-IHRRRGAJSA-N Ser-Pro-Phe Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QUGRFWPMPVIAPW-IHRRRGAJSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 2
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 2
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 2
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 2
- HAYADTTXNZFUDM-IHRRRGAJSA-N Ser-Tyr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HAYADTTXNZFUDM-IHRRRGAJSA-N 0.000 description 2
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 2
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- 101000936711 Streptococcus gordonii Accessory secretory protein Asp4 Proteins 0.000 description 2
- 101000929863 Streptomyces cinnamonensis Monensin polyketide synthase putative ketoacyl reductase Proteins 0.000 description 2
- 241000187432 Streptomyces coelicolor Species 0.000 description 2
- 101000788468 Streptomyces coelicolor Uncharacterized protein in mprR 3'region Proteins 0.000 description 2
- 101000845085 Streptomyces violaceoruber Granaticin polyketide synthase putative ketoacyl reductase 1 Proteins 0.000 description 2
- 101150006914 TRP1 gene Proteins 0.000 description 2
- 101000711771 Thiocystis violacea Uncharacterized 76.5 kDa protein in phbC 3'region Proteins 0.000 description 2
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- SWIKDOUVROTZCW-GCJQMDKQSA-N Thr-Asn-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O SWIKDOUVROTZCW-GCJQMDKQSA-N 0.000 description 2
- VIBXMCZWVUOZLA-OLHMAJIHSA-N Thr-Asn-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O VIBXMCZWVUOZLA-OLHMAJIHSA-N 0.000 description 2
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 2
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 2
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 2
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 2
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 2
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 2
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 2
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 2
- JMBRNXUOLJFURW-BEAPCOKYSA-N Thr-Phe-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N)O JMBRNXUOLJFURW-BEAPCOKYSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 2
- DNCUODYZAMHLCV-XGEHTFHBSA-N Thr-Pro-Cys Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N)O DNCUODYZAMHLCV-XGEHTFHBSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 2
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 2
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 2
- OMRWDMWXRWTQIU-YJRXYDGGSA-N Thr-Tyr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CS)C(=O)O)N)O OMRWDMWXRWTQIU-YJRXYDGGSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 2
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 2
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 2
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 2
- PNHABSVRPFBUJY-UMPQAUOISA-N Trp-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O PNHABSVRPFBUJY-UMPQAUOISA-N 0.000 description 2
- ZCPCXVJOMUPIDD-IHPCNDPISA-N Trp-Asp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 ZCPCXVJOMUPIDD-IHPCNDPISA-N 0.000 description 2
- SNJAPSVIPKUMCK-NWLDYVSISA-N Trp-Glu-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SNJAPSVIPKUMCK-NWLDYVSISA-N 0.000 description 2
- HJXWDGGIORSQQF-WDSOQIARSA-N Trp-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N HJXWDGGIORSQQF-WDSOQIARSA-N 0.000 description 2
- WKQNLTQSCYXKQK-VFAJRCTISA-N Trp-Lys-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WKQNLTQSCYXKQK-VFAJRCTISA-N 0.000 description 2
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 2
- UQHPXCFAHVTWFU-BVSLBCMMSA-N Trp-Phe-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UQHPXCFAHVTWFU-BVSLBCMMSA-N 0.000 description 2
- XOLLWQIBBLBAHQ-WDSOQIARSA-N Trp-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O XOLLWQIBBLBAHQ-WDSOQIARSA-N 0.000 description 2
- STKZKWFOKOCSLW-UMPQAUOISA-N Trp-Thr-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)[C@@H](C)O)=CNC2=C1 STKZKWFOKOCSLW-UMPQAUOISA-N 0.000 description 2
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 2
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 2
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 2
- RKISDJMICOREEL-QRTARXTBSA-N Trp-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RKISDJMICOREEL-QRTARXTBSA-N 0.000 description 2
- PCXFIOFKIMNHGR-UHFFFAOYSA-N Tyr Trp Gly Ser Chemical compound C=1NC2=CC=CC=C2C=1CC(C(=O)NCC(=O)NC(CO)C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 PCXFIOFKIMNHGR-UHFFFAOYSA-N 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 2
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 2
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 2
- QNJYPWZACBACER-KKUMJFAQSA-N Tyr-Asp-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O QNJYPWZACBACER-KKUMJFAQSA-N 0.000 description 2
- TZXFLDNBYYGLKA-BZSNNMDCSA-N Tyr-Asp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 TZXFLDNBYYGLKA-BZSNNMDCSA-N 0.000 description 2
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 2
- BVOCLAPFOBSJHR-KKUMJFAQSA-N Tyr-Cys-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O BVOCLAPFOBSJHR-KKUMJFAQSA-N 0.000 description 2
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 2
- QUILOGWWLXMSAT-IHRRRGAJSA-N Tyr-Gln-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QUILOGWWLXMSAT-IHRRRGAJSA-N 0.000 description 2
- CKHQKYHIZCRTAP-SOUVJXGZSA-N Tyr-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CKHQKYHIZCRTAP-SOUVJXGZSA-N 0.000 description 2
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 2
- GIOBXJSONRQHKQ-RYUDHWBXSA-N Tyr-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O GIOBXJSONRQHKQ-RYUDHWBXSA-N 0.000 description 2
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 2
- LFCQXIXJQXWZJI-BZSNNMDCSA-N Tyr-His-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O LFCQXIXJQXWZJI-BZSNNMDCSA-N 0.000 description 2
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 2
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 2
- HHFMNAVFGBYSAT-IGISWZIWSA-N Tyr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HHFMNAVFGBYSAT-IGISWZIWSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- ARJASMXQBRNAGI-YESZJQIVSA-N Tyr-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N ARJASMXQBRNAGI-YESZJQIVSA-N 0.000 description 2
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 2
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 2
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 2
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 2
- ZZDYJFVIKVSUFA-WLTAIBSBSA-N Tyr-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ZZDYJFVIKVSUFA-WLTAIBSBSA-N 0.000 description 2
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 2
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 2
- RGJZPXFZIUUQDN-BPNCWPANSA-N Tyr-Val-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O RGJZPXFZIUUQDN-BPNCWPANSA-N 0.000 description 2
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 2
- OBKOPLHSRDATFO-XHSDSOJGSA-N Tyr-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OBKOPLHSRDATFO-XHSDSOJGSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 2
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 2
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 2
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 2
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 2
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 2
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- XXWBHOWRARMUOC-NHCYSSNCSA-N Val-Lys-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XXWBHOWRARMUOC-NHCYSSNCSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 2
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 2
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 2
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 2
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 2
- SDHZOOIGIUEPDY-JYJNAYRXSA-N Val-Ser-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CO)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 SDHZOOIGIUEPDY-JYJNAYRXSA-N 0.000 description 2
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 2
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 2
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 2
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 2
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 101000711318 Vibrio alginolyticus Uncharacterized 11.6 kDa protein in scrR 3'region Proteins 0.000 description 2
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 2
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 2
- 108010033011 des-Arg- enterostatin Proteins 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 108010009297 diglycyl-histidine Proteins 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000013604 expression vector Substances 0.000 description 2
- 230000004136 fatty acid synthesis Effects 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 238000004817 gas chromatography Methods 0.000 description 2
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 2
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 2
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 2
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 2
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 2
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 2
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 2
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 2
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 2
- 229930027917 kanamycin Natural products 0.000 description 2
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 2
- 229960000318 kanamycin Drugs 0.000 description 2
- 229930182823 kanamycin A Natural products 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 2
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 2
- 108010054155 lysyllysine Proteins 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010074082 phenylalanyl-alanyl-lysine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010018625 phenylalanylarginine Proteins 0.000 description 2
- 108010012581 phenylalanylglutamate Proteins 0.000 description 2
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229930001119 polyketide Natural products 0.000 description 2
- 108010079317 prolyl-tyrosine Proteins 0.000 description 2
- 108010029020 prolylglycine Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 102220201851 rs143406017 Human genes 0.000 description 2
- 235000019515 salmon Nutrition 0.000 description 2
- 235000019512 sardine Nutrition 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010051110 tyrosyl-lysine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 1
- 101000818089 Acholeplasma phage L2 Uncharacterized 25.6 kDa protein Proteins 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 108700037654 Acyl carrier protein (ACP) Proteins 0.000 description 1
- 102000048456 Acyl carrier protein (ACP) Human genes 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- STACJSVFHSEZJV-GHCJXIJMSA-N Ala-Asn-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STACJSVFHSEZJV-GHCJXIJMSA-N 0.000 description 1
- JYEBJTDTPNKQJG-FXQIFTODSA-N Ala-Asn-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N JYEBJTDTPNKQJG-FXQIFTODSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 1
- AJBVYEYZVYPFCF-CIUDSAMLSA-N Ala-Lys-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O AJBVYEYZVYPFCF-CIUDSAMLSA-N 0.000 description 1
- IAUSCRHURCZUJP-CIUDSAMLSA-N Ala-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CS)C(O)=O IAUSCRHURCZUJP-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- HIIJOGIBQXHFKE-HHKYUTTNSA-N Ala-Thr-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O HIIJOGIBQXHFKE-HHKYUTTNSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- RCAUJZASOAFTAJ-FXQIFTODSA-N Arg-Asp-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N RCAUJZASOAFTAJ-FXQIFTODSA-N 0.000 description 1
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- PSZDKHGMNWAHGO-UHFFFAOYSA-N Asn Asn Pro Val Chemical compound CC(C)C(C(O)=O)NC(=O)C1CCCN1C(=O)C(CC(N)=O)NC(=O)C(N)CC(N)=O PSZDKHGMNWAHGO-UHFFFAOYSA-N 0.000 description 1
- PDQBXRSOSCTGKY-ACZMJKKPSA-N Asn-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PDQBXRSOSCTGKY-ACZMJKKPSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- LZLCLRQMUQWUHJ-GUBZILKMSA-N Asn-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N LZLCLRQMUQWUHJ-GUBZILKMSA-N 0.000 description 1
- NNDSLVWAQAUPPP-GUBZILKMSA-N Asn-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N NNDSLVWAQAUPPP-GUBZILKMSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- FIRWLDUOFOULCA-XIRDDKMYSA-N Asp-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N FIRWLDUOFOULCA-XIRDDKMYSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- 101000770875 Autographa californica nuclear polyhedrosis virus Uncharacterized 14.2 kDa protein in PK1-LEF1 intergenic region Proteins 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101001074429 Bacillus subtilis (strain 168) Polyketide biosynthesis acyltransferase homolog PksD Proteins 0.000 description 1
- 101000936617 Bacillus velezensis (strain DSM 23117 / BGSC 10A6 / FZB42) Polyketide biosynthesis acyltransferase homolog BaeD Proteins 0.000 description 1
- 101000736909 Campylobacter jejuni Probable nucleotidyltransferase Proteins 0.000 description 1
- 102100033029 Carbonic anhydrase-related protein 11 Human genes 0.000 description 1
- 241001149724 Cololabis adocetus Species 0.000 description 1
- 241000195493 Cryptophyta Species 0.000 description 1
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- VCIIDXDOPGHMDQ-WDSKDSINSA-N Cys-Gly-Gln Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VCIIDXDOPGHMDQ-WDSKDSINSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- XKDHARKYRGHLKO-QEJZJMRPSA-N Cys-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N XKDHARKYRGHLKO-QEJZJMRPSA-N 0.000 description 1
- UOEYKPDDHSFMLI-DCAQKATOSA-N Cys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N UOEYKPDDHSFMLI-DCAQKATOSA-N 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 238000007399 DNA isolation Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 1
- FMBWLLMUPXTXFC-SDDRHHMPSA-N Glu-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N)C(=O)O FMBWLLMUPXTXFC-SDDRHHMPSA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XUDLUKYPXQDCRX-BQBZGAKWSA-N Gly-Arg-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O XUDLUKYPXQDCRX-BQBZGAKWSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 1
- YZPVGIVFMZLQMM-YUMQZZPRSA-N Gly-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN YZPVGIVFMZLQMM-YUMQZZPRSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- 229930186217 Glycolipid Natural products 0.000 description 1
- 102100036738 Guanine nucleotide-binding protein subunit alpha-11 Human genes 0.000 description 1
- 101000748060 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.3 kDa protein in rep-hol intergenic region Proteins 0.000 description 1
- 101000623276 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiBIM 5'region Proteins 0.000 description 1
- 101000623175 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiCIIM 5'region Proteins 0.000 description 1
- 101000626850 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiEIM 5'region Proteins 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- 101000867841 Homo sapiens Carbonic anhydrase-related protein 11 Proteins 0.000 description 1
- 101100283445 Homo sapiens GNA11 gene Proteins 0.000 description 1
- 101001075218 Homo sapiens Gastrokine-1 Proteins 0.000 description 1
- 101000833492 Homo sapiens Jouberin Proteins 0.000 description 1
- 101000651236 Homo sapiens NCK-interacting protein with SH3 domain Proteins 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- RWYCOSAAAJBJQL-KCTSRDHCSA-N Ile-Gly-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N RWYCOSAAAJBJQL-KCTSRDHCSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- RIVKTKFVWXRNSJ-GRLWGSQLSA-N Ile-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RIVKTKFVWXRNSJ-GRLWGSQLSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- 102100024407 Jouberin Human genes 0.000 description 1
- 101000768313 Klebsiella pneumoniae Uncharacterized membrane protein in cps region Proteins 0.000 description 1
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 1
- AUNMOHYWTAPQLA-XUXIUFHCSA-N Leu-Met-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AUNMOHYWTAPQLA-XUXIUFHCSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 1
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 101001090725 Leuconostoc gelidum Bacteriocin leucocin-A Proteins 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 241000906444 Loristes Species 0.000 description 1
- 239000006142 Luria-Bertani Agar Substances 0.000 description 1
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- CKSBRMUOQDNPKZ-SRVKXCTJSA-N Lys-Gln-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CKSBRMUOQDNPKZ-SRVKXCTJSA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- AEIIJFBQVGYVEV-YESZJQIVSA-N Lys-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCCN)N)C(=O)O AEIIJFBQVGYVEV-YESZJQIVSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- YRNRVKTYDSLKMD-KKUMJFAQSA-N Lys-Ser-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YRNRVKTYDSLKMD-KKUMJFAQSA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- DLAFCQWUMFMZSN-GUBZILKMSA-N Met-Arg-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N DLAFCQWUMFMZSN-GUBZILKMSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- FYRUJIJAUPHUNB-IUCAKERBSA-N Met-Gly-Arg Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N FYRUJIJAUPHUNB-IUCAKERBSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- QGRJTULYDZUBAY-ZPFDUUQYSA-N Met-Ile-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGRJTULYDZUBAY-ZPFDUUQYSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- WXUUEPIDLLQBLJ-DCAQKATOSA-N Met-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N WXUUEPIDLLQBLJ-DCAQKATOSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- VWWGEKCAPBMIFE-SRVKXCTJSA-N Met-Met-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VWWGEKCAPBMIFE-SRVKXCTJSA-N 0.000 description 1
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 1
- RMLLCGYYVZKKRT-CIUDSAMLSA-N Met-Ser-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O RMLLCGYYVZKKRT-CIUDSAMLSA-N 0.000 description 1
- 101000804418 Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) Uncharacterized protein MTH_1463 Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- 239000004677 Nylon Substances 0.000 description 1
- 101150112539 OR gene Proteins 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 101000770870 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 37.2 kDa protein Proteins 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- JNRFYJZCMHHGMH-UBHSHLNASA-N Phe-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JNRFYJZCMHHGMH-UBHSHLNASA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- DJPXNKUDJKGQEE-BZSNNMDCSA-N Phe-Asp-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DJPXNKUDJKGQEE-BZSNNMDCSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 1
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 1
- 101001062854 Rattus norvegicus Fatty acid-binding protein 5 Proteins 0.000 description 1
- 241000269821 Scombridae Species 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- QPFJSHSJFIYDJZ-GHCJXIJMSA-N Ser-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO QPFJSHSJFIYDJZ-GHCJXIJMSA-N 0.000 description 1
- WKLJLEXEENIYQE-SRVKXCTJSA-N Ser-Cys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WKLJLEXEENIYQE-SRVKXCTJSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 1
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- 229930006000 Sucrose Natural products 0.000 description 1
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- CTONFVDJYCAMQM-IUKAMOBKSA-N Thr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H]([C@@H](C)O)N CTONFVDJYCAMQM-IUKAMOBKSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- APIQKJYZDWVOCE-VEVYYDQMSA-N Thr-Asp-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O APIQKJYZDWVOCE-VEVYYDQMSA-N 0.000 description 1
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 1
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- FKIGTIXHSRNKJU-IXOXFDKPSA-N Thr-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CN=CN1 FKIGTIXHSRNKJU-IXOXFDKPSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- JRAUIKJSEAKTGD-TUBUOCAGSA-N Thr-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N JRAUIKJSEAKTGD-TUBUOCAGSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 1
- 108010089879 Type I Fatty Acid Synthase Proteins 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- FJBCEFPCVPHPPM-STECZYCISA-N Tyr-Ile-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O FJBCEFPCVPHPPM-STECZYCISA-N 0.000 description 1
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 1
- HNERGSKJJZQGEA-JYJNAYRXSA-N Tyr-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HNERGSKJJZQGEA-JYJNAYRXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 1
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- 101710110895 Uncharacterized 7.3 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- HWNYVQMOLCYHEA-IHRRRGAJSA-N Val-Ser-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N HWNYVQMOLCYHEA-IHRRRGAJSA-N 0.000 description 1
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 1
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010055956 beta-ketoacyl-acyl carrier protein synthase I Proteins 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000012467 final product Substances 0.000 description 1
- 235000019688 fish Nutrition 0.000 description 1
- 238000010230 functional analysis Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 235000013402 health food Nutrition 0.000 description 1
- 210000001990 heterocyst Anatomy 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 235000020640 mackerel Nutrition 0.000 description 1
- LTYOQGRJFJAKNA-IJCONWDESA-N malonyl-coenzyme a Chemical compound O[C@@H]1[C@@H](OP(O)(O)=O)[C@H](CO[P@](O)(=O)O[P@@](O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-IJCONWDESA-N 0.000 description 1
- 239000006325 marine broth Substances 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 229920001778 nylon Polymers 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 150000003881 polyketide derivatives Chemical class 0.000 description 1
- 125000000830 polyketide group Chemical group 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 239000005720 sucrose Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 108091035539 telomere Proteins 0.000 description 1
- 102000055501 telomere Human genes 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 201000008827 tuberculosis Diseases 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P20/00—Technologies relating to chemical industry
- Y02P20/50—Improvements relating to the production of bulk chemicals
- Y02P20/52—Improvements relating to the production of bulk chemicals using catalysts, e.g. selective catalysts
Landscapes
- Enzymes And Modification Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION
【0001】[0001]
【発明の属する技術分野】本発明は、新規な高度不飽和
脂肪酸合成酵素をコードする遺伝子群、より詳しくはド
コサヘキサエン酸(docosahexaenoic acid、以下「DH
A」という。)生産細菌由来のイコサペンタエン酸(ei
cosapentaenoic acid、以下「EPA」という。)合成
酵素群遺伝子と顕著に類似性のある新規な遺伝子に関す
る。TECHNICAL FIELD The present invention relates to a group of genes encoding a novel highly unsaturated fatty acid synthase, more specifically, docosahexaenoic acid (hereinafter referred to as "DH").
A ". ) Eicosapentaenoic acid derived from producing bacteria (ei
cosapentaenoic acid, hereinafter referred to as "EPA". 2.) Novel genes with significant similarity to the synthase family genes.
【0002】[0002]
【従来の技術】DHAは、高度不飽和脂肪酸の一種であ
り、近年、コレステロール低下作用、抗血液凝固作用、
学習機能向上作用など多彩な生理作用が報告されてい
る。このような多彩な生理作用に着目し、わが国におい
てもDHAを多く含有するイワシ、サバ、サンマ等の青
背魚の摂食が推奨されている。二重結合を6個有する炭
素数22の直鎖の高度不飽和脂肪酸であるDHAは、そ
の化学構造から明らかなように、化学合成することは極
めて困難である。今日、健康食品として市販されている
DHAは、そのほとんどが煮取法によって得られた魚油
の分別物である。原料が魚油の場合には、いずれの精製
方法を採用しても、高純度のDHAエステルを高収率で
回収することは困難である。2. Description of the Related Art DHA is a kind of polyunsaturated fatty acid.
Various physiological actions such as learning function improving action have been reported. Paying attention to such various physiological actions, in Japan, it is recommended to feed blue sardines such as sardines, mackerel, saury, etc., which contain a large amount of DHA. It is extremely difficult to chemically synthesize DHA, a straight-chain highly unsaturated fatty acid having 22 carbon atoms having 6 double bonds, as is clear from its chemical structure. Today, most of the DHA marketed as a health food is a fraction of fish oil obtained by the boiling method. When the raw material is fish oil, it is difficult to recover a high-purity DHA ester in a high yield by any of the purification methods.
【0003】一方、最近、不完全な精製・濃縮では、魚
臭が残るなどの欠点を有した魚油からの抽出法を改善す
ることを目的として、菌類、微細藻類などに選択的にD
HAを産生させる検討が行なわれてきた。しかしなが
ら、DHAを含有した微生物や藻類を原料とした場合で
も、これから高純度のDHAエステルを高収率で回収し
たという報告例は見当たらない。[0003] On the other hand, recently, incomplete purification / concentration, selectively improving fungi and microalgae, etc., for the purpose of improving the extraction method from fish oil, which has a drawback such as remaining fish odor.
Studies have been conducted to produce HA. However, even when a microorganism or algae containing DHA is used as a raw material, there is no report that a high-purity DHA ester was recovered in a high yield.
【0004】ある種の海洋性細菌がDHA等の高度不飽
和脂肪酸を生産することは古くから報告されている。本
発明者らは、培養時間が短く、培養制御が容易であり、
遺伝子の取得も容易な細菌を利用したDHA含有脂質の
生産法を見出す目的で研究を開始した。これまでEPA
生合成酵素群遺伝子に関しては、特開平6-46864
号、特開平8-242867号がある。It has long been reported that certain marine bacteria produce polyunsaturated fatty acids such as DHA. The present inventors have a short culture time, easy culture control,
Research has begun with the aim of finding a method for producing DHA-containing lipids using bacteria that can easily obtain genes. Until now EPA
Regarding biosynthetic enzyme group genes, see JP-A-6-46864.
And JP-A-8-242867.
【0005】しかしながら、DHA合成に関与している
生合成酵素群及びそれをコードする遺伝子に関する遺伝
子は、未だ単離・解析されておらず、該遺伝子の解明が
当業界で待ち望まれている。[0005] However, genes related to biosynthetic enzymes involved in DHA synthesis and genes encoding the same have not yet been isolated and analyzed, and elucidation of these genes has been awaited in the art.
【0006】[0006]
【発明が解決しようとする課題】本発明の目的は、DH
A生産細菌に由来する新規な高度不飽和脂肪酸合成酵素
遺伝子、特にDHAの合成に関与する遺伝子を単離し、
その遺伝子DNAを提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a DH
A novel highly unsaturated fatty acid synthase gene derived from an A-producing bacterium, particularly a gene involved in the synthesis of DHA,
It is to provide the gene DNA.
【0007】[0007]
【課題を解決するための手段】本発明者らは、上記目的
より鋭意研究を重ねた結果、DHA生産細菌の染色体D
NAの一部を単離し、その塩基配列によりコードされる
複数種のアミノ酸配列中においてEPA合成酵素群のア
ミノ酸配列と顕著に類似性のあるものを検索することに
より、従来知られていないDHA合成に関与する酵素タ
ンパク質及びその遺伝子DNAを見出し、本発明を完成
するに至った。Means for Solving the Problems The present inventors have conducted intensive studies for the above purpose and found that the chromosome D
By isolating a part of NA and searching for amino acid sequences remarkably similar to the amino acid sequence of the EPA synthetase group in a plurality of amino acid sequences encoded by the base sequence, DHA synthesis which has not been known before The present inventors have found an enzyme protein involved in the above and its gene DNA, and have completed the present invention.
【0008】すなわち、本発明は、ドコサヘキサエン酸
を生産する能力を有する細菌由来の、イコサペンタエン
酸生合成酵素群類似タンパク質群をコードするDNAを
提供する。前記細菌は、モリテラ属(Moritella)に属
するものであることが好ましい。前記DNAとしては、
(i)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されていて
もよいアミノ酸配列を含み、かつ、ドコサヘキサエン酸
生合成酵素群のメンバーとして機能し得るタンパク質を
コードする塩基配列、(ii)配列番号5で表わされるア
ミノ酸配列において1以上のアミノ酸が置換、欠失、付
加又は挿入されていてもよいアミノ酸配列を含み、か
つ、ドコサヘキサエン酸生合成酵素群のメンバーとして
機能し得るタンパク質をコードする塩基配列、(iii)
配列番号7で表わされるアミノ酸配列において1以上の
アミノ酸が置換、欠失、付加又は挿入されていてもよい
アミノ酸配列を含み、かつ、ドコサヘキサエン酸生合成
酵素群のメンバーとして機能し得るタンパク質をコード
する塩基配列、及び(iv)配列番号9で表わされるアミ
ノ酸配列において1以上のアミノ酸が置換、欠失、付加
又は挿入されていてもよいアミノ酸配列を含み、かつ、
ドコサヘキサエン酸生合成酵素群のメンバーとして機能
し得るタンパク質をコードする塩基配列を含むものであ
ることが好ましく、例えば、配列番号1で表わされる塩
基配列を含むものが挙げられる。That is, the present invention provides a DNA derived from a bacterium having an ability to produce docosahexaenoic acid and encoding a group of proteins similar to the group of icosapentaenoic acid biosynthetic enzymes. Preferably, the bacterium belongs to the genus Moritella . As the DNA,
(I) 1 in the amino acid sequence represented by SEQ ID NO: 3
A base sequence encoding a protein containing the amino acid sequence in which the above amino acids may be substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group; (ii) SEQ ID NO: 5 A nucleotide sequence that includes an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented, and encodes a protein that can function as a member of the docosahexaenoic acid biosynthetic enzyme group; iii)
Encodes a protein that includes an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 7, and that can function as a member of the docosahexaenoic acid biosynthetic enzyme group A base sequence, and (iv) an amino acid sequence represented by SEQ ID NO: 9 in which one or more amino acids may be substituted, deleted, added or inserted, and
It preferably contains a base sequence encoding a protein capable of functioning as a member of the docosahexaenoic acid biosynthetic enzyme group, and includes, for example, a base sequence represented by SEQ ID NO: 1.
【0009】さらに、本発明は、以下の(1)又は
(2)に示されるタンパク質を提供する。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。Further, the present invention provides a protein represented by the following (1) or (2). (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【0010】さらに、本発明は、以下の(1)又は
(2)に示されるタンパク質をコードするDNAを提供
する。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号2で表わされる塩基
配列を含むものが挙げられる。Further, the present invention provides a DNA encoding a protein represented by the following (1) or (2). (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such DNA include those containing the base sequence represented by SEQ ID NO: 2.
【0011】さらに、本発明は、以下の(3)又は
(4)に示されるタンパク質を提供する。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。Further, the present invention provides a protein represented by the following (3) or (4). (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【0012】さらに、本発明は、以下の(3)又は
(4)に示されるタンパク質をコードするDNAを提供
する。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号4で表わされる塩基
配列を含むものが挙げられる。Further, the present invention provides a DNA encoding a protein represented by the following (3) or (4). (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such a DNA include a DNA containing the base sequence represented by SEQ ID NO: 4.
【0013】さらに、本発明は、以下の(5)又は
(6)に示されるタンパク質を提供する。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。Further, the present invention provides a protein represented by the following (5) or (6). (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【0014】さらに、本発明は、以下の(5)又は
(6)に示されるタンパク質をコードするDNAを提供
する。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号6で表わされる塩基
配列を含むものが挙げられる。Further, the present invention provides a DNA encoding a protein represented by the following (5) or (6). (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Such DNA includes, for example, those containing the base sequence represented by SEQ ID NO: 6.
【0015】さらに、本発明は、以下の(7)又は
(8)に示されるタンパク質を提供する。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。Further, the present invention provides a protein represented by the following (7) or (8). (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【0016】さらに、本発明は、以下の(7)又は
(8)に示されるタンパク質をコードするDNAを提供
する。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号8で表わされる塩基
配列を含むものが挙げられる。Further, the present invention provides a DNA encoding a protein represented by the following (7) or (8). (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such a DNA include a DNA containing the base sequence represented by SEQ ID NO: 8.
【0017】[0017]
【発明の実施の形態】以下、本発明を詳細に説明する。 1.遺伝子源 本発明において遺伝子源として利用できる生物は、特に
属、種あるいは株などを限定するものではなく、DHA
生産能を有する細菌であればいずれのものでも用いるこ
とができる。これらの微生物については、公的微生物寄
託機関で容易に入手することができる。このような微生
物としては、モリテラ属(Moritella)に属する細菌、
例えば、モリテラ・マリナ(Moritella marina)MP−
1株(ATCC15381)が挙げられる。BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail. 1. Gene source The organism that can be used as a gene source in the present invention is not particularly limited to any genus, species or strain.
Any bacteria can be used as long as they have productivity. These microorganisms can be easily obtained from a public microorganism depositary organization. Such microorganisms include bacteria belonging to the genus Moritella ,
For example, Moritella marina MP-
1 strain (ATCC15381).
【0018】2.DHA合成酵素遺伝子群のクローニン
グ 本発明においては、遺伝子源の例として、モリテラ・マ
リナ(Moritella marina)MP−1株(ATCC153
81)を用いる場合について具体的に説明する。しかし
ながら、前述の様に、種々のDHA生産細菌を同様にし
て遺伝子源として使用できる。2. Clonin of DHA synthase gene group
In the present invention, as an example of a gene source, Moritella marina MP-1 strain (ATCC153) is used.
81) is specifically described. However, as noted above, various DHA producing bacteria can be used as a gene source in a similar manner.
【0019】DHA生産細菌からDHA合成酵素遺伝子
群を単離するためには、脂肪酸合成酵素遺伝子、ポリケ
タイド合成酵素遺伝子、EPA合成酵素群遺伝子の既知
の塩基配列を利用する。例えば、シェワネラ(Shewanel
la)SCRC-2738株由来のEPA生合成酵素群遺
伝子中5番目のオープンリーディングフレーム(以下
「ORF」という。)にはβ-ケトアシル-[アシルキャ
リアプロテイン(以下「ACP」という。)]シンター
ゼ(以下「KAS」という。)とマロニルコエンザイム
A(以下「CoA」という。)-ACPトランスアシラ
ーゼ(以下「MCT」という。)のドメインが隣り合っ
て存在する部分がある。そこで大腸菌(Escherichia co
li)とマイコバクテリウム・チュバキュロシス(Mycoba
cterium tuberculosis)のKASによく保存されている
アミノ酸配列と、大腸菌とストレプトマイセス・コリカ
ラ(Streptomyces coelicolor)のMCTによく保存さ
れているアミノ酸配列からそれぞれオリゴヌクレオチド
を作製し、ゲノムPCR法によってモリテラ・マリナM
P−1株の該当する遺伝子部分を増幅し、プラスミドベ
クターにクローニング後、常法により塩基配列を決定
し、既知のEPA合成酵素群遺伝子と比較することによ
り、モリテラ・マリナMP−1株より目的の遺伝子の一
部が単離できたことを確認する。In order to isolate DHA synthase genes from DHA-producing bacteria, known base sequences of fatty acid synthase gene, polyketide synthase gene and EPA synthase gene are used. For example, Shewanel
la ) β-ketoacyl- [acyl carrier protein (hereinafter “ACP”)] synthase (hereinafter referred to as “ACP”) is contained in the fifth open reading frame (hereinafter referred to as “ORF”) in the EPA biosynthetic enzyme group gene derived from SCRC-2738 strain. Hereinafter, there is a portion where domains of malonyl coenzyme A (hereinafter, referred to as “CoA”) and ACP transacylase (hereinafter, referred to as “MCT”) are adjacent to each other. So Escherichia co
li ) and Mycobacterium tuberculosis ( Mycoba)
Oligonucleotides were prepared from the amino acid sequence well conserved in KAS of Cterium tuberculosis ) and the amino acid sequence well conserved in MCT of Escherichia coli and Streptomyces coelicolor, respectively. Marina M
After amplifying the corresponding gene portion of the P-1 strain, cloning it into a plasmid vector, determining the nucleotide sequence by a conventional method, and comparing with the known EPA synthase group genes, the objective was obtained from the Moritera marina MP-1 strain. Confirm that a part of the gene could be isolated.
【0020】次に、目的とするDHA合成酵素遺伝子群
は、複数の遺伝子から構成されていることが予想される
ことから、ライブラリーの作製には、コスミドベクター
を用いる。モリテラ・マリナMP−1株のゲノムDNA
を制限酵素で部分消化した後、サイズ分画し、コスミド
ベクターに組み込み、コスミドライブラリーを構築す
る。 PCR法によって得られた目的とする酵素遺伝子
の一部のDNAを[α−32P] dCTPで標識し、それ
をプローブとしてコロニーハイブリダイゼーションを行
い、ポジティブクローンを得る。このポジティブクロー
ンのゲノムDNAについて各種制限酵素によるマッピン
グを行い、更に全塩基配列を決定することにより、目的
とする酵素遺伝子の全体を含んでいるかどうかを確認す
る。Next, since the target DHA synthase gene group is expected to be composed of a plurality of genes, a cosmid vector is used for preparing the library. Genomic DNA of Moritera marina MP-1 strain
Is partially digested with a restriction enzyme, size-fractionated and incorporated into a cosmid vector to construct a cosmid library. Partial DNA of the target enzyme gene obtained by the PCR method is labeled with [α- 32 P] dCTP, and colony hybridization is performed using the labeled DNA as a probe to obtain a positive clone. The genomic DNA of this positive clone is mapped with various restriction enzymes, and the entire base sequence is determined to confirm whether or not the gene contains the entire target enzyme gene.
【0021】以上の操作により、DHA合成酵素遺伝子
群のクローニング及び配列決定を行うことができる。D
HA合成酵素遺伝子群の塩基配列としては、例えば、モ
リテラ・マリナ(Moritella marina)MP−1株(AT
CC15381)のゲノム中に含まれる、配列番号1で
表わされる塩基配列が挙げられる。By the above operation, cloning and sequencing of the DHA synthase gene group can be performed. D
Examples of the base sequence of the HA synthase gene group include, for example, Moritella marina MP-1 strain (AT
CC15381) contained in the genome of SEQ ID NO: 1.
【0022】本遺伝子群の一部を用いて宿主生物体の脂
肪酸組成を変えることも可能である。公知の方法によっ
て得られるこの遺伝子の不要な部分を除いた小型化した
遺伝子群も本発明に含まれる。It is also possible to alter the fatty acid composition of the host organism using a part of this gene group. The present invention also includes a miniaturized gene group obtained by removing unnecessary portions of the gene obtained by a known method.
【0023】3.塩基配列の解析 配列番号1で表わされる塩基配列中には、図3に示すよ
うに、22個のORFが存在する(5'側から3'側に向け
て、それぞれORF1〜22とする)。このうち、OR
F8〜11の推定アミノ酸配列(それぞれ、配列番号
3、5、7及び9に示す)は、それぞれ公知のEPA合
成酵素群遺伝子(DDBJ/EMBL/GenBank登録番号;U73
935)のORF5〜8によりコードされるアミノ酸配
列と、アミノ酸残基数及び配列において類似している。3. Analysis of Nucleotide Sequence As shown in FIG. 3, there are 22 ORFs in the nucleotide sequence represented by SEQ ID NO: 1 (ORFs 1 to 22 from the 5 ′ side to the 3 ′ side, respectively). Of these, OR
The deduced amino acid sequences of F8-11 (shown in SEQ ID NOs: 3, 5, 7 and 9, respectively) are known EPA synthase group genes (DDBJ / EMBL / GenBank accession numbers; U73
935) is similar to the amino acid sequence encoded by ORFs 5 to 8 in the number and sequence of amino acid residues.
【0024】さらに、ORF8には、β-ケトアシル-AC
PシンターゼII、マロニルCoA-ACPトランスアシラーゼ、
アシルキャリアープロテイン(ACP)及びβ-ケトアシル
-ACPリダクターゼと顕著な類似性を有するドメインが存
在する。ORF9には、β-ケトアシル-ACPシンターゼI
I及びマロニルCoA-ACPトランスアシラーゼと顕著な類似
性を有するドメインが存在する。ORF10には、β-
ケトアシル-ACPシンターゼII及び3-ヒドロキシデカノ
イル-ACPデヒドラーゼと顕著な類似性を有するドメイン
が存在する。このようなドメイン構造は、ポリケタイド
合成酵素(以下「PKS」という。)、I型脂肪酸合成
酵素(以下「FAS I」という。)、ヘテロシスト糖
脂質合成酵素(以下「Hgl」という。)等の、長鎖不
飽和化合物の生合成に関与する多機能酵素のドメイン構
造とよく似ている。以上の結果から、上記ORF8〜1
1によりコードされるタンパク質は、DHA合成酵素群
のメンバーであると考えられる。Further, ORF8 contains β-ketoacyl-AC
P synthase II, malonyl CoA-ACP transacylase,
Acyl carrier protein (ACP) and β-ketoacyl
-There are domains with significant similarity to ACP reductase. ORF9 contains β-ketoacyl-ACP synthase I
There are domains with significant similarity to I and malonyl CoA-ACP transacylase. ORF10 contains β-
There are domains with significant similarity to ketoacyl-ACP synthase II and 3-hydroxydecanoyl-ACP dehydrase. Such domain structures include polyketide synthase (hereinafter referred to as “PKS”), type I fatty acid synthase (hereinafter referred to as “FAS I”), and heterocyst glycolipid synthase (hereinafter referred to as “Hgl”). It closely resembles the domain structure of multifunctional enzymes involved in the biosynthesis of long-chain unsaturated compounds. From the above results, the above ORFs 8 to 1
The protein encoded by 1 is considered to be a member of the DHA synthase group.
【0025】4.本発明のDNA及びタンパク質 本発明のDNAは、DHA生産能を有する細菌由来の、
イコサペンタエン酸生合成酵素群類似タンパク質群をコ
ードするDNAである。このようなDNAとしては、上
記ORF8〜11によりコードされるアミノ酸配列(そ
れぞれ、配列番号3、5、7及び9に示す)において1
以上のアミノ酸が置換、欠失、付加又は挿入されていて
もよいアミノ酸配列を含み、かつ、DHA合成酵素群の
メンバーとして機能し得るタンパク質をコードする4種
の塩基配列を含むものであることが好ましく、例えば、
配列番号1で表わされる塩基配列を含むものが挙げられ
る。しかし、本発明のDNAはこのようなものに限定さ
れるものではなく、上記「1.遺伝子源」及び「2.D
HA合成酵素遺伝子群のクローニング」の項で説明した
方法により、他の塩基配列を含むものを得ることもでき
る。4. DNA and protein of the present invention The DNA of the present invention is derived from a bacterium having DHA-producing ability.
DNA encoding an icosapentaenoic acid biosynthetic enzyme group-like protein group. Such DNA includes 1 in the amino acid sequence encoded by ORFs 8 to 11 (shown in SEQ ID NOS: 3, 5, 7, and 9, respectively).
It is preferable that the above amino acids include an amino acid sequence that may be substituted, deleted, added or inserted, and include four base sequences encoding a protein that can function as a member of the DHA synthase group, For example,
One containing the base sequence represented by SEQ ID NO: 1 is exemplified. However, the DNA of the present invention is not limited to such DNA, and the above-mentioned “1. Gene source” and “2.
Cloning of HA synthase gene group "can also be used to obtain those containing other nucleotide sequences.
【0026】本発明のタンパク質は、配列番号1で表わ
される塩基配列中に含まれる上記ORF8〜11により
コードされるアミノ酸配列を含むタンパク質である。こ
れらのアミノ酸配列は、配列番号3(ORF8)、配列
番号5(ORF9)、配列番号7(ORF10)及び配
列番号9(ORF11)に示されるものである。このよ
うなタンパク質はいずれも、DHA合成酵素群のメンバ
ーとして機能し得る。The protein of the present invention is a protein comprising the amino acid sequence encoded by the above ORFs 8 to 11 contained in the nucleotide sequence represented by SEQ ID NO: 1. These amino acid sequences are shown in SEQ ID NO: 3 (ORF8), SEQ ID NO: 5 (ORF9), SEQ ID NO: 7 (ORF10) and SEQ ID NO: 9 (ORF11). Any such protein can function as a member of the DHA synthase family.
【0027】ただし、本発明のタンパク質のアミノ酸配
列は上記のアミノ酸配列に限定されるものではなく、D
HA合成酵素群のメンバーとして機能する限り、各アミ
ノ酸配列において1以上のアミノ酸の置換、欠失、付
加、挿入等の変異が生じていてもよい。このような変異
が生じてもよいアミノ酸残基の数は特に限定されるもの
ではないが、好ましくは1個〜数個である。また、本発
明のタンパク質は、上記ORF8〜11の各塩基配列を
有するDNAとストリンジェントな条件下でハイブリダ
イズするDNAによりコードされるアミノ酸配列を含
み、かつ、DHA合成酵素群のメンバーとして機能し得
るタンパク質も包含する。この場合において、「ストリ
ンジェントな条件」としては、例えば、5×SSC、10
×Denhaldt's溶液、0.1%SDS、100μg/mlサケ精子
DNAを含むハイブリダイゼーションバッファー中で、
ハイブリダイズさせる温度が65℃という条件が挙げられ
る。例えば、酵素の安定性や活性を高めるために、公知
の手法を用いて遺伝子の一部塩基配列を変更することに
より、翻訳されるアミノ酸配列を変更することも可能で
ある。However, the amino acid sequence of the protein of the present invention is not limited to the above-mentioned amino acid sequence.
As long as it functions as a member of the HA synthase group, mutations such as substitution, deletion, addition, and insertion of one or more amino acids in each amino acid sequence may occur. The number of amino acid residues where such a mutation may occur is not particularly limited, but is preferably one to several. In addition, the protein of the present invention contains an amino acid sequence encoded by DNA that hybridizes under stringent conditions with DNA having each of the base sequences of ORFs 8 to 11, and functions as a member of the DHA synthase group. The resulting protein is also included. In this case, “stringent conditions” include, for example, 5 × SSC, 10
× in a hybridization buffer containing Denhaldt's solution, 0.1% SDS, 100 μg / ml salmon sperm DNA,
A condition in which the hybridization temperature is 65 ° C. is exemplified. For example, in order to enhance the stability and activity of the enzyme, it is possible to change the amino acid sequence to be translated by changing a partial base sequence of the gene using a known method.
【0028】さらに、本発明のDNAは、配列番号1で
表わされる塩基配列を含むDNAの一部である上記OR
F8〜11をも包含する。これらの塩基配列としては、
例えば、配列番号2(ORF8)、配列番号4(ORF
9)、配列番号6(ORF10)、配列番号8(ORF
11)で表わされるものが挙げられるが、これらに限定
されるものではなく、上記のような本発明のタンパク質
をコードするものであれば本発明のDNAに含まれる。Further, the DNA of the present invention comprises the above OR which is a part of the DNA containing the base sequence represented by SEQ ID NO: 1.
F8-11 are also included. As these base sequences,
For example, SEQ ID NO: 2 (ORF8), SEQ ID NO: 4 (ORF8)
9), SEQ ID NO: 6 (ORF10), SEQ ID NO: 8 (ORF10)
Examples of the present invention include, but are not limited to, those represented by 11), and any DNA encoding the protein of the present invention as described above is included in the DNA of the present invention.
【0029】なお、DNAに変異を導入するには、Kunk
el法、Gapped duplex法等の公知の手法又はこれに準ず
る方法を採用することができる。例えば、部位特異的突
然変異誘発法を利用した、Mutant-K、Mutant-G(TaKaRa
社製)等の変異導入用キット、又はLA PCR in vitro Mu
tagenesisシリーズキット(TaKaRa社製)を用いて変異
を導入することができる。To introduce a mutation into DNA, use Kunk
Known methods such as the el method and the gapped duplex method or methods similar thereto can be employed. For example, Mutant-K, Mutant-G (TaKaRa) using site-directed mutagenesis
Mutagenesis kit such as LA PCR in vitro Mu
Mutations can be introduced using a tagenesis series kit (TaKaRa).
【0030】一旦本発明のDNAの塩基配列が決定され
ると、その後は化学合成反応等の化学的方法によって該
塩基配列を含むDNAを調製することができ、さらに
は、該塩基配列の全部又は一部を有するDNA断片をプ
ローブとして用いるハイブリダイゼーション法、該塩基
配列の一部を有するDNA断片をプライマーとして用い
るPCR法等の生物化学的方法によって、DHA生産能
を有する細菌のゲノムDNA等から、本発明のDNAを
得ることができる。このような化学合成反応、ハイブリ
ダイゼーション法及びPCR法は、当業者に公知の方法
によって行うことができる。Once the nucleotide sequence of the DNA of the present invention has been determined, a DNA containing the nucleotide sequence can be prepared by a chemical method such as a chemical synthesis reaction. A hybridization method using a DNA fragment having a part as a probe, a biochemical method such as a PCR method using a DNA fragment having a part of the base sequence as a primer, from a genomic DNA of a bacterium having a DHA-producing ability, The DNA of the present invention can be obtained. Such a chemical synthesis reaction, a hybridization method, and a PCR method can be performed by methods known to those skilled in the art.
【0031】上記のようにして取得される本発明のDN
AがDHA合成酵素群遺伝子としての機能を有するか否
か、すなわち、上記のようにして取得される、本発明の
各タンパク質がDHA合成酵素群のメンバーとしての機
能を有するか否かは、以下のような手法により確認する
ことができる。The DN of the present invention obtained as described above
Whether A has a function as a DHA synthase group gene, that is, whether each protein of the present invention obtained as described above has a function as a member of the DHA synthase group, It can be confirmed by such a method as follows.
【0032】すなわち、上記DNAを公知の発現ベクタ
ー中に組み込み、得られるベクターを適当な宿主に導入
して形質転換体を作製し、次いで該形質転換体を培養す
ることによりタンパク質を発現させ、得られるタンパク
質の機能を解析する。ここで、発現ベクターとしては、
例えば、pBluescript II(Stratagene社製)等を用いる
ことができる。また、宿主としては、大腸菌TOP10F'株
(Invitrogen社製)が適当であり、その場合の培養条件
は、培養温度20℃以下とするとよい。タンパク質の機能
解析は、培養した形質転換体の全脂肪酸組成をガスクロ
マトグラフィーによって分析し、DHAの存在を確認す
ることにより行うことができる。That is, the above DNA is incorporated into a known expression vector, the resulting vector is introduced into an appropriate host to prepare a transformant, and then the transformant is cultured to express a protein. Analyze the function of the protein obtained. Here, as the expression vector,
For example, pBluescript II (manufactured by Stratagene) or the like can be used. In addition, Escherichia coli TOP10F '(manufactured by Invitrogen) is suitable as a host, and the culture conditions in this case may be a culture temperature of 20 ° C or lower. Protein function analysis can be performed by analyzing the total fatty acid composition of the cultured transformant by gas chromatography and confirming the presence of DHA.
【0033】あるいは、本発明のDNAの全部又は一部
に外来遺伝子を挿入し、遺伝子源として用いたDHA生
産細菌内で増殖(複製)することのできるベクターに組
み込み、得られるベクターを該DHA生産細菌に導入
し、相同組換えによって外来遺伝子挿入部分とゲノム上
の遺伝子を置き換えた各ORFの挿入変異株を作製する
ことによって、該ORFの機能を解析する。ここで、外
来遺伝子としては、抗生物質耐性遺伝子を用いることが
できる。遺伝子の機能解析は、上記挿入変異株の全脂肪
酸をガスクロマトグラフィーで分析することにより行う
ことができる。すなわち、挿入変異によりDHAが消失
することが確認されれば、該ORFはDHA合成に必須
であることが確認できる。さらに、新たな脂肪酸が確認
されれば、各酵素タンパク質の機能を推定することがで
きる。Alternatively, a foreign gene is inserted into all or a part of the DNA of the present invention, incorporated into a vector capable of growing (replicating) in a DHA-producing bacterium used as a gene source, and the resulting vector is subjected to the DHA-producing process. The function of each ORF is analyzed by introducing into a bacterium and producing an insertion mutant of each ORF in which the gene on the genome is replaced with the foreign gene insertion portion by homologous recombination. Here, an antibiotic resistance gene can be used as the foreign gene. The functional analysis of the gene can be performed by analyzing all the fatty acids of the inserted mutant strain by gas chromatography. That is, if it is confirmed that DHA disappears due to the insertion mutation, it can be confirmed that the ORF is essential for DHA synthesis. Furthermore, if a new fatty acid is confirmed, the function of each enzyme protein can be estimated.
【0034】上述の如く、本発明により提供される遺伝
子は、公知のEPA合成酵素群遺伝子と厳密な類似性を
有し、その類似性ある遺伝子の情報に基づくDHA合成
酵素遺伝子群の解析とそれら解析された遺伝子の機能及
び解析についての研究に利用でき、DHA生産性の向上
を目指す研究や、DHA生合成能を持たない生物へのD
HA生合成能の賦与への応用研究に用いることが可能で
ある。さらに、本発明により提供される遺伝子を利用す
ることにより、高度不飽和脂肪酸合成細菌から、高度不
飽和脂肪酸合成に関与する遺伝子の検出に有用なプライ
マー及び/又はプローブを提供することも可能である。As described above, the genes provided by the present invention have strict similarity to known EPA synthase genes, and the analysis of DHA synthase genes based on information of the similar genes and their analysis. It can be used for research on the function and analysis of the analyzed genes, and studies aimed at improving DHA productivity,
It can be used for application research to impart HA biosynthesis ability. Furthermore, by using the gene provided by the present invention, it is also possible to provide a primer and / or a probe useful for detection of a gene involved in the synthesis of a highly unsaturated fatty acid from a highly unsaturated fatty acid synthesizing bacterium. .
【0035】[0035]
【実施例】以下の実施例により、本発明を理解するため
にさらに詳細に説明する。これらの実施例は本発明の技
術範囲を限定するものではない。 〔実施例1〕供試菌種、培養条件とDNAの単離方法 モリテラ・マリナMP−1株(Moritella marina strai
n MP-1、ATCC15381)は、American Type Cult
ure Collectionより購入した。モリテラ・マリナMP−
1株は、Difco社製マリンブロス培地(タイプ22
16)50mlを500mlのフラスコ内で200rp
mの速度で旋回して、10℃で培養した。培養液を遠心
処理(15000×g, 20分)し、沈殿した細菌を回
収した。モリテラ・マリナMP‐1株のゲノムDNA
は、ニッポンジーン社製イソプラントDNA抽出キット
を用いて、該キットに付帯のプロトコールに従って抽出
した。The following examples are provided to further illustrate the present invention. These examples do not limit the technical scope of the present invention. [Example 1] Test bacterial species, culture conditions and DNA isolation method Moritella marina strain MP-1
n MP-1, ATCC15381) is American Type Cult
Purchased from ure Collection. Moritera Marina MP-
One strain is a marine broth medium (Type 22) manufactured by Difco.
16) 50 ml in a 500 ml flask at 200 rpm
The culture was performed at 10 ° C. while swirling at a speed of m. The culture was centrifuged (15000 × g, 20 minutes), and the precipitated bacteria were collected. Genomic DNA of Moritera Marina MP-1 strain
Was extracted using an isoplant DNA extraction kit manufactured by Nippon Gene according to a protocol attached to the kit.
【0036】〔実施例2〕PCR及びPCR産物のサブ
クローニング 矢澤の報告(DDBJ/EMBL/GenBank塩基配列データベース
登録番号;U73935)によると、シェワネラ(Shew
anella)SCRC-2738株由来のEPA生合成酵素
群遺伝子中5番目のORFにはKASとMCTのドメイ
ンが隣り合って存在する部分がある。そこで、上記KA
Sドメインと、大腸菌(Escherichia coli)及びマイコ
バクテリウム・チュバキュロシス(Mycobacterium tube
rculosis)のKASドメインとの間でよく保存されてい
るアミノ酸配列からセンスプライマーを、また、上記M
CTドメインと、大腸菌及びストレプトマイセス・コリ
カラ(Streptomyces coelicolor)のMCTドメインと
の間でよく保存されているアミノ酸配列からアンチセン
スプライマーを作製し(図1、矢印はプライマーの位置
を示す)、ゲノムPCRを行った。このとき、ゲノムP
CRに用いたプライマーは、下記表1の通りである。[0036] Example 2 PCR and PCR product subcloned Yazawa report (DDBJ / EMBL / GenBank nucleotide sequence database accession numbers; U73935) According to the Shewanella (Shew
anella ) In the fifth ORF of the EPA biosynthetic enzyme group gene derived from SCRC-2738 strain, there is a portion where domains of KAS and MCT are adjacent to each other. Therefore, the above KA
And the S domain, E. coli (Escherichia coli) and Mycobacterium tuberculosis (Mycobacterium tube
rculosis ) from the amino acid sequence that is well conserved with the KAS domain of
An antisense primer was prepared from the amino acid sequence well conserved between the CT domain and the MCT domain of Escherichia coli and Streptomyces coelicolor (FIG. 1, arrows indicate the positions of the primers), and the genome was prepared. PCR was performed. At this time, the genome P
The primers used for CR are as shown in Table 1 below.
【0037】[0037]
【表1】 [Table 1]
【0038】次いで、実施例1で得られたモリテラ・マ
リナMP−1株由来全DNAを鋳型とし、上記プライマ
ーを用いてPCR(ポリメラーゼ連鎖反応)を行った。
DNAポリメラーゼとしては、Taqポリメラーゼ(パー
キンエルマー社製)を用いた。すなわち、以下の表2に
示す組成を有するPCR溶液を調製し、サイクル反応を
行った。Next, PCR (polymerase chain reaction) was carried out using the above primers and the total DNA derived from Moritera marina MP-1 strain obtained in Example 1 as a template.
Taq polymerase (manufactured by PerkinElmer) was used as the DNA polymerase. That is, a PCR solution having the composition shown in Table 2 below was prepared, and a cycle reaction was performed.
【0039】[0039]
【表2】 [Table 2]
【0040】上記サイクル反応の温度条件は、94℃4
分を1回、94℃1分、45℃2分、72℃3分の繰り
返しを30回、最後に72℃10分を1回とした。上記
PCRの結果、図3に示したような710bpのKAS
/MCT断片が増幅された。この断片の塩基配列を常法
に従って決定した。シーケンスデータのデータベース検
索をNational Center of Biochemical Informationのオ
ンラインBLASTを用いて行ったところ、予想される
アミノ酸配列でシェワネラSCRC-2738株由来の
EPA合成酵素群遺伝子の該当する部分と39%の同一
性を示した(図2)。The temperature condition of the above cycle reaction is 94 ° C.4
One minute, 94 ° C. for 1 minute, 45 ° C. for 2 minutes, and 72 ° C. for 3 minutes were repeated 30 times, and finally, 72 ° C. for 10 minutes was performed once. As a result of the PCR, a KAS of 710 bp as shown in FIG.
The / MCT fragment was amplified. The nucleotide sequence of this fragment was determined according to a conventional method. When a database search of sequence data was performed using online BLAST of the National Center of Biochemical Information, the predicted amino acid sequence showed 39% identity with the corresponding portion of the EPA synthase group gene derived from Shewanella SCRC-2738 strain. (FIG. 2).
【0041】〔実施例3〕コスミドライブラリーの構築
とスクリーニング モリテラ・マリナMP−1株から抽出したゲノムDNA
を制限酵素Sau3AIで部分消化した後、サイズ分画
し、制限酵素BamHIで消化して開環したニッポンジ
ーン社製コスミドベクター、ロリスト6につないでコス
ミドライブラリーを構築した。その手順を以下に示す。[Example 3] Construction and screening of cosmid library Genomic DNA extracted from Moritera marina MP-1 strain
Was partially digested with the restriction enzyme Sau3AI, size-fractionated, digested with the restriction enzyme BamHI, and opened to a cosmid vector manufactured by Nippon Gene Co., Ltd., Lorist 6, to construct a cosmid library. The procedure is shown below.
【0042】100μgのゲノムDNAを制限酵素Sa
u3AIで部分消化し、ショ糖密度勾配遠心によってサ
イズ分画した。およそ50kbpの断片が濃縮された分
画のDNA断片とロリスト6クローニングベクターを混
合し、T4DNAリガーゼにより連結した。この連結さ
れたDNA鎖を常法に従ってλファージにパッケージン
グし、大腸菌DH5α株に感染させた。得られたカナマ
イシン耐性コロニーを、カナマイシン(25μg/m
l)を含むLB寒天培地の上に置いたナイロンフィルタ
ー(Hybond N+、Amersham Pharmacia Biotech社製)に
移し、30℃で一晩培養した。常法に従い、フィルター
上に増殖したコロニーをアルカリ溶解させ、遊離したD
NAをフィルター上に固定した。このフィルターを、5
×SSC、10×Denhaldt's溶液、0.1%SDS、100μg
/mlサケ精子DNAを含むハイブリダイゼーションバッ
ファー中に浸し、65℃で一晩プレハイブリダイゼーショ
ンを行った。その後、放射線標識したプローブを加え
て、65℃で一晩ハイブリダイゼーションを行った。プロ
ーブとしては、ランダムプライマーDNAラベリングキ
ット(TaKaRa社製)と[α−32P]dCTP(Amersham P
harmacia Biotech社製)で標識した710bpのKAS
/MCT断片(図2)を用いた。ハイブリダイゼーショ
ンの後、該フィルターについて2×SSC、0.1%SD
S中、室温で10分間の洗浄を行い、さらに、1×SS
C、0.1%SDS中、65℃で30分間の洗浄を2回行っ
た。オートラジオグラフィーにより確認したところ、2
45個のコロニーのうち、2個の陽性コロニーが得られ
た。このうち1個は、およそ40kbpの挿入断片をも
っていた(このコスミドをp3D5と称する)。100 μg of genomic DNA was replaced with the restriction enzyme Sa.
It was partially digested with u3AI and size fractionated by sucrose density gradient centrifugation. The DNA fragment of the fraction enriched with the approximately 50 kbp fragment was mixed with the Loristo 6 cloning vector, and ligated with T4 DNA ligase. The ligated DNA chain was packaged into a λ phage according to a conventional method, and the phage was infected with Escherichia coli DH5α strain. The obtained kanamycin-resistant colonies were transformed with kanamycin (25 μg / m
The mixture was transferred to a nylon filter (Hybond N +, manufactured by Amersham Pharmacia Biotech) placed on an LB agar medium containing 1), and cultured at 30 ° C. overnight. The colonies grown on the filter were dissolved in alkali and the
NA was fixed on the filter. Change this filter to 5
× SSC, 10 × Denhaldt's solution, 0.1% SDS, 100 μg
/ Ml salmon sperm DNA, and prehybridized at 65 ° C overnight. Thereafter, a radiolabeled probe was added, and hybridization was carried out at 65 ° C. overnight. As a probe, a random primer DNA labeling kit (TaKaRa) and [α- 32 P] dCTP (Amersham P
710 bp KAS labeled with harmacia Biotech)
/ MCT fragment (FIG. 2) was used. After hybridization, the filter was 2 × SSC, 0.1% SD
After washing at room temperature for 10 minutes in S,
C, washing twice in 0.1% SDS at 65 ° C for 30 minutes. As confirmed by autoradiography, 2
Of the 45 colonies, two positive colonies were obtained. One of them had an insert of approximately 40 kbp (this cosmid is called p3D5).
【0043】〔実施例4〕コスミドp3D5の塩基配列
の解析 コスミドp3D5中のゲノムDNA挿入部を含むSau3AI
-Sau3AI断片の全塩基配列を配列番号1に示す。この塩
基配列中には、図3に示されるように、22個のORF
があると推定できた。図3において、予想されるORF
の大きさと方向は、矢印で示す。斜線が施されている矢
印は、シェワネラSCRC2738株のEPA合成酵素
群遺伝子と相同性が見られたことを示す。全塩基配列
(配列番号1)とORF8〜11(それぞれ配列番号
2、4、6及び8)の関係は表3の通りである。[Example 4] Analysis of base sequence of cosmid p3D5 Sau3AI containing genomic DNA insertion site in cosmid p3D5
The entire nucleotide sequence of the -Sau3AI fragment is shown in SEQ ID NO: 1. In this base sequence, as shown in FIG.
It was estimated that there was. In FIG. 3, the expected ORF
Are indicated by arrows. The hatched arrow indicates that homology was found to the EPA synthase group gene of Shewanella SCRC2738 strain. Table 3 shows the relationship between the entire base sequence (SEQ ID NO: 1) and ORFs 8 to 11 (SEQ ID NOs: 2, 4, 6, and 8, respectively).
【0044】[0044]
【表3】 [Table 3]
【0045】さらに、EPA合成酵素群遺伝子(DDBJ/E
MBL/GenBank登録番号;U73935)の全塩基配列と
これに含まれるORF5〜8との関係を、下記の表4に
示す。Further, the EPA synthase group gene (DDBJ / E
Table 4 below shows the relationship between the entire nucleotide sequence of MBL / GenBank accession number; U73935) and ORFs 5 to 8 contained therein.
【0046】[0046]
【表4】 [Table 4]
【0047】上記の表3と表4を比較したところ、OR
F5,6,7及び8の長さは、p3D5のORF8,
9,10及び11に対応するそれぞれのORFの長さと
ほぼ同じであった。p3D5のORF8、9、10及び
11の推定アミノ酸配列を、EPA合成酵素群遺伝子の
ORF5、6、7及び8によってコードされるアミノ酸
配列と比較した結果を、下記の表5に示す。When Tables 3 and 4 above were compared, OR
The lengths of F5, 6, 7 and 8 are ORF8, p3D5,
The length of each ORF corresponding to 9, 10, and 11 was almost the same. The results of comparing the deduced amino acid sequences of ORFs 8, 9, 10 and 11 of p3D5 with the amino acid sequences encoded by ORFs 5, 6, 7 and 8 of the EPA synthase group gene are shown in Table 5 below.
【0048】[0048]
【表5】 [Table 5]
【0049】表5からわかるように、p3D5のORF
8、9、10及び11の推定アミノ酸配列は、EPA合
成酵素群遺伝子のORF5、6、7及び8によりコード
されるアミノ酸配列とそれぞれよく類似していた。OR
F8、9、10の推定アミノ酸配列をデータベースと照
らし合わせると、脂肪酸合成の様々な反応にかかわる酵
素やタンパク質に類似した領域がいくつかみられた。そ
の結果を次の表6に示す。As can be seen from Table 5, the ORF of p3D5
The deduced amino acid sequences of 8, 9, 10 and 11 were very similar to the amino acid sequences encoded by ORFs 5, 6, 7 and 8 of the EPA synthase group gene, respectively. OR
When the deduced amino acid sequences of F8, 9, and 10 were compared with a database, several regions similar to enzymes and proteins involved in various reactions of fatty acid synthesis were found. The results are shown in Table 6 below.
【0050】[0050]
【表6】 [Table 6]
【0051】ORF8にはKAS、MCT、ACP、β
-ケトアシル-ACPリダクターゼ(以下「KAR」とい
う。)によく似た領域があった。このうちACP類似領
域は5回繰り返していた。ORF9にはKASとMCT
に類似した領域があった。ORF10には2ヶ所のKA
S領域と2ヶ所のヒドロキシデカノイル-ACPデヒド
ラーゼ(HDD)と類似した領域があった。ORF8 includes KAS, MCT, ACP, β
-There was a region very similar to ketoacyl-ACP reductase (hereinafter referred to as "KAR"). Of these, the ACP-like region was repeated five times. ORF9 has KAS and MCT
There was an area similar to ORF10 has two KA
There was an S region and two regions similar to hydroxydecanoyl-ACP dehydrase (HDD).
【0052】これらの領域はどれも、部分的にそれぞれ
の酵素、タンパク質の配列と似ているというより、酵
素、タンパク質の配列全体を含んでおり、それぞれのO
RFがいくつかの触媒部位をもった多機能タンパク質を
コードしていることを示唆するものである。このような
ORFのドメイン構造は、PKSやFAS Iと類似し
ている(Hopwood, D.A. (1997) Chem. Rev. 97: 2465-2
497.)。Each of these regions contains the entire sequence of the enzyme or protein, rather than being partially similar to the sequence of the respective enzyme or protein.
This suggests that RF encodes a multifunctional protein with several catalytic sites. The domain structure of such an ORF is similar to PKS or FAS I (Hopwood, DA (1997) Chem. Rev. 97: 2465-2).
497.).
【0053】さらに、ORF8のKAS、MCT及びK
ARの各ドメイン、ORF9のKAS及びMCTの各ド
メイン、並びにORF10のKASドメインは、種々微
生物PKSのそれぞれのドメインとよく似ている(相同
性の最高値はそれぞれ、35%、26%、26%、23
%、21%、32%)。ORF8のKAS、MCT及び
KARの各ドメイン、並びにORF10のKASドメイ
ンは、脊椎動物のFAS Iのそれぞれのドメインとよ
く似ている(相同性はそれぞれ、27%、25%、28
%、25%)。ORF8のKAS−MCTドメイン、O
RF9のKAS−MCTドメイン、及びORF10のK
ASドメインは、ノストック・パンクチフォルメ(Nost
oc punctiforme)(Campbell, E.L., et al. (1998) A
rch. Microbiol. 167: 251-258.)とアナベナ(Anabaen
a sp.(GenBank登録番号U13677))のHglのそれぞれ
のドメインとよく似ている(前者との相同性はそれぞ
れ、51%、21%、36%であり、後者との相同性は
それぞれ、30%、29%、47%である)。ORF1
1も枯草菌(Bacillus subtilis)のポリケタイド合成
に関与していると報告されているpksE(Kunst, F.,
et al. (1997) Nature. 390: 249-256.)とよく似てい
る(相同性46%)。Further, the KAS, MCT and K of ORF8
The AR domains, the KAS and MCT domains of ORF9, and the KAS domains of ORF10 are very similar to the respective domains of various microbial PKSs (the highest homology values are 35%, 26%, and 26%, respectively). , 23
%, 21%, 32%). The KAS, MCT and KAR domains of ORF8 and the KAS domain of ORF10 are very similar to the respective domains of vertebrate FAS I (homologies are 27%, 25% and 28%, respectively).
%, 25%). The KAS-MCT domain of ORF8, O
KAS-MCT domain of RF9 and K of ORF10
AS domain is Nostock puncture form ( Nost
oc punctiforme ) (Campbell, EL, et al. (1998) A
rch. Microbiol. 167: 251-258.) and Anabaen ( Anabaen
a sp. Each homology with the respective domains and are very similar (former Hgl of (GenBank Accession No. U13677)) is 51%, 21%, and 36%, respectively, the homology with the latter, 30 %, 29%, and 47%). ORF1
PksE (Kunst, F., et al.) Reported to be involved in polyketide synthesis of Bacillus subtilis .
et al. (1997) Nature. 390: 249-256.) (46% homology).
【0054】このように、これらのORFにコードされ
るタンパク質は、PKS、FASI、Hglのように多
機能酵素として機能していると推測される。これらのP
KS、FAS I、Hglといった酵素は、どれも脂肪酸
やポリケタイドのような長鎖不飽和化合物の生合成に関
わっており、ORF8、9、10及び11にコードされ
るタンパク質がDHAの合成に関与している可能性は高
いと考えられる。Thus, the proteins encoded by these ORFs are presumed to function as multifunctional enzymes such as PKS, FASI and Hgl. These P
Enzymes such as KS, FAS I and Hgl are all involved in the biosynthesis of long-chain unsaturated compounds such as fatty acids and polyketides, and the proteins encoded by ORFs 8, 9, 10 and 11 are involved in the synthesis of DHA. It is highly probable.
【0055】このようなドメイン構造は、EPA合成酵
素群遺伝子にもみられ、本発明で得た遺伝子は、それと
非常によく似た構造をとっている。その結果を図4に示
す。図4において、矢印は、ORFの大きさと方向を示
す。DHA合成酵素遺伝子群(A)とEPA合成酵素群
遺伝子(B)とのドメイン構造の大きな違いとして、A
CPドメインの繰り返し数があげられる。EPA合成酵
素群遺伝子のORF5では6回であるのに対して、DH
A合成酵素遺伝子群のORF8では5回しかない。ま
た、EPA合成酵素群遺伝子のORF6にはみられなか
ったKASドメインが、DHA合成酵素遺伝子群のOR
F9には存在する。さらに、EPA合成酵素群遺伝子の
ORF7にはKASドメインは1ヶ所しかないが、DH
A合成酵素遺伝子群のORF10には2ヶ所存在する。
このようなEPA合成酵素群遺伝子とDHA合成酵素遺
伝子群のドメイン構造の違いが、最終産物の違いをもた
らしている可能性が高いと考えられる。Such a domain structure is also found in EPA synthase group genes, and the gene obtained in the present invention has a very similar structure. FIG. 4 shows the results. In FIG. 4, the arrows indicate the size and direction of the ORF. The major difference in the domain structure between the DHA synthase gene group (A) and the EPA synthase group gene (B) is as follows.
The number of repetitions of the CP domain is given. In EPA synthase group gene ORF5, DH is 6 times, whereas DH is 5 times.
ORF8 of the A synthase gene group has only 5 times. In addition, a KAS domain that was not found in ORF6 of the EPA synthase gene group was OR OR of the DHA synthase gene group.
Present in F9. Furthermore, although there is only one KAS domain in ORF7 of the EPA synthase group gene, DH
There are two locations in ORF10 of the A synthase gene group.
It is highly likely that such a difference in the domain structure between the EPA synthase gene group and the DHA synthase gene group causes a difference in the final product.
【0056】[0056]
【発明の効果】本発明によれば、DHA生産細菌に由来
する、配列番号3、5、7及び9に示すアミノ酸配列を
コードする新規のイコサペンタエン酸合成酵素群類似遺
伝子及びそれから翻訳されるアミノ酸配列を有するポリ
ペプチドが提供される。本発明によれば、確立された遺
伝子発現システムによって該遺伝子を発現させることに
より、DHA生産性の向上を目指す研究や、DHA生合
成能を持たない生物へのDHA生合成能を賦与するこ
と、更にはDHAの有利な製造法を確立することができ
るようになる。更に、本発明から得られる遺伝子情報
は、高度不飽和脂肪酸生産細菌から高度不飽和脂肪酸合
成に関与する遺伝子の検出に有用なプライマー及び/又
はプローブの設計に利用することができる。According to the present invention, a novel gene similar to the icosapentaenoic acid synthase group encoding the amino acid sequences shown in SEQ ID NOS: 3, 5, 7, and 9 derived from a DHA-producing bacterium, and an amino acid sequence translated therefrom Are provided. According to the present invention, a study aimed at improving DHA productivity by expressing the gene by an established gene expression system, or conferring DHA biosynthesis ability to an organism having no DHA biosynthesis ability, Furthermore, an advantageous method for producing DHA can be established. Furthermore, the genetic information obtained from the present invention can be used for designing primers and / or probes useful for detecting genes involved in polyunsaturated fatty acid synthesis from polyunsaturated fatty acid producing bacteria.
【0057】一般に、有用物質(例えば、DHA)を生
産する野生株の有用物質の生産性は低いことが多いが、
本発明の遺伝子を利用して、微生物におけるDHAの生
産性の向上を図ることによって、それらの微生物を工業
的に利用することが可能となる。In general, the productivity of a useful substance in a wild strain producing a useful substance (eg, DHA) is often low,
By utilizing the gene of the present invention to improve the productivity of DHA in microorganisms, those microorganisms can be industrially used.
【0058】[0058]
【配列表】 SEQUENCE LISTING <110> Director-General of Agency of Industrial Science and Technology <120> Gene from Docosahexaenoic Acid Producing Bacteria <130> P99-0665 <160> 11 <170> PatentIn Ver. 2.0 <210> 1 <211> 41587 <212> DNA <213> Moritella marina <400> 1 gatcactctg ctgcatggcg agagctgttt aattacaggt tgaaaaaaac gatgtaatgc 60 acttaattgc ttgctgttct taatgcctga ggcgtcgaag ataataccgt tgaagcgatc 120 tgttttagcg atagcattaa ggctaatagg tgtcgcgact aaagacgttt gattaaattc 180 aatattaaga tcggctaacg ctgacgtgtt attaggataa gaaatcgtga cttcagcatc 240 tttaaatgtg ttaagaatgg gtttaattaa tttgctgttg ctggctgcgc cgatgagtaa 300 gttgccagag atgagatcgg ttccctgatc gtagcgtgtt aacgtaaccg gtcgtggcag 360 attaagcgct ttaaataaac ctgatgtcca cttgccatta gcgagttttg cgtatgtatc 420 cgtcattttc taatccttgt tatagtgaac agtttgaatc tcgaagatgt acatgtgtta 480 aaaattatct gatagctatg acttatctgc cactacgtaa taataaatag accagttcat 540 tacatcgtta atcgatatag tataactaaa tactaagtaa attataatga taagactgtt 600 atcgtactcg gatcaaactc tgatcagcaa ataatcaaat tagagttttt attttaaact 660 tgtatcaaca atgttacatt aatgtatctt acgtctaatg tgctacgggc atatttaagt 720 cactaaatta aaggaataaa ccatgacagg tcaaacaata agaagagtag caattatcgg 780 cggtaaccgt atcccgtttg cacgttcaaa tacagcgtat tcaaaactaa gtaaccaaga 840 tatgctgacg gaaactatcc gtggcttggt ggttaaatat aacctacgtg gtgaacaact 900 gggggaagtt gttgctggtg cggtaattaa gcattctcgt gattttaact taacacgtga 960 agccgtgcta agtgcaggtc ttgcacctga aacgccttgt tatgacattc aacaagcttg 1020 tggtactggt ctagctgcag ctatccaagt agcaaacaaa attgcgcttg gtcaaataga 1080 agcgggtatt gctggtggtt ctgatacgac atcagatgca ccgattgcag tcagtgaagg 1140 catgcgtagt gtattacttg agcttaatcg agctaaaacg ggtaagcaac gtttgaaagc 1200 actatctcgt ctacgtctaa aacactttgc gccactaacg cctgcaaata aagagccgcg 1260 taccaaaatg gcgatgggcg atcattgtca agtaacagcg aaagagtgga atatctcacg 1320 tgaagcacaa gatgcattgg cctgcgcaag tcatcaaaaa ttagctgcag catatgaaga 1380 aggtttcttt gatacgttag tttcacctat ggccggctta acgaaagata acgtattacg 1440 cgcagataca acagttgaga aactggctaa attgaaacct tgttttgata aagtaaacgg 1500 cactatgacg gcgggtaaca gtactaacct taccgatgga gcatcagctg tattacttgc 1560 aagtgaagaa tgggcagcgg cacataactt accagtacaa gcttatctaa catttggtga 1620 aacggccgct atcgacttcg ttgataagaa agaaggtctg ttaatggcgc ctgcatacgc 1680 agtgccaaaa atgttgaagc gtgctggcct tacattacaa gacttcgatt actatgaaat 1740 acatgaagca tttgctgcgc agttattagc aacgctagca gcttgggaag acgaaaaatt 1800 ctgtaaagaa aaactgggtc tagatgctgc gcttggttca attgatatga ccaagttaaa 1860 cgtgaaaggg agtagcttag ccacgggtca cccatttgcc gcaactggtg gtcgtgttgt 1920 cgctacgcta gcgcaattac ttgatcagaa aggttcaggt cgtggtttga tctcgatttg 1980 tgctgctggt ggtcaaggta tcacggcaat tttagagaaa taaacgcact gtttattatc 2040 tattgattaa gctgtcctga gatactggat atttttaaat aaaacgccaa tactgcagag 2100 tattggcgtt tttttgtaat accaattcct atataacggt gcattttaaa cacttaattt 2160 ccggcattgg tatcataaaa aagcagcacc gaagtgctgc ttgattgtag attaacctat 2220 taaaatagag aggctagaat tagtcttcgt atgcttcatt atgtacgcca gctgcacgac 2280 ccgatggatc agcattgttt tggaaacttt catcccaagc taatgcttct acagttgaac 2340 aagcaacgga tttaccaaac ggtacgcatt tcgctgctga atcacctggg aagtgatctt 2400 caaagatggc acgatagtag taaccttctt tcgtatctgg tgtgttaatt gggaacttaa 2460 atgctgcact tgctaacatt tgatcagtta ccgcttcttc aacgtgtact ttaagttggt 2520 caatccaaga ataaccaaca ccatcagaga attgttcttt ttgacgccat acaatttctt 2580 caggtagtaa atcttcaaat gcttctcgaa tgatgttttt ctcaatgcgg tcgcccgtga 2640 tcatttttag ttcagggttt agacgcattg acgcatcaac aaattcttta tctaagaaag 2700 gaacacgtgc ttcgatgccc caagctgcca tagatttgtt tgcacgtaag caatcaaaca 2760 tatgtaattt atttacttta cgtaccgtct cttcatggaa ttctttcgca tttggcgctt 2820 tgtggaagta caagtaacca ccgaacagtt catcagcacc ttcaccagaa agcaccatct 2880 taatccccat ggctttaatt ttacgtgcca ttaggtacat aggggttgat gcacgaattg 2940 ttgttacatc gtaggtttca atgtggtaaa tcacgtcgcg taaagcgtcg ataccttctt 3000 gcacagtaaa ttcaattgaa tgatggatag tacctaagtg atctgccact ttttgtgcag 3060 cggctaaatc tggagaacca tttaggccta cagagaaaga gtgtagttgt ggccaccatg 3120 cttcggtttt accaccgtct tcaatacgac gttttgcata ctgttgggtg attgctgaaa 3180 taacagatga atctaacccg cctgataata atacgccgta aggtacatca cacattaatt 3240 gacgtttaac tgcatcttcc aaaccttgct taacaacgct tttatcacca ccattttgtg 3300 caacgttatc aaaatctttc caatcacgtt gataataagg cgtgactaca ccatccttac 3360 tccacaggta atgacctgct gggaattctt caatttgagt acaaattggc actagtgctt 3420 tcatttcaga ggcaacataa aagttaccgt gttcatcata gcccgtataa agagggatga 3480 taccgatatg gtcacggcca atcaggtaag cgtcctctgt ttcgtcatat aaagcgaaag 3540 caaaaatacc atttagatca tctaaaaatt gtgtgccttt ttctttatat agcgcaagta 3600 tcacttcgca atctgattct gtttggaatt caaagtctac gttcagcgtt ttctttaaat 3660 ctttgtggtt ataaatttca ccattaacag caagtacgtg tgtcttttct tcattatata 3720 gcggctgtgc accattattt acatcgacaa tagcaagacg ttcatgaact aaaatagcat 3780 tgtcacttgt atagatacct gaccaatctg ggccgcggtg acgtagtaac tttgatagtt 3840 ctagtgcttg ttcgcgaaga ggtttaatgt ctgatttgat gtctagaatt ccgaatattg 3900 agcacataac taattccttc tggggctgcg tctgcagcta actttctaaa tagtgtgtct 3960 aatttgccac attgtagatt taatgcaaac attaatgata aaacatttat aaaaaatgta 4020 attcaatgtg gaatcgataa tttaatggct taaaagtgaa gatccattaa ttgtgatggc 4080 gaggtgatag accaatgtag accttaatga ataaagcagg cacgattgaa tccattcaac 4140 gcaaagtggt actaactatt gttttaaacg ttataaatag tgttttaaag gttataagta 4200 aataatttaa aaacaataat aatccacatg cattaaattt atcatgataa accgctatat 4260 ctcaatggca atttgggata agtgtaaaat atatgtaaaa tgaatgagtt gacttgcttt 4320 ttttacacta agtgatgaaa ttaaagctag atgtcgttgt tagcattgat taataacgta 4380 ctaaaatacg acatctagta tagaaattta aaaaacagtt ggttttgata gcataactgc 4440 ataaactaat cagcttattg tctgtaatat ttttgtaatt taaataggtt taataaaatt 4500 atatgtctga taaatataaa ccgtacgacc tttcctttaa aaagacgttt ttgctgccta 4560 agttttggcc tgtgtggttc ggggtgtttg caatatactt attagctttt atgccagtaa 4620 agccgcgtga taaatttgct cgattcatag cgaagaaatt gtttagtcta aaaatgatgg 4680 caaagcgtaa aaaggtagca aagatcaatt tatctatgtg cttccctgaa atggatgata 4740 cggaacaaga ccgtataatc atggtcaatc tagttacttt ttgtcaaact atcttaagtt 4800 atgcagagcc aagtgcgcgt agtcgtgctt ataaccgtga ccgtatgata gtgcatggtg 4860 gcgagaattt atttccgcta cttgaacaag gtaaggcttg tatcttatta gtgccgcata 4920 gcttcgctat tgattttgca ggtttacaca ttgcttctta tggcgcgcca ttttgtacta 4980 tgtttaacaa ttctgagaat gagttgttcg attggctgat gacacgtcaa cgcgctatgt 5040 ttggaggcac tgtttatcac cgcaaggcag ggctaggggc tctagttaaa tcacttaaga 5100 gcggtgaaag ctgttattac ttacctgatg aagaccatgg acctaagcgt agtgtatttg 5160 cgcctttatt tgcgactcaa aaagcaactt tacctgtaat gggcaagcta gcagaaaaaa 5220 caaatgcact cgttgttcct gtttatgcgg catataatga atcactaggt aaatttgaaa 5280 cctttattcg accagcaatg caaaactttc catcagaaag cccagaacaa gatgcagtga 5340 tgatgaataa agagattgaa gccttgattg aatgtggtgt tgatcaatat atgtggacac 5400 ttagattatt gagaacacgt ccggacggta aaaaaatcta ctaataaagt ttaataaaca 5460 ccataatctt cgttgaatat ggtgtttacc cccctgaata ccctctaaat taataacaaa 5520 aaaagccatt tacgtaacat ctaatgatga tttagcctgc acttgctttg tttttagtct 5580 taagagccta ataaacttga tctaggtata gattctgtct ttctttacgt aacgcgatct 5640 atttttttta accgatagtt gttataatta gtttcatatg aaagagatat cgtttcagta 5700 aaagctattt cgtttcaata gataatttat ttatagtcat attttctgta atgacaatca 5760 ttttctcatc tagactatag ataagaatac gaattaagta agaacattaa ttttacaaga 5820 atataaaata tcccatcgga gctataagaa tgaaaaagac taaaattgtt tgtacaattg 5880 gtccaaaaac tgaatcagta gagaaactaa cagagcttgt taatgcaggc atgaacgtta 5940 tgcgtttaaa tttctctcat ggtaactttg ctgaacattc agtgcgtatt caaaatatcc 6000 gtcaagtaag tgaaaacctg aataagaaaa ttgctgtttt actggatact aaaggtccag 6060 aaatccgtac gattaaacta gaaaacggtg acgatgtaat gttgaccgct ggtcagtcat 6120 tcacgtttac aacagacatt aacgtggtag gtaataaaga ctgtgttgct gtaacatatg 6180 ctggttttgc taaagacctt aatcctggtg caatcatcct tgttgatgat ggtttaattg 6240 aaatggaagt tgttgcaaca actgacactg aagttaaatg tacagtatta aatactggtg 6300 cacttggtga aaataaaggc gttaacttac ctaacatcag tgtaggtcta cctgcattgt 6360 cagaaaaaga taaagctgat ttagcgtttg gttgtgagca agaagttgat tttgttgctg 6420 catcatttat tcgtaaggct gatgatgtaa gagaaattcg tgaaatccta tttaataatg 6480 gtggcgaaaa cattcagatt atctcgaaaa ttgaaaacca agaaggtgta gacaatttcg 6540 atgaaatctt agctgaatca gacggtatca tggttgctcg tggcgatctc ggtgttgaga 6600 tcccagttga agaagtgatc atggcacaga agatgatgat caaaaaatgt aataaagcag 6660 gtaaagttgt aattactgca acacaaatgc ttgattcaat gatcagtaac ccacgtccaa 6720 cacgtgcaga agcgggcgat gttgccaatg ctgtgcttga cggtaccgac gcggtaatgc 6780 tttctggtga aactgcgaaa ggtaaatacc cagttgaagc tgtgtctatc atggcaaaca 6840 tctgtgaacg tactgataac tcaatgtctt cggatttagg tgcgaacatt gttgctaaaa 6900 gcatgcgcat tacagaagct gtgtgtaaag gtgcggtaga aacaacagaa aaattgtgtg 6960 ctccacttat tgttgttgca actcgtggcg gtaaatcagc aaaatctgtt cgtaaatact 7020 tcccgaaagc aaatattctt gctatcacaa caaatgaaaa agcagcgcaa cagttatgcc 7080 taactaaagg cgtaagcagc tgcatcgttg agcagattga tagcactgat gagttctacc 7140 gtaaaggtaa agagcttgca ttagcaactg gtttagctaa agaaggcgat atcgttgtta 7200 tggtatcagg tgcgttagta ccatcaggta caacgaatac ggcatctgtt caccaacttt 7260 aagttgccat attgatatta taaaaaagag agcgtatgct ctcttttttt atatctgtag 7320 tttatatgtc tgtacaaaaa aatgataaag agtacataaa ctattaatat agcgtaatat 7380 ataatgatta acggtgatga aagggttaaa taaatggata gtgctaaaca taaaattggc 7440 ttagtccttt ctggcggtgg tgcgaaaggt attgctcatc ttggtgtatt aaaatacctg 7500 ttagagcaag atataagacc gaatgtaatt gcgggtacaa gtgctggctc tatggttggt 7560 gcactttatt gctcaggact tgagattgat gacattttac aattcttcat cgatgtaaaa 7620 cctttttctt ggaagtttac ccgtgcccgt gctggcttta tagacccggc aaaattatat 7680 cctgaagtgc taaaatatat ccccgaggat agctttgagt accttcaacc tgaattgcgc 7740 attgttgcca ccaacatgtt actcggtaaa gagcatatat ttaaagatgg ctccgtgatt 7800 aatgccttat tagcatcagc cagctaccct ttagtttttt ctccgatgat cattgacgat 7860 caagtgtatt cagatggcgg tattgttaat catttccccg tgagtgtcat tgaagatgat 7920 tgcgataaaa taatcggcgt atacgtgtcg cccattcgtc aggtcgaagc tgacgaactc 7980 tcgagtataa aagacgtggt attacgtgcg ttcacgctgc agggtagtgg tgctgaatta 8040 gataaactat cgcaatgtga tgtgcaaatt tatccagaag cgctattgaa ttacaatacg 8100 tttgcaaccg atgaaaaatc attacgggag atctaccaga ttggttatga tgctgcaaaa 8160 gatcaacatg acaaccttat ggcattgaaa gaaagtatca ccaccagcga ggttaaaaag 8220 aacgtcttta gcaaatggtt tggtgataaa cttgctagca acagcggcaa atagcggccc 8280 acacggattt atacactagg ataatgggcg ttaatagcct cactgtcgtt gtgtggtctc 8340 taattttagc taaatcttgt gttatactga cttcctatta atcataaacg atttatcacg 8400 gtaaacatga ctcaaataaa taacccgctt cacggcatga cactcgaaaa agtaattaac 8460 agtctcgttg aacaatatgg ctgggatggt cttggatact acatcaacat tcgttgcttt 8520 actgaaaatc caagtgttaa gtctagtctt aaatttttac gtaaaacccc ttgggcacgt 8580 gataaagtag aagcgctata tatcaaaatg gtgactgaag gctaactgtc tccacgctag 8640 cgaaccgctg tttatagtta atataagtac tataagcagg gctcgttaat tcagtatgta 8700 attaatcctg aataccttcc gcttatttca acattgtact ctctagataa cactctcaac 8760 attacacctt caacatcaca gcctccacat aacatccgat gacatagccc tgttattttt 8820 cacatttatc tatatgctat atattttagc catttgatca attgagttaa tttctgcaat 8880 gacaaagata taccatcatc cagtacaaat ttattatgaa gataccgacc attctggtgt 8940 tgtttaccac cctaactttt taaaatactt tgaacgtgca cgtgagcatg tgataaatag 9000 tgacttacta gcaacattgt ggaatgaacg cggtttaggt tttgcggtgt ataaagccaa 9060 tatgactttt caggatgggg tcgaatttgc tgaagtgtgt gatattcgca cttcttttgt 9120 cctagacggt aagtacaaaa cgatctggcg ccaagaagta tggcgtccga atgcgactag 9180 ggctgccgtt atcggtgata ttgaaatggt gtgcttagac aaacaaaaac gtttacagcc 9240 catccctgat gatgtgttag ctgcaatggt tagtgaataa atggttcatg cataaatagt 9300 taatacatga ttctggcccg tcacgtttac agataagagg catccgatgc ctccttccta 9360 ttaccaatac tactgcttat ccctttctaa ctatctttag cgtccataac acactgagca 9420 tttattctat taatcagtga ttgtgattta attatcttct atatatgtaa tttaatgtaa 9480 ttttcaattt atttttagct acattaaggc ttacgaatgt acgctaaaat gagatgtcag 9540 actaatttta gcttattaat ctgttagccg tttatatttt ataaagatgg gatttaactt 9600 aaatgcaatt aattatggcg taaatagagt gaaaacatgg ctaatattca ctaagtcctg 9660 aattttatat aaagtttaat ctgttatttt agcgtttacc tggtcttatc agtgaggttt 9720 atagccatta ttagtgggat tgaagtgatt tttaaagcta tgtatattat tgcaaatata 9780 aattgtaaca attaagactt tggacacttg agttcaattt cgaattgatt ggcataaaat 9840 ttaaaacagc taaatctacc tcaatcattt tagcaaatgt atgcaggtag atttttttcg 9900 ccatttaaga gtacacttgt acgctaggtt tttgtttagt gtgcaaatga acgttttgat 9960 gagcattgtt tttagagcac aaaatagatc cttacaggag caataacgca atggctaaaa 10020 agaacaccac atcgattaag cacgccaagg atgtgttaag tagtgatgat caacagttaa 10080 attctcgctt gcaagaatgt ccgattgcca tcattggtat ggcatcggtt tttgcagatg 10140 ctaaaaactt ggatcaattc tgggataaca tcgttgactc tgtggacgct attattgatg 10200 tgcctagcga tcgctggaac attgacgacc attactcggc tgataaaaaa gcagctgaca 10260 agacatactg caaacgcggt ggtttcattc cagagcttga ttttgatccg atggagtttg 10320 gtttaccgcc aaatatcctc gagttaactg acatcgctca attgttgtca ttaattgttg 10380 ctcgtgatgt attaagtgat gctggcattg gtagtgatta tgaccatgat aaaattggta 10440 tcacgctggg tgtcggtggt ggtcagaaac aaatttcgcc attaacgtcg cgcctacaag 10500 gcccggtatt agaaaaagta ttaaaagcct caggcattga tgaagatgat cgcgctatga 10560 tcatcgacaa atttaaaaaa gcctacatcg gctgggaaga gaactcattc ccaggcatgc 10620 taggtaacgt tattgctggt cgtatcgcca atcgttttga ttttggtggt actaactgtg 10680 tggttgatgc ggcatgcgct ggctcccttg cagctgttaa aatggcgatc tcagacttac 10740 ttgaatatcg ttcagaagtc atgatatcgg gtggtgtatg ttgtgataac tcgccattca 10800 tgtatatgtc attctcgaaa acaccagcat ttaccaccaa tgatgatatc cgtccgtttg 10860 atgacgattc aaaaggcatg ctggttggtg aaggtattgg catgatggcg tttaaacgtc 10920 ttgaagatgc tgaacgtgac ggcgacaaaa tttattctgt actgaaaggt atcggtacat 10980 cttcagatgg tcgtttcaaa tctatttacg ctccacgccc agatggccaa gcaaaagcgc 11040 taaaacgtgc ttatgaagat gccggttttg cccctgaaac atgtggtcta attgaaggcc 11100 atggtacggg taccaaagcg ggtgatgccg cagaatttgc tggcttgacc aaacactttg 11160 gcgccgccag tgatgaaaag caatatatcg ccttaggctt agttaaatcg caaattggtc 11220 atactaaatc tgcggctggc tctgcgggta tgattaaggc ggcattagcg ctgcatcata 11280 aaatcttacc tgcaacgatc catatcgata aaccaagtga agccttggat atcaaaaaca 11340 gcccgttata cctaaacagc gaaacgcgtc cttggatgcc acgtgaagat ggtattccac 11400 gtcgtgcagg tatcagctca tttggttttg gcggcaccaa cttccatatt attttagaag 11460 agtatcgccc aggtcacgat agcgcatatc gcttaaactc agtgagccaa actgtgttga 11520 tctcggcaaa cgaccaacaa ggtattgttg ctgagttaaa taactggcgt actaaactgg 11580 ctgtcgatgc tgatcatcaa gggtttgtat ttaatgagtt agtgacaacg tggccattaa 11640 aaaccccatc cgttaaccaa gctcgtttag gttttgttgc gcgtaatgca aatgaagcga 11700 tcgcgatgat tgatacggca ttgaaacaat tcaatgcgaa cgcagataaa atgacatggt 11760 cagtacctac cggggtttac tatcgtcaag ccggtattga tgcaacaggt aaagtggttg 11820 cgctattctc agggcaaggt tcgcaatacg tgaacatggg tcgtgaatta acctgtaact 11880 tcccaagcat gatgcacagt gctgcggcga tggataaaga gttcagtgcc gctggtttag 11940 gccagttatc tgcagttact ttccctatcc ctgtttatac ggatgccgag cgtaagctac 12000 aagaagagca attacgttta acgcaacatg cgcaaccagc gattggtagt ttgagtgttg 12060 gtctgttcaa aacgtttaag caagcaggtt ttaaagctga ttttgctgcc ggtcatagtt 12120 tcggtgagtt aaccgcatta tgggctgccg atgtattgag cgaaagcgat tacatgatgt 12180 tagcgcgtag tcgtggtcaa gcaatggctg cgccagagca acaagatttt gatgcaggta 12240 agatggccgc tgttgttggt gatccaaagc aagtcgctgt gatcattgat acccttgatg 12300 atgtctctat tgctaacttc aactcgaata accaagttgt tattgctggt actacggagc 12360 aggttgctgt agcggttaca accttaggta atgctggttt caaagttgtg ccactgccgg 12420 tatctgctgc gttccataca cctttagttc gtcacgcgca aaaaccattt gctaaagcgg 12480 ttgatagcgc taaatttaaa gcgccaagca ttccagtgtt tgctaatggc acaggcttgg 12540 tgcattcaag caaaccgaat gacattaaga aaaacctgaa aaaccacatg ctggaatctg 12600 ttcatttcaa tcaagaaatt gacaacatct atgctgatgg tggccgcgta tttatcgaat 12660 ttggtccaaa gaatgtatta actaaattgg ttgaaaacat tctcactgaa aaatctgatg 12720 tgactgctat cgcggttaat gctaatccta aacaacctgc ggacgtacaa atgcgccaag 12780 ctgcgctgca aatggcagtg cttggtgtcg cattagacaa tattgacccg tacgacgccg 12840 ttaagcgtcc acttgttgcg ccgaaagcat caccaatgtt gatgaagtta tctgcagcgt 12900 cttatgttag tccgaaaacg aagaaagcgt ttgctgatgc attgactgat ggctggactg 12960 ttaagcaagc gaaagctgta cctgctgttg tgtcacaacc acaagtgatt gaaaagatcg 13020 ttgaagttga aaagatagtt gaacgcattg tcgaagtaga gcgtattgtc gaagtagaaa 13080 aaatcgtcta cgttaatgct gacggttcgc ttatatcgca aaataatcaa gacgttaaca 13140 gcgctgttgt tagcaacgtg actaatagct cagtgactca tagcagtgat gctgaccttg 13200 ttgcctctat tgaacgcagt gttggtcaat ttgttgcaca ccaacagcaa ttattaaatg 13260 tacatgaaca gtttatgcaa ggtccacaag actacgcgaa aacagtgcag aacgtacttg 13320 ctgcgcagac gagcaatgaa ttaccggaaa gtttagaccg tacattgtct atgtataacg 13380 agttccaatc agaaacgcta cgtgtacatg aaacgtacct gaacaatcag acgagcaaca 13440 tgaacaccat gcttactggt gctgaagctg atgtgctagc aaccccaata actcaggtag 13500 tgaatacagc cgttgccact agtcacaagg tagttgctcc agttattgct aatacagtga 13560 cgaatgttgt atctagtgtc agtaataacg cggcggttgc agtgcaaact gtggcattag 13620 cgcctacgca agaaatcgct ccaacagtcg ctactacgcc agcacccgca ttggttgcta 13680 tcgtggctga acctgtgatt gttgcgcatg ttgctacaga agttgcacca attacaccat 13740 cagttacacc agttgtcgca actcaagcgg ctatcgatgt agcaactatt aacaaagtaa 13800 tgttagaagt tgttgctgat aaaaccggtt atccaacgga tatgctggaa ctgagcatgg 13860 acatggaagc tgacttaggt atcgactcaa tcaaacgtgt tgagatatta ggcgcagtac 13920 aggaattgat ccctgactta cctgaactta atcctgaaga tcttgctgag ctacgcacgc 13980 ttggtgagat tgtcgattac atgaattcaa aagcccaggc tgtagctcct acaacagtac 14040 ctgtaacaag tgcacctgtt tcgcctgcat ctgctggtat tgatttagcc cacatccaaa 14100 acgtaatgtt agaagtggtt gcagacaaaa ccggttaccc aacagacatg ctagaactga 14160 gcatggatat ggaagctgac ttaggtattg attcaatcaa gcgtgtggaa atcttaggtg 14220 cagtacagga gatcataact gatttacctg agctaaaccc tgaagatctt gttgaattac 14280 gcaccctagg tgaaatcgtt agttacatgc aaagcaaagc gccagtcgct gaaagtgcgc 14340 cagtggcgac ggctcctgta gcaacaagct cagcaccgtc tatcgatttg aaccacattc 14400 aaacagtgat gatggatgta gttgcagata agactggtta tccaactgac atgctagaac 14460 ttggcatgga catggaagct gatttaggta tcgattcaat caaacgtgtg gaaatattag 14520 gcgcagtgca ggagatcatc actgatttac ctgagctaaa cccagaagac ctcgctgaat 14580 tacgcacgct aggtgaaatc gttagttaca tgcaaagcaa agcgccagtc gctgagagtg 14640 cgccagtagc gacggcttct gtagcaacaa gctctgcacc gtctatcgat ttaaaccata 14700 tccaaacagt gatgatggaa gtggttgcag acaaaaccgg ttatccagta gacatgttag 14760 aacttgctat ggacatggaa gctgacctag gtatcgattc aatcaagcgt gtagaaattt 14820 taggtgcggt acaggaaatc attactgact tacctgagct taaccctgaa gatcttgctg 14880 aactacgtac attaggtgaa atcgttagtt acatgcaaag caaagcgccc gtagctgaag 14940 cgcctgcagt acctgttgca gtagaaagtg cacctactag tgtaacaagc tcagcaccgt 15000 ctatcgattt agaccacatc caaaatgtaa tgatggatgt tgttgctgat aagactggtt 15060 atcctgccaa tatgcttgaa ttagcaatgg acatggaagc cgaccttggt attgattcaa 15120 tcaagcgtgt tgaaattcta ggcgcggtac aggagatcat tactgattta cctgaactaa 15180 acccagaaga cttagctgaa ctacgtacgt tagaagaaat tgtaacctac atgcaaagca 15240 aggcgagtgg tgttactgta aatgtagtgg ctagccctga aaataatgct gtatcagatg 15300 catttatgca aagcaatgtg gcgactatca cagcggccgc agaacataag gcggaattta 15360 aaccggcgcc gagcgcaacc gttgctatct ctcgtctaag ctctatcagt aaaataagcc 15420 aagattgtaa aggtgctaac gccttaatcg tagctgatgg cactgataat gctgtgttac 15480 ttgcagacca cctattgcaa actggctgga atgtaactgc attgcaacca acttgggtag 15540 ctgtaacaac gacgaaagca tttaataagt cagtgaacct ggtgacttta aatggcgttg 15600 atgaaactga aatcaacaac attattactg ctaacgcaca attggatgca gttatctatc 15660 tgcacgcaag tagcgaaatt aatgctatcg aatacccaca agcatctaag caaggcctga 15720 tgttagcctt cttattagcg aaattgagta aagtaactca agccgctaaa gtgcgtggcg 15780 cctttatgat tgttactcag cagggtggtt cattaggttt tgatgatatc gattctgcta 15840 caagtcatga tgtgaaaaca gacctagtac aaagcggctt aaacggttta gttaagacac 15900 tgtctcacga gtgggataac gtattctgtc gtgcggttga tattgcttcg tcattaacgg 15960 ctgaacaagt tgcaagcctt gttagtgatg aactacttga tgctaacact gtattaacag 16020 aagtgggtta tcaacaagct ggtaaaggcc ttgaacgtat cacgttaact ggtgtggcta 16080 ctgacagcta tgcattaaca gctggcaata acatcgatgc taactcggta tttttagtga 16140 gtggtggcgc aaaaggtgta actgcacatt gtgttgctcg tatagctaaa gaatatcagt 16200 ctaagttcat cttattggga cgttcaacgt tctcaagtga cgaaccgagc tgggcaagtg 16260 gtattactga tgaagcggcg ttaaagaaag cagcgatgca gtctttgatt acagcaggtg 16320 ataaaccaac acccgttaag atcgtacagc taatcaaacc aatccaagct aatcgtgaaa 16380 ttgcgcaaac cttgtctgca attaccgctg ctggtggcca agctgaatat gtttctgcag 16440 atgtaactaa tgcagcaagc gtacaaatgg cagtcgctcc agctatcgct aagttcggtg 16500 caatcactgg catcattcat ggcgcgggtg tgttagctga ccaattcatt gagcaaaaaa 16560 cactgagtga ttttgagtct gtttacagca ctaaaattga cggtttgtta tcgctactat 16620 cagtcactga agcaagcaac atcaagcaat tggtattgtt ctcgtcagcg gctggtttct 16680 acggtaaccc cggccagtct gattactcga ttgccaatga gatcttaaat aaaaccgcat 16740 accgctttaa atcattgcac ccacaagctc aagtattgag ctttaactgg ggtccttggg 16800 acggtggcat ggtaacgcct gagcttaaac gtatgtttga ccaacgtggt gtttacatta 16860 ttccacttga tgcaggtgca cagttattgc tgaatgaact agccgctaat gataaccgtt 16920 gtccacaaat cctcgtgggt aatgacttat ctaaagatgc tagctctgat caaaagtctg 16980 atgaaaagag tactgctgta aaaaagccac aagttagtcg tttatcagat gctttagtaa 17040 ctaaaagtat caaagcgact aacagtagct ctttatcaaa caagactagt gctttatcag 17100 acagtagtgc ttttcaggtt aacgaaaacc actttttagc tgaccacatg atcaaaggca 17160 atcaggtatt accaacggta tgcgcgattg cttggatgag tgatgcagca aaagcgactt 17220 atagtaaccg agactgtgca ttgaagtatg tcggtttcga agactataaa ttgtttaaag 17280 gtgtggtttt tgatggcaat gaggcggcgg attaccaaat ccaattgtcg cctgtgacaa 17340 gggcgtcaga acaggattct gaagtccgta ttgccgcaaa gatctttagc ctgaaaagtg 17400 acggtaaacc tgtgtttcat tatgcagcga caatattgtt agcaactcag ccacttaatg 17460 ctgtgaaggt agaacttccg acattgacag aaagtgttga tagcaacaat aaagtaactg 17520 atgaagcaca agcgttatac agcaatggca ccttgttcca cggtgaaagt ctgcagggca 17580 ttaagcagat attaagttgt gacgacaagg gcctgctatt ggcttgtcag ataaccgatg 17640 ttgcaacagc taagcaggga tccttcccgt tagctgacaa caatatcttt gccaatgatt 17700 tggtttatca ggctatgttg gtctgggtgc gcaaacaatt tggtttaggt agcttacctt 17760 cggtgacaac ggcttggact gtgtatcgtg aagtggttgt agatgaagta ttttatctgc 17820 aacttaatgt tgttgagcat gatctattgg gttcacgcgg cagtaaagcc cgttgtgata 17880 ttcaattgat tgctgctgat atgcaattac ttgccgaagt gaaatcagcg caagtcagtg 17940 tcagtgacat tttgaacgat atgtcatgat cgagtaaata ataacgatag gcgtcatggt 18000 gagcatggcg tctgctttct tcatttttta acattaacaa tattaatagc taaacgcggt 18060 tgctttaaac caagtaaaca agtgctttta gctattacta ttccaaacag gatattaaag 18120 agaatatgac ggaattagct gttattggta tggatgctaa atttagcgga caagacaata 18180 ttgaccgtgt ggaacgcgct ttctatgaag gtgcttatgt aggtaatgtt agccgcgtta 18240 gtaccgaatc taatgttatt agcaatggcg aagaacaagt tattactgcc atgacagttc 18300 ttaactctgt cagtctacta gcgcaaacga atcagttaaa tatagctgat atcgcggtgt 18360 tgctgattgc tgatgtaaaa agtgctgatg atcagcttgt agtccaaatt gcatcagcaa 18420 ttgaaaaaca gtgtgcgagt tgtgttgtta ttgctgattt aggccaagca ttaaatcaag 18480 tagctgattt agttaataac caagactgtc ctgtggctgt aattggcatg aataactcgg 18540 ttaatttatc tcgtcatgat cttgaatctg taactgcaac aatcagcttt gatgaaacct 18600 tcaatggtta taacaatgta gctgggttcg cgagtttact tatcgcttca actgcgtttg 18660 ccaatgctaa gcaatgttat atatacgcca acattaaggg cttcgctcaa tcgggcgtaa 18720 atgctcaatt taacgttgga aacattagcg atactgcaaa gaccgcattg cagcaagcta 18780 gcataactgc agagcaggtt ggtttgttag aagtgtcagc agtcgctgat tcggcaatcg 18840 cattgtctga aagccaaggt ttaatgtctg cttatcatca tacgcaaact ttgcatactg 18900 cattaagcag tgcccgtagt gtgactggtg aaggcgggtg tttttcacag gtcgcaggtt 18960 tattgaaatg tgtaattggt ttacatcaac gttatattcc ggcgattaaa gattggcaac 19020 aaccgagtga caatcaaatg tcacggtggc ggaattcacc attctatatg cctgtagatg 19080 ctcgaccttg gttcccacat gctgatggct ctgcacacat tgccgcttat agttgtgtga 19140 ctgctgacag ctattgtcat attcttttac aagaaaacgt cttacaagaa cttgttttga 19200 aagaaacagt cttgcaagat aatgacttaa ctgaaagcaa gcttcagact cttgaacaaa 19260 acaatccagt agctgatctg cgcactaatg gttactttgc atcgagcgag ttagcattaa 19320 tcatagtaca aggtaatgac gaagcacaat tacgctgtga attagaaact attacagggc 19380 agttaagtac tactggcata agtactatca gtattaaaca gatcgcagca gactgttatg 19440 cccgtaatga tactaacaaa gcctatagcg cagtgcttat tgccgagact gctgaagagt 19500 taagcaaaga aataaccttg gcgtttgctg gtatcgctag cgtgtttaat gaagatgcta 19560 aagaatggaa aaccccgaag ggcagttatt ttaccgcgca gcctgcaaat aaacaggctg 19620 ctaacagcac acagaatggt gtcaccttca tgtacccagg tattggtgct acatatgttg 19680 gtttagggcg tgatctattt catctattcc cacagattta tcagcctgta gcggctttag 19740 ccgatgacat tggcgaaagt ctaaaagata ctttacttaa tccacgcagt attagtcgtc 19800 atagctttaa agaactcaag cagttggatc tggacctgcg cggtaactta gccaatatcg 19860 ctgaagccgg tgtgggtttt gcttgtgtgt ttaccaaggt atttgaagaa gtctttgccg 19920 ttaaagctga ctttgctaca ggttatagca tgggtgaagt aagcatgtat gcagcactag 19980 gctgctggca gcaaccggga ttgatgagtg ctcgccttgc acaatcgaat acctttaatc 20040 atcaactttg cggcgagtta agaacactac gtcagcattg gggcatggat gatgtagcta 20100 acggtacgtt cgagcagatc tgggaaacct ataccattaa ggcaacgatt gaacaggtcg 20160 aaattgcctc tgcagatgaa gatcgtgtgt attgcaccat tatcaataca cctgatagct 20220 tgttgttagc cggttatcca gaagcctgtc agcgagtcat taagaattta ggtgtgcgtg 20280 caatggcatt gaatatggcg aacgcaattc acagcgcgcc agcttatgcc gaatacgatc 20340 atatggttga gctataccat atggatgtta ctccacgtat taataccaag atgtattcaa 20400 gctcatgtta tttaccgatt ccacaacgca gcaaagcgat ttcccacagt attgctaaat 20460 gtttgtgtga tgtggtggat ttcccacgtt tggttaatac cttacatgac aaaggtgcgc 20520 gggtattcat tgaaatgggt ccaggtcgtt cgttatgtag ctgggtagat aagatcttag 20580 ttaatggcga tggcgataat aaaaagcaaa gccaacatgt atctgttcct gtgaatgcca 20640 aaggcaccag tgatgaactt acttatattc gtgcgattgc taagttaatt agtcatggcg 20700 tgaatttgaa tttagatagc tagtttaacg ggtcaatcct ggttaaagca ggccatatag 20760 caaacacgaa caaatagtca acatcgatat ctagcgctgg tgagttatac ctcattagtt 20820 gaaatatgga tttaaagaga gtaattatgg aaaatattgc agtagtaggt attgctaatt 20880 tgttcccggg ctcacaagca ccggatcaat tttggcagca attgcttgaa caacaagatt 20940 gccgcagtaa ggcgaccgct gttcaaatgg gcgttgatcc tgctaaatat accgccaaca 21000 aaggtgacac agataaattt tactgtgtgc acggcggtta catcagtgat ttcaattttg 21060 atgcttcagg ttatcaactc gataatgatt atttagccgg tttagatgac cttaatcaat 21120 gggggcttta tgttacgaaa caagccctta ccgatgcggg ttattggggc agtactgcac 21180 tagaaaactg tggtgtgatt ttaggtaatt tgtcattccc aactaaatca tctaatcagc 21240 tgtttatgcc tttgtatcat caagttgttg ataatgcctt aaaggcggta ttacatcctg 21300 attttcaatt aacgcattac acagcaccga aaaaaacaca tgctgacaat gcattagtag 21360 caggttatcc agctgcattg atcgcgcaag cggcgggtct tggtggttca cattttgcac 21420 tggatgcggc ttgtgcttca tcttgttata gcgttaagtt agcgtgtgat tacctgcata 21480 cgggtaaagc caacatgatg cttgctggtg cggtatctgc agcagatcct atgttcgtaa 21540 atatgggttt ctcgatattc caagcttacc cagctaacaa tgtacatgcc ccgtttgacc 21600 aaaattcaca aggtctattt gccggtgaag gcgcgggcat gatggtattg aaacgtcaaa 21660 gtgatgcagt acgtgatggt gatcatattt acgccattat taaaggcggc gcattatcga 21720 atgacggtaa aggcgagttt gtattaagcc cgaacaccaa gggccaagta ttagtatatg 21780 aacgtgctta tgccgatgca gatgttgacc cgagtacagt tgactatatt gaatgtcatg 21840 caacgggcac acctaagggt gacaatgttg aattgcgttc gatggaaacc tttttcagtc 21900 gcgtaaataa caaaccatta ctgggctcgg ttaaatctaa ccttggtcat ttgttaactg 21960 ccgctggtat gcctggcatg accaaagcta tgttagcgct aggtaaaggt cttattcctg 22020 caacgattaa cttaaagcaa ccactgcaat ctaaaaacgg ttactttact ggcgagcaaa 22080 tgccaacgac gactgtgtct tggccaacaa ctccgggtgc caaggcagat aaaccgcgta 22140 ccgcaggtgt gagcgtattt ggttttggtg gcagcaacgc ccatttggta ttacaacagc 22200 caacgcaaac actcgagact aattttagtg ttgctaaacc acgtgagcct ttggctatta 22260 ttggtatgga cagccatttt ggtagtgcca gtaatttagc gcagttcaaa accttattaa 22320 ataataatca aaataccttc cgtgaattac cagaacaacg ctggaaaggc atggaaagta 22380 acgctaacgt catgcagtcg ttacaattac gcaaagcgcc taaaggcagt tacgttgaac 22440 agctagatat tgatttcttg cgttttaaag taccgcctaa tgaaaaagat tgcttgatcc 22500 cgcaacagtt aatgatgatg caagtggcag acaatgctgc gaaagacgga ggtctagttg 22560 aaggtcgtaa tgttgcggta ttagtagcga tgggcatgga actggaatta catcagtatc 22620 gtggtcgcgt taatctaacc acccaaattg aagacagctt attacagcaa ggtattaacc 22680 tgactgttga gcaacgtgaa gaactgacca atattgctaa agacggtgtt gcctcggctg 22740 cacagctaaa tcagtatacg agtttcattg gtaatattat ggcgtcacgt atttcggcgt 22800 tatgggattt ttctggtcct gctattaccg tatcggctga agaaaactct gtttatcgtt 22860 gtgttgaatt agctgaaaat ctatttcaaa ccagtgatgt tgaagccgtt attattgctg 22920 ctgttgattt gtctggttca attgaaaaca ttactttacg tcagcactac ggtccagtta 22980 atgaaaaggg atctgtaagt gaatgtggtc cggttaatga aagcagttca gtaaccaaca 23040 atattcttga tcagcaacaa tggctggtgg gtgaaggcgc agcggctatt gtcgttaaac 23100 cgtcatcgca agtcactgct gaacaagttt atgcgcgtat tgatgcggtg agttttgccc 23160 ctggtagcaa tgcgaaagca attacgattg cagcggataa agcattaaca cttgctggta 23220 tcagtgctgc tgatgtagct agtgttgaag cacatgcaag tggttttagt gccgaaaata 23280 atgctgaaaa aaccgcgtta ccgactttat acccaagcgc aagtatcagt tcggtgaaag 23340 ccaatattgg tcatacgttt aatgcctcgg gtatggcgag tattattaaa acggcgctgc 23400 tgttagatca gaatacgagt caagatcaga aaagcaaaca tattgctatt aacggtctag 23460 gtcgtgataa cagctgcgcg catcttatct tatcgagttc agcgcaagcg catcaagttg 23520 caccagcgcc tgtatctggt atggccaagc aacgcccaca gttagttaaa accatcaaac 23580 tcggtggtca gttaattagc aacgcgattg ttaacagtgc gagttcatct ttacacgcta 23640 ttaaagcgca gtttgccggt aagcacttaa acaaagttaa ccagccagtg atgatggata 23700 acctgaagcc ccaaggtatt agcgctcatg caaccaatga gtatgtggtg actggagctg 23760 ctaacactca agcttctaac attcaagcat ctcatgttca agcgtcaagt catgcacaag 23820 agatagcacc aaaccaagtt caaaatatgc aagctacagc agccgctgta agttcacccc 23880 tttctcaaca tcaacacaca gcgcagcccg tagcggcacc gagcgttgtt ggagtgactg 23940 tgaaacataa agcaagtaac caaattcatc agcaagcgtc tacgcataaa gcatttttag 24000 aaagtcgttt agctgcacag aaaaacctat cgcaacttgt tgaattgcaa accaagctgt 24060 caatccaaac tggtagtgac aatacatcta acaatactgc gtcaacaagc aatacagtgc 24120 taacaaatcc tgtatcagca acgccattaa cacttgtgta taatgcgcct gtagtagcga 24180 caaacctaac cagtacagaa gcaaaagcgc aagcagctgc tacacaagct ggttttcaga 24240 taaaaggacc tgttggttac aactatccac cgctgcagtt aattgaacgt tataataaac 24300 cagaaaacgt gatttacgat caagctgatt tggttgaatt cgctgaaggt gatattggta 24360 aggtatttgg tgctgaatac aatattattg atggctattc gcgtcgtgta cgtctgccaa 24420 cctcagatta cttgttagta acacgtgtta ctgaacttga tgccaaggtg catgaataca 24480 agaaatcata catgtgtact gaatatgatg tgcctgttga tgcaccgttc ttaattgatg 24540 gtcagatccc ttggtctgtt gccgtcgaat caggccagtg tgatttgatg ttgatttcat 24600 atatcggtat tgatttccaa gcgaaaggcg aacgtgttta ccgtttactt gattgtgaat 24660 taactttcct tgaagagatg gcttttggtg gcgatacttt acgttacgag atccacattg 24720 attcgtatgc acgtaacggc gagcaattat tattcttctt ccattacgat tgttacgtag 24780 gggataagaa ggtacttatc atgcgtaatg gttgtgctgg tttctttact gacgaagaac 24840 tttctgatgg taaaggcgtt attcataacg acaaagacaa agctgagttt agcaatgctg 24900 ttaaatcatc attcacgccg ttattacaac ataaccgtgg tcaatacgat tataacgaca 24960 tgatgaagtt ggttaatggt gatgttgcca gttgttttgg tccgcaatat gatcaaggtg 25020 gccgtaatcc atcattgaaa ttctcgtctg agaagttctt gatgattgaa cgtattacca 25080 agatagaccc aaccggtggt cattggggac taggcctgtt agaaggtcag aaagatttag 25140 accctgagca ttggtatttc ccttgtcact ttaaaggtga tcaagtaatg gctggttcgt 25200 tgatgtcgga aggttgtggc caaatggcga tgttcttcat gctgtctctt ggtatgcata 25260 ccaatgtgaa caacgctcgt ttccaaccac taccaggtga atcacaaacg gtacgttgtc 25320 gtgggcaagt actgccacag cgcaatacct taacttaccg tatggaagtt actgcgatgg 25380 gtatgcatcc acagccattc atgaaagcta atattgatat tttgcttgac ggtaaagtgg 25440 ttgttgattt caaaaacttg agcgtgatga tcagcgaaca agatgagcat tcagattacc 25500 ctgtaacact gccgagtaat gtggcgctta aagcgattac tgcacctgtt gcgtcagtag 25560 caccagcatc ttcacccgct aacagcgcgg atctagacga acgtggtgtt gaaccgttta 25620 agtttcctga acgtccgtta atgcgtgttg agtcagactt gtctgcaccg aaaagcaaag 25680 gtgtgacacc gattaagcat tttgaagcgc ctgctgttgc tggtcatcat agagtgccta 25740 accaagcacc gtttacacct tggcatatgt ttgagtttgc gacgggtaat atttctaact 25800 gtttcggtcc tgattttgat gtttatgaag gtcgtattcc acctcgtaca ccttgtggcg 25860 atttacaagt tgttactcag gttgtagaag tgcagggcga acgtcttgat cttaaaaatc 25920 catcaagctg tgtagctgaa tactatgtac cggaagacgc ttggtacttt actaaaaaca 25980 gccatgaaaa ctggatgcct tattcattaa tcatggaaat tgcattgcaa ccaaatggct 26040 ttatttctgg ttacatgggc acgacgctta aataccctga aaaagatctg ttcttccgta 26100 accttgatgg tagcggcacg ttattaaagc agattgattt acgcggcaag accattgtga 26160 ataaatcagt cttggttagt acggctattg ctggtggcgc gattattcaa agtttcacgt 26220 ttgatatgtc tgtagatggc gagctatttt atactggtaa agctgtattt ggttacttta 26280 gtggtgaatc actgactaac caactgggca ttgataacgg taaaacgact aatgcgtggt 26340 ttgttgataa caataccccc gcagcgaata ttgatgtgtt tgatttaact aatcagtcat 26400 tggctctgta taaagcgcct gtggataaac cgcattataa attggctggt ggtcagatga 26460 actttatcga tacagtgtca gtggttgaag gcggtggtaa agcgggcgtg gcttatgttt 26520 atggcgaacg tacgattgat gctgatgatt ggttcttccg ttatcacttc caccaagatc 26580 cggtgatgcc aggttcatta ggtgttgaag ctattattga gttgatgcag acctatgcgc 26640 ttaaaaatga tttgggtggc aagtttgcta acccacgttt cattgcgccg atgacgcaag 26700 ttgattggaa ataccgtggg caaattacgc cgctgaataa acagatgtca ctggacgtgc 26760 atatcactga gatcgtgaat gacgctggtg aagtgcgaat cgttggtgat gcgaatctgt 26820 ctaaagatgg tctgcgtatt tatgaagtta aaaacatcgt tttaagtatt gttgaagcgt 26880 aaagggtcaa gtgtaacgtg cttaagcgcc gcattggtta aagacgcttt gcacgccgtg 26940 aatccgtcca tggaggcttg gggttggcat ccatgccaac aacagcaagc ttactttaat 27000 caatacggct tggtgtccat ttagacgcct cgaacttagt agttaataga caaaataatt 27060 tagctgtgga atgaatatag taagtaatca ttcggcagct acaaaaaagg aattaagaat 27120 gtcgagttta ggttttaaca ataacaacgc aattaactgg gcttggaaag tagatccagc 27180 gtcagttcat acacaagatg cagaaattaa agcagcttta atggatctaa ctaaacctct 27240 ctatgtggcg aataattcag gcgtaactgg tatagctaat catacgtcag tagcaggtgc 27300 gatcagcaat aacatcgatg ttgatgtatt ggcgtttgcg caaaagttaa acccagaaga 27360 tctgggtgat gatgcttaca agaaacagca cggcgttaaa tatgcttatc atggcggtgc 27420 gatggcaaat ggtattgcct cggttgaatt ggttgttgcg ttaggtaaag cagggctgtt 27480 atgttcattt ggtgctgcag gtctagtgcc tgatgcggtt gaagatgcaa ttcgtcgtat 27540 tcaagctgaa ttaccaaatg gcccttatgc ggttaacttg atccatgcac cagcagaaga 27600 agcattagag cgtggcgcgg ttgaacgttt cctaaaactt ggcgtcaaga cggtagaggc 27660 ttcagcttac cttggtttaa ctgaacacat tgtttggtat cgtgctgctg gtctaactaa 27720 aaacgcagat ggcagtgtta atatcggtaa caaggttatc gctaaagtat cgcgtaccga 27780 agttggtcgc cgctttatgg aacctgcacc gcaaaaatta ctggataagt tattagaaca 27840 aaataagatc acccctgaac aagctgcttt agcgttgctt gtacctatgg ctgatgatat 27900 tactggggaa gcggattctg gtggtcatac agataaccgt ccgtttttaa cattattacc 27960 gacgattatt ggtctgcgtg atgaagtgca agcgaagtat aacttctctc ctgcattacg 28020 tgttggtgct ggtggtggta tcggaacgcc tgaagcagca ctcgctgcat ttaacatggg 28080 cgcggcttat atcgttctgg gttctgtgaa tcaggcgtgt gttgaagcgg gtgcatctga 28140 atatactcgt aaactgttat cgacagttga aatggctgat gtgactatgg cacctgctgc 28200 agatatgttt gaaatgggtg tgaagctgca agtattaaaa cgcggttcta tgttcgcgat 28260 gcgtgcgaag aaactgtatg acttgtatgt ggcttatgac tcgattgaag atatcccagc 28320 tgctgaacgt gagaagattg aaaaacaaat cttccgtgca aacctagacg agatttggga 28380 tggcactatc gctttcttta ctgaacgcga tccagaaatg ctagcccgtg caacgagtag 28440 tcctaaacgt aaaatggcac ttatcttccg ttggtatctt ggcctttctt cacgctggtc 28500 aaacacaggc gagaagggac gtgaaatgga ttatcagatt tgggcaggcc caagtttagg 28560 tgcattcaac agctgggtga aaggttctta ccttgaagac tatacccgcc gtggcgctgt 28620 agatgttgct ttgcatatgc ttaaaggtgc tgcgtattta caacgtgtaa accagttgaa 28680 attgcaaggt gttagcttaa gtacagaatt ggcaagttat cgtacgagtg attaatgtta 28740 cttgatgata tgtgaattaa ttaaagcgcc tgagggcgct ttttttggtt tttaactcag 28800 gtgttgtaac tcgaaattgc ccctttcaag ttagatcgat tactcactca caatatgttg 28860 atatcgcact tgccatatac ttgctcatcc aaagccctat attgataatg gtgttaatag 28920 tctttaatat ccgagtcttt cttcagcata atactaatat agagactcga ccaatgttaa 28980 acacaacaaa gaatatattc ttgtgtactg ccttattatt aacgagtgcg agtacgacag 29040 ctactacgct aaacaattcg atatcagcaa ttgaacaacg tatttctggt cgtatcggtg 29100 tggctgtttt agatacgcaa aataaacaaa cgtgggctta caatggtgat gcacattttc 29160 cgatgatgag tacattcaaa accctcgctt gcgcgaaaat gctaagtgaa tcgacaaatg 29220 gtaatctgga tcccagtact agctcattga taaaggctga agaattaatc ccttggtcac 29280 cagtcactaa aacgtttgtg aataacacta ttacagtggc gaaagcgtgt gaagcaacaa 29340 tgctgaccag tgataatacc gcggctaata ttgttttaca gtatatcgga ggccctcaag 29400 gcgttactgc attcttgcga gaaattggtg atgaagagag tcagttagat cgtatagaac 29460 ctgaattgaa tgaagctaag gtcggagact tgcgtgatac cacgacaccg aaagccatag 29520 ttaccacgct caacaaacta ctacttggtg atgttctact tgatttggat aaaaaccaac 29580 ttaaaacatg gatgcaaaat aataaagtgt cagatccttt actgcgttct atattaccgc 29640 aaggctggtt tattgccgac cgctcaggtg cgggtggtaa tggttctcga ggtataactg 29700 ctatgctttg gcactccgag cgtcaaccgc taatcatcag tatttattta accgaaactg 29760 agttagcaat ggcaatgcgc aatgagatta ttgttgagat cggtaagctg atattcaaag 29820 aatacgcggt gaaataataa gttatttttt gataatactt taacgagcgt agctatcgaa 29880 gtgagggcgt caattagaca cctttgcttc ccctacaaaa tctaatgtgt attacctcgg 29940 ctagtacaat tgccctaagt tatttctgtc cagctttggc ttagtgcaat tgcgttagcc 30000 aatgtgaaca ccaagggact ttgtcgtacc ataactacca agcgactttg tcgtttttat 30060 cttttcttag acaaacagag gttaaatgag tgacgccttc caaatcacag gaatgaatcc 30120 gcatttcaat aaaatctaac ccgtaccaac tccgtacaag ttgatcttta gttgtttaaa 30180 atctataata aattcaatta cggaattaat ccgtacaact ggaggtttta tggctactgc 30240 aagacttgat atccgtttgg atgaagaaat caaagctaag gctgagaaag catcagcttt 30300 actcggctta aaaagtttaa ccgaatacgt tgttcgctta atggacgaag attcaactaa 30360 agtagtttct gagcatgaga gtattaccgt tgaagcgaat gtattcgacc aatttatggc 30420 tgcttgtgat gaagcgaaag ccccaaataa agcattactt gaagccgctg tatttactca 30480 gaatggtgag tttaagtgag ttattccaaa cgtttcaaag aactggataa atcaaaacat 30540 gacagagcat catttgactg tggcgaaaaa gagctaaatg attttatcca aactcaagca 30600 gccaaacata tgcaagcagg tattagccgc actctggttt tacctgcttc tgcgccgtta 30660 ccaaacaaaa aatatccaat ttgctcattt tatagtatcg cgccaagctc aattagccgc 30720 gatacgttac cacaagcaat ggctaaaaag ttaccacgtt atcctatccc tgtttttctt 30780 ttggctcaac ttgccgtcca taaagagttt catgggagtg ggttaggcaa agttagctta 30840 attaaagcgt tagagtacct ttgggaaatt aactctcaca tgagagctta cgccatcgtt 30900 gttgattgtt taactgaaca agctgagtca ttctacgcta aatatggttt cgacgttctc 30960 tgcgaaataa atggtcgagt aagaatgttc atatcaatga aaacagtcaa tcagttattc 31020 acttaacagt aagagttagt ataacagttg tatgaattaa atttattata ttcggtaatc 31080 tcattgcgat cacgctagaa gtgcgagcgg gtcagaccga ggccacaata gcagccgtta 31140 cgtttagggg atgacttaaa aagataacta ctacgtcagt ggcgatccta gaggattaaa 31200 ggtttatgat tcacaacatt tatttattgt gcttaatttt ttctatccaa tatgcgcaag 31260 ctgtaaatat cactgaagta gacttttatg tcagtgatga tatccctaaa gatgttgcca 31320 aattaaagat aggtgaatcc ataacgaact ccagccttat tctaagtaac tcatctattc 31380 cactctcgcg ggagacgggt aacatatatt actcttcatc aattgctaac ttgaactatg 31440 actcgataga atttgttatg gctcaattga tggccgaaga ttccagcctt tacaagatgc 31500 tggtaaatag cgataggttg tccgtgctag taatgacatc ttcccagtcc acagtctcta 31560 tggctcgact tactcggctt attttcctaa tgttgcggtc atcgatttga attgtgactc 31620 gctaacttta gaacatgagc tcggccatct atacggagct gaacatgaag aaatatatga 31680 cgactatgtc ttctatgctg cgatatgtgg agactatacg actatcatga actctatgca 31740 gcctgaaatg aaagaaaaac aaatgataaa ggcatattca ttccctgaat taaaagtgga 31800 tggcttgcag tgcggaaatg aaaatacgaa taacaaaaag gttattttag acaatattgg 31860 tcggtttaga taggattggg atattattct cattcggctc tacttagtgc tgttattatg 31920 agtgccagtg cttctatcta cgatattggt cttaacaagt atttatctat agacgctaag 31980 gtgttatgta tttaagggat gttcaagatg aaactaggtg taaacgatgt atagttgtat 32040 aacatttttt caacggttgg aacgttcgat tctatcgggt aacaagaccg cgacgatccg 32100 cgataagtcc gatagtcatt acttagttgg tcagatgtta gatgcttgta ctcacgaaga 32160 taatcggaaa atgtgtcaaa tagaaatact gagcattgaa tatgtgacgt ttagtgaatt 32220 aaaccgtgcg cacgccaatg ctgaaggttt accgtttttg tttatgctta agtggatagt 32280 tcgaaagatt tatccgactt caaatgattt atttttcata agtttcagag ttgtaactat 32340 cgatatctta taagtcttag tgcacaaaac agaactattt atagcgctca agaaggcgat 32400 aatttgataa tgaattatcg ccttgttact attaagagac tttaaatgac tgagatataa 32460 gatatgacac ggaagaacat attgatcaca ggcgcaagtt cagggttggg ccgaggtatg 32520 gccatcgaat ttgcaaaatc aggtcataac ttagcacttt gtgcacgtag acttgataat 32580 ttagttgcac tgaaagcaga actcttagcc ctcaatcctc acatccaaat cgaaataaaa 32640 cctcttgatg tcaatgaaca tgaacaagtc ttcactgttt tccatgaatt caaagctgaa 32700 tttggtacgc ttgatcgtat tattgttaat gctggattag gcaagggtgg atccgtcggt 32760 acaggttttt tcaaagctaa tctgcaaact gcacaaacta attttattgc ggcgctcgca 32820 caatgtgaag cggcgctcga aatctttagg gcgcaaaatg ctgggcacct agtgacgatt 32880 tcttctatca gcgctgtacg aggattccgc cgtgcgttaa ctgtgtatgc agctactaaa 32940 tcggcactaa catcattaac tgaaggtatc aggattgacg tgatggatac gccaatcaaa 33000 gtgagttgta ttcatcctgg atttattcgc accgagatga atgaaaaagt aaaaacagca 33060 cctttcatga tagatgctga agcgggttgt aaagcgatag tgaaagcaat taataaagaa 33120 aaagcgaata gttatgtacc tagttaccct tgggctatta tgcacttatt actacgtgtg 33180 gcgccaacgc gtttgatccg cagaatgagt taatatcaca gacgcatcaa taaaatttta 33240 aggttctaga aatgatgaag tctcatgttt ggttcaaggc cggtgtagtc atcatatatg 33300 gctcatctat agatgcctct cctcatcgtc atcatgcaat tcaattagcg gcggtgttac 33360 ccaatcccaa gcgaatgtct gcagcaaccc cttcttctta tgtgctcagc cgtgcggcac 33420 aaatttaaga ctcggtgcga tcattaggcg gatctgttta cctgaaaaac ttataacaaa 33480 agctatcgac tgttgaattt atcctgaatg ctttaataga gtgggctggt ggcattacat 33540 gattggaaag ctgaaagaca agtcgttata tttgcaggca gtaaaattaa cactggtatg 33600 gatacttttg attctgtaaa gttcagagta tcagcccctt aacgagcttt ggtataaaca 33660 aatatgaata atcgacagcc taagaaaacc tcttcgacta tatcgacgct caacgaatta 33720 gcgacgttag caaactattc actcatggac acgctaaact gtgatcctga tgcgacagaa 33780 aacggcgacg atcacgcgcc gagacaagtc ctttacgggt cattatgttc ccgtaaaacc 33840 gactccaatc aaagaccctg aatatgtagc gcatagcaaa aatttatttt ctgaacttgg 33900 ctttgccgac agtatggctg agtccgctga ttttgtccgg atgttctctg gtgatatgtc 33960 aggggttcca gtaccaatgc gccaggtagg ttgggcgagt ggctatgcac tttccattta 34020 tggcaccgag tacacccaac agtgcccgtt ccaaactggt aacggatatg gagacggacg 34080 tgcaatttca gtgcttgaga ccctcatcaa gggtcaacgc tgggaaatgc agctgaaagg 34140 cggtggtcgt acaccatatt gccgcggcgc agacggtcgc gctgttttac ggtctagtat 34200 tcgcgagttc ttggctcaag atcacatgca tgcgctcggg gtacctacat cacggtcttt 34260 aagtctgtac gtttcaaaaa cggagacagt taagcgacct tggtactcac agggctcgcg 34320 ttcagagaat cccgacatgc ttatatctga agctgtcgct atctcgacgc gtgttgcacc 34380 gtcgttcatc cgtgttggtc aactcgaact tttcgcgcgc cgcagccgta gtaatgaaca 34440 cccgaaagcg atggaagaac tcgagaagat tgtgctgcac ttgatcgatc gtgaatacgc 34500 tgacgttatc gatacgcagc tagccactcc agaaaaaatc gtgttgctgg ctcgcgagtt 34560 tcgtggccgc cttacctcaa tggttgcgaa ttggatccgt gttggatttt gccaaggtaa 34620 ctttaacagt gataactgcg cagccggtgg ttttacactt gattatggtc cctttggttt 34680 ttgtgatgtg tttaatccgt attatcaacc ttggacgggg gggggtaatc acttctcgtt 34740 catgaaccaa ccaaatgcag cacaacgaaa tttcgatatg ttttgttcgg cgttacggcc 34800 gttactggta tctcatcagc aggatttgct cgcgtttgac gagatccaaa gtgaattttt 34860 agcagtaatg gatacgaaaa tgaaggcgat gtgggctact aaattgggtc ttattaattt 34920 gaagactgag tctgataaag cactgtgtaa cgtactcatc aaagagctac aaacactcat 34980 gatgcaagca cctgttgatt acactatttt cttccgcgaa ctatcctcaa ttcctgacga 35040 tattggccca ctgaagaaaa gtttttacag taatctatac aatgatgcag cggatgatcc 35100 agatacctta gcgttagaaa aatactggat tgagtggctc gaaaaatggc aaatgctcct 35160 taacagtact tgtgacgcga aaggtatctc gtcccgagcc agtgaggaca tcgctatgca 35220 gatgaaactc gtcaacccta aatacgtttt gcgagaatgg ttcgtgatgc cggcttatca 35280 gcaagccact gcgggtgatt attctctcat tcaagagctg caggccgtaa tgacacagcc 35340 atatgcagag cagtcgaagg agctagagga taaatactat cgattgaaac cgcttgagtt 35400 ctttgaggta ggtggattgt cccatcttag ttgctcgtcg tgaacgataa cgcgtcggta 35460 catgtgtatc gacgtatggg cgcttaattt ttattaatat tagaaacaaa aatcgccagc 35520 aaatgctggc gttttaaaga ttaatgtcaa ttattacatc atgcctatat cacgtaggag 35580 atgtggcgat aagcctttta attgaatatc taaagatttt tcttttttat cactaaataa 35640 aatgtcttta gtgtgtttaa tcagtccttt gatagaaaca gcataagctt ttgtatctaa 35700 agcttgtggg atcatattga tgtgcgctgc gtgtgccatt ttagcctcta tctgaattta 35760 ataatttatg ttttaaccag gtgatgtatt gctcatctgg tgaacatagt agcgcattaa 35820 ataaccatgc aataatgata aaaaataaca ctaagcatta gttttgataa tgcattcggc 35880 gctgtgtgac actgtttact gttttataga tattcattca ctttaattgc atataaattg 35940 aattgtttac tccaaatgta gttaaaataa gcacttgtta catcaatgca acaattatac 36000 gctgttaaaa tagccttgat ataccaatga taaataattc tgagtcttta atatttaaaa 36060 tagatgaatt taattcatta gatatactat tacgttgaat tgcgatttac atgcgcattt 36120 agtgtgtttt ttattaaatg aaaattattt tgacgatttt attaacatat ataagaaata 36180 tgtgacttag atctaagtaa acgttaattt atcgccgata aagcagtagt aagcatgttg 36240 catatcaaac cctctctata gatctcaact agcctcaatt atcatcaagt taactgtggt 36300 tttatttatt gctcgtgcgt tcagttatgc ttaaccatga gttaacttca ttctaatatt 36360 tttaacttac agtgaggggt atactctcgg ctcttagaaa tagagagcca aaacatgttt 36420 gaattcgtta ctaattcctc attgaaaaca cacctattgc ttatcaataa tggctatcaa 36480 tagtggttta ttgtttctta cgccacggct tatttttctg aaaatgtact aaatagataa 36540 attatcaata aaaacacaca tcacattaac cgatgtaaac agggaacatc cccatgtatg 36600 aaaatgaaga aaaactaacg aaagcatttg ttattgccgc cataatttgg ggcgttatag 36660 gcatgtgcat gggtttaatg gcagctctgc agctatatct accgcaattg aattttgcta 36720 atgagtatat aaatttcggg aaaataagac ccttgcatac taacgccatc atttttgggt 36780 tggtttgtaa ctttattatc ggtctgtcgt tatacatagt ggcaaaaaca tcagtcgtga 36840 atctagtatc caaaggttta tcgtggttct tgttctgggg ttggcagata acattggtaa 36900 tcggccttat ctcaatcgct ttagggtata catcaaccaa agaatacgct gaatttgagt 36960 ggccaattga tatcgctatt gtggttctct ggttaacgtt tggatatatc ttttttggaa 37020 cgctagcgaa aagaaaaaca aagcatatat ttgtttcaaa ctggttcagt ggcggtgtca 37080 ttattgttat cggcttaatt tacttgataa acaatttagc cattcccgtg tatgcattta 37140 aaggttattc aatattttct ggtgcgagtg atgcgcttgt acagtggtgg tggggacata 37200 atgcagttgg cttcttattg acagctggct ttgtaggtac caactactat ttcattccca 37260 agttagttaa tagacccatt tattcatatc gactgtcttt aattactttt tggggtctaa 37320 tcggctttta tacttgggct ggtacacacc atttactctt tacatccgtt ccatcttgga 37380 ttcaaaatat tggcgtagtg atgtctattt tattatggat cccgtcatgg gctggcgcat 37440 ttaacgcttg gatgacgtgt acttccaata aagaagaatt gaaaacaaat cccgttgtct 37500 ggtttttctt atcgtcaatt gcctattacg cattagcaac gtttgaaggg cctcttatgg 37560 ctatcagatg gttcaatatg atagctcaca ataccagttg ggttatcgga cacgttcact 37620 ctggggcgtt aggttgggtt ggcatgacgt gtatagcaac cttctactat ttcattccta 37680 agctatacaa aaaagaactc tactcatatg gcttagttaa ggtgcatttt gtactcgctc 37740 acataggcgt actgttctac atagtctccc tgtggatagg gggtataggt caaggtgtta 37800 aatcgttaag cctcactgag tctggttctc tgacttattc gtttgttgat attttacgat 37860 ttatggaacc ttatatgctc ggacgtgcaa ttggcggggc gctgtttatc ttgggtatgt 37920 tagtgatggt atataacctc atcatgacgg tgaacaaacc acaaaaagta gttattgaag 37980 gagcatatta atggaagagt caatatccaa gtcagtaatg gcttttatca ctatcacgac 38040 agtcgtggtg ttattttcat tctttgtgtg ggttttccca gggttcttct tcaccaacga 38100 tcttaaagaa ataacgacag ctaaaccata cacagcctta gagttagctg gacgggatgt 38160 gtatatggct gaaggttgcg tggcatgcca tacccagatg gttagaaact tggaaccgga 38220 aagaaaaaga tacggtcgtc ctaataaaat ggaagatgat gtttatgagt ttaacttttt 38280 gtggagctca caaagaactg gccctgattt aacgaatatt ggtttgaagt acacacaagg 38340 ctggcacaaa cagcatctca tcaatcctca ggcagttgtt ccagcctcaa tcatgccaca 38400 atatccgtgg ctgtttgaaa agcaacttaa cgttggtcat gttattgctt caatgaaagc 38460 gatgaaaaaa ctaggtgtgc cgtatacaga cacgcaaatt gaaaattcat caagcaaagt 38520 ggaaggtaaa acaaaaggtg atgcgcttgt tgcttacttg atgagtcttg gcgtagatac 38580 gcgtgaaaaa ggtggggatt taaattaatg ggatccatga acatattatc aagcgtacta 38640 tcgattatct tcttttttat catggttgcc gttatttatt cacagttccg taagaccaaa 38700 actgcagaca gtaataaaac agtagagcaa tttgatggaa tagatgaaaa agatgcacca 38760 attcctaagg ttttctttgt tgcgtatctt attgcgttta taggcgcaat tgtttacgtc 38820 cttctatacc caagtttagc ttcttggaaa gggtttatcg gttggaccga gaacgatgac 38880 gcgtatgtag ctaaatcaat tgatataaac aataacatta acgcaataat caacgcgaat 38940 accgatgaac aagtctttac gctgttacaa aaagatccgc ttgttttgca gagtggtaaa 39000 tcgttatttg gtgataattg ttctgcttgt catggtcagg atgctaaggg gcaatataac 39060 tacccgagtt tagttgataa agattggtta tacggcggct cacctcaaga tgtctatacg 39120 accatacata atggacgtaa gggtaaaatg ccagcttgga aaggtgtact gagcggtaaa 39180 gacatagatg agcttaccca gtatgtgtct gagctaaata aaggaccatt taaaagcaat 39240 gcgcttttcg atgctaattg ttcatcatgt cacggtaaag aggctcaagg ttcacatagc 39300 gtaggagccc ctaacttaac gaatgatatc tggcttcatg gttcaaccaa tgctgatatc 39360 aaacgtaata ttgagaatgg catgtataac gaaatgcctg attttggtca acgccttagc 39420 agaaatcaaa tattgtcttt aacctcttat attgtgtccc tacagagtga accacaagat 39480 aatatcgata ttatgcaagc gaacacttat atcttctctc gaaacgaaca gcaattgccg 39540 gcagtgctaa cgacttgtgt ggcctgtcat ggcgcagatg gtcttggtac tttacctgga 39600 gcgcctaagt tagcaggatt aaagcaagcg tatatctata accaattaca cttgtttgta 39660 tctggtttaa gaaaaaatgc aacgatgcaa aatatagttg ccgacttaga tgtgaaagac 39720 aagttacttg ctgctagcta tttcagttca ctcgattcac cggcgataag taaaattacc 39780 ccagagaaat cagctgacgg tatcatcaaa gatcctactg agcgcctgat atttcaaggt 39840 gattggcaac gcgctattcc tgcttgttct acttgtcatg gtcaagaaac gcaaggtagc 39900 ccatcatttc caagattggc aggtcaatca tctgactatt tagagaaaca attatttgac 39960 tggcgaacag gcgatagaac cggtgatcaa ggtcatatga tgcaaaacgt cgttaacaag 40020 ctacaagatg atgaaattaa atccctgtcg aaatatttat caaaaatgaa ataacctgtg 40080 agccagttaa aggccaatag atcgaaggtt aacagctcaa agattaatag gatactgtaa 40140 ttatgaaaat gaataagtta agaagggaaa tcattaaagc tggtggctat gtcgctttag 40200 ctgctgcacc attaacggct ttctctaaag agtttatgaa atacggcaaa atgtattcag 40260 atggtgaggg agttagctat gccgatggcc ctaagcctgt attaagcaat tttccgcaaa 40320 aagataatgt tgtgatcgta catactcgac cacctcatct tgaaacgcct tttaatgtat 40380 tcaatgaagg gctaataaca ccaaacaacc gtttctttgt tcgttatcat ctagctgacg 40440 tccccgttgc catagacact gataagtaca ctattactat ttcaggggct gttaatgagg 40500 aagtgacatt aagcttggct gaattaaagt cgattgaagg ccaacaagaa attgtcgcgg 40560 tacaacagtg tactggtaat agtcgaggtt attcatctcc acgtgttttt ggtgcgcaat 40620 taagtaatgg cgctatgggg aatgcgaagt tcaaaggcgt gccacttaaa aatgtgttag 40680 ctaaagcggg aatttctagt gctgcgacaa gtgtcattat cgatggtttg gataagccgg 40740 ttcgagatac cacaccagac tttcaaaaat cattacctat tgatcatatt atgacgggcg 40800 aacctatgct tgtttgggaa atgaatggtg aacctttacc atttttaaat ggctttccag 40860 tgaaattaat cgttccgggt tggtatgcaa catattgggt taaacatgta tcgcacctta 40920 aagttataga gggtgagttt gataactttg atgcgttctt tatgacaact gcataccgtc 40980 tacctgataa cgattccaag agtgaattac caactgccag agcgaaaaag acgttacctg 41040 taaatcgttt cccaataaga agttttgtta ctagcttaga aaatggtgat gaagttaatg 41100 ctgcaactag tattgaaatt aaagggatag cttttgatag tggtagtggt atcaaaaaag 41160 ttgaagtttc agtcgatggt ggcaataagt ggatgcaagc agcgcttggt gaaaatcttg 41220 gtcgtttttc ctttcgaggt tggaagttaa gccataattt taatgaaaaa ggcagaacgc 41280 ttgtgatggt aagagctaca ggtaagagtg gagagacaca acctcttaat gcctcttgga 41340 atcatggcgg ttataaccga aacgcgattg aacgaacaag tattaaggtg gtttaaatgc 41400 ggtttttact tattatatta gcgctatgtt cattgactgt taaagctgag atcgtatcaa 41460 ttaccttacc tatggataat accaagctta agccgtcgac attaccagga tatggcctcg 41520 cgcaatctaa atgtcacctt tgtcattcag tcgattacgt tatgtatcaa ccaccagaaa 41580 tggatcc 41587 <210> 2 <211> 7959 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(7956) <400> 2 atg gct aaa aag aac acc aca tcg att aag cac gcc aag gat gtg tta 48 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 agt agt gat gat caa cag tta aat tct cgc ttg caa gaa tgt ccg att 96 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 gcc atc att ggt atg gca tcg gtt ttt gca gat gct aaa aac ttg gat 144 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 caa ttc tgg gat aac atc gtt gac tct gtg gac gct att att gat gtg 192 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 cct agc gat cgc tgg aac att gac gac cat tac tcg gct gat aaa aaa 240 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 gca gct gac aag aca tac tgc aaa cgc ggt ggt ttc att cca gag ctt 288 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 gat ttt gat ccg atg gag ttt ggt tta ccg cca aat atc ctc gag tta 336 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 act gac atc gct caa ttg ttg tca tta att gtt gct cgt gat gta tta 384 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 agt gat gct ggc att ggt agt gat tat gac cat gat aaa att ggt atc 432 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 acg ctg ggt gtc ggt ggt ggt cag aaa caa att tcg cca tta acg tcg 480 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 cgc cta caa ggc ccg gta tta gaa aaa gta tta aaa gcc tca ggc att 528 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 gat gaa gat gat cgc gct atg atc atc gac aaa ttt aaa aaa gcc tac 576 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 atc ggc tgg gaa gag aac tca ttc cca ggc atg cta ggt aac gtt att 624 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 gct ggt cgt atc gcc aat cgt ttt gat ttt ggt ggt act aac tgt gtg 672 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 gtt gat gcg gca tgc gct ggc tcc ctt gca gct gtt aaa atg gcg atc 720 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 tca gac tta ctt gaa tat cgt tca gaa gtc atg ata tcg ggt ggt gta 768 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 tgt tgt gat aac tcg cca ttc atg tat atg tca ttc tcg aaa aca cca 816 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 gca ttt acc acc aat gat gat atc cgt ccg ttt gat gac gat tca aaa 864 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 ggc atg ctg gtt ggt gaa ggt att ggc atg atg gcg ttt aaa cgt ctt 912 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 gaa gat gct gaa cgt gac ggc gac aaa att tat tct gta ctg aaa ggt 960 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 atc ggt aca tct tca gat ggt cgt ttc aaa tct att tac gct cca cgc 1008 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 cca gat ggc caa gca aaa gcg cta aaa cgt gct tat gaa gat gcc ggt 1056 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 ttt gcc cct gaa aca tgt ggt cta att gaa ggc cat ggt acg ggt acc 1104 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 aaa gcg ggt gat gcc gca gaa ttt gct ggc ttg acc aaa cac ttt ggc 1152 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 gcc gcc agt gat gaa aag caa tat atc gcc tta ggc tta gtt aaa tcg 1200 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 caa att ggt cat act aaa tct gcg gct ggc tct gcg ggt atg att aag 1248 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 gcg gca tta gcg ctg cat cat aaa atc tta cct gca acg atc cat atc 1296 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 gat aaa cca agt gaa gcc ttg gat atc aaa aac agc ccg tta tac cta 1344 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 aac agc gaa acg cgt cct tgg atg cca cgt gaa gat ggt att cca cgt 1392 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 cgt gca ggt atc agc tca ttt ggt ttt ggc ggc acc aac ttc cat att 1440 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 att tta gaa gag tat cgc cca ggt cac gat agc gca tat cgc tta aac 1488 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 tca gtg agc caa act gtg ttg atc tcg gca aac gac caa caa ggt att 1536 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 gtt gct gag tta aat aac tgg cgt act aaa ctg gct gtc gat gct gat 1584 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 cat caa ggg ttt gta ttt aat gag tta gtg aca acg tgg cca tta aaa 1632 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 acc cca tcc gtt aac caa gct cgt tta ggt ttt gtt gcg cgt aat gca 1680 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 aat gaa gcg atc gcg atg att gat acg gca ttg aaa caa ttc aat gcg 1728 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 aac gca gat aaa atg aca tgg tca gta cct acc ggg gtt tac tat cgt 1776 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 caa gcc ggt att gat gca aca ggt aaa gtg gtt gcg cta ttc tca ggg 1824 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 caa ggt tcg caa tac gtg aac atg ggt cgt gaa tta acc tgt aac ttc 1872 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 cca agc atg atg cac agt gct gcg gcg atg gat aaa gag ttc agt gcc 1920 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 gct ggt tta ggc cag tta tct gca gtt act ttc cct atc cct gtt tat 1968 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 acg gat gcc gag cgt aag cta caa gaa gag caa tta cgt tta acg caa 2016 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 cat gcg caa cca gcg att ggt agt ttg agt gtt ggt ctg ttc aaa acg 2064 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 ttt aag caa gca ggt ttt aaa gct gat ttt gct gcc ggt cat agt ttc 2112 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 ggt gag tta acc gca tta tgg gct gcc gat gta ttg agc gaa agc gat 2160 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 tac atg atg tta gcg cgt agt cgt ggt caa gca atg gct gcg cca gag 2208 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 caa caa gat ttt gat gca ggt aag atg gcc gct gtt gtt ggt gat cca 2256 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 aag caa gtc gct gtg atc att gat acc ctt gat gat gtc tct att gct 2304 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 aac ttc aac tcg aat aac caa gtt gtt att gct ggt act acg gag cag 2352 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 gtt gct gta gcg gtt aca acc tta ggt aat gct ggt ttc aaa gtt gtg 2400 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 cca ctg ccg gta tct gct gcg ttc cat aca cct tta gtt cgt cac gcg 2448 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 caa aaa cca ttt gct aaa gcg gtt gat agc gct aaa ttt aaa gcg cca 2496 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 agc att cca gtg ttt gct aat ggc aca ggc ttg gtg cat tca agc aaa 2544 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 ccg aat gac att aag aaa aac ctg aaa aac cac atg ctg gaa tct gtt 2592 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 cat ttc aat caa gaa att gac aac atc tat gct gat ggt ggc cgc gta 2640 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 ttt atc gaa ttt ggt cca aag aat gta tta act aaa ttg gtt gaa aac 2688 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 att ctc act gaa aaa tct gat gtg act gct atc gcg gtt aat gct aat 2736 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 cct aaa caa cct gcg gac gta caa atg cgc caa gct gcg ctg caa atg 2784 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 gca gtg ctt ggt gtc gca tta gac aat att gac ccg tac gac gcc gtt 2832 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 aag cgt cca ctt gtt gcg ccg aaa gca tca cca atg ttg atg aag tta 2880 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 tct gca gcg tct tat gtt agt ccg aaa acg aag aaa gcg ttt gct gat 2928 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 gca ttg act gat ggc tgg act gtt aag caa gcg aaa gct gta cct gct 2976 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 gtt gtg tca caa cca caa gtg att gaa aag atc gtt gaa gtt gaa aag 3024 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 ata gtt gaa cgc att gtc gaa gta gag cgt att gtc gaa gta gaa aaa 3072 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 atc gtc tac gtt aat gct gac ggt tcg ctt ata tcg caa aat aat caa 3120 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 gac gtt aac agc gct gtt gtt agc aac gtg act aat agc tca gtg act 3168 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 cat agc agt gat gct gac ctt gtt gcc tct att gaa cgc agt gtt ggt 3216 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 caa ttt gtt gca cac caa cag caa tta tta aat gta cat gaa cag ttt 3264 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 atg caa ggt cca caa gac tac gcg aaa aca gtg cag aac gta ctt gct 3312 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 gcg cag acg agc aat gaa tta ccg gaa agt tta gac cgt aca ttg tct 3360 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 atg tat aac gag ttc caa tca gaa acg cta cgt gta cat gaa acg tac 3408 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 ctg aac aat cag acg agc aac atg aac acc atg ctt act ggt gct gaa 3456 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 gct gat gtg cta gca acc cca ata act cag gta gtg aat aca gcc gtt 3504 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 gcc act agt cac aag gta gtt gct cca gtt att gct aat aca gtg acg 3552 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 aat gtt gta tct agt gtc agt aat aac gcg gcg gtt gca gtg caa act 3600 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 gtg gca tta gcg cct acg caa gaa atc gct cca aca gtc gct act acg 3648 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 cca gca ccc gca ttg gtt gct atc gtg gct gaa cct gtg att gtt gcg 3696 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 cat gtt gct aca gaa gtt gca cca att aca cca tca gtt aca cca gtt 3744 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 gtc gca act caa gcg gct atc gat gta gca act att aac aaa gta atg 3792 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 tta gaa gtt gtt gct gat aaa acc ggt tat cca acg gat atg ctg gaa 3840 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 ctg agc atg gac atg gaa gct gac tta ggt atc gac tca atc aaa cgt 3888 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 gtt gag ata tta ggc gca gta cag gaa ttg atc cct gac tta cct gaa 3936 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 ctt aat cct gaa gat ctt gct gag cta cgc acg ctt ggt gag att gtc 3984 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 gat tac atg aat tca aaa gcc cag gct gta gct cct aca aca gta cct 4032 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 gta aca agt gca cct gtt tcg cct gca tct gct ggt att gat tta gcc 4080 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 cac atc caa aac gta atg tta gaa gtg gtt gca gac aaa acc ggt tac 4128 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 cca aca gac atg cta gaa ctg agc atg gat atg gaa gct gac tta ggt 4176 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 att gat tca atc aag cgt gtg gaa atc tta ggt gca gta cag gag atc 4224 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 ata act gat tta cct gag cta aac cct gaa gat ctt gtt gaa tta cgc 4272 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 acc cta ggt gaa atc gtt agt tac atg caa agc aaa gcg cca gtc gct 4320 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 gaa agt gcg cca gtg gcg acg gct cct gta gca aca agc tca gca ccg 4368 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 tct atc gat ttg aac cac att caa aca gtg atg atg gat gta gtt gca 4416 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 gat aag act ggt tat cca act gac atg cta gaa ctt ggc atg gac atg 4464 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 gaa gct gat tta ggt atc gat tca atc aaa cgt gtg gaa ata tta ggc 4512 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 gca gtg cag gag atc atc act gat tta cct gag cta aac cca gaa gac 4560 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 ctc gct gaa tta cgc acg cta ggt gaa atc gtt agt tac atg caa agc 4608 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 aaa gcg cca gtc gct gag agt gcg cca gta gcg acg gct tct gta gca 4656 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 aca agc tct gca ccg tct atc gat tta aac cat atc caa aca gtg atg 4704 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 atg gaa gtg gtt gca gac aaa acc ggt tat cca gta gac atg tta gaa 4752 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 ctt gct atg gac atg gaa gct gac cta ggt atc gat tca atc aag cgt 4800 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 gta gaa att tta ggt gcg gta cag gaa atc att act gac tta cct gag 4848 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 ctt aac cct gaa gat ctt gct gaa cta cgt aca tta ggt gaa atc gtt 4896 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 agt tac atg caa agc aaa gcg ccc gta gct gaa gcg cct gca gta cct 4944 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 gtt gca gta gaa agt gca cct act agt gta aca agc tca gca ccg tct 4992 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 atc gat tta gac cac atc caa aat gta atg atg gat gtt gtt gct gat 5040 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 aag act ggt tat cct gcc aat atg ctt gaa tta gca atg gac atg gaa 5088 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 gcc gac ctt ggt att gat tca atc aag cgt gtt gaa att cta ggc gcg 5136 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 gta cag gag atc att act gat tta cct gaa cta aac cca gaa gac tta 5184 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 gct gaa cta cgt acg tta gaa gaa att gta acc tac atg caa agc aag 5232 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 gcg agt ggt gtt act gta aat gta gtg gct agc cct gaa aat aat gct 5280 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 gta tca gat gca ttt atg caa agc aat gtg gcg act atc aca gcg gcc 5328 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 gca gaa cat aag gcg gaa ttt aaa ccg gcg ccg agc gca acc gtt gct 5376 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 atc tct cgt cta agc tct atc agt aaa ata agc caa gat tgt aaa ggt 5424 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 gct aac gcc tta atc gta gct gat ggc act gat aat gct gtg tta ctt 5472 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 gca gac cac cta ttg caa act ggc tgg aat gta act gca ttg caa cca 5520 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 act tgg gta gct gta aca acg acg aaa gca ttt aat aag tca gtg aac 5568 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 ctg gtg act tta aat ggc gtt gat gaa act gaa atc aac aac att att 5616 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 act gct aac gca caa ttg gat gca gtt atc tat ctg cac gca agt agc 5664 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 gaa att aat gct atc gaa tac cca caa gca tct aag caa ggc ctg atg 5712 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 tta gcc ttc tta tta gcg aaa ttg agt aaa gta act caa gcc gct aaa 5760 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 gtg cgt ggc gcc ttt atg att gtt act cag cag ggt ggt tca tta ggt 5808 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 ttt gat gat atc gat tct gct aca agt cat gat gtg aaa aca gac cta 5856 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 gta caa agc ggc tta aac ggt tta gtt aag aca ctg tct cac gag tgg 5904 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 gat aac gta ttc tgt cgt gcg gtt gat att gct tcg tca tta acg gct 5952 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 gaa caa gtt gca agc ctt gtt agt gat gaa cta ctt gat gct aac act 6000 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 gta tta aca gaa gtg ggt tat caa caa gct ggt aaa ggc ctt gaa cgt 6048 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 atc acg tta act ggt gtg gct act gac agc tat gca tta aca gct ggc 6096 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 aat aac atc gat gct aac tcg gta ttt tta gtg agt ggt ggc gca aaa 6144 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 ggt gta act gca cat tgt gtt gct cgt ata gct aaa gaa tat cag tct 6192 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 aag ttc atc tta ttg gga cgt tca acg ttc tca agt gac gaa ccg agc 6240 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 tgg gca agt ggt att act gat gaa gcg gcg tta aag aaa gca gcg atg 6288 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 cag tct ttg att aca gca ggt gat aaa cca aca ccc gtt aag atc gta 6336 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 cag cta atc aaa cca atc caa gct aat cgt gaa att gcg caa acc ttg 6384 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 tct gca att acc gct gct ggt ggc caa gct gaa tat gtt tct gca gat 6432 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 gta act aat gca gca agc gta caa atg gca gtc gct cca gct atc gct 6480 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 aag ttc ggt gca atc act ggc atc att cat ggc gcg ggt gtg tta gct 6528 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 gac caa ttc att gag caa aaa aca ctg agt gat ttt gag tct gtt tac 6576 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 agc act aaa att gac ggt ttg tta tcg cta cta tca gtc act gaa gca 6624 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 agc aac atc aag caa ttg gta ttg ttc tcg tca gcg gct ggt ttc tac 6672 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 ggt aac ccc ggc cag tct gat tac tcg att gcc aat gag atc tta aat 6720 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 aaa acc gca tac cgc ttt aaa tca ttg cac cca caa gct caa gta ttg 6768 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 agc ttt aac tgg ggt cct tgg gac ggt ggc atg gta acg cct gag ctt 6816 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 aaa cgt atg ttt gac caa cgt ggt gtt tac att att cca ctt gat gca 6864 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 ggt gca cag tta ttg ctg aat gaa cta gcc gct aat gat aac cgt tgt 6912 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 cca caa atc ctc gtg ggt aat gac tta tct aaa gat gct agc tct gat 6960 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 caa aag tct gat gaa aag agt act gct gta aaa aag cca caa gtt agt 7008 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 cgt tta tca gat gct tta gta act aaa agt atc aaa gcg act aac agt 7056 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 agc tct tta tca aac aag act agt gct tta tca gac agt agt gct ttt 7104 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 cag gtt aac gaa aac cac ttt tta gct gac cac atg atc aaa ggc aat 7152 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 cag gta tta cca acg gta tgc gcg att gct tgg atg agt gat gca gca 7200 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 aaa gcg act tat agt aac cga gac tgt gca ttg aag tat gtc ggt ttc 7248 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 gaa gac tat aaa ttg ttt aaa ggt gtg gtt ttt gat ggc aat gag gcg 7296 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 gcg gat tac caa atc caa ttg tcg cct gtg aca agg gcg tca gaa cag 7344 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 gat tct gaa gtc cgt att gcc gca aag atc ttt agc ctg aaa agt gac 7392 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 ggt aaa cct gtg ttt cat tat gca gcg aca ata ttg tta gca act cag 7440 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 cca ctt aat gct gtg aag gta gaa ctt ccg aca ttg aca gaa agt gtt 7488 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 gat agc aac aat aaa gta act gat gaa gca caa gcg tta tac agc aat 7536 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 ggc acc ttg ttc cac ggt gaa agt ctg cag ggc att aag cag ata tta 7584 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 agt tgt gac gac aag ggc ctg cta ttg gct tgt cag ata acc gat gtt 7632 Ser Cys Asp Asp Lys Gly Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 gca aca gct aag cag gga tcc ttc ccg tta gct gac aac aat atc ttt 7680 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 gcc aat gat ttg gtt tat cag gct atg ttg gtc tgg gtg cgc aaa caa 7728 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 ttt ggt tta ggt agc tta cct tcg gtg aca acg gct tgg act gtg tat 7776 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 cgt gaa gtg gtt gta gat gaa gta ttt tat ctg caa ctt aat gtt gtt 7824 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 gag cat gat cta ttg ggt tca cgc ggc agt aaa gcc cgt tgt gat att 7872 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 caa ttg att gct gct gat atg caa tta ctt gcc gaa gtg aaa tca gcg 7920 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 caa gtc agt gtc agt gac att ttg aac gat atg tca tga 7959 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 3 <211> 2652 <212> PRT <213> Moritella marina <400> 3 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 Ser Cys Asp Asp Lys Gly Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 4 <211> 2598 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(2595) <400> 4 atg acg gaa tta gct gtt att ggt atg gat gct aaa ttt agc gga caa 48 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 gac aat att gac cgt gtg gaa cgc gct ttc tat gaa ggt gct tat gta 96 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 ggt aat gtt agc cgc gtt agt acc gaa tct aat gtt att agc aat ggc 144 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 gaa gaa caa gtt att act gcc atg aca gtt ctt aac tct gtc agt cta 192 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 cta gcg caa acg aat cag tta aat ata gct gat atc gcg gtg ttg ctg 240 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 att gct gat gta aaa agt gct gat gat cag ctt gta gtc caa att gca 288 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 tca gca att gaa aaa cag tgt gcg agt tgt gtt gtt att gct gat tta 336 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 ggc caa gca tta aat caa gta gct gat tta gtt aat aac caa gac tgt 384 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 cct gtg gct gta att ggc atg aat aac tcg gtt aat tta tct cgt cat 432 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 gat ctt gaa tct gta act gca aca atc agc ttt gat gaa acc ttc aat 480 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 ggt tat aac aat gta gct ggg ttc gcg agt tta ctt atc gct tca act 528 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 gcg ttt gcc aat gct aag caa tgt tat ata tac gcc aac att aag ggc 576 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 ttc gct caa tcg ggc gta aat gct caa ttt aac gtt gga aac att agc 624 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 gat act gca aag acc gca ttg cag caa gct agc ata act gca gag cag 672 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 gtt ggt ttg tta gaa gtg tca gca gtc gct gat tcg gca atc gca ttg 720 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 tct gaa agc caa ggt tta atg tct gct tat cat cat acg caa act ttg 768 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 cat act gca tta agc agt gcc cgt agt gtg act ggt gaa ggc ggg tgt 816 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 ttt tca cag gtc gca ggt tta ttg aaa tgt gta att ggt tta cat caa 864 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 cgt tat att ccg gcg att aaa gat tgg caa caa ccg agt gac aat caa 912 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 atg tca cgg tgg cgg aat tca cca ttc tat atg cct gta gat gct cga 960 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 cct tgg ttc cca cat gct gat ggc tct gca cac att gcc gct tat agt 1008 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 tgt gtg act gct gac agc tat tgt cat att ctt tta caa gaa aac gtc 1056 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 tta caa gaa ctt gtt ttg aaa gaa aca gtc ttg caa gat aat gac tta 1104 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 act gaa agc aag ctt cag act ctt gaa caa aac aat cca gta gct gat 1152 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 ctg cgc act aat ggt tac ttt gca tcg agc gag tta gca tta atc ata 1200 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 gta caa ggt aat gac gaa gca caa tta cgc tgt gaa tta gaa act att 1248 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 aca ggg cag tta agt act act ggc ata agt act atc agt att aaa cag 1296 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 atc gca gca gac tgt tat gcc cgt aat gat act aac aaa gcc tat agc 1344 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 gca gtg ctt att gcc gag act gct gaa gag tta agc aaa gaa ata acc 1392 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 ttg gcg ttt gct ggt atc gct agc gtg ttt aat gaa gat gct aaa gaa 1440 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 tgg aaa acc ccg aag ggc agt tat ttt acc gcg cag cct gca aat aaa 1488 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 cag gct gct aac agc aca cag aat ggt gtc acc ttc atg tac cca ggt 1536 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 att ggt gct aca tat gtt ggt tta ggg cgt gat cta ttt cat cta ttc 1584 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 cca cag att tat cag cct gta gcg gct tta gcc gat gac att ggc gaa 1632 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 agt cta aaa gat act tta ctt aat cca cgc agt att agt cgt cat agc 1680 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 ttt aaa gaa ctc aag cag ttg gat ctg gac ctg cgc ggt aac tta gcc 1728 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 575 aat atc gct gaa gcc ggt gtg ggt ttt gct tgt gtg ttt acc aag gta 1776 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 ttt gaa gaa gtc ttt gcc gtt aaa gct gac ttt gct aca ggt tat agc 1824 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 atg ggt gaa gta agc atg tat gca gca cta ggc tgc tgg cag caa ccg 1872 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 gga ttg atg agt gct cgc ctt gca caa tcg aat acc ttt aat cat caa 1920 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 ctt tgc ggc gag tta aga aca cta cgt cag cat tgg ggc atg gat gat 1968 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 gta gct aac ggt acg ttc gag cag atc tgg gaa acc tat acc att aag 2016 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 gca acg att gaa cag gtc gaa att gcc tct gca gat gaa gat cgt gtg 2064 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 tat tgc acc att atc aat aca cct gat agc ttg ttg tta gcc ggt tat 2112 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 cca gaa gcc tgt cag cga gtc att aag aat tta ggt gtg cgt gca atg 2160 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 gca ttg aat atg gcg aac gca att cac agc gcg cca gct tat gcc gaa 2208 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 tac gat cat atg gtt gag cta tac cat atg gat gtt act cca cgt att 2256 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 aat acc aag atg tat tca agc tca tgt tat tta ccg att cca caa cgc 2304 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 agc aaa gcg att tcc cac agt att gct aaa tgt ttg tgt gat gtg gtg 2352 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 gat ttc cca cgt ttg gtt aat acc tta cat gac aaa ggt gcg cgg gta 2400 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 ttc att gaa atg ggt cca ggt cgt tcg tta tgt agc tgg gta gat aag 2448 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 atc tta gtt aat ggc gat ggc gat aat aaa aag caa agc caa cat gta 2496 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 tct gtt cct gtg aat gcc aaa ggc acc agt gat gaa ctt act tat att 2544 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 cgt gcg att gct aag tta att agt cat ggc gtg aat ttg aat tta gat 2592 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 agc tag 2598 Ser 865 <210> 5 <211> 865 <212> PRT <213> Moritella marina <400> 5 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 575 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 Ser 865 <210> 6 <211> 6036 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(6033) <400> 6 atg gaa aat att gca gta gta ggt att gct aat ttg ttc ccg ggc tca 48 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 caa gca ccg gat caa ttt tgg cag caa ttg ctt gaa caa caa gat tgc 96 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 cgc agt aag gcg acc gct gtt caa atg ggc gtt gat cct gct aaa tat 144 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 acc gcc aac aaa ggt gac aca gat aaa ttt tac tgt gtg cac ggc ggt 192 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 tac atc agt gat ttc aat ttt gat gct tca ggt tat caa ctc gat aat 240 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 gat tat tta gcc ggt tta gat gac ctt aat caa tgg ggg ctt tat gtt 288 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 acg aaa caa gcc ctt acc gat gcg ggt tat tgg ggc agt act gca cta 336 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 gaa aac tgt ggt gtg att tta ggt aat ttg tca ttc cca act aaa tca 384 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 tct aat cag ctg ttt atg cct ttg tat cat caa gtt gtt gat aat gcc 432 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 tta aag gcg gta tta cat cct gat ttt caa tta acg cat tac aca gca 480 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 ccg aaa aaa aca cat gct gac aat gca tta gta gca ggt tat cca gct 528 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 gca ttg atc gcg caa gcg gcg ggt ctt ggt ggt tca cat ttt gca ctg 576 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 gat gcg gct tgt gct tca tct tgt tat agc gtt aag tta gcg tgt gat 624 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 tac ctg cat acg ggt aaa gcc aac atg atg ctt gct ggt gcg gta tct 672 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 gca gca gat cct atg ttc gta aat atg ggt ttc tcg ata ttc caa gct 720 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 tac cca gct aac aat gta cat gcc ccg ttt gac caa aat tca caa ggt 768 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 cta ttt gcc ggt gaa ggc gcg ggc atg atg gta ttg aaa cgt caa agt 816 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 gat gca gta cgt gat ggt gat cat att tac gcc att att aaa ggc ggc 864 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 gca tta tcg aat gac ggt aaa ggc gag ttt gta tta agc ccg aac acc 912 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 aag ggc caa gta tta gta tat gaa cgt gct tat gcc gat gca gat gtt 960 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 gac ccg agt aca gtt gac tat att gaa tgt cat gca acg ggc aca cct 1008 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 aag ggt gac aat gtt gaa ttg cgt tcg atg gaa acc ttt ttc agt cgc 1056 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 gta aat aac aaa cca tta ctg ggc tcg gtt aaa tct aac ctt ggt cat 1104 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 ttg tta act gcc gct ggt atg cct ggc atg acc aaa gct atg tta gcg 1152 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 cta ggt aaa ggt ctt att cct gca acg att aac tta aag caa cca ctg 1200 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 caa tct aaa aac ggt tac ttt act ggc gag caa atg cca acg acg act 1248 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 gtg tct tgg cca aca act ccg ggt gcc aag gca gat aaa ccg cgt acc 1296 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 gca ggt gtg agc gta ttt ggt ttt ggt ggc agc aac gcc cat ttg gta 1344 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 tta caa cag cca acg caa aca ctc gag act aat ttt agt gtt gct aaa 1392 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 cca cgt gag cct ttg gct att att ggt atg gac agc cat ttt ggt agt 1440 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 gcc agt aat tta gcg cag ttc aaa acc tta tta aat aat aat caa aat 1488 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 acc ttc cgt gaa tta cca gaa caa cgc tgg aaa ggc atg gaa agt aac 1536 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 gct aac gtc atg cag tcg tta caa tta cgc aaa gcg cct aaa ggc agt 1584 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 tac gtt gaa cag cta gat att gat ttc ttg cgt ttt aaa gta ccg cct 1632 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 aat gaa aaa gat tgc ttg atc ccg caa cag tta atg atg atg caa gtg 1680 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 gca gac aat gct gcg aaa gac gga ggt cta gtt gaa ggt cgt aat gtt 1728 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 gcg gta tta gta gcg atg ggc atg gaa ctg gaa tta cat cag tat cgt 1776 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 ggt cgc gtt aat cta acc acc caa att gaa gac agc tta tta cag caa 1824 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 ggt att aac ctg act gtt gag caa cgt gaa gaa ctg acc aat att gct 1872 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 aaa gac ggt gtt gcc tcg gct gca cag cta aat cag tat acg agt ttc 1920 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 att ggt aat att atg gcg tca cgt att tcg gcg tta tgg gat ttt tct 1968 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 ggt cct gct att acc gta tcg gct gaa gaa aac tct gtt tat cgt tgt 2016 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 gtt gaa tta gct gaa aat cta ttt caa acc agt gat gtt gaa gcc gtt 2064 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 att att gct gct gtt gat ttg tct ggt tca att gaa aac att act tta 2112 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 cgt cag cac tac ggt cca gtt aat gaa aag gga tct gta agt gaa tgt 2160 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 ggt ccg gtt aat gaa agc agt tca gta acc aac aat att ctt gat cag 2208 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 caa caa tgg ctg gtg ggt gaa ggc gca gcg gct att gtc gtt aaa ccg 2256 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 tca tcg caa gtc act gct gaa caa gtt tat gcg cgt att gat gcg gtg 2304 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 agt ttt gcc cct ggt agc aat gcg aaa gca att acg att gca gcg gat 2352 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 aaa gca tta aca ctt gct ggt atc agt gct gct gat gta gct agt gtt 2400 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 gaa gca cat gca agt ggt ttt agt gcc gaa aat aat gct gaa aaa acc 2448 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 gcg tta ccg act tta tac cca agc gca agt atc agt tcg gtg aaa gcc 2496 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 aat att ggt cat acg ttt aat gcc tcg ggt atg gcg agt att att aaa 2544 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 acg gcg ctg ctg tta gat cag aat acg agt caa gat cag aaa agc aaa 2592 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 cat att gct att aac ggt cta ggt cgt gat aac agc tgc gcg cat ctt 2640 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 atc tta tcg agt tca gcg caa gcg cat caa gtt gca cca gcg cct gta 2688 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 tct ggt atg gcc aag caa cgc cca cag tta gtt aaa acc atc aaa ctc 2736 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 ggt ggt cag tta att agc aac gcg att gtt aac agt gcg agt tca tct 2784 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 tta cac gct att aaa gcg cag ttt gcc ggt aag cac tta aac aaa gtt 2832 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 aac cag cca gtg atg atg gat aac ctg aag ccc caa ggt att agc gct 2880 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 cat gca acc aat gag tat gtg gtg act gga gct gct aac act caa gct 2928 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 tct aac att caa gca tct cat gtt caa gcg tca agt cat gca caa gag 2976 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 ata gca cca aac caa gtt caa aat atg caa gct aca gca gcc gct gta 3024 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 agt tca ccc ctt tct caa cat caa cac aca gcg cag ccc gta gcg gca 3072 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 ccg agc gtt gtt gga gtg act gtg aaa cat aaa gca agt aac caa att 3120 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 cat cag caa gcg tct acg cat aaa gca ttt tta gaa agt cgt tta gct 3168 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 gca cag aaa aac cta tcg caa ctt gtt gaa ttg caa acc aag ctg tca 3216 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 atc caa act ggt agt gac aat aca tct aac aat act gcg tca aca agc 3264 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 aat aca gtg cta aca aat cct gta tca gca acg cca tta aca ctt gtg 3312 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 tat aat gcg cct gta gta gcg aca aac cta acc agt aca gaa gca aaa 3360 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 gcg caa gca gct gct aca caa gct ggt ttt cag ata aaa gga cct gtt 3408 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 ggt tac aac tat cca ccg ctg cag tta att gaa cgt tat aat aaa cca 3456 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 gaa aac gtg att tac gat caa gct gat ttg gtt gaa ttc gct gaa ggt 3504 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 gat att ggt aag gta ttt ggt gct gaa tac aat att att gat ggc tat 3552 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 tcg cgt cgt gta cgt ctg cca acc tca gat tac ttg tta gta aca cgt 3600 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 gtt act gaa ctt gat gcc aag gtg cat gaa tac aag aaa tca tac atg 3648 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 tgt act gaa tat gat gtg cct gtt gat gca ccg ttc tta att gat ggt 3696 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 cag atc cct tgg tct gtt gcc gtc gaa tca ggc cag tgt gat ttg atg 3744 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 ttg att tca tat atc ggt att gat ttc caa gcg aaa ggc gaa cgt gtt 3792 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 tac cgt tta ctt gat tgt gaa tta act ttc ctt gaa gag atg gct ttt 3840 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 ggt ggc gat act tta cgt tac gag atc cac att gat tcg tat gca cgt 3888 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 aac ggc gag caa tta tta ttc ttc ttc cat tac gat tgt tac gta ggg 3936 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 gat aag aag gta ctt atc atg cgt aat ggt tgt gct ggt ttc ttt act 3984 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 gac gaa gaa ctt tct gat ggt aaa ggc gtt att cat aac gac aaa gac 4032 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 aaa gct gag ttt agc aat gct gtt aaa tca tca ttc acg ccg tta tta 4080 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 caa cat aac cgt ggt caa tac gat tat aac gac atg atg aag ttg gtt 4128 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 aat ggt gat gtt gcc agt tgt ttt ggt ccg caa tat gat caa ggt ggc 4176 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 cgt aat cca tca ttg aaa ttc tcg tct gag aag ttc ttg atg att gaa 4224 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 cgt att acc aag ata gac cca acc ggt ggt cat tgg gga cta ggc ctg 4272 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 tta gaa ggt cag aaa gat tta gac cct gag cat tgg tat ttc cct tgt 4320 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 cac ttt aaa ggt gat caa gta atg gct ggt tcg ttg atg tcg gaa ggt 4368 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 tgt ggc caa atg gcg atg ttc ttc atg ctg tct ctt ggt atg cat acc 4416 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 aat gtg aac aac gct cgt ttc caa cca cta cca ggt gaa tca caa acg 4464 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 gta cgt tgt cgt ggg caa gta ctg cca cag cgc aat acc tta act tac 4512 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 cgt atg gaa gtt act gcg atg ggt atg cat cca cag cca ttc atg aaa 4560 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 gct aat att gat att ttg ctt gac ggt aaa gtg gtt gtt gat ttc aaa 4608 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 aac ttg agc gtg atg atc agc gaa caa gat gag cat tca gat tac cct 4656 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 gta aca ctg ccg agt aat gtg gcg ctt aaa gcg att act gca cct gtt 4704 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 gcg tca gta gca cca gca tct tca ccc gct aac agc gcg gat cta gac 4752 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 gaa cgt ggt gtt gaa ccg ttt aag ttt cct gaa cgt ccg tta atg cgt 4800 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 gtt gag tca gac ttg tct gca ccg aaa agc aaa ggt gtg aca ccg att 4848 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 aag cat ttt gaa gcg cct gct gtt gct ggt cat cat aga gtg cct aac 4896 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 caa gca ccg ttt aca cct tgg cat atg ttt gag ttt gcg acg ggt aat 4944 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 att tct aac tgt ttc ggt cct gat ttt gat gtt tat gaa ggt cgt att 4992 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 cca cct cgt aca cct tgt ggc gat tta caa gtt gtt act cag gtt gta 5040 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 gaa gtg cag ggc gaa cgt ctt gat ctt aaa aat cca tca agc tgt gta 5088 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 gct gaa tac tat gta ccg gaa gac gct tgg tac ttt act aaa aac agc 5136 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 cat gaa aac tgg atg cct tat tca tta atc atg gaa att gca ttg caa 5184 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 cca aat ggc ttt att tct ggt tac atg ggc acg acg ctt aaa tac cct 5232 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 gaa aaa gat ctg ttc ttc cgt aac ctt gat ggt agc ggc acg tta tta 5280 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 aag cag att gat tta cgc ggc aag acc att gtg aat aaa tca gtc ttg 5328 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 gtt agt acg gct att gct ggt ggc gcg att att caa agt ttc acg ttt 5376 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 gat atg tct gta gat ggc gag cta ttt tat act ggt aaa gct gta ttt 5424 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 ggt tac ttt agt ggt gaa tca ctg act aac caa ctg ggc att gat aac 5472 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 ggt aaa acg act aat gcg tgg ttt gtt gat aac aat acc ccc gca gcg 5520 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 aat att gat gtg ttt gat tta act aat cag tca ttg gct ctg tat aaa 5568 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 gcg cct gtg gat aaa ccg cat tat aaa ttg gct ggt ggt cag atg aac 5616 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 ttt atc gat aca gtg tca gtg gtt gaa ggc ggt ggt aaa gcg ggc gtg 5664 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 gct tat gtt tat ggc gaa cgt acg att gat gct gat gat tgg ttc ttc 5712 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 cgt tat cac ttc cac caa gat ccg gtg atg cca ggt tca tta ggt gtt 5760 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 gaa gct att att gag ttg atg cag acc tat gcg ctt aaa aat gat ttg 5808 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 ggt ggc aag ttt gct aac cca cgt ttc att gcg ccg atg acg caa gtt 5856 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 gat tgg aaa tac cgt ggg caa att acg ccg ctg aat aaa cag atg tca 5904 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 ctg gac gtg cat atc act gag atc gtg aat gac gct ggt gaa gtg cga 5952 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 atc gtt ggt gat gcg aat ctg tct aaa gat ggt ctg cgt att tat gaa 6000 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 gtt aaa aac atc gtt tta agt att gtt gaa gcg taa 6036 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 7 <211> 2011 <212> PRT <213> Moritella marina <400> 7 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 8 <211> 1617 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(1614) <400> 8 atg tcg agt tta ggt ttt aac aat aac aac gca att aac tgg gct tgg 48 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 aaa gta gat cca gcg tca gtt cat aca caa gat gca gaa att aaa gca 96 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 gct tta atg gat cta act aaa cct ctc tat gtg gcg aat aat tca ggc 144 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 gta act ggt ata gct aat cat acg tca gta gca ggt gcg atc agc aat 192 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 aac atc gat gtt gat gta ttg gcg ttt gcg caa aag tta aac cca gaa 240 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 gat ctg ggt gat gat gct tac aag aaa cag cac ggc gtt aaa tat gct 288 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 tat cat ggc ggt gcg atg gca aat ggt att gcc tcg gtt gaa ttg gtt 336 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 gtt gcg tta ggt aaa gca ggg ctg tta tgt tca ttt ggt gct gca ggt 384 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 cta gtg cct gat gcg gtt gaa gat gca att cgt cgt att caa gct gaa 432 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 tta cca aat ggc cct tat gcg gtt aac ttg atc cat gca cca gca gaa 480 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 gaa gca tta gag cgt ggc gcg gtt gaa cgt ttc cta aaa ctt ggc gtc 528 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 aag acg gta gag gct tca gct tac ctt ggt tta act gaa cac att gtt 576 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 tgg tat cgt gct gct ggt cta act aaa aac gca gat ggc agt gtt aat 624 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 atc ggt aac aag gtt atc gct aaa gta tcg cgt acc gaa gtt ggt cgc 672 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 cgc ttt atg gaa cct gca ccg caa aaa tta ctg gat aag tta tta gaa 720 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 caa aat aag atc acc cct gaa caa gct gct tta gcg ttg ctt gta cct 768 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 atg gct gat gat att act ggg gaa gcg gat tct ggt ggt cat aca gat 816 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 aac cgt ccg ttt tta aca tta tta ccg acg att att ggt ctg cgt gat 864 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 gaa gtg caa gcg aag tat aac ttc tct cct gca tta cgt gtt ggt gct 912 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 ggt ggt ggt atc gga acg cct gaa gca gca ctc gct gca ttt aac atg 960 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 ggc gcg gct tat atc gtt ctg ggt tct gtg aat cag gcg tgt gtt gaa 1008 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 gcg ggt gca tct gaa tat act cgt aaa ctg tta tcg aca gtt gaa atg 1056 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 gct gat gtg act atg gca cct gct gca gat atg ttt gaa atg ggt gtg 1104 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 aag ctg caa gta tta aaa cgc ggt tct atg ttc gcg atg cgt gcg aag 1152 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 aaa ctg tat gac ttg tat gtg gct tat gac tcg att gaa gat atc cca 1200 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 gct gct gaa cgt gag aag att gaa aaa caa atc ttc cgt gca aac cta 1248 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 gac gag att tgg gat ggc act atc gct ttc ttt act gaa cgc gat cca 1296 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 gaa atg cta gcc cgt gca acg agt agt cct aaa cgt aaa atg gca ctt 1344 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 atc ttc cgt tgg tat ctt ggc ctt tct tca cgc tgg tca aac aca ggc 1392 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 gag aag gga cgt gaa atg gat tat cag att tgg gca ggc cca agt tta 1440 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 ggt gca ttc aac agc tgg gtg aaa ggt tct tac ctt gaa gac tat acc 1488 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 cgc cgt ggc gct gta gat gtt gct ttg cat atg ctt aaa ggt gct gcg 1536 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 tat tta caa cgt gta aac cag ttg aaa ttg caa ggt gtt agc tta agt 1584 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 aca gaa ttg gca agt tat cgt acg agt gat taa 1617 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 9 <211> 538 <212> PRT <213> Moritella marina <400> 9 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 10 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence:Primer <220> <221> Degenerate <222> (6) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (12) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (15) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (18) <223> "n" is a, t, c or g <400> 10 ttyggnttyg gnggnacnaa 20 <210> 11 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence:Primer <220> <221> Degenerate <222> (4) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (7) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (10) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (16) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (19) <223> "n" is a, t, c or g <400> 11 ytcnccnarn swrtgnccng c 21 [Sequence List] SEQUENCE LISTING <110> Director-General of Agency of Industrial Science and Technology <120> Gene from Docosahexaenoic Acid Producing Bacteria <130> P99-0665 <160> 11 <170> PatentIn Ver. 2.0 <210> 1 <211> 41587 <212> DNA <213> Moritella marina <400> 1 gatcactctg ctgcatggcg agagctgttt aattacaggt tgaaaaaaac gatgtaatgc 60 acttaattgc ttgctgttct taatgcctga ggcgtcgaag ataataccgt tgaagcgatc 120 tgttttagcg atagcattaa ggctaatagg tgtcgcgact aaagacgttt gattaaattc 180 aatattaaga tcggctaacg ctgacgtgtt attaggataa gaaatcgtga cttcagcatc 240 tttaaatgtg ttaagaatgg gtttaattaa tttgctgttg ctggctgcgc cgatgagtaa 300 gttgccagag atgagatcgg ttccctgatc gtagcgtgtt aacgtaaccg gtcgtggcag 360 attaagcgct ttaaataaac ctgatgtcca cttgccatta gcgagttttg cgtatgtatc 420 cgtcattttc taatccttgt tatagtgaac agtttgaatc tcgaagatgt acatgtgtta 480 aaaattatct gatagctatg acttatctgc cactacgtaa taataaatag accagttcat 540 tacatcgtta atcgatatag tataactaaa tactaagtaa attataatga taagactgtt 600 atcgtactcg gatcaaactc tgatcagcaa ataatcaaat tagagttttt attttaaact 660 tgtatcaaca atgttacatt aatgtatctt acgtctaatg tgctacgggc atatttaagt 720 cactaaatta aaggaataaa ccatgacagg tcaaacaata agaagagtag caattatcgg 780 cggtaaccgt atcccgtttg cacgttcaaa tacagcgtat tcaaaactaa gtaaccaaga 840 tatgctgacg gaaactatcc gtggcttggt ggttaaatat aacctacgtg gtgaacaact 900 gggggaagtt gttgctggtg cggtaattaa gcattctcgt gattttaact taacacgtga 960 agccgtgcta agtgcaggtc ttgcacctga aacgccttgt tatgacattc aacaagcttg 1020 tggtactggt ctagctgcag ctatccaagt agcaaacaaa attgcgcttg gtcaaataga 1080 agcgggtatt gctggtggtt ctgatacgac atcagatgca ccgattgcag tcagtgaagg 1140 catgcgtagt gtattacttg agcttaatcg agctaaaacg ggtaagcaac gtttgaaagc 1200 actatctcgt ctacgtctaa aacactttgc gccactaacg cctgcaaata aagagccgcg 1260 taccaaaatg gcgatgggcg atcattgtca agtaacagcg aaagagtgga atatctcacg 1320 tgaagcacaa gatgcattgg cctgcgcaag tcatcaaaaa ttagctgcag catatgaaga 1380 aggtttcttt gatacgttag tttcacctat ggccggctta acgaaagata acgtattacg 1440 cgcagataca acagttgaga aactggctaa attgaaacct tgttttgata aagtaaacgg 1500 cactatgacg gcgggtaaca gtactaacct taccgatgga gcatcagctg tattacttgc 1560 aagtgaagaa tgggcagcgg cacataactt accagtacaa gcttatctaa catttggtga 1620 aacggccgct atcgacttcg ttgataagaa agaaggtctg ttaatggcgc ctgcatacgc 1680 agtgccaaaa atgttgaagc gtgctggcct tacattacaa gacttcgatt actatgaaat 1740 acatgaagca tttgctgcgc agttattagc aacgctagca gcttgggaag acgaaaaatt 1800 ctgtaaagaa aaactgggtc tagatgctgc gcttggttca attgatatga ccaagttaaa 1860 cgtgaaaggg agtagcttag ccacgggtca cccatttgcc gcaactggtg gtcgtgttgt 1920 cgctacgcta gcgcaattac ttgatcagaa aggttcaggt cgtggtttga tctcgatttg 1980 tgctgctggt ggtcaaggta tcacggcaat tttagagaaa taaacgcact gtttattatc 2040 tattgattaa gctgtcctga gatactggat atttttaaat aaaacgccaa tactgcagag 2100 tattggcgtt tttttgtaat accaattcct atataacggt gcattttaaa cacttaattt 2160 ccggcattgg tatcataaaa aagcagcacc gaagtgctgc ttgattgtag attaacctat 2220 taaaatagag aggctagaat tagtcttcgt atgcttcatt atgtacgcca gctgcacgac 2280 ccgatggatc agcattgttt tggaaacttt catcccaagc taatgcttct acagttgaac 2340 aagcaacgga tttaccaaac ggtacgcatt tcgctgctga atcacctggg aagtgatctt 2400 caaagatggc acgatagtag taaccttctt tcgtatctgg tgtgttaatt gggaacttaa 2460 atgctgcact tgctaacatt tgatcagtta ccgcttcttc aacgtgtact ttaagttggt 2520 caatccaaga ataaccaaca ccatcagaga attgttcttt ttgacgccat acaatttctt 2580 caggtagtaa atcttcaaat gcttctcgaa tgatgttttt ctcaatgcgg tcgcccgtga 2640 tcatttttag ttcagggttt agacgcattg acgcatcaac aaattcttta tctaagaaag 2700 gaacacgtgc ttcgatgccc caagctgcca tagatttgtt tgcacgtaag caatcaaaca 2760 tatgtaattt atttacttta cgtaccgtct cttcatggaa ttctttcgca tttggcgctt 2820 tgtggaagta caagtaacca ccgaacagtt catcagcacc ttcaccagaa agcaccatct 2880 taatccccat ggctttaatt ttacgtgcca ttaggtacat aggggttgat gcacgaattg 2940 ttgttacatc gtaggtttca atgtggtaaa tcacgtcgcg taaagcgtcg ataccttctt 3000 gcacagtaaa ttcaattgaa tgatggatag tacctaagtg atctgccact ttttgtgcag 3060 cggctaaatc tggagaacca tttaggccta cagagaaaga gtgtagttgt ggccaccatg 3120 cttcggtttt accaccgtct tcaatacgac gttttgcata ctgttgggtg attgctgaaa 3180 taacagatga atctaacccg cctgataata atacgccgta aggtacatca cacattaatt 3240 gacgtttaac tgcatcttcc aaaccttgct taacaacgct tttatcacca ccattttgtg 3300 caacgttatc aaaatctttc caatcacgtt gataataagg cgtgactaca ccatccttac 3360 tccacaggta atgacctgct gggaattctt caatttgagt acaaattggc actagtgctt 3420 tcatttcaga ggcaacataa aagttaccgt gttcatcata gcccgtataa agagggatga 3480 taccgatatg gtcacggcca atcaggtaag cgtcctctgt ttcgtcatat aaagcgaaag 3540 caaaaatacc atttagatca tctaaaaatt gtgtgccttt ttctttatat agcgcaagta 3600 tcacttcgca atctgattct gtttggaatt caaagtctac gttcagcgtt ttctttaaat 3660 ctttgtggtt ataaatttca ccattaacag caagtacgtg tgtcttttct tcattatata 3720 gcggctgtgc accattattt acatcgacaa tagcaagacg ttcatgaact aaaatagcat 3780 tgtcacttgt atagatacct gaccaatctg ggccgcggtg acgtagtaac tttgatagtt 3840 ctagtgcttg ttcgcgaaga ggtttaatgt ctgatttgat gtctagaatt ccgaatattg 3900 agcacataac taattccttc tggggctgcg tctgcagcta actttctaaa tagtgtgtct 3960 aatttgccac attgtagatt taatgcaaac attaatgata aaacatttat aaaaaatgta 4020 attcaatgtg gaatcgataa tttaatggct taaaagtgaa gatccattaa ttgtgatggc 4080 gaggtgatag accaatgtag accttaatga ataaagcagg cacgattgaa tccattcaac 4140 gcaaagtggt actaactatt gttttaaacg ttataaatag tgttttaaag gttataagta 4200 aataatttaa aaacaataat aatccacatg cattaaattt atcatgataa accgctatat 4260 ctcaatggca atttgggata agtgtaaaat atatgtaaaa tgaatgagtt gacttgcttt 4320 ttttacacta agtgatgaaa ttaaagctag atgtcgttgt tagcattgat taataacgta 4380 ctaaaatacg acatctagta tagaaattta aaaaacagtt ggttttgata gcataactgc 4440 ataaactaat cagcttattg tctgtaatat ttttgtaatt taaataggtt taataaaatt 4500 atatgtctga taaatataaa ccgtacgacc tttcctttaa aaagacgttt ttgctgccta 4560 agttttggcc tgtgtggttc ggggtgtttg caatatactt attagctttt atgccagtaa 4620 agccgcgtga taaatttgct cgattcatag cgaagaaatt gtttagtcta aaaatgatgg 4680 caaagcgtaa aaaggtagca aagatcaatt tatctatgtg cttccctgaa atggatgata 4740 cggaacaaga ccgtataatc atggtcaatc tagttacttt ttgtcaaact atcttaagtt 4800 atgcagagcc aagtgcgcgt agtcgtgctt ataaccgtga ccgtatgata gtgcatggtg 4860 gcgagaattt atttccgcta cttgaacaag gtaaggcttg tatcttatta gtgccgcata 4920 gcttcgctat tgattttgca ggtttacaca ttgcttctta tggcgcgcca ttttgtacta 4980 tgtttaacaa ttctgagaat gagttgttcg attggctgat gacacgtcaa cgcgctatgt 5040 ttggaggcac tgtttatcac cgcaaggcag ggctaggggc tctagttaaa tcacttaaga 5100 gcggtgaaag ctgttattac ttacctgatg aagaccatgg acctaagcgt agtgtatttg 5160 cgcctttatt tgcgactcaa aaagcaactt tacctgtaat gggcaagcta gcagaaaaaa 5220 caaatgcact cgttgttcct gtttatgcgg catataatga atcactaggt aaatttgaaa 5280 cctttattcg accagcaatg caaaactttc catcagaaag cccagaacaa gatgcagtga 5340 tgatgaataa agagattgaa gccttgattg aatgtggtgt tgatcaatat atgtggacac 5400 ttagattatt gagaacacgt ccggacggta aaaaaatcta ctaataaagt ttaataaaca 5460 ccataatctt cgttgaatat ggtgtttacc cccctgaata ccctctaaat taataacaaa 5520 aaaagccatt tacgtaacat ctaatgatga tttagcctgc acttgctttg tttttagtct 5580 taagagccta ataaacttga tctaggtata gattctgtct ttctttacgt aacgcgatct 5640 atttttttta accgatagtt gttataatta gtttcatatg aaagagatat cgtttcagta 5700 aaagctattt cgtttcaata gataatttat ttatagtcat attttctgta atgacaatca 5760 ttttctcatc tagactatag ataagaatac gaattaagta agaacattaa ttttacaaga 5820 atataaaata tcccatcgga gctataagaa tgaaaaagac taaaattgtt tgtacaattg 5880 gtccaaaaac tgaatcagta gagaaactaa cagagcttgt taatgcaggc atgaacgtta 5940 tgcgtttaaa tttctctcat ggtaactttg ctgaacattc agtgcgtatt caaaatatcc 6000 gtcaagtaag tgaaaacctg aataagaaaa ttgctgtttt actggatact aaaggtccag 6060 aaatccgtac gattaaacta gaaaacggtg acgatgtaat gttgaccgct ggtcagtcat 6120 tcacgtttac aacagacatt aacgtggtag gtaataaaga ctgtgttgct gtaacatatg 6180 ctggttttgc taaagacctt aatcctggtg caatcatcct tgttgatgat ggtttaattg 6240 aaatggaagt tgttgcaaca actgacactg aagttaaatg tacagtatta aatactggtg 6300 cacttggtga aaataaaggc gttaacttac ctaacatcag tgtaggtcta cctgcattgt 6360 cagaaaaaga taaagctgat ttagcgtttg gttgtgagca agaagttgat tttgttgctg 6420 catcatttat tcgtaaggct gatgatgtaa gagaaattcg tgaaatccta tttaataatg 6480 gtggcgaaaa cattcagatt atctcgaaaa ttgaaaacca agaaggtgta gacaatttcg 6540 atgaaatctt agctgaatca gacggtatca tggttgctcg tggcgatctc ggtgttgaga 6600 tcccagttga agaagtgatc atggcacaga agatgatgat caaaaaatgt aataaagcag 6660 gtaaagttgt aattactgca acacaaatgc ttgattcaat gatcagtaac ccacgtccaa 6720 cacgtgcaga agcgggcgat gttgccaatg ctgtgcttga cggtaccgac gcggtaatgc 6780 tttctggtga aactgcgaaa ggtaaatacc cagttgaagc tgtgtctatc atggcaaaca 6840 tctgtgaacg tactgataac tcaatgtctt cggatttagg tgcgaacatt gttgctaaaa 6900 gcatgcgcat tacagaagct gtgtgtaaag gtgcggtaga aacaacagaa aaattgtgtg 6960 ctccacttat tgttgttgca actcgtggcg gtaaatcagc aaaatctgtt cgtaaatact 7020 tcccgaaagc aaatattctt gctatcacaa caaatgaaaa agcagcgcaa cagttatgcc 7080 taactaaagg cgtaagcagc tgcatcgttg agcagattga tagcactgat gagttctacc 7140 gtaaaggtaa agagcttgca ttagcaactg gtttagctaa agaaggcgat atcgttgtta 7200 tggtatcagg tgcgttagta ccatcaggta caacgaatac ggcatctgtt caccaacttt 7260 aagttgccat attgatatta taaaaaagag agcgtatgct ctcttttttt atatctgtag 7320 tttatatgtc tgtacaaaaa aatgataaag agtacataaa ctattaatat agcgtaatat 7380 ataatgatta acggtgatga aagggttaaa taaatggata gtgctaaaca taaaattggc 7440 ttagtccttt ctggcggtgg tgcgaaaggt attgctcatc ttggtgtatt aaaatacctg 7500 ttagagcaag atataagacc gaatgtaatt gcgggtacaa gtgctggctc tatggttggt 7560 gcactttatt gctcaggact tgagattgat gacattttac aattcttcat cgatgtaaaa 7620 cctttttctt ggaagtttac ccgtgcccgt gctggcttta tagacccggc aaaattatat 7680 cctgaagtgc taaaatatat ccccgaggat agctttgagt accttcaacc tgaattgcgc 7740 attgttgcca ccaacatgtt actcggtaaa gagcatatat ttaaagatgg ctccgtgatt 7800 aatgccttat tagcatcagc cagctaccct ttagtttttt ctccgatgat cattgacgat 7860 caagtgtatt cagatggcgg tattgttaat catttccccg tgagtgtcat tgaagatgat 7920 tgcgataaaa taatcggcgt atacgtgtcg cccattcgtc aggtcgaagc tgacgaactc 7980 tcgagtataa aagacgtggt attacgtgcg ttcacgctgc agggtagtgg tgctgaatta 8040 gataaactat cgcaatgtga tgtgcaaatt tatccagaag cgctattgaa ttacaatacg 8100 tttgcaaccg atgaaaaatc attacgggag atctaccaga ttggttatga tgctgcaaaa 8160 gatcaacatg acaaccttat ggcattgaaa gaaagtatca ccaccagcga ggttaaaaag 8220 aacgtcttta gcaaatggtt tggtgataaa cttgctagca acagcggcaa atagcggccc 8280 acacggattt atacactagg ataatgggcg ttaatagcct cactgtcgtt gtgtggtctc 8340 taattttagc taaatcttgt gttatactga cttcctatta atcataaacg atttatcacg 8400 gtaaacatga ctcaaataaa taacccgctt cacggcatga cactcgaaaa agtaattaac 8460 agtctcgttg aacaatatgg ctgggatggt cttggatact acatcaacat tcgttgcttt 8520 actgaaaatc caagtgttaa gtctagtctt aaatttttac gtaaaacccc ttgggcacgt 8580 gataaagtag aagcgctata tatcaaaatg gtgactgaag gctaactgtc tccacgctag 8640 cgaaccgctg tttatagtta atataagtac tataagcagg gctcgttaat tcagtatgta 8700 attaatcctg aataccttcc gcttatttca acattgtact ctctagataa cactctcaac 8760 attacacctt caacatcaca gcctccacat aacatccgat gacatagccc tgttattttt 8820 cacatttatc tatatgctat atattttagc catttgatca attgagttaa tttctgcaat 8880 gacaaagata taccatcatc cagtacaaat ttattatgaa gataccgacc attctggtgt 8940 tgtttaccac cctaactttt taaaatactt tgaacgtgca cgtgagcatg tgataaatag 9000 tgacttacta gcaacattgt ggaatgaacg cggtttaggt tttgcggtgt ataaagccaa 9060 tatgactttt caggatgggg tcgaatttgc tgaagtgtgt gatattcgca cttcttttgt 9120 cctagacggt aagtacaaaa cgatctggcg ccaagaagta tggcgtccga atgcgactag 9180 ggctgccgtt atcggtgata ttgaaatggt gtgcttagac aaacaaaaac gtttacagcc 9240 catccctgat gatgtgttag ctgcaatggt tagtgaataa atggttcatg cataaatagt 9300 taatacatga ttctggcccg tcacgtttac agataagagg catccgatgc ctccttccta 9360 ttaccaatac tactgcttat ccctttctaa ctatctttag cgtccataac acactgagca 9420 tttattctat taatcagtga ttgtgattta attatcttct atatatgtaa tttaatgtaa 9480 ttttcaattt atttttagct acattaaggc ttacgaatgt acgctaaaat gagatgtcag 9540 actaatttta gcttattaat ctgttagccg tttatatttt ataaagatgg gatttaactt 9600 aaatgcaatt aattatggcg taaatagagt gaaaacatgg ctaatattca ctaagtcctg 9660 aattttatat aaagtttaat ctgttatttt agcgtttacc tggtcttatc agtgaggttt 9720 atagccatta ttagtgggat tgaagtgatt tttaaagcta tgtatattat tgcaaatata 9780 aattgtaaca attaagactt tggacacttg agttcaattt cgaattgatt ggcataaaat 9840 ttaaaacagc taaatctacc tcaatcattt tagcaaatgt atgcaggtag atttttttcg 9900 ccatttaaga gtacacttgt acgctaggtt tttgtttagt gtgcaaatga acgttttgat 9960 gagcattgtt tttagagcac aaaatagatc cttacaggag caataacgca atggctaaaa 10020 agaacaccac atcgattaag cacgccaagg atgtgttaag tagtgatgat caacagttaa 10080 attctcgctt gcaagaatgt ccgattgcca tcattggtat ggcatcggtt tttgcagatg 10140 ctaaaaactt ggatcaattc tgggataaca tcgttgactc tgtggacgct attattgatg 10200 tgcctagcga tcgctggaac attgacgacc attactcggc tgataaaaaa gcagctgaca 10260 agacatactg caaacgcggt ggtttcattc cagagcttga ttttgatccg atggagtttg 10320 gtttaccgcc aaatatcctc gagttaactg acatcgctca attgttgtca ttaattgttg 10380 ctcgtgatgt attaagtgat gctggcattg gtagtgatta tgaccatgat aaaattggta 10440 tcacgctggg tgtcggtggt ggtcagaaac aaatttcgcc attaacgtcg cgcctacaag 10500 gcccggtatt agaaaaagta ttaaaagcct caggcattga tgaagatgat cgcgctatga 10560 tcatcgacaa atttaaaaaa gcctacatcg gctgggaaga gaactcattc ccaggcatgc 10620 taggtaacgt tattgctggt cgtatcgcca atcgttttga ttttggtggt actaactgtg 10680 tggttgatgc ggcatgcgct ggctcccttg cagctgttaa aatggcgatc tcagacttac 10740 ttgaatatcg ttcagaagtc atgatatcgg gtggtgtatg ttgtgataac tcgccattca 10800 tgtatatgtc attctcgaaa acaccagcat ttaccaccaa tgatgatatc cgtccgtttg 10860 atgacgattc aaaaggcatg ctggttggtg aaggtattgg catgatggcg tttaaacgtc 10920 ttgaagatgc tgaacgtgac ggcgacaaaa tttattctgt actgaaaggt atcggtacat 10980 cttcagatgg tcgtttcaaa tctatttacg ctccacgccc agatggccaa gcaaaagcgc 11040 taaaacgtgc ttatgaagat gccggttttg cccctgaaac atgtggtcta attgaaggcc 11100 atggtacggg taccaaagcg ggtgatgccg cagaatttgc tggcttgacc aaacactttg 11160 gcgccgccag tgatgaaaag caatatatcg ccttaggctt agttaaatcg caaattggtc 11220 atactaaatc tgcggctggc tctgcgggta tgattaaggc ggcattagcg ctgcatcata 11280 aaatcttacc tgcaacgatc catatcgata aaccaagtga agccttggat atcaaaaaca 11340 gcccgttata cctaaacagc gaaacgcgtc cttggatgcc acgtgaagat ggtattccac 11400 gtcgtgcagg tatcagctca tttggttttg gcggcaccaa cttccatatt attttagaag 11460 agtatcgccc aggtcacgat agcgcatatc gcttaaactc agtgagccaa actgtgttga 11520 tctcggcaaa cgaccaacaa ggtattgttg ctgagttaaa taactggcgt actaaactgg 11580 ctgtcgatgc tgatcatcaa gggtttgtat ttaatgagtt agtgacaacg tggccattaa 11640 aaaccccatc cgttaaccaa gctcgtttag gttttgttgc gcgtaatgca aatgaagcga 11700 tcgcgatgat tgatacggca ttgaaacaat tcaatgcgaa cgcagataaa atgacatggt 11760 cagtacctac cggggtttac tatcgtcaag ccggtattga tgcaacaggt aaagtggttg 11820 cgctattctc agggcaaggt tcgcaatacg tgaacatggg tcgtgaatta acctgtaact 11880 tcccaagcat gatgcacagt gctgcggcga tggataaaga gttcagtgcc gctggtttag 11940 gccagttatc tgcagttact ttccctatcc ctgtttatac ggatgccgag cgtaagctac 12000 aagaagagca attacgttta acgcaacatg cgcaaccagc gattggtagt ttgagtgttg 12060 gtctgttcaa aacgtttaag caagcaggtt ttaaagctga ttttgctgcc ggtcatagtt 12120 tcggtgagtt aaccgcatta tgggctgccg atgtattgag cgaaagcgat tacatgatgt 12180 tagcgcgtag tcgtggtcaa gcaatggctg cgccagagca acaagatttt gatgcaggta 12240 agatggccgc tgttgttggt gatccaaagc aagtcgctgt gatcattgat acccttgatg 12300 atgtctctat tgctaacttc aactcgaata accaagttgt tattgctggt actacggagc 12360 aggttgctgt agcggttaca accttaggta atgctggttt caaagttgtg ccactgccgg 12420 tatctgctgc gttccataca cctttagttc gtcacgcgca aaaaccattt gctaaagcgg 12480 ttgatagcgc taaatttaaa gcgccaagca ttccagtgtt tgctaatggc acaggcttgg 12540 tgcattcaag caaaccgaat gacattaaga aaaacctgaa aaaccacatg ctggaatctg 12600 ttcatttcaa tcaagaaatt gacaacatct atgctgatgg tggccgcgta tttatcgaat 12660 ttggtccaaa gaatgtatta actaaattgg ttgaaaacat tctcactgaa aaatctgatg 12720 tgactgctat cgcggttaat gctaatccta aacaacctgc ggacgtacaa atgcgccaag 12780 ctgcgctgca aatggcagtg cttggtgtcg cattagacaa tattgacccg tacgacgccg 12840 ttaagcgtcc acttgttgcg ccgaaagcat caccaatgtt gatgaagtta tctgcagcgt 12900 cttatgttag tccgaaaacg aagaaagcgt ttgctgatgc attgactgat ggctggactg 12960 ttaagcaagc gaaagctgta cctgctgttg tgtcacaacc acaagtgatt gaaaagatcg 13020 ttgaagttga aaagatagtt gaacgcattg tcgaagtaga gcgtattgtc gaagtagaaa 13080 aaatcgtcta cgttaatgct gacggttcgc ttatatcgca aaataatcaa gacgttaaca 13140 gcgctgttgt tagcaacgtg actaatagct cagtgactca tagcagtgat gctgaccttg 13200 ttgcctctat tgaacgcagt gttggtcaat ttgttgcaca ccaacagcaa ttattaaatg 13260 tacatgaaca gtttatgcaa ggtccacaag actacgcgaa aacagtgcag aacgtacttg 13320 ctgcgcagac gagcaatgaa ttaccggaaa gtttagaccg tacattgtct atgtataacg 13380 agttccaatc agaaacgcta cgtgtacatg aaacgtacct gaacaatcag acgagcaaca 13440 tgaacaccat gcttactggt gctgaagctg atgtgctagc aaccccaata actcaggtag 13500 tgaatacagc cgttgccact agtcacaagg tagttgctcc agttattgct aatacagtga 13560 cgaatgttgt atctagtgtc agtaataacg cggcggttgc agtgcaaact gtggcattag 13620 cgcctacgca agaaatcgct ccaacagtcg ctactacgcc agcacccgca ttggttgcta 13680 tcgtggctga acctgtgatt gttgcgcatg ttgctacaga agttgcacca attacaccat 13740 cagttacacc agttgtcgca actcaagcgg ctatcgatgt agcaactatt aacaaagtaa 13800 tgttagaagt tgttgctgat aaaaccggtt atccaacgga tatgctggaa ctgagcatgg 13860 acatggaagc tgacttaggt atcgactcaa tcaaacgtgt tgagatatta ggcgcagtac 13920 aggaattgat ccctgactta cctgaactta atcctgaaga tcttgctgag ctacgcacgc 13980 ttggtgagat tgtcgattac atgaattcaa aagcccaggc tgtagctcct acaacagtac 14040 ctgtaacaag tgcacctgtt tcgcctgcat ctgctggtat tgatttagcc cacatccaaa 14100 acgtaatgtt agaagtggtt gcagacaaaa ccggttaccc aacagacatg ctagaactga 14160 gcatggatat ggaagctgac ttaggtattg attcaatcaa gcgtgtggaa atcttaggtg 14220 cagtacagga gatcataact gatttacctg agctaaaccc tgaagatctt gttgaattac 14280 gcaccctagg tgaaatcgtt agttacatgc aaagcaaagc gccagtcgct gaaagtgcgc 14340 cagtggcgac ggctcctgta gcaacaagct cagcaccgtc tatcgatttg aaccacattc 14400 aaacagtgat gatggatgta gttgcagata agactggtta tccaactgac atgctagaac 14460 ttggcatgga catggaagct gatttaggta tcgattcaat caaacgtgtg gaaatattag 14520 gcgcagtgca ggagatcatc actgatttac ctgagctaaa cccagaagac ctcgctgaat 14580 tacgcacgct aggtgaaatc gttagttaca tgcaaagcaa agcgccagtc gctgagagtg 14640 cgccagtagc gacggcttct gtagcaacaa gctctgcacc gtctatcgat ttaaaccata 14700 tccaaacagt gatgatggaa gtggttgcag acaaaaccgg ttatccagta gacatgttag 14760 aacttgctat ggacatggaa gctgacctag gtatcgattc aatcaagcgt gtagaaattt 14820 taggtgcggt acaggaaatc attactgact tacctgagct taaccctgaa gatcttgctg 14880 aactacgtac attaggtgaa atcgttagtt acatgcaaag caaagcgccc gtagctgaag 14940 cgcctgcagt acctgttgca gtagaaagtg cacctactag tgtaacaagc tcagcaccgt 15000 ctatcgattt agaccacatc caaaatgtaa tgatggatgt tgttgctgat aagactggtt 15060 atcctgccaa tatgcttgaa ttagcaatgg acatggaagc cgaccttggt attgattcaa 15120 tcaagcgtgt tgaaattcta ggcgcggtac aggagatcat tactgattta cctgaactaa 15180 acccagaaga cttagctgaa ctacgtacgt tagaagaaat tgtaacctac atgcaaagca 15240 aggcgagtgg tgttactgta aatgtagtgg ctagccctga aaataatgct gtatcagatg 15300 catttatgca aagcaatgtg gcgactatca cagcggccgc agaacataag gcggaattta 15360 aaccggcgcc gagcgcaacc gttgctatct ctcgtctaag ctctatcagt aaaataagcc 15420 aagattgtaa aggtgctaac gccttaatcg tagctgatgg cactgataat gctgtgttac 15480 ttgcagacca cctattgcaa actggctgga atgtaactgc attgcaacca acttgggtag 15540 ctgtaacaac gacgaaagca tttaataagt cagtgaacct ggtgacttta aatggcgttg 15600 atgaaactga aatcaacaac attattactg ctaacgcaca attggatgca gttatctatc 15660 tgcacgcaag tagcgaaatt aatgctatcg aatacccaca agcatctaag caaggcctga 15720 tgttagcctt cttattagcg aaattgagta aagtaactca agccgctaaa gtgcgtggcg 15780 cctttatgat tgttactcag cagggtggtt cattaggttt tgatgatatc gattctgcta 15840 caagtcatga tgtgaaaaca gacctagtac aaagcggctt aaacggttta gttaagacac 15900 tgtctcacga gtgggataac gtattctgtc gtgcggttga tattgcttcg tcattaacgg 15960 ctgaacaagt tgcaagcctt gttagtgatg aactacttga tgctaacact gtattaacag 16020 aagtgggtta tcaacaagct ggtaaaggcc ttgaacgtat cacgttaact ggtgtggcta 16080 ctgacagcta tgcattaaca gctggcaata acatcgatgc taactcggta tttttagtga 16140 gtggtggcgc aaaaggtgta actgcacatt gtgttgctcg tatagctaaa gaatatcagt 16200 ctaagttcat cttattggga cgttcaacgt tctcaagtga cgaaccgagc tgggcaagtg 16260 gtattactga tgaagcggcg ttaaagaaag cagcgatgca gtctttgatt acagcaggtg 16320 ataaaccaac acccgttaag atcgtacagc taatcaaacc aatccaagct aatcgtgaaa 16380 ttgcgcaaac cttgtctgca attaccgctg ctggtggcca agctgaatat gtttctgcag 16440 atgtaactaa tgcagcaagc gtacaaatgg cagtcgctcc agctatcgct aagttcggtg 16500 caatcactgg catcattcat ggcgcgggtg tgttagctga ccaattcatt gagcaaaaaa 16560 cactgagtga ttttgagtct gtttacagca ctaaaattga cggtttgtta tcgctactat 16620 cagtcactga agcaagcaac atcaagcaat tggtattgtt ctcgtcagcg gctggtttct 16680 acggtaaccc cggccagtct gattactcga ttgccaatga gatcttaaat aaaaccgcat 16740 accgctttaa atcattgcac ccacaagctc aagtattgag ctttaactgg ggtccttggg 16800 acggtggcat ggtaacgcct gagcttaaac gtatgtttga ccaacgtggt gtttacatta 16860 ttccacttga tgcaggtgca cagttattgc tgaatgaact agccgctaat gataaccgtt 16920 gtccacaaat cctcgtgggt aatgacttat ctaaagatgc tagctctgat caaaagtctg 16980 atgaaaagag tactgctgta aaaaagccac aagttagtcg tttatcagat gctttagtaa 17040 ctaaaagtat caaagcgact aacagtagct ctttatcaaa caagactagt gctttatcag 17100 acagtagtgc ttttcaggtt aacgaaaacc actttttagc tgaccacatg atcaaaggca 17160 atcaggtatt accaacggta tgcgcgattg cttggatgag tgatgcagca aaagcgactt 17220 atagtaaccg agactgtgca ttgaagtatg tcggtttcga agactataaa ttgtttaaag 17280 gtgtggtttt tgatggcaat gaggcggcgg attaccaaat ccaattgtcg cctgtgacaa 17340 gggcgtcaga acaggattct gaagtccgta ttgccgcaaa gatctttagc ctgaaaagtg 17400 acggtaaacc tgtgtttcat tatgcagcga caatattgtt agcaactcag ccacttaatg 17460 ctgtgaaggt agaacttccg acattgacag aaagtgttga tagcaacaat aaagtaactg 17520 atgaagcaca agcgttatac agcaatggca ccttgttcca cggtgaaagt ctgcagggca 17580 ttaagcagat attaagttgt gacgacaagg gcctgctatt ggcttgtcag ataaccgatg 17640 ttgcaacagc taagcaggga tccttcccgt tagctgacaa caatatcttt gccaatgatt 17700 tggtttatca ggctatgttg gtctgggtgc gcaaacaatt tggtttaggt agcttacctt 17760 cggtgacaac ggcttggact gtgtatcgtg aagtggttgt agatgaagta ttttatctgc 17820 aacttaatgt tgttgagcat gatctattgg gttcacgcgg cagtaaagcc cgttgtgata 17880 ttcaattgat tgctgctgat atgcaattac ttgccgaagt gaaatcagcg caagtcagtg 17940 tcagtgacat tttgaacgat atgtcatgat cgagtaaata ataacgatag gcgtcatggt 18000 gagcatggcg tctgctttct tcatttttta acattaacaa tattaatagc taaacgcggt 18060 tgctttaaac caagtaaaca agtgctttta gctattacta ttccaaacag gatattaaag 18120 agaatatgac ggaattagct gttattggta tggatgctaa atttagcgga caagacaata 18180 ttgaccgtgt ggaacgcgct ttctatgaag gtgcttatgt aggtaatgtt agccgcgtta 18240 gtaccgaatc taatgttatt agcaatggcg aagaacaagt tattactgcc atgacagttc 18300 ttaactctgt cagtctacta gcgcaaacga atcagttaaa tatagctgat atcgcggtgt 18360 tgctgattgc tgatgtaaaa agtgctgatg atcagcttgt agtccaaatt gcatcagcaa 18420 ttgaaaaaca gtgtgcgagt tgtgttgtta ttgctgattt aggccaagca ttaaatcaag 18480 tagctgattt agttaataac caagactgtc ctgtggctgt aattggcatg aataactcgg 18540 ttaatttatc tcgtcatgat cttgaatctg taactgcaac aatcagcttt gatgaaacct 18600 tcaatggtta taacaatgta gctgggttcg cgagtttact tatcgcttca actgcgtttg 18660 ccaatgctaa gcaatgttat atatacgcca acattaaggg cttcgctcaa tcgggcgtaa 18720 atgctcaatt taacgttgga aacattagcg atactgcaaa gaccgcattg cagcaagcta 18780 gcataactgc agagcaggtt ggtttgttag aagtgtcagc agtcgctgat tcggcaatcg 18840 cattgtctga aagccaaggt ttaatgtctg cttatcatca tacgcaaact ttgcatactg 18900 cattaagcag tgcccgtagt gtgactggtg aaggcgggtg tttttcacag gtcgcaggtt 18960 tattgaaatg tgtaattggt ttacatcaac gttatattcc ggcgattaaa gattggcaac 19020 aaccgagtga caatcaaatg tcacggtggc ggaattcacc attctatatg cctgtagatg 19080 ctcgaccttg gttcccacat gctgatggct ctgcacacat tgccgcttat agttgtgtga 19140 ctgctgacag ctattgtcat attcttttac aagaaaacgt cttacaagaa cttgttttga 19200 aagaaacagt cttgcaagat aatgacttaa ctgaaagcaa gcttcagact cttgaacaaa 19260 acaatccagt agctgatctg cgcactaatg gttactttgc atcgagcgag ttagcattaa 19320 tcatagtaca aggtaatgac gaagcacaat tacgctgtga attagaaact attacagggc 19380 agttaagtac tactggcata agtactatca gtattaaaca gatcgcagca gactgttatg 19440 cccgtaatga tactaacaaa gcctatagcg cagtgcttat tgccgagact gctgaagagt 19500 taagcaaaga aataaccttg gcgtttgctg gtatcgctag cgtgtttaat gaagatgcta 19560 aagaatggaa aaccccgaag ggcagttatt ttaccgcgca gcctgcaaat aaacaggctg 19620 ctaacagcac acagaatggt gtcaccttca tgtacccagg tattggtgct acatatgttg 19680 gtttagggcg tgatctattt catctattcc cacagattta tcagcctgta gcggctttag 19740 ccgatgacat tggcgaaagt ctaaaagata ctttacttaa tccacgcagt attagtcgtc 19800 atagctttaa agaactcaag cagttggatc tggacctgcg cggtaactta gccaatatcg 19860 ctgaagccgg tgtgggtttt gcttgtgtgt ttaccaaggt atttgaagaa gtctttgccg 19920 ttaaagctga ctttgctaca ggttatagca tgggtgaagt aagcatgtat gcagcactag 19980 gctgctggca gcaaccggga ttgatgagtg ctcgccttgc acaatcgaat acctttaatc 20040 atcaactttg cggcgagtta agaacactac gtcagcattg gggcatggat gatgtagcta 20100 acggtacgtt cgagcagatc tgggaaacct ataccattaa ggcaacgatt gaacaggtcg 20160 aaattgcctc tgcagatgaa gatcgtgtgt attgcaccat tatcaataca cctgatagct 20220 tgttgttagc cggttatcca gaagcctgtc agcgagtcat taagaattta ggtgtgcgtg 20280 caatggcatt gaatatggcg aacgcaattc acagcgcgcc agcttatgcc gaatacgatc 20340 atatggttga gctataccat atggatgtta ctccacgtat taataccaag atgtattcaa 20400 gctcatgtta tttaccgatt ccacaacgca gcaaagcgat ttcccacagt attgctaaat 20460 gtttgtgtga tgtggtggat ttcccacgtt tggttaatac cttacatgac aaaggtgcgc 20520 gggtattcat tgaaatgggt ccaggtcgtt cgttatgtag ctgggtagat aagatcttag 20580 ttaatggcga tggcgataat aaaaagcaaa gccaacatgt atctgttcct gtgaatgcca 20640 aaggcaccag tgatgaactt acttatattc gtgcgattgc taagttaatt agtcatggcg 20700 tgaatttgaa tttagatagc tagtttaacg ggtcaatcct ggttaaagca ggccatatag 20760 caaacacgaa caaatagtca acatcgatat ctagcgctgg tgagttatac ctcattagtt 20820 gaaatatgga tttaaagaga gtaattatgg aaaatattgc agtagtaggt attgctaatt 20880 tgttcccggg ctcacaagca ccggatcaat tttggcagca attgcttgaa caacaagatt 20940 gccgcagtaa ggcgaccgct gttcaaatgg gcgttgatcc tgctaaatat accgccaaca 21000 aaggtgacac agataaattt tactgtgtgc acggcggtta catcagtgat ttcaattttg 21060 atgcttcagg ttatcaactc gataatgatt atttagccgg tttagatgac cttaatcaat 21120 gggggcttta tgttacgaaa caagccctta ccgatgcggg ttattggggc agtactgcac 21180 tagaaaactg tggtgtgatt ttaggtaatt tgtcattccc aactaaatca tctaatcagc 21240 tgtttatgcc tttgtatcat caagttgttg ataatgcctt aaaggcggta ttacatcctg 21300 attttcaatt aacgcattac acagcaccga aaaaaacaca tgctgacaat gcattagtag 21360 caggttatcc agctgcattg atcgcgcaag cggcgggtct tggtggttca cattttgcac 21420 tggatgcggc ttgtgcttca tcttgttata gcgttaagtt agcgtgtgat tacctgcata 21480 cgggtaaagc caacatgatg cttgctggtg cggtatctgc agcagatcct atgttcgtaa 21540 atatgggttt ctcgatattc caagcttacc cagctaacaa tgtacatgcc ccgtttgacc 21600 aaaattcaca aggtctattt gccggtgaag gcgcgggcat gatggtattg aaacgtcaaa 21660 gtgatgcagt acgtgatggt gatcatattt acgccattat taaaggcggc gcattatcga 21720 atgacggtaa aggcgagttt gtattaagcc cgaacaccaa gggccaagta ttagtatatg 21780 aacgtgctta tgccgatgca gatgttgacc cgagtacagt tgactatatt gaatgtcatg 21840 caacgggcac acctaagggt gacaatgttg aattgcgttc gatggaaacc tttttcagtc 21900 gcgtaaataa caaaccatta ctgggctcgg ttaaatctaa ccttggtcat ttgttaactg 21960 ccgctggtat gcctggcatg accaaagcta tgttagcgct aggtaaaggt cttattcctg 22020 caacgattaa cttaaagcaa ccactgcaat ctaaaaacgg ttactttact ggcgagcaaa 22080 tgccaacgac gactgtgtct tggccaacaa ctccgggtgc caaggcagat aaaccgcgta 22140 ccgcaggtgt gagcgtattt ggttttggtg gcagcaacgc ccatttggta ttacaacagc 22200 caacgcaaac actcgagact aattttagtg ttgctaaacc acgtgagcct ttggctatta 22260 ttggtatgga cagccatttt ggtagtgcca gtaatttagc gcagttcaaa accttattaa 22320 ataataatca aaataccttc cgtgaattac cagaacaacg ctggaaaggc atggaaagta 22380 acgctaacgt catgcagtcg ttacaattac gcaaagcgcc taaaggcagt tacgttgaac 22440 agctagatat tgatttcttg cgttttaaag taccgcctaa tgaaaaagat tgcttgatcc 22500 cgcaacagtt aatgatgatg caagtggcag acaatgctgc gaaagacgga ggtctagttg 22560 aaggtcgtaa tgttgcggta ttagtagcga tgggcatgga actggaatta catcagtatc 22620 gtggtcgcgt taatctaacc acccaaattg aagacagctt attacagcaa ggtattaacc 22680 tgactgttga gcaacgtgaa gaactgacca atattgctaa agacggtgtt gcctcggctg 22740 cacagctaaa tcagtatacg agtttcattg gtaatattat ggcgtcacgt atttcggcgt 22800 tatgggattt ttctggtcct gctattaccg tatcggctga agaaaactct gtttatcgtt 22860 gtgttgaatt agctgaaaat ctatttcaaa ccagtgatgt tgaagccgtt attattgctg 22920 ctgttgattt gtctggttca attgaaaaca ttactttacg tcagcactac ggtccagtta 22980 atgaaaaggg atctgtaagt gaatgtggtc cggttaatga aagcagttca gtaaccaaca 23040 atattcttga tcagcaacaa tggctggtgg gtgaaggcgc agcggctatt gtcgttaaac 23100 cgtcatcgca agtcactgct gaacaagttt atgcgcgtat tgatgcggtg agttttgccc 23160 ctggtagcaa tgcgaaagca attacgattg cagcggataa agcattaaca cttgctggta 23220 tcagtgctgc tgatgtagct agtgttgaag cacatgcaag tggttttagt gccgaaaata 23280 atgctgaaaa aaccgcgtta ccgactttat acccaagcgc aagtatcagt tcggtgaaag 23340 ccaatattgg tcatacgttt aatgcctcgg gtatggcgag tattattaaa acggcgctgc 23400 tgttagatca gaatacgagt caagatcaga aaagcaaaca tattgctatt aacggtctag 23460 gtcgtgataa cagctgcgcg catcttatct tatcgagttc agcgcaagcg catcaagttg 23520 caccagcgcc tgtatctggt atggccaagc aacgcccaca gttagttaaa accatcaaac 23580 tcggtggtca gttaattagc aacgcgattg ttaacagtgc gagttcatct ttacacgcta 23640 ttaaagcgca gtttgccggt aagcacttaa acaaagttaa ccagccagtg atgatggata 23700 acctgaagcc ccaaggtatt agcgctcatg caaccaatga gtatgtggtg actggagctg 23760 ctaacactca agcttctaac attcaagcat ctcatgttca agcgtcaagt catgcacaag 23820 agatagcacc aaaccaagtt caaaatatgc aagctacagc agccgctgta agttcacccc 23880 tttctcaaca tcaacacaca gcgcagcccg tagcggcacc gagcgttgtt ggagtgactg 23940 tgaaacataa agcaagtaac caaattcatc agcaagcgtc tacgcataaa gcatttttag 24000 aaagtcgttt agctgcacag aaaaacctat cgcaacttgt tgaattgcaa accaagctgt 24060 caatccaaac tggtagtgac aatacatcta acaatactgc gtcaacaagc aatacagtgc 24120 taacaaatcc tgtatcagca acgccattaa cacttgtgta taatgcgcct gtagtagcga 24180 caaacctaac cagtacagaa gcaaaagcgc aagcagctgc tacacaagct ggttttcaga 24240 taaaaggacc tgttggttac aactatccac cgctgcagtt aattgaacgt tataataaac 24300 cagaaaacgt gatttacgat caagctgatt tggttgaatt cgctgaaggt gatattggta 24360 aggtatttgg tgctgaatac aatattattg atggctattc gcgtcgtgta cgtctgccaa 24420 cctcagatta cttgttagta acacgtgtta ctgaacttga tgccaaggtg catgaataca 24480 agaaatcata catgtgtact gaatatgatg tgcctgttga tgcaccgttc ttaattgatg 24540 gtcagatccc ttggtctgtt gccgtcgaat caggccagtg tgatttgatg ttgatttcat 24600 atatcggtat tgatttccaa gcgaaaggcg aacgtgttta ccgtttactt gattgtgaat 24660 taactttcct tgaagagatg gcttttggtg gcgatacttt acgttacgag atccacattg 24720 attcgtatgc acgtaacggc gagcaattat tattcttctt ccattacgat tgttacgtag 24780 gggataagaa ggtacttatc atgcgtaatg gttgtgctgg tttctttact gacgaagaac 24840 tttctgatgg taaaggcgtt attcataacg acaaagacaa agctgagttt agcaatgctg 24900 ttaaatcatc attcacgccg ttattacaac ataaccgtgg tcaatacgat tataacgaca 24960 tgatgaagtt ggttaatggt gatgttgcca gttgttttgg tccgcaatat gatcaaggtg 25020 gccgtaatcc atcattgaaa ttctcgtctg agaagttctt gatgattgaa cgtattacca 25080 agatagaccc aaccggtggt cattggggac taggcctgtt agaaggtcag aaagatttag 25140 accctgagca ttggtatttc ccttgtcact ttaaaggtga tcaagtaatg gctggttcgt 25200 tgatgtcgga aggttgtggc caaatggcga tgttcttcat gctgtctctt ggtatgcata 25260 ccaatgtgaa caacgctcgt ttccaaccac taccaggtga atcacaaacg gtacgttgtc 25320 gtgggcaagt actgccacag cgcaatacct taacttaccg tatggaagtt actgcgatgg 25380 gtatgcatcc acagccattc atgaaagcta atattgatat tttgcttgac ggtaaagtgg 25440 ttgttgattt caaaaacttg agcgtgatga tcagcgaaca agatgagcat tcagattacc 25500 ctgtaacact gccgagtaat gtggcgctta aagcgattac tgcacctgtt gcgtcagtag 25560 caccagcatc ttcacccgct aacagcgcgg atctagacga acgtggtgtt gaaccgttta 25620 agtttcctga acgtccgtta atgcgtgttg agtcagactt gtctgcaccg aaaagcaaag 25680 gtgtgacacc gattaagcat tttgaagcgc ctgctgttgc tggtcatcat agagtgccta 25740 accaagcacc gtttacacct tggcatatgt ttgagtttgc gacgggtaat atttctaact 25800 gtttcggtcc tgattttgat gtttatgaag gtcgtattcc acctcgtaca ccttgtggcg 25860 atttacaagt tgttactcag gttgtagaag tgcagggcga acgtcttgat cttaaaaatc 25920 catcaagctg tgtagctgaa tactatgtac cggaagacgc ttggtacttt actaaaaaca 25980 gccatgaaaa ctggatgcct tattcattaa tcatggaaat tgcattgcaa ccaaatggct 26040 ttatttctgg ttacatgggc acgacgctta aataccctga aaaagatctg ttcttccgta 26100 accttgatgg tagcggcacg ttattaaagc agattgattt acgcggcaag accattgtga 26160 ataaatcagt cttggttagt acggctattg ctggtggcgc gattattcaa agtttcacgt 26220 ttgatatgtc tgtagatggc gagctatttt atactggtaa agctgtattt ggttacttta 26280 gtggtgaatc actgactaac caactgggca ttgataacgg taaaacgact aatgcgtggt 26340 ttgttgataa caataccccc gcagcgaata ttgatgtgtt tgatttaact aatcagtcat 26400 tggctctgta taaagcgcct gtggataaac cgcattataa attggctggt ggtcagatga 26460 actttatcga tacagtgtca gtggttgaag gcggtggtaa agcgggcgtg gcttatgttt 26520 atggcgaacg tacgattgat gctgatgatt ggttcttccg ttatcacttc caccaagatc 26580 cggtgatgcc aggttcatta ggtgttgaag ctattattga gttgatgcag acctatgcgc 26640 ttaaaaatga tttgggtggc aagtttgcta acccacgttt cattgcgccg atgacgcaag 26700 ttgattggaa ataccgtggg caaattacgc cgctgaataa acagatgtca ctggacgtgc 26760 atatcactga gatcgtgaat gacgctggtg aagtgcgaat cgttggtgat gcgaatctgt 26820 ctaaagatgg tctgcgtatt tatgaagtta aaaacatcgt tttaagtatt gttgaagcgt 26880 aaagggtcaa gtgtaacgtg cttaagcgcc gcattggtta aagacgcttt gcacgccgtg 26940 aatccgtcca tggaggcttg gggttggcat ccatgccaac aacagcaagc ttactttaat 27000 caatacggct tggtgtccat ttagacgcct cgaacttagt agttaataga caaaataatt 27060 tagctgtgga atgaatatag taagtaatca ttcggcagct acaaaaaagg aattaagaat 27120 gtcgagttta ggttttaaca ataacaacgc aattaactgg gcttggaaag tagatccagc 27180 gtcagttcat acacaagatg cagaaattaa agcagcttta atggatctaa ctaaacctct 27240 ctatgtggcg aataattcag gcgtaactgg tatagctaat catacgtcag tagcaggtgc 27300 gatcagcaat aacatcgatg ttgatgtatt ggcgtttgcg caaaagttaa acccagaaga 27360 tctgggtgat gatgcttaca agaaacagca cggcgttaaa tatgcttatc atggcggtgc 27420 gatggcaaat ggtattgcct cggttgaatt ggttgttgcg ttaggtaaag cagggctgtt 27480 atgttcattt ggtgctgcag gtctagtgcc tgatgcggtt gaagatgcaa ttcgtcgtat 27540 tcaagctgaa ttaccaaatg gcccttatgc ggttaacttg atccatgcac cagcagaaga 27600 agcattagag cgtggcgcgg ttgaacgttt cctaaaactt ggcgtcaaga cggtagaggc 27660 ttcagcttac cttggtttaa ctgaacacat tgtttggtat cgtgctgctg gtctaactaa 27720 aaacgcagat ggcagtgtta atatcggtaa caaggttatc gctaaagtat cgcgtaccga 27780 agttggtcgc cgctttatgg aacctgcacc gcaaaaatta ctggataagt tattagaaca 27840 aaataagatc acccctgaac aagctgcttt agcgttgctt gtacctatgg ctgatgatat 27900 tactggggaa gcggattctg gtggtcatac agataaccgt ccgtttttaa cattattacc 27960 gacgattatt ggtctgcgtg atgaagtgca agcgaagtat aacttctctc ctgcattacg 28020 tgttggtgct ggtggtggta tcggaacgcc tgaagcagca ctcgctgcat ttaacatggg 28080 cgcggcttat atcgttctgg gttctgtgaa tcaggcgtgt gttgaagcgg gtgcatctga 28140 atatactcgt aaactgttat cgacagttga aatggctgat gtgactatgg cacctgctgc 28200 agatatgttt gaaatgggtg tgaagctgca agtattaaaa cgcggttcta tgttcgcgat 28260 gcgtgcgaag aaactgtatg acttgtatgt ggcttatgac tcgattgaag atatcccagc 28320 tgctgaacgt gagaagattg aaaaacaaat cttccgtgca aacctagacg agatttggga 28380 tggcactatc gctttcttta ctgaacgcga tccagaaatg ctagcccgtg caacgagtag 28440 tcctaaacgt aaaatggcac ttatcttccg ttggtatctt ggcctttctt cacgctggtc 28500 aaacacaggc gagaagggac gtgaaatgga ttatcagatt tgggcaggcc caagtttagg 28560 tgcattcaac agctgggtga aaggttctta ccttgaagac tatacccgcc gtggcgctgt 28620 agatgttgct ttgcatatgc ttaaaggtgc tgcgtattta caacgtgtaa accagttgaa 28680 attgcaaggt gttagcttaa gtacagaatt ggcaagttat cgtacgagtg attaatgtta 28740 cttgatgata tgtgaattaa ttaaagcgcc tgagggcgct ttttttggtt tttaactcag 28800 gtgttgtaac tcgaaattgc ccctttcaag ttagatcgat tactcactca caatatgttg 28860 atatcgcact tgccatatac ttgctcatcc aaagccctat attgataatg gtgttaatag 28920 tctttaatat ccgagtcttt cttcagcata atactaatat agagactcga ccaatgttaa 28980 acacaacaaa gaatatattc ttgtgtactg ccttattatt aacgagtgcg agtacgacag 29040 ctactacgct aaacaattcg atatcagcaa ttgaacaacg tatttctggt cgtatcggtg 29100 tggctgtttt agatacgcaa aataaacaaa cgtgggctta caatggtgat gcacattttc 29160 cgatgatgag tacattcaaa accctcgctt gcgcgaaaat gctaagtgaa tcgacaaatg 29220 gtaatctgga tcccagtact agctcattga taaaggctga agaattaatc ccttggtcac 29280 cagtcactaa aacgtttgtg aataacacta ttacagtggc gaaagcgtgt gaagcaacaa 29340 tgctgaccag tgataatacc gcggctaata ttgttttaca gtatatcgga ggccctcaag 29400 gcgttactgc attcttgcga gaaattggtg atgaagagag tcagttagat cgtatagaac 29460 ctgaattgaa tgaagctaag gtcggagact tgcgtgatac cacgacaccg aaagccatag 29520 ttaccacgct caacaaacta ctacttggtg atgttctact tgatttggat aaaaaccaac 29580 ttaaaacatg gatgcaaaat aataaagtgt cagatccttt actgcgttct atattaccgc 29640 aaggctggtt tattgccgac cgctcaggtg cgggtggtaa tggttctcga ggtataactg 29700 ctatgctttg gcactccgag cgtcaaccgc taatcatcag tatttattta accgaaactg 29760 agttagcaat ggcaatgcgc aatgagatta ttgttgagat cggtaagctg atattcaaag 29820 aatacgcggt gaaataataa gttatttttt gataatactt taacgagcgt agctatcgaa 29880 gtgagggcgt caattagaca cctttgcttc ccctacaaaa tctaatgtgt attacctcgg 29940 ctagtacaat tgccctaagt tatttctgtc cagctttggc ttagtgcaat tgcgttagcc 30000 aatgtgaaca ccaagggact ttgtcgtacc ataactacca agcgactttg tcgtttttat 30060 cttttcttag acaaacagag gttaaatgag tgacgccttc caaatcacag gaatgaatcc 30120 gcatttcaat aaaatctaac ccgtaccaac tccgtacaag ttgatcttta gttgtttaaa 30180 atctataata aattcaatta cggaattaat ccgtacaact ggaggtttta tggctactgc 30240 aagacttgat atccgtttgg atgaagaaat caaagctaag gctgagaaag catcagcttt 30300 actcggctta aaaagtttaa ccgaatacgt tgttcgctta atggacgaag attcaactaa 30360 agtagtttct gagcatgaga gtattaccgt tgaagcgaat gtattcgacc aatttatggc 30420 tgcttgtgat gaagcgaaag ccccaaataa agcattactt gaagccgctg tatttactca 30480 gaatggtgag tttaagtgag ttattccaaa cgtttcaaag aactggataa atcaaaacat 30540 gacagagcat catttgactg tggcgaaaaa gagctaaatg attttatcca aactcaagca 30600 gccaaacata tgcaagcagg tattagccgc actctggttt tacctgcttc tgcgccgtta 30660 ccaaacaaaa aatatccaat ttgctcattt tatagtatcg cgccaagctc aattagccgc 30720 gatacgttac cacaagcaat ggctaaaaag ttaccacgtt atcctatccc tgtttttctt 30780 ttggctcaac ttgccgtcca taaagagttt catgggagtg ggttaggcaa agttagctta 30840 attaaagcgt tagagtacct ttgggaaatt aactctcaca tgagagctta cgccatcgtt 30900 gttgattgtt taactgaaca agctgagtca ttctacgcta aatatggttt cgacgttctc 30960 tgcgaaataa atggtcgagt aagaatgttc atatcaatga aaacagtcaa tcagttattc 31020 acttaacagt aagagttagt ataacagttg tatgaattaa atttattata ttcggtaatc 31080 tcattgcgat cacgctagaa gtgcgagcgg gtcagaccga ggccacaata gcagccgtta 31140 cgtttagggg atgacttaaa aagataacta ctacgtcagt ggcgatccta gaggattaaa 31200 ggtttatgat tcacaacatt tatttattgt gcttaatttt ttctatccaa tatgcgcaag 31260 ctgtaaatat cactgaagta gacttttatg tcagtgatga tatccctaaa gatgttgcca 31320 aattaaagat aggtgaatcc ataacgaact ccagccttat tctaagtaac tcatctattc 31380 cactctcgcg ggagacgggt aacatatatt actcttcatc aattgctaac ttgaactatg 31440 actcgataga atttgttatg gctcaattga tggccgaaga ttccagcctt tacaagatgc 31500 tggtaaatag cgataggttg tccgtgctag taatgacatc ttcccagtcc acagtctcta 31560 tggctcgact tactcggctt attttcctaa tgttgcggtc atcgatttga attgtgactc 31620 gctaacttta gaacatgagc tcggccatct atacggagct gaacatgaag aaatatatga 31680 cgactatgtc ttctatgctg cgatatgtgg agactatacg actatcatga actctatgca 31740 gcctgaaatg aaagaaaaac aaatgataaa ggcatattca ttccctgaat taaaagtgga 31800 tggcttgcag tgcggaaatg aaaatacgaa taacaaaaag gttattttag acaatattgg 31860 tcggtttaga taggattggg atattattct cattcggctc tacttagtgc tgttattatg 31920 agtgccagtg cttctatcta cgatattggt cttaacaagt atttatctat agacgctaag 31980 gtgttatgta tttaagggat gttcaagatg aaactaggtg taaacgatgt atagttgtat 32040 aacatttttt caacggttgg aacgttcgat tctatcgggt aacaagaccg cgacgatccg 32100 cgataagtcc gatagtcatt acttagttgg tcagatgtta gatgcttgta ctcacgaaga 32160 taatcggaaa atgtgtcaaa tagaaatact gagcattgaa tatgtgacgt ttagtgaatt 32220 aaaccgtgcg cacgccaatg ctgaaggttt accgtttttg tttatgctta agtggatagt 32280 tcgaaagatt tatccgactt caaatgattt atttttcata agtttcagag ttgtaactat 32340 cgatatctta taagtcttag tgcacaaaac agaactattt atagcgctca agaaggcgat 32400 aatttgataa tgaattatcg ccttgttact attaagagac tttaaatgac tgagatataa 32460 gatatgacac ggaagaacat attgatcaca ggcgcaagtt cagggttggg ccgaggtatg 32520 gccatcgaat ttgcaaaatc aggtcataac ttagcacttt gtgcacgtag acttgataat 32580 ttagttgcac tgaaagcaga actcttagcc ctcaatcctc acatccaaat cgaaataaaa 32640 cctcttgatg tcaatgaaca tgaacaagtc ttcactgttt tccatgaatt caaagctgaa 32700 tttggtacgc ttgatcgtat tattgttaat gctggattag gcaagggtgg atccgtcggt 32760 acaggttttt tcaaagctaa tctgcaaact gcacaaacta attttattgc ggcgctcgca 32820 caatgtgaag cggcgctcga aatctttagg gcgcaaaatg ctgggcacct agtgacgatt 32880 tcttctatca gcgctgtacg aggattccgc cgtgcgttaa ctgtgtatgc agctactaaa 32940 tcggcactaa catcattaac tgaaggtatc aggattgacg tgatggatac gccaatcaaa 33000 gtgagttgta ttcatcctgg atttattcgc accgagatga atgaaaaagt aaaaacagca 33060 cctttcatga tagatgctga agcgggttgt aaagcgatag tgaaagcaat taataaagaa 33120 aaagcgaata gttatgtacc tagttaccct tgggctatta tgcacttatt actacgtgtg 33180 gcgccaacgc gtttgatccg cagaatgagt taatatcaca gacgcatcaa taaaatttta 33240 aggttctaga aatgatgaag tctcatgttt ggttcaaggc cggtgtagtc atcatatatg 33300 gctcatctat agatgcctct cctcatcgtc atcatgcaat tcaattagcg gcggtgttac 33360 ccaatcccaa gcgaatgtct gcagcaaccc cttcttctta tgtgctcagc cgtgcggcac 33420 aaatttaaga ctcggtgcga tcattaggcg gatctgttta cctgaaaaac ttataacaaa 33480 agctatcgac tgttgaattt atcctgaatg ctttaataga gtgggctggt ggcattacat 33540 gattggaaag ctgaaagaca agtcgttata tttgcaggca gtaaaattaa cactggtatg 33600 gatacttttg attctgtaaa gttcagagta tcagcccctt aacgagcttt ggtataaaca 33660 aatatgaata atcgacagcc taagaaaacc tcttcgacta tatcgacgct caacgaatta 33720 gcgacgttag caaactattc actcatggac acgctaaact gtgatcctga tgcgacagaa 33780 aacggcgacg atcacgcgcc gagacaagtc ctttacgggt cattatgttc ccgtaaaacc 33840 gactccaatc aaagaccctg aatatgtagc gcatagcaaa aatttatttt ctgaacttgg 33900 ctttgccgac agtatggctg agtccgctga ttttgtccgg atgttctctg gtgatatgtc 33960 aggggttcca gtaccaatgc gccaggtagg ttgggcgagt ggctatgcac tttccattta 34020 tggcaccgag tacacccaac agtgcccgtt ccaaactggt aacggatatg gagacggacg 34080 tgcaatttca gtgcttgaga ccctcatcaa gggtcaacgc tgggaaatgc agctgaaagg 34140 cggtggtcgt acaccatatt gccgcggcgc agacggtcgc gctgttttac ggtctagtat 34200 tcgcgagttc ttggctcaag atcacatgca tgcgctcggg gtacctacat cacggtcttt 34260 aagtctgtac gtttcaaaaa cggagacagt taagcgacct tggtactcac agggctcgcg 34320 ttcagagaat cccgacatgc ttatatctga agctgtcgct atctcgacgc gtgttgcacc 34380 gtcgttcatc cgtgttggtc aactcgaact tttcgcgcgc cgcagccgta gtaatgaaca 34440 cccgaaagcg atggaagaac tcgagaagat tgtgctgcac ttgatcgatc gtgaatacgc 34500 tgacgttatc gatacgcagc tagccactcc agaaaaaatc gtgttgctgg ctcgcgagtt 34560 tcgtggccgc cttacctcaa tggttgcgaa ttggatccgt gttggatttt gccaaggtaa 34620 ctttaacagt gataactgcg cagccggtgg ttttacactt gattatggtc cctttggttt 34680 ttgtgatgtg tttaatccgt attatcaacc ttggacgggg gggggtaatc acttctcgtt 34740 catgaaccaa ccaaatgcag cacaacgaaa tttcgatatg ttttgttcgg cgttacggcc 34800 gttactggta tctcatcagc aggatttgct cgcgtttgac gagatccaaa gtgaattttt 34860 agcagtaatg gatacgaaaa tgaaggcgat gtgggctact aaattgggtc ttattaattt 34920 gaagactgag tctgataaag cactgtgtaa cgtactcatc aaagagctac aaacactcat 34980 gatgcaagca cctgttgatt acactatttt cttccgcgaa ctatcctcaa ttcctgacga 35040 tattggccca ctgaagaaaa gtttttacag taatctatac aatgatgcag cggatgatcc 35100 agatacctta gcgttagaaa aatactggat tgagtggctc gaaaaatggc aaatgctcct 35160 taacagtact tgtgacgcga aaggtatctc gtcccgagcc agtgaggaca tcgctatgca 35220 gatgaaactc gtcaacccta aatacgtttt gcgagaatgg ttcgtgatgc cggcttatca 35280 gcaagccact gcgggtgatt attctctcat tcaagagctg caggccgtaa tgacacagcc 35340 atatgcagag cagtcgaagg agctagagga taaatactat cgattgaaac cgcttgagtt 35400 ctttgaggta ggtggattgt cccatcttag ttgctcgtcg tgaacgataa cgcgtcggta 35460 catgtgtatc gacgtatggg cgcttaattt ttattaatat tagaaacaaa aatcgccagc 35520 aaatgctggc gttttaaaga ttaatgtcaa ttattacatc atgcctatat cacgtaggag 35580 atgtggcgat aagcctttta attgaatatc taaagatttt tcttttttat cactaaataa 35640 aatgtcttta gtgtgtttaa tcagtccttt gatagaaaca gcataagctt ttgtatctaa 35700 agcttgtggg atcatattga tgtgcgctgc gtgtgccatt ttagcctcta tctgaattta 35760 ataatttatg ttttaaccag gtgatgtatt gctcatctgg tgaacatagt agcgcattaa 35820 ataaccatgc aataatgata aaaaataaca ctaagcatta gttttgataa tgcattcggc 35880 gctgtgtgac actgtttact gttttataga tattcattca ctttaattgc atataaattg 35940 aattgtttac tccaaatgta gttaaaataa gcacttgtta catcaatgca acaattatac 36000 gctgttaaaa tagccttgat ataccaatga taaataattc tgagtcttta atatttaaaa 36060 tagatgaatt taattcatta gatatactat tacgttgaat tgcgatttac atgcgcattt 36120 agtgtgtttt ttattaaatg aaaattattt tgacgatttt attaacatat ataagaaata 36180 tgtgacttag atctaagtaa acgttaattt atcgccgata aagcagtagt aagcatgttg 36240 catatcaaac cctctctata gatctcaact agcctcaatt atcatcaagt taactgtggt 36300 tttatttatt gctcgtgcgt tcagttatgc ttaaccatga gttaacttca ttctaatatt 36360 tttaacttac agtgaggggt atactctcgg ctcttagaaa tagagagcca aaacatgttt 36420 gaattcgtta ctaattcctc attgaaaaca cacctattgc ttatcaataa tggctatcaa 36480 tagtggttta ttgtttctta cgccacggct tatttttctg aaaatgtact aaatagataa 36540 attatcaata aaaacacaca tcacattaac cgatgtaaac agggaacatc cccatgtatg 36600 aaaatgaaga aaaactaacg aaagcatttg ttattgccgc cataatttgg ggcgttatag 36660 gcatgtgcat gggtttaatg gcagctctgc agctatatct accgcaattg aattttgcta 36720 atgagtatat aaatttcggg aaaataagac ccttgcatac taacgccatc atttttgggt 36780 tggtttgtaa ctttattatc ggtctgtcgt tatacatagt ggcaaaaaca tcagtcgtga 36840 atctagtatc caaaggttta tcgtggttct tgttctgggg ttggcagata acattggtaa 36900 tcggccttat ctcaatcgct ttagggtata catcaaccaa agaatacgct gaatttgagt 36960 ggccaattga tatcgctatt gtggttctct ggttaacgtt tggatatatc ttttttggaa 37020 cgctagcgaa aagaaaaaca aagcatatat ttgtttcaaa ctggttcagt ggcggtgtca 37080 ttattgttat cggcttaatt tacttgataa acaatttagc cattcccgtg tatgcattta 37140 aaggttattc aatattttct ggtgcgagtg atgcgcttgt acagtggtgg tggggacata 37200 atgcagttgg cttcttattg acagctggct ttgtaggtac caactactat ttcattccca 37260 agttagttaa tagacccatt tattcatatc gactgtcttt aattactttt tggggtctaa 37320 tcggctttta tacttgggct ggtacacacc atttactctt tacatccgtt ccatcttgga 37380 ttcaaaatat tggcgtagtg atgtctattt tattatggat cccgtcatgg gctggcgcat 37440 ttaacgcttg gatgacgtgt acttccaata aagaagaatt gaaaacaaat cccgttgtct 37500 ggtttttctt atcgtcaatt gcctattacg cattagcaac gtttgaaggg cctcttatgg 37560 ctatcagatg gttcaatatg atagctcaca ataccagttg ggttatcgga cacgttcact 37620 ctggggcgtt aggttgggtt ggcatgacgt gtatagcaac cttctactat ttcattccta 37680 agctatacaa aaaagaactc tactcatatg gcttagttaa ggtgcatttt gtactcgctc 37740 acataggcgt actgttctac atagtctccc tgtggatagg gggtataggt caaggtgtta 37800 aatcgttaag cctcactgag tctggttctc tgacttattc gtttgttgat attttacgat 37860 ttatggaacc ttatatgctc ggacgtgcaa ttggcggggc gctgtttatc ttgggtatgt 37920 tagtgatggt atataacctc atcatgacgg tgaacaaacc acaaaaagta gttattgaag 37980 gagcatatta atggaagagt caatatccaa gtcagtaatg gcttttatca ctatcacgac 38040 agtcgtggtg ttattttcat tctttgtgtg ggttttccca gggttcttct tcaccaacga 38100 tcttaaagaa ataacgacag ctaaaccata cacagcctta gagttagctg gacgggatgt 38160 gtatatggct gaaggttgcg tggcatgcca tacccagatg gttagaaact tggaaccgga 38220 aagaaaaaga tacggtcgtc ctaataaaat ggaagatgat gtttatgagt ttaacttttt 38280 gtggagctca caaagaactg gccctgattt aacgaatatt ggtttgaagt acacacaagg 38340 ctggcacaaa cagcatctca tcaatcctca ggcagttgtt ccagcctcaa tcatgccaca 38400 atatccgtgg ctgtttgaaa agcaacttaa cgttggtcat gttattgctt caatgaaagc 38460 gatgaaaaaa ctaggtgtgc cgtatacaga cacgcaaatt gaaaattcat caagcaaagt 38520 ggaaggtaaa acaaaaggtg atgcgcttgt tgcttacttg atgagtcttg gcgtagatac 38580 gcgtgaaaaa ggtggggatt taaattaatg ggatccatga acatattatc aagcgtacta 38640 tcgattatct tcttttttat catggttgcc gttatttatt cacagttccg taagaccaaa 38700 actgcagaca gtaataaaac agtagagcaa tttgatggaa tagatgaaaa agatgcacca 38760 attcctaagg ttttctttgt tgcgtatctt attgcgttta taggcgcaat tgtttacgtc 38820 cttctatacc caagtttagc ttcttggaaa gggtttatcg gttggaccga gaacgatgac 38880 gcgtatgtag ctaaatcaat tgatataaac aataacatta acgcaataat caacgcgaat 38940 accgatgaac aagtctttac gctgttacaa aaagatccgc ttgttttgca gagtggtaaa 39000 tcgttatttg gtgataattg ttctgcttgt catggtcagg atgctaaggg gcaatataac 39060 tacccgagtt tagttgataa agattggtta tacggcggct cacctcaaga tgtctatacg 39120 accatacata atggacgtaa gggtaaaatg ccagcttgga aaggtgtact gagcggtaaa 39180 gacatagatg agcttaccca gtatgtgtct gagctaaata aaggaccatt taaaagcaat 39240 gcgcttttcg atgctaattg ttcatcatgt cacggtaaag aggctcaagg ttcacatagc 39300 gtaggagccc ctaacttaac gaatgatatc tggcttcatg gttcaaccaa tgctgatatc 39360 aaacgtaata ttgagaatgg catgtataac gaaatgcctg attttggtca acgccttagc 39420 agaaatcaaa tattgtcttt aacctcttat attgtgtccc tacagagtga accacaagat 39480 aatatcgata ttatgcaagc gaacacttat atcttctctc gaaacgaaca gcaattgccg 39540 gcagtgctaa cgacttgtgt ggcctgtcat ggcgcagatg gtcttggtac tttacctgga 39600 gcgcctaagt tagcaggatt aaagcaagcg tatatctata accaattaca cttgtttgta 39660 tctggtttaa gaaaaaatgc aacgatgcaa aatatagttg ccgacttaga tgtgaaagac 39720 aagttacttg ctgctagcta tttcagttca ctcgattcac cggcgataag taaaattacc 39780 ccagagaaat cagctgacgg tatcatcaaa gatcctactg agcgcctgat atttcaaggt 39840 gattggcaac gcgctattcc tgcttgttct acttgtcatg gtcaagaaac gcaaggtagc 39900 ccatcatttc caagattggc aggtcaatca tctgactatt tagagaaaca attatttgac 39960 tggcgaacag gcgatagaac cggtgatcaa ggtcatatga tgcaaaacgt cgttaacaag 40020 ctacaagatg atgaaattaa atccctgtcg aaatatttat caaaaatgaa ataacctgtg 40080 agccagttaa aggccaatag atcgaaggtt aacagctcaa agattaatag gatactgtaa 40140 ttatgaaaat gaataagtta agaagggaaa tcattaaagc tggtggctat gtcgctttag 40200 ctgctgcacc attaacggct ttctctaaag agtttatgaa atacggcaaa atgtattcag 40260 atggtgaggg agttagctat gccgatggcc ctaagcctgt attaagcaat tttccgcaaa 40320 aagataatgt tgtgatcgta catactcgac cacctcatct tgaaacgcct tttaatgtat 40380 tcaatgaagg gctaataaca ccaaacaacc gtttctttgt tcgttatcat ctagctgacg 40440 tccccgttgc catagacact gataagtaca ctattactat ttcaggggct gttaatgagg 40500 aagtgacatt aagcttggct gaattaaagt cgattgaagg ccaacaagaa attgtcgcgg 40560 tacaacagtg tactggtaat agtcgaggtt attcatctcc acgtgttttt ggtgcgcaat 40620 taagtaatgg cgctatgggg aatgcgaagt tcaaaggcgt gccacttaaa aatgtgttag 40680 ctaaagcggg aatttctagt gctgcgacaa gtgtcattat cgatggtttg gataagccgg 40740 ttcgagatac cacaccagac tttcaaaaat cattacctat tgatcatatt atgacgggcg 40800 aacctatgct tgtttgggaa atgaatggtg aacctttacc atttttaaat ggctttccag 40860 tgaaattaat cgttccgggt tggtatgcaa catattgggt taaacatgta tcgcacctta 40920 aagttataga gggtgagttt gataactttg atgcgttctt tatgacaact gcataccgtc 40980 tacctgataa cgattccaag agtgaattac caactgccag agcgaaaaag acgttacctg 41040 taaatcgttt cccaataaga agttttgtta ctagcttaga aaatggtgat gaagttaatg 41100 ctgcaactag tattgaaatt aaagggatag cttttgatag tggtagtggt atcaaaaaag 41160 ttgaagtttc agtcgatggt ggcaataagt ggatgcaagc agcgcttggt gaaaatcttg 41220 gtcgtttttc ctttcgaggt tggaagttaa gccataattt taatgaaaaa ggcagaacgc 41280 ttgtgatggt aagagctaca ggtaagagtg gagagacaca acctcttaat gcctcttgga 41340 atcatggcgg ttataaccga aacgcgattg aacgaacaag tattaaggtg gtttaaatgc 41400 ggtttttact tattatatta gcgctatgtt cattgactgt taaagctgag atcgtatcaa 41460 ttaccttacc tatggataat accaagctta agccgtcgac attaccagga tatggcctcg 41520 cgcaatctaa atgtcacctt tgtcattcag tcgattacgt tatgtatcaa ccaccagaaa 41580 tggatcc 41587 <210> 2 <211> 7959 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (7956) <400> 2 atg gct aaa aag aac acc aca tcg att aag cac gcc aag gat gtg tta 48 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 agt agt gat gat caa cag tta aat tct cgc ttg caa gaa tgt ccg att 96 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 gcc atc att ggt atg gca tcg gtt ttt gca gat gct aaa aac ttg gat 144 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 caa ttc tgg gat aac atc gtt gac tct gtg gac gct att att gat gtg 192 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 cct agc gat cgc tgg aac att gac gac cat tac tcg gct gat aaa aaa 240 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 gca gct gac aag aca tac tgc aaa cgc ggt ggt ttc att cca gag ctt 288 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 gat ttt gat ccg atg gag ttt ggt tta ccg cca aat atc ctc gag tta 336 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 act gac atc gct caa ttg ttg tca tta att gtt gct cgt gat gta tta 384 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 agt gat gct ggc att ggt agt gat tat gac cat gat aaa att ggt atc 432 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 acg ctg ggt gtc ggt ggt ggt cag aaa caa att tcg cca tta acg tcg 480 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 cgc cta caa ggc ccg gta tta gaa aaa gta tta aaa gcc tca ggc att 528 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 gat gaa gat gat cgc gct atg atc atc gac aaa ttt aaa aaa gcc tac 576 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 atc ggc tgg gaa gag aac tca ttc cca ggc atg cta ggt aac gtt att 624 Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 gct ggt cgt atc gcc aat cgt ttt gat ttt ggt ggt act aac tgt gtg 672 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 gtt gat gcg gca tgc gct ggc tcc ctt gca gct gtt aaa atg gcg atc 720 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 tca gac tta ctt gaa tat cgt tca gaa gt atg ata tcg ggt ggt gta 768 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 tgt tgt gat aac tcg cca ttc atg tat atg tca ttc tcg aaa aca cca 816 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 gca ttt acc acc aat gat gat atc cgt ccg ttt gat gac gat tca aaa 864 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 ggc atg ctg gtt ggt gaa ggt att ggc atg atg gcg ttt aaa cgt ctt 912 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 gaa gat gct gaa cgt gac ggc gac aaa att tat tct 960 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 atc ggt aca tct tca gat ggt cgt ttc aaa tct att tac gct cca cgc 1008 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Il e Tyr Ala Pro Arg 325 330 335 cca gat ggc caa gca aaa gcg cta aaa cgt gct tat gaa gat gcc ggt 1056 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 350 ttt gcc cct gaa tac ggt cta att gaa ggc cat ggt acg ggt acc 1104 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 aaa gcg ggt gat gcc gca gaa ttt gct ggc ttg acc aaa cac ttt ggc 1152 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 gcc gcc agt gat gaa aag caa tat atc gcc tta ggc tta gtt aaa tcg 1200 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 caa att ggt cat act aaa tct gcg gct ggc tct gcg ggt atg att aag 1248 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 gcg gca tta gcg ctg cat cat aaa atc tta cct gca acg atc cat atc 1296 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 gat aaa cca agt gaa gcc ttg gat atc aaa aac agc ccg tta tac cta 1344 Asp Lys Pro Ser Glu Ala Le u Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 aac agcc gaa acg cgt cct tgg atg cca cgt gaa gat ggt att cca cgt 1392 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 cgt gca ggt atc agc tca ttt ggt ttt ggc ggc acc aac ttc cat att 1440 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 475 att tta tta gaa gag tat cgc cca ggt cac gat agc gca tta aac 1488 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 tca gtg agc caa act gtg ttg atc tcg gca aac gac caa caa ggt att 1536 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 gtt gct gag tta aat aac tgg cgt act aaa ctg gct gtc gat gct gat 1584 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 525 cat caa ggg ttt gta ttt aat gag tta gtg aca acg tgg cca tta aaa 1632 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 acc cca tcc gtt aac caa gct cgt tta ggt ttt gtt gcg cgt aat gca 1680 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 aat gaa gcg atc gcg atg att gat acg gca ttg aaa caa ttc aat gcg 1728 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys G Ala 565 570 575 aac gca gat aaa atg aca tgg tca gta cct acc ggg gtt tac tat cgt 1776 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 caa gcc ggt att gat gca aca ggt aaa gtg gtt gcg cta ttc tca ggg 1824 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 caa ggt tcg caa tac gtg aac atg ggt cgt gaa tta acc tgt aac ttc 1872 Gln Gly Ser Gln Tyr Val Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 cca agc atg atg cac agt gct gcg gcg atg gat aaa gag ttc agt gcc 1920 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 gct ggt tta ggc cag tta tct gca gtt act ttc cct atc cct gtt tat 1968 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 acg gat gcc gag cgt aag cta caa gaa gag caa tta cgt tta acg caa 2016 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 cat gcg caa cca gcg att ggt agt ttg agt gtt ggt ctg ttc aaa acg 2064 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 ttt aag caa gca ggt ttt aaa gct gat ttt gct gcc ggt cat agt ttc 2112 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 ggt gag tta acc gca tta tgg gct gcc gat gta ttg agc gaa agc gat 2160 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 tac atg atg tta gcg cgt agt cgt ggt caa gca atg gct gcg cca gag 2 Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 caa caa gat ttt gat gca ggt aag atg gcc gct gtt gtt ggt gat cca 2256 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 aag caa gtc gct gtg atc att gat acc ctt gat gat gtc tct att gct 2304 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 aac ttc aac tcg aat aac caa gtt g tt att gct ggt act acg gag cag 2352 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 gtt gct gta gcg gtt aca acc tta ggt aat gct ggt ttc aaa gtt gtg 2400 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 cca ctg ccg gta tct gct gcg ttc cat aca cct tta gtt cgt cac gcg 2448 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 caa aaa cca ttt gct aaa gcg gtt gat agc gct aaa ttt aaa gcg cca 2496 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 agc att cca gtg ttt gct aat ggc acaggc tca agc aaa 2544 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 ccg aat gac att aag aaa aac ctg aaa aac cac atg ctg gaa tct gtt 2592 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 cat ttc aat caa gaa att gac aac atc tat gct gat ggt ggc cgc gta 2640 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 ttt atc ga a ttt ggt cca aag aat gta tta act aaa ttg gtt gaa aac 2688 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 att ctc act gaa aaa tct gat gtg act gct atc gcg gtt aat gct 2736 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 cct aaa caa cct gcg gac gta caa atg cgc caa gct gcg ctg caa atg 2784 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 gca gtg ctt ggt gtc gca tta gac aat att gac ccg tac gac gcc gtt 2832 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 aag cgt cca ctt gtt gcg cc gca tca cca atg ttg atg aag tta 2880 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 tct gca gcg tct tat gtt agt ccg aaa acg aag aaa gcg ttt gct gat 2928 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 gca ttg act gat ggc tgg act gtt aag caa gcg aaa gct gta cct gct 2976 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 98 0 985 990 gtt gtg tca caa cca caa gtg att gaa aag atc gtt gaa gtt gaa aag 3024 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 ata gtt gaa cgc att gtc gaa gt gag cgt gtc gaa gta gaa aaa 3072 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 atc gtc tac gtt aat gct gac ggt tcg ctt ata tcg caa aat aat caa 3120 Ile Val Tyr Val Asn Ala Asp Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 gac gtt aac agc gct gtt gtt agc aac gtg act aat agc tca gtg act 3168 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 cat agc agt gat gct gac ctt gtt gcc tct att gaa cgc agt gtt ggt 3216 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 caa ttt gtt gca cac caa cag caa tta tta aat gta cat gaa cag ttt 3264 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 atg caa ggt cca caa gac tac gcg aaa aca gtg cag aac gta ctt gct 3312 Met Gln Gly Pro Gln Asp Tyr A la Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 gcg cag acg agc aat gaa tta ccg gaa agt tta gac cgt aca ttg tct 3360 Ala Gln Thr Ser Sern Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 atg tat aac gag ttc caa tca gaa acg cta cgt gta cat gaa acg tac 3408 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 ctg aac aat cag acg agc aac atg aac acc atg ctt act ggt gaa 3456 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 gct gat gtg cta gca acc cca ata act cag gta gtg aat aca gcc gtt 3504 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 gcc act agt cac aag gta gtt gct cca gtt att gct aat aca gtg acg 3552 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 aat gtt gta tct agt gtc agt aat aac gcg gcg gtt gca gtg caa act 3600 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 gtg gca tta gcg cct acg caa gaa atc gct cca aca gtc gct act acg 3648 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 cca gca ccc gca ttg gtt gct atc gtg gct gaa cct gtg att gtt gcg 3696 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 cat gtt gct aca gaa gtt gca cca att aca cca tca gtt aca cca gtt 3744 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 gtc gca act caa gcg gct atc gat gta gca act att aac aaa gta atg 3792 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 tta gaa gtt gtt gct gat aaa acc ggt tat cca acg gat atg ctg gaa 3840 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 ctg agc atg gac atg gaa gct gac tta ggt atc gac tca atc aaa cgt 3888 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 gtt gag ata tta ggc gca gta cag gaa ttg atc cct gac tta cct gaa 3936 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 ctt aat cc t gaa gat ctt gct gag cta cgc acg ctt ggt gag att gtc 3984 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 gat tac atg aat tca aaa gcc cag gct gta gct cct aca aca gta 4032 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 gta aca agt gca cct gtt tcg cct gca tct gct ggt att gat tta gcc 4080 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 cac atc caa aac gta atg tta gaa gtg gtt gca gac aaa acc ggt tac 4128 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 cca aca gac atg cta gaag agc atg gat atg gaa gct gac tta ggt 4176 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 att gat tca atc aag cgt gtg gaa atc tta ggt gca gta cag gag atc 4224 Ile Asp Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 ata act gat tta cct gag cta aac cct gaa gat ctt gtt gaa tta cgc 4272 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 acc cta ggt gaa atc gtt agt tac atg caa agc aaa gcg cca gtc gct 4320 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 gaa agt gcgcca gg acg gct cct gta gca aca agc tca gca ccg 4368 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 tct atc gat ttg aac cac att caa aca gtg atg atg gat gta gtt gca 4416 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 gat aag act ggt tat cca act gac atg cta gaa ctt ggc atg gac atg 4464 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 gaa gct gat tta ggt atc gat tca atc aaa cgt gtg gaa ata tta ggc 4512 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 gca gtg cag gag atc atc act gat c ctta g aac cca gaa gac 4560 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 ctc gct gaa tta cgc acg cta ggt gaa atc gtt agt tac atg caa agc 4608 L eu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 aaa gcg cca gtc gct gag agt gcg cca gta gcg acg gct tct gta gca 4656 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 aca agc tct gca ccg tct atc gat tta aac cat atc caa aca gtg atg 4704 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 atg gaa gtg gtt gca gac aaa acc ggt tat cca gta gac atg tta gaa 4752 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 ctt gct atg gac atg gaa gct gac cta ggt atc gat tca atc aag cgt 4800 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 gta gaa att tta ggt gcg gta cag gaa atc att act gac tta cct gag 4848 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 ctt aac cct gaa gat ctt gct gaa cta cgt aca tta ggt gaa atc gtt 4896 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 agt tac atg caa agc aaa gcg c cc gta gct gaa gcg cct gca gta cct 4944 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 gtt gca gta gaa agt gca cct act agt gta aca agc tca gca ccg tct 4992 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 atc gat tta gac cac atc caa aat gta atg atg gat gtt gtt gct gat 5040 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 aag act ggt tat cct gcc aat atg ctt gaa tta gca atg gac atg gaa 5088 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 gcc gac ctt ggt att gat tca atc aag cgt gtt att cta ggc gcg 5136 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 gta cag gag atc att act gat tta cct gaa cta aac cca gaa gac tta 5184 Val Gln Glu Ile Ile Thr Asp Leu Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 gct gaa cta cgt acg tta gaa gaa att gta acc tac atg caa agc aag 5232 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 173 0 1735 1740 gcg agt ggt gtt act gta aat gta gtg gct agc cct gaa aat aat gct 5280 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 gta tca gat gca ttt atg caa agc aat gtg gcg act atc aca gcg gcc 5328 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 gca gaa cat aag gcg gaa ttt aaa ccg gcg ccg agc gca acc gtt gct 5376 Ala Glu His Lys Alu Glu Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 atc tct cgt cta agc tct atc agt aaa ata agc caa gat tgt aaa ggt 5424 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 gct aac gcc tta atc gta gct gat ggc act gat aat gct gtg tta ctt 5472 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 gca gac cac cta ttg caa act ggc tgg aat gta act gca ttg caa 5520 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 act tgg gta gct gta aca acg acg aaa gca ttt aat aag tca gtg aac 5568 Thr Trp Val Ala Val T hr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 ctg gtg act tta aat ggc gtt gat gaa act gaa atc aac aac att att 5616 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 act gct aac gca caa ttg gat gca gtt atc tat ctg cac gca agt agc 5664 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 gaa att aat gct atc gaa tac cca caa gca tct aag caa ctg atg 5712 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 tta gcc ttc tta tta gcg aaa ttg agt aaa gta act caa gcc gct aaa 5760 Leu Ala Phe Leu Leu Ala Lys Leu Sers Thr Gln Ala Ala Lys 1905 1910 1915 1920 gtg cgt ggc gcc ttt atg att gtt act cag cag ggt ggt tca tta ggt 5808 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 ttt gat gat atc gat tct gct aca agt cat gat gtg aaa aca gac cta 5856 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 gta caa agc ggc tta aac ggt tta gtt aag aca ctg tct cac gag tgg 5904 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 gat aac gta ttc tgt cgt gcg gtt gat att gct tcg tca tta acg gct 5952 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 gaa caa gtt gca agc ctt gtt agt gat gaa cta ctt gat gct aac act 6000 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 gta tta aca gaa gtg ggt tat caa caa gct ggt aaa ggc ctt gaa cgt 6048 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 atc acg tta act ggt gtg gct act gac agc tat gca tta aca gct ggc 60 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 aat aac atc gat gct aac tcg gta ttt tta gtg agt ggt ggc gca aaa 6144 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 ggt gta act gca cat tgt gtt gct cgt ata gct aaa gaa tat cag tct 6192 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 aag ttc atc tta ttg gga cgt tca acg ttc tca agt gac gaa ccg agc 6240 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 tgg gca agt ggt att act gat gaa gcg gcg tta aag aaa atg 6288 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 cag tct ttg att aca gca ggt gat aaa cca aca ccc gtt aag atc gta 6336 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 cag cta atc aaa cca atc caa gct aat cgt gaa att gcg caa acc ttg 6384 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 tct gca att acc gct gct ggt ggc caa gct gaa tat gtt tct gca gat 6432 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 gta act aat gca gca agc gta caa atg gca gtc gct cca gct atc gct 6480 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 aag ttc ggt gca atc act ggc atc att cat ggc gcg ggt gtg tta gct 6528 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 gac caa ttc att gag caa aaa aca ctg agt gat ttt gag tct gtt tac 6576 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 agc act aaa att gac ggt ttg tta tcg cta cta tca gtc act gaa gca 6624 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 agc aac atc aag caa ttg gta ttg ttc tcg tca gcg gct ggt ttct Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 ggt aac ccc ggc cag tct gat tac tcg att gcc aat gag atc tta aat 6720 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 aaa acc gca tac cgc ttt aaa tca ttg cac cca caa gct caa gta ttg 6768 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 agc ttt aac tgg ggt cct tgg gac ggt atg gta acg cct gag ctt 6816 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 aaa cgt atg ttt gac caa cgt ggt gtt tac att att cca ctt gat gca 686 4 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 ggt gca cag tta ttg ctg aat gaa cta gcc gct aat gat aac cgt tgt 6912 Gly Ala Gln Leu Leu Leu Asn Glu Lea Ala Asn Asn Arg Cys 2290 2295 2300 cca caa atc ctc gtg ggt aat gac tta tct aaa gat gct agc tct gat 6960 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 caa aag tct gat gaa aag act gct gta aaa aag cca caa gtt agt 7008 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 cgt tta tca gat gct tta gta act aaa agt atc aaa gcg act aac agt 7056 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 agc tct tta tca aac aag act agt gct tta tca gac agt agt gct ttt 7104 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 cag gtt aac gaa aac cac ttt tta gct gac cac atg atc aaa ggc aat 7152 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 cag gta tta cca acg gta tg c gcg att gct tgg atg agt gat gca gca 7200 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 aaa gcg act tat agt aac cga gac tgt gca ttg aag tat gtc ggt ttc 7248 Lys A Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 gaa gac tat aaa ttg ttt aaa ggt gtg gtt ttt gat ggc aat gag gcg 7296 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu 2420 2425 2430 gcg gat tac caa atc caa ttg tcg cct gtg aca agg gcg tca gaa cag 7344 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 gat tct gaa gtc cgt att gcc gca agc ctg aaa agt gac 7392 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 ggt aaa cct gtg ttt cat tat gca gcg aca ata ttg tta gca act cag 7440 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 cca ctt aat gct gtg aag gta gaa ctt ccg aca ttg aca gaa agt gtt 7488 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Se r Val 2485 2490 2495 gat agc aac aat aaa gta act gat gaa gca caa gcg tta tac agc aat 7536 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 ggc acc ttg ttc cac ggt gaagt cag ggc att aag cag ata tta 7584 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 agt tgt gac gac aag ggc ctg cta ttg gct tgt cag ata acc gat gtt 7632 Ser Cys Asp Asp Lys Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 gca aca gct aag cag gga tcc ttc ccg tta gct gac aac aat atc ttt 7680 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 gcc aat gat ttg gtt tat cag gct atg ttg gtc tgg gtg cgc aaa caa 7728 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 ttt ggt tta ggt agc tta cct tcg gtg agg ggt tat 7776 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 cgt gaa gtg gtt gta gat gaa gta ttt tat ctg caa ctt aat gtt gtt 7824 Arg Glu Val Val V al Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 gag cat gat cta ttg ggt tca cgc ggc agt aaa gcc cgt tgt gat att 7872 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 caa ttg att gct gct gat atg caa tta ctt gcc gaa gtg aaa tca gcg 7920 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 caa gtc agt gtc agt gac att atg tga 7959 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 3 <211> 2652 <212> PRT <213> Moritella marina <400> 3 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 Ala Gly Arg I le Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 Ala Ala Leu A la Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 495 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 Pro Ser Met MetHis Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 715 720 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 Asp Va l Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr Thr 1170 1175 1180 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 Glu S er Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Al a Pro Ala Val Pro 1635 1640 1645 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 1840 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val S er Gly Gly Ala Lys 2035 2040 2045 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 2190 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 Ser Cys Asp Asp Lys Gly Leu Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 Gl n Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 4 <211> 2598 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (2595) <400> 4 atg acg gaa tta gct gtt att ggt atg gat gct aaa ttt agc gga caa 48 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 gac aat att gac cgt gtg gaa cgc gct ttc tat gaa ggt gct tat gta 96 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 ggt aat gtt agc cgc gtt agt acc gaa tct aat gtt att agc aat ggc 144 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 gaa gaa caa gtt att act gcc atg aca gtt ctt aac tct gtc agt cta 192 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 cta gcg caa acg aat cag tta aat ata gct gat atc gcg gtg ttg ctg 240 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 att gct gat gta aaa agt gct gat gat cag ctt gta gtc att gca 288 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 tca gca att gaa aaa cag tgt gcg agt tgt gtt gtt att gct gat tta 336 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 ggc caa gca tta aat caa gta gct gat tta gtt aat aac caa gac tgt 384 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 cct gtg gct gta att ggc atg aat aac tcg gtt tatat cgt cat 432 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 gat ctt gaa tct gta act gca aca atc agc ttt gat gaa acc ttc aat 480 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 ggt tat aac aat gta gct ggg ttc gcg agt tta ctt atc gct tca act 528 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 gcg ttt gcc aat gct aag caa tgt tat ata tac gcc aac att aag ggc 576 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 ttc gct caa tcg ggc gta aat gct caa ttt aac gtt gga aac att ag ag Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 gat act gca aag acc gca ttg cag caa gct agc ata act gca gag cag 672 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln210 215 220 gtt ggt ttg tta gaa gtg tca gca gtc gct gat tcg gca atc gca ttg 720 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 tct gaa agc caa ggt tta atg tct gct cat cat acg caa act ttg 768 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 cat act gca tta agc agt gcc cgt agt gtg act ggt gaa ggc ggg tgt 816 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 ttt tca cag gtc gca ggt tta ttg aaa tgt gta att ggt tta cat caa 864 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 cgt tat att ccg gcg att aaa gat tgg caa caa ccg agt gac aat caa 912 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 atg tca cgg tgg cgg aat tca cca ttc tat atg cct gta gat gct 960 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 cct tgg ttc cca cat gct gat ggc tct gca cac att gcc gct tat agt 1008 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Il e Ala Ala Tyr Ser 325 330 335 tgt gtg act gct gac agc tat tgt cat att ctt tta caa gaa aac gtc 1056 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 tta caa gaa ctt gtt ttg aaa gaa aca gtc ttg caa gat aat gac tta 1104 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 act gaa agc aag ctt cag act ctt gaa caa aac aat cca gta gct gat 1152 Thr Glu Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 ctg cgc act aat ggt tac ttt gca tcg agc gag tta gca tta atc ata 1200 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 gta caa ggt aat gac gaa gca caa tta cgc tgt gaa tta gaa act att 1248 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 aca ggg cag tta agt act act ggc ata agt act atc agt att aaa cag 1296 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 atc gca gca gac tgt tat gcc cgt aat gat act aac aaa gcc tat agc 1344 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 gca gtg ctt att gcc gag act gct gaa gag tta agc aaa gaa ata acc 1392 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 ttg gc ttt gct ggt atc gct agc gtg ttt aat gaa gat gct aaa gaa 1440 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 475 480 tgg aaa acc ccg aag ggc agt tat ttt acc gcag cag aaa 1488 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 cag gct gct aac agc aca cag aat ggt gtc acc ttc atg tac cca ggt 1536 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 att ggt gct aca tat gtt ggt tta ggg cgt gat cta ttt cat cta ttc 1584 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 cca cag att tat cag cct gta gcg gct tta gcc gat gac att ggc gaa 1632 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 agt cta aaa gat act tta ctt aat cca cgc agt att agt cgt cat agc 1680 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 ttt aaa gaa ctc aag cag ttg gat ctg gac ctg cgc ggt aac tta gcc 1728 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Ala 565 570 575 aat atc gct gaa gcc ggt gtg ggt ttt gct tgt gtg ttt acc aag gta 1776 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 ttt gaa gaa gtc ttt gcc gtt gtt aaa ttt gct aca ggt tat agc 1824 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 atg ggt gaa gta agc atg tat gca gca cta ggc tgc tgg cag caa ccg 1872 Met Gly Glu Val Ser Met Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 gga ttg atg agt gct cgc ctt gca caa tcg aat acc ttt aat cat caa 1920 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 630 635 640 ctt tgc ggc gag tta aga aca cta cgt cag cat tgg ggc atg gat gat 1968 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 gta gct aac ggt acg ttc gag cag atc tgg gaa acctat cc att aag 2016 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 gca acg att gaa cag gtc gaa att gcc tct gca gat gaa gat cgt gtg 2064 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 tat tgc acc att atc aat aca cct gat agc ttg ttg tta gcc ggt tat 2112 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 cca gaa gcc tgt ca cga gtc att aag aat tta ggt gtg cgt gca atg 2160 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 gca ttg aat atg gcg aac gca att cac agc gcg cca gc tat gcc Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 tac gat cat atg gtt gag cta tac cat atg gat gtt act cca cgt att 2256 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 aat acc aag atg tat tca agc tca tgt tat tta ccg att cca caa cgc 2304 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 agc aaa gcg att tcc cac agt att gc t aaa tgt ttg tgt gat gtg gtg 2352 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 gat ttc cca cgt ttg gtt aat acc tta cat gac aaa ggt gcg cgg gta 2400 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 ttc att gaa atg ggt cca ggt cgt tcg tta tgt agc tgg gta gat aag 2448 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 atc tta gtt aat ggc gat ggc gat aat aaa aag caa agc caa cat gta 2496 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 tct gtt cct gtg aat gcc aaa ggc acc agt gatga act tat att 2544 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 cgt gcg att gct aag tta att agt cat ggc gtg aat ttg aat tta gat 2592 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 agc tag 2598 Ser 865 <210> 5 <211> 865 <212> PRT <213> Moritella marina <400> 5 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Glu Gly Ala Tyr Val 20 25 30 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 Asp Thr Ala Ly s Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 Thr Gly Gln L eu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 490 495 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 510 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 560 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 570 575 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 Gly Leu Met SerAla Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 710 715 720 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 730 735 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 775 780 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 Ser Val Pro ValAsn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 Ser 865 <210> 6 <211> 6036 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (6033) <400> 6 atg gaa aat att gca gta gta ggt att gct aat ttg ttc ccg ggc tca 48 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 caa gca ccg gat caa ttt tgg cag caa ttg ctt gaa caa caa gat tgc 96 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 cgc agt aag gcg acc gct gtt caa atg ggc gtt gat cct gct aaa tat 144 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 acc gcc aac aaa ggt gac aca gat aaa ttt tac tgt gtg cac ggc ggt 192 Thr Ala Asn Lys Gly Asp Thrasp As Lys Phe Tyr Cys Val His Gly Gly 50 55 60 tac atc agt gat ttc aat ttt gat gct tca ggt tat caa ctc gat aat 240 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 gat tat tta gcc ggt tta gat gac ctt aat caa tgg ggg ctt tat gtt 288 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 acg aaa caa gcc ctt acc gat gcg ggt tat tgg ggc agt act gca cta 336 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 gaa aac tgt ggt gtg att tta ggt aat ttg tca ttc cca act aaa tca 384 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 tct aat cag ctg ttt atg cct ttg tat cat caa gtt gtt gat aat gcc 432 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 tta aag gcg gta tta cat cct gat ttt caa tta acg cat tac aca gca 480 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 ccg aaa aaa aca cat gct gac aat gca tta gta gca ggt tat cca gct 528 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 gca ttg atc gcg caa gcg gcg ggt ctt ggt ggt tca cat ttt gca ctg 576 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 gat gcg gct tgt gct tca tct tgt tat agc gtt aag tta gcg tgt gat 624 Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 tac ctg cat acg ggt aaa gcc aac atg atg ctt gct ggt gcg gta tct 672 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 gca gca gat cct atg ttc gta aat atg ggt ttc tcg ata ttc caa gct 720 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 tac cca gct aac aat gta cat gcc ccg ttt gac caa aat tca caa ggt 768 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 cta ttt gcc ggt gaa ggc gcg ggc atg atg gta ttg aaa cgt caa agt 816 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 270 gat gca gta cgt gat ggt gat cat att tac gcc att att aaa ggc ggc 864 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 gca tta tcg aat gac ggt aaa ggc gag ttt gta tta agc ccg aac acc 912 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 aag ggc caa gta tta gta tat gaa cgt gct tat gcc gat gca gat 960 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 gac ccg agt aca gtt gac tat att gaa tgt cat gca acg ggc aca cct 1008 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His A la Thr Gly Thr Pro 325 330 335 aag ggt gac aat gtt gaa ttg cgt tcg atg gaa acc ttt ttc agt cgc 1056 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 gta aat aac aaa cca tta ctg ggc tcg gtt aaa tct aac ctt ggt cat 1104 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 ttg tta act gcc gct ggt atg cct ggc atg acc aaa gct atg tta gcg 1152 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 cta ggt aaa ggt ctt att cct gca acg att aac tta aag caa cca ctg 1200 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 caa tct aaa aac ggt tac ttt act ggc gag caa atg cca acg acg act 1248 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 gtg tct tgg cca aca act ccg ggt gcc aag gca gat aaa ccg cgt acc 1296 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 gca ggt gtg agc gta ttt ggt ttt ggt ggc agc aac gcc cat ttg gta 1344 Ala Gly Val Ser Val Phe Gl y Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 tta caa cag cca acg caa aca ctc gag act aat ttt agt gtt gct aaa 1392 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 cca cgt gag cct ttg gct att att ggt atg gac agc cat ttt ggt agt 1440 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 475 480 gcc agt aat tta gcg cag ttc aaa acc tta tta aat aat aat caa aat 1488 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 acc ttc cgt gaa tta cca gaa caa cgc tgg aaa ggc atg gaa agt aac 1536 Thr Phe Arg Glu Leu Pro Glu Gln Arg Gly Met Glu Ser Asn 500 505 510 gct aac gtc atg cag tcg tta caa tta cgc aaa gcg cct aaa ggc agt 1584 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 tac gtt gat gag cacta att gat ttc ttg cgt ttt aaa gta ccg cct 1632 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 aat gaa aaa gat tgc ttg atc ccg caa cag tta atg atg atg Asa glugg 1680 Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 gca gac aat gct gcg aaa gac gga ggt cta gtt gaa ggt cgt aat gtt 1728 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Asg Val 565 570 575 gcg gta tta gta gcg atg ggc atg gaa ctg gaa tta cat cag tat cgt 1776 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 ggt cgc gtt aat cta accga caat gac agc tta tta cag caa 1824 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 ggt att aac ctg act gtt gag caa cgt gaa gaa ctg acc aat att gct 1872 Gly Ile Asn Leu Thr Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 aaa gac ggt gtt gcc tcg gct gca cag cta aat cag tat acg agt ttc 1920 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 630 630 att ggt aat att atg gcg tca cgt att tcg gcg tta tgg gat ttt tct 1968 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 ggt cct gct att acc gta tcg gct gaa gaa aac tct tat cgt tgt 2016 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 gtt gaa tta gct gaa aat cta ttt caa acc agt gat gtt gaa gcc gtt 2064 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 att att gct gct gtt gat ttg tct ggt tca att gaa aac att act tta 2112 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 cgt cag cac tac ggt cca gtt aat gaa aag gga tct gta agt gaa tgt 2160 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 ggt ccg gtt aat gaa agc agt tca gta acc aac aat att ctt gat cag 2208 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 caa caa tgg ctg gtg ggt gaa ggc gca gcg gct att gtc gtt aaa ccg 2256 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Vals Pro 740 745 750 tca tcg caa gtc act gct gaa caa gtt tat gcg cgt att gat gcg gtg 2304 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 agt ttt gcc cct ggt agc aat gc g aa gca att acg att gca gcg gat 2352 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 aaa gca tta aca ctt gct ggt atc agt gct gct gat gta gct agt gtt 2400 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 gaa gca cat gca agt ggt ttt agt gcc gaa aat aat gct gaa aaa acc 2448 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 gcg tta ccg act tta tac cca agc gca agt atc agt tcg gtg aaa gcc 2496 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 aat att ggt cat acg ttt aat gcc tcg ggt atg gcgt att att aaa 2544 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 acg gcg ctg ctg tta gat cag aat acg agt caa gat cag aaa agc aaa 2592 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 cat att gct att aac ggt cta ggt cgt gat aac agc tgc gcg cat ctt 2640 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 875 880 atc tta tc g agt tca gcg caa gcg cat caa gtt gca cca gcg cct gta 2688 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 tct ggt atg gcc aag caa cgc cca cag tta gtt aaa acc atc aaa ctc 2736 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 ggt ggt cag tta att agc aac gcg att gtt aac agt gcg agt tca tct 2784 Gly Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 tta cac gct att aaa gcg cag ttt gcc ggt aag cac tta aac aaa gtt 2832 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 aac cag cca gtg atg atg gat aac ctg aag ccc caa ggt att agc gct 2880 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 cat gca acc aat gag tat gtg gtg act gga gct gct aac act caa gct 2928 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 tct aac att caa gca tct cat gtt caa gcg tca agt cat gca caa gag 2976 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 ata gca cca aac caa gtt caa aat atg caa gct aca gca gcc gct gta 3024 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Ala Val 995 1000 1005 agt tca ccc ctt tct caa cat caa cac aca gcg cag ccc gta gcg gca 3072 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 ccg agc gtt gtt gga gtg act gtg aaa cat aaa gca agt aac caa att 3120 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 cat cag caa gcg tct acg cat aaa gca ttt tta gaa agt cgt tta gct 3168 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 gca cag aaa aac cta tcg caa ctt gtt gaa ttg caa acc aag ctg tca 3216 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 atc caa act ggt agt gac aat aca tct aac aat act gcg 3ca aca ag Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 aat aca gtg cta aca aat cct gta tca gca acg cca tta aca ctt gtg 3312 Asn Thr Val Leu Thr Asn Pro V al Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 tat aat gcg cct gta gta gcg aca aac cta acc agt aca gaa gca aaa 3360 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 gcg caa gca gct gct aca caa gct ggt ttt cag ata aaa gga cct gtt 3408 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 ggt tac aac tat cca ccg ctg cag tta att gaa cgt tat a cca 3456 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 gaa aac gtg att tac gat caa gct gat ttg gtt gaa ttc gct gaa ggt 3504 Glu Asn Val Ile Tyr Asp Gln Ala Asp Le Phe Ala Glu Gly 1155 1160 1165 gat att ggt aag gta ttt ggt gct gaa tac aat att att gat ggc tat 3552 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 tcg cgt cgt gcac acc tca gat tac ttg tta gta aca cgt 3600 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 gtt act gaa ctt gat gcc aag gtg cat gaa tac aag aaa tca tac atg 3648 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 tgt act gaa tat gat gtg cct gtt gat gca ccg ttc tta att gat ggt 3696 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 cag atc cct tgg tct gtt gcc gtc gaa tca ggc cag tgt gat ttg atg 3744 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 ttg att tca tat at ggt att gat ttc caa gcg aaa ggc gaa cgt gtt 3792 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 tac cgt tta ctt gat tgt gaa tta act ttc ctt gaa gag atg gtt gtt Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 ggt ggc gat act tta cgt tac gag atc cac att gat tcg tat gca cgt 3888 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 aac ggc gag caa tta tta ttc ttc ttc cat tac gat tgt tac gta ggg 3936 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 gat aag aag gta ctt atc atg cgt aat ggt tgt gct ggt ttc ttt act 3984 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 gac gaa gaa ctt tct gat ggt aaa ggc gtt att cat aac gac aaa g Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 aaa gct gag ttt agc aat gct gtt aaa tca tca ttc acg ccg tta tta 4080 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 caa cat aac cgt ggt caa tac gat tat aac gac atg atg aag ttg gtt 4128 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 aat ggt gat gtt gcc agt tgt ggt ccg caa tat gat caa ggt ggc 4176 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 cgt aat cca tca ttg aaa ttc tcg tct gag aag ttc ttg atg att gaa 4224 Arg Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 cgt att acc aag ata gac cca acc ggt ggt cat tgg gga cta ggc ctg 4272 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 tta gaa ggt cag aaa gat tta gac cct gag cat tgg tat ttc cct tgt 4320 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 cac ttt aaa ggt gat ca gta atg gct ggt tcg ttg atg tcg gaa ggt 4368 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 tgt ggc caa atg gcg atg ttc ttc atg ctg tct ctt ggt atg cat Gcc 4416 Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 aat gtg aac aac gct cgt ttc caa cca cta cca ggt gaa tca caa acg 4464 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 gta cgt tgt cgt ggg caa gta ctg cca cag cgc aat acc tta act tac 4512 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 cgt atg gaa gtt act gcg atg ggt atg cat cca cag cca ttc atg aaa 4560 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 gct aat att gat att ttg ctt gac ggt aaa gtg gtt gtt gat ttc aaa 4608 Al a Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 aac ttg agc gtg atg atc agc gaa caa gat gag cat tca gat tac cct 4656 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 gta aca ctg ccg agt aat gtg gcg ctt aaa gcg att act gca cct gtt 4704 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 gcg tca gta gca cca gca tct tca ccc gct aac agc gcg gat cta gac 4752 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 gaa cgt ggt gtt gaa ccg ttt aag ttt cct gaa cgt ccg tta atg cgt 4800 Glu Arg Gly Val Glu Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 gtt gag tca gac ttg tct gca ccg aaa agc aaa ggt gtg aca ccg att 4848 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 aag cat ttt gaa gcg cct gct gtt gct ggt cat cat aga gtg cct aac 4896 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 caa gca ccg ttt aca cct tgg c at atg ttt gag ttt gcg acg ggt aat 4944 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 att tct aac tgt ttc ggt cct gat ttt gat gtt tat gaa ggt cgt att 4992 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 cca cct cgt aca cct tgt ggc gat tta caa gtt gtt act cag gtt gta 5040 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 gaa gtg cag ggc gaa cgt ctt gat ctt aaa aat cca tca agc tgt gta 5088 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 gct gaa tac tat gta ccg gaa gac gct act aaa aac agc 5136 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 cat gaa aac tgg atg cct tat tca tta atc atg gaa att gca ttg caa 5184 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 cca aat ggc ttt att tct ggt tac atg ggc acg acg ctt aaa tac cct 5232 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 173 0 1735 1740 gaa aaa gat ctg ttc ttc cgt aac ctt gat ggt agc ggc acg tta tta 5280 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 aag cag att gat c cag acc at gtg aat aaa tca gtc ttg 5328 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 gtt agt acg gct att gct ggt ggc gcg att att caa agt ttc acg ttt 5376 Val Ser Thr Ala Ile Ala Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 gat atg tct gta gat ggc gag cta ttt tat act ggt aaa gct gta ttt 5424 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 ggt tac ttt agt ggt gaa tca ctg act aac caa ctg ggc att gat aac 5472 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 ggt aaa acg act aat gcg tgg ttt gtt gat aac ac acc gcc gca 5520 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 aat att gat gtg ttt gat tta act aat cag tca ttg gct ctg tat aaa 5568 Asn Ile Asp Val Phe A sp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 gcg cct gtg gat aaa ccg cat tat aaa ttg gct ggt ggt cag atg aac 5616 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 ttt atc gat aca gtg tca gtg gtt gaa ggc ggt ggt aaa gcg ggc gtg 5664 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 gct tat gtt tat ggc gaa cgt acg att gat gct ttc ttc 5712 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 cgt tat cac ttc cac caa gat ccg gtg atg cca ggt tca tta ggt gtt 5760 Arg Tyr His Phe His Gln Asp Pro Val Met Gly Ser Leu Gly Val 1905 1910 1915 1920 gaa gct att att gag ttg atg cag acc tat gcg ctt aaa aat gat ttg 5808 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 ggt ggc aag t aac cca cgt ttc att gcg ccg atg acg caa gtt 5856 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 gat tgg aaa tac cgt ggg caa att acg ccg ctg aat aaa cag atg tca 5904 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 ctg gac gtg cat atc act gag atc gtg aat gac gct ggt gaa gtg cga 5952 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 atc gtt ggt gat gcg aat ctg tct aaa gat ggt ctg cgt att tat gaa 6000 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 gtt aaa aac atc gtt tta agt att gtt gaa gcg taa 6036 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 7 <211> 2011 <212> PRT <213> Moritella marina <400> 7 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 Asp Tyr Leu Ala Gly Leu Asp Asp Lep Asn Gln Trp Gly Leu Tyr Val 85 90 95 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 Tyr Leu His T hr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 285 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 Val Ser Trp P ro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asu Asn Asn Asn Gln Asn 485 490 495 495 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 555 560 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 lys Asp Gly ValAla Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 720 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Ala Asp Val Ala Ser Val 785 790 795 800 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 Asn Ile Gly HisThr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 His Gl n Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Lys Ser Tyr Met 1205 1210 1215 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 His P he Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gly Gln Met Asn 1860 1865 1870 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 8 <211> 1617 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (1614) <400> 8 atg tcg agt tta ggt ttt aac aat aac aac gca att aac tgg gct tgg 48 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 aaa gta gat cca gcg tca gtt cat aca caa gat gca gaa att aaa gca 96 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 gct tta atg gat cta act aaa cct ctc tat gtg gcg aat aat tca ggc 144 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 gta act ggt ata gct aat cat acg tca gta gca ggt gcg atc agc aat 192 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 aac atc gat gtt gat gta ttg gcg ttt gcg caa aag tta aac cca gaa 240 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 gat ctg ggt gat gat gct tac aag aaa cag cac ggc gtt aaa tat gct 288 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 tat cat ggc ggt gcg atg gca aat ggt att gcc tcg gtt gaa ttg gtt 336 Tyr His Gly Gly Ala Met Ala Asn Gly Ale Ser Val Glu Leu Val 100 105 110 gtt gcg tta ggt aaa gca ggg ctg tta tgt tca ttt ggt gct gca ggt 384 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 cta gtg cct gat gcg gtt gaa gat gca att cgt cgt gct gaa 432 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 tta cca aat ggc cct tat gcg gtt aac ttg atc cat gca cca gca gaa 480 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 gaa gca tta gag cgt ggc gcg gtt gaa cgt ttc cta aaa ctt ggc gtc 528 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 aag acct gta tca gct tac ctt ggt tta act gaa cac att gtt 576 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 tgg tat cgt gct gct ggt cta act aaa aac gca gat ggc agt gtt aat 624 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 atc ggt aac aag gtt atc gct aaa gta tcg cgt acc gaa gtt ggt cgc 672 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg210 215 220 cgc ttt atg gaa cct gca ccg caa aaa tta ctg gat aag tta tta gaa 720 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 caa aat aag atc acc cct gga caa gct tta gcg ttg ctt gta cct 768 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 atg gct gat gat att act ggg gaa gcg gat tct ggt ggt cat aca gat 816 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 aac cgt ccg ttt tta aca tta tta ccg acg att att ggt ctg cgt gat 864 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thrh Ile Ile Gly Leu Arg Asp 275 280 285 gaa gtg caa gcg aag tat aac ttc tct cct gca tta cgt gtt ggt gct 912 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 ggt ggt ggt atc gga acg cct gaa gca gca ctc gct gca ttt ata 960 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 ggc gcg gct tat atc gtt ctg ggt tct gtg aat cag gcg tgt gtt gaa 1008 Gly Ala Ala Tla Iyr Val Leu Gly Ser Val Asn Ala Cys Val Glu 325 330 335 gcg ggt gca tct gaa tat act cgt aaa ctg tta tcg aca gtt gaa atg 1056 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 gct gat gtg act atg gca cct gct gca gat atg ttt gaa atg ggt gtg 1104 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 aag ctg caa gta tta aaa cgc ggt tct atg ttc gcg atg cgt gcg aag 1152 Lys Le Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 aaa ctg tat gac ttg tat gtg gct tat gac tcg att gaa gat atc cca 1200 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 395 400 gct gct gaa cgt gag aag att gaa aaa caa atc ttc cgt gca aac cta 1248 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 gac gag att tgg gat ggc act atc gct ttc gaa cgc gat cca 1296 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 gaa atg cta gcc cgt gca acg agt agt cct aaa cgt aaa atg gca ctt 1344 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 atc ttc cgt tgg tat ctt ggc ctt tct tca cgc tgg tca aac aca ggc 1392 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 gag aag gga cgt gaa atg gat tat cag att tgg gca ggc cca agt tta 1440 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 ggt gca ttc aac agc tgg gtg aaa ggt tct tac ctt acc 1488 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 cgc cgt ggc gct gta gat gtt gct ttg cat atg ctt aaa ggt gct gcg 1536 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 tat tta caa cgt gta aac cag ttg aaa ttg caa ggt gtt agc tta agt 1584 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 aca gaa ttg gca gtg agt acg agt gat taa 1617 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 9 <211> 538 <212> PRT <213> Moritella marina <400> 9 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 Ile Gly Asn Ly s Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 300 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 Ala Ala Gla Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 Asp Glu Ile T rp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 10 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Primer <220> <221> Degenerate <222> (6) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (12) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (15) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (18) <223> "n" is a, t, c or g <400> 10 ttyggnttyg gnggnacnaa 20 <210> 11 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Primer <220> <221> Degenerate <222> (4) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (7) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (10) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (16) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (19) <223> "n" is a, t, c or g <400> 11 ytcnccnarn swrtgnccng c 21
【0059】[0059]
【0060】[0060]
【配列番号10】プライマー。第6塩基、第12塩基、
第15塩基及び第18塩基のnは、a、t、c又はgで
ある。[SEQ ID NO: 10] Primer. 6th base, 12th base,
N of the 15th and 18th bases is a, t, c or g.
【0061】[0061]
【配列番号11】プライマー。第4塩基、第7塩基、第
10塩基、第16塩基及び第19塩基のnは、a、t、
c又はgである。[SEQ ID NO: 11] Primer. N of the fourth, seventh, tenth, sixteenth, and nineteenth bases is a, t,
c or g.
【図1】シェワネラSCRC2738株のEPA合成酵
素群遺伝子ORF5中のKAS−MCTドメインのアミ
ノ酸配列と、微生物由来のKASドメイン及びMCTド
メインのアミノ酸配列との比較を示す図である。FIG. 1 is a diagram showing a comparison between the amino acid sequence of the KAS-MCT domain in the EPA synthase group gene ORF5 of Shewanella SCRC2738 strain and the amino acid sequences of the KAS domain and MCT domain derived from microorganisms.
【図2】PCRによって得られたモリテラ・マリナMP
−1株のKAS−MCT断片のアミノ酸配列とシェワネ
ラSCRC2738株由来EPA合成酵素群遺伝子OR
F5中のKAS−MCTドメインのアミノ酸配列との比
較を示す図である。FIG. 2: Moritera Marina MP obtained by PCR
Amino acid sequence of KAS-MCT fragment of -1 strain and OR gene of EPA synthase group derived from Shewanella SCRC2738 strain
It is a figure which shows the comparison with the amino acid sequence of the KAS-MCT domain in F5.
【図3】コスミドクローンp3D5に含まれるORFの
概略を示す図である。FIG. 3 is a diagram showing an outline of an ORF contained in a cosmid clone p3D5.
【図4】コスミドクローンp3D5(A)とシェワネラ
SCRC2738株のEPA合成酵素群遺伝子(B)に
見られるドメイン構造の比較を示す図である。FIG. 4 is a view showing a comparison of the domain structures of the cosmid clone p3D5 (A) and the EPA synthase gene (B) of Shewanella SCRC2738 strain.
───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.7 識別記号 FI テーマコート゛(参考) C12R 1:01) C12R 1:01) (72)発明者 奥山 英登志 北海道札幌市豊平区月寒東2条17丁目2番 1号 工業技術院北海道工業技術研究所内 Fターム(参考) 4B024 AA05 BA10 CA03 DA05 DA11 EA04 GA11 4B050 CC03 DD02 LL02 4B064 AD90 CA02 CA19 CC24 DA06 DA10 4H045 AA10 BA10 CA11 DA89 EA01 FA72 FA74 ──────────────────────────────────────────────────続 き Continued on the front page (51) Int.Cl. 7 Identification symbol FI Theme coat ゛ (Reference) C12R 1:01) C12R 1:01) (72) Inventor Hidetoshi Okuyama Tsukikan Higashijo, Toyohira-ku, Sapporo, Hokkaido 17-chome 2-1-1 F-term in the National Institute of Advanced Industrial Science and Technology, Hokkaido Institute of Technology 4B024 AA05 BA10 CA03 DA05 DA11 EA04 GA11 4B050 CC03 DD02 LL02 4B064 AD90 CA02 CA19 CC24 DA06 DA10 4H045 AA10 BA10 CA11 DA89 EA01 FA72 FA74
Claims (16)
する細菌由来の、イコサペンタエン酸生合成酵素群類似
タンパク質群をコードするDNA。1. A DNA encoding an icosapentaenoic acid biosynthetic enzyme group-like protein group derived from a bacterium capable of producing docosahexaenoic acid.
に属するものである請求項1記載のDNA。2. The method according to claim 1, wherein the bacterium is of the genus Moritella .
The DNA according to claim 1, which belongs to
配列において1以上のアミノ酸が置換、欠失、付加又は
挿入されていてもよいアミノ酸配列を含み、かつ、ドコ
サヘキサエン酸生合成酵素群のメンバーとして機能し得
るタンパク質をコードする塩基配列、(ii)配列番号5
で表わされるアミノ酸配列において1以上のアミノ酸が
置換、欠失、付加又は挿入されていてもよいアミノ酸配
列を含み、かつ、ドコサヘキサエン酸生合成酵素群のメ
ンバーとして機能し得るタンパク質をコードする塩基配
列、(iii)配列番号7で表わされるアミノ酸配列にお
いて1以上のアミノ酸が置換、欠失、付加又は挿入され
ていてもよいアミノ酸配列を含み、かつ、ドコサヘキサ
エン酸生合成酵素群のメンバーとして機能し得るタンパ
ク質をコードする塩基配列、及び(iv)配列番号9で表
わされるアミノ酸配列において1以上のアミノ酸が置
換、欠失、付加又は挿入されていてもよいアミノ酸配列
を含み、かつ、ドコサヘキサエン酸生合成酵素群のメン
バーとして機能し得るタンパク質をコードする塩基配列
を含む請求項1記載のDNA。(I) a member of the docosahexaenoic acid biosynthetic enzyme group, which comprises an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 3; Sequence encoding a protein capable of functioning as: (ii) SEQ ID NO: 5
In the amino acid sequence represented by, one or more amino acids substitution, deletion, including an amino acid sequence that may be added or inserted, and, a nucleotide sequence encoding a protein that can function as a member of the docosahexaenoic acid biosynthetic enzyme group, (Iii) a protein comprising an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 7, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group And (iv) an amino acid sequence represented by SEQ ID NO: 9 in which one or more amino acids may be substituted, deleted, added or inserted, and a group of docosahexaenoic acid biosynthetic enzymes 2. The method according to claim 1, which comprises a nucleotide sequence encoding a protein capable of functioning as a member of the protein. NA.
請求項1記載のDNA。4. The DNA according to claim 1, comprising the base sequence represented by SEQ ID NO: 1.
パク質。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。5. A protein represented by the following (1) or (2): (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
パク質をコードするDNA。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。6. A DNA encoding a protein represented by the following (1) or (2): (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
請求項6記載のDNA。7. The DNA according to claim 6, which comprises the base sequence represented by SEQ ID NO: 2.
パク質。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。8. A protein represented by the following (3) or (4): (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
パク質をコードするDNA。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。9. A DNA encoding a protein represented by the following (3) or (4): (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
む請求項9記載のDNA。10. The DNA according to claim 9, which comprises the base sequence represented by SEQ ID NO: 4.
ンパク質。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。11. A protein represented by the following (5) or (6): (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
ンパク質をコードするDNA。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。12. A DNA encoding a protein represented by the following (5) or (6): (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
む請求項12記載のDNA。13. The DNA according to claim 12, comprising the base sequence represented by SEQ ID NO: 6.
ンパク質。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。14. A protein represented by the following (7) or (8): (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
ンパク質をコードするDNA。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。15. A DNA encoding a protein represented by the following (7) or (8): (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
む請求項15記載のDNA。16. The DNA according to claim 15, comprising the base sequence represented by SEQ ID NO: 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP35661499A JP2001169780A (en) | 1999-12-15 | 1999-12-15 | Gene derived from docosahexaenoic acid-producing bacterium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP35661499A JP2001169780A (en) | 1999-12-15 | 1999-12-15 | Gene derived from docosahexaenoic acid-producing bacterium |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2001169780A true JP2001169780A (en) | 2001-06-26 |
Family
ID=18449912
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP35661499A Pending JP2001169780A (en) | 1999-12-15 | 1999-12-15 | Gene derived from docosahexaenoic acid-producing bacterium |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2001169780A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7807849B2 (en) | 2004-04-22 | 2010-10-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US7834250B2 (en) | 2004-04-22 | 2010-11-16 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US8809559B2 (en) | 2008-11-18 | 2014-08-19 | Commonwelath Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
US8816106B2 (en) | 2006-08-29 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
US9718759B2 (en) | 2013-12-18 | 2017-08-01 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
US10005713B2 (en) | 2014-06-27 | 2018-06-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position |
-
1999
- 1999-12-15 JP JP35661499A patent/JP2001169780A/en active Pending
Cited By (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9453183B2 (en) | 2004-04-22 | 2016-09-27 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US7834250B2 (en) | 2004-04-22 | 2010-11-16 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US10443079B2 (en) | 2004-04-22 | 2019-10-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US8071341B2 (en) | 2004-04-22 | 2011-12-06 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US8106226B2 (en) | 2004-04-22 | 2012-01-31 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US8158392B1 (en) | 2004-04-22 | 2012-04-17 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US8535917B2 (en) | 2004-04-22 | 2013-09-17 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US8575377B2 (en) | 2004-04-22 | 2013-11-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US8778644B2 (en) | 2004-04-22 | 2014-07-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US9458410B2 (en) | 2004-04-22 | 2016-10-04 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US9963723B2 (en) | 2004-04-22 | 2018-05-08 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US9970033B2 (en) | 2004-04-22 | 2018-05-15 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US8853432B2 (en) | 2004-04-22 | 2014-10-07 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US10781463B2 (en) | 2004-04-22 | 2020-09-22 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US7932438B2 (en) | 2004-04-22 | 2011-04-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US11220698B2 (en) | 2004-04-22 | 2022-01-11 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US9951357B2 (en) | 2004-04-22 | 2018-04-24 | Commonweatlh Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US9994880B2 (en) | 2004-04-22 | 2018-06-12 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US7807849B2 (en) | 2004-04-22 | 2010-10-05 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US11597953B2 (en) | 2004-04-22 | 2023-03-07 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cells |
US9926579B2 (en) | 2004-04-22 | 2018-03-27 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of long-chain polyunsaturated fatty acids by recombinant cell |
US8816106B2 (en) | 2006-08-29 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
US10513717B2 (en) | 2006-08-29 | 2019-12-24 | Commonwealth Scientific And Industrial Research Organisation | Synthesis of fatty acids |
US9938486B2 (en) | 2008-11-18 | 2018-04-10 | Commonwealth Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
US8809559B2 (en) | 2008-11-18 | 2014-08-19 | Commonwelath Scientific And Industrial Research Organisation | Enzymes and methods for producing omega-3 fatty acids |
US9550718B2 (en) | 2012-06-15 | 2017-01-24 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
US9932289B2 (en) | 2012-06-15 | 2018-04-03 | Commonwealth Scientific And Industrial Research Ogranisation | Process for producing ethyl esters of polyunsaturated fatty acids |
US9556102B2 (en) | 2012-06-15 | 2017-01-31 | Commonwealth Scientific And Industrial Research Organisation | Process for producing ethyl esters of polyunsaturated fatty acids |
US10335386B2 (en) | 2012-06-15 | 2019-07-02 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
US8946460B2 (en) | 2012-06-15 | 2015-02-03 | Commonwealth Scientific And Industrial Research Organisation | Process for producing polyunsaturated fatty acids in an esterified form |
US8816111B2 (en) | 2012-06-15 | 2014-08-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising polyunsaturated fatty acids |
US9718759B2 (en) | 2013-12-18 | 2017-08-01 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
US10190073B2 (en) | 2013-12-18 | 2019-01-29 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
US10125084B2 (en) | 2013-12-18 | 2018-11-13 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
US10800729B2 (en) | 2013-12-18 | 2020-10-13 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
US9725399B2 (en) | 2013-12-18 | 2017-08-08 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising long chain polyunsaturated fatty acids |
US11623911B2 (en) | 2013-12-18 | 2023-04-11 | Commonwealth Scientific And Industrial Research Organisation | Lipid comprising docosapentaenoic acid |
US10793507B2 (en) | 2014-06-27 | 2020-10-06 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the SN-2 position |
US10005713B2 (en) | 2014-06-27 | 2018-06-26 | Commonwealth Scientific And Industrial Research Organisation | Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR101539470B1 (en) | Chimeric pufa polyketide synthase systems and uses thereof | |
KR101506347B1 (en) | Plant seed oils containing polyunsaturated fatty acids | |
KR20070084187A (en) | Pufa polyketide synthase systems and uses thereof | |
TW201038734A (en) | Polyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof | |
MXPA01007153A (en) | Schizochytrium pks genes. | |
KR101234198B1 (en) | PUFA Polyketide Synthase Systems and Uses Thereof | |
CN108368491A (en) | The algae mutant of lipid production rate with raising | |
AU673359B2 (en) | Gene which codes for eicosapentaenoic acid synthetase group and process for producing eicosapentaenoic acid | |
US6908992B2 (en) | Methanotrophic carbon metabolism pathway genes and enzymes | |
JP2001169780A (en) | Gene derived from docosahexaenoic acid-producing bacterium | |
EP0836611A1 (en) | Sequences for production of 2,4-diacetylphloroglucinol and methods | |
US6537786B2 (en) | Genes encoding exopolysaccharide production | |
US20030157673A1 (en) | Genes involved in cyclododecanone degradation pathway | |
JP4221476B2 (en) | Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid | |
CA2391131C (en) | Genes and proteins for rosaramicin biosynthesis | |
CN1325959B (en) | Genes from genome | |
US20030215930A1 (en) | Genes involved in cyclododecanone degradation pathway | |
KR20130097538A (en) | Chejuenolide biosynthetic gene cluster from hahella chejuensis | |
KR20110092510A (en) | Tridecaptin synthetase and gene thereof | |
JP5110511B2 (en) | Method for producing highly unsaturated fatty acids and highly unsaturated lipids using microorganisms | |
JPWO2009147984A1 (en) | DNA encoding a polypeptide involved in the biosynthesis of herboxidiene | |
CN101142313A (en) | Genes encoding the synthetic pathway for the production of disorazole | |
JPH0646864A (en) | Gene capable of coding eicosapentaenoic acid synthase and production of elcosapentaenoic acid | |
JPH08242867A (en) | Gene coding for biosynthetic enzyme group for eicosapentaenoic acid and production of eicosapentaenoic acid | |
JP2002315579A (en) | Structural gene on gene cluster |