CN112980907A - 一种用于糖蛋白合成的基于原核生物的无细胞系统 - Google Patents
一种用于糖蛋白合成的基于原核生物的无细胞系统 Download PDFInfo
- Publication number
- CN112980907A CN112980907A CN202110180948.XA CN202110180948A CN112980907A CN 112980907 A CN112980907 A CN 112980907A CN 202110180948 A CN202110180948 A CN 202110180948A CN 112980907 A CN112980907 A CN 112980907A
- Authority
- CN
- China
- Prior art keywords
- leu
- ala
- xaa
- ile
- phe
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000288 Glycoproteins Proteins 0.000 title claims abstract description 98
- 102000003886 Glycoproteins Human genes 0.000 title claims abstract description 98
- 210000004671 cell-free system Anatomy 0.000 title abstract description 16
- 230000015572 biosynthetic process Effects 0.000 title description 12
- 238000003786 synthesis reaction Methods 0.000 title description 10
- 241000894006 Bacteria Species 0.000 title description 9
- 108010089072 Dolichyl-diphosphooligosaccharide-protein glycotransferase Proteins 0.000 claims abstract description 109
- 150000004676 glycans Chemical class 0.000 claims abstract description 85
- 150000002632 lipids Chemical class 0.000 claims abstract description 68
- 238000000034 method Methods 0.000 claims abstract description 54
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 48
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 34
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 34
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 30
- 238000004519 manufacturing process Methods 0.000 claims abstract description 21
- 102000035122 glycosylated proteins Human genes 0.000 claims abstract description 15
- 108091005608 glycosylated proteins Proteins 0.000 claims abstract description 15
- 150000001413 amino acids Chemical group 0.000 claims description 505
- 229940024606 amino acid Drugs 0.000 claims description 442
- 235000001014 amino acid Nutrition 0.000 claims description 442
- 108090000623 proteins and genes Proteins 0.000 claims description 80
- 102000004169 proteins and genes Human genes 0.000 claims description 72
- 238000006206 glycosylation reaction Methods 0.000 claims description 67
- 235000018102 proteins Nutrition 0.000 claims description 67
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 41
- 241000589876 Campylobacter Species 0.000 claims description 40
- 238000012546 transfer Methods 0.000 claims description 33
- 102000004190 Enzymes Human genes 0.000 claims description 26
- 108090000790 Enzymes Proteins 0.000 claims description 26
- 150000002482 oligosaccharides Chemical class 0.000 claims description 25
- 229920001542 oligosaccharide Polymers 0.000 claims description 21
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 14
- 239000003153 chemical reaction reagent Substances 0.000 claims description 14
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 13
- -1 asparagine amino acid Chemical class 0.000 claims description 10
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 claims description 10
- 230000002194 synthesizing effect Effects 0.000 claims description 10
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 9
- 235000009582 asparagine Nutrition 0.000 claims description 9
- 229960001230 asparagine Drugs 0.000 claims description 9
- 239000011541 reaction mixture Substances 0.000 claims description 8
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 7
- OVRNDRQMDRJTHS-RTRLPJTCSA-N N-acetyl-D-glucosamine Chemical group CC(=O)N[C@H]1C(O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-RTRLPJTCSA-N 0.000 claims description 6
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 6
- 239000004473 Threonine Substances 0.000 claims description 6
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 claims description 6
- 125000000311 mannosyl group Chemical group C1([C@@H](O)[C@@H](O)[C@H](O)[C@H](O1)CO)* 0.000 claims description 5
- RSLLXTJELTWVHR-FNORWQNLSA-N (3e)-undeca-1,3-diene Chemical compound CCCCCCC\C=C\C=C RSLLXTJELTWVHR-FNORWQNLSA-N 0.000 claims description 4
- 229910019142 PO4 Inorganic materials 0.000 claims description 4
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 4
- ZTOKCBJDEGPICW-GWPISINRSA-N alpha-D-Manp-(1->3)-[alpha-D-Manp-(1->6)]-beta-D-Manp-(1->4)-beta-D-GlcpNAc-(1->4)-beta-D-GlcpNAc Chemical compound O[C@@H]1[C@@H](NC(=O)C)[C@H](O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O[C@H]2[C@H]([C@@H](O[C@@H]3[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)[C@H](O)[C@@H](CO[C@@H]3[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)O2)O)[C@@H](CO)O1 ZTOKCBJDEGPICW-GWPISINRSA-N 0.000 claims description 4
- 235000003704 aspartic acid Nutrition 0.000 claims description 4
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 4
- KSEVLWYNDHVCJB-DMZKWVAKSA-N n-[(2r,3r,4r,5s,6r)-2-[(2r,3r,4s,5r)-2-acetamido-6-hydroxy-1-oxo-4-[(2s,3s,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-5-[(2r,3s,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxyhexan-3-yl]oxy-4,5-dihydroxy-6-(hydroxymethyl)oxan-3-yl Chemical compound O([C@H]([C@H](C=O)NC(=O)C)[C@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H](CO)O[C@@H]1[C@H]([C@@H](O)[C@H](O)[C@@H](CO)O1)O)[C@@H]1O[C@H](CO)[C@@H](O)[C@H](O)[C@H]1NC(C)=O KSEVLWYNDHVCJB-DMZKWVAKSA-N 0.000 claims description 4
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 claims description 4
- 239000010452 phosphate Substances 0.000 claims description 4
- 238000012360 testing method Methods 0.000 claims description 4
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 claims description 3
- ZTOKCBJDEGPICW-UHFFFAOYSA-N Man3GlcNAc2 Natural products OC1C(NC(=O)C)C(O)OC(CO)C1OC1C(NC(C)=O)C(O)C(OC2C(C(OC3C(C(O)C(O)C(CO)O3)O)C(O)C(COC3C(C(O)C(O)C(CO)O3)O)O2)O)C(CO)O1 ZTOKCBJDEGPICW-UHFFFAOYSA-N 0.000 claims 2
- 235000013922 glutamic acid Nutrition 0.000 claims 2
- 239000004220 glutamic acid Substances 0.000 claims 2
- 230000013595 glycosylation Effects 0.000 description 70
- 210000004027 cell Anatomy 0.000 description 64
- 230000014616 translation Effects 0.000 description 54
- 241000588724 Escherichia coli Species 0.000 description 49
- 241000589875 Campylobacter jejuni Species 0.000 description 46
- 238000013519 translation Methods 0.000 description 45
- 239000000370 acceptor Substances 0.000 description 33
- 230000001580 bacterial effect Effects 0.000 description 33
- 230000004988 N-glycosylation Effects 0.000 description 29
- 101150099625 STT3 gene Proteins 0.000 description 27
- 238000000338 in vitro Methods 0.000 description 25
- 108010034529 leucyl-lysine Proteins 0.000 description 25
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 24
- 241000205160 Pyrococcus Species 0.000 description 24
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 23
- 102000051366 Glycosyltransferases Human genes 0.000 description 21
- 108700023372 Glycosyltransferases Proteins 0.000 description 21
- 108010037850 glycylvaline Proteins 0.000 description 21
- 239000002773 nucleotide Substances 0.000 description 21
- 125000003729 nucleotide group Chemical group 0.000 description 21
- 108010050848 glycylleucine Proteins 0.000 description 20
- 108010015792 glycyllysine Proteins 0.000 description 20
- 108010051242 phenylalanylserine Proteins 0.000 description 20
- 108091035707 Consensus sequence Proteins 0.000 description 19
- 230000014509 gene expression Effects 0.000 description 19
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 18
- 108010003137 tyrosyltyrosine Proteins 0.000 description 18
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 17
- 108090000765 processed proteins & peptides Proteins 0.000 description 17
- 108010073969 valyllysine Proteins 0.000 description 17
- 241000589877 Campylobacter coli Species 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 16
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 16
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 16
- 108010029020 prolylglycine Proteins 0.000 description 16
- 241000880493 Leptailurus serval Species 0.000 description 15
- 108010047495 alanylglycine Proteins 0.000 description 15
- 108700014210 glycosyltransferase activity proteins Proteins 0.000 description 15
- 108010009298 lysylglutamic acid Proteins 0.000 description 15
- 108010061238 threonyl-glycine Proteins 0.000 description 15
- 108020004414 DNA Proteins 0.000 description 14
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 14
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 13
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 13
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 13
- 241000205156 Pyrococcus furiosus Species 0.000 description 13
- 108010005233 alanylglutamic acid Proteins 0.000 description 13
- 108010081551 glycylphenylalanine Proteins 0.000 description 13
- 108010057821 leucylproline Proteins 0.000 description 13
- 108010012058 leucyltyrosine Proteins 0.000 description 13
- 108010017391 lysylvaline Proteins 0.000 description 13
- 239000012528 membrane Substances 0.000 description 13
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 13
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 13
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 12
- 241000222722 Leishmania <genus> Species 0.000 description 12
- 241000222732 Leishmania major Species 0.000 description 12
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 12
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 12
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 12
- 102000004196 processed proteins & peptides Human genes 0.000 description 12
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 11
- NMKJPMCEKQHRPD-IRXDYDNUSA-N Tyr-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NMKJPMCEKQHRPD-IRXDYDNUSA-N 0.000 description 11
- 108010062796 arginyllysine Proteins 0.000 description 11
- 108010068265 aspartyltyrosine Proteins 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 239000000758 substrate Substances 0.000 description 11
- 238000013518 transcription Methods 0.000 description 11
- 230000035897 transcription Effects 0.000 description 11
- 108010045269 tryptophyltryptophan Proteins 0.000 description 11
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 10
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 10
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 10
- NLEBIOOXCVAHBD-QKMCSOCLSA-N dodecyl beta-D-maltoside Chemical compound O[C@@H]1[C@@H](O)[C@H](OCCCCCCCCCCCC)O[C@H](CO)[C@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 NLEBIOOXCVAHBD-QKMCSOCLSA-N 0.000 description 10
- 108010087823 glycyltyrosine Proteins 0.000 description 10
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 10
- 108010054155 lysyllysine Proteins 0.000 description 10
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 9
- 241000203069 Archaea Species 0.000 description 9
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 9
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 9
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 9
- 108010011559 alanylphenylalanine Proteins 0.000 description 9
- 108010070783 alanyltyrosine Proteins 0.000 description 9
- 108010093581 aspartyl-proline Proteins 0.000 description 9
- 108010054812 diprotin A Proteins 0.000 description 9
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 9
- 230000035772 mutation Effects 0.000 description 9
- 229920001184 polypeptide Polymers 0.000 description 9
- 108010053725 prolylvaline Proteins 0.000 description 9
- 108010051110 tyrosyl-lysine Proteins 0.000 description 9
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 8
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 8
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 8
- 101710150311 Dolichyl-phosphooligosaccharide-protein glycotransferase Proteins 0.000 description 8
- 101710202156 Dolichyl-phosphooligosaccharide-protein glycotransferase 1 Proteins 0.000 description 8
- 101710202150 Dolichyl-phosphooligosaccharide-protein glycotransferase 2 Proteins 0.000 description 8
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 8
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 8
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 8
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 8
- 108010087924 alanylproline Proteins 0.000 description 8
- 108010013835 arginine glutamate Proteins 0.000 description 8
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 8
- 108010010147 glycylglutamine Proteins 0.000 description 8
- 108010025306 histidylleucine Proteins 0.000 description 8
- 108010092114 histidylphenylalanine Proteins 0.000 description 8
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 8
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 8
- 108010064235 lysylglycine Proteins 0.000 description 8
- 108010026333 seryl-proline Proteins 0.000 description 8
- 108010005652 splenotritin Proteins 0.000 description 8
- 108010020532 tyrosyl-proline Proteins 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 description 7
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 7
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 7
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 7
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 7
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 7
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 7
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 7
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 7
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 7
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 7
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 7
- 241000235347 Schizosaccharomyces pombe Species 0.000 description 7
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 7
- OFCKFBGRYHOKFP-IHPCNDPISA-N Trp-Asp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N OFCKFBGRYHOKFP-IHPCNDPISA-N 0.000 description 7
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 7
- 108010081404 acein-2 Proteins 0.000 description 7
- 108010070944 alanylhistidine Proteins 0.000 description 7
- 108010049041 glutamylalanine Proteins 0.000 description 7
- 108010089804 glycyl-threonine Proteins 0.000 description 7
- 238000003119 immunoblot Methods 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 230000037361 pathway Effects 0.000 description 7
- 108010031719 prolyl-serine Proteins 0.000 description 7
- 238000002864 sequence alignment Methods 0.000 description 7
- 235000000346 sugar Nutrition 0.000 description 7
- 230000009466 transformation Effects 0.000 description 7
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 6
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 6
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 6
- 240000005528 Arctium lappa Species 0.000 description 6
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 6
- 108010010777 Arg-Gly-Asp-Gly Proteins 0.000 description 6
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 6
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 6
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 6
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 6
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 6
- 241000589986 Campylobacter lari Species 0.000 description 6
- 241000206602 Eukaryota Species 0.000 description 6
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 6
- SLQVFYWBGNNOTK-BYULHYEWSA-N Ile-Gly-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N SLQVFYWBGNNOTK-BYULHYEWSA-N 0.000 description 6
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 6
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 6
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 6
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 6
- ZDBMWELMUCLUPL-QEJZJMRPSA-N Leu-Phe-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ZDBMWELMUCLUPL-QEJZJMRPSA-N 0.000 description 6
- XIZQPFCRXLUNMK-BZSNNMDCSA-N Lys-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCCCN)N XIZQPFCRXLUNMK-BZSNNMDCSA-N 0.000 description 6
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 6
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 6
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 6
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 6
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 6
- 108010079364 N-glycylalanine Proteins 0.000 description 6
- CSDMCMITJLKBAH-SOUVJXGZSA-N Phe-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O CSDMCMITJLKBAH-SOUVJXGZSA-N 0.000 description 6
- 108010076504 Protein Sorting Signals Proteins 0.000 description 6
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 6
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 6
- COLXBVRHSKPKIE-NYVOZVTQSA-N Trp-Trp-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O COLXBVRHSKPKIE-NYVOZVTQSA-N 0.000 description 6
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 6
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 6
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 238000001727 in vivo Methods 0.000 description 6
- 108010078274 isoleucylvaline Proteins 0.000 description 6
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 6
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 6
- 201000000626 mucocutaneous leishmaniasis Diseases 0.000 description 6
- 101150104606 pgl gene Proteins 0.000 description 6
- 238000001243 protein synthesis Methods 0.000 description 6
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 5
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 5
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 5
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 5
- QHASENCZLDHBGX-ONGXEEELSA-N Ala-Gly-Phe Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QHASENCZLDHBGX-ONGXEEELSA-N 0.000 description 5
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 5
- MAEQBGQTDWDSJQ-LSJOCFKGSA-N Ala-Met-His Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MAEQBGQTDWDSJQ-LSJOCFKGSA-N 0.000 description 5
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 5
- XAXHGSOBFPIRFG-LSJOCFKGSA-N Ala-Pro-His Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XAXHGSOBFPIRFG-LSJOCFKGSA-N 0.000 description 5
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 5
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 5
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 5
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 5
- VZNOVQKGJQJOCS-SRVKXCTJSA-N Asp-Asp-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VZNOVQKGJQJOCS-SRVKXCTJSA-N 0.000 description 5
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 5
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 5
- 241000193830 Bacillus <bacterium> Species 0.000 description 5
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 5
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 5
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 5
- BKMOHWJHXQLFEX-IRIUXVKKSA-N Glu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N)O BKMOHWJHXQLFEX-IRIUXVKKSA-N 0.000 description 5
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 5
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 5
- JSHOVJTVPXJFTE-HOCLYGCPSA-N His-Gly-Trp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JSHOVJTVPXJFTE-HOCLYGCPSA-N 0.000 description 5
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 5
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 5
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 5
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 5
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 5
- 241000222727 Leishmania donovani Species 0.000 description 5
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 5
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 5
- RIMMMMYKGIBOSN-DCAQKATOSA-N Leu-Asn-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O RIMMMMYKGIBOSN-DCAQKATOSA-N 0.000 description 5
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 5
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 5
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 5
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 5
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 5
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 5
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 5
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 5
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 5
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 5
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 5
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 5
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 5
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 5
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 5
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 5
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 5
- LPNWWHBFXPNHJG-AVGNSLFASA-N Met-Val-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN LPNWWHBFXPNHJG-AVGNSLFASA-N 0.000 description 5
- OVRNDRQMDRJTHS-UHFFFAOYSA-N N-acelyl-D-glucosamine Natural products CC(=O)NC1C(O)OC(CO)C(O)C1O OVRNDRQMDRJTHS-UHFFFAOYSA-N 0.000 description 5
- MBLBDJOUHNCFQT-LXGUWJNJSA-N N-acetylglucosamine Natural products CC(=O)N[C@@H](C=O)[C@@H](O)[C@H](O)[C@H](O)CO MBLBDJOUHNCFQT-LXGUWJNJSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 5
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 5
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 5
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 5
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 5
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 5
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 5
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 5
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 5
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 5
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 5
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 5
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 5
- URPSJRMWHQTARR-MBLNEYKQSA-N Thr-Ile-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O URPSJRMWHQTARR-MBLNEYKQSA-N 0.000 description 5
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 5
- QAXCHNZDPLSFPC-PJODQICGSA-N Trp-Ala-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QAXCHNZDPLSFPC-PJODQICGSA-N 0.000 description 5
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 5
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 5
- GYKDRHDMGQUZPU-MGHWNKPDSA-N Tyr-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N GYKDRHDMGQUZPU-MGHWNKPDSA-N 0.000 description 5
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 5
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 5
- WYOBRXPIZVKNMF-IRXDYDNUSA-N Tyr-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CC=C(O)C=C1 WYOBRXPIZVKNMF-IRXDYDNUSA-N 0.000 description 5
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 5
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 5
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 5
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 5
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 5
- 206010047505 Visceral leishmaniasis Diseases 0.000 description 5
- 229920004482 WACKER® Polymers 0.000 description 5
- 238000007792 addition Methods 0.000 description 5
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 108091008053 gene clusters Proteins 0.000 description 5
- 108010078144 glutaminyl-glycine Proteins 0.000 description 5
- 108010028295 histidylhistidine Proteins 0.000 description 5
- 108010038320 lysylphenylalanine Proteins 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 108010079317 prolyl-tyrosine Proteins 0.000 description 5
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 5
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 5
- 108010002837 tyrosyl-isoleucyl-phenylalanyl-valine Proteins 0.000 description 5
- ZRLAPVCGIOJNSE-XJTSNBOBSA-N (2s)-2-[[(2r)-2-[[(2s,3s)-2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-3-methylpentanoyl]amino]-3-phenylpropanoyl]amino]-3-methylbutanoic acid Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRLAPVCGIOJNSE-XJTSNBOBSA-N 0.000 description 4
- VUDQSRFCCHQIIU-UHFFFAOYSA-N 1-(3,5-dichloro-2,6-dihydroxy-4-methoxyphenyl)hexan-1-one Chemical compound CCCCCC(=O)C1=C(O)C(Cl)=C(OC)C(Cl)=C1O VUDQSRFCCHQIIU-UHFFFAOYSA-N 0.000 description 4
- KQFRUSHJPKXBMB-BHDSKKPTSA-N Ala-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 KQFRUSHJPKXBMB-BHDSKKPTSA-N 0.000 description 4
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 4
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 4
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 4
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 4
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 4
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 4
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 4
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 4
- XMIAMUXIMWREBJ-HERUPUMHSA-N Ala-Trp-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N XMIAMUXIMWREBJ-HERUPUMHSA-N 0.000 description 4
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 4
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 4
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 4
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 4
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 4
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 4
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 4
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 4
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 4
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 4
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 4
- QEQVUHQQYDZUEN-GUBZILKMSA-N Asn-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N QEQVUHQQYDZUEN-GUBZILKMSA-N 0.000 description 4
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 4
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 4
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 4
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 4
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 4
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 4
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 4
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 4
- GGBQDSHTXKQSLP-NHCYSSNCSA-N Asp-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N GGBQDSHTXKQSLP-NHCYSSNCSA-N 0.000 description 4
- 241001135528 Campylobacter upsaliensis Species 0.000 description 4
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 4
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 4
- SHZGCJCMOBCMKK-UHFFFAOYSA-N D-mannomethylose Natural products CC1OC(O)C(O)C(O)C1O SHZGCJCMOBCMKK-UHFFFAOYSA-N 0.000 description 4
- WQZGKKKJIJFFOK-QTVWNMPRSA-N D-mannopyranose Chemical compound OC[C@H]1OC(O)[C@@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-QTVWNMPRSA-N 0.000 description 4
- 241000224495 Dictyostelium Species 0.000 description 4
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 4
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 4
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 4
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 4
- CYHBMLHCQXXCCT-AVGNSLFASA-N Glu-Asp-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CYHBMLHCQXXCCT-AVGNSLFASA-N 0.000 description 4
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 4
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 4
- QIZJOTQTCAGKPU-KWQFWETISA-N Gly-Ala-Tyr Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 QIZJOTQTCAGKPU-KWQFWETISA-N 0.000 description 4
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 4
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 4
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 4
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 4
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 4
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 4
- WMGHDYWNHNLGBV-ONGXEEELSA-N Gly-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WMGHDYWNHNLGBV-ONGXEEELSA-N 0.000 description 4
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 4
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 4
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 4
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 4
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 4
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 4
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 4
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 4
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 4
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 4
- LRAUKBMYHHNADU-DKIMLUQUSA-N Ile-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 LRAUKBMYHHNADU-DKIMLUQUSA-N 0.000 description 4
- JSLIXOUMAOUGBN-JUKXBJQTSA-N Ile-Tyr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JSLIXOUMAOUGBN-JUKXBJQTSA-N 0.000 description 4
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 4
- 108010065920 Insulin Lispro Proteins 0.000 description 4
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 4
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 4
- 241000222740 Leishmania braziliensis Species 0.000 description 4
- 241000222697 Leishmania infantum Species 0.000 description 4
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 4
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 4
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 4
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 4
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 4
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 4
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 4
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 4
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 4
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 4
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 4
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 4
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 4
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 4
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 4
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 4
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 4
- HGLKOTPFWOMPOB-MEYUZBJRSA-N Leu-Thr-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HGLKOTPFWOMPOB-MEYUZBJRSA-N 0.000 description 4
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 4
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 4
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 4
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 4
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 4
- ODTZHNZPINULEU-KKUMJFAQSA-N Lys-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N ODTZHNZPINULEU-KKUMJFAQSA-N 0.000 description 4
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 4
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 4
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 4
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 4
- 241000192041 Micrococcus Species 0.000 description 4
- MBLBDJOUHNCFQT-UHFFFAOYSA-N N-acetyl-D-galactosamine Natural products CC(=O)NC(C=O)C(O)C(O)C(O)CO MBLBDJOUHNCFQT-UHFFFAOYSA-N 0.000 description 4
- DRBBFCLWYRJSJZ-UHFFFAOYSA-N N-phosphocreatine Chemical compound OC(=O)CN(C)C(=N)NP(O)(O)=O DRBBFCLWYRJSJZ-UHFFFAOYSA-N 0.000 description 4
- PXHVJJICTQNCMI-UHFFFAOYSA-N Nickel Chemical compound [Ni] PXHVJJICTQNCMI-UHFFFAOYSA-N 0.000 description 4
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 4
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 4
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 4
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 4
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 4
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 4
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 4
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 4
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 4
- ZTVSVSFBHUVYIN-UFYCRDLUSA-N Phe-Tyr-Met Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=C(O)C=C1 ZTVSVSFBHUVYIN-UFYCRDLUSA-N 0.000 description 4
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 4
- HATVCTYBNCNMAA-AVGNSLFASA-N Pro-Leu-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O HATVCTYBNCNMAA-AVGNSLFASA-N 0.000 description 4
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 4
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 4
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 4
- 241000191025 Rhodobacter Species 0.000 description 4
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 4
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 4
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 4
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 4
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 4
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 4
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 4
- VAIWUNAAPZZGRI-IHPCNDPISA-N Ser-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N VAIWUNAAPZZGRI-IHPCNDPISA-N 0.000 description 4
- TYIHBQYLIPJSIV-NYVOZVTQSA-N Ser-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CO)N TYIHBQYLIPJSIV-NYVOZVTQSA-N 0.000 description 4
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 4
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 4
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 4
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 4
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 4
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 4
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 4
- UGFOSENEZHEQKX-PJODQICGSA-N Trp-Val-Ala Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(=O)N[C@@H](C)C(O)=O UGFOSENEZHEQKX-PJODQICGSA-N 0.000 description 4
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 4
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 4
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 4
- XOVDRAVPGHTYLP-JYJNAYRXSA-N Tyr-Pro-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O XOVDRAVPGHTYLP-JYJNAYRXSA-N 0.000 description 4
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 4
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 4
- PWKMJDQXKCENMF-MEYUZBJRSA-N Tyr-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O PWKMJDQXKCENMF-MEYUZBJRSA-N 0.000 description 4
- KUXCBJFJURINGF-PXDAIIFMSA-N Tyr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N KUXCBJFJURINGF-PXDAIIFMSA-N 0.000 description 4
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 4
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 4
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 4
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 4
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 4
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 4
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 4
- WDIGUPHXPBMODF-UMNHJUIQSA-N Val-Glu-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N WDIGUPHXPBMODF-UMNHJUIQSA-N 0.000 description 4
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 4
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 4
- SVFRYKBZHUGKLP-QXEWZRGKSA-N Val-Met-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVFRYKBZHUGKLP-QXEWZRGKSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 150000001412 amines Chemical class 0.000 description 4
- 108010060035 arginylproline Proteins 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 210000004899 c-terminal region Anatomy 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 235000011180 diphosphates Nutrition 0.000 description 4
- 238000002523 gelfiltration Methods 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 4
- 108010091871 leucylmethionine Proteins 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 108010005942 methionylglycine Proteins 0.000 description 4
- 108010068488 methionylphenylalanine Proteins 0.000 description 4
- 101150068826 pglB gene Proteins 0.000 description 4
- 108010012581 phenylalanylglutamate Proteins 0.000 description 4
- 229920001282 polysaccharide Polymers 0.000 description 4
- 239000005017 polysaccharide Substances 0.000 description 4
- 108010015796 prolylisoleucine Proteins 0.000 description 4
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- FATXTKJILXPNJL-UHFFFAOYSA-N 2-[[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 FATXTKJILXPNJL-UHFFFAOYSA-N 0.000 description 3
- 241000606750 Actinobacillus Species 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 3
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 3
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 3
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 3
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 3
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 3
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 3
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 3
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 3
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 3
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 3
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 3
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 3
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 3
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 3
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 3
- PZBSKYJGKNNYNK-ULQDDVLXSA-N Arg-Leu-Tyr Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O PZBSKYJGKNNYNK-ULQDDVLXSA-N 0.000 description 3
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 3
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 3
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 3
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- GOVUDFOGXOONFT-VEVYYDQMSA-N Asn-Arg-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GOVUDFOGXOONFT-VEVYYDQMSA-N 0.000 description 3
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 3
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 3
- XIDSGDJNUJRUHE-VEVYYDQMSA-N Asn-Thr-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O XIDSGDJNUJRUHE-VEVYYDQMSA-N 0.000 description 3
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 3
- QIRJQYQOIKBPBZ-IHRRRGAJSA-N Asn-Tyr-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QIRJQYQOIKBPBZ-IHRRRGAJSA-N 0.000 description 3
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 3
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 3
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 3
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 3
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 3
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 3
- SXLCDCZHNCLFGZ-BPUTZDHNSA-N Asp-Pro-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SXLCDCZHNCLFGZ-BPUTZDHNSA-N 0.000 description 3
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 3
- JDDYEZGPYBBPBN-JRQIVUDYSA-N Asp-Thr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JDDYEZGPYBBPBN-JRQIVUDYSA-N 0.000 description 3
- XOASPVGNFAMYBD-WFBYXXMGSA-N Asp-Trp-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O XOASPVGNFAMYBD-WFBYXXMGSA-N 0.000 description 3
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 3
- 108020004705 Codon Proteins 0.000 description 3
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 3
- IAJILQKETJEXLJ-UHFFFAOYSA-N Galacturonsaeure Natural products O=CC(O)C(O)C(O)C(O)C(O)=O IAJILQKETJEXLJ-UHFFFAOYSA-N 0.000 description 3
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 3
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 3
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 3
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 3
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 3
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 3
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 3
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 3
- VAZZOGXDUQSVQF-NUMRIWBASA-N Glu-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)O VAZZOGXDUQSVQF-NUMRIWBASA-N 0.000 description 3
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 3
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 3
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 3
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 3
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 3
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 3
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 3
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 3
- TZXOPHFCAATANZ-QEJZJMRPSA-N Glu-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N TZXOPHFCAATANZ-QEJZJMRPSA-N 0.000 description 3
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 3
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 3
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 3
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 3
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 3
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 3
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 3
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 3
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 3
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 3
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 3
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 3
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 3
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- NIOPEYHPOBWLQO-KBPBESRZSA-N Gly-Trp-Glu Chemical compound NCC(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOPEYHPOBWLQO-KBPBESRZSA-N 0.000 description 3
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- HOLOYAZCIHDQNS-YVNDNENWSA-N Ile-Gln-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HOLOYAZCIHDQNS-YVNDNENWSA-N 0.000 description 3
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 3
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 3
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 3
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 3
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 3
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 3
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 3
- WSSGUVAKYCQSCT-XUXIUFHCSA-N Ile-Met-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)O)N WSSGUVAKYCQSCT-XUXIUFHCSA-N 0.000 description 3
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 3
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 3
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 3
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 3
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 3
- 241000347881 Kadua laxiflora Species 0.000 description 3
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 3
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 3
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 3
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 3
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 3
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 3
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 3
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 3
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 3
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 3
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 3
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 3
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 3
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 3
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 3
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 3
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 3
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 3
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 3
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 3
- VQHUBNVKFFLWRP-ULQDDVLXSA-N Leu-Tyr-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 VQHUBNVKFFLWRP-ULQDDVLXSA-N 0.000 description 3
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 3
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 3
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 3
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 3
- DGAAQRAUOFHBFJ-CIUDSAMLSA-N Lys-Asn-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DGAAQRAUOFHBFJ-CIUDSAMLSA-N 0.000 description 3
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 3
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 3
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 3
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 3
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 3
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 3
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 3
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 3
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 3
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 3
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 3
- 229910021380 Manganese Chloride Inorganic materials 0.000 description 3
- GLFNIEUTAYBVOC-UHFFFAOYSA-L Manganese chloride Chemical compound Cl[Mn]Cl GLFNIEUTAYBVOC-UHFFFAOYSA-L 0.000 description 3
- MVQGZYIOMXAFQG-GUBZILKMSA-N Met-Ala-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MVQGZYIOMXAFQG-GUBZILKMSA-N 0.000 description 3
- BQVJARUIXRXDKN-DCAQKATOSA-N Met-Asn-His Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BQVJARUIXRXDKN-DCAQKATOSA-N 0.000 description 3
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 3
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 3
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 3
- OVRNDRQMDRJTHS-CBQIKETKSA-N N-Acetyl-D-Galactosamine Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@H](O)[C@@H]1O OVRNDRQMDRJTHS-CBQIKETKSA-N 0.000 description 3
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 3
- 241000588650 Neisseria meningitidis Species 0.000 description 3
- 244000020186 Nymphaea lutea Species 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 3
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 3
- CTNODEMQIKCZGQ-JYJNAYRXSA-N Phe-Gln-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 CTNODEMQIKCZGQ-JYJNAYRXSA-N 0.000 description 3
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 3
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 3
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 3
- OVJMCXAPGFDGMG-HKUYNNGSSA-N Phe-Gly-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OVJMCXAPGFDGMG-HKUYNNGSSA-N 0.000 description 3
- GXDPQJUBLBZKDY-IAVJCBSLSA-N Phe-Ile-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GXDPQJUBLBZKDY-IAVJCBSLSA-N 0.000 description 3
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 3
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 3
- FENSZYFJQOFSQR-FIRPJDEBSA-N Phe-Phe-Ile Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FENSZYFJQOFSQR-FIRPJDEBSA-N 0.000 description 3
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 3
- WEQJQNWXCSUVMA-RYUDHWBXSA-N Phe-Pro Chemical compound C([C@H]([NH3+])C(=O)N1[C@@H](CCC1)C([O-])=O)C1=CC=CC=C1 WEQJQNWXCSUVMA-RYUDHWBXSA-N 0.000 description 3
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 3
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 3
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 3
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 3
- 108010066816 Polypeptide N-acetylgalactosaminyltransferase Proteins 0.000 description 3
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 3
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 3
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 3
- LNOWDSPAYBWJOR-PEDHHIEDSA-N Pro-Ile-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LNOWDSPAYBWJOR-PEDHHIEDSA-N 0.000 description 3
- WCNVGGZRTNHOOS-ULQDDVLXSA-N Pro-Lys-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O WCNVGGZRTNHOOS-ULQDDVLXSA-N 0.000 description 3
- DSGSTPRKNYHGCL-JYJNAYRXSA-N Pro-Phe-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DSGSTPRKNYHGCL-JYJNAYRXSA-N 0.000 description 3
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 3
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 3
- 208000004160 Rasmussen subacute encephalitis Diseases 0.000 description 3
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 3
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 3
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 3
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 3
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 3
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 3
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 3
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 3
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 3
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 3
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 3
- TVPQRPNBYCRRLL-IHRRRGAJSA-N Ser-Phe-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O TVPQRPNBYCRRLL-IHRRRGAJSA-N 0.000 description 3
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 3
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 3
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 3
- 241000194017 Streptococcus Species 0.000 description 3
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 3
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 3
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 3
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 3
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 3
- DXPURPNJDFCKKO-RHYQMDGZSA-N Thr-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DXPURPNJDFCKKO-RHYQMDGZSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- 241000209140 Triticum Species 0.000 description 3
- 235000021307 Triticum Nutrition 0.000 description 3
- 241001504505 Troglodytes troglodytes Species 0.000 description 3
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 3
- YTCNLMSUXPCFBW-SXNHZJKMSA-N Trp-Ile-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O YTCNLMSUXPCFBW-SXNHZJKMSA-N 0.000 description 3
- QUIXRGCMQOXUSV-SZMVWBNQSA-N Trp-Pro-Pro Chemical compound O=C([C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)N1CCC[C@H]1C(O)=O QUIXRGCMQOXUSV-SZMVWBNQSA-N 0.000 description 3
- NSOMQRHZMJMZIE-GVARAGBVSA-N Tyr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NSOMQRHZMJMZIE-GVARAGBVSA-N 0.000 description 3
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 3
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 3
- YWXMGBUGMLJMIP-IHPCNDPISA-N Tyr-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YWXMGBUGMLJMIP-IHPCNDPISA-N 0.000 description 3
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 3
- WDGDKHLSDIOXQC-ACRUOGEOSA-N Tyr-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WDGDKHLSDIOXQC-ACRUOGEOSA-N 0.000 description 3
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 3
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 3
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 3
- FASACHWGQBNSRO-ZEWNOJEFSA-N Tyr-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FASACHWGQBNSRO-ZEWNOJEFSA-N 0.000 description 3
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 3
- YYLHVUCSTXXKBS-IHRRRGAJSA-N Tyr-Pro-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YYLHVUCSTXXKBS-IHRRRGAJSA-N 0.000 description 3
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 3
- TYFLVOUZHQUBGM-IHRRRGAJSA-N Tyr-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TYFLVOUZHQUBGM-IHRRRGAJSA-N 0.000 description 3
- CLEGSEJVGBYZBJ-MEYUZBJRSA-N Tyr-Thr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CLEGSEJVGBYZBJ-MEYUZBJRSA-N 0.000 description 3
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 3
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 3
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 3
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 3
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 3
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 3
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 3
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 3
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 3
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 3
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 3
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 3
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 3
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 3
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 3
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 3
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 3
- 241000607598 Vibrio Species 0.000 description 3
- 241000605939 Wolinella succinogenes Species 0.000 description 3
- 238000001042 affinity chromatography Methods 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 239000000427 antigen Substances 0.000 description 3
- 108010008355 arginyl-glutamine Proteins 0.000 description 3
- SQVRNKJHWKZAKO-UHFFFAOYSA-N beta-N-Acetyl-D-neuraminic acid Natural products CC(=O)NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO SQVRNKJHWKZAKO-UHFFFAOYSA-N 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 230000003111 delayed effect Effects 0.000 description 3
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 3
- 239000002158 endotoxin Substances 0.000 description 3
- 239000003623 enhancer Substances 0.000 description 3
- 210000003527 eukaryotic cell Anatomy 0.000 description 3
- 239000013604 expression vector Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 3
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 150000002386 heptoses Chemical class 0.000 description 3
- 230000000977 initiatory effect Effects 0.000 description 3
- 238000002955 isolation Methods 0.000 description 3
- 229920006008 lipopolysaccharide Polymers 0.000 description 3
- 239000011565 manganese chloride Substances 0.000 description 3
- 235000002867 manganese chloride Nutrition 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 108010056582 methionylglutamic acid Proteins 0.000 description 3
- 210000001589 microsome Anatomy 0.000 description 3
- 239000000203 mixture Substances 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 150000002772 monosaccharides Chemical group 0.000 description 3
- 239000008188 pellet Substances 0.000 description 3
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000012846 protein folding Effects 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 102000005962 receptors Human genes 0.000 description 3
- 108020003175 receptors Proteins 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 210000001995 reticulocyte Anatomy 0.000 description 3
- 230000028327 secretion Effects 0.000 description 3
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 150000008163 sugars Chemical class 0.000 description 3
- 239000006228 supernatant Substances 0.000 description 3
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 2
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 2
- WPXFILQZNKUYQO-BZSNNMDCSA-N 2-[[(2s)-2-[[(2s)-1-[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 WPXFILQZNKUYQO-BZSNNMDCSA-N 0.000 description 2
- JUEUYDRZJNQZGR-UHFFFAOYSA-N 2-[[2-[[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]amino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JUEUYDRZJNQZGR-UHFFFAOYSA-N 0.000 description 2
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 2
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 2
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 2
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 2
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 2
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- IHRGVZXPTIQNIP-NAKRPEOUSA-N Ala-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C)N IHRGVZXPTIQNIP-NAKRPEOUSA-N 0.000 description 2
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 2
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 2
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 2
- JVMKBJNSRZWDBO-FXQIFTODSA-N Arg-Cys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O JVMKBJNSRZWDBO-FXQIFTODSA-N 0.000 description 2
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 2
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 2
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 2
- OISWSORSLQOGFV-AVGNSLFASA-N Arg-Met-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N OISWSORSLQOGFV-AVGNSLFASA-N 0.000 description 2
- UGZUVYDKAYNCII-ULQDDVLXSA-N Arg-Phe-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UGZUVYDKAYNCII-ULQDDVLXSA-N 0.000 description 2
- SLQQPJBDBVPVQV-JYJNAYRXSA-N Arg-Phe-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O SLQQPJBDBVPVQV-JYJNAYRXSA-N 0.000 description 2
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 2
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 2
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 2
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 2
- FANGHKQYFPYDNB-UBHSHLNASA-N Asn-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N FANGHKQYFPYDNB-UBHSHLNASA-N 0.000 description 2
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 2
- MECFLTFREHAZLH-ACZMJKKPSA-N Asn-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N MECFLTFREHAZLH-ACZMJKKPSA-N 0.000 description 2
- OLGCWMNDJTWQAG-GUBZILKMSA-N Asn-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(N)=O OLGCWMNDJTWQAG-GUBZILKMSA-N 0.000 description 2
- GYOHQKJEQQJBOY-QEJZJMRPSA-N Asn-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N GYOHQKJEQQJBOY-QEJZJMRPSA-N 0.000 description 2
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 2
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 2
- QDXQWFBLUVTOFL-FXQIFTODSA-N Asn-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(=O)N)N QDXQWFBLUVTOFL-FXQIFTODSA-N 0.000 description 2
- OMSMPWHEGLNQOD-UWVGGRQHSA-N Asn-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMSMPWHEGLNQOD-UWVGGRQHSA-N 0.000 description 2
- MVXJBVVLACEGCG-PCBIJLKTSA-N Asn-Phe-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVXJBVVLACEGCG-PCBIJLKTSA-N 0.000 description 2
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 2
- UYCPJVYQYARFGB-YDHLFZDLSA-N Asn-Phe-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O UYCPJVYQYARFGB-YDHLFZDLSA-N 0.000 description 2
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 2
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 2
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 2
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 2
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 2
- 241001364856 Draba discoidea Species 0.000 description 2
- 241000588722 Escherichia Species 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 2
- CITDWMLWXNUQKD-FXQIFTODSA-N Gln-Gln-Asn Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CITDWMLWXNUQKD-FXQIFTODSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- JNENSVNAUWONEZ-GUBZILKMSA-N Gln-Lys-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JNENSVNAUWONEZ-GUBZILKMSA-N 0.000 description 2
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 2
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 2
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 2
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 2
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 2
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 2
- HMHRTKOWRUPPNU-RCOVLWMOSA-N Gly-Ile-Gly Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O HMHRTKOWRUPPNU-RCOVLWMOSA-N 0.000 description 2
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 2
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 2
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- CVFOYJJOZYYEPE-KBPBESRZSA-N Gly-Lys-Tyr Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CVFOYJJOZYYEPE-KBPBESRZSA-N 0.000 description 2
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 2
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 2
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 2
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 2
- KOYUSMBPJOVSOO-XEGUGMAKSA-N Gly-Tyr-Ile Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KOYUSMBPJOVSOO-XEGUGMAKSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 2
- 241000606790 Haemophilus Species 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 2
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 2
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 2
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 2
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- YVCGJPIKRMGNPA-LSJOCFKGSA-N His-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O YVCGJPIKRMGNPA-LSJOCFKGSA-N 0.000 description 2
- XDIVYNSPYBLSME-DCAQKATOSA-N His-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N XDIVYNSPYBLSME-DCAQKATOSA-N 0.000 description 2
- YAEKRYQASVCDLK-JYJNAYRXSA-N His-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N YAEKRYQASVCDLK-JYJNAYRXSA-N 0.000 description 2
- FBVHRDXSCYELMI-PBCZWWQYSA-N His-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O FBVHRDXSCYELMI-PBCZWWQYSA-N 0.000 description 2
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 2
- XGBVLRJLHUVCNK-DCAQKATOSA-N His-Val-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O XGBVLRJLHUVCNK-DCAQKATOSA-N 0.000 description 2
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 2
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 2
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 2
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 2
- CWJQMCPYXNVMBS-STECZYCISA-N Ile-Arg-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N CWJQMCPYXNVMBS-STECZYCISA-N 0.000 description 2
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 2
- KIMHKBDJQQYLHU-PEFMBERDSA-N Ile-Glu-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KIMHKBDJQQYLHU-PEFMBERDSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- MQFGXJNSUJTXDT-QSFUFRPTSA-N Ile-Gly-Ile Chemical compound N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)O MQFGXJNSUJTXDT-QSFUFRPTSA-N 0.000 description 2
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 2
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- FFJQAEYLAQMGDL-MGHWNKPDSA-N Ile-Lys-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FFJQAEYLAQMGDL-MGHWNKPDSA-N 0.000 description 2
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 2
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- AGGIYSLVUKVOPT-HTFCKZLJSA-N Ile-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N AGGIYSLVUKVOPT-HTFCKZLJSA-N 0.000 description 2
- PELCGFMHLZXWBQ-BJDJZHNGSA-N Ile-Ser-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)O)N PELCGFMHLZXWBQ-BJDJZHNGSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- GMUYXHHJAGQHGB-TUBUOCAGSA-N Ile-Thr-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GMUYXHHJAGQHGB-TUBUOCAGSA-N 0.000 description 2
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 2
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- DLEBSGAVWRPTIX-PEDHHIEDSA-N Ile-Val-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)[C@@H](C)CC DLEBSGAVWRPTIX-PEDHHIEDSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 2
- SHZGCJCMOBCMKK-DHVFOXMCSA-N L-fucopyranose Chemical compound C[C@@H]1OC(O)[C@@H](O)[C@H](O)[C@@H]1O SHZGCJCMOBCMKK-DHVFOXMCSA-N 0.000 description 2
- PNNNRSAQSRJVSB-UHFFFAOYSA-N L-rhamnose Natural products CC(O)C(O)C(O)C(O)C=O PNNNRSAQSRJVSB-UHFFFAOYSA-N 0.000 description 2
- 241000222734 Leishmania mexicana Species 0.000 description 2
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 2
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 2
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 2
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 2
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 2
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 2
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 2
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 2
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 2
- PKKMDPNFGULLNQ-AVGNSLFASA-N Leu-Met-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O PKKMDPNFGULLNQ-AVGNSLFASA-N 0.000 description 2
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 2
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 2
- AIRUUHAOKGVJAD-JYJNAYRXSA-N Leu-Phe-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIRUUHAOKGVJAD-JYJNAYRXSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 2
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 2
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 2
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 2
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 2
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 2
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 2
- ONHCDMBHPQIPAI-YTQUADARSA-N Leu-Trp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N ONHCDMBHPQIPAI-YTQUADARSA-N 0.000 description 2
- WPIKRJDRQVFRHP-TUSQITKMSA-N Leu-Trp-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O WPIKRJDRQVFRHP-TUSQITKMSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- 241000186781 Listeria Species 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 2
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 2
- RFQATBGBLDAKGI-VHSXEESVSA-N Lys-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCCN)N)C(=O)O RFQATBGBLDAKGI-VHSXEESVSA-N 0.000 description 2
- HQXSFFSLXFHWOX-IXOXFDKPSA-N Lys-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N)O HQXSFFSLXFHWOX-IXOXFDKPSA-N 0.000 description 2
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 2
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- BXPHMHQHYHILBB-BZSNNMDCSA-N Lys-Lys-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BXPHMHQHYHILBB-BZSNNMDCSA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 2
- SBQDRNOLGSYHQA-YUMQZZPRSA-N Lys-Ser-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SBQDRNOLGSYHQA-YUMQZZPRSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- ZFNYWKHYUMEZDZ-WDSOQIARSA-N Lys-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCCN)N ZFNYWKHYUMEZDZ-WDSOQIARSA-N 0.000 description 2
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 2
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 2
- VWJFOUBDZIUXGA-AVGNSLFASA-N Lys-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N VWJFOUBDZIUXGA-AVGNSLFASA-N 0.000 description 2
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 2
- VEJMLLWTNKQWKM-SHYLFXLQSA-N Man3GlcNAc Chemical compound CC(=O)N[C@H]1[C@@H](O)O[C@H](CO)[C@@H](O[C@@H]2O[C@H](CO[C@H]3O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]3O)[C@@H](O)[C@H](O[C@H]3O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]3O)[C@@H]2O)[C@@H]1O VEJMLLWTNKQWKM-SHYLFXLQSA-N 0.000 description 2
- 241000604449 Megasphaera Species 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- MDXAULHWGWETHF-SRVKXCTJSA-N Met-Arg-Val Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCNC(N)=N MDXAULHWGWETHF-SRVKXCTJSA-N 0.000 description 2
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 2
- MCNGIXXCMJAURZ-VEVYYDQMSA-N Met-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCSC)N)O MCNGIXXCMJAURZ-VEVYYDQMSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 2
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 2
- YLBUMXYVQCHBPR-ULQDDVLXSA-N Met-Leu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YLBUMXYVQCHBPR-ULQDDVLXSA-N 0.000 description 2
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- FNYBIOGBMWFQRJ-SRVKXCTJSA-N Met-Pro-Met Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)O)N FNYBIOGBMWFQRJ-SRVKXCTJSA-N 0.000 description 2
- XIGAHPDZLAYQOS-SRVKXCTJSA-N Met-Pro-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 XIGAHPDZLAYQOS-SRVKXCTJSA-N 0.000 description 2
- MNGBICITWAPGAS-BPUTZDHNSA-N Met-Ser-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MNGBICITWAPGAS-BPUTZDHNSA-N 0.000 description 2
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 2
- ATBJCCFCJXCNGZ-UFYCRDLUSA-N Met-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 ATBJCCFCJXCNGZ-UFYCRDLUSA-N 0.000 description 2
- OVRNDRQMDRJTHS-FMDGEEDCSA-N N-acetyl-beta-D-glucosamine Chemical compound CC(=O)N[C@H]1[C@H](O)O[C@H](CO)[C@@H](O)[C@@H]1O OVRNDRQMDRJTHS-FMDGEEDCSA-N 0.000 description 2
- 108010047562 NGR peptide Proteins 0.000 description 2
- 241000606012 Pectinatus Species 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- LBSARGIQACMGDF-WBAXXEDZSA-N Phe-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 LBSARGIQACMGDF-WBAXXEDZSA-N 0.000 description 2
- BRDYYVQTEJVRQT-HRCADAONSA-N Phe-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BRDYYVQTEJVRQT-HRCADAONSA-N 0.000 description 2
- QCHNRQQVLJYDSI-DLOVCJGASA-N Phe-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 QCHNRQQVLJYDSI-DLOVCJGASA-N 0.000 description 2
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 2
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 2
- WLYPRKLMRIYGPP-JYJNAYRXSA-N Phe-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 WLYPRKLMRIYGPP-JYJNAYRXSA-N 0.000 description 2
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 2
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 2
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 2
- OWSLLRKCHLTUND-BZSNNMDCSA-N Phe-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OWSLLRKCHLTUND-BZSNNMDCSA-N 0.000 description 2
- CBENHWCORLVGEQ-HJOGWXRNSA-N Phe-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CBENHWCORLVGEQ-HJOGWXRNSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- BPIMVBKDLSBKIJ-FCLVOEFKSA-N Phe-Thr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BPIMVBKDLSBKIJ-FCLVOEFKSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- BPIFSOUEUYDJRM-DCPHZVHLSA-N Phe-Trp-Ala Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](C)C(O)=O)C1=CC=CC=C1 BPIFSOUEUYDJRM-DCPHZVHLSA-N 0.000 description 2
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 2
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 2
- VDTYRPWRWRCROL-UFYCRDLUSA-N Phe-Val-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VDTYRPWRWRCROL-UFYCRDLUSA-N 0.000 description 2
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 2
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 2
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 2
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 2
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 2
- ZZCJYPLMOPTZFC-SRVKXCTJSA-N Pro-Met-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O ZZCJYPLMOPTZFC-SRVKXCTJSA-N 0.000 description 2
- XYAFCOJKICBRDU-JYJNAYRXSA-N Pro-Phe-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O XYAFCOJKICBRDU-JYJNAYRXSA-N 0.000 description 2
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 2
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 2
- QHSSUIHLAIWXEE-IHRRRGAJSA-N Pro-Tyr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O QHSSUIHLAIWXEE-IHRRRGAJSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 2
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 2
- 241001148023 Pyrococcus abyssi Species 0.000 description 2
- 241000522615 Pyrococcus horikoshii Species 0.000 description 2
- 241001467519 Pyrococcus sp. Species 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 2
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 2
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 2
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 2
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 2
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- UICKAKRRRBTILH-GUBZILKMSA-N Ser-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N UICKAKRRRBTILH-GUBZILKMSA-N 0.000 description 2
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 2
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 2
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- SDFUZKIAHWRUCS-QEJZJMRPSA-N Ser-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N SDFUZKIAHWRUCS-QEJZJMRPSA-N 0.000 description 2
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 2
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 2
- OSFZCEQJLWCIBG-BZSNNMDCSA-N Ser-Tyr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OSFZCEQJLWCIBG-BZSNNMDCSA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 241000607720 Serratia Species 0.000 description 2
- 102000003838 Sialyltransferases Human genes 0.000 description 2
- 108090000141 Sialyltransferases Proteins 0.000 description 2
- 241000191940 Staphylococcus Species 0.000 description 2
- 241000205188 Thermococcus Species 0.000 description 2
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 2
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 2
- DKDHTRVDOUZZTP-IFFSRLJSSA-N Thr-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O DKDHTRVDOUZZTP-IFFSRLJSSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 2
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 2
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 2
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- KKPOGALELPLJTL-MEYUZBJRSA-N Thr-Lys-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KKPOGALELPLJTL-MEYUZBJRSA-N 0.000 description 2
- PUEWAXRPXOEQOW-HJGDQZAQSA-N Thr-Met-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(O)=O PUEWAXRPXOEQOW-HJGDQZAQSA-N 0.000 description 2
- WRUWXBBEFUTJOU-XGEHTFHBSA-N Thr-Met-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N)O WRUWXBBEFUTJOU-XGEHTFHBSA-N 0.000 description 2
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 2
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 2
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 2
- GRIUMVXCJDKVPI-IZPVPAKOSA-N Thr-Thr-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GRIUMVXCJDKVPI-IZPVPAKOSA-N 0.000 description 2
- NDLHSJWPCXKOGG-VLCNGCBASA-N Thr-Trp-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N)O NDLHSJWPCXKOGG-VLCNGCBASA-N 0.000 description 2
- PELIQFPESHBTMA-WLTAIBSBSA-N Thr-Tyr-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 PELIQFPESHBTMA-WLTAIBSBSA-N 0.000 description 2
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 2
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 2
- 102000004357 Transferases Human genes 0.000 description 2
- 108090000992 Transferases Proteins 0.000 description 2
- 239000007983 Tris buffer Substances 0.000 description 2
- NKUIXQOJUAEIET-AQZXSJQPSA-N Trp-Asp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 NKUIXQOJUAEIET-AQZXSJQPSA-N 0.000 description 2
- PHNBFZBKLWEBJN-BPUTZDHNSA-N Trp-Glu-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PHNBFZBKLWEBJN-BPUTZDHNSA-N 0.000 description 2
- FEZASNVQLJQBHW-CABZTGNLSA-N Trp-Gly-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O)=CNC2=C1 FEZASNVQLJQBHW-CABZTGNLSA-N 0.000 description 2
- BONYBFXWMXBAND-GQGQLFGLSA-N Trp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BONYBFXWMXBAND-GQGQLFGLSA-N 0.000 description 2
- AIISTODACBDQLW-WDSOQIARSA-N Trp-Leu-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 AIISTODACBDQLW-WDSOQIARSA-N 0.000 description 2
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 2
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 2
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 2
- ZWZOCUWOXSDYFZ-CQDKDKBSSA-N Tyr-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZWZOCUWOXSDYFZ-CQDKDKBSSA-N 0.000 description 2
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 2
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 2
- NSTPFWRAIDTNGH-BZSNNMDCSA-N Tyr-Asn-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NSTPFWRAIDTNGH-BZSNNMDCSA-N 0.000 description 2
- TWAVEIJGFCBWCG-JYJNAYRXSA-N Tyr-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N TWAVEIJGFCBWCG-JYJNAYRXSA-N 0.000 description 2
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 2
- HDSKHCBAVVWPCQ-FHWLQOOXSA-N Tyr-Glu-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HDSKHCBAVVWPCQ-FHWLQOOXSA-N 0.000 description 2
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 2
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 2
- ARSHSYUZHSIYKR-ACRUOGEOSA-N Tyr-His-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ARSHSYUZHSIYKR-ACRUOGEOSA-N 0.000 description 2
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 2
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 2
- QSFJHIRIHOJRKS-ULQDDVLXSA-N Tyr-Leu-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QSFJHIRIHOJRKS-ULQDDVLXSA-N 0.000 description 2
- BJCILVZEZRDIDR-PMVMPFDFSA-N Tyr-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 BJCILVZEZRDIDR-PMVMPFDFSA-N 0.000 description 2
- WURLIFOWSMBUAR-SLFFLAALSA-N Tyr-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O WURLIFOWSMBUAR-SLFFLAALSA-N 0.000 description 2
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 2
- SOEGLGLDSUHWTI-STECZYCISA-N Tyr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 SOEGLGLDSUHWTI-STECZYCISA-N 0.000 description 2
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 2
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 2
- ZPFLBLFITJCBTP-QWRGUYRKSA-N Tyr-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O ZPFLBLFITJCBTP-QWRGUYRKSA-N 0.000 description 2
- JAQGKXUEKGKTKX-HOTGVXAUSA-N Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 2
- HZDQUVQEVVYDDA-ACRUOGEOSA-N Tyr-Tyr-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 HZDQUVQEVVYDDA-ACRUOGEOSA-N 0.000 description 2
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 2
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 2
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- VJOWWOGRNXRQMF-UVBJJODRSA-N Val-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C)C(O)=O)=CNC2=C1 VJOWWOGRNXRQMF-UVBJJODRSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- XPYNXORPPVTVQK-SRVKXCTJSA-N Val-Arg-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCSC)C(=O)O)N XPYNXORPPVTVQK-SRVKXCTJSA-N 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 2
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 2
- VLOYGOZDPGYWFO-LAEOZQHASA-N Val-Asp-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VLOYGOZDPGYWFO-LAEOZQHASA-N 0.000 description 2
- OVLIFGQSBSNGHY-KKHAAJSZSA-N Val-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N)O OVLIFGQSBSNGHY-KKHAAJSZSA-N 0.000 description 2
- XEYUMGGWQCIWAR-XVKPBYJWSA-N Val-Gln-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N XEYUMGGWQCIWAR-XVKPBYJWSA-N 0.000 description 2
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 2
- OQWNEUXPKHIEJO-NRPADANISA-N Val-Glu-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N OQWNEUXPKHIEJO-NRPADANISA-N 0.000 description 2
- PMXBARDFIAPBGK-DZKIICNBSA-N Val-Glu-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PMXBARDFIAPBGK-DZKIICNBSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- BZMIYHIJVVJPCK-QSFUFRPTSA-N Val-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N BZMIYHIJVVJPCK-QSFUFRPTSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 2
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 2
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 2
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 2
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 2
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 2
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 2
- 241000605941 Wolinella Species 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 2
- NTXGVHCCXVHYCL-RDQGWRCRSA-N all-trans-undecaprenyl diphosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\COP(O)(=O)OP(O)(O)=O NTXGVHCCXVHYCL-RDQGWRCRSA-N 0.000 description 2
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010047857 aspartylglycine Proteins 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- UOKKJQVOZSYEJM-JGWLITMVSA-N bacillosamine Chemical compound C[C@@H](O)[C@@H](N)[C@H](O)[C@@H](N)C=O UOKKJQVOZSYEJM-JGWLITMVSA-N 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 2
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 2
- 108010064886 beta-D-galactoside alpha 2-6-sialyltransferase Proteins 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- NKLPQNGYXWVELD-UHFFFAOYSA-M coomassie brilliant blue Chemical compound [Na+].C1=CC(OCC)=CC=C1NC1=CC=C(C(=C2C=CC(C=C2)=[N+](CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=2C=CC(=CC=2)N(CC)CC=2C=C(C=CC=2)S([O-])(=O)=O)C=C1 NKLPQNGYXWVELD-UHFFFAOYSA-M 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- UFPHFKCTOZIAFY-NTDVEAECSA-N ditrans,polycis-undecaprenyl phosphate Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/CC\C(C)=C/COP(O)(O)=O UFPHFKCTOZIAFY-NTDVEAECSA-N 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 229940097043 glucuronic acid Drugs 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010077515 glycylproline Proteins 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- OUUQCZGPVNCOIJ-UHFFFAOYSA-N hydroperoxyl Chemical compound O[O] OUUQCZGPVNCOIJ-UHFFFAOYSA-N 0.000 description 2
- 230000002779 inactivation Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 230000003834 intracellular effect Effects 0.000 description 2
- 108010027338 isoleucylcysteine Proteins 0.000 description 2
- 210000001630 jejunum Anatomy 0.000 description 2
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 108010085203 methionylmethionine Proteins 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 230000003228 microsomal effect Effects 0.000 description 2
- 229950006780 n-acetylglucosamine Drugs 0.000 description 2
- 108010009920 neokyotorphin (1-4) Proteins 0.000 description 2
- 229910052759 nickel Inorganic materials 0.000 description 2
- 108010084572 phenylalanyl-valine Proteins 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 2
- 229950007002 phosphocreatine Drugs 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 108010048818 seryl-histidine Proteins 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 2
- 108010044292 tryptophyltyrosine Proteins 0.000 description 2
- 108010005834 tyrosyl-alanyl-glycine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 238000005199 ultracentrifugation Methods 0.000 description 2
- 101150017134 wecA gene Proteins 0.000 description 2
- TXKJNHBRVLCYFX-UHFFFAOYSA-N (2Z,6Z,10Z,14Z,18Z,22Z,26Z,30E,34E,38E)-3,7,11,15,19,23,27,31,35,39,43-undecamethyl-tetratetraconta-2,6,10,14,18,22,26,30,34,38,42-undecaen-1-ol Natural products CC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCCC(C)=CCO TXKJNHBRVLCYFX-UHFFFAOYSA-N 0.000 description 1
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 description 1
- PNNNRSAQSRJVSB-ARQDHWQXSA-N (2s,3s,4s,5r)-2,3,4,5-tetrahydroxyhexanal Chemical compound C[C@@H](O)[C@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-ARQDHWQXSA-N 0.000 description 1
- WEZDRVHTDXTVLT-GJZGRUSLSA-N 2-[[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WEZDRVHTDXTVLT-GJZGRUSLSA-N 0.000 description 1
- ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylpentanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylpentanoic acid Chemical compound CCC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)CC)C(O)=O ZWZOCNTYMUOGPQ-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- 125000003143 4-hydroxybenzyl group Chemical group [H]C([*])([H])C1=C([H])C([H])=C(O[H])C([H])=C1[H] 0.000 description 1
- PZUPAGRIHCRVKN-UHFFFAOYSA-N 5-[5-[3,4-dihydroxy-6-[(3,4,5-trihydroxyoxan-2-yl)oxymethyl]-5-[3,4,5-trihydroxy-6-[(3,4,5-trihydroxyoxan-2-yl)oxymethyl]oxan-2-yl]oxyoxan-2-yl]oxy-3,4-dihydroxy-6-[(3,4,5-trihydroxyoxan-2-yl)oxymethyl]oxan-2-yl]oxy-6-(hydroxymethyl)oxane-2,3,4-triol Chemical compound OCC1OC(O)C(O)C(O)C1OC1C(O)C(O)C(OC2C(C(O)C(OC3C(C(O)C(O)C(COC4C(C(O)C(O)CO4)O)O3)O)C(COC3C(C(O)C(O)CO3)O)O2)O)C(COC2C(C(O)C(O)CO2)O)O1 PZUPAGRIHCRVKN-UHFFFAOYSA-N 0.000 description 1
- 241000607534 Aeromonas Species 0.000 description 1
- 241000607519 Aeromonas sp. Species 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 1
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- JQDFGZKKXBEANU-IMJSIDKUSA-N Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(O)=O JQDFGZKKXBEANU-IMJSIDKUSA-N 0.000 description 1
- UQJUGHFKNKGHFQ-VZFHVOOUSA-N Ala-Cys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UQJUGHFKNKGHFQ-VZFHVOOUSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 1
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- UHMQKOBNPRAZGB-CIUDSAMLSA-N Ala-Glu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N UHMQKOBNPRAZGB-CIUDSAMLSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- FVNAUOZKIPAYNA-BPNCWPANSA-N Ala-Met-Tyr Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FVNAUOZKIPAYNA-BPNCWPANSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- CJQAEJMHBAOQHA-DLOVCJGASA-N Ala-Phe-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CJQAEJMHBAOQHA-DLOVCJGASA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 1
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UFBFGSQYSA-N Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UFBFGSQYSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 1
- QDGMZAOSMNGBLP-MRFFXTKBSA-N Ala-Trp-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N QDGMZAOSMNGBLP-MRFFXTKBSA-N 0.000 description 1
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 1
- YCTIYBUTCKNOTI-UWJYBYFXSA-N Ala-Tyr-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCTIYBUTCKNOTI-UWJYBYFXSA-N 0.000 description 1
- GCTANJIJJROSLH-GVARAGBVSA-N Ala-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C)N GCTANJIJJROSLH-GVARAGBVSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 description 1
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 description 1
- KGSJCPBERYUXCN-BPNCWPANSA-N Arg-Ala-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KGSJCPBERYUXCN-BPNCWPANSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 1
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- BNODVYXZAAXSHW-IUCAKERBSA-N Arg-His Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 BNODVYXZAAXSHW-IUCAKERBSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 1
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 1
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- PYZPXCZNQSEHDT-GUBZILKMSA-N Arg-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N PYZPXCZNQSEHDT-GUBZILKMSA-N 0.000 description 1
- JCROZIFVIYMXHM-GUBZILKMSA-N Arg-Met-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCCN=C(N)N JCROZIFVIYMXHM-GUBZILKMSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- CGWVCWFQGXOUSJ-ULQDDVLXSA-N Arg-Tyr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O CGWVCWFQGXOUSJ-ULQDDVLXSA-N 0.000 description 1
- LFWOQHSQNCKXRU-UFYCRDLUSA-N Arg-Tyr-Phe Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 LFWOQHSQNCKXRU-UFYCRDLUSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- AKEBUSZTMQLNIX-UWJYBYFXSA-N Asn-Ala-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N AKEBUSZTMQLNIX-UWJYBYFXSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- BVLIJXXSXBUGEC-SRVKXCTJSA-N Asn-Asn-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BVLIJXXSXBUGEC-SRVKXCTJSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- XVVOVPFMILMHPX-ZLUOBGJFSA-N Asn-Asp-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XVVOVPFMILMHPX-ZLUOBGJFSA-N 0.000 description 1
- ZWASIOHRQWRWAS-UGYAYLCHSA-N Asn-Asp-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZWASIOHRQWRWAS-UGYAYLCHSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- OKZOABJQOMAYEC-NUMRIWBASA-N Asn-Gln-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OKZOABJQOMAYEC-NUMRIWBASA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 1
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 1
- JZDZLBJVYWIIQU-AVGNSLFASA-N Asn-Glu-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JZDZLBJVYWIIQU-AVGNSLFASA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- UDSVWSUXKYXSTR-QWRGUYRKSA-N Asn-Gly-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UDSVWSUXKYXSTR-QWRGUYRKSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- RAUPFUCUDBQYHE-AVGNSLFASA-N Asn-Phe-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RAUPFUCUDBQYHE-AVGNSLFASA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 1
- VBKIFHUVGLOJKT-FKZODXBYSA-N Asn-Thr Chemical compound C[C@@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)N)O VBKIFHUVGLOJKT-FKZODXBYSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- CPYHLXSGDBDULY-IHPCNDPISA-N Asn-Trp-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CPYHLXSGDBDULY-IHPCNDPISA-N 0.000 description 1
- JPPLRQVZMZFOSX-UWJYBYFXSA-N Asn-Tyr-Ala Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 JPPLRQVZMZFOSX-UWJYBYFXSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- DPSUVAPLRQDWAO-YDHLFZDLSA-N Asn-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)N)N DPSUVAPLRQDWAO-YDHLFZDLSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- ZELQAFZSJOBEQS-ACZMJKKPSA-N Asp-Asn-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZELQAFZSJOBEQS-ACZMJKKPSA-N 0.000 description 1
- UGKZHCBLMLSANF-CIUDSAMLSA-N Asp-Asn-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O UGKZHCBLMLSANF-CIUDSAMLSA-N 0.000 description 1
- HOQGTAIGQSDCHR-SRVKXCTJSA-N Asp-Asn-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HOQGTAIGQSDCHR-SRVKXCTJSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 1
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- BKXPJCBEHWFSTF-ACZMJKKPSA-N Asp-Gln-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O BKXPJCBEHWFSTF-ACZMJKKPSA-N 0.000 description 1
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 1
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 1
- GBSUGIXJAAKZOW-GMOBBJLQSA-N Asp-Ile-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GBSUGIXJAAKZOW-GMOBBJLQSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- LBFYTUPYYZENIR-GHCJXIJMSA-N Asp-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N LBFYTUPYYZENIR-GHCJXIJMSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 1
- JTRDJYIZIKCIRC-AJNGGQMLSA-N Asp-Leu-Leu-Gln Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTRDJYIZIKCIRC-AJNGGQMLSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- XWSIYTYNLKCLJB-CIUDSAMLSA-N Asp-Lys-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O XWSIYTYNLKCLJB-CIUDSAMLSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 1
- BPTFNDRZKBFMTH-DCAQKATOSA-N Asp-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N BPTFNDRZKBFMTH-DCAQKATOSA-N 0.000 description 1
- YZQCXOFQZKCETR-UWVGGRQHSA-N Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YZQCXOFQZKCETR-UWVGGRQHSA-N 0.000 description 1
- GWIJZUVQVDJHDI-AVGNSLFASA-N Asp-Phe-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GWIJZUVQVDJHDI-AVGNSLFASA-N 0.000 description 1
- PCJOFZYFFMBZKC-PCBIJLKTSA-N Asp-Phe-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PCJOFZYFFMBZKC-PCBIJLKTSA-N 0.000 description 1
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 1
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- UTLCRGFJFSZWAW-OLHMAJIHSA-N Asp-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O UTLCRGFJFSZWAW-OLHMAJIHSA-N 0.000 description 1
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 1
- XAPPCWUWHNWCPQ-PBCZWWQYSA-N Asp-Thr-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XAPPCWUWHNWCPQ-PBCZWWQYSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- SQIARYGNVQWOSB-BZSNNMDCSA-N Asp-Tyr-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQIARYGNVQWOSB-BZSNNMDCSA-N 0.000 description 1
- BPAUXFVCSYQDQX-JRQIVUDYSA-N Asp-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(=O)O)N)O BPAUXFVCSYQDQX-JRQIVUDYSA-N 0.000 description 1
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- 241000223651 Aureobasidium Species 0.000 description 1
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 1
- 241000604933 Bdellovibrio Species 0.000 description 1
- 241000186146 Brevibacterium Species 0.000 description 1
- 241001508395 Burkholderia sp. Species 0.000 description 1
- 102100027098 CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase 1 Human genes 0.000 description 1
- 101100465853 Caenorhabditis elegans psf-2 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 101100297352 Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168) pglC gene Proteins 0.000 description 1
- 101100463764 Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168) pglD gene Proteins 0.000 description 1
- 101100463767 Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168) pglH gene Proteins 0.000 description 1
- 101100463769 Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168) pglJ gene Proteins 0.000 description 1
- 101100463770 Campylobacter jejuni subsp. jejuni serotype O:2 (strain ATCC 700819 / NCTC 11168) pglK gene Proteins 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 102100040428 Chitobiosyldiphosphodolichol beta-mannosyltransferase Human genes 0.000 description 1
- 241000123346 Chrysosporium Species 0.000 description 1
- 241000588923 Citrobacter Species 0.000 description 1
- 241001478240 Coccus Species 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 241000699800 Cricetinae Species 0.000 description 1
- 241000192700 Cyanobacteria Species 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 1
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- KKUVRYLJEXJSGX-MXAVVETBSA-N Cys-Ile-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CS)N KKUVRYLJEXJSGX-MXAVVETBSA-N 0.000 description 1
- PDRMRVHPAQKTLT-NAKRPEOUSA-N Cys-Ile-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O PDRMRVHPAQKTLT-NAKRPEOUSA-N 0.000 description 1
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- RJPKQCFHEPPTGL-ZLUOBGJFSA-N Cys-Ser-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RJPKQCFHEPPTGL-ZLUOBGJFSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- AEMOLEFTQBMNLQ-AQKNRBDQSA-N D-glucopyranuronic acid Chemical compound OC1O[C@H](C(O)=O)[C@@H](O)[C@H](O)[C@H]1O AEMOLEFTQBMNLQ-AQKNRBDQSA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102100035784 Decorin Human genes 0.000 description 1
- 108090000738 Decorin Proteins 0.000 description 1
- 241000605716 Desulfovibrio Species 0.000 description 1
- 241000605762 Desulfovibrio vulgaris Species 0.000 description 1
- 241000605809 Desulfuromonas Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 241000588697 Enterobacter cloacae Species 0.000 description 1
- 241000305071 Enterobacterales Species 0.000 description 1
- 241000190844 Erythrobacter Species 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 241000186394 Eubacterium Species 0.000 description 1
- 241000589565 Flavobacterium Species 0.000 description 1
- 241000589564 Flavobacterium sp. Species 0.000 description 1
- PNNNRSAQSRJVSB-SLPGGIOYSA-N Fucose Natural products C[C@H](O)[C@@H](O)[C@H](O)[C@H](O)C=O PNNNRSAQSRJVSB-SLPGGIOYSA-N 0.000 description 1
- 108010019236 Fucosyltransferases Proteins 0.000 description 1
- 102000006471 Fucosyltransferases Human genes 0.000 description 1
- 108010072062 GEKG peptide Proteins 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- 102000030902 Galactosyltransferase Human genes 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- AAOBFSKXAVIORT-GUBZILKMSA-N Gln-Asn-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O AAOBFSKXAVIORT-GUBZILKMSA-N 0.000 description 1
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- NPTGGVQJYRSMCM-GLLZPBPUSA-N Gln-Gln-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPTGGVQJYRSMCM-GLLZPBPUSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 1
- DQLVHRFFBQOWFL-JYJNAYRXSA-N Gln-Lys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)O DQLVHRFFBQOWFL-JYJNAYRXSA-N 0.000 description 1
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 1
- DOMHVQBSRJNNKD-ZPFDUUQYSA-N Gln-Met-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DOMHVQBSRJNNKD-ZPFDUUQYSA-N 0.000 description 1
- DFRYZTUPVZNRLG-KKUMJFAQSA-N Gln-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DFRYZTUPVZNRLG-KKUMJFAQSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- PIUPHASDUFSHTF-CIUDSAMLSA-N Gln-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O PIUPHASDUFSHTF-CIUDSAMLSA-N 0.000 description 1
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 1
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- WTJIWXMJESRHMM-XDTLVQLUSA-N Gln-Tyr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O WTJIWXMJESRHMM-XDTLVQLUSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- AKDOUBMVLRCHBD-SIUGBPQLSA-N Gln-Tyr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AKDOUBMVLRCHBD-SIUGBPQLSA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- YYOBUPFZLKQUAX-FXQIFTODSA-N Glu-Asn-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YYOBUPFZLKQUAX-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 1
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- OBIHEDRRSMRKLU-ACZMJKKPSA-N Glu-Cys-Asp Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OBIHEDRRSMRKLU-ACZMJKKPSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- NJPQBTJSYCKCNS-HVTMNAMFSA-N Glu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N NJPQBTJSYCKCNS-HVTMNAMFSA-N 0.000 description 1
- ZPASCJBSSCRWMC-GVXVVHGQSA-N Glu-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N ZPASCJBSSCRWMC-GVXVVHGQSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 1
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 1
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 1
- YHOJJFFTSMWVGR-HJGDQZAQSA-N Glu-Met-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YHOJJFFTSMWVGR-HJGDQZAQSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- MXJYXYDREQWUMS-XKBZYTNZSA-N Glu-Thr-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O MXJYXYDREQWUMS-XKBZYTNZSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 1
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- NTHIHAUEXVTXQG-KKUMJFAQSA-N Glu-Tyr-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O NTHIHAUEXVTXQG-KKUMJFAQSA-N 0.000 description 1
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- QEJKKJNDDDPSMU-KKUMJFAQSA-N Glu-Tyr-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(O)=O QEJKKJNDDDPSMU-KKUMJFAQSA-N 0.000 description 1
- LSYFGBRDBIQYAQ-FHWLQOOXSA-N Glu-Tyr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LSYFGBRDBIQYAQ-FHWLQOOXSA-N 0.000 description 1
- HBMRTXJZQDVRFT-DZKIICNBSA-N Glu-Tyr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O HBMRTXJZQDVRFT-DZKIICNBSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- FVGOGEGGQLNZGH-DZKIICNBSA-N Glu-Val-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FVGOGEGGQLNZGH-DZKIICNBSA-N 0.000 description 1
- 108010055629 Glucosyltransferases Proteins 0.000 description 1
- 102000000340 Glucosyltransferases Human genes 0.000 description 1
- 108010092364 Glucuronosyltransferase Proteins 0.000 description 1
- 102000016354 Glucuronosyltransferase Human genes 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- ZKLYPEGLWFVRGF-IUCAKERBSA-N Gly-His-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZKLYPEGLWFVRGF-IUCAKERBSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- UYPPAMNTTMJHJW-KCTSRDHCSA-N Gly-Ile-Trp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UYPPAMNTTMJHJW-KCTSRDHCSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 1
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 1
- PFMUCCYYAAFKTH-YFKPBYRVSA-N Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CN PFMUCCYYAAFKTH-YFKPBYRVSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- OMOZPGCHVWOXHN-BQBZGAKWSA-N Gly-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)CN OMOZPGCHVWOXHN-BQBZGAKWSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- YLEIWGJJBFBFHC-KBPBESRZSA-N Gly-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 YLEIWGJJBFBFHC-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 1
- YABRDIBSPZONIY-BQBZGAKWSA-N Gly-Ser-Met Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O YABRDIBSPZONIY-BQBZGAKWSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- AJHCSUXXECOXOY-NSHDSACASA-N Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-NSHDSACASA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- DNAZKGFYFRGZIH-QWRGUYRKSA-N Gly-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 DNAZKGFYFRGZIH-QWRGUYRKSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- 102000005744 Glycoside Hydrolases Human genes 0.000 description 1
- 108010031186 Glycoside Hydrolases Proteins 0.000 description 1
- 239000007995 HEPES buffer Substances 0.000 description 1
- 241000606841 Haemophilus sp. Species 0.000 description 1
- 241000589989 Helicobacter Species 0.000 description 1
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- JFFAPRNXXLRINI-NHCYSSNCSA-N His-Asp-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JFFAPRNXXLRINI-NHCYSSNCSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- SWSVTNGMKBDTBM-DCAQKATOSA-N His-Gln-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SWSVTNGMKBDTBM-DCAQKATOSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 1
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 1
- FIMNVXRZGUAGBI-AVGNSLFASA-N His-Glu-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FIMNVXRZGUAGBI-AVGNSLFASA-N 0.000 description 1
- WEIYKCOEVBUJQC-JYJNAYRXSA-N His-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WEIYKCOEVBUJQC-JYJNAYRXSA-N 0.000 description 1
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- IDXZDKMBEXLFMB-HGNGGELXSA-N His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 IDXZDKMBEXLFMB-HGNGGELXSA-N 0.000 description 1
- VJJSDSNFXCWCEJ-DJFWLOJKSA-N His-Ile-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O VJJSDSNFXCWCEJ-DJFWLOJKSA-N 0.000 description 1
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 1
- WTJBVCUCLWFGAH-JUKXBJQTSA-N His-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N WTJBVCUCLWFGAH-JUKXBJQTSA-N 0.000 description 1
- MJUUWJJEUOBDGW-IHRRRGAJSA-N His-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 MJUUWJJEUOBDGW-IHRRRGAJSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- FJCGVRRVBKYYOU-DCAQKATOSA-N His-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N FJCGVRRVBKYYOU-DCAQKATOSA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- ZHMZWSFQRUGLEC-JYJNAYRXSA-N His-Tyr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZHMZWSFQRUGLEC-JYJNAYRXSA-N 0.000 description 1
- GBMSSORHVHAYLU-QTKMDUPCSA-N His-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N)O GBMSSORHVHAYLU-QTKMDUPCSA-N 0.000 description 1
- LCWXJXMHJVIJFK-UHFFFAOYSA-N Hydroxylysine Natural products NCC(O)CC(N)CC(O)=O LCWXJXMHJVIJFK-UHFFFAOYSA-N 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 1
- RWIKBYVJQAJYDP-BJDJZHNGSA-N Ile-Ala-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RWIKBYVJQAJYDP-BJDJZHNGSA-N 0.000 description 1
- WZPIKDWQVRTATP-SYWGBEHUSA-N Ile-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 WZPIKDWQVRTATP-SYWGBEHUSA-N 0.000 description 1
- ATXGFMOBVKSOMK-PEDHHIEDSA-N Ile-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N ATXGFMOBVKSOMK-PEDHHIEDSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- FJWYJQRCVNGEAQ-ZPFDUUQYSA-N Ile-Asn-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N FJWYJQRCVNGEAQ-ZPFDUUQYSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- JXMSHKFPDIUYGS-SIUGBPQLSA-N Ile-Glu-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N JXMSHKFPDIUYGS-SIUGBPQLSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- KOPIAUWNLKKELG-SIGLWIIPSA-N Ile-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N KOPIAUWNLKKELG-SIGLWIIPSA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- CSQNHSGHAPRGPQ-YTFOTSKYSA-N Ile-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)O)N CSQNHSGHAPRGPQ-YTFOTSKYSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- UWLHDGMRWXHFFY-HPCHECBXSA-N Ile-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1CCC[C@@H]1C(=O)O)N UWLHDGMRWXHFFY-HPCHECBXSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- NZGTYCMLUGYMCV-XUXIUFHCSA-N Ile-Lys-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N NZGTYCMLUGYMCV-XUXIUFHCSA-N 0.000 description 1
- RMNMUUCYTMLWNA-ZPFDUUQYSA-N Ile-Lys-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RMNMUUCYTMLWNA-ZPFDUUQYSA-N 0.000 description 1
- PNTWNAXGBOZMBO-MNXVOIDGSA-N Ile-Lys-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PNTWNAXGBOZMBO-MNXVOIDGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- MSASLZGZQAXVFP-PEDHHIEDSA-N Ile-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N MSASLZGZQAXVFP-PEDHHIEDSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- UYNXBNHVWFNVIN-HJWJTTGWSA-N Ile-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=CC=C1 UYNXBNHVWFNVIN-HJWJTTGWSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- KLJKJVXDHVUMMZ-KKPKCPPISA-N Ile-Phe-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N KLJKJVXDHVUMMZ-KKPKCPPISA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- CIJLNXXMDUOFPH-HJWJTTGWSA-N Ile-Pro-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CIJLNXXMDUOFPH-HJWJTTGWSA-N 0.000 description 1
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- TWVKGYNQQAUNRN-ACZMJKKPSA-N Ile-Ser Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O TWVKGYNQQAUNRN-ACZMJKKPSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- SAEWJTCJQVZQNZ-IUKAMOBKSA-N Ile-Thr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SAEWJTCJQVZQNZ-IUKAMOBKSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 1
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- JTBFQNHKNRZJDS-SYWGBEHUSA-N Ile-Trp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](C)C(=O)O)N JTBFQNHKNRZJDS-SYWGBEHUSA-N 0.000 description 1
- WJBOZUVRPOIQNN-KJYZGMDISA-N Ile-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)C1=CN=CN1 WJBOZUVRPOIQNN-KJYZGMDISA-N 0.000 description 1
- DTPGSUQHUMELQB-GVARAGBVSA-N Ile-Tyr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 DTPGSUQHUMELQB-GVARAGBVSA-N 0.000 description 1
- HQLSBZFLOUHQJK-STECZYCISA-N Ile-Tyr-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HQLSBZFLOUHQJK-STECZYCISA-N 0.000 description 1
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- HODVZHLJUUWPKY-STECZYCISA-N Ile-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)CC)CC1=CC=C(O)C=C1 HODVZHLJUUWPKY-STECZYCISA-N 0.000 description 1
- NSPNUMNLZNOPAQ-SJWGOKEGSA-N Ile-Tyr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N NSPNUMNLZNOPAQ-SJWGOKEGSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- ZSESFIFAYQEKRD-CYDGBPFRSA-N Ile-Val-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N ZSESFIFAYQEKRD-CYDGBPFRSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 108060003951 Immunoglobulin Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 102000006496 Immunoglobulin Heavy Chains Human genes 0.000 description 1
- 108010019476 Immunoglobulin Heavy Chains Proteins 0.000 description 1
- 241001454354 Kingella Species 0.000 description 1
- 241000588748 Klebsiella Species 0.000 description 1
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000272168 Laridae Species 0.000 description 1
- 241000589248 Legionella Species 0.000 description 1
- 208000007764 Legionnaires' Disease Diseases 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- MLTRLIITQPXHBJ-BQBZGAKWSA-N Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O MLTRLIITQPXHBJ-BQBZGAKWSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 1
- USTCFDAQCLDPBD-XIRDDKMYSA-N Leu-Asn-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N USTCFDAQCLDPBD-XIRDDKMYSA-N 0.000 description 1
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 1
- NHHKSOGJYNQENP-SRVKXCTJSA-N Leu-Cys-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N NHHKSOGJYNQENP-SRVKXCTJSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 1
- POMXSEDNUXYPGK-IHRRRGAJSA-N Leu-Met-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N POMXSEDNUXYPGK-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 1
- GSSMYQHXZNERFX-WDSOQIARSA-N Leu-Met-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N GSSMYQHXZNERFX-WDSOQIARSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 1
- LRKCBIUDWAXNEG-CSMHCCOUSA-N Leu-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRKCBIUDWAXNEG-CSMHCCOUSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- WBRJVRXEGQIDRK-XIRDDKMYSA-N Leu-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 WBRJVRXEGQIDRK-XIRDDKMYSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- YIRIDPUGZKHMHT-ACRUOGEOSA-N Leu-Tyr-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YIRIDPUGZKHMHT-ACRUOGEOSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- MPOHDJKRBLVGCT-CIUDSAMLSA-N Lys-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N MPOHDJKRBLVGCT-CIUDSAMLSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- BTSXLXFPMZXVPR-DLOVCJGASA-N Lys-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BTSXLXFPMZXVPR-DLOVCJGASA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- CKSXSQUVEYCDIW-AVGNSLFASA-N Lys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N CKSXSQUVEYCDIW-AVGNSLFASA-N 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 1
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- KZOHPCYVORJBLG-AVGNSLFASA-N Lys-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N KZOHPCYVORJBLG-AVGNSLFASA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 1
- DTUZCYRNEJDKSR-NHCYSSNCSA-N Lys-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN DTUZCYRNEJDKSR-NHCYSSNCSA-N 0.000 description 1
- FGMHXLULNHTPID-KKUMJFAQSA-N Lys-His-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CN=CN1 FGMHXLULNHTPID-KKUMJFAQSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- YWJQHDDBFAXNIR-MXAVVETBSA-N Lys-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N YWJQHDDBFAXNIR-MXAVVETBSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- JQSIGLHQNSZZRL-KKUMJFAQSA-N Lys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N JQSIGLHQNSZZRL-KKUMJFAQSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 1
- BEGQVWUZFXLNHZ-IHPCNDPISA-N Lys-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 BEGQVWUZFXLNHZ-IHPCNDPISA-N 0.000 description 1
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- KXYLFJIQDIMURW-IHPCNDPISA-N Lys-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCCN)=CNC2=C1 KXYLFJIQDIMURW-IHPCNDPISA-N 0.000 description 1
- ZVZRQKJOQQAFCF-ULQDDVLXSA-N Lys-Tyr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZVZRQKJOQQAFCF-ULQDDVLXSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- OHXUUQDOBQKSNB-AVGNSLFASA-N Lys-Val-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OHXUUQDOBQKSNB-AVGNSLFASA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- 108010087568 Mannosyltransferases Proteins 0.000 description 1
- 102000006722 Mannosyltransferases Human genes 0.000 description 1
- 241000816479 Melittis Species 0.000 description 1
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 1
- LMKSBGIUPVRHEH-FXQIFTODSA-N Met-Ala-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(N)=O LMKSBGIUPVRHEH-FXQIFTODSA-N 0.000 description 1
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 1
- PWPBGAJJYJJVPI-PJODQICGSA-N Met-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 PWPBGAJJYJJVPI-PJODQICGSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- CWFYZYQMUDWGTI-GUBZILKMSA-N Met-Arg-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O CWFYZYQMUDWGTI-GUBZILKMSA-N 0.000 description 1
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 1
- NKDSBBBPGIVWEI-RCWTZXSCSA-N Met-Arg-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NKDSBBBPGIVWEI-RCWTZXSCSA-N 0.000 description 1
- DRXODWRPPUFIAY-DCAQKATOSA-N Met-Asn-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN DRXODWRPPUFIAY-DCAQKATOSA-N 0.000 description 1
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 1
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- UZWMJZSOXGOVIN-LURJTMIESA-N Met-Gly-Gly Chemical compound CSCC[C@H](N)C(=O)NCC(=O)NCC(O)=O UZWMJZSOXGOVIN-LURJTMIESA-N 0.000 description 1
- MVBZBRKNZVJEKK-DTWKUNHWSA-N Met-Gly-Pro Chemical compound CSCC[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N MVBZBRKNZVJEKK-DTWKUNHWSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- BEZJTLKUMFMITF-AVGNSLFASA-N Met-Lys-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCNC(N)=N BEZJTLKUMFMITF-AVGNSLFASA-N 0.000 description 1
- JCMMNFZUKMMECJ-DCAQKATOSA-N Met-Lys-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O JCMMNFZUKMMECJ-DCAQKATOSA-N 0.000 description 1
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- FMYLZGQFKPHXHI-GUBZILKMSA-N Met-Met-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O FMYLZGQFKPHXHI-GUBZILKMSA-N 0.000 description 1
- WUYLWZRHRLLEGB-AVGNSLFASA-N Met-Met-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O WUYLWZRHRLLEGB-AVGNSLFASA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- CNAGWYQWQDMUGC-IHRRRGAJSA-N Met-Phe-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CNAGWYQWQDMUGC-IHRRRGAJSA-N 0.000 description 1
- SJLPOVNXMJFKHJ-ULQDDVLXSA-N Met-Phe-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N SJLPOVNXMJFKHJ-ULQDDVLXSA-N 0.000 description 1
- RSOMVHWMIAZNLE-HJWJTTGWSA-N Met-Phe-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSOMVHWMIAZNLE-HJWJTTGWSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- ZWBCVBHKXHPCEI-BVSLBCMMSA-N Met-Phe-Trp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N ZWBCVBHKXHPCEI-BVSLBCMMSA-N 0.000 description 1
- VQILILSLEFDECU-GUBZILKMSA-N Met-Pro-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O VQILILSLEFDECU-GUBZILKMSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- LHXFNWBNRBWMNV-DCAQKATOSA-N Met-Ser-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LHXFNWBNRBWMNV-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- QZUCCDSNETVAIS-RYQLBKOJSA-N Met-Trp-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N3CCC[C@@H]3C(=O)O)N QZUCCDSNETVAIS-RYQLBKOJSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- CKAVKDJBSNTJDB-SRVKXCTJSA-N Met-Val-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCSC CKAVKDJBSNTJDB-SRVKXCTJSA-N 0.000 description 1
- VYDLZDRMOFYOGV-TUAOUCFPSA-N Met-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N VYDLZDRMOFYOGV-TUAOUCFPSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- PVSPJQWHEIQTEH-JYJNAYRXSA-N Met-Val-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PVSPJQWHEIQTEH-JYJNAYRXSA-N 0.000 description 1
- 101710094503 Metallothionein-1 Proteins 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 241000863420 Myxococcus Species 0.000 description 1
- 102000007524 N-Acetylgalactosaminyltransferases Human genes 0.000 description 1
- 108010046220 N-Acetylgalactosaminyltransferases Proteins 0.000 description 1
- 108010093077 N-Acetylglucosaminyltransferases Proteins 0.000 description 1
- 102000002493 N-Acetylglucosaminyltransferases Human genes 0.000 description 1
- SQVRNKJHWKZAKO-PFQGKNLYSA-N N-acetyl-beta-neuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-PFQGKNLYSA-N 0.000 description 1
- SQVRNKJHWKZAKO-LUWBGTNYSA-N N-acetylneuraminic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)CC(O)(C(O)=O)O[C@H]1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-LUWBGTNYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 108020002230 Pancreatic Ribonuclease Proteins 0.000 description 1
- 102000005891 Pancreatic ribonuclease Human genes 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 241000606580 Pasteurella sp. Species 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- KIAWKQJTSGRCSA-AVGNSLFASA-N Phe-Asn-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KIAWKQJTSGRCSA-AVGNSLFASA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- WMGVYPPIMZPWPN-SRVKXCTJSA-N Phe-Asp-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N WMGVYPPIMZPWPN-SRVKXCTJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- RJYBHZVWJPUSLB-QEWYBTABSA-N Phe-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N RJYBHZVWJPUSLB-QEWYBTABSA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- QPVFUAUFEBPIPT-CDMKHQONSA-N Phe-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QPVFUAUFEBPIPT-CDMKHQONSA-N 0.000 description 1
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 1
- JWBLQDDHSDGEGR-DRZSPHRISA-N Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWBLQDDHSDGEGR-DRZSPHRISA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- HQPWNHXERZCIHP-PMVMPFDFSA-N Phe-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 HQPWNHXERZCIHP-PMVMPFDFSA-N 0.000 description 1
- KNYPNEYICHHLQL-ACRUOGEOSA-N Phe-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 KNYPNEYICHHLQL-ACRUOGEOSA-N 0.000 description 1
- DMEYUTSDVRCWRS-ULQDDVLXSA-N Phe-Lys-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DMEYUTSDVRCWRS-ULQDDVLXSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- KLXQWABNAWDRAY-ACRUOGEOSA-N Phe-Lys-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 KLXQWABNAWDRAY-ACRUOGEOSA-N 0.000 description 1
- GPSMLZQVIIYLDK-ULQDDVLXSA-N Phe-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O GPSMLZQVIIYLDK-ULQDDVLXSA-N 0.000 description 1
- PYOHODCEOHCZBM-RYUDHWBXSA-N Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 PYOHODCEOHCZBM-RYUDHWBXSA-N 0.000 description 1
- PTLMYJOMJLTMCB-KKUMJFAQSA-N Phe-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N PTLMYJOMJLTMCB-KKUMJFAQSA-N 0.000 description 1
- QRUOLOPKCOEZKU-HJWJTTGWSA-N Phe-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N QRUOLOPKCOEZKU-HJWJTTGWSA-N 0.000 description 1
- LYCOGHUNJCETDK-JYJNAYRXSA-N Phe-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N LYCOGHUNJCETDK-JYJNAYRXSA-N 0.000 description 1
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 1
- YVIVIQWMNCWUFS-UFYCRDLUSA-N Phe-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N YVIVIQWMNCWUFS-UFYCRDLUSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- DEZCWWXTRAKZKJ-UFYCRDLUSA-N Phe-Phe-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DEZCWWXTRAKZKJ-UFYCRDLUSA-N 0.000 description 1
- WKLMCMXFMQEKCX-SLFFLAALSA-N Phe-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O WKLMCMXFMQEKCX-SLFFLAALSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- YMIZSYUAZJSOFL-SRVKXCTJSA-N Phe-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O YMIZSYUAZJSOFL-SRVKXCTJSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 1
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- XNQMZHLAYFWSGJ-HTUGSXCWSA-N Phe-Thr-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XNQMZHLAYFWSGJ-HTUGSXCWSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 1
- GTMSCDVFQLNEOY-BZSNNMDCSA-N Phe-Tyr-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N GTMSCDVFQLNEOY-BZSNNMDCSA-N 0.000 description 1
- QUUCAHIYARMNBL-FHWLQOOXSA-N Phe-Tyr-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N QUUCAHIYARMNBL-FHWLQOOXSA-N 0.000 description 1
- DBNGDEAQXGFGRA-ACRUOGEOSA-N Phe-Tyr-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DBNGDEAQXGFGRA-ACRUOGEOSA-N 0.000 description 1
- IPVPGAADZXRZSH-RNXOBYDBSA-N Phe-Tyr-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O IPVPGAADZXRZSH-RNXOBYDBSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000607000 Plesiomonas Species 0.000 description 1
- 102100020947 Polypeptide N-acetylgalactosaminyltransferase 1 Human genes 0.000 description 1
- 102100020950 Polypeptide N-acetylgalactosaminyltransferase 2 Human genes 0.000 description 1
- 102100039685 Polypeptide N-acetylgalactosaminyltransferase 3 Human genes 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- ONPFOYPPPOHMNH-UVBJJODRSA-N Pro-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@@H]3CCCN3 ONPFOYPPPOHMNH-UVBJJODRSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- UUHXBJHVTVGSKM-BQBZGAKWSA-N Pro-Gly-Asn Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UUHXBJHVTVGSKM-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- XYHMFGGWNOFUOU-QXEWZRGKSA-N Pro-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 XYHMFGGWNOFUOU-QXEWZRGKSA-N 0.000 description 1
- LXLFEIHKWGHJJB-XUXIUFHCSA-N Pro-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 LXLFEIHKWGHJJB-XUXIUFHCSA-N 0.000 description 1
- CFVRJNZJQHDQPP-CYDGBPFRSA-N Pro-Ile-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 CFVRJNZJQHDQPP-CYDGBPFRSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- DRKAXLDECUGLFE-ULQDDVLXSA-N Pro-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O DRKAXLDECUGLFE-ULQDDVLXSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- SXMSEHDMNIUTSP-DCAQKATOSA-N Pro-Lys-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SXMSEHDMNIUTSP-DCAQKATOSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- HBBBLSVBQGZKOZ-GUBZILKMSA-N Pro-Met-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O HBBBLSVBQGZKOZ-GUBZILKMSA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 1
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 1
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- JDJMFMVVJHLWDP-UNQGMJICSA-N Pro-Thr-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JDJMFMVVJHLWDP-UNQGMJICSA-N 0.000 description 1
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- DLZBBDSPTJBOOD-BPNCWPANSA-N Pro-Tyr-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O DLZBBDSPTJBOOD-BPNCWPANSA-N 0.000 description 1
- DYJTXTCEXMCPBF-UFYCRDLUSA-N Pro-Tyr-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CC=CC=C3)C(=O)O DYJTXTCEXMCPBF-UFYCRDLUSA-N 0.000 description 1
- FIDNSJUXESUDOV-JYJNAYRXSA-N Pro-Tyr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O FIDNSJUXESUDOV-JYJNAYRXSA-N 0.000 description 1
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 1
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- 241000186429 Propionibacterium Species 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 241000588770 Proteus mirabilis Species 0.000 description 1
- 241000334216 Proteus sp. Species 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589774 Pseudomonas sp. Species 0.000 description 1
- 102000013009 Pyruvate Kinase Human genes 0.000 description 1
- 108020005115 Pyruvate Kinase Proteins 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 241000607142 Salmonella Species 0.000 description 1
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 1
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 1
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 1
- BRGQQXQKPUCUJQ-KBIXCLLPSA-N Ser-Glu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRGQQXQKPUCUJQ-KBIXCLLPSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- HMRAQFJFTOLDKW-GUBZILKMSA-N Ser-His-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O HMRAQFJFTOLDKW-GUBZILKMSA-N 0.000 description 1
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 1
- NFDYGNFETJVMSE-BQBZGAKWSA-N Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CO NFDYGNFETJVMSE-BQBZGAKWSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- HDBOEVPDIDDEPC-CIUDSAMLSA-N Ser-Lys-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O HDBOEVPDIDDEPC-CIUDSAMLSA-N 0.000 description 1
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 1
- VXYQOFXBIXKPCX-BQBZGAKWSA-N Ser-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N VXYQOFXBIXKPCX-BQBZGAKWSA-N 0.000 description 1
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 1
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 1
- NQZFFLBPNDLTPO-DLOVCJGASA-N Ser-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CO)N NQZFFLBPNDLTPO-DLOVCJGASA-N 0.000 description 1
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 1
- PPCZVWHJWJFTFN-ZLUOBGJFSA-N Ser-Ser-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPCZVWHJWJFTFN-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- ZVBCMFDJIMUELU-BZSNNMDCSA-N Ser-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N ZVBCMFDJIMUELU-BZSNNMDCSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- 101001010097 Shigella phage SfV Bactoprenol-linked glucose translocase Proteins 0.000 description 1
- 241000589970 Spirochaetales Species 0.000 description 1
- 241000122971 Stenotrophomonas Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- QAOWNCQODCNURD-UHFFFAOYSA-L Sulfate Chemical compound [O-]S([O-])(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-L 0.000 description 1
- 239000012505 Superdex™ Substances 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- 241000605118 Thiobacillus Species 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- BWUHENPAEMNGQJ-ZDLURKLDSA-N Thr-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O BWUHENPAEMNGQJ-ZDLURKLDSA-N 0.000 description 1
- GCXFWAZRHBRYEM-NUMRIWBASA-N Thr-Gln-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O GCXFWAZRHBRYEM-NUMRIWBASA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- BNGDYRRHRGOPHX-IFFSRLJSSA-N Thr-Glu-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)C(O)=O BNGDYRRHRGOPHX-IFFSRLJSSA-N 0.000 description 1
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 1
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 1
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 1
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 1
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 1
- OHDXOXIZXSFCDN-RCWTZXSCSA-N Thr-Met-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OHDXOXIZXSFCDN-RCWTZXSCSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 1
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 1
- OLFOOYQTTQSSRK-UNQGMJICSA-N Thr-Pro-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLFOOYQTTQSSRK-UNQGMJICSA-N 0.000 description 1
- YGZWVPBHYABGLT-KJEVXHAQSA-N Thr-Pro-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YGZWVPBHYABGLT-KJEVXHAQSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- GQPQJNMVELPZNQ-GBALPHGKSA-N Thr-Ser-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O GQPQJNMVELPZNQ-GBALPHGKSA-N 0.000 description 1
- UQCNIMDPYICBTR-KYNKHSRBSA-N Thr-Thr-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UQCNIMDPYICBTR-KYNKHSRBSA-N 0.000 description 1
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 1
- BJJRNAVDQGREGC-HOUAVDHOSA-N Thr-Trp-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O BJJRNAVDQGREGC-HOUAVDHOSA-N 0.000 description 1
- MYNYCUXMIIWUNW-IEGACIPQSA-N Thr-Trp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MYNYCUXMIIWUNW-IEGACIPQSA-N 0.000 description 1
- IJKNKFJZOJCKRR-GBALPHGKSA-N Thr-Trp-Ser Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 IJKNKFJZOJCKRR-GBALPHGKSA-N 0.000 description 1
- BGHVVGPELPHRCI-HZTRNQAASA-N Thr-Trp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N)O BGHVVGPELPHRCI-HZTRNQAASA-N 0.000 description 1
- WCRFXRIWBFRZBR-GGVZMXCHSA-N Thr-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WCRFXRIWBFRZBR-GGVZMXCHSA-N 0.000 description 1
- LXXCHJKHJYRMIY-FQPOAREZSA-N Thr-Tyr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O LXXCHJKHJYRMIY-FQPOAREZSA-N 0.000 description 1
- VMSSYINFMOFLJM-KJEVXHAQSA-N Thr-Tyr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCSC)C(=O)O)N)O VMSSYINFMOFLJM-KJEVXHAQSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- SBYQHZCMVSPQCS-RCWTZXSCSA-N Thr-Val-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O SBYQHZCMVSPQCS-RCWTZXSCSA-N 0.000 description 1
- 208000007536 Thrombosis Diseases 0.000 description 1
- 241001664469 Tibicina haematodes Species 0.000 description 1
- 102000004338 Transferrin Human genes 0.000 description 1
- 108090000901 Transferrin Proteins 0.000 description 1
- 102220503491 Transmembrane protease serine 9_S30T_mutation Human genes 0.000 description 1
- 101000980463 Treponema pallidum (strain Nichols) Chaperonin GroEL Proteins 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- NMCBVGFGWSIGSB-NUTKFTJISA-N Trp-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NMCBVGFGWSIGSB-NUTKFTJISA-N 0.000 description 1
- HYNAKPYFEYJMAS-XIRDDKMYSA-N Trp-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HYNAKPYFEYJMAS-XIRDDKMYSA-N 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- HQVKQINPFOCIIV-BVSLBCMMSA-N Trp-Arg-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 HQVKQINPFOCIIV-BVSLBCMMSA-N 0.000 description 1
- YXONONCLMLHWJX-SZMVWBNQSA-N Trp-Glu-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 YXONONCLMLHWJX-SZMVWBNQSA-N 0.000 description 1
- JVTHMUDOKPQBOT-NSHDSACASA-N Trp-Gly-Gly Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O)=CNC2=C1 JVTHMUDOKPQBOT-NSHDSACASA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- ILDJYIDXESUBOE-HSCHXYMDSA-N Trp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ILDJYIDXESUBOE-HSCHXYMDSA-N 0.000 description 1
- LYMVXFSTACVOLP-ZFWWWQNUSA-N Trp-Leu Chemical compound C1=CC=C2C(C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 LYMVXFSTACVOLP-ZFWWWQNUSA-N 0.000 description 1
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 1
- UPNRACRNHISCAF-SZMVWBNQSA-N Trp-Lys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UPNRACRNHISCAF-SZMVWBNQSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- RIKLKPANMFNREP-FDARSICLSA-N Trp-Met-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)=CNC2=C1 RIKLKPANMFNREP-FDARSICLSA-N 0.000 description 1
- LFMLXCJYCFZBKE-IHPCNDPISA-N Trp-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N LFMLXCJYCFZBKE-IHPCNDPISA-N 0.000 description 1
- NECCMBOBBANRIT-RNXOBYDBSA-N Trp-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NECCMBOBBANRIT-RNXOBYDBSA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 1
- CUHBVKUVJIXRFK-DVXDUOKCSA-N Trp-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CUHBVKUVJIXRFK-DVXDUOKCSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- MBLJBGZWLHTJBH-SZMVWBNQSA-N Trp-Val-Arg Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 MBLJBGZWLHTJBH-SZMVWBNQSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- 241000223104 Trypanosoma Species 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 1
- DXYWRYQRKPIGGU-BPNCWPANSA-N Tyr-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DXYWRYQRKPIGGU-BPNCWPANSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- MBFJIHUHHCJBSN-AVGNSLFASA-N Tyr-Asn-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MBFJIHUHHCJBSN-AVGNSLFASA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- GFHYISDTIWZUSU-QWRGUYRKSA-N Tyr-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GFHYISDTIWZUSU-QWRGUYRKSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- QHEGAOPHISYNDF-XDTLVQLUSA-N Tyr-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHEGAOPHISYNDF-XDTLVQLUSA-N 0.000 description 1
- IYHNBRUWVBIVJR-IHRRRGAJSA-N Tyr-Gln-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IYHNBRUWVBIVJR-IHRRRGAJSA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- KOVXHANYYYMBRF-IRIUXVKKSA-N Tyr-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KOVXHANYYYMBRF-IRIUXVKKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- OLWFDNLLBWQWCP-STQMWFEESA-N Tyr-Gly-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OLWFDNLLBWQWCP-STQMWFEESA-N 0.000 description 1
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 1
- ZQOOYCZQENFIMC-STQMWFEESA-N Tyr-His Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=C(O)C=C1 ZQOOYCZQENFIMC-STQMWFEESA-N 0.000 description 1
- FIRUOPRJKCBLST-KKUMJFAQSA-N Tyr-His-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O FIRUOPRJKCBLST-KKUMJFAQSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- AXWBYOVVDRBOGU-SIUGBPQLSA-N Tyr-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AXWBYOVVDRBOGU-SIUGBPQLSA-N 0.000 description 1
- HFJJDMOFTCQGEI-STECZYCISA-N Tyr-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N HFJJDMOFTCQGEI-STECZYCISA-N 0.000 description 1
- YMUQBRQQCPQEQN-CXTHYWKRSA-N Tyr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YMUQBRQQCPQEQN-CXTHYWKRSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- WOAQYWUEUYMVGK-ULQDDVLXSA-N Tyr-Lys-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOAQYWUEUYMVGK-ULQDDVLXSA-N 0.000 description 1
- VTCKHZJKWQENKX-KBPBESRZSA-N Tyr-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O VTCKHZJKWQENKX-KBPBESRZSA-N 0.000 description 1
- GZOCMHSZGGJBCX-ULQDDVLXSA-N Tyr-Lys-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O GZOCMHSZGGJBCX-ULQDDVLXSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- SBLZVFCEOCWRLS-BPNCWPANSA-N Tyr-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SBLZVFCEOCWRLS-BPNCWPANSA-N 0.000 description 1
- QPBJXNYYQTUTDD-KKUMJFAQSA-N Tyr-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QPBJXNYYQTUTDD-KKUMJFAQSA-N 0.000 description 1
- UBKKNELWDCBNCF-STQMWFEESA-N Tyr-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UBKKNELWDCBNCF-STQMWFEESA-N 0.000 description 1
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 1
- CGWAPUBOXJWXMS-HOTGVXAUSA-N Tyr-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 CGWAPUBOXJWXMS-HOTGVXAUSA-N 0.000 description 1
- WTTRJMAZPDHPGS-KKXDTOCCSA-N Tyr-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(O)=O WTTRJMAZPDHPGS-KKXDTOCCSA-N 0.000 description 1
- BGFCXQXETBDEHP-BZSNNMDCSA-N Tyr-Phe-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O BGFCXQXETBDEHP-BZSNNMDCSA-N 0.000 description 1
- UPODKYBYUBTWSV-BZSNNMDCSA-N Tyr-Phe-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 UPODKYBYUBTWSV-BZSNNMDCSA-N 0.000 description 1
- SCZJKZLFSSPJDP-ACRUOGEOSA-N Tyr-Phe-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SCZJKZLFSSPJDP-ACRUOGEOSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 1
- QFXVAFIHVWXXBJ-AVGNSLFASA-N Tyr-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O QFXVAFIHVWXXBJ-AVGNSLFASA-N 0.000 description 1
- MFEVVAXTBZELLL-GGVZMXCHSA-N Tyr-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MFEVVAXTBZELLL-GGVZMXCHSA-N 0.000 description 1
- NZBSVMQZQMEUHI-WZLNRYEVSA-N Tyr-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NZBSVMQZQMEUHI-WZLNRYEVSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- LDKDSFQSEUOCOO-RPTUDFQQSA-N Tyr-Thr-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LDKDSFQSEUOCOO-RPTUDFQQSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- ZYVAAYAOTVJBSS-GMVOTWDCSA-N Tyr-Trp-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZYVAAYAOTVJBSS-GMVOTWDCSA-N 0.000 description 1
- ABZWHLRQBSBPTO-RNXOBYDBSA-N Tyr-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CC=C(C=C4)O)N ABZWHLRQBSBPTO-RNXOBYDBSA-N 0.000 description 1
- JRMCISZDVLOTLR-BVSLBCMMSA-N Tyr-Trp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CC=C(C=C3)O)N JRMCISZDVLOTLR-BVSLBCMMSA-N 0.000 description 1
- MWUYSCVVPVITMW-IGNZVWTISA-N Tyr-Tyr-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 MWUYSCVVPVITMW-IGNZVWTISA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- KHPLUFDSWGDRHD-SLFFLAALSA-N Tyr-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N)C(=O)O KHPLUFDSWGDRHD-SLFFLAALSA-N 0.000 description 1
- FZADUTOCSFDBRV-RNXOBYDBSA-N Tyr-Tyr-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=C(O)C=C1 FZADUTOCSFDBRV-RNXOBYDBSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 1
- VKYDVKAKGDNZED-STECZYCISA-N Tyr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N VKYDVKAKGDNZED-STECZYCISA-N 0.000 description 1
- HZWPGKAKGYJWCI-ULQDDVLXSA-N Tyr-Val-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O HZWPGKAKGYJWCI-ULQDDVLXSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- YKBUNNNRNZZUID-UFYCRDLUSA-N Tyr-Val-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YKBUNNNRNZZUID-UFYCRDLUSA-N 0.000 description 1
- 108010008393 UDP-N-acetylglucosamine N-acetyl-D-glucosaminyl-1-6-(N-acetylglucosaminyl-1-2)mannopyranosyl-1-R(N-acetylglucosamine to mannose)-1,4N-acetylglucosaminyltransferase VI Proteins 0.000 description 1
- 108010090473 UDP-N-acetylglucosamine-peptide beta-N-acetylglucosaminyltransferase Proteins 0.000 description 1
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- XKVXSCHXGJOQND-ZOBUZTSGSA-N Val-Asp-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XKVXSCHXGJOQND-ZOBUZTSGSA-N 0.000 description 1
- YLHLNFUXDBOAGX-DCAQKATOSA-N Val-Cys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YLHLNFUXDBOAGX-DCAQKATOSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 1
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 1
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- BMOFUVHDBROBSE-DCAQKATOSA-N Val-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N BMOFUVHDBROBSE-DCAQKATOSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- XPKCFQZDQGVJCX-RHYQMDGZSA-N Val-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N)O XPKCFQZDQGVJCX-RHYQMDGZSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- AIWLHFZYOUUJGB-UFYCRDLUSA-N Val-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 AIWLHFZYOUUJGB-UFYCRDLUSA-N 0.000 description 1
- MJOUSKQHAIARKI-JYJNAYRXSA-N Val-Phe-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 MJOUSKQHAIARKI-JYJNAYRXSA-N 0.000 description 1
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 1
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 1
- JPBGMZDTPVGGMQ-ULQDDVLXSA-N Val-Tyr-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N JPBGMZDTPVGGMQ-ULQDDVLXSA-N 0.000 description 1
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 1
- BGTDGENDNWGMDQ-KJEVXHAQSA-N Val-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N)O BGTDGENDNWGMDQ-KJEVXHAQSA-N 0.000 description 1
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- XNLUVJPMPAZHCY-JYJNAYRXSA-N Val-Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 XNLUVJPMPAZHCY-JYJNAYRXSA-N 0.000 description 1
- YKZVPMUGEJXEOR-JYJNAYRXSA-N Val-Val-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N YKZVPMUGEJXEOR-JYJNAYRXSA-N 0.000 description 1
- IOUPEELXVYPCPG-UHFFFAOYSA-N Valylglycine Chemical compound CC(C)C(N)C(=O)NCC(O)=O IOUPEELXVYPCPG-UHFFFAOYSA-N 0.000 description 1
- IXKSXJFAGXLQOQ-XISFHERQSA-N WHWLQLKPGQPMY Chemical compound C([C@@H](C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CNC=N1 IXKSXJFAGXLQOQ-XISFHERQSA-N 0.000 description 1
- 241000604961 Wolbachia Species 0.000 description 1
- 241000604955 Wolbachia sp. Species 0.000 description 1
- 241000589634 Xanthomonas Species 0.000 description 1
- 241001148118 Xanthomonas sp. Species 0.000 description 1
- 241000607734 Yersinia <bacteria> Species 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 1
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 1
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- TXKJNHBRVLCYFX-RDQGWRCRSA-N all-trans-undecaprenol Chemical compound CC(C)=CCC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CC\C(C)=C\CO TXKJNHBRVLCYFX-RDQGWRCRSA-N 0.000 description 1
- 108010042381 alpha 1,3-mannosyltransferase Proteins 0.000 description 1
- 108010039255 alpha 1,6-mannosyltransferase Proteins 0.000 description 1
- 108010021726 alpha-1,3-mannosylglycoprotein beta-1,4-N-acetylglucosaminyltransferase Proteins 0.000 description 1
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 108010057005 beta-galactoside alpha-2,3-sialyltransferase Proteins 0.000 description 1
- 230000001851 biosynthetic effect Effects 0.000 description 1
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 230000003197 catalytic effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 108010004777 chitobiosyldiphosphodolichol beta-mannosyltransferase Proteins 0.000 description 1
- 229960005091 chloramphenicol Drugs 0.000 description 1
- WIIZWVCIJKGZOK-RKDXNWHRSA-N chloramphenicol Chemical compound ClC(Cl)C(=O)N[C@H](CO)[C@H](O)C1=CC=C([N+]([O-])=O)C=C1 WIIZWVCIJKGZOK-RKDXNWHRSA-N 0.000 description 1
- 230000006690 co-activation Effects 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 238000011033 desalting Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 229920002672 di-trans,poly-cis-Undecaprenol Polymers 0.000 description 1
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013613 expression plasmid Substances 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 235000021550 forms of sugar Nutrition 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 101150002054 galE gene Proteins 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 101150002807 glcT gene Proteins 0.000 description 1
- 101150087955 glf gene Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- 239000000937 glycosyl acceptor Substances 0.000 description 1
- 230000001279 glycosylating effect Effects 0.000 description 1
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010008671 glycyl-tryptophyl-methionine Proteins 0.000 description 1
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 230000023597 hemostasis Effects 0.000 description 1
- 108010045383 histidyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 238000004191 hydrophobic interaction chromatography Methods 0.000 description 1
- QJHBJHUKURJDLG-UHFFFAOYSA-N hydroxy-L-lysine Natural products NCCCCC(NO)C(O)=O QJHBJHUKURJDLG-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 102000018358 immunoglobulin Human genes 0.000 description 1
- 230000016784 immunoglobulin production Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 229940035901 lactobacillus sp Drugs 0.000 description 1
- 108010009932 leucyl-alanyl-glycyl-valine Proteins 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010000761 leucylarginine Proteins 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 230000001050 lubricating effect Effects 0.000 description 1
- 108010056787 lysyl-arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 229940060155 neuac Drugs 0.000 description 1
- CERZMXAJYMMUDR-UHFFFAOYSA-N neuraminic acid Natural products NC1C(O)CC(O)(C(O)=O)OC1C(O)C(O)CO CERZMXAJYMMUDR-UHFFFAOYSA-N 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 101150073640 ompF gene Proteins 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000001590 oxidative effect Effects 0.000 description 1
- 125000004430 oxygen atom Chemical group O* 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 101150012061 pglA gene Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 1
- 108010066642 phenylalanyl-valyl-valyl-tyrosine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 229930029653 phosphoenolpyruvate Natural products 0.000 description 1
- DTBNBXWJWCWCIK-UHFFFAOYSA-N phosphoenolpyruvic acid Chemical compound OC(=O)C(=C)OP(O)(O)=O DTBNBXWJWCWCIK-UHFFFAOYSA-N 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920001550 polyprenyl Polymers 0.000 description 1
- 125000001185 polyprenyl group Polymers 0.000 description 1
- 150000004804 polysaccharides Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 125000001500 prolyl group Chemical group [H]N1C([H])(C(=O)[*])C([H])([H])C([H])([H])C1([H])[H] 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 239000003223 protective agent Substances 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000005664 protein glycosylation in endoplasmic reticulum Effects 0.000 description 1
- 230000003161 proteinsynthetic effect Effects 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- 239000002213 purine nucleotide Substances 0.000 description 1
- 150000003212 purines Chemical class 0.000 description 1
- 239000002719 pyrimidine nucleotide Substances 0.000 description 1
- 150000003230 pyrimidines Chemical class 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004366 reverse phase liquid chromatography Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- SQVRNKJHWKZAKO-OQPLDHBCSA-N sialic acid Chemical compound CC(=O)N[C@@H]1[C@@H](O)C[C@@](O)(C(O)=O)OC1[C@H](O)[C@H](O)CO SQVRNKJHWKZAKO-OQPLDHBCSA-N 0.000 description 1
- 239000000243 solution Substances 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 239000012536 storage buffer Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
- 230000009885 systemic effect Effects 0.000 description 1
- 238000004885 tandem mass spectrometry Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 1
- FGMPLJWBKKVCDB-UHFFFAOYSA-N trans-L-hydroxy-proline Natural products ON1CCCC1C(O)=O FGMPLJWBKKVCDB-UHFFFAOYSA-N 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 239000012581 transferrin Substances 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 108010084932 tryptophyl-proline Proteins 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 101150105580 wbbL gene Proteins 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P21/00—Preparation of peptides or proteins
- C12P21/005—Glycopeptides, glycoproteins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/1048—Glycosyltransferases (2.4)
- C12N9/1081—Glycosyltransferases (2.4) transferring other glycosyl groups (2.4.99)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y204/00—Glycosyltransferases (2.4)
- C12Y204/99—Glycosyltransferases (2.4) transferring other glycosyl groups (2.4.99)
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Biochemistry (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明涉及一种用于生产糖基化蛋白的无细胞系统。该系统包括能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶,一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接,以及包括一个或多个聚糖接受体氨基酸残基或编码所述糖蛋白靶点的核酸分子的糖蛋白靶点。本发明进一步涉及用于在该无细胞系统中生产糖基化蛋白的试剂盒和方法。
Description
本申请是申请号为201280066129.1、申请日为2012年11月5日、发明名称为“一种用于糖蛋白合成的基于原核生物的无细胞系统”的中国发明专利申请的分案申请,原申请为国际申请号为PCT/US2012/063590的国家阶段申请,该国际申请要求2011 年11月4日提交的美国临时专利申请号61/555,854的优先权,其通过引用整体并入本申请。
发明领域
本发明涉及用于生产糖基化蛋白或肽的无细胞系统、试剂盒和方法。
发明背景
无细胞蛋白合成系统正在逐渐成为有吸引力的替代依靠活细胞的常规表达系统的方案(Katzen等,“The Past,Present and Future of Cell-Free Protein Synthesis,”Trends Biotechnol.23:150-156(2005))。这是因为,在过去的十年中,无细胞蛋白合成反应: (i)能够在不到一天的时间内完成;(ii)使用的试剂成本降低;(iii)通过常规形成二硫键折叠复杂的蛋白;以及(iv)能够扩展至100L。两种主要方法已被用于体外转录/翻译:一种是基于无细胞提取物(CEF),其通常来源于大肠杆菌、家兔网状细胞或小麦胚芽,另一种是基于纯化组分的重构的蛋白合成(Shimizu等,“Cell-Free TranslationReconstituted With Purified Components,”Nat.Biotechnol.19:751-755 (2001))。由于其有能力在单一的集成平台上共同活化多种生化网络(Jewett等,“An IntegratedCell-Free Metabolic Platform for Protein Production and Synthetic Biology,”Mol. Syst.Biol.4:220(2008)),因而无细胞系统越来越多地用于多种重要的生物技术和合成生物学应用中(Ryabova等,“Functional Antibody Production Using Cell-FreeTranslation: Effects of Protein Disulfide Isomerase and Chaperones,”Nat.Biotechnol.15:79-84(1997); Noireaux等,“Principles of Cell-Free GeneticCircuit Assembly,”Proc.Nat’l.Acad.Sci. U.S.A.100:12672-12677(2003);Yang等,“Rapid Expression of Vaccine Proteins for B-Cell Lymphoma in a Cell-FreeSystem,”Biotechnol.Bioeng.89:503-511(2005))。
在无细胞系统中准确和有效地将蛋白糖基化的能力将在基础和应用研究的多个领域中具有优势,特别是考虑到N-连接糖基化在蛋白折叠、定量控制、分类、降解、分泌和活化中的重要性。(Helenius&Aebi,“Roles of N-Linked Glycans in the EndoplasmicReticulum,”Annu.Rev.Biochem.73:1019-1049(2004))。不幸的是,最佳表征和最广泛使用的基于大肠杆菌的无细胞翻译系统不能制备糖蛋白,因为大肠杆菌缺乏糖基化机制。同样地,家兔网状细胞和小麦胚芽CFE系统也不能进行这种翻译后修饰,因为其缺乏微粒体(Tarui等,“A Novel Cell-Free Translation/Glycosylation System Prepared FromInsect Cells,”J.Biosci.Bioeng.90:508-514(2000))。这可以通过补充具有微粒体囊泡的真核生物CFE克服(例如犬胰腺微粒体)(Lingappa等,“Coupled Cell-Free Synthesis,Segregation,and Core Glycosylation of a Secretory Protein,”Proc. Nat’l.Acad.Sci.U.S.A.75:2338-2342(1978);Rothblatt&Meyer,“Secretion in Yeast:Reconstitution of the Translocation and Glycosylation of Alpha-Factor andInvertase in a Homologous Cell-Free System,”Cell 44:619-628(1986)),但是由于一些CFE与微粒体囊泡之间的相容性较差所得到的系统并不总是忠实地加工靶蛋白(Rothblatt&Meyer, “Secretion in Yeast:Reconstitution of the Translocation andGlycosylation of Alpha-Factor and Invertase in a Homologous Cell-FreeSystem,”Cell 44:619-628(1986);Moreno等,“An mRNA-Dependent in VitroTranslation System from Trypanosoma brucei,”Mol.Biochem. Parasitol.46:265-274(1991))。形成能够进行N-连接糖基化的无细胞翻译系统的替代策略是由特定细胞制备CFE,如杂交瘤(Mikami等,“A Hybridoma-Based in Vitro Translation System ThatEfficiently Synthesizes Glycoproteins,”J.Biotechnol.127:65-78 (2006))、锥虫(Moreno等,“An mRNA-Dependent in Vitro Translation System from Trypanosomabrucei,”Mol.Biochem.Parasitol.46:265-274(1991))、昆虫细胞(Tarui 等,“A NovelCell-Free Translation/Glycosylation System Prepared From Insect Cells,”J.Biosci.Bioeng.90:508-514(2000))或哺乳动物细胞(Shibutani等,“Preparation of aCell-Free Translation System From PC12 Cell,”Neurochem.Res.21:801-807(1996))。然而,这些系统在技术上难以制备并且通常会产生低效的糖基化和较低的产物收率。而且,在上述所有系统中,糖基化过程实际上是“黑箱”,因此难以控制。
本发明的目的是克服本领域中的这些和其他缺陷。
发明概述
本发明的第一个方面涉及一种用于生产糖基化蛋白的无细胞系统。该系统包括能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶(OST);一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接;以及糖蛋白靶点,其包括一个或多个聚糖接受体氨基酸残基,或者编码所述糖蛋白靶点的核酸分子。
本发明的另一个方面涉及一种试剂盒,所述试剂盒包括能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶,以及一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接。
本发明的另一个方面涉及一种在无细胞系统中生产糖基化蛋白的方法。该方法包括提供能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶,提供一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接,以及提供包括一种或多种聚糖接受体氨基酸残基的糖蛋白靶点。该方法进一步包括将所述寡糖转移酶、一种或多种分离的聚糖和糖蛋白靶点组合以形成无细胞糖基化反应混合物,以及将所述无细胞糖基化反应混合物置于使寡糖转移酶有效地促使聚糖由脂质载体分子转移至糖蛋白靶点的一个或多个聚糖接受体残基的条件下以产生糖基化蛋白。
为解决其他无细胞细胞不能准确而有效地将蛋白糖基化的问题,本申请建立了两种新型的无细胞翻译/糖基化系统——称为“glycoCFE”和“glycoPURE”。这些系统将现有的体外翻译系统与重组N-连接糖基化途径相组合。纯化的糖基化组分来源于在革兰氏阴性菌空肠弯曲杆菌(Campylobacter jejuni)基因组中存在的蛋白糖基化基因座(pgl)(图1A)。该基因簇编码与真核生物和古生菌在功能上相似的N-连接糖基化系统,其包括寡糖转移酶,该寡糖转移酶催化预组装的寡糖由脂质载体整体转移至多肽保守基序[在真核生物中为N-X1-S/T和在细菌中为D/E-X1-N-X2-S/T(SEQ ID NO: 1)(Kowarik等,“Definition ofthe Bacterial N-Glycosylation Site Consensus Sequence,” EMBO J.25:1957-1966(2006),其通过引用整体并入本申请),其中X1和X2为除脯氨酸以外的任意残基]中的天冬酰胺残基(图1B)。空肠弯曲杆菌糖基化机制是非常适合用于无细胞翻译/糖基化系统中的,其原因如下。首先,使用整个pgl基因簇转化的大肠杆菌能够进行N连接蛋白糖基化(Wacker等,“N-Linked Glycosylation in Campylobacter jejuni and its FunctionalTransfer Into E.coli,”Science 298:1790-1793 (2002),其通过引用整体并入本申请),从而提供了用于生产纯的活性形式的必要组分的适宜宿主。由于大肠杆菌缺乏天然的糖基化机制,因此避免了来自背景N或O连接系统的潜在污染。其次,称为PglB的空肠弯曲杆菌OST(CjPglB)是一种当溶解在去垢剂中时活化的单亚基酶(Lizak等,“X-ray Structure ofa Bacterial Oligosaccharyltransferase,”Nature 474:350-355(2011),其通过引用整体并入本申请),并且其活性不需要任何辅助组分。再次,CjPglB能够在翻译后将糖转移至折叠蛋白局部的柔性结构(Kowarik等,“N-Linked Glycosylation of Folded Proteinsby the Bacterial Oligosaccharyltransferase,”Science 314:1148-1150(2006),其通过引用整体并入本申请),这表明无需添加功能性膜系统(例如微粒体)就能够实现蛋白糖基化。
附图简述
图1A-1B显示了细菌和真核生物N-连接糖基化的方面。图1A显示了编码N连接糖基化机制的空肠弯曲杆菌17-kb pgl基因座,其已在大肠杆菌中完全重建。图1B 显示了对在原核生物(左侧)和真核生物(右侧)中N-连接糖基化的比较。在这两个系统中,若干糖基转移酶通过将核苷酸活化的糖依次加入内膜胞质面上的脂质载体上合成聚糖。一旦组装后,翻转酶将脂质连接的聚糖(也称为脂质连接的寡糖或LLO) 跨膜转运,在膜上寡糖转移酶催化该转移至周质或内质网底物蛋白上的Asn残基。 PglB是单亚基、整合膜蛋白,其与真核生物OST STT3的催化亚基同源(注意Pg1B 和STT3的复合物未按照比例表示)。尽管真核生物和古生菌使用N-X-S/T接受体序列(其中X是除Pro以外的任意氨基酸),但是PglB需要在-2位包括Asp或Glu残基的延长的基序(D/E-X1-N-X2-S/T(SEQ ID NO:1),其中X1和X2可以是除Pro以外的任意氨基酸)。PglB能够在翻译后将糖转移至折叠蛋白局部的柔性结构。
图2A-2B显示了细菌OST的纯化。CjPglB在大肠杆菌C43(DE3)细胞中表达并纯化至接近均一。使用SDS-PAGE对从凝胶过滤柱上得到的洗脱组分(如所示的) 进行检测,并且将考马斯亮蓝染色凝胶图(图2B)与洗脱图(图2A)一起给出。MW,分子量标准品。
图3A-3C显示了使用确定的组分重建的糖基化。图3A,使用在大肠杆菌中生产的纯化的OST、提取的LLO和纯化的接受体蛋白进行的体外糖基化检测。图3A的免疫印迹显示了对接受体蛋白AcrA和scFv13-R4-GT(均抗-His)或聚糖(抗-聚糖)的检测结果。反应包括3μg野生型CjPglB、5(+)或10(++)μL LLO和5μg接受体蛋白。对照包括不同组分(-)的省略物、灭活的PglB(mut)和来自具有空pACYC的SCM6细胞的LLO(+/-)。糖基化产生了由未经修饰的(g0)向糖基化形式(g1和g2)迁移率的改变。图3B是与图3A中所述的相同的检测,但具有纯化的来自于红嘴鸥弯曲杆菌(Campylobacter lari)(ClPglB)的Pg1B。图3C显示了使用贮存3个月的冻融组分进行体外糖基化后检测的AcrA的免疫印迹。
图4A-4B显示了AcrA的无细胞翻译/糖基化。图4A是检测由通过使用大肠杆菌 CFE或纯化的翻译组分(PURE)体外翻译产生的不同AcrA构建体(抗-AcrA)的免疫印迹。通过将条带强度与第1道上样的纯化AcrA进行比较估算AcrA的浓度。图 4B是检测ΔssAcrA表达(抗-AcrA)和糖基化(抗-聚糖)的免疫印迹。ΔssAcrA由经 pET24(AcrA-cyt)启动的CFE或PURE系统通过无细胞翻译/糖基化产生。对照包括不同组分(-)的省略物或来自具有空pACYC的SCM6细胞的LLO(+/-)。
图5A-5B显示了scFv13-R4-GT的无细胞翻译/糖基化。图5A是检测由通过使用大肠杆菌无细胞提取物(CFE)或纯化的翻译组分(PURE)体外翻译产生的不同 scFv13-R4-GT(抗-FLAG)的免疫印迹。通过将条带强度与第1道上样的纯化 scFv13-R4-GT样品进行比较估算scFv13-R4-GT的浓度。图5B是检测scFv13-R4-GT 表达(抗-FLAG)和糖基化(抗-聚糖)的免疫印迹。scFv13-R4-GT蛋白由经 pET24-ssDsbAscFv13-R4-GT启动的CFE或PURE系统通过无细胞翻译/糖基化产生。对照包括不同组分(-)的省略物。
图6A-6C显示了适用于本发明的系统、试剂盒和方法的不同弯曲杆菌属Pg1B蛋白的氨基酸序列比对结果。PglB氨基酸序列来自于空肠弯曲杆菌(C.jejuni)(SEQ ID NO:2)、红嘴鸥弯曲杆菌(C.lari)(SEQ ID NO:4)、大肠弯曲杆菌(C.coli)(SEQ ID NO:6)和乌普萨拉弯曲杆菌(C.upsaliensis)(SEQ ID NO:8)。(*)表示具有单一的、完全保守的残基的位置;(:)表示保守基团之间具有较强的相似性质;以及(.)表示保守基团之间具有较弱的相似性质。基于弯曲杆菌属Pg1B序列比对的PglB共有序列如 SEQ ID NO:10所示。在四个弯曲杆菌属序列之间不完全保守的残基以X表示,其中 X可以是任意氨基酸残基。或者,X选自在四个所示的弯曲杆菌属序列之一的相应位置的氨基酸残基。
图7A-7E显示了适用于本发明的系统、试剂盒和方法的不同火球菌属(Pyrococcus)OST STT3亚基蛋白的氨基酸序列比对结果。OST氨基酸序列来自于激烈火球菌(P.furiosus)(SEQ ID NO:11)、火球菌属(Pyrococcus sp).ST04(SEQ ID NO:13)、火球菌属(Pyrococcus sp).(菌株NA2)(SEQ ID NO:14)、超嗜热火球菌(P. horikoshii)(SEQID NO:15)、深海火球菌(P.abyssi)(SEQ ID NO:16)和专性嗜压超嗜热火球菌(P.yayanosii)(SEQ ID NO:17)。(*)表示具有单一的、完全保守的残基的位置;(:)表示保守基团之间具有较强的相似性质;以及(.)表示保守基团之间具有较弱的相似性质。基于热球菌属STT3序列比对的STT3共有序列如SEQ ID NO:18 所示。在六个热球菌属序列之间不完全保守的残基以X表示,其中X可以是任意氨基酸残基。或者,X选自在六个所示的热球菌属序列之一的相应位置的氨基酸残基。
图8A-8D显示了适用于本发明的系统、试剂盒和方法的不同利什曼原虫属(Leishmania)OST STT3亚基蛋白的氨基酸序列比对结果。OST氨基酸序列来自于硕大利什曼原虫(L.major)(SEQ ID NO:19)、杜氏利什曼原虫(L.donovani)(SEQ ID NO:21)、婴儿利什曼原虫(L.infantum)(SEQ ID NO:22)、墨西哥利什曼原虫(L. mexicana)(SEQ ID NO:23)和巴西利什曼原虫(L.braziliensis)(SEQ ID NO:24)。 (*)表示具有单一的、完全保守的残基的位置;(:)表示保守基团之间具有较强的相似性质;以及(.)表示保守基团之间具有较弱的相似性质。基于利什曼原虫属STT3序列比对的STT3共有序列如SEQ ID NO:25所示。在五个利什曼原虫属序列之间不完全保守的残基以X表示,其中X可以是任意氨基酸残基。或者,X选自在五个所示的利什曼原虫属序列之一的相应位置的氨基酸残基。
图9A-9J包含了适用于本发明的系统、试剂盒和方法的真核生物STT3寡糖转移酶的列表。寡糖转移酶以提供了蛋白的氨基酸序列的UniProtKB输入编号(第1列)、UniProtKB输入名称(第2列)、蛋白名称(第3列)、基因名称(第4列)、生物体(第5列)和提供了蛋白编码的核苷酸序列的欧洲分子生物学实验室(EMBL)数据库登录号(第6列)表示。
发明详述
本发明的第一个方面涉及一种用于生产糖基化蛋白的无细胞系统。该系统包括能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶;一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接;以及糖蛋白靶点,其包括一个或多个聚糖接受体氨基酸残基,或者编码所述糖蛋白靶点的核酸分子。
根据本发明的这一方面和所有方面,“寡糖转移酶”(“OST”)通常指能够将聚糖即寡糖或多糖由供体底物转移至特定接受体底物的糖基化酶或糖基化酶复合物的亚基。供体底物通常是与聚糖连接的脂质载体分子,接受体底物通常是靶糖蛋白的特定氨基酸残基。适宜的OST包括将聚糖转移至天冬酰胺残基的那些酶即参与N-连接糖基化的OST,以及将聚糖或活化的糖部分转移至氨基酸残基羟基氧分子的那些酶即参与O-连接糖基化的OST。本发明的分离的OST可以是单一亚基的酶、多亚基的酶复合物或者来自于多亚基的酶复合物的单一亚基。尽管下文中描述了多种示例性的 OST,但是本领域技术人员将理解本领域公知的任意寡糖转移酶均适用于本发明。
根据本发明的这一方面和所有方面,所述OST可以是原核生物OST。仅作为举例,来自于弯曲空肠杆菌的单一的、整合膜OST蛋白PglB适用于本发明。PglB将七糖与糖蛋白靶点的天冬酰胺残基连接(Kowarik等,“Definition of the Bacterial N-glycosylationSite Consensus Sequence,”Embo J.25:1957-66(2006),其整体通过引用并入本申请)。编码弯曲空肠杆菌(C.jejuni)PglB(UniProtKB登录号Q9S4V7)的氨基酸序列如下述SEQ IDNO:2所示:
编码SEQ ID NO:2的氨基酸序列的核酸序列如下述SEQ ID NO:3所示(EMBL 核苷酸序列数据库编号AAD51383):
SEQ ID NO:2和3所示的氨基酸和核苷酸序列分别为代表性的空肠弯曲杆菌的PglB蛋白及其核酸序列。本领域技术人员将理解有至少70个亚种的空肠弯曲杆菌具有Pg1B蛋白,其与SEQ ID NO:2氨基酸序列的序列一致性可能不同,但是仍具有相同的功能。因此,其特征为与SEQ ID NO:2所示的空肠弯曲杆菌的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%的氨基酸序列一致性的来自空肠弯曲杆菌其他亚种和菌株的同源性Pg1B蛋白序列也适用于本发明。相关空肠弯曲杆菌Pg1B蛋白的氨基酸序列及编码其的核苷酸序列是已知的和本领域技术人员易于获得的。
与空肠弯曲杆菌的PglB具有序列一致性和/或能够将寡糖部分转移至靶糖蛋白的来自弯曲杆菌属其他种的OST也适用于本发明的这一方面和所有方面。例如,如本申请所示,来自红嘴鸥弯曲杆菌(Campylobacter lari)的PglB(ClPglB),其仅与空肠弯曲杆菌的氨基酸序列具有56%的序列一致性(Schwarz等,“Relaxed Acceptor Site Specificityof Bacterial Oligosaccharyltransferase in Vivo,”Glycobiology 21:45-54(2011),其全部内容通过引用并入本申请),能够在本发明的无细胞糖基化系统中将聚糖转移至靶糖蛋白的接受体氨基酸残基(即天冬酰胺)。编码C.lari PglB(UniProtKB登录号B9KDD4)的氨基酸序列如下述SEQ ID NO:4所示:
与SEQ ID NO:4所示的红嘴鸥弯曲杆菌(C.lari)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:4所示的氨基酸序列的核酸序列如下述SEQ ID NO: 5所示(EMBL核苷酸序列数据库编号ACM64573.1):
适用于本发明这一方面和所有方面的来自弯曲杆菌属的另一个N连接OST是来自大肠弯曲杆菌(C.Coli)的Pg1B。编码来自大肠弯曲杆菌(C.coli)的Pg1B的氨基酸序列(UniProtKB登录号H7WI6),其与弯曲空肠杆菌(C.jejuni)具有81%的一致性,如下述SEQID NO:6所示:
与SEQ ID NO:6所示的大肠弯曲杆菌(C.coli)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:6所示的氨基酸序列的核酸序列如下述SEQ ID NO: 7所示(EMBL核苷酸序列数据库编号EIB14175):
适用于本发明这一方面和所有方面的另一个弯曲杆菌属的OST是来自乌普萨拉弯曲杆菌(C.upsaliensis)的Pg1B。编码来自乌普萨拉弯曲杆菌(C.upsaliensis)的 Pg1B的氨基酸序列(UniProtKB登录号E6LAJ2),其与弯曲空肠杆菌(C.jejuni)具有57%的一致性,如下述SEQ ID NO:8所示:
与SEQ ID NO:8所示的乌普萨拉弯曲杆菌(C.upsaliensis)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:8所示的氨基酸序列的核酸序列如下述 SEQ IDNO:9所示(EMBL核苷酸序列数据库编号EFU71695):
对弯曲杆菌属Pg1B序列的比对见图6A-6C,基于该比对的Pg1B共有序列如图6 中的SEQ ID NO:10所示。在四个弯曲杆菌属序列之间不完全保守的残基以X表示,其中X可以是任意氨基酸残基。或者,X选自在所示弯曲杆菌属序列的相应位置所示的四个氨基酸残基之一。
在本发明的另一个实施方式中,OST是古生菌寡糖转移酶。例如,能够将聚糖转移至靶糖蛋白的天冬酰胺残基的来自激烈火球菌(Pyrococcus furiosus)的OST STT3亚基适用于本发明的这一方面和所有方面。激烈火球菌(P.furiosus)(UniProtKB登录号 Q8U4D2)的氨基酸序列如下述SEQ ID NO:11所示:
与SEQ ID NO:11所示的激烈火球菌(P.furiosus)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:11所示的氨基酸序列的核酸序列如下述SEQ ID NO:12所示(EMBL核苷酸序列数据库编号AAL80280):
与激烈火球菌(P.furiosus)OST STT3亚基相关蛋白具有序列一致性和/或能够将寡糖部分转移至靶糖蛋白的来自火球菌属其他种或菌株的OST也适用于本发明的这一方面和所有方面。例如,来源于火球菌种属(Pyrococcus sp.)ST04(SEQ ID NO:13; UniProtKBNo.I3RCF1)、火球菌种属(菌株NA2)(SEQ ID NO:14;UniProtKB No. F4HM23)、超嗜热火球菌(P.Horikoshii)(SEQ ID NO:15;UniProtKB No.O74088)、深海火球菌(P.Abyssi)(SEQID NO:16;UniProtKB No.Q9V250)和专性嗜压超嗜热火球菌(P.yayanosii)(SEQ ID NO:17;UniProtKB No.F8AIG3)的同源OST均与激烈火球菌(P.furiosus)OST(参见图7的比对结构)的氨基酸序列具有70%以上序列一致性,其适用于本发明的这一方面和所有方面。编码前述火球菌属OST的核苷酸序列是已知的和本领域易于获得的。基于火球菌属STT3序列比对的STT3共有序列如图7中的SEQ ID NO:18所示。在六个火球菌属序列之间不完全保守的残基以X表示,其中X可以是任意氨基酸残基。或者,X选自在六个所示的火球菌属序列之一的相应位置的氨基酸残基。
在本发明的另一个实施方式中,所述OST是真核生物寡糖转移酶。例如,硕大利什曼原虫(Leishmania major)OST的STT3亚基,其能够将聚糖转移至靶糖蛋白的天冬酰胺残基,其适用于本发明的这一方面和所有方面。硕大利什曼原虫(L.major)的氨基酸残基(UniProtKB登录号Q9U5N8)如下述SEQ ID NO:19所示。
与SEQ ID NO:19所示的硕大利什曼原虫(L.major)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:19所示的氨基酸序列(L.major STT3)的核酸序列如下述SEQ ID NO:20所示(EMBL核苷酸序列数据库编号CAB61569):
与硕大利什曼原虫(L.major)OST STT3亚基相关蛋白具有序列一致性和/或能够将寡糖部分转移至靶糖蛋白的来自利什曼原虫属其他种或菌株的OST也适用于本发明。例如,来源于杜氏利什曼原虫(L.donovani)(SEQ ID NO:21;UniProtKB No. E9BRZ2)、婴儿利什曼原虫(L.infantum)(SEQ ID NO:22;UniProtKB No.A4IB10)、墨西哥利什曼原虫(L.mexicana)(SEQ ID NO:23;UniProtKBKB No.E9B5Z4)和巴西利什曼原虫(L.braziliensis)(SEQ ID NO:24;UniProtKB No.A4HMD6)的同源性 OST,其均与硕大利什曼原虫(L.major)OST的氨基酸序列(参见图8的比对结果) 具有70%以上的序列一致性,也适用于本发明的这一方面和所有方面。基于利什曼原虫属STT3序列比对的STT3共有序列如图8中的SEQ ID NO:25所示。在五个利什曼原虫属序列之间不完全保守的残基以X表示,其中X可以是任意氨基酸残基。或者, X选自在五个所示的利什曼原虫属序列之一的相应位置的氨基酸残基。
在本发明的另一个实施方式中,所述真核生物寡糖转移酶是酿酒酵母(Saccharomyces cerevisiae)的STT3。酿酒酵母(S.cerevisiae)的氨基酸序列(UniProtKB登录号P39007)如下述SEQ ID NO:26所示。
与SEQ ID NO:26所示的酿酒酵母(S.cerevisiae)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:26所示的氨基酸序列(酿酒酵母(S.cerevisiae) STT3)的核酸序列如下述SEQ ID NO:27所示(EMBL核苷酸序列数据库编号 BAA06079)。
在本发明的另一个实施方式中,所述真核生物寡糖转移酶是栗酒裂殖酵母(Schizosaccharomyces pombe)的STT3。栗酒裂殖酵母(S.pombe)的氨基酸序列(UniProtKB登录号O94335)如下述SEQ ID NO:28所示。
与SEQ ID NO:28所示的栗酒裂殖酵母(S.pombe)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:28所示的氨基酸序列(栗酒裂殖酵母(S.pombe) STT3)的核酸序列如下述SEQ ID NO:29所示(EMBL核苷酸序列数据库编号 BAA76479)。
在本发明的另一个实施方式中,所述真核生物寡糖转移酶是盘基网柄菌(Dictyostelium discoideum)的STT3。盘基网柄菌(D.discoideum)的氨基酸序列(UniProtKB登录号Q54NM9)如下述SEQ ID NO:30所示。
与SEQ ID NO:30所示的盘基网柄菌(D.discoideum)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:30所示的氨基酸序列(盘基网柄菌(D. discoideum)STT3)的核酸序列如下述SEQ ID NO:31所示(EMBL核苷酸序列数据库编号EAL64892)。
能够在本发明的这一方面和所有方面使用的其他真核生物寡糖转移酶列于图9A-9G的表中。该表中以提供了所述蛋白的氨基酸序列的UniProtKB项目流水号以及提供了编码的核苷酸序列的EMBL数据库登录号表示各寡糖转移酶。各寡糖转移酶列于图9中的UniProtKB和EMBL登录号以及相应的氨基酸和核苷酸序列信息整体通过引用并入本申请。
在本发明的另一个实施方式中,所述寡糖转移酶是O连接的寡糖转移酶。一个示例性的O连接OST是铜绿假单胞菌(Pseudomonas aeruginosa)的PilO。PilO负责将寡糖由与脂质连接的供体整块转移至丝氨酸和苏氨酸残基的氧原子(Faridmoayer等,“FunctionalCharacterization of Bacterial Oligosaccharyltransferases Involved in O-LinkedProtein Glycosylation,”J.Bacteriol.189(22):8088-8098(2007),其通过引用整体并入本申请)。铜绿假单胞菌(P.aeruginosa)的氨基酸序列(UniProtKB登录号Q51353) 如下述SEQ ID NO:32所示:
与SEQ ID NO:32所示的铜绿假单胞菌(P.aeruginosa)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:33所示的氨基酸序列(铜绿假单胞菌(P. aeruginosa)PilO)的核酸序列如下述SEQ ID NO:33所示(EMBL核苷酸序列数据库编号AAA87404)。
适用于本发明所有方面的另一个示例性O连接的OST是脑膜炎双球菌(Neisseriameningitidis)的PglL(Faridmoayer等,“Functional Characterization of BacterialOligosaccharyltransferases Involved in O-Linked Protein Glycosylation,”J.Bacteriol. 189(22):8088-8098(2007),其通过引用整体并入本申请)。脑膜炎双球菌(N. meningitidis)的氨基酸序列(UniProtKB登录号G1FG65)如下述SEQ ID NO:34所示:
与SEQ ID NO:34所示的脑膜炎双球菌(N.menigitidis)的氨基酸序列具有至少约70%、更优选地至少约75%或80%、最优选地至少约85%或90%或95%一致性的氨基酸序列也适用于本发明。编码SEQ ID NO:34所示的氨基酸序列(脑膜炎双球菌(N.menigitidis)PglL)的核酸序列如下述SEQ ID NO:35所示(EMBL核苷酸序列数据库编号AEK98518)。
如本申请所使用的,“分离的”寡糖转移酶指基本上是纯的或基本上分离自在其天然的宿主细胞中天然伴随着天然蛋白的其他细胞组分的寡糖转移酶。通常地,本发明的分离的寡糖转移的纯度约80%,通常至少约90%以及更优选地至少约95%。可以采用本领域公知的任意方法对纯度进行评估,例如聚丙烯酰胺凝胶电泳、HPLC等。所述分离的寡糖转移酶可以从作为其直接来源的生物体中获得,或者从本申请实施例所述的宿主细胞中或使用下文所述的本领域公知的技术重组生产和纯化。
在通常情况下,使用重组表达系统生产和分离所目的蛋白涉及将编码所需蛋白的氨基酸序列的核酸分子插入表达系统中,所述分子在其中是异源性的(即通常是不存在的)。可以将编码一种或多种蛋白的一种或多种所需的核酸分子插入所述载体中。当插入多种核酸分子时,所述多种核酸分子可以编码相同或不同的酶。将异源性核酸分子以相对于启动子和任意其他的5’调控分子以及正确的阅读框架的正确的有意义 (5’→3’)方向插入表达系统或载体中。
可以采用本领域公知的标准克隆程序制备核酸构建体,如Joseph Sambrook等,MOLECULAR CLONING:A LABORATORY MANUAL(Cold Springs Harbor 1989)以及Cohen 和Boyer的美国专利号4,237,224所述的,其通过引用整体并入本申请。然后通过转化的方法将这些重组质粒引入适宜的宿主细胞并使其在宿主细胞中复制。
可以将控制多个层面上基因表达(例如DNA转录和信使RNA(“mRNA”)翻译)的多种遗传信号和加工事件引入核酸构建体以便最大限度的提高酶的生产。为了表达编码一种或多种所需酶的克隆的核酸序列,使用强启动子以获得高水平的转录是有益的。根据所使用的宿主系统,可以使用多种适宜的启动子中的任意一种。例如,当在大肠杆菌(E.coli)中克隆时,可以使用其噬菌体或质粒启动子如T7噬菌体启动子、lac启动子、trp启动子、recA启动子、核糖体RNA启动子、大肠杆菌噬菌体λ的PR和PL启动子等其他的包括但不限于lacUV5、ompF、bla、lpp等以使得邻近的DNA区段高水平的转录。此外,可以使用由重组DNA或其他合成DNA技术生产的杂交trp-lacUV5(tac)启动子或其他大肠杆菌(E.coli)启动子以便将插入的基因转录。适于在哺乳动物细胞中引导表达的常见启动子包括但不限于SV40、MMTV、金属硫蛋白-1、腺病毒Ela、CMV、立即早期、免疫球蛋白重链启动子和增强子以及RSV-LTR。
可以在核酸构建体中引入在原核细胞中有效基因转录和翻译所需的其他特异性起始信号以便最大限度的产生肽,例如Shine-Dalgarno核糖体结合位点。根据所使用的载体系统和宿主,可以使用任意数量适宜的转录和/或翻译元件包括组成型、诱导型和阻遏型启动子,以及最小的5’启动子元件、增强子或先导序列。最大限度地提高基因表达的综述见Roberts和Lauer,“Maximizing Gene Expression on a Plasmid Using RecombinationIn Vitro,”Methods in Enzymology 68:473–82(1979),其通过引用整体并入本申请。
使用本领域的标准克隆程序将编码寡糖转移酶或本发明的其他蛋白组分(例如糖蛋白靶点、参与聚糖生产的酶)的核酸分子、所选择的启动子分子包括但不限于增强子和先导序列、使得在宿主中转录的适宜的3’调控区以及任意其他所需的组分如报告子或标记物基因克隆至所选择的载体中,如根据Joseph Sambrook等,MOLECULAR CLONING:A LABORATORYMANUAL(Cold Springs Harbor 1989);Frederick M.Ausubel, SHORT PROTOCOLS IN MOLECULARBIOLOGY(Wiley 1999)以及Cohen和Boyer的美国专利号4,237,224所述的,其通过引用整体并入本申请。
一旦编码蛋白的核酸分子克隆至表达载体中,其已准备好掺入宿主中。可以使用本领域公知的标准克隆程序将重组分子引入细胞中,不限于通过转染(如果宿主是真核生物)、转导、偶联、带动转移、电穿孔、脂质转染、原生质体融合、氯化钙转化、带动转移、使用噬菌体转染或粒子轰击,如JOSEPH SAMBROOK等,MOLECULAR CLONING: A LABORATORY MANUAL(ColdSprings Harbor 1989)所述的,其通过引用整体并入本申请。
用于重组蛋白生产的适宜宿主细胞包括原核和真核细胞。适宜的原核宿主细胞包括但不限于大肠杆菌和其他肠杆菌科细菌埃希氏菌属(Escherichia sp.)、弯曲杆菌属(Campylobacter sp.)、沃廉菌属(Wolinella sp.)、脱硫弧菌属(Desulfovibrio sp.)、弧菌属 (Vibrio sp.)、假单胞菌属(Pseudomonas sp.)、芽孢杆菌属(Bacillus sp.)、李斯特氏菌属 (Listeria sp.)、葡萄球菌属(Staphylococcus sp.)、链球菌属(Streptococcussp.)、消化链球菌属(Peptostreptococcus sp.)、巨球型菌属(Megasphaera sp.)、梳状菌属(Pectinatus sp.)、月形单胞菌属(Selenomonas sp.)、嗜发酵菌属(Zymophilus sp.)、放线菌属 (Actinomyces sp.)、节杆菌属(Arthrobacter sp.)、弗兰克菌(Frankia sp.)、单孢丝菌 (Micromonospora sp.)、诺卡氏菌(Nocardia sp.)、丙酸杆菌属(Propionibacterium sp.)、链霉菌属(Streptomyces sp.)、乳杆菌属(Lactobacillussp.)、乳球菌属(Lactococcus sp.)、明串珠菌(Leuconostoc sp.)、片球菌(Pediococcussp.)、醋酸杆菌属(Acetobacterium sp.)、真杆菌属(Eubacterium sp.)、太阳杆菌属(Heliobacterium sp.)、螺旋阳光菌属 (Heliospirillum sp.)、鼠孢菌属(Sporomusasp.)、螺原体(Spiroplasma sp.)、尿支原体属 (Ureaplasma sp.)、丹毒丝菌属(Erysipelothrix,sp.)、棒杆菌属(Corynebacterium sp.)、肠球菌属(Enterococcussp.)、梭菌属(Clostridium sp.)、支原体属(Mycoplasma sp.)、分枝杆菌属(Mycobacterium sp.)、放线菌属(Actinobacteria sp.)、沙门氏菌属(Salmonella sp.)、志贺氏菌属(Shigella sp.)、莫拉氏菌属(Moraxella sp.)、缠绕杆菌属(Helicobactersp.)、寡养单胞菌属(Stenotrophomonas sp.)、微球菌属(Micrococcus sp.)、奈瑟氏菌属(Neisseria sp.)、蛭弧菌属(Bdellovibrio sp.)、嗜血杆菌属(Hemophilus sp.)、克雷伯氏菌属(Klebsiella sp.)、奇异变形杆菌(Proteus mirabilis)、阴沟肠杆菌(Enterobacter cloacae)、沙雷氏菌属(Serratia sp.)、枸橼酸杆菌属(Citrobactersp.)、变形杆菌属(Proteus sp.)、沙雷氏菌属(Serratia sp.)、耶尔森氏菌属(Yersiniasp.)、不动杆菌属(Acinetobacter sp.)、放线杆菌属(Actinobacillus sp.)、博德特氏菌属(Bordetella sp.)、布鲁氏菌(Brucella sp.)、二氧化碳嗜纤维菌属(Capnocytophagasp.)、心杆菌属(Cardiobacterium sp.)、艾肯菌属(Eikenella sp.)、弗朗西斯氏菌(Francisella sp.)、嗜血杆菌属(Haemophilus sp.)、金氏菌属(Kingella sp.)、巴斯德菌属(Pasteurella sp.)、黄杆菌属(Flavobacterium sp.)、黄单胞菌属(Xanthomonas sp.)、鼻疽菌属(Burkholderia sp.)、气单胞菌属(Aeromonas sp.)、邻单胞菌属(Plesiomonassp.)、军团菌属(Legionella sp.)和α-变形菌如沃尔巴克氏体属(Wolbachia sp.)、蓝藻、螺旋体、绿色硫黄菌和绿色非硫磺菌、革兰氏阴性球菌、苛求的革兰氏阴性杆菌、肠杆菌-葡萄糖-发酵的革兰氏阴性杆菌、革兰氏阴性杆菌-非葡萄糖发酵菌、革兰氏阴性杆菌-葡萄糖发酵的氧化酶阳性菌。除了细菌细胞以外,真核细胞如哺乳动物、昆虫和酵母系统也是用于重组蛋白生产的表达载体转染/ 转化的适宜宿主细胞。在本领域中能够获得的用于表达异源性蛋白或多肽的哺乳动物细胞系包括中国仓鼠卵巢细胞、HeLa细胞、仓鼠崽肾细胞、COS细胞和其他多种。
可以采用本领域公知的若干方法从宿主细胞中获得纯化的蛋白,包括离子交换层析、疏水性相互作用层析、亲和层析、凝胶过滤和反相层析。所述肽优选地通过常规技术以纯化形式生产(优选地纯度至少约70至约75%、或者纯度约80%至85%、更优选地纯度至少约90%或95%)。根据是否将重组的宿主细胞制成将蛋白分泌至生长培养基中(参见Bauer等的美国专利号6,596,509,其通过引用整体并入本申请),可以通过离心(将细胞组分与含有分泌蛋白的上清液分离)随后将上清液进行逐级硫酸铵沉淀分离和纯化蛋白。可以将含有蛋白的组分在适宜尺寸的葡聚糖或聚丙烯酰胺柱进行凝胶过滤以便将所述蛋白与其他细胞组分和蛋白分离。如有必要,可以使用HPLC 对蛋白组分进行进一步纯化。
寡糖转移酶催化聚糖由脂质供体转移至接受体蛋白、肽或多肽。在本发明的一个实施方式中,所述脂质供体或载体分子是原核脂质供体,即其在原核生物中制备或对原核生物是天然的。原核脂质供体的例子包括十一异戊烯-磷酸酯和十一异戊烯-磷酸酯连接的杆菌胺(bacillosamine)(Weerapana等,“Investigating Bacterial N-LinkedGlycosylation:Synthesis and Glycosyl Acceptor Activity of the UndecaprenylPyrophosphate-linked Bacillosamine,”J.Am.Chem.Soc.127:13766-67(2005),其通过引用整体并入本申请)。在本发明的另一个实施方式中,所述脂质供体是真核脂质供体,即其在真核细胞中制备或对真核细胞是天然的。示例性的真核脂质供体是多萜基焦磷酸酯。
根据本发明的这一方面和所有方面,所述聚糖包含与脂质供体分子连接的寡糖或多糖。构成寡糖或多糖链的聚糖组分的组合物在单糖单元的数量和类型上是不同的。聚糖的单糖组分包括但不限于一个或多个葡萄糖(Glc)、半乳糖(Gal)、甘露糖(Man)、岩藻糖(Fuc)、N-乙酰半乳糖胺(GalNAc)、N-乙酰葡糖胺(GlcNAc)、葡糖醛酸 (glucorionicacid)、木糖、唾液酸(例如N-乙酰-神经氨酸(NeuAc))、6-脱氧-塔罗糖和鼠李糖单糖。
根据本发明的这一方面和所有方面,所述聚糖可以是原核生物、古生菌或真核生物聚糖。或者,所述聚糖可以包含完全非天然的聚糖组合物。
在本发明的一个实施方式中,所述聚糖是由一种或多种原核生物糖基转移酶生产的原核生物聚糖。在本发明的另一个实施方式中,所述原核生物聚糖使用原核生物和真核生物糖基转移酶的组合生产,但是其具有模拟原核生物聚糖结构的单糖组合物。在本发明的另一个实施方式中,所述原核生物聚糖是合成生产的(Seeberger等, Chemical andEnzymatic Synthesis of Glycans and Glycoconjugates,in ESSENTIALS OF GLYCOBIOLOGY(A.Varki等eds.,2009),其通过引用整体并入本申请)。
示例性的原核生物聚糖是由空肠弯曲杆菌(C.jejuni)、大肠弯曲杆菌(C.Coli)、红嘴鸥弯曲杆菌(C.lari)或乌普萨拉弯曲杆菌(C.upsaliensis)Pgl基因簇或经修饰的空肠弯曲杆菌(C.jejuni)、大肠弯曲杆菌(C.Coli)、红嘴鸥弯曲杆菌(C.lari或乌普萨拉弯曲杆菌(C.upsaliensis)Pgl基因簇的糖基转移酶生产的聚糖。Pgl簇的基因包括wlaA、galE、wlaB、pglH、pglI、pglJ、pglB、pglA、pglC、pglD、wlaJ、pglE、 pglF和pglG(Szymanski和Wren,“Protein Glycosylation in Bacterial Mucosal Pathogens,” NatureMicrobiol.3:225-237(2005),其通过引用整体并入本申请)。原核生物聚糖通常包含二乙酰胺基-三脱氧糖,杆菌胺(bacillosamine)(Bac;2,4-二乙酰胺基-2,4,6- 三脱氧葡萄糖)。本发明这一方面和所有方面的适宜的原核生物聚糖是庚糖包括葡萄糖、N-乙酰半乳糖胺和杆菌胺,即GlcGalNAc5Bac。
如本申请中的实施例所述,本发明这一方面和所有方面的聚糖可以重组生产。例如,可以将编码进行GlcGalNac5Bac庚糖和其他聚糖结构生物合成的酶的修饰或未经修饰的空肠弯曲杆菌(C.jejuni)pgl基因簇分离并转移至用于生产脂质连接的聚糖的适宜宿主细胞中(亦参见Wacker等,“N-Linked Glycosylation in Campylobacter jejuni and itsFunctional Transfer into E.coli,”Science 298(5599):1790-93(2002),其通过引用整体并入本申请)。来源于其他弯曲杆菌种属例如大肠弯曲杆菌(C.coli)、红嘴鸥弯曲杆菌(C.lari)和乌普萨拉弯曲杆菌(C.upsaliensis)的Pgl基因簇也适于重组生产用于本发明所有方面的聚糖(Szymanski和Wren,“Protein Glycosylation in Bacterial MucosalPathogens,”Nature Microbiol.3:225-237(2005),其通过引用整体并入本申请)。此外,已在产琥珀酸沃林氏菌(Wolinella succinogens)、硫酸盐还原菌(Desulfovibriodesulfuricans)和普通脱硫弧菌(D.vulgaris)中鉴定得到的类似Pg1样糖基化基因基因座也适于重组生产本发明的聚糖(Baar等,“Complete Genome Sequence and Analysis ofWolinella succinogenes,”Proc.Natl.Acad.Sci.USA 100:11690-11695(2003)以及Szymanski和Wren,“Protein Glycosylation in Bacterial Mucosal Pathogens,”NatureMicrobiol.3:225-237(2005),其通过引用整体并入本申请)。
可以对Pg1基因簇进行修饰以增强脂质连接的聚糖在宿主细胞中的产生、累积和分离。例如,将基因簇的寡糖转移酶组分(例如在pgl基因簇中的pglB基因)灭活是所需的以阻止脂质连接的聚糖向宿主细胞的糖蛋白靶点转移。此外,在本发明的一些实施方式中,可能需要减弱、破坏或缺失宿主细胞中的竞争性聚糖生物合成反应。特别地,可能也需要将参与将聚糖转移或连接至宿主细胞接受体部分的宿主细胞糖基转移酶(N-连接或O-连接反应酶)或其他酶灭活。例如,当使用大肠杆菌(E.coli)作为宿主细胞时,将聚糖由十一异戊烯脂质载体转移至脂质A的WaaL酶的缺失,反而使寡糖穿梭至外膜的外叶,这将确保重组产生的脂质连接的聚糖在内膜中累积。可以缺失、破坏或修饰的其他大肠杆菌宿主细胞糖基化相关酶包括但不限于wecA、wbbL、 glcT、glf、gafT、wzx、wzy以及O16抗原生物合成通路的酶。
在本发明的另一个实施方式中,所述聚糖是真核生物聚糖,即由一种或多种真核糖基转移酶生产的聚糖。在本发明的一个实施方式中,真核生物聚糖仅由真核糖基转移酶生产。在本发明的另一个实施方式中,所述真核聚糖使用原核生物和真核生物糖基转移酶的组合生产,但是其模拟真核生物聚糖的结构。在本发明的另一个实施方式中,所述真核生物聚糖是合成生产的(Seeberger等,Chemical and Enzymatic Synthesis of Glycansand Glycoconjugates,in ESSENTIALS OF GLYCOBIOLOGY(A.Varki等,eds., 2009),其通过引用整体并入本申请)。
在一个实施方式中,所述真核生物聚糖包含GlcNAc2核。所述GlcNac2核可以进一步包含至少一个甘露糖残基。适宜的真核生物聚糖结构可以包括但不限于 Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2。
如上文所述,可以通过在适宜的宿主细胞中引入一种或多种真核糖基转移酶重组生产真核生物脂质连接的聚糖。如本申请所使用的真核糖基转移酶指催化糖基从供体底物例如从活化的核苷酸的糖转移至接受体底物例如增长的脂质连接的寡糖链的酶。能够在宿主细胞中被利用以促进系统的真核生物脂质连接的聚糖重组生产的适宜的糖基转移酶包括但不限于半乳糖转移酶(例如β1,4-半乳糖转移酶、β1,3-半乳糖转移酶)、岩藻糖转移酶、葡萄糖转移酶、N-乙酰半乳糖胺转移酶(例如GalNAcT、 GalNAc-T1、GalNAc-T2、GalNAc-T3)、N-乙酰葡糖胺转移酶(例如β-1,2-N-乙酰葡糖氨基转移酶I(GnTI-)、GnT-II、GnT-III、GnT-IV、GnT-V、GnT-VI和GvT-IVH)、葡糖醛酸转移酶、唾液酸转移酶(例如α(2,3)唾液酸转移酶、α-N-乙酰半乳糖胺α-2,6- 唾液酸转移酶I、Galβ1,3GalNAcα2,3-唾液酸转移酶、β半乳糖苷-α-2,6-唾液酸转移酶和α2,8-唾液酸转移酶)、甘露糖转移酶(例如α-1,6-甘露糖转移酶、α-1,3-甘露糖转移酶、β-1,4-甘露糖转移酶)、葡糖醛酸转移酶、半乳糖醛酸转移酶等。已经在多种真核生物系统中对上述糖基转移酶进行了广泛的研究。因此,这些酶的核酸和氨基酸序列是本领域技术人员公知的和易于获得的。此外,这些酶中的多种是市售的(例如 Sigma-Aldrich,St.Louis,MO)。
用于生产原核生物或真核生物脂质连接聚糖的适宜的宿主细胞包括原核生物宿主细胞和真核生物宿主细胞。示例性的适宜宿主细胞列表如上文所示。当在原核宿主细胞中使用真核糖基转移酶时,可以对真核糖基转移酶的核苷酸序列进行密码子优化以克服与大肠杆菌(E.coli)(及其他细菌)和更高级的生物体之间密码子使用偏好相关的局限性,如酵母和哺乳动物细胞。密码子使用偏好指生物体在蛋白编码DNA序列 (基因)的密码子出现频率上存在差异。密码子是编码多肽链中特定氨基酸残基的一系列三联核苷酸(三联体)。可以通过制备特异性的颠换核苷酸改变,即嘌呤变成嘧啶或嘧啶变成嘌呤的核苷酸改变,或者转换核苷酸改变,即嘌呤变为嘌呤或嘧啶变为嘧啶的核苷酸改变。
根据本发明的这一方面和所有方面,“糖蛋白靶点”包括包含一个或多个聚糖接受体氨基酸残基的任意肽、多肽或蛋白。典型地聚糖接受体残基包含天冬酰胺(N或 Asn)以形成N-连接糖蛋白,或者在羟基赖氨酸、羟基脯氨酸、丝氨酸、苏氨酸或酪氨酸侧链上的羟基氧以形成O-连接糖蛋白。多种多样的糖蛋白靶点存在于包括但不限于结构分子(例如胶原蛋白)、润滑和保护剂(例如粘蛋白)、转运蛋白(例如转铁蛋白)、免疫蛋白(免疫球蛋白、组织相容性抗原)、激素、酶、细胞连接识别位点、受体、蛋白折叠伴侣、发育调控蛋白和参与止血和血栓形成的蛋白。治疗性蛋白如抗体是本发明系统重要的糖蛋白靶点。
根据本发明的这一方面和所有方面,糖蛋白靶点的一个或多个寡糖接受体残基可以是天冬酰胺(N或Asn)残基。天冬酰胺残基位于包含N-X1-S/T(真核生物共有序列)或D/E-X1-N-X2-S/T(SEQ ID NO:1)(原核生物共有序列)的糖基化共有序列中,其中D是天冬氨酸,X1和X2是除了脯氨酸以外的任意氨基酸,N是天冬酰胺和 T是苏氨酸。
根据本发明这一方面和所有方面的糖蛋白靶点可以是包含所需聚糖接受体残基的纯化的蛋白、肽或多肽。或者,所述糖蛋白靶点可以是编码所述糖蛋白靶点的分离的核酸分子形式。根据本发明的这个实施方式,所述系统进一步包括适于由所述核酸分子合成糖蛋白靶点的试剂,即翻译试剂。
用于在体外(即无细胞环境)由核酸分子合成蛋白的试剂是本领域公知的。这些试剂或系统通常由家兔网状细胞、小麦胚芽和大肠杆菌(E.coli)的提取物组成。所述提取物含有翻译外源性RNA分子所必需的所有大分子组分,包括例如核糖体, tRNA,氨酰基-tRNA合成酶,起始、延伸和终止因子。该系统所需的其他组分包括氨基酸、能量来源(例如ATP、GTP)、能量再生系统(用于真核生物系统的磷酸肌酸和肌酸磷酸激酶和用于原核生物系统的磷酸烯醇丙酮酸和丙酮酸激酶)以及其他辅因子(例如Mg2+、K+等)。如果编码糖蛋白靶点的核酸分子是DNA分子,则无细胞翻译反应与利用RNA聚合酶的起始转录反应偶联或连接。
本发明的另一个方面涉及一种试剂盒,所述试剂盒包括能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶和一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接。
根据本发明的这一方面,所述试剂盒的分离的寡糖转移酶可以是纯化的蛋白或者可以是编码寡糖转移酶的核酸的形式。核酸分子可以是DNA或RNA分子,并且其可以是线性的(裸露的)或环状的(位于表达载体中)。示例性的原核生物、古生菌和真核生物寡糖转移酶如上文所述。
如上文所述,所述一种或多种聚糖与脂质载体分子连接(例如十一异戊烯醇-焦磷酸酯、十一异戊烯焦磷酸酯连接的杆菌胺或多萜基焦磷酸酯)。亦如上文所述,所述聚糖可以包含原核生物、古生菌、真核生物或完全非天然合成的聚糖。适宜的原核生物核心聚糖结构包括包含葡萄糖、N-乙酰半乳糖胺和任选地杆菌胺(例如 GlcGalNAc5Bac)的庚糖。适宜的真核生物聚糖核心结构包括N-乙酰葡糖胺和甘露糖 (例如Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2)。
在本发明这一方面的一个实施方式中,所述试剂盒的一种或多种与脂质载体分子连接的分离的聚糖是组合和纯化的形式。或者,本发明的试剂盒包括编码一种或多种真核生物和/或原核生物糖基转移酶的一种或多种核酸分子以及含有聚异戊二烯基焦磷酸酯聚糖载体并且能够表达一种或多种核酸分子的宿主细胞(真核或原核)。根据本发明的这一实施方式,所述试剂盒可以进一步含有用于在使用其他的试剂盒组分之前在宿主细胞中重组生产和分离脂质连接的聚糖的说明书。
本发明的试剂盒可以进一步包括用于合成所选择的寡糖转移酶和/或糖蛋白、肽或多肽的体外或无细胞转录和/或翻译试剂。
本发明的另一个方面涉及一种在无细胞系统中生产糖基化蛋白的方法。该方法涉及提供一种能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶,提供一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接,以及提供包括一种或多种聚糖接受体氨基酸残基的糖蛋白靶点。该方法进一步涉及将所述寡糖转移酶、一种或多种分离的聚糖以及糖蛋白靶点组合以形成无细胞糖基化反应混合物,以及将所述无细胞糖基化反应混合物置于使寡糖转移酶有效地将聚糖由脂质载体分子转移至糖蛋白靶点的一种或多种聚糖接受体残基的条件下以生产糖基化的蛋白。
本发明方法的组分即寡糖转移酶、与脂质载体分子连接的分离的聚糖和糖蛋白靶点已在上文中详细地描述。
本发明的方法可以包括一个或多个附加步骤。例如,通过提供适于由核酸分子合成糖蛋白靶点的试剂可以将糖蛋白靶点的翻译与糖基化偶联。在本发明的这个实施方式中,将编码糖蛋白靶点的核酸分子、翻译试剂、寡糖转移酶、分离的聚糖全部组合以形成翻译-糖基化反应混合物。然后,在糖基化反应之前或同时由靶核酸分子合成糖蛋白靶点。
本申请还包括以下实施方式:
实施方式1.一种用于生产糖基化蛋白的无细胞系统,所述系统包括:
能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶;
一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接;以及
糖蛋白靶点,其包括一个或多个聚糖接受体氨基酸残基,或者编码所述糖蛋白靶点的核酸分子。
实施方式2.根据实施方式1所述的系统,其中所述寡糖转移酶是原核生物寡糖转移酶。
实施方式3.根据实施方式2所述的系统,其中所述原核生物寡糖转移酶来源于弯曲杆菌属(Campylobacter)。
实施方式4.根据实施方式1所述的系统,其中所述寡糖转移酶是古生菌寡糖转移酶。
实施方式5.根据实施方式1所述的系统,其中所述寡糖转移酶是真核生物寡糖转移酶。
实施方式6.根据实施方式1所述的系统,其中所述脂质载体分子包括十一碳二烯磷酸酯。
实施方式7.根据实施方式1所述的系统,其中所述一种或多种分离的聚糖包括原核生物聚糖。
实施方式8.根据实施方式1所述的系统,其中所述原核生物聚糖包括GlcGalNAc5Bac。
实施方式9.根据实施方式1所述的系统,其中所述一种或多种分离的聚糖包括真核生物聚糖。
实施方式10.根据实施方式9所述的系统,其中所述真核生物聚糖包括GlcNAc2。
实施方式11.根据实施方式10所述的系统,其中所述真核生物聚糖进一步包括至少一个甘露糖残基。
实施方式12.根据实施方式9所述的系统,其中所述真核生物聚糖包括选自Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2的组分。
实施方式13.根据实施方式1所述的系统,其中所述糖蛋白靶点的一个或多个聚糖接受体氨基酸残基是天冬酰胺残基。
实施方式14.根据实施方式13所述的系统,其中糖蛋白靶点进一步包括N-X1-S/T或D/E-X1-N-X2-S/T(SEQ ID NO:1)聚糖接受体氨基酸序列基序,其中D是天冬氨酸、X1和X2是脯氨酸以外的任意氨基酸、N是天冬酰胺和T是苏氨酸。
实施方式15.根据实施方式1所述的系统,其进一步包括:适于由所述核酸分子合成糖蛋白靶点的试剂。
实施方式16.根据实施方式1所述的系统,其中所述糖蛋白靶点包括抗体。
实施方式17.一种试剂盒,包括:
能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶,以及
一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接。
实施方式18.根据实施方式17所述的试剂盒,其进一步包括:
适于由编码所述糖蛋白靶点的核酸分子合成糖蛋白靶点的试剂。
实施方式19.一种在无细胞系统中生产糖基化蛋白的方法,所述方法包括:
提供能够将聚糖由脂质载体分子转移至糖蛋白靶点的分离的寡糖转移酶;
提供一种或多种分离的聚糖,其中各聚糖均与脂质载体分子连接;
提供包括一种或多种聚糖接受体氨基酸残基的糖蛋白靶点;
将所述寡糖转移酶、一种或多种分离的聚糖和糖蛋白靶点组合以形成无细胞糖基化反应混合;以及
将所述无细胞糖基化反应混合物置于使寡糖转移酶有效地促使聚糖由脂质载体分子转移至糖蛋白靶点的一个或多个聚糖接受体残基的条件下以产生糖基化蛋白。
实施方式20.根据实施方式19所述的方法,其中所述寡糖转移酶是原核生物寡糖转移酶。
实施方式21.根据实施方式20所述的方法,其中所述原核生物寡糖转移酶来源于弯曲杆菌属(Campylobacter)。
实施方式22.根据实施方式19所述的方法,其中所述寡糖转移酶是古生菌寡糖转移酶。
实施方式23.根据实施方式19所述的方法,其中所述寡糖转移酶是真核生物寡糖转移酶。
实施方式24.根据实施方式19所述的方法,其中所述脂质载体分子包括十一碳二烯磷酸酯。
实施方式25.根据实施方式19所述的方法,其中所述一种或多种分离的聚糖包括原核生物聚糖。
实施方式26.根据实施方式25所述的方法,其中所述原核生物聚糖包括GlcGalNAc5Bac。
实施方式27.根据实施方式19所述的方法,其中一种或多种分离的聚糖包括真核生物聚糖。
实施方式28.根据实施方式27所述的方法,其中所述一种或多种真核生物聚糖包括GlcNAc2。
实施方式29.根据实施方式28所述的方法,其中所述一种或多种真核生物聚糖进一步包括至少一个甘露糖残基。
实施方式30.根据实施方式28所述的方法,其中所述一种或多种真核生物聚糖包含选自Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2的组分。
实施方式31.根据实施方式19所述的方法,其中所述提供糖蛋白靶点包括提供编码所述糖蛋白的核酸序列,所述方法进一步包括:
提供适于由所述核酸分子合成糖蛋白靶点的试剂,以及
先于所述操作或与所述操作同时地将所述试剂在有效地从所述核酸分子合成所述糖蛋白靶点的条件下与糖基化反应混合。
实施方式32.根据实施方式19所述的方法,其中所述糖蛋白靶点的一种或多种聚糖接受体氨基酸残基是天冬酰胺残基。
实施方式33.根据实施方式32所述的方法,其中所述糖蛋白靶点进一步包括 N-X1-S/T或D/E-X1-N-X2-S/T(SEQ ID NO:1)聚糖接受体氨基酸序列基序,其中D 是天冬氨酸、X1和X2是脯氨酸以外的任意氨基酸、N是天冬酰胺和T是苏氨酸。
实施方式34.根据实施方式19所述的方法,其中所述蛋白包括抗体。
实施例
实施例1-4的材料和方法
蛋白纯化。对于CjPglB的纯化而言,使用质粒pSN18对大肠杆菌菌株C43(DE3)(Lucigen,Middleton,WI)进行新鲜转化(Kowarik等,“N-Linked Glycosylation ofFolded Proteins by the Bacterial Oligosaccharyltransferase,”Science 314:1148-1150 (2006),其通过引用整体并入本申请),所述质粒是编码具有C-末端十个组氨酸亲和标签的空肠弯曲杆菌(C.jejuni)pglB的经修饰的pBAD表达质粒。将细胞在37℃下在补充了100μg/mL氨苄西林的1.5L优质肉汤培养基中培养。当培养物的光密度 (A600)达到~1.0时,加入0.02%的阿拉伯糖(w/v)在30℃下诱导细胞4.5h。除非有不同的说明,否则所有的下述步骤均在4℃下进行。离心收集细胞,将其重悬于25 mM Tris,pH 8.0和250mM NaCl中并通过三通道的弗氏细胞压碎器裂解细胞 (SLM-Aminco;10,000PSI,SLM Instruments,Inc.,Urbana,IL)。离心除去细胞碎片后,经100,000×g超离心1h将膜组分分离。将含有Pg1B的膜重悬于25mM Tris–HCl,pH 8.0、250mM NaCl、10%甘油(v/v)和1%DDM(w/v)(DDM,Anatrace,Affymetrix, Inc.,Santa Clara,CA)中并孵育2h。经100,000×g超离心1h除去不溶性组分。所有随后的缓冲液中均含有DDM作为去垢剂。在溶解的膜中加入10mM咪唑,将其上样于Ni-NTA超流亲和柱(Qiagen,Valencia,CA)上并在使用200mM咪唑洗脱Pg1B前使用60mM咪唑洗涤。然后将经纯化的蛋白进样至使用AKTA-FPLC的Superdex 200 凝胶过滤柱(GE Healthcare,Waukesha,WI)上。对洗脱物组分进行十二烷基硫酸钠- 聚丙烯酰胺凝胶电泳(SDS–PAGE)并使用考马斯亮蓝染色以鉴定含有Pg1B的组分 (图2)。使用PD10脱盐柱(GE Healthcare)将蛋白脱盐至20mM Tris,pH 7.5、100 mM NaCl、5%甘油(w/v)和0.05%DDM(w/v)中并在截止分子量为100kDa的Amicon centricon超滤管中将其浓缩至5–10mg/mL。对无活性的CjPglB突变体进行同样操作的表达和纯化,其使用携带质粒pSN18.1的C43(DE3)细胞,其编码来自pACYCpglmut 的pglB亚克隆无活性的拷贝(如下所示)。从携带质粒pSF2的BL2-Gold(DE3)细胞(Stratagene,La Jolla,CA)中纯化ClPglB,如其他地方所述(Lizak等,“X-ray Structure of a Bacterial Oligosaccharyltransferase,”Nature474:350-355(2011),其通过引用整体并入本申请)。在-20℃下长期保存时,将PglB样品中的甘油含量增加至10%(w/v)。从分离自携带质粒pET24(AcrA-per)(Nita-Lazar等,“TheN-X-S/T Consensus Sequence is Required but not Sufficient for Bacterial N-Linked Protein Glycosylation,”
Glycobiology 15:361-367(2005),其通过引用整体并入本申请)或
pET24-ssDsbAscFv13-R4-GT(见下文)的BL21(DE3)细胞的周质组分中纯化AcrA和scFv13-R4-GT。如此前所述(Schwarz等,“Relaxed Acceptor Site Specificity ofBacterial Oligosaccharyltransferase in Vivo,”Glycobiology 21:45-54(2011),其通过引用整体并入本申请)制备周质提取物,补充咪唑使其终浓度达到10mM,无菌过滤(0.22μm)并通过使用Ni-NTA超流亲和柱(Qiagen,Valencia,CA)的镍亲和层析进行纯化。
脂质连接聚糖的分离。将使用pACYCpglmut(Wacker等,“N-Linked Glycosylationin Campylobacter jejuni and its Functional Transfer Into E.coli,”Science 298:1790-1793 (2002),其通过引用整体并入本申请)转化的大肠杆菌(Escherichia coli)SCM6细胞于37℃下在补充了25μg/mL氯霉素的1L Luria-Burtani中培养,pACYCpglmut编码空肠弯曲杆菌(C.jejuni)LLO和无活性的空肠弯曲杆菌(C.jejuni)pglB基因(W458A 和D459A)的生物合成。当A600达到~1.0时,经离心收集细胞并将细胞团在-80℃和0.04mbar下冷冻干燥20h。所有的后续步骤均使用玻璃试管和玻璃移液管进行。在25mL 10:20:3的CHCl3:MeOH:H2O中提取经均质化的细胞团,随后3000×g离心 30min。使用旋转蒸发仪(Büchi,Flawil,Sankt Gallen,Switzerland)蒸发上清液,随后将所得到的沉淀重悬于1mL10:20:3CHCl3:MeOH:H2O中并超声直至均质。在37℃下在氮气流中干燥样品,将其溶解在10mM HEPES(4-(2-羟乙基)-1-哌嗪乙磺酸), pH 7.5、1mM MnCl2和0.1%DDM(w/v)中并在-20℃下保存。采用相同的程序从携带空pACYC的SCM6细胞中提取脂质。
无细胞翻译和糖基化。为了对纯化的接受体蛋白进行体外糖基化,将在10mMHEPES,pH 7.5、1mM MnCl2和0.1%DDM(w/v)中含3μg纯化的PglB、5–10μL 提取的LLO和5μg纯化的AcrA或scFv13-R4-GT的50μL溶液在30℃下孵育12h。为了对没有糖基化的AcrA和scFv13-R4-GT进行体外翻译,根据生产厂商的说明使用 S30 T7高产出表达系统(Promega,Fitchburg,WI)或PURExpress(New England Biolabs, Ipswich,MA)制备50μL反应物。在每个反应物中加入总计1μg的下述质粒:pET24b (Novagen,Madison,WI);编码具有C-末端六个组氨酸标签的全长空肠弯曲杆菌(C. jejuni)AcrA的pET24-AcrA(Nita-Lazar等,“TheN-X-S/T Consensus Sequence is Required but not Sufficient for Bacterial N-Linked Protein Glycosylation,”Glycobiology 15:361-367(2005),其通过引用整体并入本申请);编码在其天然输出信号的位置具有N-末端PelB信号肽的AcrA版本的pET24(AcrA-per)(Nita-Lazar等,“The N-X-S/T Consensus Sequence is Required but notSufficient for Bacterial N-Linked Protein Glycosylation,”Glycobiology 15:361-367(2005),其通过引用整体并入本申请);编码不具有N-末端输出信号的AcrA版本(ΔssAcrA)的pET24(AcrA-cyt)(Nita-Lazar 等,“The N-X-S/T Consensus Sequence isRequired but not Sufficient for Bacterial N-Linked Protein Glycosylation,”Glycobiology 15:361-367(2005),其通过引用整体并入本申请)以及编码具有来自大肠杆菌(E.coli)DsbA的用于分泌的N-末端信号肽和 C-末端GT(Fisher等,“Production ofSecretory and Extracellular N-Linked Glycoproteins in Escherichia coli,”Appl.Environ.Microbiol.77:871-881(2011),其通过引用整体并入本申请)后接FLAG和六个组氨酸表位标签的表达优化的scFv13-R4胞内抗体基因 (Martineau等,“Expression ofan Antibody Fragment at High Levels in the Bacterial Cytoplasm,”J.Mol.Biol.280:117-127(1998),其通过引用整体并入本申请)的 pET24-ssDsbA-scFv13-R4-GT。为了进行体外翻译/糖基化反应,在50μL反应反应物中加入3μg纯化的PglB、5μL提取的LLO、1μg纯化的质粒DNA、1mM MnCl2和 0.1%DDM(w/v)并在30℃下孵育12h。选择DDM用于体外翻译/糖基化,因为此前已经观察到其在大肠杆菌(E.coli)来源的CFE系统中具有良好的耐受性(Klammt 等,“Evaluation of Detergents for the Soluble Expression ofAlpha-Helical and Beta-Barrel-Type Integral Membrane Proteins by aPreparative Scale Individual Cell-Free Expression System,”Febs J.272:6024-6038(2005),其通过引用整体并入本申请)。
Western印迹分析。在SDS-PAGE之后进行免疫印迹对AcrA和scFv13-R4-GT 的表达和糖基化情况进行分析。使用单克隆的抗-His抗体(Qiagen,Valencia,CA)、单克隆的抗-FLAG抗体(Abcam,Cambridge,MA)、多克隆的抗-AcrA血清(Wacker 等,“N-LinkedGlycosylation in Campylobacter jejuni and its Functional Transfer Into E.coli,”Science 298:1790-1793(2002),其通过引用整体并入本申请)和多克隆的抗-聚糖血清hR6进行免疫检测。在SDS-PAGE之前使用RNase A(Roche Diagnostics GmbH,Mannheim,Germany)处理所有的体外翻译样品以减轻由过量的RNA导致的凝胶电泳不规则。所有实验均至少重复三次,并显示代表性的样品。
实施例1-N-连接糖基化组分的制备
首先,尝试在体外对细菌N-连接糖基化进行功能性重建。最低限度,这需要三个组分:OST、脂质连接的寡糖(LLO)(即脂质连接的聚糖)和携带D/E-X1-N-X2-S/T 基序的接受体蛋白。对于OST而言,在大肠杆菌(E.coli)细胞的膜组分中表达CjPglB,使用1%的N-十二烷基-β-D-麦芽糖苷(DDM)溶解并通过镍亲和层析后接凝胶过滤纯化至接近均质(图2B)。另外,使用携带空肠弯曲杆菌(C.jejuni)pgl基因座的大肠杆菌(E.coli)细胞生产寡糖供体。该基因簇编码实施GlcGalNAc5Bac庚糖(其中 Bac是杆菌胺)生物合成并且将其由膜锚定的十一异戊烯焦磷酸酯(UndPP)转移至天冬酰胺残基的酶。这里,将该基因簇携带无活性的pg1B基因(Wacker等,“N-Linked Glycosylation in Campylobacter jejuni and itsFunctional Transfer Into E.coli,”Science 298:1790-1793(2002),其通过引用整体并入本申请)的经修饰的版本转移至大肠杆菌 SCM6细胞并用于制备LLO。选择SCM6细胞的几个原因是:首先,这些细胞缺乏 WaaL酶,该酶天然地将寡糖(例如O-抗原,聚糖)由脂质载体十一异戊烯基转移至脂质A,其反而使寡糖穿梭至外膜的外叶(Feldman等,“EngineeringN-Linked Protein Glycosylation With Diverse O Antigen LipopolysaccharideStructures in Escherichia coli,” Proc.Nat’l.Acad.Sci.U.S.A.102:3016-3021(2005),其通过引用整体并入本申请)。这样,在缺乏WaaL时,所需的脂质连接的聚糖在内膜中累积。其次,将启动GlcNAc 转移酶的脂多糖及肠道细菌常见抗原WecA除去。因此,该菌株应该仅生产在缩小的末端具有GlcGalNAc5Bac的LLO。在这一观点的支持下,此前对从大肠杆菌菌株中提取的LLO进行质谱分析的结果与本申请中使用的一个(即ΔwaaLΔwecA)类似,这表明仅检测到了含有GlcGalNAc5Bac庚糖的LLO(Reid等,“Affinity-Capture TandemMass Spectrometric Characterization of Polyprenyl-Linked Oligosaccharides:Tool to Study Protein N-Glycosylation Pathways,”Anal.Chem.80:5468-5475(2008),其通过引用整体并入本申请)。对于寡糖接受体而言,从周质中纯化来自空肠弯曲杆菌(C.jejuni) (Nita-Lazar等,“The N-X-S/T Consensus Sequence is Required but notSufficient for Bacterial N-Linked Protein Glycosylation,”Glycobiology 15:361-367(2005),其通过引用整体并入本申请)的模型糖蛋白AcrA。AcrA存在两个共有的D/E-X1-N-X2-S/T位点,其被CjPglB糖基化(Kowarik等,“Definition of the Bacterial N-Glycosylation Site Consensus Sequence,”EMBO J.25:1957-1966(2006),其通过引用整体并入本申请)。或者,对称为scFv13-R4-GT的糖基工程化的单链可变片段(scFv)进行简单纯化,所述片段携带由四个被连续的甘氨酸残基彼此间隔的连续DQNAT基序组成的C-末端糖基化标签(GT)(Fisher等,“Production of Secretory and Extracellular N-LinkedGlycoproteins in Escherichia coli,”Appl.Environ.Microbiol.77:871-881(2011),其通过引用整体并入本申请)。
实施例2-在体外对空肠弯曲杆菌(C.jejuni)蛋白糖基化途径进行功能性重建
为评估重建的糖基化途径,将CjPglB OST与从大肠杆菌细胞中提取的LLO和纯化的AcrA组合。该反应使两个AcrA位点均有效地糖基化,其由接近全部的AcrA由未经修饰的(g0)形式迁移至完全糖基化的(g2)形式的迁移率所证实(图3A)。该活性取决于PglB和LLO。将LLO的浓度加倍导致了除了g2以外,还出现了AcrA的 g0和g1形式,这表明糖基化的效率略有降低。重要的是,当使用缺乏pg1簇或无活性CjPglB突变的细胞的脂质提取物时,糖基化活性丧失(图3A)。通过检测对空肠弯曲杆菌(C.jejuni)N-聚糖具有血清特异性的糖基化AcrA对这些结果进行了确证 (图3A)。当使用糖基工程化的scFv13-R4-GT蛋白作为寡糖接受体时观察到了几乎相同的结果(图3A)。应注意的是,g2、g3和g4是本申请中检测到的主要糖形式,几乎无法检测g1。为证实其他OST能够在这个系统中使用,还使用红嘴鸥弯曲杆菌(Campylobacter lari)PglB(ClPglB)进行了AcrA的体外糖基化,其与空肠弯曲杆菌(C.jejuni)具有56%的一致性(Schwarz等,“Relaxed Acceptor Site Specificity ofBacterial Oligosaccharyltransferase in Vivo,”Glycobiology 21:45-54(2011),其通过引用整体并入本申请)。在检测条件下结果得到几乎相同量的g0、g1和g2形式(图3B)。为了在翻译/糖基化反应中使用,纯化的糖基化组分必须能够耐受长期贮存和冻融循环。为了对这方面进行检测,将所述组分分别在-20℃下贮存3个月。除了将Pg1B 样品中甘油的终浓度增加至10%以外不改变贮存缓冲液。在此期间将各组分均冻融 5-10次,随后使用ClPglB进行体外反应。该反应产生的AcrA糖基化的效率仅略低于新鲜纯化组分的糖基化(比较图3B和3C)。
实施例3-蛋白靶点的无细胞翻译
为确定是否存在能够合成目的蛋白靶点的无细胞翻译系统,对基于大肠杆菌(E.coli)CFE的蛋白合成系统和使用纯化的翻译组分和T7 RNA聚合酶的PURE(使用重组元件的蛋白合成)系统(Shimizu等,“Cell-Free Translation Reconstituted With PurifiedComponents,”Nat.Biotechnol.19:751-755(2001),其通过引用整体并入本申请)进行了评估。其涉及启动具有在T7启动子驱动的pET载体中克隆的三个不同的AcrA DNA 序列的CFE和PURE系统。使用CFE系统,在1h内生产~150–250μg/mL各AcrA变体作为全长多肽(图4A)。携带其天然信号肽的AcrA累积至最高水平但也出现了最大量的降解。而相反的是,在天然信号位置携带PelB信号肽的AcrA和缺乏信号肽的 AcrA均累积至略低的浓度但是未出现可见的降解。同样地,PURE系统生产全部三种 AcrA变体作为全长多肽,尽管其水平(均为~100μg/mL/h)略低于基于CFE的系统 (图4A)。这两个系统均能够产生显著量的scFv13-R4-GT(图5A)。应注意的是,此前已在无氧化性条件下(即缺乏二硫键)对该scFv的表达进行了优化(Martineau 等,“Expression of an Antibody Fragment at High Levels in theBacterial Cytoplasm,”J. Mol.Biol.280:117-127(1998),其通过引用整体并入本申请),因此其不需要特别的转录/翻译条件。
实施例4-靶糖蛋白的无细胞翻译和糖基化
受这些结果的鼓舞,通过将纯化的糖基化组分(扣除接受体蛋白)与无细胞翻译系统之一组合构建glycoCFE和glycoPURE翻译/糖基化系统。选择编码不含N-末端信号肽的AcrA的质粒pET24(AcrA-cyt)评估这些系统,因为其在这两个系统中均产生显著量的靶蛋白且无可检测的降解。当使用这种质粒以及CjPglB和LLO启动CFE 或PURE系统时,AcrA主要以双重糖基化的g2糖形式生产,其具有较少量的g1和实质上不含可检测的未经修饰的AcrA(图4B)。据估计在12h后在1mL反应体积中产生~100–150μg糖基化的AcrA。同样地,利用glycoCFE和glycoPURE系统均有效地产生scFv13-R4-GT,其蛋白的~50%具有完全糖基化的g4形式,50%为g3形式(图 5B)。这两个系统均在12h内产生~50–100μg/mL糖基化的scFv13-R4-GT。因此, glycoCFE和glycoPURE系统含有有效地翻译N-连接糖蛋白所必需的全部组分。
实施例1-4的讨论
本申请开发开放的基于原核生物的翻译/糖基化系统的主要优势是能够以精确比率提供纯化的糖基化组分及其底物和辅助因子(Lizak等,“X-ray Structure of aBacterial Oligosaccharyltransferase,”Nature 474:350-355(2011),其通过引用整体并入本申请)。同样地,能够完全降低或消除抑制性底物如蛋白酶和催化糖苷键水解的糖苷酶的浓度。此外,该体外系统允许引入可能与体内系统不相容的组分如某些在体内不能生产或翻转的LLO。在此前任意的翻译/糖基化系统中都难以获得该可控性水平并且其具有下述几个方面的意义。首先,其有助于避免糖蛋白的异质性,这在评估具体聚糖结构贡献的基础研究中或在药用糖蛋白的生产中特别麻烦。按照这些原则,glycoCFE和 glycoPURE系统应允许检验与糖基化机制相互作用或对其产生刺激或者促进接受体位点占有率增加的因素。尽管在本申请中通过CjPglB观察到的的糖基化效率超出了通常在体内观察到的水平(Kowarik等,“N-Linked Glycosylation of Folded Proteins by the BacterialOligosaccharyltransferase,”Science 314:1148-1150(2006);Kowarik等,“Definitionof the Bacterial N-Glycosylation Site Consensus Sequence,”EMBO J. 25:1957-1966(2006);Fisher等,“Production of Secretory and Extracellular N-LinkedGlycoproteins in Escherichia coli,”Appl.Environ.Microbiol.77:871-881(2011),其通过引用整体并入本申请),但是应指出的是对反应条件的进一步研究能提高生产率和糖基化效率。其次,其促进了多种复合的代谢系统和途径在体外的整合/共活化,包括转录、翻译、蛋白折叠和糖基化。因此,glycoCFE和glycoPURE系统为在降低系统复杂性和除去结构性障碍的条件下对这些重要机制相互作用的研究提供了独特的机会。例如,由于细菌OST能够将折叠蛋白(Kowarik等,“N-Linked Glycosylation of Folded Proteins by theBacterial Oligosaccharyltransferase,”Science 314:1148-1150(2006),其通过引用整体并入本申请)和一些蛋白构造性结构域的局部柔性结构糖基化,因而这些系统有助于解释蛋白结构对糖基化效率的影响。而且,因为细菌和真核生物糖基化机制显示出显著的相似性,所以这些细菌系统能够为理解更复杂的真核生物过程提供简化的模型框架。再次,其允许通过在糖基化途径中重建附加的或替代性步骤(天然和非天然的)对所述系统进行进一步定制。例如,已在体外重建了在pg1途径中糖基转移酶的依次活化(Glover等,“InVitro Assembly of the Undecaprenylpyrophosphate-Linked Heptasaccharide forProkaryotic N-Linked Glycosylation,”Proc.Nat’l.Acad.Sci.U.S.A.102:14255-14259(2005),其通过引用整体并入本申请)以及能够容易地将翻译/糖基化反映整合至单一的整合平台。在糖基工程化的大肠杆菌具有提供大量的UndPP-连接聚糖的潜能的同时(Feldman等,“Engineering N-Linked Protein Glycosylation With Diverse O AntigenLipopolysaccharide Structures in Escherichia coli,”Proc.Nat’l.Acad.Sci.U.S.A.102:3016-3021(2005); Yavuz等,“Glycomimicry:Display ofFucosylation on the Lipo-Oligosaccharide of Recombinant Escherichia coliK12,”Glycoconj.J.28:39-47(2011),其通过引用整体并入本申请),可以通过添加特异性的糖基转移酶和必需的活化的糖实现将其能力延伸至细菌聚糖以外。这种方法能够用于制备原核生物聚糖模拟物(Schwarz等,“A Combined Method for Producing HomogeneousGlycoproteins With Eukaryotic N-Glycosylation,”Nat. Chem.Biol.6:264-266(2010),其通过引用整体并入本申请)并且能够更精细地控制能够用于在体外修饰靶蛋白的糖形式的多样性。由于CjPglB对聚糖结构的特异性较低 (Feldman等,“Engineering N-Linked Protein Glycosylation With Diverse O Antigen LipopolysaccharideStructures in Escherichia coli,”Proc.Nat’l.Acad.Sci.U.S.A. 102:3016-3021(2005),其通过引用整体并入本申请),所以所有这些UndPP-连接聚糖均可能是适宜的底物。就算CjPglB不足以确证,本申请中对两种不同的OST能够互换使用的证实表明实质上包括那些来自其他细菌、古生菌和甚至是一些真核生物 (Nasab等,“All in One:LeishmaniaMajor STT3 Proteins Substitute for the Whole OligosaccharyltransferaseComplex in Saccharomyces cerevisiae,”Mol.Biol.Cell 19:3758-3768(2008),其通过引用整体并入本申请)的任意单个亚基OST均能够用于这些系统中。在这一概念的支持下,可以在大肠杆菌膜上功能性表达硕大利什曼原虫 (Leishmania major)和激烈火球菌(Pyrococcus furiosus)的单个亚基OST(Igura& Kohda,“Selective Control ofOligosaccharide Transfer Efficiency for the N-Glycosylation Sequon by a PointMutation in Oligosaccharyltransferase,”J.Biol.Chem.286:13255-13260 (2011),其通过引用整体并入本申请)。最后,因为其不仅限于天然聚糖,glycoCFE 和glycoPURE系统允许杂交的天然/非天然的或者甚至是完全人工的聚糖的合成。例如,加入合成的糖-核苷酸供体底物和/或突变的糖基转移酶以及具有新的特异性的 OST将能够构建建立在不规范的聚糖编码上的糖基化系统。基于所有这些原因, glycoCFE和glycoPURE系统为无细胞翻译和糖生物学工具包提供了有益的补充。
尽管在本申请中已对优选的实施方式进行了详细的图示和描述,但是在不脱离本发明主旨的前提下可以进行多种修饰、增加、取代等,这对相关领域的技术人员是显而易见的,因此认为这些在权利要求所定义的本发明的范围内。
序列表
<110>康奈尔大学
<120> 一种用于糖蛋白合成的基于原核生物的无细胞系统
<130> 29543.7021
<150> US 61/555,854
<151> 2011-11-04
<160> 35
<170> PatentIn 3.5版
<210> 1
<211> 5
<212> PRT
<213> 人工的
<220>
<223> 细菌糖基化基序
<220>
<221> MISC_FEATURE
<222> (1)..(1)
<223> 在位置1的X是D或E
<220>
<221> MISC_FEATURE
<222> (2)..(2)
<223> 在位置2的X是除了脯氨酸以外的任意氨基酸
<220>
<221> MISC_FEATURE
<222> (4)..(4)
<223> 在位置4的X是除了脯氨酸以外的任意氨基酸
<220>
<221> MISC_FEATURE
<222> (5)..(5)
<223> 在位置5的X是S或T
<400> 1
Xaa Xaa Asn Xaa Xaa
1 5
<210> 2
<211> 664
<212> PRT
<213> 空肠弯曲杆菌
<400> 2
Ile Ile Ser Asn Asp Gly Tyr Ala Phe Ala Glu Gly Ala Arg Asp Met
1 5 10 15
Ile Ala Gly Phe His Gln Pro Asn Asp Leu Ser Tyr Tyr Gly Ser Ser
20 25 30
Leu Ser Thr Leu Thr Tyr Trp Leu Tyr Lys Ile Thr Pro Phe Ser Phe
35 40 45
Glu Ser Ile Ile Leu Tyr Met Ser Thr Phe Leu Ser Ser Leu Val Val
50 55 60
Ile Pro Ile Ile Leu Leu Ala Asn Glu Tyr Lys Arg Pro Leu Met Gly
65 70 75 80
Phe Val Ala Ala Leu Leu Ala Ser Ile Ala Asn Ser Tyr Tyr Asn Arg
85 90 95
Thr Met Ser Gly Tyr Tyr Asp Thr Asp Met Leu Val Ile Val Leu Pro
100 105 110
Met Phe Ile Leu Phe Phe Met Val Arg Met Ile Leu Lys Lys Asp Phe
115 120 125
Phe Ser Leu Ile Ala Leu Pro Leu Phe Ile Gly Ile Tyr Leu Trp Trp
130 135 140
Tyr Pro Ser Ser Tyr Thr Leu Asn Val Ala Leu Ile Gly Leu Phe Leu
145 150 155 160
Ile Tyr Thr Leu Ile Phe His Arg Lys Glu Lys Ile Phe Tyr Ile Ala
165 170 175
Val Ile Leu Ser Ser Leu Thr Leu Ser Asn Ile Ala Trp Phe Tyr Gln
180 185 190
Ser Thr Ile Ile Val Ile Leu Phe Ala Leu Phe Ala Leu Glu Gln Lys
195 200 205
Arg Leu Asn Phe Val Ile Ile Gly Ile Leu Ala Ser Val Thr Leu Ile
210 215 220
Phe Leu Ile Leu Ser Gly Gly Val Asp Pro Ile Leu Tyr Gln Leu Lys
225 230 235 240
Phe Tyr Ile Phe Arg Ser Asp Glu Ser Ala Asn Leu Thr Gln Gly Phe
245 250 255
Met Tyr Phe Asn Val Asn Gln Thr Ile Gln Glu Val Glu Asn Val Asp
260 265 270
Leu Ser Glu Phe Met Arg Arg Ile Ser Gly Ser Glu Ile Val Phe Leu
275 280 285
Phe Ser Leu Phe Gly Phe Val Trp Leu Leu Arg Lys His Lys Ser Met
290 295 300
Ile Met Ala Leu Pro Ile Leu Val Leu Gly Phe Leu Ala Leu Lys Gly
305 310 315 320
Gly Leu Arg Phe Thr Ile Tyr Ser Val Pro Val Met Ala Leu Gly Phe
325 330 335
Gly Phe Leu Leu Ser Glu Phe Lys Ala Ile Leu Val Lys Lys Tyr Ser
340 345 350
Gln Leu Thr Ser Asn Val Cys Ile Val Phe Ala Thr Ile Leu Thr Leu
355 360 365
Ala Pro Val Phe Ile His Ile Tyr Asn Tyr Lys Ala Pro Thr Val Phe
370 375 380
Ser Gln Asn Glu Ala Ser Leu Leu Asn Gln Leu Lys Asn Ile Ala Asn
385 390 395 400
Arg Glu Asp Tyr Val Val Thr Trp Trp Asp Tyr Gly Tyr Pro Val Arg
405 410 415
Tyr Tyr Ser Asp Val Lys Thr Leu Val Asp Gly Gly Lys His Leu Gly
420 425 430
Lys Asp Asn Phe Phe Pro Ser Phe Ala Leu Ser Lys Asp Glu Gln Ala
435 440 445
Ala Ala Asn Met Ala Arg Leu Ser Val Glu Tyr Thr Glu Lys Ser Phe
450 455 460
Tyr Ala Pro Gln Asn Asp Ile Leu Lys Thr Asp Ile Leu Gln Ala Met
465 470 475 480
Met Lys Asp Tyr Asn Gln Ser Asn Val Asp Leu Phe Leu Ala Ser Leu
485 490 495
Ser Lys Pro Asp Phe Lys Ile Asp Thr Pro Lys Thr Arg Asp Ile Tyr
500 505 510
Leu Tyr Met Pro Ala Arg Met Ser Leu Ile Phe Ser Thr Val Ala Ser
515 520 525
Phe Ser Phe Ile Asn Leu Asp Thr Gly Val Leu Asp Lys Pro Phe Thr
530 535 540
Phe Ser Thr Ala Tyr Pro Leu Asp Val Lys Asn Gly Glu Ile Tyr Leu
545 550 555 560
Ser Asn Gly Val Val Leu Ser Asp Asp Phe Arg Ser Phe Lys Ile Gly
565 570 575
Asp Asn Val Val Ser Val Asn Ser Ile Val Glu Ile Asn Ser Ile Lys
580 585 590
Gln Gly Glu Tyr Lys Ile Thr Pro Ile Asp Asp Lys Ala Gln Phe Tyr
595 600 605
Ile Phe Tyr Leu Lys Asp Ser Ala Ile Pro Tyr Ala Gln Phe Ile Leu
610 615 620
Met Asp Lys Thr Met Phe Asn Ser Ala Tyr Val Gln Met Phe Phe Leu
625 630 635 640
Gly Asn Tyr Asp Lys Asn Leu Phe Asp Leu Val Ile Asn Ser Arg Asp
645 650 655
Ala Lys Val Phe Lys Leu Lys Ile
660
<210> 3
<211> 1995
<212> DNA
<213> 空肠弯曲杆菌
<400> 3
atcatttcaa acgatggtta tgcttttgct gagggtgcaa gagatatgat agcaggtttt 60
catcagccta atgatttgag ttattatgga tcttctttat ctacgcttac ttattggctt 120
tataaaatca cacctttttc tttcgaaagt attattttat atatgagtac ttttttatct 180
tctttggtgg tgattcctat tattttacta gctaatgaat acaaacgtcc tttaatgggc 240
tttgtagctg ctcttttagc aagtatagca aacagttatt ataatcgcac tatgagtggg 300
tattatgata cggatatgct ggtaattgtt ttacctatgt ttattttatt ttttatggta 360
agaatgattt taaaaaaaga ctttttttca ttgattgcct taccgttatt tataggaatt 420
tatctttggt ggtatccttc aagctatact ttaaatgtag ctttaattgg acttttttta 480
atttatacac ttatttttca tagaaaagaa aagatttttt atatagctgt gattttgtct 540
tctcttactc tttcaaatat agcatggttt tatcaaagta ctattatagt aatacttttt 600
gctttatttg ctttagagca aaaacgctta aattttgtaa ttataggaat tttagctagt 660
gtaactttga tatttttgat tttaagtgga ggggttgatc ctatacttta tcagcttaaa 720
ttttatattt ttagaagtga tgaaagtgcg aatttaacgc agggttttat gtattttaat 780
gtcaatcaaa ccatacaaga agttgaaaat gtagatctta gcgaatttat gcgaagaatt 840
agtggtagtg aaattgtttt tttgttttct ttgtttggtt ttgtatggct tttgagaaaa 900
cataaaagta tgattatggc tttacctata ttggtgcttg ggtttttagc cttaaaaggg 960
gggcttagat ttaccattta ttctgtacct gtaatggcct taggatttgg ttttttattg 1020
agcgagttta aggctatatt ggttaaaaaa tatagccaat taacttcaaa tgtttgtatt 1080
gtttttgcaa ctattttgac tttagctcca gtatttatcc atatttacaa ctataaagca 1140
ccaacagttt tttctcaaaa tgaagcatca ttattaaatc aattaaaaaa tatagccaat 1200
agagaagatt atgtggtaac ttggtgggat tatggttatc ctgtgcgtta ttatagtgat 1260
gtgaaaactt tagtagatgg tggaaagcat ttaggtaagg ataatttttt cccttctttt 1320
gctttaagca aagatgaaca agctgcagct aatatggcaa gacttagtgt agaatataca 1380
gaaaaaagct tttatgctcc gcaaaatgat attttaaaaa cagacatttt acaagccatg 1440
atgaaagatt ataatcaaag caatgtggat ttgtttctag cttcattatc aaaacctgat 1500
tttaaaatcg atacaccaaa aactcgtgat atttatcttt atatgcccgc tagaatgtct 1560
ttgatttttt ctacggtggc tagtttttct tttattaatt tagatacagg agttttggat 1620
aaacctttta cctttagcac agcttatcca cttgatgtta aaaatggaga aatttatctt 1680
agcaacggag tggttttaag cgatgatttt agaagtttta aaataggtga taatgtggtt 1740
tctgtaaata gtatcgtaga gattaattct attaaacaag gtgaatacaa aatcactcca 1800
attgatgata aggctcagtt ttatattttt tatttaaagg atagtgctat tccttacgca 1860
caatttattt taatggataa aaccatgttt aatagtgctt atgtgcaaat gtttttttta 1920
ggaaattatg ataagaattt atttgacttg gtgattaatt ctagagatgc taaggttttt 1980
aaacttaaaa tttaa 1995
<210> 4
<211> 711
<212> PRT
<213> 红嘴鸥弯曲杆菌
<400> 4
Met Lys Leu Gln Gln Asn Phe Thr Asp Asn Asn Ser Ile Lys Tyr Thr
1 5 10 15
Cys Ile Leu Ile Leu Ile Ala Phe Ala Phe Ser Val Leu Cys Arg Leu
20 25 30
Tyr Trp Val Ala Trp Ala Ser Glu Phe Tyr Glu Phe Phe Phe Asn Asp
35 40 45
Gln Leu Met Ile Thr Thr Asn Asp Gly Tyr Ala Phe Ala Glu Gly Ala
50 55 60
Arg Asp Met Ile Ala Gly Phe His Gln Pro Asn Asp Leu Ser Tyr Phe
65 70 75 80
Gly Ser Ser Leu Ser Thr Leu Thr Tyr Trp Leu Tyr Ser Ile Leu Pro
85 90 95
Phe Ser Phe Glu Ser Ile Ile Leu Tyr Met Ser Ala Phe Phe Ala Ser
100 105 110
Leu Ile Val Val Pro Ile Ile Leu Ile Ala Arg Glu Tyr Lys Leu Thr
115 120 125
Thr Tyr Gly Phe Ile Ala Ala Leu Leu Gly Ser Ile Ala Asn Ser Tyr
130 135 140
Tyr Asn Arg Thr Met Ser Gly Tyr Tyr Asp Thr Asp Met Leu Val Leu
145 150 155 160
Val Leu Pro Met Leu Ile Leu Leu Thr Phe Ile Arg Leu Thr Ile Asn
165 170 175
Lys Asp Ile Phe Thr Leu Leu Leu Ser Pro Val Phe Ile Met Ile Tyr
180 185 190
Leu Trp Trp Tyr Pro Ser Ser Tyr Ser Leu Asn Phe Ala Met Ile Gly
195 200 205
Leu Phe Gly Leu Tyr Thr Leu Val Phe His Arg Lys Glu Lys Ile Phe
210 215 220
Tyr Leu Thr Ile Ala Leu Met Ile Ile Ala Leu Ser Met Leu Ala Trp
225 230 235 240
Gln Tyr Lys Leu Ala Leu Ile Val Leu Leu Phe Ala Ile Phe Ala Phe
245 250 255
Lys Glu Glu Lys Ile Asn Phe Tyr Met Ile Trp Ala Leu Ile Phe Ile
260 265 270
Ser Ile Leu Ile Leu His Leu Ser Gly Gly Leu Asp Pro Val Leu Tyr
275 280 285
Gln Leu Lys Phe Tyr Val Phe Lys Ala Ser Asp Val Gln Asn Leu Lys
290 295 300
Asp Ala Ala Phe Met Tyr Phe Asn Val Asn Glu Thr Ile Met Glu Val
305 310 315 320
Asn Thr Ile Asp Pro Glu Val Phe Met Gln Arg Ile Ser Ser Ser Val
325 330 335
Leu Val Phe Ile Leu Ser Phe Ile Gly Phe Ile Leu Leu Cys Lys Asp
340 345 350
His Lys Ser Met Leu Leu Ala Leu Pro Met Leu Ala Leu Gly Phe Met
355 360 365
Ala Leu Arg Ala Gly Leu Arg Phe Thr Ile Tyr Ala Val Pro Val Met
370 375 380
Ala Leu Gly Phe Gly Tyr Phe Leu Tyr Ala Phe Phe Asn Phe Leu Glu
385 390 395 400
Lys Lys Gln Ile Lys Leu Ser Leu Arg Asn Lys Asn Ile Leu Leu Ile
405 410 415
Leu Ile Ala Phe Phe Ser Ile Ser Pro Ala Leu Met His Ile Tyr Tyr
420 425 430
Tyr Lys Ser Ser Thr Val Phe Thr Ser Tyr Glu Ala Ser Ile Leu Asn
435 440 445
Asp Leu Lys Asn Lys Ala Gln Arg Glu Asp Tyr Val Val Ala Trp Trp
450 455 460
Asp Tyr Gly Tyr Pro Ile Arg Tyr Tyr Ser Asp Val Lys Thr Leu Ile
465 470 475 480
Asp Gly Gly Lys His Leu Gly Lys Asp Asn Phe Phe Ser Ser Phe Val
485 490 495
Leu Ser Lys Glu Gln Ile Pro Ala Ala Asn Met Ala Arg Leu Ser Val
500 505 510
Glu Tyr Thr Glu Lys Ser Phe Lys Glu Asn Tyr Pro Asp Val Leu Lys
515 520 525
Ala Met Val Lys Asp Tyr Asn Lys Thr Ser Ala Lys Asp Phe Leu Glu
530 535 540
Ser Leu Asn Asp Lys Asp Phe Lys Phe Asp Thr Asn Lys Thr Arg Asp
545 550 555 560
Val Tyr Ile Tyr Met Pro Tyr Arg Met Leu Arg Ile Met Pro Val Val
565 570 575
Ala Gln Phe Ala Asn Thr Asn Pro Asp Asn Gly Glu Gln Glu Lys Ser
580 585 590
Leu Phe Phe Ser Gln Ala Asn Ala Ile Ala Gln Asp Lys Thr Thr Gly
595 600 605
Ser Val Met Leu Asp Asn Gly Val Glu Ile Ile Asn Asp Phe Arg Ala
610 615 620
Leu Lys Val Glu Gly Ala Ser Ile Pro Leu Lys Ala Phe Val Asp Ile
625 630 635 640
Glu Ser Ile Thr Asn Gly Lys Phe Tyr Tyr Asn Glu Ile Asp Ser Lys
645 650 655
Ala Gln Ile Tyr Leu Leu Phe Leu Arg Glu Tyr Lys Ser Phe Val Ile
660 665 670
Leu Asp Glu Ser Leu Tyr Asn Ser Ser Tyr Ile Gln Met Phe Leu Leu
675 680 685
Asn Gln Tyr Asp Gln Asp Leu Phe Glu Gln Ile Thr Asn Asp Thr Arg
690 695 700
Ala Lys Ile Tyr Arg Leu Lys
705 710
<210> 5
<211> 2139
<212> DNA
<213> 红嘴鸥弯曲杆菌
<400> 5
atgaaactac aacaaaattt cacggataat aattctataa aatatacctg tattttaatc 60
cttatagcct ttgcttttag tgttttgtgt agattatact gggtagcttg ggcaagtgag 120
ttttatgagt ttttctttaa tgatcaactc atgattacta ctaatgatgg ctatgctttt 180
gcagaaggtg caagagatat gatagcaggt tttcatcaac ctaatgactt atcttatttt 240
ggaagctcac tttctacttt gacttattgg ctttatagta ttttgccttt tagctttgaa 300
agtattattt tatatatgag tgcttttttt gcttctttga ttgttgtgcc tattatatta 360
atcgcaagag agtataaact cactacctat ggctttatag cagctttact tggaagcatt 420
gcaaatagtt attataaccg cactatgagt gggtattacg atacagatat gctagtgtta 480
gttttaccaa tgcttatttt gcttaccttt atacgcttaa ctattaataa agacattttc 540
accctacttt taagtccggt ttttatcatg atttatttgt ggtggtatcc atcaagttat 600
tctttaaatt ttgctatgat aggacttttt ggactttata ctttagtatt tcatagaaaa 660
gaaaagattt tttatctaac tattgctttg atgatcatag ctttaagtat gctagcatgg 720
caatataagc ttgctttgat tgtattatta tttgctattt ttgcttttaa agaagaaaaa 780
atcaattttt atatgatttg ggctttgatt tttattagca ttttgatatt gcatttaagt 840
ggcggcttag atcctgtttt ataccaactt aaattttatg tatttaaagc ttctgatgtg 900
caaaatttaa aagatgctgc ctttatgtat tttaatgtca atgaaaccat tatggaagta 960
aatactatcg atcctgaagt atttatgcaa agaattagct ctagtgtttt agtatttatc 1020
ctttctttta taggttttat cttactttgc aaagatcaca aaagcatgct tttggctcta 1080
cctatgcttg cactaggttt tatggcttta agagctggac ttagatttac catttatgca 1140
gttcctgtga tggctttggg ttttgggtat tttttatatg cattttttaa ttttttagaa 1200
aaaaaacaaa tcaaacttag cctaagaaat aaaaatatct tacttatact cattgcattt 1260
tttagtataa gccctgcttt gatgcatatt tattattata aatcctctac tgtttttact 1320
tcttatgaag ctagtatttt aaatgattta aaaaataaag ctcaaagaga agattatgtt 1380
gttgcttggt gggattatgg ttatccaata cgctattata gcgatgtaaa aaccttaatc 1440
gatggtggaa aacacctagg aaaagataat tttttctcat cttttgtctt aagcaaagaa 1500
caaattccag cagccaatat ggcaagactt agcgtagaat acactgaaaa atctttcaaa 1560
gaaaactatc ctgatgtttt aaaagctatg gttaaagatt ataataaaac aagtgctaaa 1620
gattttttag aaagtttaaa tgataaagat tttaaatttg ataccaataa aactagagat 1680
gtatacattt atatgcctta tagaatgttg cgtatcatgc ctgtggtggc acaatttgca 1740
aatacaaatc ctgataatgg agagcaagaa aaaagtttat ttttctccca agctaatgcc 1800
atagctcaag ataaaaccac aggttctgtt atgcttgata atggagtaga aattattaat 1860
gattttagag ccttaaaagt agaaggtgca agcatacctt taaaagcttt tgtggatata 1920
gaatccatta ctaatggcaa attttattac aatgaaattg attcaaaagc tcaaatttat 1980
ttgctctttt taagagaata taaaagcttt gtgattttag atgaaagtct ttataatagt 2040
tcttatatac aaatgttttt gttaaatcaa tacgatcaag atttatttga acaaattact 2100
aatgatacaa gagcaaaaat ttataggcta aaaagatga 2139
<210> 6
<211> 714
<212> PRT
<213> 结肠弯曲杆菌
<400> 6
Met Leu Lys Lys Glu Tyr Phe Lys Asn Pro Thr Phe Ile Leu Leu Ala
1 5 10 15
Phe Ile Ile Leu Ala Tyr Val Phe Ser Val Leu Cys Arg Phe Tyr Trp
20 25 30
Ile Phe Trp Ala Ser Glu Phe Asn Glu Tyr Phe Phe Asn Asn Glu Leu
35 40 45
Met Ile Ile Ser Asn Asp Gly Tyr Ala Phe Ala Glu Gly Ala Arg Asp
50 55 60
Met Ile Ala Gly Phe His Gln Pro Asn Asp Leu Ser Tyr Tyr Gly Ser
65 70 75 80
Ser Leu Ser Thr Leu Thr Tyr Trp Phe Tyr Lys Ile Thr Pro Phe Ser
85 90 95
Leu Glu Ser Ile Phe Ile Tyr Ile Ser Thr Phe Leu Ser Ser Leu Val
100 105 110
Val Ile Pro Leu Ile Leu Ile Ala Asn Glu Tyr Lys Arg Pro Leu Met
115 120 125
Gly Phe Val Ala Ala Leu Leu Ala Ser Ile Ala Asn Ser Tyr Tyr Asn
130 135 140
Arg Thr Met Ser Gly Tyr Tyr Asp Thr Asp Met Leu Val Ile Val Leu
145 150 155 160
Ala Met Met Ile Val Phe Phe Met Ile Arg Leu Ile Leu Lys Lys Asp
165 170 175
Leu Leu Ser Leu Ile Thr Leu Pro Leu Phe Val Gly Ile Tyr Leu Trp
180 185 190
Trp Tyr Pro Ser Ser Tyr Thr Leu Asn Val Ala Leu Leu Gly Leu Phe
195 200 205
Phe Ile Tyr Thr Leu Val Phe His Ile Lys Glu Lys Thr Leu Tyr Met
210 215 220
Ala Ile Ile Leu Ala Ser Ile Thr Leu Ser Asn Ile Ala Trp Phe Tyr
225 230 235 240
Gln Ser Ala Ile Ile Val Ile Leu Phe Ser Leu Phe Val Leu Gln Asn
245 250 255
Lys Arg Phe Ser Phe Ala Leu Leu Gly Ile Leu Gly Leu Ala Thr Leu
260 265 270
Val Phe Leu Ile Leu Ser Gly Gly Ile Asp Pro Ile Leu Tyr Gln Leu
275 280 285
Lys Phe Tyr Ile Phe Arg Ser Asp Glu Ser Ala Asn Leu Ala Gln Gly
290 295 300
Phe Met Tyr Phe Asn Val Asn Gln Thr Ile Gln Glu Val Glu Ser Ile
305 310 315 320
Asp Leu Ser Ile Phe Met Gln Arg Ile Ser Gly Ser Glu Leu Val Phe
325 330 335
Phe Val Ser Leu Ile Gly Phe Ile Phe Leu Val Arg Lys His Lys Ser
340 345 350
Met Ile Leu Ala Leu Pro Met Leu Ala Leu Gly Phe Leu Ala Leu Lys
355 360 365
Ser Gly Leu Arg Phe Thr Ile Tyr Ala Val Pro Val Leu Ala Leu Gly
370 375 380
Phe Gly Phe Leu Met Ser Leu Leu Gln Glu Arg Lys Gln Lys Asn Asn
385 390 395 400
Asn Thr Tyr Trp Trp Ala Asn Ile Gly Val Phe Ile Phe Thr Phe Leu
405 410 415
Ser Leu Ile Pro Met Phe Tyr His Ile Asn Asn Tyr Lys Ala Pro Thr
420 425 430
Val Phe Ser Gln Asn Glu Ala Thr Lys Leu Asp Glu Leu Lys Lys Ile
435 440 445
Ala Gln Arg Glu Asp Tyr Val Val Thr Trp Trp Asp Tyr Gly Tyr Pro
450 455 460
Ile Arg Tyr Tyr Ser Asp Val Lys Thr Leu Ala Asp Gly Gly Lys His
465 470 475 480
Leu Gly Lys Asp Asn Phe Phe Pro Ser Phe Val Leu Ser Lys Asp Gln
485 490 495
Val Ala Ala Ala Asn Met Ala Arg Leu Ser Val Glu Tyr Thr Glu Lys
500 505 510
Ser Phe Tyr Ala Pro Leu Asn Asp Ile Leu Lys Asn Asp Leu Leu Gln
515 520 525
Ala Met Met Lys Asp Tyr Asn Gln Asn Asn Val Asp Leu Phe Leu Ala
530 535 540
Ser Leu Ser Lys Pro Asp Phe Lys Ile Asn Thr Pro Lys Thr Arg Asp
545 550 555 560
Val Tyr Ile Tyr Met Pro Ala Arg Met Ser Leu Ile Phe Ser Thr Val
565 570 575
Ala Ser Phe Ser Phe Val Asp Leu Glu Thr Gly Glu Ile Asn Lys Pro
580 585 590
Phe Thr Phe Ser Ala Ala Tyr Pro Leu Asp Val Lys Asn Gly Glu Ile
595 600 605
Tyr Leu Ser Asn Gly Ile Ala Leu Ser Asp Asp Phe Arg Ser Phe Lys
610 615 620
Ile Asn Asn Ser Thr Ile Ser Val Asn Ser Ile Ile Glu Ile Asn Ser
625 630 635 640
Ile Lys Gln Gly Glu Tyr Lys Ile Thr Pro Ile Asp Asp Met Ala Gln
645 650 655
Phe Tyr Ile Phe Tyr Leu Lys Asp Ser Thr Ile Pro Tyr Ala Gln Phe
660 665 670
Ile Leu Met Asp Lys Thr Met Phe Asn Ser Ala Tyr Val Gln Met Phe
675 680 685
Phe Leu Gly Asn Tyr Asp Lys Asn Leu Tyr Asp Leu Val Ile Asn Ala
690 695 700
Arg Asp Ala Lys Val Phe Lys Leu Lys Ile
705 710
<210> 7
<211> 2145
<212> DNA
<213> 结肠弯曲杆菌
<400> 7
atgttaaaaa aagaatactt taaaaaccca acttttattt tattggcttt tataatttta 60
gcgtatgtct ttagtgtttt atgtaggttt tattggattt tttgggcaag tgagtttaat 120
gaatattttt tcaataacga gcttatgatt atctcaaatg atggatatgc ttttgcagag 180
ggtgcaagag atatgatagc gggttttcat caacctaatg atttgagtta ttatggttct 240
tcgctttcaa cgctcacata ttggttttat aaaataactc ctttttcttt agaaagcatt 300
tttatatata tcagtacttt tttatcttct ttggtggtta tacctttgat tttgattgct 360
aatgaataca aacgcccttt aatggggttt gttgcagcat tgctagccag tatagctaat 420
agctattata atcgcacgat gagcggatat tatgatactg atatgcttgt tatagttctt 480
gcaatgatga tagttttctt tatgataagg ctgattttga aaaaagattt attatcttta 540
ataacactgc ctttgtttgt aggaatttat ctttggtggt atccatcaag ctatacttta 600
aatgttgctt tactaggact tttctttatt tataccttgg tttttcatat aaaagaaaaa 660
acgctttata tggctattat cctagcttct atcacacttt caaatatagc ttggttttat 720
caaagcgcca tcattgtcat actttttagt ctttttgttt tgcaaaataa gcgttttagc 780
tttgctttgc ttggaatttt aggtttggca actttggtat ttttgatact aagcggtgga 840
attgatccta tactctatca acttaaattt tatattttta gaagtgatga gagtgcaaat 900
ttggctcaag gttttatgta ttttaatgta aatcaaacca tacaagaggt agaaagtata 960
gatttaagta tttttatgca aaggattagc ggaagcgagc ttgtattttt tgtatcttta 1020
atcggcttta ttttccttgt tagaaaacat aaaagtatga ttttggcttt gccgatgtta 1080
gctttaggat ttttagcact taagagtgga cttcgtttta ctatttatgc agtacctgtt 1140
ttagcacttg gatttggttt tttaatgagt cttttgcaag aaagaaaaca aaaaaacaat 1200
aatacctatt ggtgggccaa tataggcgtt tttattttta cttttttaag tttaattcct 1260
atgttctatc atatcaacaa ttataaagca ccaactgttt tttctcaaaa tgaggctacg 1320
aaattagatg agcttaaaaa aattgcacaa agagaagatt atgtagtaac ttggtgggat 1380
tatggatatc ctattaggta ttacagcgat gttaaaactt tggctgatgg gggtaagcat 1440
ttaggcaagg ataatttttt cccatctttt gttctaagta aagatcaagt ggctgctgca 1500
aatatggcaa gacttagtgt agaatacaca gaaaaaagtt tttacgcccc tttaaatgat 1560
attttaaaaa atgatctttt acaagccatg atgaaagatt ataatcaaaa taatgtggat 1620
ttgtttttag cttcgctttc caagcctgat tttaaaatca atacgccaaa aacacgcgat 1680
gtgtatatct atatgccagc tagaatgtct ttgatttttt caactgtggc tagtttttct 1740
tttgtggatt tggagacagg tgagataaat aaacctttta cttttagtgc agcttatcca 1800
cttgatgtta aaaatggaga aatttatctt agcaatggta ttgcattaag tgatgatttt 1860
agaagtttta aaataaataa tagtactata tccgtaaata gtatcataga gattaattct 1920
atcaaacaag gtgaatataa aatcactcct attgatgata tggctcaatt ttatattttt 1980
tatcttaaag atagcaccat accttatgct cagtttattt taatggataa aactatgttt 2040
aatagtgctt atgtgcaaat gtttttcctt ggaaattatg ataaaaattt gtatgattta 2100
gtgattaatg ctagagatgc aaaagttttt aaactcaaaa tttaa 2145
<210> 8
<211> 785
<212> PRT
<213> 乌普萨拉弯曲杆菌
<400> 8
Met Lys Asn Glu Ala Val Lys Asn Ala Asn Leu Arg Leu Val Phe Phe
1 5 10 15
Ile Leu Leu Ala Phe Gly Phe Ser Val Leu Cys Arg Phe Tyr Trp Ile
20 25 30
Tyr Trp Ala Ser Asp Phe Asn Glu Tyr Phe Phe Asn Asn Gln Leu Met
35 40 45
Ile Ser Ser Asn Asp Gly Tyr Thr Phe Ala Glu Gly Ala Arg Asp Lys
50 55 60
Ile Ala Gly Phe His Gln Glu Asn Asp Leu Ser Phe Ile Asn Ser Ser
65 70 75 80
Leu Ser Ile Leu Thr Tyr Val Leu Tyr Lys Ile Thr Pro Phe Ser Phe
85 90 95
Glu Ser Ile Ile Leu Tyr Met Ser Val Phe Phe Ser Ser Leu Ile Val
100 105 110
Val Pro Leu Ile Leu Ile Ala Asn Glu Leu Lys Arg Pro Leu Met Gly
115 120 125
Leu Phe Ala Ala Phe Leu Ala Ser Ile Ala Lys Ser Tyr Tyr Asn Arg
130 135 140
Thr Met Ala Gly Tyr Tyr Asp Thr Asp Met Leu Ala Ile Val Leu Pro
145 150 155 160
Met Phe Ile Leu Tyr Phe Phe Ile Arg Leu Ile Leu Arg Lys Asp Asp
165 170 175
Phe Ser Leu Leu Ala Leu Pro Phe Phe Met Gly Leu Tyr Leu Trp Trp
180 185 190
Tyr Pro Ser Ser Tyr Thr Leu Asn Val Ala Phe Ile Ala Leu Phe Thr
195 200 205
Leu Tyr Val Leu Ile Tyr His Arg Lys Glu Arg Ser Phe Tyr Met Ala
210 215 220
Ala Leu Leu Cys Ala Ile Thr Leu Ser Asn Ile Ala Trp Phe Tyr Gln
225 230 235 240
Ser Ala Ile Ile Val Leu Leu Phe Ala Leu Phe Met Leu Lys Asn Ser
245 250 255
Phe Phe Asn Phe Lys Phe Ile Ala Leu Leu Ala Leu Gly Val Leu Val
260 265 270
Phe Leu Ala Leu Ser Gly Gly Ile Asp Pro Ile Leu Tyr Gln Leu Lys
275 280 285
Phe Tyr Leu Leu Arg Ser Asp Glu Ser Ala Ser Leu Ala Arg Gly Phe
290 295 300
Ala Tyr Phe Asn Val Asn Leu Thr Ile Gln Glu Val Glu Ser Ile Asp
305 310 315 320
Leu Ser Thr Phe Met Gln Arg Ile Ser Gly Ser Glu Leu Val Phe Leu
325 330 335
Leu Ser Leu Phe Gly Phe Leu Trp Leu Leu Lys Lys His Lys Val Met
340 345 350
Leu Leu Thr Leu Pro Met Leu Leu Leu Gly Phe Leu Ala Leu Arg Gly
355 360 365
Gly Leu Arg Phe Thr Ile Tyr Ala Val Pro Ile Met Ala Leu Gly Phe
370 375 380
Gly Phe Leu Ser Val Gln Ile Leu Ser Leu Ile Gln Lys Met Arg Pro
385 390 395 400
Leu Lys Glu Thr Arg Lys Leu Arg Ile Phe Phe Tyr Gly Ile Phe Pro
405 410 415
Leu Phe Val Leu Val Leu Gly Ala Tyr Phe Tyr Phe Ser Gln Ser Ala
420 425 430
Ile Tyr Glu Ser Met Gly Val Glu Phe Gln Lys Asn Phe Val Ser Phe
435 440 445
Phe Val Glu Asp Thr Leu Leu Phe Ser Leu Leu Ile Leu Ala Ile Phe
450 455 460
Thr Pro Leu Ile Phe Glu Leu Leu Trp Arg Lys Lys Asp Ile Arg Phe
465 470 475 480
Val Cys Ser Phe Tyr Ile Val Gly Val Leu Leu Phe Ser Leu Trp Ala
485 490 495
Asn Leu Ser His Ile Tyr Asn Tyr Arg Ala His Thr Val Phe Ser Tyr
500 505 510
Asn Glu Ala Ser Ile Leu Asp Asn Leu Lys Ala Asn Val Ser Arg Glu
515 520 525
Asp Tyr Ile Val Ala Trp Trp Asp Tyr Gly Tyr Pro Ile Arg Tyr Tyr
530 535 540
Ser Asp Val Lys Thr Leu Ala Asp Gly Gly Lys His Leu Gly Lys Asp
545 550 555 560
Asn Phe Phe Pro Ser Phe Val Leu Ser Gln Asn Pro Arg Ala Ala Ala
565 570 575
Asn Met Ala Arg Leu Ser Val Glu Tyr Thr Glu Lys Gly Phe Lys Thr
580 585 590
Pro Tyr Asn Asp Leu Leu Glu Ala Met Met Lys Asp Tyr Asn Tyr Ser
595 600 605
Asn Val Asn Leu Phe Leu Ala Ala Leu Ser Lys Glu Asp Phe Thr Leu
610 615 620
Gln Thr Pro Lys Thr Arg Asp Ile Tyr Ile Tyr Met Pro Ser Arg Met
625 630 635 640
Ala Ala Ile Phe Gly Thr Val Ala Ser Phe Ser Tyr Met Ser Leu Glu
645 650 655
Thr Gly Glu Leu Glu Asn Pro Phe Val Tyr Ser Val Ala Tyr Tyr Leu
660 665 670
Gly Asn Glu Asp Gly Lys Leu Val Leu Ser Asn Asn Met Leu Leu His
675 680 685
Ser Asp Phe Arg Ser Phe Asp Leu Asn Gly Lys Asn Tyr Ala Ile Asn
690 695 700
Ser Leu Val Glu Phe Thr Ser Val Gln Gln Lys Tyr Tyr Ser Val Val
705 710 715 720
Glu Ile Asp Lys Asn Ala Lys Tyr Tyr Leu Phe His Ile Lys Asp Ala
725 730 735
Asn Ile Pro Asn Val Gln Phe Ile Leu Met Asp Lys Ala Met Tyr Glu
740 745 750
Ser Ala Phe Val Gln Met Phe Phe Phe Gly Lys Tyr Asp Glu Ser Leu
755 760 765
Tyr Glu Leu Ile Val Asp Ser Lys Glu Ala Lys Val Tyr Lys Leu Lys
770 775 780
Leu
785
<210> 9
<211> 2358
<212> DNA
<213> 乌普萨拉弯曲杆菌
<400> 9
atgaaaaacg aggctgtgaa aaatgcgaat ttgaggctag tattttttat cttactagct 60
tttggtttta gtgttttatg tcgcttttat tggatttatt gggcgagtga ttttaacgaa 120
tattttttta ataatcagct tatgataagc tcaaatgacg gctacacttt tgcagagggt 180
gctagagata agatagcggg ctttcatcag gaaaatgatt taagctttat taattcctct 240
ctttctattt tgacttatgt gctttataaa atcacgcctt ttagttttga aagcattatt 300
ttatatatga gtgtattttt ttcttcactt atagttgtgc cgcttatttt aattgcaaat 360
gagcttaaac gccctttaat gggacttttt gcggcatttt tagcaagtat tgcaaaaagc 420
tattataacc gcactatggc aggatattat gatacagata tgttagccat tgtgcttcct 480
atgtttattt tatatttttt catcaggctt attttaagaa aagatgattt ttctttactt 540
gccttgccgt tttttatggg actttatctt tggtggtatc catcaagcta tactctaaat 600
gtcgctttta tcgcactttt taccctttat gttttgattt atcatagaaa agaaaggtct 660
ttttatatgg cagcactttt gtgtgccatt accctttcaa atattgcttg gttttatcaa 720
agtgctatta ttgttttact ttttgctctt tttatgctta aaaattcgtt ttttaatttt 780
aaatttatcg cacttttagc cttaggagtt ttagtttttt tggctttaag tggggggata 840
gaccccatac tttatcagct taaattttat cttttaagaa gtgatgaaag tgcaagttta 900
gcgcgtggtt ttgcgtattt taatgtaaat ttaaccatac aagaggttga aagtatcgat 960
ttaagcactt ttatgcaaag aattagcgga agtgagcttg tgtttttact ttctcttttt 1020
ggctttttat ggcttttaaa aaagcataag gtgatgcttt taaccctacc tatgcttttg 1080
ctcggttttt tagcacttag aggtgggctt agatttacta tttatgctgt gcctattatg 1140
gcgcttggct ttggcttttt aagcgttcaa attttaagct taatccaaaa aatgcgtccc 1200
ttaaaagaaa ctcgaaaatt aagaatattt ttttatggaa tctttccgct ttttgtgctt 1260
gttttggggg cttattttta ttttagtcaa agtgctattt atgagagtat gggagtggaa 1320
tttcaaaaga actttgtgag cttttttgta gaagatactt tgcttttttc tttgctgatt 1380
ttggctattt ttacgccttt aatttttgag cttttgtgga gaaaaaagga cattcgtttt 1440
gtgtgtagct tttatattgt gggggttttg cttttttctt tatgggcaaa tttaagtcat 1500
atttataatt atagagcaca caccgttttt agctacaatg aagcgagtat tttggataat 1560
cttaaagcta atgtttctag ggaagattat attgtggctt ggtgggatta tggctatcct 1620
attcgttatt atagcgatgt gaaaacctta gctgatgggg gtaagcattt gggtaaggat 1680
aattttttcc cttcttttgt tttaagtcaa aatccacgcg cagcggcaaa tatggcaaga 1740
cttagcgtag aatacacaga aaaaggcttt aaaacgcctt ataatgatct tttagaagcg 1800
atgatgaagg attataatta tagcaatgta aatttatttt tagcggcact ttctaaggag 1860
gattttactc ttcaaacgcc caaaactaga gatatttaca tctatatgcc ttctcgtatg 1920
gcggcgattt ttggcacggt ggcaagtttt tcttatatga gcttagaaac gggtgagctt 1980
gaaaatcctt ttgtttatag tgtggcgtat tatttgggaa atgaggacgg caaactcgtc 2040
ttaagtaata atatgctcct tcatagcgac tttagaagct ttgaccttaa tggcaagaat 2100
tatgctatta attctttggt tgaatttact tcggtgcagc aaaaatatta tagtgttgtg 2160
gagattgata aaaatgctaa atattatctc tttcacatca aagacgctaa tatccctaat 2220
gtgcaattta tcctaatgga taaggcgatg tatgagagtg ctttcgtgca aatgtttttc 2280
tttggtaagt atgatgagag tttgtatgaa ttaattgtag atagtaaaga agcaaaggtg 2340
tataaattaa aattatga 2358
<210> 10
<211> 661
<212> PRT
<213> 人工的
<220>
<223> 弯曲杆菌属Pg1B共有序列
<220>
<221> misc_feature
<222> (1)..(2)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (7)..(7)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (15)..(15)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (22)..(22)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (27)..(29)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (34)..(34)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (38)..(39)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (41)..(41)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (47)..(47)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (51)..(52)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (54)..(54)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (56)..(56)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (58)..(60)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (62)..(62)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (64)..(64)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (66)..(66)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (69)..(69)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (71)..(71)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (73)..(73)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (75)..(78)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (80)..(81)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (84)..(84)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (86)..(86)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (90)..(90)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (98)..(98)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (107)..(108)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (111)..(111)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (113)..(113)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (115)..(124)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (127)..(129)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (131)..(133)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (135)..(135)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (137)..(139)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (149)..(149)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (152)..(152)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (154)..(156)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (159)..(160)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (162)..(162)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (164)..(165)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (167)..(167)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (170)..(172)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (174)..(177)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (179)..(182)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (185)..(186)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (189)..(189)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (191)..(194)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (197)..(197)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (200)..(201)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (203)..(210)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (212)..(216)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (218)..(224)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (226)..(226)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (231)..(231)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (234)..(234)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (240)..(240)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (242)..(250)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (252)..(255)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (257)..(257)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (263)..(263)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (266)..(266)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (269)..(271)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (273)..(275)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (278)..(278)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (282)..(282)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (284)..(285)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (288)..(289)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (291)..(292)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (295)..(296)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (298)..(301)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (303)..(303)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (305)..(307)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (310)..(310)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (312)..(312)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (316)..(316)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (319)..(320)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (328)..(328)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (331)..(332)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (338)..(348)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (350)..(373)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (376)..(377)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (379)..(381)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (385)..(387)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (390)..(391)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (393)..(394)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (397)..(400)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (405)..(405)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (407)..(407)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (415)..(415)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (421)..(421)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (425)..(425)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (438)..(438)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (441)..(441)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (444)..(448)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (463)..(463)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (465)..(469)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (471)..(471)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (473)..(473)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (475)..(476)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (481)..(486)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (489)..(490)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (492)..(494)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (497)..(499)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (501)..(501)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (506)..(506)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (508)..(508)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (512)..(512)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (515)..(516)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (518)..(520)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (523)..(523)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (525)..(531)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (533)..(540)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (542)..(542)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (544)..(552)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (554)..(556)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (558)..(558)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (560)..(565)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (569)..(586)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (588)..(601)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (603)..(604)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (606)..(618)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (620)..(622)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (624)..(628)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (630)..(632)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (636)..(639)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (642)..(643)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (645)..(653)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (656)..(661)
<223> Xaa可以是任意天然存在的氨基酸
<400> 10
Xaa Xaa Asn Asp Gly Tyr Xaa Phe Ala Glu Gly Ala Arg Asp Xaa Ile
1 5 10 15
Ala Gly Phe His Gln Xaa Asn Asp Leu Ser Xaa Xaa Xaa Ser Ser Leu
20 25 30
Ser Xaa Leu Thr Tyr Xaa Xaa Tyr Xaa Ile Leu Pro Phe Ser Xaa Glu
35 40 45
Ser Ile Xaa Xaa Tyr Xaa Ser Xaa Phe Xaa Xaa Xaa Leu Xaa Val Xaa
50 55 60
Pro Xaa Ile Leu Xaa Ala Xaa Glu Xaa Lys Xaa Xaa Xaa Xaa Gly Xaa
65 70 75 80
Xaa Ala Ala Xaa Leu Xaa Ser Ile Ala Xaa Ser Tyr Tyr Asn Arg Thr
85 90 95
Met Xaa Gly Tyr Tyr Asp Thr Asp Met Leu Xaa Xaa Val Leu Xaa Met
100 105 110
Xaa Ile Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Asp Xaa Xaa
115 120 125
Xaa Leu Xaa Xaa Xaa Pro Xaa Phe Xaa Xaa Xaa Tyr Leu Trp Trp Tyr
130 135 140
Pro Ser Ser Tyr Xaa Leu Asn Xaa Ala Xaa Xaa Xaa Leu Phe Xaa Xaa
145 150 155 160
Tyr Xaa Leu Xaa Xaa His Xaa Lys Glu Xaa Xaa Xaa Tyr Xaa Xaa Xaa
165 170 175
Xaa Leu Xaa Xaa Xaa Xaa Leu Ser Xaa Xaa Ala Trp Xaa Tyr Xaa Xaa
180 185 190
Xaa Xaa Ile Val Xaa Leu Phe Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa
195 200 205
Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa
210 215 220
Leu Xaa Leu Ser Gly Gly Xaa Asp Pro Xaa Leu Tyr Gln Leu Lys Xaa
225 230 235 240
Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe
245 250 255
Xaa Tyr Phe Asn Val Asn Xaa Thr Ile Xaa Glu Val Xaa Xaa Xaa Asp
260 265 270
Xaa Xaa Xaa Phe Met Xaa Arg Ile Ser Xaa Ser Xaa Xaa Val Phe Xaa
275 280 285
Xaa Ser Xaa Xaa Gly Phe Xaa Xaa Leu Xaa Xaa Xaa Xaa Lys Xaa Met
290 295 300
Xaa Xaa Xaa Leu Pro Xaa Leu Xaa Leu Gly Phe Xaa Ala Leu Xaa Xaa
305 310 315 320
Gly Leu Arg Phe Thr Ile Tyr Xaa Val Pro Xaa Xaa Ala Leu Gly Phe
325 330 335
Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa
340 345 350
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
355 360 365
Xaa Xaa Xaa Xaa Xaa His Ile Xaa Xaa Tyr Xaa Xaa Xaa Thr Val Phe
370 375 380
Xaa Xaa Xaa Glu Ala Xaa Xaa Leu Xaa Xaa Leu Lys Xaa Xaa Xaa Xaa
385 390 395 400
Arg Glu Asp Tyr Xaa Val Xaa Trp Trp Asp Tyr Gly Tyr Pro Xaa Arg
405 410 415
Tyr Tyr Ser Asp Xaa Lys Thr Leu Xaa Asp Gly Gly Lys His Leu Gly
420 425 430
Lys Asp Asn Phe Phe Xaa Ser Phe Xaa Leu Ser Xaa Xaa Xaa Xaa Xaa
435 440 445
Ala Ala Asn Met Ala Arg Leu Ser Val Glu Tyr Thr Glu Lys Xaa Phe
450 455 460
Xaa Xaa Xaa Xaa Xaa Asp Xaa Leu Xaa Ala Xaa Xaa Lys Asp Tyr Asn
465 470 475 480
Xaa Xaa Xaa Xaa Xaa Xaa Phe Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Phe
485 490 495
Xaa Xaa Xaa Thr Xaa Lys Thr Arg Asp Xaa Tyr Xaa Tyr Met Pro Xaa
500 505 510
Arg Met Xaa Xaa Ile Xaa Xaa Xaa Val Ala Xaa Phe Xaa Xaa Xaa Xaa
515 520 525
Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Ala Xaa
530 535 540
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Leu Xaa Asn Xaa
545 550 555 560
Xaa Xaa Xaa Xaa Xaa Asp Phe Arg Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
565 570 575
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa
580 585 590
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Xaa Xaa Tyr Xaa Xaa Xaa
595 600 605
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Asp Xaa
610 615 620
Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Gln Met Phe Xaa Xaa Xaa Xaa Tyr
625 630 635 640
Asp Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ala Lys Xaa
645 650 655
Xaa Xaa Xaa Xaa Xaa
660
<210> 11
<211> 967
<212> PRT
<213> 激烈火球菌
<400> 11
Met Val Lys Thr Gln Ile Lys Glu Lys Lys Lys Asp Glu Lys Val Thr
1 5 10 15
Ile Pro Leu Pro Gly Lys Ile Lys Thr Val Leu Ala Phe Leu Val Val
20 25 30
Leu Ala Phe Ala Ala Tyr Gly Phe Tyr Ile Arg His Leu Thr Ala Gly
35 40 45
Lys Tyr Phe Ser Asp Pro Asp Thr Phe Tyr His Phe Glu Ile Tyr Lys
50 55 60
Leu Val Leu Lys Glu Gly Leu Pro Arg Tyr Tyr Pro Met Ala Asp Ala
65 70 75 80
Pro Phe Gly Ser Leu Ile Gly Glu Pro Leu Gly Leu Tyr Ile Leu Pro
85 90 95
Ala Ile Phe Tyr Lys Ile Ile Ser Ile Phe Gly Tyr Asn Glu Leu Glu
100 105 110
Ala Phe Leu Leu Trp Pro Pro Phe Val Gly Phe Leu Ser Val Ile Gly
115 120 125
Val Tyr Leu Leu Gly Arg Lys Val Leu Asn Glu Trp Ala Gly Met Trp
130 135 140
Gly Ala Ile Ile Leu Ser Val Leu Thr Ala Asn Phe Ser Arg Thr Phe
145 150 155 160
Ser Gly Asn Ala Arg Gly Asp Gly Pro Phe Met Met Leu Phe Thr Phe
165 170 175
Ser Ala Val Leu Met Leu Tyr Tyr Leu Thr Glu Glu Asn Lys Asn Lys
180 185 190
Lys Ile Ile Trp Gly Thr Leu Phe Val Leu Leu Ala Gly Ile Ser Thr
195 200 205
Ala Ala Trp Asn Gly Ser Pro Phe Gly Leu Met Val Leu Leu Gly Phe
210 215 220
Ala Ser Phe Gln Thr Ile Ile Leu Phe Ile Phe Gly Lys Ile Asn Glu
225 230 235 240
Leu Arg Glu Phe Ile Lys Glu Tyr Tyr Pro Ala Tyr Leu Gly Ile Leu
245 250 255
Ala Ile Ser Tyr Leu Leu Thr Ile Pro Gly Ile Gly Lys Ile Gly Gly
260 265 270
Phe Val Arg Phe Ala Phe Glu Val Phe Leu Gly Leu Val Phe Leu Ala
275 280 285
Ile Val Met Leu Tyr Gly Gly Lys Tyr Leu Asn Tyr Ser Asp Lys Lys
290 295 300
His Arg Phe Ala Val Val Ala Val Ile Val Ile Ala Gly Phe Ala Gly
305 310 315 320
Ala Tyr Ile Tyr Val Gly Pro Lys Leu Phe Thr Leu Met Gly Gly Ala
325 330 335
Tyr Gln Ser Thr Gln Val Tyr Glu Thr Val Gln Glu Leu Ala Lys Thr
340 345 350
Asp Trp Gly Asp Val Lys Val Tyr Tyr Gly Val Glu Lys Pro Asn Gly
355 360 365
Ile Val Phe Phe Leu Gly Leu Val Gly Ala Met Ile Val Thr Ala Arg
370 375 380
Tyr Leu Tyr Lys Leu Phe Lys Asp Gly Arg Arg Pro His Glu Glu Leu
385 390 395 400
Phe Ala Ile Thr Phe Tyr Val Met Ser Ile Tyr Leu Leu Trp Thr Ala
405 410 415
Ala Arg Phe Leu Phe Leu Ala Ser Tyr Ala Ile Ala Leu Met Ser Gly
420 425 430
Val Phe Ala Gly Tyr Val Leu Glu Thr Val Glu Lys Met Lys Glu Ser
435 440 445
Ile Pro Ile Lys Ala Ala Leu Gly Gly Val Ile Ala Ile Met Leu Leu
450 455 460
Leu Ile Pro Leu Thr His Gly Pro Leu Leu Ala Gln Ser Ala Lys Ser
465 470 475 480
Met Arg Thr Thr Glu Ile Glu Thr Ser Gly Trp Glu Asp Ala Leu Lys
485 490 495
Trp Leu Arg Glu Asn Thr Pro Glu Tyr Ser Thr Ala Thr Ser Trp Trp
500 505 510
Asp Tyr Gly Tyr Trp Ile Glu Ser Ser Leu Leu Gly Gln Arg Arg Ala
515 520 525
Ser Ala Asp Gly Gly His Ala Arg Asp Arg Asp His Ile Leu Ala Leu
530 535 540
Phe Leu Ala Arg Asp Gly Asn Ile Ser Glu Val Asp Phe Glu Ser Trp
545 550 555 560
Glu Leu Asn Tyr Phe Leu Val Tyr Leu Asn Asp Trp Ala Lys Phe Asn
565 570 575
Ala Ile Ser Tyr Leu Gly Gly Ala Ile Thr Arg Arg Glu Tyr Asn Gly
580 585 590
Asp Glu Ser Gly Arg Gly Ala Val Thr Thr Leu Leu Pro Leu Pro Arg
595 600 605
Tyr Gly Glu Lys Tyr Val Asn Leu Tyr Ala Lys Val Ile Val Asp Val
610 615 620
Ser Asn Ser Ser Val Lys Val Thr Val Gly Asp Arg Glu Cys Asp Pro
625 630 635 640
Leu Met Val Thr Phe Thr Pro Ser Gly Lys Thr Ile Lys Gly Thr Gly
645 650 655
Thr Cys Ser Asp Gly Asn Ala Phe Pro Tyr Val Leu His Leu Thr Pro
660 665 670
Thr Ile Gly Val Leu Ala Tyr Tyr Lys Val Ala Thr Ala Asn Phe Ile
675 680 685
Lys Leu Ala Phe Gly Val Pro Ala Ser Thr Ile Pro Gly Phe Ser Asp
690 695 700
Lys Leu Phe Ser Asn Phe Glu Pro Val Tyr Glu Ser Gly Asn Val Ile
705 710 715 720
Val Tyr Arg Phe Thr Pro Phe Gly Ile Tyr Lys Ile Glu Glu Asn Ile
725 730 735
Asn Gly Thr Trp Lys Gln Val Tyr Asn Leu Thr Pro Gly Lys His Glu
740 745 750
Leu Lys Leu Tyr Ile Ser Ala Phe Gly Arg Asp Ile Glu Asn Ala Thr
755 760 765
Leu Tyr Ile Tyr Ala Ile Asn Asn Glu Lys Ile Ile Glu Lys Ile Lys
770 775 780
Ile Ala Glu Ile Ser His Met Asp Tyr Leu Asn Glu Tyr Pro Ile Ala
785 790 795 800
Val Asn Val Thr Leu Pro Asn Ala Thr Ser Tyr Arg Phe Val Leu Val
805 810 815
Gln Lys Gly Pro Ile Gly Val Leu Leu Asp Ala Pro Lys Val Asn Gly
820 825 830
Glu Ile Arg Ser Pro Thr Asn Ile Leu Arg Glu Gly Glu Ser Gly Glu
835 840 845
Ile Glu Leu Lys Val Gly Val Asp Lys Asp Tyr Thr Ala Asp Leu Tyr
850 855 860
Leu Arg Ala Thr Phe Ile Tyr Leu Val Arg Lys Ser Gly Lys Asp Asn
865 870 875 880
Glu Asp Tyr Asp Ala Ala Phe Glu Pro Gln Met Asp Val Phe Phe Ile
885 890 895
Thr Lys Ile Gly Glu Asn Ile Gln Leu Lys Glu Gly Glu Asn Thr Val
900 905 910
Lys Val Arg Ala Glu Leu Pro Glu Gly Val Ile Ser Ser Tyr Lys Asp
915 920 925
Glu Leu Gln Arg Lys Tyr Gly Asp Lys Leu Ile Ile Arg Gly Ile Arg
930 935 940
Val Glu Pro Val Phe Ile Ala Glu Lys Glu Tyr Leu Met Leu Glu Val
945 950 955 960
Ser Ala Ser Ala Pro His His
965
<210> 12
<211> 2904
<212> DNA
<213> 激烈火球菌
<400> 12
atggtgaaaa cccaaataaa ggagaaaaag aaagatgaaa aagttactat tccacttcct 60
gggaagataa aaactgtttt ggccttccta gtcgttttgg catttgccgc atatggattt 120
tacattagac atttaacagc cggaaagtat ttctcagatc cagatacctt ctaccatttc 180
gaaatttata agctagtcct caaagagggc cttcctaggt attacccaat ggcagatgct 240
ccatttggaa gtctcatagg agaacctctt ggactataca tccttccagc aatattctac 300
aaaataatct caatatttgg gtacaatgag ctagaggcat ttcttctttg gcccccattc 360
gtaggatttc tcagtgttat aggtgtttac ttactcggaa gaaaagttct gaacgaatgg 420
gcagggatgt ggggtgctat aattctctca gtcctcacgg caaacttttc aagaacattc 480
tcaggcaacg caagaggcga cggcccattc atgatgttgt ttacgttttc agcagtccta 540
atgctctatt atctaaccga ggaaaataaa aacaagaaaa taatctgggg aacactgttt 600
gtactcttgg caggaatatc aactgcagca tggaacggtt caccatttgg actaatggtt 660
ctccttggat tcgcatcgtt ccagacaata atcctcttta tttttggaaa gatcaatgag 720
cttagagaat tcataaagga atactaccca gcatacctgg gaattttagc tataagctac 780
cttctaacga tcccaggaat tggaaaaata ggaggatttg taagatttgc atttgaggtt 840
ttcttagggt tagttttctt agccatcgtc atgctctatg gaggaaaata cttgaactat 900
tctgacaaga agcacaggtt cgcagtggtt gcagttatag ttattgcggg gttcgcagga 960
gcttatattt acgttggtcc aaaactcttc actctaatgg gtggagctta tcagtcaacg 1020
caagtttatg aaacagtaca ggagctcgca aaaactgatt ggggagatgt aaaagtctat 1080
tatggagtag aaaagccaaa cggaatagtc ttcttccttg gattagttgg agcaatgatt 1140
gttacagcta ggtacctcta caaattattt aaagatggaa ggcgcccaca cgaagagtta 1200
tttgcaataa ctttctatgt aatgtcaatt tacctcctct ggacagctgc tagattccta 1260
ttcctagcga gttatgcgat agcattgatg tcaggtgtct ttgcaggata cgtcctagag 1320
actgtagaaa agatgaaaga gagtatacca ataaaagcag cactaggagg agtaattgct 1380
attatgcttc ttctaatacc cttaactcat ggcccactct tagctcaaag cgctaaaagt 1440
atgagaacaa ccgagatcga gactagtgga tgggaagatg cgctcaaatg gctcagagaa 1500
aacactccag aatattcgac cgcaacctct tggtgggact atggatattg gatagagtca 1560
agcctcctag gacagagaag ggccagtgct gatggtggac atgcaagaga tagagatcat 1620
atcttagccc tatttctagc cagagacggt aacattagtg aagtagactt tgagagttgg 1680
gagcttaact acttcctagt ttaccttaat gattgggcaa agttcaatgc aatcagctat 1740
ctaggcgggg ctataacgag gagagaatac aatggagatg aaagtggaag aggagccgta 1800
actacgctac ttcctctccc aaggtatgga gagaaatacg tcaacctcta tgccaaagtt 1860
atagttgatg tttcaaactc gagcgtaaag gttactgtag gagacagaga gtgtgatcca 1920
ctaatggtta cgtttactcc aagtggaaag acgataaaag gaactggaac ctgtagtgat 1980
ggcaacgcct tcccatatgt tttacactta actccaacaa ttggagtact tgcatactac 2040
aaagtagcaa ctgcaaactt cattaagtta gccttcggtg ttccagcttc aacaattcca 2100
ggattctctg ataagctatt ctcaaacttt gagccagtgt atgagtcagg aaacgtaata 2160
gtatatcgct tcacaccatt tggaatatac aaaattgagg aaaacattaa cggaacttgg 2220
aagcaagttt ataacctaac tcctggaaaa cacgagctca aactgtacat ttcagcattc 2280
ggaagagaca tcgaaaatgc aacgctgtac atttacgcca taaacaacga gaagatcata 2340
gagaaaatta agattgccga gatatcccac atggactatc taaatgaata cccgatagca 2400
gtgaacgtaa ccctaccaaa tgctacaagc tacaggtttg tactagttca aaaaggccca 2460
ataggtgttc ttctagatgc accaaaagtc aatggtgaga taagaagtcc aaccaacata 2520
ctaagggaag gagaaagtgg agaaatagag cttaaagttg gggttgataa agactacact 2580
gccgatctat acttaagggc tacgttcata tatttagtca gaaaaagtgg aaaggataac 2640
gaagattatg acgcagcgtt tgagccccaa atggatgttt tctttatcac aaagatcgga 2700
gaaaacattc aacttaaaga aggagagaat acagtaaagg ttagggcgga gcttccagaa 2760
ggagttatat ctagctacaa agatgaacta cagagaaaat acggagacaa gttgataatc 2820
agaggaataa gagtagagcc agtgttcata gcagaaaaag agtacctaat gctcgaggtc 2880
agtgcatcgg ctcctcatca ctaa 2904
<210> 13
<211> 980
<212> PRT
<213> 人工的
<220>
<223> 火球菌属 ST04 OST
<400> 13
Met Lys Ser Leu Val Lys Val Glu Val Lys Arg Glu Lys Lys Asp Arg
1 5 10 15
Lys Glu Lys Arg Glu Ile Gly Asn Ile Ser Arg His Tyr Gly Lys Ile
20 25 30
Lys Leu Ala Leu Thr Phe Ile Val Thr Leu Ile Phe Ala Trp Tyr Ala
35 40 45
Phe His Ile Arg His Leu Thr Ala Gly Lys Tyr Phe Pro Asp Pro Asp
50 55 60
Thr Phe Tyr His Tyr Glu Ile Tyr Lys Leu Val Leu Lys Glu Gly Leu
65 70 75 80
Pro Lys Tyr Tyr Pro Met Ser Asp Ala Pro Phe Gly Ser Leu Ile Gly
85 90 95
Glu Pro Leu Gly Leu Tyr Ile Leu Pro Ala Ile Phe Tyr Lys Ile Leu
100 105 110
Ser Ala Phe Gly Tyr Asn Glu Phe Gln Ala Phe Leu Leu Trp Pro Pro
115 120 125
Phe Val Gly Phe Leu Ser Val Ile Gly Val Tyr Leu Leu Gly Arg Lys
130 135 140
Ile Leu Asn Glu Trp Ala Gly Leu Trp Ala Ala Ala Ile Leu Ala Val
145 150 155 160
Ser Thr Ala Asn Phe Ser Arg Thr Phe Ser Gly Asn Ala Arg Gly Asp
165 170 175
Gly Pro Phe Met Met Leu Phe Val Phe Ser Met Val Ala Leu Leu Tyr
180 185 190
Tyr Leu Glu Glu Ala Arg Ile Lys Arg Lys Ala Val Trp Gly Ala Leu
195 200 205
Phe Val Ile Leu Ala Gly Leu Ser Thr Met Ala Trp Asn Gly Ser Pro
210 215 220
Phe Gly Leu Met Val Leu Leu Gly Phe Ala Ser Leu Gln Thr Ile Ala
225 230 235 240
Leu Phe Ile Phe Gly Lys Ile Asp Glu Leu Lys Lys Phe Ile Lys Glu
245 250 255
Phe Tyr Pro Ala Tyr Val Ser Val Leu Ile Leu Ser Tyr Leu Leu Thr
260 265 270
Ile Pro Gly Leu Ala Lys Ile Gln Ser Phe Ile Arg Phe Ala Phe Glu
275 280 285
Val Phe Leu Gly Leu Val Phe Leu Ala Ile Val Met Leu Tyr Gly Glu
290 295 300
Lys Phe Leu Asn Tyr Ser Asp Lys Lys His Arg Phe Leu Val Val Ala
305 310 315 320
Ile Ile Val Leu Ile Gly Phe Ala Gly Ala Tyr Ala Tyr Val Gly Pro
325 330 335
Lys Leu Phe Arg Leu Met Gly Gly Ala Tyr Gln Ser Thr Gln Val Tyr
340 345 350
Gln Thr Val Gln Glu Leu Ala Lys Thr Ser Met Gln Asp Ile Lys Leu
355 360 365
Tyr Tyr Gly Val Glu Lys Ala Asn Gly Leu Ile Phe Phe Leu Ser Ile
370 375 380
Pro Gly Phe Leu Ile Met Leu Ser Leu Tyr Leu Ile Gly Leu Trp Ser
385 390 395 400
Lys Ser Glu Ser Pro Asn Lys Glu Leu Leu Gly Ile Thr Phe Tyr Val
405 410 415
Met Ser Ile Tyr Leu Met Ser Leu Ala Val Arg Phe Leu Phe Leu Ala
420 425 430
Ser Tyr Ala Ile Ala Leu Phe Ala Gly Ile Leu Val Gly Tyr Gly Leu
435 440 445
Glu Val Ile Glu Lys Met Lys Glu Asn Val Gly Ile Lys Ala Ala Leu
450 455 460
Ala Ile Val Ile Ser Ile Met Ile Leu Leu Ile Pro Ile Thr His Gly
465 470 475 480
Pro Val Leu Ala Arg Ser Ala Lys Ala Met Ser Lys Thr Glu Val Glu
485 490 495
Thr Ser Gly Trp Glu Gln Ala Leu Lys Trp Leu Arg Asn Asn Thr Pro
500 505 510
Lys Tyr Ala Thr Ala Thr Ser Trp Trp Asp Tyr Gly Tyr Trp Ile Glu
515 520 525
Ser Ser Leu Leu Gly Asn Arg Arg Ala Ser Ala Asp Gly Gly His Ala
530 535 540
Arg Asp Arg Asp His Ile Leu Ala Leu Phe Leu Ala Arg Asp Gly Asn
545 550 555 560
Val Ser Glu Val Asp Phe Glu Ser Trp Glu Leu Asn Tyr Phe Ile Val
565 570 575
Tyr Leu Asn Asp Trp Ala Lys Phe Asn Ala Ile Ser Tyr Leu Gly Gly
580 585 590
Ala Ile Thr Lys Arg Glu Tyr Ser Gly Asp Glu Lys Gly Arg Gly Ser
595 600 605
Ile Pro Thr Ile Ile Leu Ala Pro Arg Phe Gly Glu Gln Tyr Ile Asn
610 615 620
Pro Tyr Asn Gly Val Ser Ile Lys Val Leu Asn Asn Ser Gln Val Thr
625 630 635 640
Val Thr Ile Gly Ser Thr Thr Cys Ser Pro Leu Met Thr Val Phe Ile
645 650 655
Pro Gly Asn Lys Lys Val Lys Gly Gln Gly Ser Cys Thr Asn Gly Gly
660 665 670
Ser Phe Pro Phe Val Val Tyr Leu Thr Pro Thr Leu Gly Val Ile Ser
675 680 685
Tyr Tyr Lys Val Ala Thr Ser Asn Phe Leu Lys Leu Ala Tyr Gly Ile
690 695 700
Pro Ala Ser Lys Glu Pro Gly Phe Thr Asp Lys Leu Phe Ser Asn Phe
705 710 715 720
Lys Met Val Tyr Gln Glu Gly Asn Val Val Ile Tyr Glu Phe Arg Pro
725 730 735
Phe Ala Ile Tyr Lys Leu Gln Glu Phe Thr Asn Gly Thr Trp Lys Thr
740 745 750
Ile Thr Thr Leu Ser Pro Gly Lys His Thr Leu Lys Leu Tyr Ile Ser
755 760 765
Ala Phe Gly Arg Asp Ile Lys Asn Ala Thr Leu Tyr Ile Asp Ala Ile
770 775 780
Lys Asp Asn Arg Thr Ile Gln Arg Ile Lys Ile Gly Glu Ile Lys Tyr
785 790 795 800
Met Ser His Leu Asn Glu Thr Pro Ile Thr Val Asn Val Thr Leu Pro
805 810 815
Asp Ala Asp Lys Tyr Lys Phe Val Leu Val Gln Lys Gly Pro Val Gly
820 825 830
Val Leu Thr Ala Pro Pro Lys Val Asn Gly Lys Ile Ala Asn Pro Val
835 840 845
Arg Val Leu Asn Asp Gly Glu Ser Gly Arg Leu Glu Leu Lys Val Gly
850 855 860
Val Asp Lys Asp Tyr Lys Ala Asp Leu Tyr Leu Arg Ala Thr Phe Ile
865 870 875 880
Tyr Leu Val Arg Lys Ser Gly Thr Ser Asn Asp Asp Tyr Asn Ala Ala
885 890 895
Phe Glu Pro His Met Asp Val Phe Phe Ile Thr Lys Leu Lys Ser Gly
900 905 910
Ile Ser Leu His Lys Gly Glu Asn Glu Val Thr Val Glu Ala Lys Met
915 920 925
Pro Glu Asn Val Ile Ser Asp Tyr Lys Lys Lys Leu Glu Ala Glu Tyr
930 935 940
Gly Asp Lys Leu Ile Ile Arg Gly Ile Arg Val Glu Pro Val Phe Ile
945 950 955 960
Ala Glu Lys Glu Tyr Val Met Leu Glu Val Arg Ala Ser Ala Pro His
965 970 975
His Ser Ser Glu
980
<210> 14
<211> 973
<212> PRT
<213> 人工的
<220>
<223> 火球菌属菌株NA2 OST
<400> 14
Met Val Lys Arg Lys Lys Glu Glu Lys Glu Ile Lys Gly Glu Lys Arg
1 5 10 15
Glu Phe Tyr Ser Lys Ile Lys Arg Met Ile Ile Pro Ile Ile Val Leu
20 25 30
Gly Phe Ala Thr Tyr Gly Phe Tyr Leu Arg His Leu Thr Ala Gly Arg
35 40 45
Tyr Phe Pro Asp Pro Asp Thr Phe Tyr His Phe Glu Ile Tyr Lys Leu
50 55 60
Val Ile Lys Glu Gly Leu Pro Lys Tyr Tyr Pro Leu Ser Asp Ala Pro
65 70 75 80
Phe Gly Ser Leu Ile Gly Glu Pro Leu Gly Leu Tyr Ile Leu Pro Ala
85 90 95
Ile Phe Tyr Lys Val Ile Ser Ala Phe Gly Tyr Asn Glu Phe Gln Ala
100 105 110
Phe Leu Leu Trp Pro Pro Phe Val Gly Phe Leu Ser Val Val Gly Ile
115 120 125
Tyr Leu Leu Gly Arg Lys Val Leu Asn Glu Trp Ala Gly Leu Trp Ala
130 135 140
Ala Val Ile Leu Ser Val Ser Thr Ala Asn Phe Ser Arg Thr Phe Ser
145 150 155 160
Gly Asn Ala Arg Gly Asp Gly Pro Phe Met Met Leu Phe Val Phe Ser
165 170 175
Ala Ile Leu Met Phe His Tyr Leu Arg Glu Thr Ser Lys Thr Lys Lys
180 185 190
Val Leu Tyr Gly Thr Leu Phe Val Ile Leu Ala Ser Ile Ser Leu Gly
195 200 205
Ala Trp Asn Gly Ser Pro Phe Gly Leu Met Val Leu Leu Gly Phe Ala
210 215 220
Ser Phe Gln Thr Ile Ala Leu Phe Ile Phe Gly Lys Ile Ser Glu Leu
225 230 235 240
Lys Lys Phe Ala Thr Glu Phe Tyr Pro Ala Tyr Leu Gly Ile Leu Ala
245 250 255
Leu Gly Tyr Leu Leu Thr Ile Pro Gly Ile Val Lys Ile Gly Ser Phe
260 265 270
Ile Lys Phe Ala Phe Glu Val Phe Leu Gly Leu Val Val Leu Leu Thr
275 280 285
Ile Met Leu Tyr Gly Gly Arg Tyr Leu Asn Tyr Ser Asp Lys Lys His
290 295 300
Arg Phe Leu Val Val Ala Val Val Val Leu Ile Gly Phe Ala Gly Ala
305 310 315 320
Tyr Ala Tyr Val Gly Pro Lys Leu Phe Arg Leu Met Gly Gly Ala Tyr
325 330 335
Gln Ser Thr Gln Val Tyr Glu Thr Val Gln Glu Leu Ala Lys Thr Thr
340 345 350
Met Arg Asp Ile Lys Val Tyr Tyr Gly Val Glu Asn Pro Asn Gly Leu
355 360 365
Ile Phe Phe Leu Ser Ile Pro Gly Ile Ile Ile Ile Leu Val Lys Tyr
370 375 380
Leu Val Asp Leu Phe Arg Lys Ser Glu Ser Ser Asn Glu Thr Leu Phe
385 390 395 400
Ala Ala Val Phe Tyr Ile Met Ser Ile Tyr Leu Leu Ser Leu Ala Val
405 410 415
Arg Phe Leu Phe Leu Ala Ser Tyr Ala Ile Ala Leu Phe Ala Gly Ile
420 425 430
Phe Ala Gly Phe Val Ile Glu Ile Val Glu Lys Met Lys Glu Ser Ile
435 440 445
Gly Ile Lys Ala Ala Leu Gly Ile Val Ile Ser Ile Met Ile Leu Met
450 455 460
Ile Pro Ile Thr His Ala Pro Val Leu Ala Arg Ser Ala Arg Ser Leu
465 470 475 480
Ser Arg Thr Glu Val Glu Thr Thr Gly Trp Glu Gln Val Leu Lys Trp
485 490 495
Leu Arg Ser Asn Thr Ser Gln Tyr Ala Thr Ala Thr Ser Trp Trp Asp
500 505 510
Tyr Gly Tyr Trp Ile Glu Ser Ser Leu Leu Gly Asn Arg Arg Ala Ser
515 520 525
Ala Asp Gly Gly His Ala Arg Asp Arg Asp His Ile Leu Ala Leu Phe
530 535 540
Leu Ala Arg Asp Gly Asn Val Ser Glu Val Asp Phe Glu Ser Trp Glu
545 550 555 560
Leu Asn Tyr Phe Ile Val Tyr Leu Asn Asp Trp Ala Lys Phe Asn Ala
565 570 575
Ile Ser Tyr Leu Gly Gly Ala Leu Thr Arg Arg Glu Tyr Lys Gly Asp
580 585 590
Glu Thr Gly Arg Gly Ser Val Thr Ser Ile Leu Ile Thr Gln Gly Ala
595 600 605
Gly Asn Val Tyr Val Asn Pro Tyr Ala Gly Ile Thr Ile Lys Val Val
610 615 620
Glu Glu Asn Lys Thr Arg Lys Val Val Val Asn Ile Gly Arg Leu Glu
625 630 635 640
Cys Ser Pro Met Thr Thr Val Val Phe Pro Gly Asn Ile His Ile Lys
645 650 655
Gly Thr Gly Ser Cys Asn Asn Gly Ser Ser Phe Pro Tyr Val Val Tyr
660 665 670
Leu Thr Pro Ser Leu Gly Ile Ile Ala Tyr Tyr Lys Val Ala Thr Ser
675 680 685
Asn Phe Ile Lys Leu Ala Phe Gly Ile Pro Val Ser Asn Tyr Lys Gly
690 695 700
Phe Thr Glu Lys Leu Phe Ser Asn Phe Val Pro Val Tyr Gln Ala Gly
705 710 715 720
Asn Val Ile Val Tyr Glu Phe Arg Pro Phe Ala Ile Tyr Gly Met Glu
725 730 735
Glu Leu Val Asn Gly Ser Trp Arg Tyr Ile Gly Tyr Leu Thr Pro Gly
740 745 750
Lys His Thr Leu Arg Leu Tyr Ile Ser Ala Phe Gly Arg Asp Ile Lys
755 760 765
Asn Ala Thr Leu Tyr Val Tyr Ala Ile Asn Gly Thr Glu Ile Thr Ala
770 775 780
Lys Ile Arg Leu Thr Lys Ile Asp Tyr Met Asn His Leu Asn Glu Tyr
785 790 795 800
Pro Ile Thr Val Asn Val Thr Leu Pro Pro Ala Gln Lys Tyr Arg Phe
805 810 815
Val Leu Val Gln Lys Gly Pro Val Gly Val Leu Thr Gly Pro Pro Lys
820 825 830
Leu Asn Gly Lys Ile Val Asn Pro Ile Ser Val Leu Lys Glu Gly Glu
835 840 845
Glu Gly Glu Leu Glu Leu Lys Val Gly Val Asp Lys Asn Tyr Thr Ala
850 855 860
Asp Leu Tyr Leu Arg Ala Thr Phe Ile Tyr Leu Val Arg Lys Gly Gly
865 870 875 880
Thr Ser Asn Glu Asp Tyr Asn Ala Ala Phe Glu Pro His Met Asp Val
885 890 895
Phe Phe Ile Ser Arg Val Lys Glu Gly Ile Lys Leu His Pro Gly Asp
900 905 910
Asn Tyr Val Lys Ala His Val Glu Met Pro Lys Gly Val Ile Ser Ser
915 920 925
Tyr Lys Glu Glu Leu Glu Lys Lys Tyr Gly Asp Arg Leu Ile Ile Arg
930 935 940
Gly Ile Arg Val Glu Pro Val Phe Ile Ala Glu Lys Glu Tyr Thr Met
945 950 955 960
Leu Glu Val Ser Ala Ser Ala Pro His His Ser Ser Glu
965 970
<210> 15
<211> 976
<212> PRT
<213> 超嗜热火球菌
<400> 15
Met Val Lys Ser Lys Val Lys Lys Val Glu Lys Gly Lys Glu Gly Glu
1 5 10 15
Glu Lys Arg Ser Thr Tyr Val Leu Leu Lys Lys Val Leu Ile Pro Ile
20 25 30
Leu Val Phe Gly Phe Ala Ile Tyr Ala Phe Tyr Leu Arg His Leu Thr
35 40 45
Ala Gly Lys Tyr Phe Pro Asp Pro Asp Thr Phe Tyr His Phe Glu Ile
50 55 60
Tyr Lys Leu Val Leu Lys Glu Gly Leu Pro Arg Tyr Tyr Pro Met Ser
65 70 75 80
Asp Ala Pro Phe Gly Ser Leu Ile Gly Glu Pro Leu Gly Leu Tyr Leu
85 90 95
Leu Pro Ala Ala Phe Tyr Lys Val Val Ser Leu Phe Gly Tyr Asn Glu
100 105 110
Leu Gln Ala Phe Leu Leu Trp Pro Pro Phe Val Gly Phe Leu Gly Val
115 120 125
Ile Ala Val Tyr Leu Leu Gly Arg Lys Val Leu Asn Glu Trp Thr Gly
130 135 140
Leu Trp Gly Ala Val Val Leu Thr Val Ser Thr Ala Asn Phe Ser Arg
145 150 155 160
Thr Phe Ser Gly Asn Ala Arg Gly Asp Gly Pro Phe Met Ala Leu Phe
165 170 175
Ile Phe Ala Ser Val Ala Met Leu Tyr Tyr Leu Lys Glu Ser Asn Lys
180 185 190
Thr Arg Lys Ile Ile Tyr Gly Thr Leu Phe Val Leu Leu Thr Val Ile
195 200 205
Ser Leu Gly Ala Trp Asn Gly Ser Pro Phe Gly Leu Met Val Leu Leu
210 215 220
Gly Phe Ala Ser Leu Gln Thr Ile Ile Leu Phe Ile Phe Gly Lys Leu
225 230 235 240
Glu Glu Leu Lys Lys Phe Val Lys Glu Phe Tyr Pro Ala Tyr Leu Ala
245 250 255
Ile Leu Ala Phe Gly Tyr Ala Leu Thr Phe Pro Gly Ile Val Lys Ile
260 265 270
Gly Gly Phe Ile Arg Phe Ala Phe Glu Val Phe Leu Gly Leu Ile Phe
275 280 285
Leu Leu Val Ile Met Leu Tyr Gly Gly Arg Tyr Leu Asn Tyr Ser Asp
290 295 300
Lys Lys His Arg Phe Leu Val Val Thr Ile Ile Val Leu Leu Gly Phe
305 310 315 320
Gly Gly Ala Tyr Ala Tyr Val Gly Pro Lys Leu Phe Arg Leu Met Gly
325 330 335
Gly Ala Tyr Gln Ser Thr Gln Val Tyr Glu Thr Val Gln Glu Leu Ala
340 345 350
Lys Thr Thr Ile Gly Asp Val Lys Ala Tyr Tyr Gly Val Glu Ser Gly
355 360 365
Asn Gly Leu Ile Phe Phe Leu Ser Ile Pro Gly Leu Leu Ile Leu Leu
370 375 380
Thr Lys Tyr Leu Tyr Asp Leu Phe Lys Lys Ala Lys Ser Asp Asn Glu
385 390 395 400
Thr Leu Phe Ala Leu Val Phe Tyr Thr Met Ser Leu Tyr Leu Leu Tyr
405 410 415
Leu Ala Val Arg Phe Leu Phe Leu Ala Ser Tyr Ala Val Ala Leu Phe
420 425 430
Phe Gly Ile Phe Ile Gly Phe Ser Met Asp Val Ile Glu Lys Met Lys
435 440 445
Glu Asn Ile Gly Ile Lys Ala Ala Leu Gly Ile Val Leu Ser Leu Met
450 455 460
Ile Leu Val Ile Pro Phe Val His Ala Pro Val Leu Ala Arg Ser Ala
465 470 475 480
Arg Ala Leu Lys Asn Thr Glu Ile Glu Val Thr Gly Trp Glu Gln Ala
485 490 495
Leu Lys Trp Leu Arg Ser Asn Thr Ser Lys Tyr Ala Thr Ala Thr Ser
500 505 510
Trp Trp Asp Tyr Gly Tyr Trp Ile Glu Ser Ser Leu Leu Gly Asn Arg
515 520 525
Arg Ala Ser Ala Asp Gly Gly His Ala Arg Asp Arg Asp His Ile Leu
530 535 540
Ala Leu Phe Leu Ala Arg Asp Gly Asn Ile Ser Glu Val Asp Phe Glu
545 550 555 560
Ser Trp Glu Leu Asn Tyr Phe Ile Ile Tyr Leu Asn Asp Trp Ala Lys
565 570 575
Phe Asn Ala Ile Ser Tyr Leu Gly Gly Ala Ile Thr Arg Lys Glu Tyr
580 585 590
Asn Gly Asp Glu Asn Gly Arg Gly Arg Val Thr Thr Ile Leu Leu Thr
595 600 605
Gln Ala Ala Gly Asn Val Tyr Val Asn Pro Tyr Ala Arg Ile Val Ile
610 615 620
Lys Val Ile Gln Gln Asn Lys Thr Arg Arg Ile Ala Val Asn Ile Gly
625 630 635 640
Gln Leu Glu Cys Ser Pro Ile Leu Ser Val Ala Phe Pro Gly Asn Ile
645 650 655
Lys Ile Lys Gly Ser Gly Arg Cys Ser Asp Gly Ser Pro Phe Pro Tyr
660 665 670
Val Val Tyr Leu Thr Pro Ser Leu Gly Val Leu Ala Tyr Tyr Lys Val
675 680 685
Ala Thr Ser Asn Phe Val Lys Leu Ala Phe Gly Ile Pro Thr Ser Ser
690 695 700
Tyr Ser Glu Phe Ala Glu Lys Leu Phe Ser Asn Phe Ile Pro Val Tyr
705 710 715 720
Gln Tyr Gly Ser Val Ile Val Tyr Glu Phe Arg Pro Phe Ala Ile Tyr
725 730 735
Lys Ile Glu Asp Phe Ile Asn Gly Thr Trp Arg Glu Val Gly Lys Leu
740 745 750
Ser Pro Gly Lys His Thr Leu Arg Leu Tyr Ile Ser Ala Phe Gly Arg
755 760 765
Asp Ile Lys Asn Ala Thr Leu Tyr Val Tyr Ala Leu Asn Gly Thr Lys
770 775 780
Ile Ile Lys Arg Ile Lys Val Gly Glu Ile Lys Tyr Met Asn His Leu
785 790 795 800
Glu Glu Tyr Pro Ile Ile Val Asn Val Thr Leu Pro Thr Ala Gln Lys
805 810 815
Tyr Arg Phe Ile Leu Ala Gln Lys Gly Pro Val Gly Val Leu Thr Gly
820 825 830
Pro Val Arg Val Asn Gly Lys Ile Thr Asn Pro Ala Tyr Ile Met Arg
835 840 845
Glu Gly Glu Ser Gly Arg Leu Glu Leu Lys Val Gly Val Asp Lys Glu
850 855 860
Tyr Thr Ala Asp Leu Tyr Leu Arg Ala Thr Phe Ile Tyr Leu Val Arg
865 870 875 880
Lys Gly Gly Lys Ser Asn Glu Asp Tyr Asp Ala Ser Phe Glu Pro His
885 890 895
Met Asp Thr Phe Phe Ile Thr Lys Leu Lys Glu Gly Ile Lys Leu Arg
900 905 910
Pro Gly Glu Asn Glu Ile Val Val Asn Ala Glu Met Pro Lys Asn Ala
915 920 925
Ile Ser Ser Tyr Lys Glu Lys Leu Glu Lys Glu His Gly Asp Lys Leu
930 935 940
Ile Ile Arg Gly Ile Arg Val Glu Pro Val Phe Ile Val Glu Lys Glu
945 950 955 960
Tyr Thr Met Ile Glu Val Ser Ala Ser Ala Pro His His Ser Ser Glu
965 970 975
<210> 16
<211> 976
<212> PRT
<213> 深海火球菌
<400> 16
Met Val Lys Thr Lys Val Lys Glu Glu Lys Glu Glu Lys Ser Glu Lys
1 5 10 15
Ser Glu Gly Lys Ser Leu Tyr Pro Leu Leu Lys Arg Ile Leu Ile Pro
20 25 30
Leu Ala Val Ile Gly Phe Gly Ile Tyr Ala Tyr Tyr Leu Arg His Leu
35 40 45
Thr Ala Gly Lys Tyr Phe Pro Asp Pro Asp Thr Phe Tyr His Phe Glu
50 55 60
Ile Tyr Lys Leu Val Leu Lys Glu Gly Leu Pro Lys Tyr Tyr Pro Met
65 70 75 80
Ala Glu Ala Pro Phe Gly Ser Leu Ile Gly Glu Pro Leu Gly Leu Tyr
85 90 95
Ile Leu Pro Ala Ile Phe Tyr Lys Val Val Ser Val Phe Gly Tyr Asn
100 105 110
Glu Phe Gln Ala Phe Leu Met Trp Pro Pro Phe Val Gly Phe Leu Gly
115 120 125
Val Ile Ala Val Tyr Leu Leu Gly Arg Lys Val Leu Asn Glu Trp Ala
130 135 140
Gly Leu Trp Ala Ala Val Ile Leu Ser Val Ser Thr Ala Asn Phe Ser
145 150 155 160
Arg Thr Phe Ser Gly Asn Ala Arg Gly Asp Gly Pro Phe Met Thr Leu
165 170 175
Phe Leu Phe Ser Leu Val Ala Met Leu Tyr Tyr Leu Lys Glu Asn Asp
180 185 190
Ile Lys Lys Lys Ser Leu Trp Gly Ala Val Phe Val Leu Leu Ala Ser
195 200 205
Ile Ser Leu Gly Ala Trp Asn Gly Ser Pro Phe Gly Leu Met Val Leu
210 215 220
Ile Gly Phe Ala Ser Phe Gln Thr Ile Ala Leu Phe Ile Phe Gly Lys
225 230 235 240
Ile Lys Glu Leu Lys Lys Phe Val Lys Glu Phe Tyr Pro Ala Tyr Leu
245 250 255
Ala Ile Leu Ala Ile Gly Tyr Gly Leu Thr Ile Pro Gly Ile Ala Lys
260 265 270
Ile Gly Gly Phe Ile Lys Phe Ala Phe Glu Val Phe Leu Gly Leu Val
275 280 285
Leu Leu Val Thr Ile Met Leu Tyr Gly Gly Lys Phe Leu Asn Tyr Ser
290 295 300
Asp Lys Lys His Arg Phe Ala Val Val Ala Val Ile Val Leu Leu Gly
305 310 315 320
Phe Ala Gly Ala Tyr Ala Tyr Val Gly Pro Lys Leu Phe Arg Leu Met
325 330 335
Gly Gly Ala Tyr Gln Ser Thr Gln Val Tyr Gln Thr Val Gln Glu Leu
340 345 350
Ala Lys Thr Thr Leu Ser Asp Ile Lys Leu Tyr Tyr Gly Val Glu Gly
355 360 365
Asn Asn Gly Leu Val Phe Phe Leu Ser Ile Pro Gly Phe Leu Ile Ile
370 375 380
Leu Gly Leu Tyr Leu Asn Ala Leu Leu Lys Lys Ser Glu Ser Ser Asn
385 390 395 400
Glu Tyr Met Leu Ser Leu Val Phe Tyr Ile Met Ser Leu Tyr Leu Leu
405 410 415
Ser Leu Ala Val Arg Phe Leu Phe Leu Ala Ser Tyr Ala Ile Ala Leu
420 425 430
Phe Ser Gly Ile Phe Ala Gly Phe Thr Met Glu Val Ile Glu Lys Met
435 440 445
Lys Glu Asn Val Gly Ile Lys Ala Ala Leu Gly Ile Ala Ile Ala Val
450 455 460
Met Ile Leu Met Val Pro Ile Thr His Gly Pro Val Ile Ala Arg Asn
465 470 475 480
Ala Lys Ala Leu Lys Val Ser Glu Ile Glu Thr Thr Gly Trp Glu Gln
485 490 495
Val Leu Lys Trp Leu Asn Glu Asn Thr Ser Lys Tyr Ala Thr Ala Thr
500 505 510
Ser Trp Trp Asp Tyr Gly Tyr Trp Ile Glu Ser Ser Leu Leu Gly His
515 520 525
Arg Arg Ala Ser Ala Asp Gly Gly His Ala Arg Asp Arg Asp His Ile
530 535 540
Leu Ala Leu Phe Leu Ala Arg Asp Gly Asn Val Ser Glu Val Asp Phe
545 550 555 560
Glu Ser Trp Glu Leu Asn Tyr Phe Ile Ile Tyr Leu Asn Asp Trp Ala
565 570 575
Lys Phe Asn Ala Ile Ser Tyr Leu Gly Gly Ala Ile Thr Arg Arg Glu
580 585 590
Tyr Asn Gly Asp Glu Thr Gly Arg Gly Gln Val Thr Thr Ile Leu Pro
595 600 605
Leu Gln Gly Ser Gly Gly Ile Tyr Val Asn Pro Tyr Ala Gly Ile Ser
610 615 620
Val Arg Val Val Gln Ser Asn Thr Thr Ser Lys Val Thr Val Asn Val
625 630 635 640
Arg Gly Arg Ala Glu Cys Ser Pro Ile Tyr Thr Leu Leu Ile Pro Gly
645 650 655
Asn Lys Lys Ile Pro Gly Asn Gly Arg Cys Ser Asp Gly Ser Pro Phe
660 665 670
Pro Tyr Val Leu Tyr Leu Ala Pro Asn Phe Gly Leu Ile Thr Tyr Tyr
675 680 685
Lys Val Ala Thr Ser Asn Phe Ile Lys Leu Ala Phe Asn Ile Pro Ile
690 695 700
Ser Lys Tyr Ser Gly Phe Thr Glu Lys Leu Tyr Ser Asn Phe Val Pro
705 710 715 720
Val Tyr Gly Tyr Gly Asn Val Ile Val Tyr Glu Phe Arg Pro Phe Ala
725 730 735
Ile Tyr Arg Ile Glu Glu Leu Ile Asn Gly Thr Trp Lys Ala Val Asn
740 745 750
Ser Leu Thr Pro Gly Lys His Glu Leu Lys Leu Tyr Ile Ser Ala Phe
755 760 765
Gly Arg Asp Ile Arg Asn Ala Thr Leu Tyr Val Tyr Ala Ile Gly Asn
770 775 780
Lys Thr Glu Lys Ile Lys Ile Gly Glu Ile Glu Tyr Met Asn His Leu
785 790 795 800
Asn Glu Lys Pro Ile Ile Val Asn Val Thr Leu Pro Lys Ala Glu Lys
805 810 815
Tyr Arg Leu Val Leu Val Gln Lys Gly Pro Val Gly Val Leu Thr Gly
820 825 830
Pro Pro Lys Leu Asn Gly Glu Ile Ala Asn Pro Ile Arg Ile Ala Arg
835 840 845
Glu Gly Glu Lys Gly Thr Leu Ser Leu Lys Val Gly Val Asp Lys Asp
850 855 860
Tyr Thr Ala Asp Leu Tyr Leu Arg Ala Thr Phe Ile Tyr Leu Val Arg
865 870 875 880
Lys Glu Gly Lys Ser Asn Asp Asp Tyr Asn Ala Ala Phe Glu Pro His
885 890 895
Met Asp Thr Phe Phe Ile Thr Lys Leu Lys Gly Gly Ile Lys Leu His
900 905 910
Lys Gly Asp Asn Val Val Thr Ala Glu Leu Asn Met Pro Asn Gly Val
915 920 925
Ile Ser Ser Tyr Lys Glu Lys Leu Glu Lys Glu Tyr Gly Asp Lys Leu
930 935 940
Ile Ile Arg Gly Ile Arg Val Glu Pro Val Phe Ile Ala Glu Lys Glu
945 950 955 960
Tyr Val Met Ala Glu Val Arg Ala Ser Ala Pro His His Gly Ser Glu
965 970 975
<210> 17
<211> 972
<212> PRT
<213> 专性嗜压超嗜热火球菌
<400> 17
Met Val Lys Thr Lys Val Lys Arg Glu Lys Arg Glu Glu Lys Ala Pro
1 5 10 15
Glu His Arg Pro Lys Thr Leu Val Val Phe Phe Lys Arg Phe Gly Ile
20 25 30
Pro Leu Ile Val Leu Ala Phe Ala Thr Leu Gly Phe Tyr Ile Arg Tyr
35 40 45
Leu Pro Gly Thr Gly Lys Tyr Phe Ile Asp Pro Asp Thr Tyr Tyr His
50 55 60
Tyr Glu Ile Tyr Lys Leu Val Leu Lys Glu Gly Leu Pro Arg Tyr Tyr
65 70 75 80
Ser Met Ala Glu Ala Pro Phe Gly Ser Leu Ile Gly Glu Pro Leu Gly
85 90 95
Leu Tyr Leu Leu Pro Ala Ile Phe Tyr Lys Leu Ile Ser Ala Phe Gly
100 105 110
Tyr Thr Thr Leu Gln Ala Phe Lys Leu Trp Pro Pro Thr Val Gly Phe
115 120 125
Leu Ser Ile Ile Ala Thr Tyr Leu Leu Ala Arg Lys Ile His Gly Glu
130 135 140
Trp Ala Gly Leu Trp Ser Ala Ala Ile Met Ser Phe Leu Leu Ala His
145 150 155 160
Phe Thr Arg Thr Phe Ser Gly Asn Ala Arg Gly Asp Gly Pro Phe Leu
165 170 175
Met Leu Phe Leu Phe Ala Ser Val Ala Met Leu Tyr Tyr Leu Glu Ala
180 185 190
Lys Asp Val Lys Arg Lys Met Val Tyr Gly Thr Leu Phe Val Ala Leu
195 200 205
Ser Val Ile Ala Leu Ser Ala Trp Asn Gly Ser Pro Phe Ser Leu Met
210 215 220
Val Phe Leu Gly Phe Gly Ala Leu Gln Ala Ile Val Leu Phe Val Phe
225 230 235 240
Gly Arg Ile Glu Glu Leu Arg Glu Phe Ile Lys Leu Tyr Tyr Pro Thr
245 250 255
Tyr Leu Thr Val Leu Leu Leu Gly Tyr Leu Leu Thr Phe Pro Arg Ile
260 265 270
Val Ala Val Lys Gly His Ile Leu Phe Ala Leu Lys Val Phe Leu Gly
275 280 285
Leu Ala Gly Leu Thr Val Leu Met Leu Tyr Gly Gly Lys Trp Leu Asn
290 295 300
Tyr Ser Asp Arg Arg His Arg Phe Ala Val Val Ala Val Val Thr Leu
305 310 315 320
Leu Gly Phe Val Gly Ala Tyr Ile Tyr Val Gly Pro Lys Leu Phe Ser
325 330 335
Leu Met Ala Gly Ala Tyr Gln Ser Thr Gln Val Tyr Glu Thr Val Gln
340 345 350
Glu Leu Ala Lys Thr Thr Leu Gly Asp Ile Lys Ala Tyr Tyr Gly Ile
355 360 365
Lys Gly Thr Asp Gly Ile Val Phe Phe Met Ser Leu Ala Gly Val Leu
370 375 380
Val Leu Leu Tyr Arg Tyr Leu Thr Thr Leu Leu Arg Glu Gly Arg Ser
385 390 395 400
Ser His Glu Tyr Leu Phe Ala Leu Thr Leu Tyr Gly Met Ser Leu Tyr
405 410 415
Leu Val Trp Ser Ala Val Arg Phe Leu Phe Leu Ala Ser Gly Ala Val
420 425 430
Ile Leu Met Ala Gly Val Phe Ala Gly Glu Leu Phe Arg Ile Ile Glu
435 440 445
Asp Met Lys Glu Lys Ala Thr Thr Lys Ile Thr Leu Gly Leu Ala Leu
450 455 460
Thr Val Met Leu Leu Leu Met Pro Val Thr Gly Val Pro Leu Met Ile
465 470 475 480
Asn Thr Ala Lys Ala Met Lys Thr Ser Glu Val Glu Arg Ser Gly Trp
485 490 495
Glu Asp Ala Leu Met Trp Leu Arg Glu Asn Thr Ser Glu Tyr Ala Thr
500 505 510
Ala Thr Ser Trp Trp Asp Tyr Gly Tyr Trp Ile Glu Ser Ser Leu Leu
515 520 525
Gly Asn Arg Arg Ala Ser Ala Asp Gly Gly His Ala Arg Asp Arg Asp
530 535 540
His Ile Leu Ala Leu Phe Leu Ala Arg Asp Gly Asn Val Ser Glu Val
545 550 555 560
Asp Phe Glu Ser Trp Glu Leu Asn Tyr Phe Ile Ala Tyr Met Gln Asp
565 570 575
Trp Arg Lys Phe Asn Ala Ile Ser Tyr Leu Gly Gly Ala Ile Thr Arg
580 585 590
Arg Glu Tyr Lys Gly Asp Glu Ser Gly Arg Gly Gly Val Thr Thr Ile
595 600 605
Val Leu Leu Pro Gly Ala Asn Gly Val Tyr Ser Asn Pro Tyr Met Gly
610 615 620
Leu Thr Leu Arg Val Glu Asn Arg Thr Val Lys Val Asn Gly Tyr Cys
625 630 635 640
Glu Pro Met Glu Ser Val Ile Leu Pro Ser Asn Thr His Ile Lys Gly
645 650 655
Ser Gly Gln Cys Glu Thr Gly Ser Tyr Phe Pro Tyr Val Ala Tyr Val
660 665 670
Thr Pro Thr Phe Ala Val Leu Ala Tyr Tyr Lys Val Ala Thr Ser Asn
675 680 685
Phe Leu Lys Leu Ala Phe Gly Ile Pro Ala Ser Lys Glu Ala Asn Phe
690 695 700
Thr Glu Lys Leu Tyr Ala Asn Phe Glu Leu Val Phe Gln Ser Gly Asp
705 710 715 720
Val Ile Val Tyr Glu Phe Lys Pro Phe Ala Val Tyr Lys Ala Glu Glu
725 730 735
Leu Val Asn Gly Thr Trp Arg Ala Val Glu Thr Leu Thr Pro Gly Glu
740 745 750
His Thr Leu Lys Leu Tyr Ile Ser Ala Phe Gly Arg Asp Val Lys Asn
755 760 765
Ala Thr Leu Tyr Val Glu Ala Leu Lys Asp Gly Lys Val Val Glu Arg
770 775 780
Ile Lys Val Ala Glu Gly Leu Tyr Ile Asp His Leu Asn Glu Lys Pro
785 790 795 800
Ile Glu Val Lys Val Asn Leu Pro Glu Ala Asp Glu Tyr Arg Phe Val
805 810 815
Leu Val Gln Lys Gly Pro Val Gly Val Leu Thr Ser Ala Pro Arg Val
820 825 830
Asn Gly Ser Ile Ala Asn Pro Ile Lys Val Leu Gly Glu Gly Gln Ser
835 840 845
Gly Thr Leu Glu Leu Lys Ala Ala Phe Asp Arg Asp Tyr Thr Ala Asp
850 855 860
Leu Tyr Leu Arg Val Thr Phe Ile Tyr Leu Val Arg Lys Ser Gly Arg
865 870 875 880
Ser Asn Asp Asp Ile Asp Ala Ala Phe Glu Pro His Met Asp Thr Phe
885 890 895
Phe Ala Ala Lys Leu Ala Glu Gly Leu Lys Leu Lys Lys Gly Glu Asp
900 905 910
Thr Ile Thr Val Asn Ala Gly Leu Pro Ala Gly Val Ile Ser Ser Tyr
915 920 925
Glu Glu Lys Leu Lys Ala Leu Tyr Gly Asp Arg Leu Ile Ile Arg Gly
930 935 940
Ile Arg Val Glu Pro Val Phe Ile Ala Asp Lys Ala Tyr Thr Ile Trp
945 950 955 960
Glu Val Arg Ala Ser Ala Pro His His Gly Ser Glu
965 970
<210> 18
<211> 982
<212> PRT
<213> 人工的
<220>
<223> 火球菌属STT3共有序列
<220>
<221> misc_feature
<222> (1)..(1)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (4)..(29)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (31)..(40)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (42)..(48)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (50)..(50)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (52)..(54)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (56)..(56)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (59)..(59)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (64)..(64)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (67)..(67)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (74)..(74)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (80)..(80)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (83)..(86)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (101)..(101)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (105)..(105)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (109)..(110)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (112)..(112)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (116)..(119)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (122)..(123)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (127)..(127)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (132)..(136)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (140)..(140)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (143)..(145)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (148)..(148)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (150)..(150)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (152)..(152)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (154)..(160)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (162)..(162)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (164)..(164)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (178)..(179)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (182)..(182)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (184)..(190)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (193)..(199)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (201)..(203)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (205)..(206)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (209)..(209)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (211)..(216)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (224)..(224)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (228)..(229)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (232)..(234)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (236)..(236)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (238)..(238)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (241)..(241)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (244)..(246)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (249)..(250)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (252)..(255)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (258)..(258)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (260)..(262)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (264)..(266)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (268)..(268)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (271)..(271)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (273)..(282)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (285)..(286)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (292)..(293)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (295)..(297)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (302)..(304)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (310)..(311)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (315)..(315)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (318)..(323)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (326)..(326)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (330)..(330)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (338)..(338)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (341)..(341)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (351)..(351)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (360)..(362)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (364)..(364)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (366)..(366)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (370)..(374)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (376)..(377)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (380)..(383)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (385)..(391)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (394)..(395)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (397)..(412)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (414)..(414)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (417)..(417)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (420)..(422)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (424)..(424)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (432)..(432)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (434)..(435)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (437)..(438)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (440)..(442)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (444)..(449)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (451)..(451)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (455)..(458)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (460)..(461)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (463)..(468)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (470)..(470)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (472)..(473)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (475)..(478)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (480)..(484)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (486)..(491)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (493)..(493)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (495)..(496)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (500)..(501)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (503)..(503)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (506)..(507)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (510)..(511)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (513)..(513)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (532)..(532)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (559)..(559)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (573)..(574)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (576)..(577)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (580)..(580)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (592)..(592)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (594)..(595)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (598)..(598)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (602)..(602)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (606)..(619)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (621)..(621)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (623)..(623)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (625)..(630)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (632)..(649)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (651)..(651)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (653)..(658)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (660)..(665)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (667)..(667)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (669)..(669)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (671)..(672)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (674)..(675)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (678)..(678)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (680)..(683)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (685)..(690)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (697)..(697)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (700)..(700)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (704)..(706)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (708)..(708)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (710)..(713)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (715)..(716)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (719)..(720)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (723)..(724)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (726)..(728)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (730)..(730)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (732)..(733)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (735)..(735)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (737)..(737)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (740)..(741)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (743)..(748)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (751)..(751)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (753)..(757)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (759)..(759)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (762)..(762)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (764)..(764)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (766)..(766)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (776)..(777)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (783)..(784)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (786)..(794)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (796)..(805)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (807)..(807)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (809)..(809)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (812)..(812)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (814)..(814)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (816)..(816)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (819)..(819)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (821)..(822)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (824)..(826)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (828)..(828)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (833)..(833)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (837)..(842)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (845)..(845)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (847)..(848)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (850)..(855)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (857)..(858)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (860)..(862)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (865)..(867)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (869)..(870)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (872)..(872)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (879)..(879)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (888)..(888)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (890)..(891)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (893)..(893)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (895)..(896)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (898)..(898)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (902)..(902)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (905)..(905)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (908)..(916)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (918)..(919)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (921)..(930)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (932)..(934)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (937)..(937)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (939)..(941)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (943)..(946)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (949)..(949)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (963)..(964)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (966)..(966)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (968)..(970)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (973)..(973)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (980)..(982)
<223> Xaa可以是任意天然存在的氨基酸
<400> 18
Xaa Val Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
1 5 10 15
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa Xaa
20 25 30
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa
35 40 45
Arg Xaa Leu Xaa Xaa Xaa Gly Xaa Tyr Phe Xaa Asp Pro Asp Thr Xaa
50 55 60
Tyr His Xaa Glu Ile Tyr Lys Leu Val Xaa Lys Glu Gly Leu Pro Xaa
65 70 75 80
Tyr Tyr Xaa Xaa Xaa Xaa Ala Pro Phe Gly Ser Leu Ile Gly Glu Pro
85 90 95
Leu Gly Leu Tyr Xaa Leu Pro Ala Xaa Phe Tyr Lys Xaa Xaa Ser Xaa
100 105 110
Phe Gly Tyr Xaa Xaa Xaa Xaa Ala Phe Xaa Xaa Trp Pro Pro Xaa Val
115 120 125
Gly Phe Leu Xaa Xaa Xaa Xaa Xaa Tyr Leu Leu Xaa Arg Lys Xaa Xaa
130 135 140
Xaa Glu Trp Xaa Gly Xaa Trp Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa
145 150 155 160
Ala Xaa Phe Xaa Arg Thr Phe Ser Gly Asn Ala Arg Gly Asp Gly Pro
165 170 175
Phe Xaa Xaa Leu Phe Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Leu
180 185 190
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys Xaa Xaa Xaa Gly Xaa Xaa Phe Val
195 200 205
Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Ala Trp Asn Gly Ser Pro Phe Xaa
210 215 220
Leu Met Val Xaa Xaa Gly Phe Xaa Xaa Xaa Gln Xaa Ile Xaa Leu Phe
225 230 235 240
Xaa Phe Gly Xaa Xaa Xaa Glu Leu Xaa Xaa Phe Xaa Xaa Xaa Xaa Tyr
245 250 255
Pro Xaa Tyr Xaa Xaa Xaa Leu Xaa Xaa Xaa Tyr Xaa Leu Thr Xaa Pro
260 265 270
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Ala Xaa Xaa Val Phe
275 280 285
Leu Gly Leu Xaa Xaa Leu Xaa Xaa Xaa Met Leu Tyr Gly Xaa Xaa Xaa
290 295 300
Leu Asn Tyr Ser Asp Xaa Xaa His Arg Phe Xaa Val Val Xaa Xaa Xaa
305 310 315 320
Xaa Xaa Xaa Gly Phe Xaa Gly Ala Tyr Xaa Tyr Val Gly Pro Lys Leu
325 330 335
Phe Xaa Leu Met Xaa Gly Ala Tyr Gln Ser Thr Gln Val Tyr Xaa Thr
340 345 350
Val Gln Glu Leu Ala Lys Lys Xaa Xaa Xaa Asp Xaa Lys Xaa Tyr Tyr
355 360 365
Gly Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Phe Phe Xaa Xaa Xaa Xaa Gly
370 375 380
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa
385 390 395 400
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Met Ser
405 410 415
Xaa Tyr Leu Xaa Xaa Xaa Ala Xaa Arg Phe Leu Phe Leu Ala Ser Xaa
420 425 430
Ala Xaa Xaa Leu Xaa Xaa Gly Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa
435 440 445
Xaa Glu Xaa Met Lys Glu Xaa Xaa Xaa Xaa Lys Xaa Xaa Leu Xaa Xaa
450 455 460
Xaa Xaa Xaa Xaa Met Xaa Leu Xaa Xaa Pro Xaa Xaa Xaa Xaa Pro Xaa
465 470 475 480
Xaa Xaa Xaa Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Glu Xaa Glu Xaa Xaa
485 490 495
Gly Trp Glu Xaa Xaa Leu Xaa Trp Leu Xaa Xaa Asn Thr Xaa Xaa Tyr
500 505 510
Xaa Thr Ala Thr Ser Trp Trp Asp Tyr Gly Tyr Trp Ile Glu Ser Ser
515 520 525
Leu Leu Gly Xaa Arg Arg Ala Ser Ala Asp Gly Gly His Ala Arg Asp
530 535 540
Arg Asp His Ile Leu Ala Leu Phe Leu Ala Arg Asp Gly Asn Xaa Ser
545 550 555 560
Glu Val Asp Phe Glu Ser Trp Glu Leu Asn Tyr Phe Xaa Xaa Tyr Xaa
565 570 575
Xaa Asp Trp Xaa Lys Phe Asn Ala Ile Ser Tyr Leu Gly Gly Ala Xaa
580 585 590
Thr Xaa Xaa Glu Tyr Xaa Gly Asp Glu Xaa Gly Arg Gly Xaa Xaa Xaa
595 600 605
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Asn Xaa Tyr
610 615 620
Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
625 630 635 640
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Pro Xaa Xaa Xaa Xaa
645 650 655
Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Gly Xaa Cys Xaa Xaa
660 665 670
Gly Xaa Xaa Phe Pro Xaa Val Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Xaa
675 680 685
Xaa Xaa Tyr Tyr Lys Val Ala Thr Xaa Asn Phe Xaa Lys Leu Ala Xaa
690 695 700
Xaa Xaa Pro Xaa Ser Xaa Xaa Xaa Xaa Phe Xaa Xaa Lys Leu Xaa Xaa
705 710 715 720
Asn Phe Xaa Xaa Val Xaa Xaa Xaa Gly Xaa Val Xaa Xaa Tyr Xaa Phe
725 730 735
Xaa Pro Phe Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Asn Gly Xaa Trp
740 745 750
Xaa Xaa Xaa Xaa Xaa Leu Xaa Pro Gly Xaa His Xaa Leu Xaa Leu Tyr
755 760 765
Ile Ser Ala Phe Gly Arg Asp Xaa Xaa Asn Ala Thr Leu Tyr Xaa Xaa
770 775 780
Ala Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ile Xaa Xaa Xaa Xaa Xaa
785 790 795 800
Xaa Xaa Xaa Xaa Xaa Leu Xaa Glu Xaa Pro Ile Xaa Val Xaa Val Xaa
805 810 815
Leu Pro Xaa Ala Xaa Xaa Tyr Xaa Xaa Xaa Leu Xaa Gln Lys Gly Pro
820 825 830
Xaa Gly Val Leu Xaa Xaa Xaa Xaa Xaa Xaa Asn Gly Xaa Ile Xaa Xaa
835 840 845
Pro Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Gly Xaa Xaa Xaa Leu Lys
850 855 860
Xaa Xaa Xaa Asp Xaa Xaa Tyr Xaa Ala Asp Leu Tyr Leu Arg Xaa Thr
865 870 875 880
Phe Ile Tyr Leu Val Arg Lys Xaa Gly Xaa Xaa Asn Xaa Asp Xaa Xaa
885 890 895
Ala Xaa Phe Glu Pro Xaa Met Asp Xaa Phe Phe Xaa Xaa Xaa Xaa Xaa
900 905 910
Xaa Xaa Xaa Xaa Leu Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
915 920 925
Xaa Xaa Pro Xaa Xaa Xaa Ile Ser Xaa Tyr Xaa Xaa Xaa Leu Xaa Xaa
930 935 940
Xaa Xaa Gly Asp Xaa Leu Ile Ile Arg Gly Ile Arg Val Glu Pro Val
945 950 955 960
Phe Ile Xaa Xaa Lys Xaa Tyr Xaa Xaa Xaa Glu Val Xaa Ala Ser Ala
965 970 975
Pro His His Xaa Xaa Xaa
980
<210> 19
<211> 833
<212> PRT
<213> 硕大利什曼原虫
<400> 19
Met Ala Ala Ala Ser Asn Val Asn Ala Pro Glu Ser Asn Val Met Thr
1 5 10 15
Thr Arg Ser Ala Val Ala Pro Pro Ser Thr Ala Ala Pro Lys Glu Ala
20 25 30
Ser Ser Glu Thr Leu Leu Ile Gly Leu Tyr Lys Met Pro Ser Gln Thr
35 40 45
Arg Ser Leu Ile Tyr Ser Ser Cys Phe Ala Val Ala Met Ala Ile Ala
50 55 60
Leu Pro Ile Ala Tyr Asp Met Arg Val Arg Ser Ile Gly Val Tyr Gly
65 70 75 80
Tyr Leu Phe His Ser Ser Asp Pro Trp Phe Asn Tyr Arg Ala Ala Glu
85 90 95
Tyr Met Ser Thr His Gly Trp Ser Ala Phe Phe Ser Trp Phe Asp Tyr
100 105 110
Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly Ser Thr Thr Tyr Pro
115 120 125
Gly Leu Gln Leu Thr Ala Val Ala Ile His Arg Ala Leu Ala Ala Ala
130 135 140
Gly Met Pro Met Ser Leu Asn Asn Val Cys Val Leu Met Pro Ala Trp
145 150 155 160
Phe Ser Leu Val Ser Ser Ala Met Ala Ala Leu Leu Ala His Glu Met
165 170 175
Ser Gly Asn Met Ala Val Ala Ser Ile Ser Ser Ile Leu Phe Ser Val
180 185 190
Val Pro Ala His Leu Met Arg Ser Met Ala Gly Glu Phe Asp Asn Glu
195 200 205
Cys Ile Ala Val Ala Ala Met Leu Leu Thr Phe Tyr Cys Trp Val Arg
210 215 220
Ser Leu Arg Thr Arg Ser Ser Trp Pro Ile Gly Val Leu Thr Gly Val
225 230 235 240
Ala Tyr Gly Tyr Met Ala Ala Ala Trp Gly Gly Tyr Ile Phe Val Leu
245 250 255
Asn Met Val Ala Met His Ala Gly Ile Ser Ser Met Val Asp Trp Ala
260 265 270
Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala Tyr Thr Leu Phe Tyr
275 280 285
Val Val Gly Thr Ala Ile Ala Val Cys Val Pro Pro Val Gly Met Ser
290 295 300
Pro Phe Lys Ser Leu Glu Gln Leu Gly Ala Leu Leu Val Leu Val Phe
305 310 315 320
Ile Phe Gly Gln Ser Val Cys Glu Ala Gln Arg Arg Arg Leu Gly Ile
325 330 335
Ala Arg Leu Ser Lys Glu Gly Val Ala Leu Leu Ile Arg Ile Asp Ala
340 345 350
Ala Phe Phe Val Gly Ile Val Ala Val Ala Thr Ile Ala Pro Ala Gly
355 360 365
Phe Phe Lys Pro Leu Ser Leu Gln Ala Asn Ala Ile Ile Thr Gly Val
370 375 380
Ser Arg Thr Gly Asn Thr Leu Val Asp Ile Leu Leu Ala Gln Asp Ala
385 390 395 400
Ser Asn Leu Leu Met Val Trp Gln Leu Phe Leu Phe Pro Phe Leu Gly
405 410 415
Trp Val Ala Gly Met Ser Ala Phe Leu Arg Glu Leu Ile Arg Asn Tyr
420 425 430
Thr Tyr Ala Lys Ser Phe Ile Leu Met Tyr Gly Val Val Gly Met Tyr
435 440 445
Phe Ala Ser Gln Ser Val Arg Met Met Val Met Met Ala Pro Val Ala
450 455 460
Cys Ile Phe Thr Ala Leu Leu Phe Arg Trp Ala Leu Asp Tyr Leu Leu
465 470 475 480
Gly Ser Leu Phe Trp Ala Glu Met Pro Pro Ser Phe Asp Thr Asp Ala
485 490 495
Gln Arg Gly Arg Gln Gln Gln Thr Ala Glu Glu Ser Glu Ala Glu Thr
500 505 510
Lys Arg Lys Glu Glu Glu Tyr Asn Thr Met Gln Val Lys Lys Met Ser
515 520 525
Val Arg Met Leu Pro Phe Met Leu Leu Leu Leu Leu Phe Arg Leu Ser
530 535 540
Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg Lys Met Glu Ala Pro
545 550 555 560
Gly Ile Val Phe Pro Ser Glu Gln Val Gln Gly Val Ser Glu Lys Lys
565 570 575
Val Asp Asp Tyr Tyr Ala Gly Tyr Leu Tyr Leu Arg Asp Ser Thr Pro
580 585 590
Glu Asp Ala Arg Val Leu Ala Trp Trp Asp Tyr Gly Tyr Gln Ile Thr
595 600 605
Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly Asn Thr Trp Asn His
610 615 620
Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr Ser Pro Val Ala Glu
625 630 635 640
Ala His Ser Leu Val Arg His Met Ala Asp Tyr Val Leu Ile Ser Ala
645 650 655
Gly Asp Thr Tyr Phe Ser Asp Leu Asn Arg Ser Pro Met Met Ala Arg
660 665 670
Ile Gly Asn Ser Val Tyr His Asp Ile Cys Pro Asp Asp Pro Leu Cys
675 680 685
Ser Gln Phe Val Leu Gln Lys Arg Pro Lys Ala Ala Ala Ala Lys Arg
690 695 700
Ser Arg His Val Ser Val Asp Ala Leu Glu Glu Asp Asp Thr Ala Glu
705 710 715 720
His Met Val Tyr Glu Pro Ser Ser Leu Ile Ala Lys Ser Leu Ile Tyr
725 730 735
His Leu His Ser Thr Gly Val Val Thr Gly Val Thr Leu Asn Glu Thr
740 745 750
Leu Phe Gln His Val Phe Thr Ser Pro Gln Gly Leu Met Arg Ile Phe
755 760 765
Lys Val Met Asn Val Ser Thr Glu Ser Lys Lys Trp Val Ala Asp Ser
770 775 780
Ala Asn Arg Val Cys His Pro Pro Gly Ser Trp Ile Cys Pro Gly Gln
785 790 795 800
Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met Leu Ala His Gln His Thr
805 810 815
Asn Phe Lys Asp Leu Leu Asp Pro Arg Thr Thr Trp Ser Gly Ser Arg
820 825 830
Arg
<210> 20
<211> 2502
<212> DNA
<213> 硕大利什曼原虫
<400> 20
atggcggcag cgtcaaacgt gaatgccccc gaaagcaacg tgatgacaac gagaagtgcc 60
gttgcaccac cgtcgacggc tgcacccaaa gaggcttcaa gtgaaacgct gctcattggc 120
ctatacaaga tgccctcgca aactcgtagc ctcatctact cctcctgctt tgcggtggcc 180
atggccattg ccctccctat cgcgtacgac atgcgtgtcc gctccatcgg cgtgtacggg 240
tacctcttcc acagcagtga cccgtggttc aactaccgcg ctgccgagta catgtccacg 300
cacggctggt ccgccttctt cagctggttc gactacatga gctggtaccc gctgggccgc 360
cccgtcggct ccaccacgta cccgggcctg cagctcactg ccgtcgccat tcaccgcgca 420
ctggcggctg ccggcatgcc gatgtctctc aacaacgtgt gcgtgctgat gccagcgtgg 480
ttttcacttg tctcttcagc gatggcggca ctgctggcgc atgagatgag cggcaatatg 540
gcggtagcca gcatctcgtc tatcttattc agtgtggttc cagcccacct gatgcggtcc 600
atggcgggtg agttcgacaa cgagtgtatc gccgtcgcag ccatgctcct caccttctac 660
tgctgggtgc gctcgctgcg cacgcggtcc tcgtggccca tcggtgtcct caccggtgtc 720
gcctacggct acatggcggc ggcgtggggc ggctacattt tcgtgctcaa catggttgcc 780
atgcatgccg gcatatcatc gatggtggac tgggcccgca acacgtacaa cccgtcgctg 840
ctgcgtgcat acacgctgtt ctacgtcgtg ggcaccgcca tcgccgtgtg cgtgccgcca 900
gtggggatgt cgcccttcaa gtcgctggag cagctgggtg cgctgctggt gcttgtcttc 960
attttcggtc agtctgtgtg tgaggcccag cgcagacgat tgggaatcgc gcgcctttca 1020
aaggagggcg tggcgctgct catccgcatc gacgcagcct tcttcgtcgg tatcgttgcc 1080
gtggccacca ttgccccggc tggattcttc aagccgctct ccctgcaagc gaacgcgata 1140
atcactggcg tatctcgtac cggaaacaca ctcgtagaca ttctgcttgc gcaagacgcg 1200
tccaacctac tcatggtgtg gcagcttttt ctctttccct tcttaggttg ggtggcgggc 1260
atgagcgcct tccttagaga gttgatccgg aactacacct acgcgaagag tttcatcctg 1320
atgtacggcg tggtcggtat gtacttcgcc agccagtctg tccgaatgat ggtgatgatg 1380
gcccccgtgg cgtgcatctt tactgccctc ttgttccgct gggcactgga ctacctcctc 1440
gggtctttgt tttgggctga gatgccacct tcctttgaca ccgacgcaca gcgtgggcgg 1500
cagcaacaga ccgccgagga gtcggaggca gagaccaagc gtaaggagga agagtacaac 1560
accatgcagg tcaagaagat gtcggtgcgc atgttgccct tcatgctgtt gctcttactg 1620
tttcgtcttt cggggttcat cgaagatgtg gcggcgatat cgcgcaagat ggaggcgccg 1680
ggtatagttt ttcccagtga acaggtgcaa ggcgtgtcgg agaaaaaggt cgacgactac 1740
tatgcggggt acctgtatct gcgcgacagc acgccagagg acgcgcgcgt tttggcctgg 1800
tgggactacg gctaccagat cacaggcatc ggcaaccgca cctcgctggc cgatggcaac 1860
acctggaacc acgagcacat cgccacgatc ggcaagatgc tgacgtcgcc cgtggcggag 1920
gcgcactcgc tggtgcgcca catggccgac tatgttctga tttctgctgg agacacatat 1980
ttttccgacc tgaatcgctc accgatgatg gcgcgcatcg gcaacagcgt gtaccacgac 2040
atctgccccg acgacccact ttgtagtcag ttcgtgttgc agaaaagacc gaaagctgct 2100
gcagcgaagc gcagtcggca cgtcagcgtt gacgcactag aggaggatga cactgcagag 2160
catatggtat acgagccgtc atcactcata gccaagtcgc tcatatatca cctgcactcc 2220
acaggggtgg tgacgggggt cacgctgaat gagacgctct tccagcacgt cttcacctca 2280
ccgcagggtc tcatgcgcat cttcaaggtc atgaacgtga gcacggagag caaaaagtgg 2340
gttgctgact cggcaaaccg cgtgtgccac ccgcctgggt cgtggatctg ccccgggcag 2400
tacccgccgg cgaaggagat ccaggagatg ctggcacacc aacacaccaa cttcaaggac 2460
cttcttgatc ccagaacgac ttggagcggg agcaggcgct ga 2502
<210> 21
<211> 794
<212> PRT
<213> 杜氏利什曼原虫
<400> 21
Met Ser Ser Gln Thr Arg Ser Ile Ile Tyr Ser Ser Cys Phe Ala Val
1 5 10 15
Ala Met Ala Ile Ala Leu Pro Ile Ala Tyr Asp Met Arg Val Arg Ser
20 25 30
Ile Gly Val Tyr Gly Tyr Leu Phe His Arg Ser Asp Pro Trp Phe Asn
35 40 45
Tyr Arg Ala Ala Glu Tyr Met Ser Thr His Gly Trp Ser Ala Phe Phe
50 55 60
Ser Trp Phe Asp Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly
65 70 75 80
Ser Thr Thr Tyr Pro Gly Leu Gln Leu Thr Ala Val Ala Ile His Arg
85 90 95
Ala Leu Ala Ala Ala Gly Met Pro Met Ser Leu Asn Asn Val Cys Val
100 105 110
Leu Met Pro Ala Trp Phe Ser Leu Val Ser Ser Ala Met Val Ala Leu
115 120 125
Leu Ala His Glu Leu Ser Gly Asn Met Ala Val Ala Ser Ile Ser Ser
130 135 140
Ile Leu Phe Ser Val Val Pro Ala His Leu Met Arg Ser Met Ala Gly
145 150 155 160
Glu Phe Asp Asn Glu Cys Ile Ala Val Ala Ala Met Leu Leu Thr Phe
165 170 175
Tyr Cys Trp Val Arg Ser Leu Arg Thr Arg Ser Ser Trp Pro Ile Gly
180 185 190
Val Leu Thr Gly Val Ala Tyr Gly Tyr Met Val Ala Ala Trp Gly Gly
195 200 205
Tyr Ile Phe Val Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser
210 215 220
Met Val Asp Trp Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala
225 230 235 240
Tyr Thr Leu Phe Tyr Val Val Gly Thr Ala Ile Ala Val Cys Val Pro
245 250 255
Pro Val Gly Met Ser Pro Phe Lys Ser Leu Glu Gln Leu Gly Ala Leu
260 265 270
Leu Val Leu Leu Phe Ile Phe Gly Gln Ser Val Cys Glu Ala Gln Arg
275 280 285
Arg Arg Leu Glu Ile Ala Arg Phe Ser Lys Glu Gly Val Ala Leu Leu
290 295 300
Ile Arg Ile Tyr Ala Ala Phe Phe Val Gly Ile Val Ala Val Ala Thr
305 310 315 320
Ile Ala Pro Ala Gly Phe Phe Lys Pro Leu Ser Leu Gln Ala Ser Ala
325 330 335
Ile Ile Thr Gly Val Ser Arg Thr Gly Asn Thr Leu Val Asp Thr Leu
340 345 350
Ile Ala Gln Asp Ala Ser Asn Leu Leu Ile Val Trp Gln Leu Phe Leu
355 360 365
Phe Pro Val Phe Gly Trp Val Ala Gly Met Ser Ala Phe Leu Thr Glu
370 375 380
Leu Val Arg Asn Tyr Thr Tyr Thr Lys Ser Phe Met Leu Met Tyr Gly
385 390 395 400
Val Val Gly Leu Tyr Phe Ala Ser Gln Ser Val Arg Met Met Val Met
405 410 415
Met Ala Pro Val Ala Cys Ile Phe Thr Ala Leu Leu Phe Arg Trp Ala
420 425 430
Leu Asp Tyr Leu Leu Gly Ser Leu Phe Trp Ala Glu Met Pro Pro Cys
435 440 445
Phe Asp Thr Asp Ala Gln Arg Gly Arg Gln Gln Gln Thr Ala Glu Glu
450 455 460
Ala Glu Ala Glu Thr Lys Arg Lys Glu Glu Glu Tyr Asn Thr Met Gln
465 470 475 480
Val Lys Lys Met Thr Thr Arg Met Leu Pro Phe Met Phe Leu Leu Leu
485 490 495
Leu Phe Arg Leu Ser Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg
500 505 510
Glu Met Glu Ala Pro Gly Ile Val Phe Pro Ser Gly Gln Val Gln Gly
515 520 525
Val Ser Glu Lys Lys Val Asp Asp Tyr Tyr Ala Gly Tyr Leu Tyr Leu
530 535 540
Arg Asp Asn Thr Pro Glu Asp Ala Arg Ile Leu Ala Trp Trp Asp Tyr
545 550 555 560
Gly Tyr Gln Ile Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly
565 570 575
Asn Thr Trp Asn His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr
580 585 590
Ser Pro Val Ala Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr
595 600 605
Val Leu Ile Phe Ala Gly Asp Thr Tyr Phe Ser Asp Leu Asn Arg Ser
610 615 620
Pro His Met Ala Arg Ile Gly Asn Ser Val Tyr Arg Asp Ile Cys Pro
625 630 635 640
His Asp Pro Leu Cys Ser Arg Phe Val Leu Gln Lys Arg Pro Lys Ala
645 650 655
Ala Ala Ala Lys Arg Ser Arg His Val Ser Val Asp Glu Leu Glu Glu
660 665 670
Glu Asp Asn Ala Glu His Val Val Tyr Glu Pro Ser Ser Leu Met Ala
675 680 685
Lys Ser Leu Ile Tyr His Leu His Ser Ala Gly Val Val Thr Gly Val
690 695 700
Thr Leu Asn Glu Thr Leu Phe Gln His Val Phe Thr Ser Ala Gln Gly
705 710 715 720
Leu Ile Arg Ile Phe Lys Val Met Asn Val Ser Glu Glu Ser Lys Lys
725 730 735
Trp Val Ala Asp Pro Ala Asn Arg Val Cys His Pro Pro Gly Ser Trp
740 745 750
Ile Cys Pro Gly Gln Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met Leu
755 760 765
Ala His Gln His Thr Asn Phe Lys Asp Leu Leu Asp Ala Met Asn Asp
770 775 780
Leu Glu Arg Glu Gln Ala Leu Asn Lys Glu
785 790
<210> 22
<211> 794
<212> PRT
<213> 婴儿利什曼原虫
<400> 22
Met Ser Ser Gln Thr Arg Ser Ile Ile Tyr Ser Ser Cys Phe Ala Val
1 5 10 15
Ala Met Ala Ile Ala Leu Pro Ile Ala Tyr Asp Met Arg Val Arg Ser
20 25 30
Ile Gly Val Tyr Gly Tyr Leu Phe His Arg Ser Asp Pro Trp Phe Asn
35 40 45
Tyr Arg Ala Ala Glu Tyr Met Ser Thr His Gly Trp Ser Ala Phe Phe
50 55 60
Ser Trp Phe Asp Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly
65 70 75 80
Ser Thr Thr Tyr Pro Gly Leu Gln Leu Thr Ala Val Ala Ile His Arg
85 90 95
Ala Leu Ala Ala Ala Gly Met Pro Met Ser Leu Asn Asn Val Cys Val
100 105 110
Leu Met Pro Ala Trp Phe Ser Leu Val Ser Ser Ala Met Val Ala Leu
115 120 125
Leu Ala His Glu Leu Ser Gly Asn Met Ala Val Ala Ser Ile Ser Ser
130 135 140
Ile Leu Phe Ser Val Ile Pro Ala His Leu Met Arg Ser Met Ala Gly
145 150 155 160
Glu Phe Asp Asn Glu Cys Ile Ala Val Ala Ala Met Leu Leu Thr Phe
165 170 175
Tyr Cys Trp Val Arg Ser Leu Arg Thr Arg Ser Ser Trp Pro Ile Gly
180 185 190
Val Leu Thr Gly Val Ala Tyr Gly Tyr Met Val Ala Ala Trp Gly Gly
195 200 205
Tyr Ile Phe Val Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser
210 215 220
Met Val Asp Trp Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala
225 230 235 240
Tyr Thr Leu Phe Tyr Val Val Gly Thr Ala Ile Ala Val Cys Val Pro
245 250 255
Pro Val Gly Met Ser Pro Phe Lys Ser Leu Glu Gln Leu Gly Ala Leu
260 265 270
Leu Val Leu Leu Phe Ile Phe Gly Gln Ser Val Cys Glu Ala Gln Arg
275 280 285
Arg Arg Leu Glu Ile Ala Arg Phe Ser Lys Glu Gly Val Ala Leu Leu
290 295 300
Ile Arg Ile Tyr Ala Ala Phe Phe Val Gly Ile Val Ala Val Ala Thr
305 310 315 320
Ile Ala Pro Ala Gly Phe Phe Lys Pro Leu Ser Leu Gln Ala Ser Ala
325 330 335
Ile Ile Thr Gly Val Ser Arg Thr Gly Asn Thr Leu Val Asp Thr Leu
340 345 350
Ile Ala Gln Asp Ala Ser Asn Leu Leu Ile Val Trp Gln Leu Phe Leu
355 360 365
Phe Pro Val Phe Gly Trp Val Ala Gly Met Ser Ala Phe Leu Thr Glu
370 375 380
Leu Val Arg Asn Tyr Thr Tyr Thr Lys Ser Phe Met Leu Met Tyr Gly
385 390 395 400
Val Val Gly Leu Tyr Phe Ala Ser Gln Ser Val Arg Met Met Val Met
405 410 415
Met Ala Pro Val Ala Cys Ile Phe Thr Ala Leu Leu Phe Arg Trp Ala
420 425 430
Leu Asp Tyr Leu Leu Gly Ser Leu Phe Trp Ala Glu Met Pro Pro Cys
435 440 445
Phe Asp Thr Asp Ala Gln Arg Gly Arg Gln Gln Gln Thr Ala Glu Glu
450 455 460
Ala Glu Ala Glu Thr Lys Arg Lys Glu Glu Glu Tyr Asn Thr Met Gln
465 470 475 480
Val Lys Lys Met Thr Thr Arg Met Leu Pro Phe Met Phe Leu Leu Leu
485 490 495
Leu Phe Arg Leu Ser Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg
500 505 510
Glu Met Glu Ala Pro Gly Ile Val Phe Pro Ser Gly Gln Val Gln Gly
515 520 525
Val Ser Glu Lys Lys Val Asp Asp Tyr Tyr Ser Gly Tyr Leu Tyr Leu
530 535 540
Arg Asp Asn Thr Pro Glu Asp Ala Arg Ile Leu Ala Trp Trp Asp Tyr
545 550 555 560
Gly Tyr Gln Ile Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly
565 570 575
Asn Thr Trp Asn His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr
580 585 590
Ser Pro Val Ala Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr
595 600 605
Val Leu Ile Phe Ala Gly Asp Thr Tyr Phe Ser Asp Leu Asn Arg Ser
610 615 620
Pro His Met Ala Arg Ile Gly Asn Ser Val Tyr Arg Asp Ile Cys Pro
625 630 635 640
His Asp Pro Leu Cys Ser Arg Phe Val Leu Gln Lys Arg Pro Lys Ala
645 650 655
Ala Ala Ala Lys Arg Ser Arg His Val Ser Val Asp Glu Leu Glu Glu
660 665 670
Glu Asp Asn Ala Glu His Val Val Tyr Glu Pro Ser Ser Leu Met Ala
675 680 685
Lys Ser Leu Ile Tyr His Leu His Ser Ala Gly Val Val Lys Gly Val
690 695 700
Thr Leu Asn Glu Thr Leu Phe Gln His Val Phe Thr Ser Ala Gln Gly
705 710 715 720
Leu Ile Arg Ile Phe Lys Val Met Asn Val Ser Glu Glu Ser Lys Lys
725 730 735
Trp Val Ala Asp Pro Ala Asn Arg Val Cys His Pro Pro Gly Ser Trp
740 745 750
Ile Cys Pro Gly Gln Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met Leu
755 760 765
Ala His Gln His Thr Asn Phe Lys Asp Leu Leu Asp Ala Met Asn Asp
770 775 780
Leu Glu Arg Glu Gln Ala Leu Asn Lys Glu
785 790
<210> 23
<211> 794
<212> PRT
<213> 墨西哥利什曼原虫
<400> 23
Met Ser Ser Gln Thr Arg Ser Leu Ile Tyr Ser Ser Cys Phe Ala Val
1 5 10 15
Val Met Ala Ile Gly Leu Ser Ile Ala Tyr Asp Met Arg Val Arg Ser
20 25 30
Ile Gly Val Tyr Gly Tyr Leu Phe His Ser Ser Asp Pro Trp Phe Asn
35 40 45
Tyr Arg Ala Ala Glu Tyr Met Ser Thr His Gly Trp Ser Ala Phe Phe
50 55 60
Ser Trp Phe Asp Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly
65 70 75 80
Ser Thr Thr Tyr Pro Gly Leu Gln Phe Thr Ala Val Ala Ile His Arg
85 90 95
Ala Leu Ala Ala Ala Gly Met Pro Met Ser Leu Asn Asp Val Cys Val
100 105 110
Leu Ile Pro Ala Trp Phe Ser Leu Leu Ser Ser Ala Met Val Ala Leu
115 120 125
Leu Ala His Glu Ile Ser Gly Asn Met Ala Val Ala Ser Val Ser Ser
130 135 140
Ile Leu Phe Ser Val Val Pro Ala His Leu Met Arg Ser Met Ala Gly
145 150 155 160
Glu Phe Asp Asn Glu Cys Ile Ala Val Thr Ala Met Leu Leu Thr Phe
165 170 175
Tyr Cys Trp Val Arg Ser Leu Arg Thr Arg Ser Ser Trp Pro Ile Gly
180 185 190
Val Leu Thr Gly Val Ala Tyr Gly Tyr Met Val Ala Ala Trp Gly Gly
195 200 205
Tyr Ile Phe Val Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser
210 215 220
Met Val Asp Trp Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala
225 230 235 240
Tyr Thr Leu Phe Tyr Val Val Gly Thr Ala Ile Ala Val Cys Val Pro
245 250 255
Pro Val Gly Met Ser Pro Phe Lys Ser Leu Glu Gln Leu Gly Ala Leu
260 265 270
Leu Val Leu Leu Phe Ile Phe Gly Gln Ala Leu Cys Glu Ala Gln Arg
275 280 285
Ser Arg Leu Gly Ile Glu Arg Phe Ser Lys Glu Gly Val Ala Leu Leu
290 295 300
Ile Arg Ile Tyr Ala Ala Phe Phe Val Gly Ile Val Ala Val Ala Ala
305 310 315 320
Val Ala Pro Ala Gly Phe Phe Lys Pro Leu Ser Leu Gln Ala Thr Ala
325 330 335
Ile Ile Ala Gly Val Ser Arg Thr Gly Asn Thr Leu Val Asp Ile Leu
340 345 350
Ile Ala Gln Asp Ala Ser Asn Leu Leu Ile Val Trp Gln Leu Phe Leu
355 360 365
Phe Pro Leu Leu Gly Trp Val Val Gly Met Ser Leu Phe Leu Thr Glu
370 375 380
Leu Val Arg Asn Phe Thr Tyr Ala Lys Ser Phe Ile Leu Met Tyr Gly
385 390 395 400
Val Val Gly Ile Tyr Phe Ala Ser Gln Ser Val Arg Met Met Val Met
405 410 415
Met Ala Pro Val Ala Cys Ile Phe Thr Ala Leu Leu Phe Arg Trp Thr
420 425 430
Leu Asp Tyr Leu Leu Gly Ser Phe Phe Trp Ala Glu Met Pro Leu Ser
435 440 445
Leu Asp Thr Asp Ala Gln Arg Gly Arg Gln Gln Gln Thr Ala Glu Glu
450 455 460
Ala Glu Ala Glu Thr Lys Arg Lys Glu Glu Glu Tyr Asn Thr Met Gln
465 470 475 480
Val Lys Lys Met Thr Val Arg Met Val Pro Phe Met Ile Leu Leu Leu
485 490 495
Leu Phe Arg Leu Ser Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg
500 505 510
Glu Met Glu Ser Pro Gly Ile Ile Phe Pro Arg Gly Gln Val Gln Gly
515 520 525
Met Pro Glu Asp Lys Val Asp Asp Tyr Tyr Ala Gly Tyr Leu Tyr Leu
530 535 540
Arg Glu Asn Thr Pro Glu Asp Ala Arg Ile Leu Ala Trp Trp Asp Tyr
545 550 555 560
Gly Tyr Gln Ile Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly
565 570 575
Asn Thr Trp Asn His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr
580 585 590
Ser Pro Val Ala Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr
595 600 605
Val Leu Ile Phe Ser Gly Asp Lys Tyr Phe Ser Asp Leu Asn Arg Ser
610 615 620
Pro Met Met Ala Arg Ile Gly Asn Ser Val Tyr Arg Asp Ile Cys Pro
625 630 635 640
Asn Asp Pro Leu Cys Ser Gln Phe Val Leu Gln Lys Arg Arg Lys Val
645 650 655
Ala Ala Ala Lys Arg Ser Arg His Val Thr Val Asn Glu Gln Glu Glu
660 665 670
Asp Asp Asn Pro Glu Ser Val Val Tyr Glu Pro Ser Ser Leu Met Ala
675 680 685
Lys Ser Leu Ile Tyr His Leu His Ser Thr Gly Val Val Glu Gly Val
690 695 700
Met Leu Asp Glu Thr Leu Phe Gln Asn Val Phe Thr Ser Thr Gln Gly
705 710 715 720
Phe Met Arg Ile Phe Lys Val Met Asn Val Ser Ala Glu Ser Lys Lys
725 730 735
Trp Val Ala Asp Pro Ala Asn Arg Val Cys Arg Pro Pro Gly Ser Trp
740 745 750
Ile Cys Pro Gly Gln Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met Leu
755 760 765
Ala His Gln Asn Thr Asn Phe Lys Asp Leu Leu Asp Ala Met Asn Asp
770 775 780
Leu Glu Gln Ala Gln Ala Leu Asn Lys Val
785 790
<210> 24
<211> 823
<212> PRT
<213> 巴西利什曼原虫
<400> 24
Met Val Thr Glu Arg Gly Ala Ala Thr Pro Ser Thr Ala Ala Ser Gly
1 5 10 15
Glu Ala Pro Ser Glu Thr Leu Leu Leu Gly Glu Tyr Lys Val Ser Leu
20 25 30
His Ala Arg Ser Thr Ile Tyr Thr Ala Cys Phe Ala Val Pro Met Ala
35 40 45
Ile Leu Phe Pro Ile Ala Tyr Lys Met Arg Val Arg Ser Ile Asp Val
50 55 60
Tyr Gly Tyr Leu Phe His Arg Asn Asp Pro Trp Phe Asn Tyr Arg Ala
65 70 75 80
Ala Glu Tyr Met Ser Ala His Gly Trp Ser Ala Phe Phe Ser Trp Phe
85 90 95
Asp Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly Thr Thr Thr
100 105 110
Tyr Pro Gly Leu Gln Leu Thr Ala Val Ala Ile His Arg Ala Leu Ala
115 120 125
Ala Ala Gly Val Pro Met Ser Leu Asn Asn Val Cys Val Leu Ile Pro
130 135 140
Ala Trp Phe Ser Leu Val Ser Ser Ala Met Val Ala Leu Leu Ala His
145 150 155 160
Glu Met Thr Gly Asn Met Ala Thr Ser Ser Ile Ser Ser Ile Leu Phe
165 170 175
Ser Val Val Pro Ala His Leu Met Arg Ser Met Ala Gly Glu Phe Asp
180 185 190
Asn Glu Cys Ile Ala Val Ala Ala Met Leu Leu Thr Phe Tyr Leu Trp
195 200 205
Val Arg Ser Leu Arg Thr Arg Cys Ser Trp Pro Ile Gly Ile Leu Thr
210 215 220
Gly Ile Ala Tyr Gly Tyr Met Val Ala Ala Trp Gly Gly Tyr Ile Phe
225 230 235 240
Val Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser Met Val Asp
245 250 255
Trp Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala Tyr Ala Leu
260 265 270
Phe Tyr Val Val Gly Thr Ala Ile Ala Thr Arg Val Pro Pro Val Gly
275 280 285
Met Ser Pro Phe Arg Ser Leu Glu Gln Leu Gly Ala Leu Val Val Leu
290 295 300
Leu Phe Leu Cys Gly Leu Gln Ala Cys Glu Thr Gln Arg Ser Arg Leu
305 310 315 320
Gly Val Glu Arg Phe Ser Thr Glu Gly Val Ser Leu Leu Val Arg Ile
325 330 335
Tyr Ala Ala Phe Phe Val Gly Ile Val Ala Val Val Ala Met Ala Pro
340 345 350
Ala Gly Phe Phe Lys Pro Leu Ser Leu Gln Ala His Ala Met Ile Ala
355 360 365
Gly Ala Gln Pro Thr Gly Asn Thr Leu Val Asp Met Leu Ile Ala Lys
370 375 380
Asp Ala Ser Ser Leu Leu Val Ala Trp Glu Leu Leu Leu Phe Pro Phe
385 390 395 400
Phe Gly Trp Met Val Gly Met Gly Ala Phe Leu Thr Glu Leu Val Gln
405 410 415
Ser Phe Thr Tyr Thr Lys Ser Phe Met Leu Met Tyr Gly Ala Val Gly
420 425 430
Met Tyr Phe Ala Ser Gln Ser Val Arg Met Met Val Met Met Ala Pro
435 440 445
Val Ala Cys Ile Phe Thr Ala Leu Leu Phe Cys Leu Ala Leu Asp Tyr
450 455 460
Ala Leu Gly Ala Leu Phe Trp Ala Glu Ile Pro Pro Ser Ile Asp Ser
465 470 475 480
Asp Ala Gln Gln Glu Leu His Gln Gln Thr Ala Glu Ala Ala Lys Thr
485 490 495
Lys Lys Arg Lys Gln Glu Glu Tyr Thr Thr Met Gln Val Lys Met Ile
500 505 510
Ser Val Arg Met Met Pro Leu Met Leu Leu Val Leu Leu Phe Arg Phe
515 520 525
Ser Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg Glu Ile Glu Val
530 535 540
Pro Gly Ile Val Phe Pro Gly Ser Met Val Gln Gly Leu Ser Asp Asp
545 550 555 560
Met Ile Asp Asp Tyr Tyr Ala Gly Tyr Leu Tyr Leu Arg Asp Asn Thr
565 570 575
Pro Ala Asp Ala Arg Val Leu Ser Trp Trp Asp Tyr Gly Tyr Gln Ile
580 585 590
Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly Asn Thr Trp Asn
595 600 605
His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr Ser Pro Val Ala
610 615 620
Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr Val Leu Ile Phe
625 630 635 640
Ala Gly Asp Met His Phe Ser Asp Leu Ile Asn Ser Pro Met Met Ala
645 650 655
Arg Ile Gly Asn Ser Val Tyr His Asp Ile Cys Pro Asn Asp Pro Leu
660 665 670
Cys Ser Arg Phe Val Phe Gln Glu Lys Arg Lys Ile Ala Pro Ala Arg
675 680 685
Ser Gly Arg His Ile Asn Leu Ala Lys Leu Gly Asp Asp Glu Glu Glu
690 695 700
Thr Gln Asn Val Glu Tyr Glu Pro Ser Pro Leu Met Ala Lys Ser Leu
705 710 715 720
Ile Tyr His Leu His Ser Ala Gly Val Lys Glu Gly Val Thr Leu Asn
725 730 735
Asp Lys Leu Phe Gln His Val Tyr Thr Ser Ala His Gly Leu Met Arg
740 745 750
Ile Phe Lys Val Met Asn Val Ser Ala Glu Ser Lys Lys Trp Val Ala
755 760 765
Asp Pro Ala Asn Arg Val Cys His Pro Pro Gly Ser Trp Ile Cys Pro
770 775 780
Gly Gln Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met Leu Ala His Arg
785 790 795 800
Tyr Thr Ser Leu Lys Asp Leu Val Asp Ser Met Ser Asp Ser Glu Arg
805 810 815
Glu Gly Thr Leu Asn Gly Glu
820
<210> 25
<211> 795
<212> PRT
<213> 人工的
<220>
<223> 利什曼原虫属STT3共有序列
<220>
<221> misc_feature
<222> (1)..(6)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (9)..(9)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (12)..(13)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (21)..(23)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (27)..(27)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (34)..(34)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (42)..(43)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (57)..(57)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (81)..(81)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (89)..(89)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (103)..(103)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (109)..(109)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (114)..(114)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (121)..(121)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (126)..(126)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (133)..(134)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (139)..(140)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (142)..(142)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (150)..(150)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (170)..(170)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (178)..(178)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (187)..(187)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (193)..(193)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (197)..(197)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (203)..(203)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (242)..(242)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (253)..(254)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (264)..(264)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (273)..(273)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (276)..(276)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (278)..(279)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (281)..(283)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (286)..(286)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (289)..(289)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (292)..(294)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (296)..(296)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (298)..(298)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (302)..(302)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (305)..(305)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (308)..(308)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (319)..(321)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (335)..(335)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (337)..(337)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (339)..(339)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (341)..(343)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (351)..(351)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (353)..(353)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (355)..(355)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (359)..(359)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (362)..(363)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (365)..(365)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (367)..(367)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (371)..(372)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (375)..(376)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (379)..(380)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (383)..(383)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (386)..(389)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (392)..(392)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (396)..(396)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (401)..(401)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (404)..(404)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (430)..(432)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (436)..(436)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (439)..(440)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (445)..(445)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (447)..(449)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (451)..(451)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (455)..(458)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (464)..(469)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (473)..(473)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (477)..(477)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (483)..(486)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (489)..(489)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (491)..(491)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (493)..(493)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (495)..(495)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (500)..(500)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (513)..(514)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (516)..(516)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (520)..(520)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (523)..(525)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (529)..(534)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (539)..(539)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (546)..(547)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (550)..(550)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (554)..(554)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (556)..(556)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (612)..(613)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (616)..(617)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (622)..(623)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (626)..(626)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (636)..(636)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (641)..(641)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (647)..(647)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (650)..(650)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (652)..(654)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (656)..(656)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (658)..(658)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (660)..(662)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (665)..(681)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (686)..(686)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (688)..(688)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (699)..(699)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (702)..(703)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (706)..(706)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (708)..(710)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (714)..(714)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (716)..(716)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (719)..(720)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (722)..(723)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (733)..(733)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (742)..(742)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (748)..(748)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (772)..(773)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (775)..(776)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (780)..(780)
<223> Xaa可以是任意天然存在的氨基酸
<220>
<221> misc_feature
<222> (782)..(795)
<223> Xaa可以是任意天然存在的氨基酸
<400> 25
Xaa Xaa Xaa Xaa Xaa Xaa Ser Thr Xaa Tyr Thr Xaa Xaa Phe Ala Val
1 5 10 15
Pro Met Ala Ile Xaa Xaa Xaa Ile Ala Tyr Xaa Met Arg Val Arg Ser
20 25 30
Ile Xaa Val Tyr Gly Tyr Leu Phe His Xaa Xaa Asp Pro Trp Phe Asn
35 40 45
Tyr Arg Ala Ala Glu Tyr Met Ser Xaa His Gly Trp Ser Ala Phe Phe
50 55 60
Ser Trp Phe Asp Tyr Met Ser Trp Tyr Pro Leu Gly Arg Pro Val Gly
65 70 75 80
Xaa Thr Thr Tyr Pro Gly Leu Gln Xaa Thr Ala Val Ala Ile His Arg
85 90 95
Ala Leu Ala Ala Ala Gly Xaa Pro Met Ser Leu Asn Xaa Val Cys Val
100 105 110
Leu Xaa Pro Ala Trp Phe Ser Leu Xaa Ser Ser Ala Met Xaa Ala Leu
115 120 125
Leu Ala His Glu Xaa Xaa Gly Asn Met Ala Xaa Xaa Ser Xaa Ser Ser
130 135 140
Ile Leu Phe Ser Val Xaa Pro Ala His Leu Met Arg Ser Met Ala Gly
145 150 155 160
Glu Phe Asp Asn Glu Cys Ile Ala Val Xaa Ala Met Leu Leu Thr Phe
165 170 175
Tyr Xaa Trp Val Arg Ser Leu Arg Thr Arg Xaa Ser Trp Pro Ile Gly
180 185 190
Xaa Leu Thr Gly Xaa Ala Tyr Gly Tyr Met Xaa Ala Ala Trp Gly Gly
195 200 205
Tyr Ile Phe Val Leu Asn Met Val Ala Met His Ala Gly Ile Ser Ser
210 215 220
Met Val Asp Trp Ala Arg Asn Thr Tyr Asn Pro Ser Leu Leu Arg Ala
225 230 235 240
Tyr Xaa Leu Phe Tyr Val Val Gly Thr Ala Ile Ala Xaa Xaa Val Pro
245 250 255
Pro Val Gly Met Ser Pro Phe Xaa Arg Ser Leu Glu Gln Leu Gly Ala
260 265 270
Xaa Val Val Xaa Leu Xaa Xaa Cys Xaa Xaa Xaa Ala Cys Xaa Thr Gln
275 280 285
Xaa Ser Arg Xaa Xaa Xaa Glu Xaa Phe Xaa Thr Glu Gly Xaa Ser Leu
290 295 300
Xaa Arg Ile Xaa Ala Ala Phe Phe Val Gly Ile Val Ala Val Xaa Xaa
305 310 315 320
Xaa Ala Pro Ala Gly Phe Phe Lys Pro Leu Ser Leu Gln Ala Xaa Ala
325 330 335
Xaa Ile Xaa Gly Xaa Xaa Xaa Thr Gly Asn Thr Leu Val Asp Xaa Leu
340 345 350
Xaa Ala Xaa Asp Ala Ser Xaa Leu Leu Xaa Xaa Trp Xaa Leu Xaa Leu
355 360 365
Phe Pro Xaa Xaa Gly Trp Xaa Xaa Gly Met Xaa Xaa Phe Leu Xaa Glu
370 375 380
Leu Xaa Xaa Xaa Xaa Thr Tyr Xaa Lys Ser Phe Xaa Leu Met Tyr Gly
385 390 395 400
Xaa Val Gly Xaa Tyr Phe Ala Ser Gln Ser Val Arg Met Met Val Met
405 410 415
Met Ala Pro Val Ala Cys Ile Phe Thr Ala Leu Leu Phe Xaa Xaa Xaa
420 425 430
Leu Asp Tyr Xaa Leu Gly Xaa Xaa Phe Trp Ala Glu Xaa Pro Xaa Xaa
435 440 445
Xaa Asp Xaa Asp Ala Gln Xaa Xaa Xaa Xaa Gln Gln Thr Ala Glu Xaa
450 455 460
Xaa Xaa Xaa Xaa Xaa Lys Arg Lys Xaa Glu Glu Tyr Xaa Thr Met Gln
465 470 475 480
Val Lys Xaa Xaa Xaa Xaa Arg Met Xaa Pro Xaa Met Xaa Leu Xaa Leu
485 490 495
Leu Phe Arg Xaa Ser Gly Phe Ile Glu Asp Val Ala Ala Ile Ser Arg
500 505 510
Xaa Xaa Glu Xaa Pro Gly Ile Xaa Phe Pro Xaa Xaa Xaa Val Gln Gly
515 520 525
Xaa Xaa Xaa Xaa Xaa Xaa Asp Asp Tyr Tyr Xaa Gly Tyr Leu Tyr Leu
530 535 540
Arg Xaa Xaa Thr Pro Xaa Asp Ala Arg Xaa Leu Xaa Trp Trp Asp Tyr
545 550 555 560
Gly Tyr Gln Ile Thr Gly Ile Gly Asn Arg Thr Ser Leu Ala Asp Gly
565 570 575
Asn Thr Trp Asn His Glu His Ile Ala Thr Ile Gly Lys Met Leu Thr
580 585 590
Ser Pro Val Ala Glu Ala His Ser Leu Val Arg His Met Ala Asp Tyr
595 600 605
Val Leu Ile Xaa Xaa Gly Asp Xaa Xaa Phe Ser Asp Leu Xaa Xaa Ser
610 615 620
Pro Xaa Met Ala Arg Ile Gly Asn Ser Val Tyr Xaa Asp Ile Cys Pro
625 630 635 640
Xaa Asp Pro Leu Cys Ser Xaa Phe Val Xaa Gln Xaa Xaa Xaa Lys Xaa
645 650 655
Ala Xaa Ala Xaa Xaa Xaa Arg His Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
660 665 670
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Glu Pro Ser Xaa Leu Xaa
675 680 685
Ala Lys Ser Leu Ile Tyr His Leu His Ser Xaa Gly Val Xaa Xaa Gly
690 695 700
Val Xaa Leu Xaa Xaa Xaa Leu Phe Gln Xaa Val Xaa Thr Ser Xaa Xaa
705 710 715 720
Gly Xaa Xaa Arg Ile Phe Lys Val Met Asn Val Ser Xaa Glu Ser Lys
725 730 735
Lys Trp Val Ala Asp Xaa Ala Asn Arg Val Cys Xaa Pro Pro Gly Ser
740 745 750
Trp Ile Cys Pro Gly Gln Tyr Pro Pro Ala Lys Glu Ile Gln Glu Met
755 760 765
Leu Ala His Xaa Xaa Thr Xaa Xaa Lys Asp Leu Xaa Asp Xaa Xaa Xaa
770 775 780
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa
785 790 795
<210> 26
<211> 718
<212> PRT
<213> 酿酒酵母
<400> 26
Met Gly Ser Asp Arg Ser Cys Val Leu Ser Val Phe Gln Thr Ile Leu
1 5 10 15
Lys Leu Val Ile Phe Val Ala Ile Phe Gly Ala Ala Ile Ser Ser Arg
20 25 30
Leu Phe Ala Val Ile Lys Phe Glu Ser Ile Ile His Glu Phe Asp Pro
35 40 45
Trp Phe Asn Tyr Arg Ala Thr Lys Tyr Leu Val Asn Asn Ser Phe Tyr
50 55 60
Lys Phe Leu Asn Trp Phe Asp Asp Arg Thr Trp Tyr Pro Leu Gly Arg
65 70 75 80
Val Thr Gly Gly Thr Leu Tyr Pro Gly Leu Met Thr Thr Ser Ala Phe
85 90 95
Ile Trp His Ala Leu Arg Asn Trp Leu Gly Leu Pro Ile Asp Ile Arg
100 105 110
Asn Val Cys Val Leu Phe Ala Pro Leu Phe Ser Gly Val Thr Ala Trp
115 120 125
Ala Thr Tyr Glu Phe Thr Lys Glu Ile Lys Asp Ala Ser Ala Gly Leu
130 135 140
Leu Ala Ala Gly Phe Ile Ala Ile Val Pro Gly Tyr Ile Ser Arg Ser
145 150 155 160
Val Ala Gly Ser Tyr Asp Asn Glu Ala Ile Ala Ile Thr Leu Leu Met
165 170 175
Val Thr Phe Met Phe Trp Ile Lys Ala Gln Lys Thr Gly Ser Ile Met
180 185 190
His Ala Thr Cys Ala Ala Leu Phe Tyr Phe Tyr Met Val Ser Ala Trp
195 200 205
Gly Gly Tyr Val Phe Ile Thr Asn Leu Ile Pro Leu His Val Phe Leu
210 215 220
Leu Ile Leu Met Gly Arg Tyr Ser Ser Lys Leu Tyr Ser Ala Tyr Thr
225 230 235 240
Thr Trp Tyr Ala Ile Gly Thr Val Ala Ser Met Gln Ile Pro Phe Val
245 250 255
Gly Phe Leu Pro Ile Arg Ser Asn Asp His Met Ala Ala Leu Gly Val
260 265 270
Phe Gly Leu Ile Gln Ile Val Ala Phe Gly Asp Phe Val Lys Gly Gln
275 280 285
Ile Ser Thr Ala Lys Phe Lys Val Ile Met Met Val Ser Leu Phe Leu
290 295 300
Ile Leu Val Leu Gly Val Val Gly Leu Ser Ala Leu Thr Tyr Met Gly
305 310 315 320
Leu Ile Ala Pro Trp Thr Gly Arg Phe Tyr Ser Leu Trp Asp Thr Asn
325 330 335
Tyr Ala Lys Ile His Ile Pro Ile Ile Ala Ser Val Ser Glu His Gln
340 345 350
Pro Val Ser Trp Pro Ala Phe Phe Phe Asp Thr His Phe Leu Ile Trp
355 360 365
Leu Phe Pro Ala Gly Val Phe Leu Leu Phe Leu Asp Leu Lys Asp Glu
370 375 380
His Val Phe Val Ile Ala Tyr Ser Val Leu Cys Ser Tyr Phe Ala Gly
385 390 395 400
Val Met Val Arg Leu Met Leu Thr Leu Thr Pro Val Ile Cys Val Ser
405 410 415
Ala Ala Val Ala Leu Ser Lys Ile Phe Asp Ile Tyr Leu Asp Phe Lys
420 425 430
Thr Ser Asp Arg Lys Tyr Ala Ile Lys Pro Ala Ala Leu Leu Ala Lys
435 440 445
Leu Ile Val Ser Gly Ser Phe Ile Phe Tyr Leu Tyr Leu Phe Val Phe
450 455 460
His Ser Thr Trp Val Thr Arg Thr Ala Tyr Ser Ser Pro Ser Val Val
465 470 475 480
Leu Pro Ser Gln Thr Pro Asp Gly Lys Leu Ala Leu Ile Asp Asp Phe
485 490 495
Arg Glu Ala Tyr Tyr Trp Leu Arg Met Asn Ser Asp Glu Asp Ser Lys
500 505 510
Val Ala Ala Trp Trp Asp Tyr Gly Tyr Gln Ile Gly Gly Met Ala Asp
515 520 525
Arg Thr Thr Leu Val Asp Asn Asn Thr Trp Asn Asn Thr His Ile Ala
530 535 540
Ile Val Gly Lys Ala Met Ala Ser Pro Glu Glu Lys Ser Tyr Glu Ile
545 550 555 560
Leu Lys Glu His Asp Val Asp Tyr Val Leu Val Ile Phe Gly Gly Leu
565 570 575
Ile Gly Phe Gly Gly Asp Asp Ile Asn Lys Phe Leu Trp Met Ile Arg
580 585 590
Ile Ser Glu Gly Ile Trp Pro Glu Glu Ile Lys Glu Arg Tyr Phe Tyr
595 600 605
Thr Ala Glu Gly Glu Tyr Arg Val Asp Ala Arg Ala Ser Glu Thr Met
610 615 620
Arg Asn Ser Leu Leu Tyr Lys Met Ser Tyr Lys Asp Phe Pro Gln Leu
625 630 635 640
Phe Asn Gly Gly Gln Ala Thr Asp Arg Val Arg Gln Gln Met Ile Thr
645 650 655
Pro Leu Asp Val Pro Pro Leu Asp Tyr Phe Asp Glu Val Phe Thr Ser
660 665 670
Glu Asn Trp Met Val Arg Ile Tyr Gln Leu Lys Lys Asp Asp Ala Gln
675 680 685
Gly Arg Thr Leu Arg Asp Val Gly Glu Leu Thr Arg Ser Ser Thr Lys
690 695 700
Thr Arg Arg Ser Ile Lys Arg Pro Glu Leu Gly Leu Arg Val
705 710 715
<210> 27
<211> 2157
<212> DNA
<213> 酿酒酵母
<400> 27
atgggatccg accggtcgtg tgttttgtct gtgtttcaga ccatcctcaa gctcgtcatc 60
ttcgtggcga tttttggggc tgccatatca tcacgtttgt ttgcagtcat caaatttgag 120
tctattatcc atgaattcga cccctggttc aattataggg ctaccaaata tctcgtcaac 180
aattcgtttt acaagttttt gaactggttt gacgaccgta cctggtaccc cctcggaagg 240
gttactggag ggactttata tcctggtttg atgacgacta gtgcgttcat ctggcacgcc 300
ctgcgcaact ggttgggctt gcccattgac atcagaaacg tttgtgtgct atttgcgcca 360
ctattttctg gggtcaccgc ctgggcgact tacgaattta cgaaagagat taaagatgcc 420
agcgctgggc ttttggctgc tggttttata gccattgtcc ccggttatat atctagatca 480
gtggcggggt cctacgataa tgaggccatt gccattacac tattaatggt cactttcatg 540
ttttggatta aggcccaaaa gactggctct atcatgcacg caacgtgtgc agctttattc 600
tacttctaca tggtgtcggc ttggggtgga tacgtgttca tcaccaactt gatcccactc 660
catgtctttt tgctgatttt gatgggcaga tattcgtcca aactgtattc tgcctacacc 720
acttggtacg ctattggaac tgttgcatcc atgcagatcc catttgtcgg tttcctacct 780
atcaggtcta acgaccacat ggccgcattg ggtgttttcg gtttgattca gattgtcgcc 840
ttcggtgact tcgtgaaggg ccaaatcagc acagctaagt ttaaagtcat catgatggtt 900
tctctgtttt tgatcttggt ccttggtgtg gtcggacttt ctgccttgac ctatatgggg 960
ttgattgccc cttggactgg tagattttat tcgttatggg ataccaacta cgcaaagatc 1020
cacattccta tcattgcctc cgtttccgaa catcaacccg tttcgtggcc cgctttcttc 1080
tttgataccc actttttgat ctggctattc cccgccggtg tattcctact attcctcgac 1140
ttgaaagacg agcacgtttt tgtcatcgct tactccgttc tgtgttcgta ctttgccggt 1200
gttatggtta gattgatgtt gactttgaca ccagtcatct gtgtgtccgc cgccgtcgca 1260
ttgtccaaga tatttgacat ctacctggat ttcaagacaa gtgaccgcaa atacgccatc 1320
aaacctgcgg cactactggc caaattgatt gtttccggat cattcatctt ttatttgtat 1380
cttttcgtct tccattctac ttgggtaaca agaactgcat actcttctcc ttctgttgtt 1440
ttgccatcac aaaccccaga tggtaaattg gcgttgatcg acgacttcag ggaagcgtac 1500
tattggttaa gaatgaactc tgatgaggac agtaaggttg cagcgtggtg ggattacggt 1560
taccaaattg gtggcatggc agacagaacc actttagtcg ataacaacac gtggaacaat 1620
actcacatcg ccatcgttgg taaagccatg gcttcccctg aagagaaatc ttacgaaatt 1680
ctaaaagagc atgatgtcga ttatgtcttg gtcatctttg gtggtctaat tgggtttggt 1740
ggtgatgaca tcaacaaatt cttgtggatg atcagaatta gcgagggaat ctggccagaa 1800
gagataaaag agcgttattt ctataccgca gagggagaat acagagtaga tgcaagggct 1860
tctgagacca tgaggaactc gctactttac aagatgtcct acaaagattt cccacaatta 1920
ttcaatggtg gccaagccac tgacagagtg cgtcaacaaa tgatcacacc attagacgtc 1980
ccaccattag actacttcga cgaagttttt acttccgaaa actggatggt tagaatatat 2040
caattgaaga aggatgatgc ccaaggtaga actttgaggg acgttggtga gttaaccagg 2100
tcttctacga aaaccagaag gtccataaag agacctgaat taggcttgag agtctaa 2157
<210> 28
<211> 749
<212> PRT
<213> 粟酒裂殖酵母
<400> 28
Met Ala Asn Ser Ala Thr Ile Thr Ser Lys Lys Gly Val Lys Ser His
1 5 10 15
Gln Lys Asp Trp Lys Ile Pro Leu Lys Val Leu Ile Leu Ile Cys Ile
20 25 30
Ala Val Ala Ser Val Ser Ser Arg Leu Phe Ser Val Ile Arg Tyr Glu
35 40 45
Ser Ile Ile His Glu Phe Asp Pro Trp Phe Asn Phe Arg Ala Ser Lys
50 55 60
Ile Leu Val Glu Gln Gly Phe Tyr Asn Phe Leu Asn Trp Phe Asp Glu
65 70 75 80
Arg Ser Trp Tyr Pro Leu Gly Arg Val Ala Gly Gly Thr Leu Tyr Pro
85 90 95
Gly Leu Met Val Thr Ser Gly Ile Ile Phe Lys Val Leu His Leu Leu
100 105 110
Arg Ile Asn Val Asn Ile Arg Asp Val Cys Val Leu Leu Ala Pro Ala
115 120 125
Phe Ser Gly Ile Thr Ala Ile Ala Thr Tyr Tyr Leu Ala Arg Glu Leu
130 135 140
Lys Ser Asp Ala Cys Gly Leu Leu Ala Ala Ala Phe Met Gly Ile Ala
145 150 155 160
Pro Gly Tyr Thr Ser Arg Ser Val Ala Gly Ser Tyr Asp Asn Glu Ala
165 170 175
Ile Ala Ile Thr Leu Leu Met Ser Thr Phe Ala Leu Trp Ile Lys Ala
180 185 190
Val Lys Ser Gly Ser Ser Phe Trp Gly Ala Cys Thr Gly Leu Leu Tyr
195 200 205
Phe Tyr Met Val Thr Ala Trp Gly Gly Tyr Val Phe Ile Thr Asn Met
210 215 220
Ile Pro Leu His Val Phe Val Leu Leu Leu Met Gly Arg Tyr Thr Ser
225 230 235 240
Lys Leu Tyr Ile Ala Tyr Thr Thr Tyr Tyr Val Ile Gly Thr Leu Ala
245 250 255
Ser Met Gln Val Pro Phe Val Gly Phe Gln Pro Val Ser Thr Ser Glu
260 265 270
His Met Ser Ala Leu Gly Val Phe Gly Leu Leu Gln Leu Phe Ala Phe
275 280 285
Tyr Asn Tyr Val Lys Gly Leu Val Ser Ser Lys Gln Phe Gln Ile Leu
290 295 300
Ile Arg Phe Ala Leu Val Cys Leu Val Gly Leu Ala Thr Val Val Leu
305 310 315 320
Phe Ala Leu Ser Ser Thr Gly Val Ile Ala Pro Trp Thr Gly Arg Phe
325 330 335
Tyr Ser Leu Trp Asp Thr Asn Tyr Ala Lys Ile His Ile Pro Ile Ile
340 345 350
Ala Ser Val Ser Glu His Gln Pro Pro Thr Trp Ser Ser Leu Phe Phe
355 360 365
Asp Leu Gln Phe Leu Ile Trp Leu Leu Pro Val Gly Val Tyr Leu Cys
370 375 380
Phe Lys Glu Leu Arg Asn Glu His Val Phe Ile Ile Ile Tyr Pro Val
385 390 395 400
Leu Gly Thr Tyr Phe Cys Gly Val Met Val Arg Leu Val Leu Thr Leu
405 410 415
Thr Pro Cys Val Cys Ile Ala Ala Ala Val Ala Ile Ser Thr Leu Leu
420 425 430
Asp Thr Tyr Met Gly Pro Glu Val Glu Glu Asp Lys Val Ser Glu Glu
435 440 445
Ala Ala Ser Ala Lys Ser Lys Asn Lys Lys Gly Ile Ser Ser Ile Leu
450 455 460
Ser Phe Phe Thr Ser Gly Ser Lys Asn Ile Gly Ile Tyr Ser Leu Leu
465 470 475 480
Ser Arg Val Leu Val Ile Ser Ser Thr Ala Tyr Phe Leu Ile Met Phe
485 490 495
Val Tyr His Ser Ser Trp Val Thr Ser Asn Ala Tyr Ser Ser Pro Thr
500 505 510
Val Val Leu Ser Thr Val Leu Asn Asp Gly Ser Leu Met Tyr Ile Asp
515 520 525
Asp Phe Arg Glu Ala Tyr Asp Trp Leu Arg Arg Asn Thr Pro Tyr Asp
530 535 540
Thr Lys Val Met Ser Trp Trp Asp Tyr Gly Tyr Gln Ile Ala Gly Met
545 550 555 560
Ala Asp Arg Ile Thr Leu Val Asp Asn Asn Thr Trp Asn Asn Thr His
565 570 575
Ile Ala Thr Val Gly Lys Ala Met Ser Ser Pro Glu Glu Lys Ala Tyr
580 585 590
Pro Ile Leu Arg Lys His Asp Val Asp Tyr Ile Leu Ile Ile Tyr Gly
595 600 605
Gly Thr Leu Gly Tyr Ser Ser Asp Asp Met Asn Lys Phe Leu Trp Met
610 615 620
Ile Arg Ile Ser Gln Gly Leu Trp Pro Asp Glu Ile Val Glu Arg Asn
625 630 635 640
Phe Phe Thr Pro Asn Gly Glu Tyr Arg Thr Asp Asp Ala Ala Thr Pro
645 650 655
Thr Met Arg Glu Ser Leu Leu Tyr Lys Met Ser Tyr His Gly Ala Trp
660 665 670
Lys Leu Phe Pro Pro Asn Gln Gly Tyr Asp Arg Ala Arg Asn Gln Lys
675 680 685
Leu Pro Ser Lys Asp Pro Gln Leu Phe Thr Ile Glu Glu Ala Phe Thr
690 695 700
Thr Val His His Leu Val Arg Leu Tyr Lys Val Lys Lys Pro Asp Thr
705 710 715 720
Leu Gly Arg Asp Leu Lys Gln Val Thr Leu Phe Glu Glu Gly Lys Arg
725 730 735
Lys Lys Ser Ala Val Leu Gln Lys Leu Thr Lys Phe Leu
740 745
<210> 29
<211> 2250
<212> DNA
<213> 粟酒裂殖酵母
<400> 29
atggctaatt ctgctacaat tacgagtaaa aaaggcgtga agtctcatca gaaggactgg 60
aaaattccac ttaaagtgct cattcttata tgtattgctg tggcttctgt ctcttcgagg 120
cttttttctg tcattcgtta cgagtccatt attcatgaat ttgatccttg gttcaatttc 180
cgagcttcca aaatattggt ggaacaaggt ttttataact ttttaaattg gtttgatgaa 240
agaagttggt acccgttggg tcgtgtagcg ggtggtactt tgtacccagg acttatggtc 300
acgtctggta ttattttcaa agttttacat cttttaagaa ttaacgtgaa catccgtgat 360
gtatgtgttt tacttgcccc tgctttctct ggaatcactg cgattgctac ctattatctg 420
gctagagaat tgaaaagtga tgcatgtggc cttttagctg ccgcatttat gggtattgct 480
cctggataca cctcccgttc cgtcgctggt tcttacgata atgaagcaat tgctattacc 540
cttttgatgt caacgtttgc tttgtggatc aaggcagtga agtctggctc ctctttctgg 600
ggtgcctgca caggattgct ctacttctat atggtaactg cgtggggtgg ttatgtattc 660
atcacaaaca tgataccttt acacgtattt gttcttctac ttatgggtcg ctatactagc 720
aaattataca ttgcttacac aacatactat gttattggaa cgctggcttc tatgcaagtt 780
ccgtttgttg gtttccaacc cgtgtcgact agtgagcata tgtccgcttt aggagtgttt 840
ggcctgttac agctttttgc attctacaat tatgttaaag gtctagtttc atccaagcaa 900
ttccaaatac ttattcgttt tgccttggtt tgcttagtgg gtctagcaac agtcgtcctt 960
tttgctttat cttcaacagg tgttatcgct ccttggacag gacgtttcta ttctctttgg 1020
gatacaaact acgccaagat tcatattcct atcattgctt cggtatcaga acatcagcct 1080
cctacttgga gttcgttgtt ctttgatctt caatttttga tttggttatt gccagttggt 1140
gtttacttgt gtttcaagga acttcgtaat gaacatgtct ttattattat atatcctgtc 1200
ttaggaacat atttttgtgg tgtgatggtt cgtttggttt taaccttaac tccttgtgtt 1260
tgcatagctg ctgctgtagc aatttccact cttttagaca catatatggg tcctgaagtt 1320
gaagaggaca aagtgagcga agaagccgct tcagccaaat ctaagaacaa gaaaggtatt 1380
tcctctattc ttagtttctt cacttctggc tcaaaaaata ttggaattta cagtttgctt 1440
tccagagtat tagtcatttc ctctaccgca tatttcctaa taatgtttgt ttatcattcc 1500
agttgggtga cttctaatgc ttactcttcc cctaccgtgg ttttgtctac cgtgttaaac 1560
gatggtagtt taatgtatat tgatgacttc cgtgaagctt atgactggct tcgtagaaac 1620
actccttatg acacaaaggt tatgagttgg tgggattatg gttaccaaat tgctggtatg 1680
gctgatcgta ttactttagt cgacaacaat acgtggaaca acacacatat tgccacagtt 1740
ggaaaagcca tgtcttcacc tgaagaaaaa gcttacccta tcctccgtaa acacgatgtt 1800
gattatattc ttattatata tggtggtact cttggataca gcagcgacga catgaacaag 1860
ttcctttgga tgatccgaat ttctcaggga ttatggcccg atgaaatagt agagcgtaac 1920
ttttttactc ctaatggaga atatcgaact gacgatgcgg ctactcccac tatgcgtgag 1980
tctttattat ataagatgtc atatcacggt gcttggaaac ttttccctcc caatcaagga 2040
tatgaccgtg ctcgcaatca aaaactacca tcgaaagatc ctcaactatt tactatcgaa 2100
gaagcattca ctaccgttca tcatttagtt cgtttgtata aggttaagaa accggataca 2160
cttggacgcg atttgaaaca agtgacatta tttgaagaag gcaaaagaaa gaagtccgcc 2220
gtcctgcaaa aactaacgaa attcctttga 2250
<210> 30
<211> 714
<212> PRT
<213> 盘基网柄菌
<400> 30
Met Lys Arg Ser Glu Lys Ser Ser Thr Ser Val Val Ser Asn Asn Lys
1 5 10 15
Gln Gln Asp Val Asn Ile Ile Ser Ser Asn Glu Val Gly Val Lys Glu
20 25 30
Glu Asn Lys Gly His Gln Glu Phe Leu Leu Lys Val Leu Ile Leu Ser
35 40 45
Val Ile Tyr Val Leu Ala Phe Ser Thr Arg Leu Phe Ser Val Leu Arg
50 55 60
Tyr Glu Ser Val Ile His Glu Phe Asp Pro Tyr Phe Asn Tyr Arg Ser
65 70 75 80
Thr Ile Tyr Leu Val Gln Glu Gly Phe Tyr Asn Phe Leu Asn Trp Phe
85 90 95
Asp Glu Arg Ala Trp Tyr Pro Leu Gly Arg Ile Val Gly Gly Thr Ile
100 105 110
Tyr Pro Gly Leu Met Ala Thr Ala Ser Leu Val His Trp Ser Leu Asn
115 120 125
Ser Leu Asn Ile Thr Val Asn Ile Arg Asn Val Cys Val Leu Leu Ser
130 135 140
Pro Trp Phe Ala Ser Asn Thr Ala Met Val Thr Tyr Lys Phe Ala Lys
145 150 155 160
Glu Val Lys Asp Thr Gln Thr Gly Leu Val Ala Ala Ala Met Ile Ala
165 170 175
Ile Val Pro Gly Tyr Ile Ser Arg Ser Val Ala Gly Ser Phe Asp Asn
180 185 190
Glu Gly Ile Ala Ile Phe Ala Leu Ile Phe Thr Tyr Tyr Cys Trp Ile
195 200 205
Lys Ser Val Asn Thr Gly Ser Leu Met Trp Ala Ala Ile Cys Ser Leu
210 215 220
Ala Tyr Phe Tyr Met Ala Ser Ala Trp Gly Gly Tyr Val Phe Ile Ile
225 230 235 240
Asn Leu Ile Pro Leu His Ala Phe Phe Leu Leu Leu Thr Gly Arg Tyr
245 250 255
Ser His Arg Leu Tyr Ile Ala Tyr Ser Thr Met Phe Val Ile Gly Thr
260 265 270
Ile Leu Ser Met Gln Ile Thr Phe Ile Ser Phe Gln Pro Val Gln Ser
275 280 285
Ser Glu His Leu Ala Ala Ile Gly Ile Phe Gly Leu Leu Gln Leu Tyr
290 295 300
Ala Gly Leu Ser Trp Val Lys Ser His Leu Thr Asn Glu Ala Phe Lys
305 310 315 320
Lys Leu Gln Arg Leu Thr Val Leu Phe Val Leu Ser Cys Ala Ala Ala
325 330 335
Val Leu Val Val Gly Thr Leu Thr Gly Tyr Ile Ser Pro Phe Asn Gly
340 345 350
Arg Phe Tyr Ser Leu Leu Asp Pro Thr Tyr Ala Arg Asp His Ile Pro
355 360 365
Ile Ile Ala Ser Val Ser Glu His Gln Pro Thr Thr Trp Ala Ser Tyr
370 375 380
Phe Phe Asp Leu His Ile Leu Val Phe Leu Phe Pro Ala Gly Leu Tyr
385 390 395 400
Phe Cys Phe Gln Lys Leu Thr Asp Ala Asn Ile Phe Leu Ile Leu Tyr
405 410 415
Gly Val Thr Ser Ile Tyr Phe Ser Gly Val Met Val Arg Leu Met Leu
420 425 430
Val Leu Ala Pro Val Ala Cys Ile Leu Ala Ala Val Ala Val Ser Ala
435 440 445
Thr Leu Thr Thr Tyr Met Lys Lys Leu Lys Ala Pro Ser Ser Pro Ser
450 455 460
Asp Ala Asn Asn Ser Lys Glu Ser Gly Gly Val Met Val Ala Val Leu
465 470 475 480
Thr Val Leu Leu Ile Leu Tyr Ala Phe His Cys Thr Trp Val Thr Ser
485 490 495
Glu Ala Tyr Ser Ser Pro Ser Ile Val Leu Ser Ala Lys Gln Asn Asp
500 505 510
Gly Ser Arg Val Ile Phe Asp Asp Phe Arg Glu Ala Tyr Arg Trp Ile
515 520 525
Gly Gln Asn Thr Ala Asp Asp Ala Arg Ile Met Ser Trp Trp Asp Tyr
530 535 540
Gly Tyr Gln Leu Ser Ala Met Ala Asn Arg Thr Val Leu Val Asp Asn
545 550 555 560
Asn Thr Trp Asn Asn Ser His Ile Ala Gln Val Gly Lys Ala Phe Ala
565 570 575
Ser Thr Glu Glu Asp Ala Tyr Ile Gln Met Lys Ala Leu Asp Val Asp
580 585 590
Tyr Val Leu Val Ile Phe Gly Gly Leu Thr Gly Tyr Ser Ser Asp Asp
595 600 605
Ile Asn Lys Phe Leu Trp Met Val Arg Ile Gly Gly Ser Cys Asp Pro
610 615 620
Asn Ile Lys Glu Gln Asp Tyr Leu Thr Asn Gly Gln Tyr Arg Ile Asp
625 630 635 640
Lys Gly Ala Ser Pro Thr Met Leu Asn Ser Leu Met Tyr Lys Leu Ser
645 650 655
Tyr Tyr Arg Phe Ser Glu Val His Thr Asp Tyr Gln Arg Pro Thr Gly
660 665 670
Phe Asp Arg Val Arg Asn Val Glu Ile Gly Asn Lys Asn Phe Asp Leu
675 680 685
Thr Tyr Leu Glu Glu Ala Phe Thr Ser Val His Trp Leu Val Arg Val
690 695 700
Tyr Lys Val Lys Asp Phe Asp Asn Arg Ala
705 710
<210> 31
<211> 2145
<212> DNA
<213> 盘基网柄菌
<400> 31
atgaaaagat cagaaaaatc aagtacatct gttgttagta ataacaaaca acaagatgta 60
aatatcatca gttcaaatga agttggtgtt aaagaagaaa ataaaggaca tcaagaattc 120
ttattaaaag ttttaattct atcagtcatt tatgttttag cattttcaac tcgtttattc 180
tcagtattac gttatgaaag tgttattcat gaatttgatc catattttaa ttatagatca 240
acaatatatc ttgttcaaga aggtttttat aattttttaa attggtttga tgaaagagca 300
tggtatccat taggacgtat tgtaggtggt acaatttacc caggtttaat ggcaacagca 360
agtttagttc attggtcatt gaattcattg aatattacag ttaatattag aaatgtatgt 420
gtattgttat caccatggtt tgcatcaaat acagcaatgg taacctataa atttgccaaa 480
gaagttaagg atacacaaac tggtttggtt gcagcagcca tgattgcaat tgttccaggt 540
tatatttcac gttcagtagc aggttcattc gataatgaag gtattgcaat ctttgcattg 600
attttcacat attattgttg gattaagtca gtaaacacag gctcattgat gtgggctgcc 660
atctgttcat tggcctactt ttatatggca agtgcctggg gtggttatgt attcatcatt 720
aatttaatcc cattgcatgc ctttttcttg cttttgacag gccgttattc acatcgtctc 780
tacatagcct acagcacaat gtttgtcatt ggtacaatcc tctctatgca aattacattc 840
attagtttcc aaccagttca atcatctgaa catttggctg ccattggtat ctttggtctc 900
ctccaattgt acgctggttt gtcatgggta aagagtcacc tcaccaatga agccttcaag 960
aaacttcaac gtttgacagt gttattcgtt ttatcttgtg ctgctgccgt acttgtcgtt 1020
ggtacattaa ctggttacat ctcaccattc aatggtcgtt tctattcatt gttggatcca 1080
acctatgctc gtgaccacat tccaatcatt gcatcagtat cagagcatca accaaccact 1140
tgggcatcat actttttcga tctccatatc ttggtattcc ttttcccagc cggtttatac 1200
ttttgtttcc aaaaattaac cgatgctaat attttcctca ttctctacgg tgtcacctcc 1260
atttatttct ctggtgtaat ggtacgtctt atgttggttt tagcaccagt tgcatgtatt 1320
ttagccgccg ttgccgtcag tgcaaccctc accacctata tgaagaagtt aaaggctcca 1380
tcatcaccaa gtgatgctaa taattccaaa gagagtggtg gtgttatggt tgcagtctta 1440
actgttcttt taattctcta cgctttccat tgtacttggg tcactagtga agcctactca 1500
tctccatcca ttgtactctc tgccaaacaa aacgatggta gtcgtgtgat tttcgatgat 1560
ttccgtgaag cctaccgttg gattggtcaa aatactgccg acgacgctcg tattatgtct 1620
tggtgggatt atggttatca attatctgca atggccaatc gtaccgtatt ggttgataat 1680
aacacttgga acaatagtca tatcgctcaa gttggtaaag catttgcatc cactgaagaa 1740
gatgcttaca tacaaatgaa agcattggat gtcgattatg ttttagttat ttttggtggt 1800
ttaactggtt acagttctga tgatatcaat aaattccttt ggatggttag aattggtggt 1860
agttgtgatc caaatattaa agaacaagat tatctcacca atggtcaata tagaatagat 1920
aaaggtgcct caccaacaat gttaaattct ctcatgtaca aacttagtta ctatcgtttc 1980
tctgaagttc acactgacta tcaaagacca acaggtttcg atcgtgtaag aaatgttgaa 2040
attggtaata aaaatttcga tttaacttat ttagaagaag ctttcacatc tgttcattgg 2100
ttagttagag tttataaagt taaagatttt gataatagag cttaa 2145
<210> 32
<211> 206
<212> PRT
<213> 铜绿假单胞菌
<400> 32
Met Ser Leu Ala Ser Ser Leu Glu Ser Leu Arg Lys Ile Asp Ile Asn
1 5 10 15
Asp Leu Asp Leu Asn Asn Ile Gly Ser Trp Pro Ala Ala Val Lys Val
20 25 30
Ile Val Cys Val Leu Leu Thr Ala Ala Val Leu Ala Leu Gly Tyr Asn
35 40 45
Phe His Leu Ser Asp Met Gln Ala Gln Leu Glu Gln Gln Ala Ala Glu
50 55 60
Glu Glu Thr Leu Lys Gln Gln Phe Ser Thr Lys Ala Phe Gln Ala Ala
65 70 75 80
Asn Leu Glu Ala Tyr Lys Ala Gln Met Lys Glu Met Glu Glu Ser Phe
85 90 95
Gly Ala Leu Leu Arg Gln Leu Pro Ser Asp Thr Glu Val Pro Gly Leu
100 105 110
Leu Glu Asp Ile Thr Arg Thr Gly Leu Gly Ser Gly Leu Glu Phe Glu
115 120 125
Glu Ile Lys Leu Leu Pro Glu Val Ala Gln Gln Phe Tyr Ile Glu Leu
130 135 140
Pro Ile Gln Ile Ser Val Val Gly Gly Tyr His Asp Leu Ala Thr Val
145 150 155 160
Ser Gly Val Ser Ser Leu Pro Arg Ile Val Thr Leu His Asp Phe Glu
165 170 175
Ile Lys Pro Val Ala Pro Gly Ser Thr Ser Lys Leu Arg Met Ser Ile
180 185 190
Leu Ala Lys Thr Tyr Arg Tyr Asn Asp Lys Gly Leu Lys Lys
195 200 205
<210> 33
<211> 624
<212> DNA
<213> 铜绿假单胞菌
<400> 33
atgagtctgg ccagttccct ggaaagtctg cgcaagatcg atatcaacga tctcgacctg 60
aacaacatcg gttcctggcc ggcggcggtc aaggtcatcg tctgcgtgct gctgaccgcg 120
gcggtcctgg cgctgggcta caacttccat ctgagtgaca tgcaggctca gctcgaacag 180
caggccgcgg aagaggagac gctcaagcag cagttctcca ccaaggcctt ccaggccgcg 240
aacctggaag cctacaaggc acagatgaag gagatggaag agtcctttgg cgccttgctg 300
cggcagttgc ccagcgacac cgaggtaccc gggctgctcg aggacatcac tcgtaccggc 360
ctgggcagcg gcctggagtt cgaggaaatc aagctgcttc ccgaggttgc ccagcagttc 420
tacatcgagc tgccgatcca gatcagcgtg gtcggcggct accacgactt ggcgaccttc 480
gtcagcggcg tgtccagcct gccgcggatc gtcaccctgc atgacttcga gatcaagccg 540
gtcgcgcccg gcagcacgtc caagctgcgc atgagcatcc tggccaagac ctatcgctac 600
aacgacaagg ggctgaagaa atga 624
<210> 34
<211> 604
<212> PRT
<213> 脑膜炎双球菌
<400> 34
Met Pro Ala Glu Thr Thr Val Ser Gly Ala His Pro Ala Ala Lys Leu
1 5 10 15
Pro Ile Tyr Ile Leu Pro Cys Phe Leu Trp Ile Gly Ile Val Pro Phe
20 25 30
Thr Phe Ala Leu Lys Leu Lys Pro Ser Pro Asp Phe Tyr His Asp Ala
35 40 45
Ala Ala Ala Ala Gly Leu Ile Val Leu Leu Phe Leu Thr Ala Gly Lys
50 55 60
Lys Leu Phe Asp Val Lys Ile Pro Ala Ile Ser Phe Leu Leu Phe Ala
65 70 75 80
Met Ala Ala Phe Trp Tyr Leu Gln Ala Arg Leu Met Asn Leu Ile Tyr
85 90 95
Pro Gly Met Asn Asp Ile Val Ser Trp Ile Phe Ile Leu Leu Ala Val
100 105 110
Ser Ala Trp Ala Cys Arg Ser Leu Val Ala His Phe Gly Gln Glu Arg
115 120 125
Ile Val Thr Leu Phe Ala Trp Ser Leu Leu Ile Gly Ser Leu Leu Gln
130 135 140
Ser Cys Ile Val Val Ile Gln Phe Ala Gly Trp Glu Asp Thr Pro Leu
145 150 155 160
Phe Gln Asn Ile Ile Val Tyr Ser Gly Gln Gly Val Ile Gly His Ile
165 170 175
Gly Gln Arg Asn Asn Leu Gly His Tyr Leu Met Trp Gly Ile Leu Ala
180 185 190
Ala Ala Tyr Leu Asn Gly Gln Arg Lys Ile Pro Ala Ala Leu Gly Val
195 200 205
Ile Cys Leu Ile Met Gln Thr Ala Val Leu Gly Leu Val Asn Ser Arg
210 215 220
Thr Ile Leu Thr Tyr Ile Ala Ala Ile Ala Leu Ile Leu Pro Phe Trp
225 230 235 240
Tyr Phe Arg Ser Asp Lys Ser Asn Arg Arg Thr Met Leu Gly Ile Ala
245 250 255
Ala Ala Val Phe Leu Thr Ala Leu Phe Gln Phe Ser Met Asn Thr Ile
260 265 270
Leu Glu Thr Phe Thr Gly Ile Arg Tyr Glu Thr Ala Val Glu Arg Val
275 280 285
Ala Asn Gly Gly Phe Thr Asp Leu Pro Arg Gln Ile Glu Trp Asn Lys
290 295 300
Ala Leu Ala Ala Phe Gln Ser Ala Pro Ile Phe Gly His Gly Trp Asn
305 310 315 320
Ser Phe Ala Gln Gln Thr Phe Leu Ile Asn Ala Glu Gln His Asn Ile
325 330 335
Tyr Asp Asn Leu Leu Ser Asn Leu Phe Thr His Ser His Asn Ile Val
340 345 350
Leu Gln Leu Leu Ala Glu Met Gly Ile Ser Gly Thr Leu Leu Val Ala
355 360 365
Ala Thr Leu Leu Thr Gly Ile Ala Gly Leu Leu Lys Arg Pro Leu Thr
370 375 380
Pro Ala Ser Leu Phe Leu Ile Cys Thr Leu Ala Val Ser Met Cys His
385 390 395 400
Ser Met Leu Glu Tyr Pro Leu Trp Tyr Val Tyr Phe Leu Ile Pro Phe
405 410 415
Gly Leu Met Leu Phe Leu Ser Pro Ala Glu Ala Ser Asp Gly Ile Ala
420 425 430
Phe Lys Lys Ala Ala Asn Leu Gly Ile Leu Thr Ala Ser Ala Ala Ile
435 440 445
Phe Ala Gly Leu Leu His Leu Asp Trp Thr Tyr Thr Arg Leu Val Asn
450 455 460
Ala Phe Ser Pro Ala Thr Asp Asp Ser Ala Lys Thr Leu Asn Arg Lys
465 470 475 480
Ile Asn Glu Leu Arg Tyr Ile Ser Ala Asn Ser Pro Met Leu Ser Phe
485 490 495
Tyr Ala Asp Phe Ser Leu Val Asn Phe Ala Leu Pro Glu Tyr Pro Glu
500 505 510
Thr Gln Thr Trp Ala Glu Glu Ala Thr Leu Lys Ser Leu Lys Tyr Arg
515 520 525
Pro His Ser Ala Thr Tyr Arg Ile Ala Leu Tyr Leu Met Arg Gln Gly
530 535 540
Lys Val Ala Glu Ala Lys Gln Trp Met Arg Ala Thr Gln Ser Tyr Tyr
545 550 555 560
Pro Tyr Leu Met Pro Arg Tyr Ala Asp Glu Ile Arg Lys Leu Pro Val
565 570 575
Trp Ala Pro Leu Leu Pro Glu Leu Leu Lys Asp Cys Lys Ala Phe Ala
580 585 590
Ala Ala Pro Gly His Pro Glu Ala Lys Pro Cys Lys
595 600
<210> 35
<211> 1815
<212> DNA
<213> 脑膜炎双球菌
<400> 35
atgcccgctg aaacgaccgt atccggcgcg caccccgccg ccaaactgcc gatttacatc 60
ctgccctgct tcctttggat aggcatcgtc ccctttacct tcgcgctcaa actgaaaccg 120
tcgcccgact tttaccacga tgccgccgcc gcagccggcc tgattgtcct gttgttcctc 180
acggcaggaa aaaaactgtt tgatgtcaaa atccccgcca tcagcttcct tctgtttgca 240
atggcggcgt tttggtatct tcaggcacgc ctgatgaacc tgatttaccc cggtatgaac 300
gacatcgtct cttggatttt catcttgctc gccgtcagcg cgtgggcctg ccggagcttg 360
gtcgcacact tcggacaaga acgcatcgtg accctgtttg cctggtcgct gcttatcggc 420
tccctgcttc aatcctgcat cgtcgtcatc cagtttgccg gctgggaaga cacccctctg 480
tttcaaaaca tcatcgttta cagcgggcaa ggcgtaatcg gacacatcgg gcagcgcaac 540
aacctcggac actacctcat gtggggcata ctcgccgccg cctacctcaa cggacaacga 600
aaaatccccg ccgccctcgg cgtaatctgc ctgattatgc agaccgccgt tttaggtttg 660
gtcaactcgc gcaccatctt gacctacata gccgccatcg ccctcatcct tcccttctgg 720
tatttccgtt cggacaaatc caacaggcgg acgatgctcg gcatagccgc agccgtattc 780
cttaccgcgc tgttccaatt ttccatgaac accattctgg aaacctttac tggcatccgc 840
tacgaaactg ccgtcgaacg cgtcgccaac ggcggtttca cagacttgcc gcgccaaatc 900
gaatggaata aagcccttgc cgccttccag tccgccccga tattcgggca cggctggaac 960
agttttgccc aacaaacctt cctcatcaat gccgaacagc acaacatata cgacaacctc 1020
ctcagcaact tgttcaccca ttcccacaac atcgtcctcc aactccttgc agagatggga 1080
atcagcggca cgcttctggt tgccgcaacc ctgctgacgg gcattgccgg gctgcttaaa 1140
cgccccctga cccccgcatc gcttttccta atctgcacgc ttgccgtcag tatgtgccac 1200
agtatgctcg aatatccttt gtggtatgtc tatttcctca tccctttcgg actgatgctc 1260
ttcctgtccc ccgcagaggc ttcagacggc atcgccttca aaaaagccgc caatctcggc 1320
atactgaccg cctccgccgc catattcgca ggattgctgc acttggactg gacatacacc 1380
cggctggtta acgccttttc ccccgccact gacgacagtg ccaaaaccct caaccggaaa 1440
atcaacgagt tgcgctatat ttccgcaaac agtccgatgc tgtcctttta tgccgacttc 1500
tccctcgtaa acttcgccct gccggaatac cccgaaaccc agacttgggc ggaagaagca 1560
accctcaaat cactaaaata ccgcccccac tccgccacct accgcatcgc cctctacctg 1620
atgcggcaag gcaaagttgc agaagcaaaa caatggatgc gggcgacaca gtcctattac 1680
ccctacctga tgccccgata cgccgacgaa atccgcaaac tgcccgtatg ggcgccgctg 1740
ctacccgaac tgctcaaaga ctgcaaagcc ttcgccgccg cgcccggtca tccggaagca 1800
aaaccctgca aatga 1815
Claims (18)
1.一种用于生产糖基化蛋白的重组系统,所述系统包括:
适于合成糖蛋白靶点的试剂;
能够在重组系统中将真核生物聚糖由原核生物脂质载体分子转移至糖蛋白靶点的分离的原核生物寡糖转移酶;
一种或多种分离的真核生物聚糖,其中各真核生物聚糖均包含GlcNAc2核并与原核生物脂质载体分子连接;和
糖蛋白靶点,其包括一个或多个聚糖接受体氨基酸残基,或者编码所述糖蛋白靶点的核酸分子,其中所述试剂、所述分离的原核生物寡糖转移酶、所述一种或多种分离的真核生物聚糖和所述重组系统的糖蛋白靶点是无细胞的。
2.根据权利要求1所述的系统,其中所述原核生物寡糖转移酶来源于弯曲杆菌属(Campylobacter)。
3.根据权利要求1所述的系统,其中所述脂质载体分子包括十一碳二烯磷酸酯。
4.根据权利要求1所述的系统,其中所述真核生物聚糖进一步包括至少一个甘露糖残基。
5.根据权利要求1所述的系统,其中所述真核生物聚糖包括选自Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2的组分。
6.根据权利要求1所述的系统,其中所述糖蛋白靶点的一个或多个聚糖接受体氨基酸残基是天冬酰胺残基。
7.根据权利要求6所述的系统,其中糖蛋白靶点进一步包括N-X1-S/T或D/E-X1-N-X2-S/T聚糖接受体氨基酸序列基序,其中D是天冬氨酸、E是谷氨酸、X1和X2是脯氨酸以外的任意氨基酸、N是天冬酰胺、S是丝氨酸和T是苏氨酸。
8.根据权利要求1所述的系统,其中所述糖蛋白靶点包括抗体。
9.一种包含重组系统的试剂盒,所述重组系统包括:
适于合成糖蛋白靶点的试剂;
能够在重组系统中将真核生物聚糖由原核生物脂质载体分子转移至糖蛋白靶点的分离的原核生物寡糖转移酶;
一种或多种分离的真核生物聚糖,其中各真核生物聚糖均包含GlcNAc2核并与原核生物脂质载体分子连接,其中所述试剂、所述分离的原核生物寡糖转移酶、所述一种或多种分离的真核生物聚糖和所述重组系统的糖蛋白靶点是无细胞的。
10.一种在重组系统中生产糖基化蛋白的方法,所述方法包括:
提供能够在重组系统中将真核生物聚糖由原核生物脂质载体分子转移至糖蛋白靶点的分离的原核生物寡糖转移酶;
提供一种或多种分离的真核生物聚糖,其中各真核生物聚糖均包含GlcNAc2核并与原核生物脂质载体分子连接,其中所述试剂、所述分离的原核生物寡糖转移酶、所述一种或多种分离的真核生物聚糖和所述重组系统的糖蛋白靶点是无细胞的;
将所述原核生物寡糖转移酶、所述一种或多种分离的真核生物聚糖和糖蛋白靶点组合以形成无细胞糖基化反应混合物;以及
将所述糖蛋白靶点置于使原核生物寡糖转移酶有效地促使真核生物聚糖由原核生物脂质载体分子转移至所述糖蛋白靶点的一个或多个聚糖接受体残基的条件下以产生糖基化蛋白。
11.根据权利要求10所述的方法,其中所述原核生物寡糖转移酶来源于弯曲杆菌属(Campylobacter)。
12.根据权利要求10所述的方法,其中所述脂质载体分子包括十一碳二烯磷酸酯。
13.根据权利要求10所述的方法,其中所述一种或多种真核生物聚糖进一步包括至少一个甘露糖残基。
14.根据权利要求10所述的方法,其中所述一种或多种真核生物聚糖包含选自Man1GlcNAc2、Man2GlcNAc2和Man3GlcNAc2的组分。
15.根据权利要求10所述的方法,其中所述生产糖蛋白靶点包括
提供适于由所述核酸分子合成糖蛋白靶点的试剂,以及
先于所述操作或与所述操作同时地将所述试剂在有效地从所述核酸分子合成所述糖蛋白靶点的条件下与糖基化反应混合。
16.根据权利要求10所述的方法,其中所述糖蛋白靶点的一种或多种聚糖接受体氨基酸残基是天冬酰胺残基。
17.根据权利要求16所述的方法,其中所述糖蛋白靶点进一步包括N-X1-S/T或D/E-X1-N-X2-S/T聚糖接受体氨基酸序列基序,其中D是天冬氨酸、E是谷氨酸、X1和X2是脯氨酸以外的任意氨基酸、N是天冬酰胺、S是丝氨酸和T是苏氨酸。
18.根据权利要求10所述的方法,其中所述蛋白包括抗体。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161555854P | 2011-11-04 | 2011-11-04 | |
US61/555,854 | 2011-11-04 | ||
CN201280066129.1A CN104080921A (zh) | 2011-11-04 | 2012-11-05 | 一种用于糖蛋白合成的基于原核生物的无细胞系统 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201280066129.1A Division CN104080921A (zh) | 2011-11-04 | 2012-11-05 | 一种用于糖蛋白合成的基于原核生物的无细胞系统 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112980907A true CN112980907A (zh) | 2021-06-18 |
Family
ID=48192910
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110180948.XA Pending CN112980907A (zh) | 2011-11-04 | 2012-11-05 | 一种用于糖蛋白合成的基于原核生物的无细胞系统 |
CN201280066129.1A Pending CN104080921A (zh) | 2011-11-04 | 2012-11-05 | 一种用于糖蛋白合成的基于原核生物的无细胞系统 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201280066129.1A Pending CN104080921A (zh) | 2011-11-04 | 2012-11-05 | 一种用于糖蛋白合成的基于原核生物的无细胞系统 |
Country Status (5)
Country | Link |
---|---|
US (2) | US11193154B2 (zh) |
CN (2) | CN112980907A (zh) |
HK (1) | HK1202896A1 (zh) |
IN (1) | IN2014CN04076A (zh) |
WO (1) | WO2013067523A1 (zh) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SG11201508347YA (en) * | 2013-03-14 | 2015-11-27 | Glycobia Inc | Oligosaccharide compositions, glycoproteins and methods to produce the same in prokaryotes |
US20160312312A1 (en) | 2013-12-06 | 2016-10-27 | President And Fellows Of Harvard College | Paper-based synthetic gene networks |
CN106574288A (zh) * | 2014-07-08 | 2017-04-19 | 泰克年研究发展基金会公司 | 用于无细胞转录和翻译的方法和试剂盒 |
WO2016107819A1 (en) | 2014-12-30 | 2016-07-07 | Glycovaxyn Ag | Compositions and methods for protein glycosylation |
US10265391B2 (en) | 2015-02-26 | 2019-04-23 | Vaxnewmo Llc | Acinetobacter O-oligosaccharyltransferases and uses thereof |
CN106478773B (zh) * | 2015-08-25 | 2021-09-14 | 三生国健药业(上海)股份有限公司 | 一种人工合成的新型信号肽 |
WO2017117539A1 (en) | 2015-12-30 | 2017-07-06 | Northwestern University | Cell-free glycoprotein synthesis (cfgps) in prokaryotic cell lysates enriched with components for glycosylation |
US10829795B2 (en) | 2016-07-14 | 2020-11-10 | Northwestern University | Method for rapid in vitro synthesis of glycoproteins via recombinant production of N-glycosylated proteins in prokaryotic cell lysates |
WO2019035916A1 (en) | 2017-08-15 | 2019-02-21 | Northwestern University | DESIGN OF PROTEIN GLYCOSYLATION SITES BY RAPID EXPRESSION AND CHARACTERIZATION OF N-GLYCOSYLTRANSFERASES |
US11530432B2 (en) | 2018-03-19 | 2022-12-20 | Northwestern University | Compositions and methods for rapid in vitro synthesis of bioconjugate vaccines in vitro via production and N-glycosylation of protein carriers in detoxified prokaryotic cell lysates |
US11725224B2 (en) | 2018-04-16 | 2023-08-15 | Northwestern University | Methods for co-activating in vitro non-standard amino acid (nsAA) incorporation and glycosylation in crude cell lysates |
AU2019287659A1 (en) | 2018-06-16 | 2021-01-07 | Vaxnewmo Llc | Glycosylated ComP pilin variants, methods of making and uses thereof |
CA3149430A1 (en) | 2019-08-09 | 2021-02-18 | Glaxosmithkline Biologicals Sa | Mutated pglb oligosaccharyltransferase enzymes |
KR20220088473A (ko) * | 2019-10-25 | 2022-06-27 | 노쓰웨스턴유니버시티 | 막 소포의 농축 및 증가된 당단백질 수율을 위한 무세포 추출물 제조 프로토콜 |
CN115181752B (zh) * | 2022-07-12 | 2024-12-24 | 大连大学 | 一种糖链质粒优化提高修饰蛋白质效率以及蛋白表达量的方法 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101960017A (zh) * | 2008-01-03 | 2011-01-26 | 康乃尔研究基金会有限公司 | 原核生物中的糖基化蛋白表达 |
CN102037004A (zh) * | 2008-01-08 | 2011-04-27 | 生物种属学股份公司 | 使用寡糖基转移酶的多肽的糖缀合 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4061043B2 (ja) | 2000-12-28 | 2008-03-12 | 株式会社ポストゲノム研究所 | invitro転写/翻訳系によるペプチド等の製造方法 |
KR20090110951A (ko) | 2002-03-07 | 2009-10-23 | 아이드게노쉬쉐 테흐니쉐 호흐슐레 쥬리히 | 원핵 숙주에서의 재조합 글리코실화 단백질 생산 방법 및 생산계 |
JP4590249B2 (ja) * | 2004-11-17 | 2010-12-01 | 独立行政法人理化学研究所 | 糖タンパク質合成用の無細胞タンパク質合成システム |
-
2012
- 2012-11-05 CN CN202110180948.XA patent/CN112980907A/zh active Pending
- 2012-11-05 WO PCT/US2012/063590 patent/WO2013067523A1/en active Application Filing
- 2012-11-05 CN CN201280066129.1A patent/CN104080921A/zh active Pending
- 2012-11-05 US US14/356,258 patent/US11193154B2/en active Active
-
2014
- 2014-05-30 IN IN4076CHN2014 patent/IN2014CN04076A/en unknown
-
2015
- 2015-03-31 HK HK15103270.8A patent/HK1202896A1/zh unknown
-
2021
- 2021-12-06 US US17/543,614 patent/US20220340947A1/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101960017A (zh) * | 2008-01-03 | 2011-01-26 | 康乃尔研究基金会有限公司 | 原核生物中的糖基化蛋白表达 |
CN102037004A (zh) * | 2008-01-08 | 2011-04-27 | 生物种属学股份公司 | 使用寡糖基转移酶的多肽的糖缀合 |
Non-Patent Citations (6)
Title |
---|
ADAM C.FISHER等: "Production of Secretory and Extracellular N-Linked Glycoproteins in Escherichia coli", 《APPLIED AND ENVIRONMENTAL MICROBIOLOGY》 * |
MARK M. CHEN等: "From Peptide to Protein: Comparative Analysis of the Substrate Specificity of N-Linked Glycosylation in C. jejuni", 《BIOCHEMISTRY》 * |
MICHAEL KOWARIK等: "Definition of the bacterial N-glycosylation site consensus sequence", 《THE EMBO JOURNAL》 * |
MICHAEL KOWARIK等: "N-Linked Glycosylation of Folded Proteins by the Bacterial Oligosaccharyltransferase", 《SCIENCE》 * |
NOBUO MAITA等: "Comparative Structural Biology of Eubacterial and Archaeal Oligosaccharyltransferases", 《THE JOURNAL OF BIOLOGICAL CHEMISTRY》 * |
王鸿利 等: "《医学实验技术的理论与应用》", 30 November 2004, 上海科技教育出版社 * |
Also Published As
Publication number | Publication date |
---|---|
US11193154B2 (en) | 2021-12-07 |
WO2013067523A1 (en) | 2013-05-10 |
US20220340947A1 (en) | 2022-10-27 |
HK1202896A1 (zh) | 2015-10-09 |
US20140255987A1 (en) | 2014-09-11 |
IN2014CN04076A (zh) | 2015-10-23 |
CN104080921A (zh) | 2014-10-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220340947A1 (en) | Prokaryote-based cell-free system for the synthesis of glycoproteins | |
Zhou et al. | Expression of heparan sulfate sulfotransferases in Kluyveromyces lactis and preparation of 3′-phosphoadenosine-5′-phosphosulfate | |
US9809835B2 (en) | Quantitative control of sialylation | |
US11788108B2 (en) | CMP-dependent sialidase activity | |
EP3017057B1 (en) | Process for the mono- and bi-sialylation of glycoproteins employing n-terminally truncated beta-galactoside alpha-2,6-sialyltransferase mutants | |
EP3017041B1 (en) | N-terminally truncated glycosyltransferases | |
JPWO2008108325A1 (ja) | 新規なβ−ガラクトシド−α2,6−シアル酸転移酵素、それをコードする遺伝子および酵素活性を向上させる方法 | |
US20170204381A1 (en) | Pmst1 mutants for chemoenzymatic synthesis of sialyl lewis x compounds | |
WO2010143713A1 (ja) | 新規タンパク質およびそれをコードする遺伝子 | |
US9783838B2 (en) | PmST3 enzyme for chemoenzymatic synthesis of alpha-2-3-sialosides | |
US9938510B2 (en) | Photobacterium sp. alpha-2-6-sialyltransferase variants | |
TW202330913A (zh) | 突變的磺基轉移酶及其用途 | |
WO2012014980A1 (ja) | 新規酵素タンパク質、当該酵素タンパク質の製造方法及び当該酵素タンパク質をコードする遺伝子 | |
EP4265730A1 (en) | Cell-free enzymatic method for preparation of n-glycans | |
US9102967B2 (en) | PmST2 enzyme for chemoenzymatic synthesis of α-2-3-sialylglycolipids | |
WO2023202991A2 (en) | Cell-free enzymatic method for preparation of n-glycans | |
JP2011223885A (ja) | 新規なシチジン5’−モノホスホシアル酸合成酵素、それをコードする遺伝子およびその製造方法 | |
EP3037527A1 (en) | Sialyltransferase without CMP-dependent sialidase activity | |
Saxena | Corynebacterium glutamicum: A Platform For Studying Actinobacterial Protein-O-Mannosylation And High-Yield Heterologous Protein Production | |
TW202332766A (zh) | 用突變的芳基磺基轉移酶硫酸化受質的用途和方法 | |
JP4977125B2 (ja) | 新規なβ−ガラクトシド−α2,6−シアル酸転移酵素、それをコードする遺伝子およびその製造方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 40055774 Country of ref document: HK |