CA2346218A1 - Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell - Google Patents
Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell Download PDFInfo
- Publication number
- CA2346218A1 CA2346218A1 CA002346218A CA2346218A CA2346218A1 CA 2346218 A1 CA2346218 A1 CA 2346218A1 CA 002346218 A CA002346218 A CA 002346218A CA 2346218 A CA2346218 A CA 2346218A CA 2346218 A1 CA2346218 A1 CA 2346218A1
- Authority
- CA
- Canada
- Prior art keywords
- ala
- val
- leu
- gly
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000003153 chemical reaction reagent Substances 0.000 title claims description 26
- 239000000427 antigen Substances 0.000 title description 32
- 108091007433 antigens Proteins 0.000 title description 31
- 102000036639 antigens Human genes 0.000 title description 31
- 241000187479 Mycobacterium tuberculosis Species 0.000 title description 8
- 229960002109 tuberculosis vaccine Drugs 0.000 title description 2
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 270
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 266
- 229920001184 polypeptide Polymers 0.000 claims abstract description 256
- 150000001413 amino acids Chemical class 0.000 claims abstract description 87
- 241000186359 Mycobacterium Species 0.000 claims abstract description 59
- 108091028043 Nucleic acid sequence Proteins 0.000 claims abstract description 25
- 210000001744 T-lymphocyte Anatomy 0.000 claims abstract description 21
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 20
- 239000003814 drug Substances 0.000 claims abstract description 6
- 201000008827 tuberculosis Diseases 0.000 claims description 230
- 108090000623 proteins and genes Proteins 0.000 claims description 158
- 102000004169 proteins and genes Human genes 0.000 claims description 134
- 108020004414 DNA Proteins 0.000 claims description 84
- 230000004044 response Effects 0.000 claims description 57
- 125000003729 nucleotide group Chemical group 0.000 claims description 50
- 210000004027 cell Anatomy 0.000 claims description 49
- 208000015181 infectious disease Diseases 0.000 claims description 49
- 239000002773 nucleotide Substances 0.000 claims description 49
- 238000000034 method Methods 0.000 claims description 48
- 239000000725 suspension Substances 0.000 claims description 44
- 210000000172 cytosol Anatomy 0.000 claims description 40
- 239000000203 mixture Substances 0.000 claims description 34
- 210000003819 peripheral blood mononuclear cell Anatomy 0.000 claims description 34
- 239000006228 supernatant Substances 0.000 claims description 32
- 210000002421 cell wall Anatomy 0.000 claims description 27
- 239000012634 fragment Substances 0.000 claims description 27
- CSCPPACGZOOCGX-UHFFFAOYSA-N Acetone Chemical compound CC(C)=O CSCPPACGZOOCGX-UHFFFAOYSA-N 0.000 claims description 26
- 150000007523 nucleic acids Chemical group 0.000 claims description 25
- 241000894006 Bacteria Species 0.000 claims description 23
- 241001465754 Metazoa Species 0.000 claims description 23
- 239000002671 adjuvant Substances 0.000 claims description 22
- 238000002360 preparation method Methods 0.000 claims description 21
- 230000001681 protective effect Effects 0.000 claims description 19
- 238000002255 vaccination Methods 0.000 claims description 19
- 230000028993 immune response Effects 0.000 claims description 18
- 239000008188 pellet Substances 0.000 claims description 17
- 238000002347 injection Methods 0.000 claims description 16
- 239000007924 injection Substances 0.000 claims description 16
- 210000000170 cell membrane Anatomy 0.000 claims description 15
- 230000036039 immunity Effects 0.000 claims description 15
- 230000006698 induction Effects 0.000 claims description 15
- 238000004519 manufacturing process Methods 0.000 claims description 15
- 239000000126 substance Substances 0.000 claims description 14
- 238000000338 in vitro Methods 0.000 claims description 13
- 210000000952 spleen Anatomy 0.000 claims description 13
- 238000009396 hybridization Methods 0.000 claims description 12
- 239000008280 blood Substances 0.000 claims description 11
- 241000124008 Mammalia Species 0.000 claims description 10
- 210000004369 blood Anatomy 0.000 claims description 10
- 238000003745 diagnosis Methods 0.000 claims description 10
- 238000001262 western blot Methods 0.000 claims description 10
- 238000002965 ELISA Methods 0.000 claims description 9
- 230000000694 effects Effects 0.000 claims description 9
- 244000005700 microbiome Species 0.000 claims description 8
- 229920001213 Polysorbate 20 Polymers 0.000 claims description 7
- 210000003719 b-lymphocyte Anatomy 0.000 claims description 7
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 claims description 7
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 claims description 7
- 230000002147 killing effect Effects 0.000 claims description 6
- 239000000443 aerosol Substances 0.000 claims description 5
- 230000000295 complement effect Effects 0.000 claims description 5
- 239000000902 placebo Substances 0.000 claims description 5
- 229940068196 placebo Drugs 0.000 claims description 5
- 230000001376 precipitating effect Effects 0.000 claims description 5
- 230000005764 inhibitory process Effects 0.000 claims description 4
- 210000000056 organ Anatomy 0.000 claims description 4
- 238000012216 screening Methods 0.000 claims description 4
- 210000003071 memory t lymphocyte Anatomy 0.000 claims description 3
- 230000000069 prophylactic effect Effects 0.000 claims description 3
- 229940124597 therapeutic agent Drugs 0.000 claims description 3
- 230000005875 antibody response Effects 0.000 claims description 2
- 230000000903 blocking effect Effects 0.000 claims description 2
- 238000003018 immunoassay Methods 0.000 claims description 2
- 230000002401 inhibitory effect Effects 0.000 claims description 2
- 230000002452 interceptive effect Effects 0.000 claims description 2
- 230000001323 posttranslational effect Effects 0.000 claims description 2
- 238000012545 processing Methods 0.000 claims description 2
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 230000009870 specific binding Effects 0.000 claims description 2
- 238000013519 translation Methods 0.000 claims description 2
- 239000003981 vehicle Substances 0.000 claims description 2
- 239000000306 component Substances 0.000 claims 2
- 239000003937 drug carrier Substances 0.000 claims 1
- 230000003993 interaction Effects 0.000 claims 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 claims 1
- 238000013518 transcription Methods 0.000 claims 1
- 230000035897 transcription Effects 0.000 claims 1
- 229960005486 vaccine Drugs 0.000 abstract description 34
- 239000000032 diagnostic agent Substances 0.000 abstract 1
- 229940039227 diagnostic agent Drugs 0.000 abstract 1
- 235000018102 proteins Nutrition 0.000 description 131
- 235000001014 amino acid Nutrition 0.000 description 85
- 238000010367 cloning Methods 0.000 description 45
- 108010005233 alanylglutamic acid Proteins 0.000 description 32
- 239000002953 phosphate buffered saline Substances 0.000 description 30
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 30
- 241000699670 Mus sp. Species 0.000 description 29
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 29
- 239000012528 membrane Substances 0.000 description 28
- 238000007792 addition Methods 0.000 description 27
- 239000000523 sample Substances 0.000 description 26
- 108010047495 alanylglycine Proteins 0.000 description 23
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 23
- 239000013598 vector Substances 0.000 description 23
- 108700026244 Open Reading Frames Proteins 0.000 description 22
- 108010049041 glutamylalanine Proteins 0.000 description 22
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 20
- 238000012360 testing method Methods 0.000 description 20
- 238000006243 chemical reaction Methods 0.000 description 19
- 108010050848 glycylleucine Proteins 0.000 description 19
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 18
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 18
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 17
- 241000880493 Leptailurus serval Species 0.000 description 16
- 108010037850 glycylvaline Proteins 0.000 description 16
- 230000001900 immune effect Effects 0.000 description 16
- 210000002966 serum Anatomy 0.000 description 16
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 15
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 15
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 15
- 108010038633 aspartylglutamate Proteins 0.000 description 15
- 239000000499 gel Substances 0.000 description 15
- 238000002649 immunization Methods 0.000 description 15
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 14
- 238000012163 sequencing technique Methods 0.000 description 14
- 230000000638 stimulation Effects 0.000 description 14
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 13
- 241000700198 Cavia Species 0.000 description 13
- 239000002033 PVDF binder Substances 0.000 description 13
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 13
- 108090000695 Cytokines Proteins 0.000 description 12
- 102000004127 Cytokines Human genes 0.000 description 12
- 241000699666 Mus <mouse, genus> Species 0.000 description 12
- 108010041407 alanylaspartic acid Proteins 0.000 description 12
- 108010089804 glycyl-threonine Proteins 0.000 description 12
- 241000894007 species Species 0.000 description 12
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 11
- 241000186367 Mycobacterium avium Species 0.000 description 11
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 11
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 11
- 238000010171 animal model Methods 0.000 description 11
- 108010008355 arginyl-glutamine Proteins 0.000 description 11
- 230000002163 immunogen Effects 0.000 description 11
- 241000282326 Felis catus Species 0.000 description 10
- 108010087924 alanylproline Proteins 0.000 description 10
- 108010068380 arginylarginine Proteins 0.000 description 10
- 108010062796 arginyllysine Proteins 0.000 description 10
- 238000005119 centrifugation Methods 0.000 description 10
- 108010070643 prolylglutamic acid Proteins 0.000 description 10
- 108010029020 prolylglycine Proteins 0.000 description 10
- 108010073969 valyllysine Proteins 0.000 description 10
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 9
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 9
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 9
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 9
- 125000000539 amino acid group Chemical group 0.000 description 9
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 238000003556 assay Methods 0.000 description 9
- 239000000872 buffer Substances 0.000 description 9
- 108010061238 threonyl-glycine Proteins 0.000 description 9
- WDIYWDJLXOCGRW-ACZMJKKPSA-N Ala-Asp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WDIYWDJLXOCGRW-ACZMJKKPSA-N 0.000 description 8
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 8
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 8
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 8
- 208000020545 Exposure to communicable disease Diseases 0.000 description 8
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 8
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 8
- 108010044940 alanylglutamine Proteins 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 8
- 238000001962 electrophoresis Methods 0.000 description 8
- 239000013604 expression vector Substances 0.000 description 8
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 8
- 150000003839 salts Chemical group 0.000 description 8
- IDOQDZANRZQBTP-UHFFFAOYSA-N 2-[2-(2,4,4-trimethylpentan-2-yl)phenoxy]ethanol Chemical compound CC(C)(C)CC(C)(C)C1=CC=CC=C1OCCO IDOQDZANRZQBTP-UHFFFAOYSA-N 0.000 description 7
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 7
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 7
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 7
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 7
- UXJCMQFPDWCHKX-DCAQKATOSA-N Arg-Arg-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UXJCMQFPDWCHKX-DCAQKATOSA-N 0.000 description 7
- 241000588724 Escherichia coli Species 0.000 description 7
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 7
- VYZAGTDAHUIRQA-WHFBIAKZSA-N L-alanyl-L-glutamic acid Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O VYZAGTDAHUIRQA-WHFBIAKZSA-N 0.000 description 7
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 7
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 7
- 241000186362 Mycobacterium leprae Species 0.000 description 7
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 7
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 7
- 229920004929 Triton X-114 Polymers 0.000 description 7
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 7
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 108010060035 arginylproline Proteins 0.000 description 7
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 7
- 108010000761 leucylarginine Proteins 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 108010053725 prolylvaline Proteins 0.000 description 7
- 208000027930 type IV hypersensitivity disease Diseases 0.000 description 7
- 108010020532 tyrosyl-proline Proteins 0.000 description 7
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 6
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 6
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 6
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 6
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 6
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 6
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 6
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 6
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 6
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 6
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 6
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 6
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 6
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 6
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 6
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 6
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 6
- 108010011559 alanylphenylalanine Proteins 0.000 description 6
- -1 animal model Proteins 0.000 description 6
- 108010077245 asparaginyl-proline Proteins 0.000 description 6
- 108010093581 aspartyl-proline Proteins 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 238000012217 deletion Methods 0.000 description 6
- 230000037430 deletion Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000000605 extraction Methods 0.000 description 6
- 108010078144 glutaminyl-glycine Proteins 0.000 description 6
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 6
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 239000000243 solution Substances 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 5
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 5
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 5
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 5
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 5
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 5
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 5
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 5
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 5
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 5
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 5
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 5
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 5
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 5
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 5
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 5
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 5
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 5
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 5
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 5
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 5
- SENJXOPIZNYLHU-IUCAKERBSA-N Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-IUCAKERBSA-N 0.000 description 5
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 5
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 5
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 5
- 241001646725 Mycobacterium tuberculosis H37Rv Species 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 5
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 5
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 5
- NDXSOKGYKCGYKT-VEVYYDQMSA-N Thr-Pro-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O NDXSOKGYKCGYKT-VEVYYDQMSA-N 0.000 description 5
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 5
- XQVRMLRMTAGSFJ-QXEWZRGKSA-N Val-Asp-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XQVRMLRMTAGSFJ-QXEWZRGKSA-N 0.000 description 5
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 5
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 5
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 5
- 239000004480 active ingredient Substances 0.000 description 5
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 5
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 5
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 5
- 108010036533 arginylvaline Proteins 0.000 description 5
- 108010092854 aspartyllysine Proteins 0.000 description 5
- 238000009472 formulation Methods 0.000 description 5
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 5
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 5
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 5
- 108010015792 glycyllysine Proteins 0.000 description 5
- 108010077515 glycylproline Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 210000004072 lung Anatomy 0.000 description 5
- 108010009298 lysylglutamic acid Proteins 0.000 description 5
- 108010084572 phenylalanyl-valine Proteins 0.000 description 5
- 108010012581 phenylalanylglutamate Proteins 0.000 description 5
- 239000008363 phosphate buffer Substances 0.000 description 5
- 239000011347 resin Substances 0.000 description 5
- 229920005989 resin Polymers 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- 230000000392 somatic effect Effects 0.000 description 5
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 4
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 4
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- PIPTUBPKYFRLCP-NHCYSSNCSA-N Ala-Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PIPTUBPKYFRLCP-NHCYSSNCSA-N 0.000 description 4
- SITWEMZOJNKJCH-WDSKDSINSA-N Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SITWEMZOJNKJCH-WDSKDSINSA-N 0.000 description 4
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 4
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 4
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 4
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 4
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 4
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 4
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 4
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 4
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 4
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 4
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 4
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 4
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 4
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 4
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 4
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 4
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 4
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 4
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 4
- KOSRFJWDECSPRO-WDSKDSINSA-N Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(O)=O KOSRFJWDECSPRO-WDSKDSINSA-N 0.000 description 4
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 4
- QCMVGXDELYMZET-GLLZPBPUSA-N Glu-Thr-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QCMVGXDELYMZET-GLLZPBPUSA-N 0.000 description 4
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 4
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 4
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 4
- HKSNHPVETYYJBK-LAEOZQHASA-N Gly-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN HKSNHPVETYYJBK-LAEOZQHASA-N 0.000 description 4
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 4
- DNVDEMWIYLVIQU-RCOVLWMOSA-N Gly-Val-Asp Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O DNVDEMWIYLVIQU-RCOVLWMOSA-N 0.000 description 4
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- BOTVMTSMOUSDRW-GMOBBJLQSA-N Ile-Arg-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O BOTVMTSMOUSDRW-GMOBBJLQSA-N 0.000 description 4
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 4
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 4
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 4
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 4
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 4
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 4
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 4
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 4
- KAKJTZWHIUWTTD-VQVTYTSYSA-N Met-Thr Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)O)C([O-])=O KAKJTZWHIUWTTD-VQVTYTSYSA-N 0.000 description 4
- 239000000020 Nitrocellulose Substances 0.000 description 4
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 4
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 4
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 4
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 4
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 4
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 4
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 4
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 4
- HSRXSKHRSXRCFC-WDSKDSINSA-N Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(O)=O HSRXSKHRSXRCFC-WDSKDSINSA-N 0.000 description 4
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 4
- UPJONISHZRADBH-XPUUQOCRSA-N Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UPJONISHZRADBH-XPUUQOCRSA-N 0.000 description 4
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 4
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 4
- XCTHZFGSVQBHBW-IUCAKERBSA-N Val-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C XCTHZFGSVQBHBW-IUCAKERBSA-N 0.000 description 4
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 4
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 4
- YDVDTCJGBBJGRT-GUBZILKMSA-N Val-Met-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N YDVDTCJGBBJGRT-GUBZILKMSA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 4
- OWFGFHQMSBTKLX-UFYCRDLUSA-N Val-Tyr-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N OWFGFHQMSBTKLX-UFYCRDLUSA-N 0.000 description 4
- KRNYOVHEKOBTEF-YUMQZZPRSA-N Val-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O KRNYOVHEKOBTEF-YUMQZZPRSA-N 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000008827 biological function Effects 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- 238000010828 elution Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 4
- 108010079547 glutamylmethionine Proteins 0.000 description 4
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 4
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 4
- 108010010147 glycylglutamine Proteins 0.000 description 4
- 108010081551 glycylphenylalanine Proteins 0.000 description 4
- 108010040030 histidinoalanine Proteins 0.000 description 4
- 108010036413 histidylglycine Proteins 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 238000011534 incubation Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 4
- 230000007774 longterm Effects 0.000 description 4
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 4
- 108010064235 lysylglycine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 230000015654 memory Effects 0.000 description 4
- 238000000386 microscopy Methods 0.000 description 4
- 238000010172 mouse model Methods 0.000 description 4
- 229920001220 nitrocellulos Polymers 0.000 description 4
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 4
- 108010024607 phenylalanylalanine Proteins 0.000 description 4
- 230000035755 proliferation Effects 0.000 description 4
- 108010090894 prolylleucine Proteins 0.000 description 4
- 230000010076 replication Effects 0.000 description 4
- 231100000430 skin reaction Toxicity 0.000 description 4
- 238000006467 substitution reaction Methods 0.000 description 4
- 238000000539 two dimensional gel electrophoresis Methods 0.000 description 4
- 108010036320 valylleucine Proteins 0.000 description 4
- 108010021889 valylvaline Proteins 0.000 description 4
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 3
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 3
- ZEXDYVGDZJBRMO-ACZMJKKPSA-N Ala-Asn-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZEXDYVGDZJBRMO-ACZMJKKPSA-N 0.000 description 3
- XAEWTDMGFGHWFK-IMJSIDKUSA-N Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O XAEWTDMGFGHWFK-IMJSIDKUSA-N 0.000 description 3
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 3
- MKZCBYZBCINNJN-DLOVCJGASA-N Ala-Asp-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MKZCBYZBCINNJN-DLOVCJGASA-N 0.000 description 3
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 3
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 3
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- VWEWCZSUWOEEFM-WDSKDSINSA-N Ala-Gly-Ala-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(O)=O VWEWCZSUWOEEFM-WDSKDSINSA-N 0.000 description 3
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 3
- PNALXAODQKTNLV-JBDRJPRFSA-N Ala-Ile-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O PNALXAODQKTNLV-JBDRJPRFSA-N 0.000 description 3
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 3
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 3
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 3
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 3
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 3
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 3
- GKAZXNDATBWNBI-DCAQKATOSA-N Ala-Met-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)O)N GKAZXNDATBWNBI-DCAQKATOSA-N 0.000 description 3
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 3
- OEVCHROQUIVQFZ-YTLHQDLWSA-N Ala-Thr-Ala Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O OEVCHROQUIVQFZ-YTLHQDLWSA-N 0.000 description 3
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 3
- AENHOIXXHKNIQL-AUTRQRHGSA-N Ala-Tyr-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H]([NH3+])C)CC1=CC=C(O)C=C1 AENHOIXXHKNIQL-AUTRQRHGSA-N 0.000 description 3
- SIFXMYAHXJGAFC-WDSKDSINSA-N Arg-Asp Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O SIFXMYAHXJGAFC-WDSKDSINSA-N 0.000 description 3
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 3
- DXQIQUIQYAGRCC-CIUDSAMLSA-N Arg-Asp-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)CN=C(N)N DXQIQUIQYAGRCC-CIUDSAMLSA-N 0.000 description 3
- PMGDADKJMCOXHX-BQBZGAKWSA-N Arg-Gln Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PMGDADKJMCOXHX-BQBZGAKWSA-N 0.000 description 3
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 3
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 3
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 3
- WYBVBIHNJWOLCJ-IUCAKERBSA-N Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N WYBVBIHNJWOLCJ-IUCAKERBSA-N 0.000 description 3
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 3
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 3
- SJUXYGVRSGTPMC-IMJSIDKUSA-N Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O SJUXYGVRSGTPMC-IMJSIDKUSA-N 0.000 description 3
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 3
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 3
- KLKHFFMNGWULBN-VKHMYHEASA-N Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)NCC(O)=O KLKHFFMNGWULBN-VKHMYHEASA-N 0.000 description 3
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 3
- NCFJQJRLQJEECD-NHCYSSNCSA-N Asn-Leu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O NCFJQJRLQJEECD-NHCYSSNCSA-N 0.000 description 3
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 3
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 3
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 3
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 3
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 3
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 3
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 3
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 3
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 3
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 3
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 3
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 3
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 3
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 3
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 3
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 3
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 3
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 3
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 3
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 3
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 3
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 3
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 3
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 3
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 3
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 3
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 3
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 3
- JLXVRFDTDUGQEE-YFKPBYRVSA-N Gly-Arg Chemical compound NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N JLXVRFDTDUGQEE-YFKPBYRVSA-N 0.000 description 3
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 3
- VXKCPBPQEKKERH-IUCAKERBSA-N Gly-Arg-Pro Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N1CCC[C@H]1C(O)=O VXKCPBPQEKKERH-IUCAKERBSA-N 0.000 description 3
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 3
- JMQFHZWESBGPFC-WDSKDSINSA-N Gly-Gln-Asp Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JMQFHZWESBGPFC-WDSKDSINSA-N 0.000 description 3
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 3
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 3
- OLIFSFOFKGKIRH-WUJLRWPWSA-N Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CN OLIFSFOFKGKIRH-WUJLRWPWSA-N 0.000 description 3
- FOKISINOENBSDM-WLTAIBSBSA-N Gly-Thr-Tyr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FOKISINOENBSDM-WLTAIBSBSA-N 0.000 description 3
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 3
- RYAOJUMWLWUGNW-QMMMGPOBSA-N Gly-Val-Gly Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O RYAOJUMWLWUGNW-QMMMGPOBSA-N 0.000 description 3
- FULZDMOZUZKGQU-ONGXEEELSA-N Gly-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN FULZDMOZUZKGQU-ONGXEEELSA-N 0.000 description 3
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 3
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 3
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 3
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 3
- AQCUAZTZSPQJFF-ZKWXMUAHSA-N Ile-Ala-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O AQCUAZTZSPQJFF-ZKWXMUAHSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 3
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 3
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 3
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 3
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 3
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 3
- 108010065920 Insulin Lispro Proteins 0.000 description 3
- QOOWRKBDDXQRHC-BQBZGAKWSA-N L-lysyl-L-alanine Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN QOOWRKBDDXQRHC-BQBZGAKWSA-N 0.000 description 3
- ZRLUISBDKUWAIZ-CIUDSAMLSA-N Leu-Ala-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O ZRLUISBDKUWAIZ-CIUDSAMLSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- JYOAXOMPIXKMKK-YUMQZZPRSA-N Leu-Gln Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CCC(N)=O JYOAXOMPIXKMKK-YUMQZZPRSA-N 0.000 description 3
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 3
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 3
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 3
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 3
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 3
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 3
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 3
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 3
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 3
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 3
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 3
- OSZTUONKUMCWEP-XUXIUFHCSA-N Met-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC OSZTUONKUMCWEP-XUXIUFHCSA-N 0.000 description 3
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 3
- 241001467552 Mycobacterium bovis BCG Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 3
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 3
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 3
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 3
- 241000288906 Primates Species 0.000 description 3
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 3
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 3
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 3
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 3
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 3
- PKHDJFHFMGQMPS-RCWTZXSCSA-N Pro-Thr-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKHDJFHFMGQMPS-RCWTZXSCSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- 241000589516 Pseudomonas Species 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 3
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 3
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 3
- LAFKUZYWNCHOHT-WHFBIAKZSA-N Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O LAFKUZYWNCHOHT-WHFBIAKZSA-N 0.000 description 3
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 3
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 3
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 3
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 3
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 3
- 241000187432 Streptomyces coelicolor Species 0.000 description 3
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 3
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 3
- JEDIEMIJYSRUBB-FOHZUACHSA-N Thr-Asp-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O JEDIEMIJYSRUBB-FOHZUACHSA-N 0.000 description 3
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 3
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 3
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 3
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 3
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 3
- DSGIVWSDDRDJIO-ZXXMMSQZSA-N Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DSGIVWSDDRDJIO-ZXXMMSQZSA-N 0.000 description 3
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 3
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 3
- DLZKEQQWXODGGZ-KWQFWETISA-N Tyr-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DLZKEQQWXODGGZ-KWQFWETISA-N 0.000 description 3
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 3
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 3
- JAQGKXUEKGKTKX-HOTGVXAUSA-N Tyr-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 JAQGKXUEKGKTKX-HOTGVXAUSA-N 0.000 description 3
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 3
- PFNZJEPSCBAVGX-CYDGBPFRSA-N Val-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N PFNZJEPSCBAVGX-CYDGBPFRSA-N 0.000 description 3
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 3
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 3
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 3
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 3
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 3
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 3
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 3
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 3
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 3
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 3
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 3
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 3
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 3
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 3
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 3
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 3
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 3
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 3
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 3
- 108010017893 alanyl-alanyl-alanine Proteins 0.000 description 3
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 3
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 3
- 239000003926 antimycobacterial agent Substances 0.000 description 3
- 239000008346 aqueous phase Substances 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 108091006004 biotinylated proteins Proteins 0.000 description 3
- 210000000601 blood cell Anatomy 0.000 description 3
- 239000002299 complementary DNA Substances 0.000 description 3
- 239000012228 culture supernatant Substances 0.000 description 3
- 238000010790 dilution Methods 0.000 description 3
- 239000012895 dilution Substances 0.000 description 3
- 239000002158 endotoxin Substances 0.000 description 3
- 101150079015 esxB gene Proteins 0.000 description 3
- 238000011156 evaluation Methods 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 108010087823 glycyltyrosine Proteins 0.000 description 3
- STKYPAFSDFAEPH-LURJTMIESA-N glycylvaline Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CN STKYPAFSDFAEPH-LURJTMIESA-N 0.000 description 3
- 108010025306 histidylleucine Proteins 0.000 description 3
- 108010085325 histidylproline Proteins 0.000 description 3
- 210000000987 immune system Anatomy 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 239000006166 lysate Substances 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 239000000546 pharmaceutical excipient Substances 0.000 description 3
- 239000012071 phase Substances 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 239000002244 precipitate Substances 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 230000009257 reactivity Effects 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 108010071207 serylmethionine Proteins 0.000 description 3
- 108010005652 splenotritin Proteins 0.000 description 3
- 238000007619 statistical method Methods 0.000 description 3
- 239000000829 suppository Substances 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- 108010003137 tyrosyltyrosine Proteins 0.000 description 3
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 3
- LSLXWOCIIFUZCQ-SRVKXCTJSA-N (2S)-2-[[(2S)-2-[[(2S)-2-amino-3-methyl-1-oxobutyl]amino]-3-methyl-1-oxobutyl]amino]-3-methylbutanoic acid Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O LSLXWOCIIFUZCQ-SRVKXCTJSA-N 0.000 description 2
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 2
- RVLOMLVNNBWRSR-KNIFDHDWSA-N (2s)-2-aminopropanoic acid;(2s)-2,6-diaminohexanoic acid Chemical compound C[C@H](N)C(O)=O.NCCCC[C@H](N)C(O)=O RVLOMLVNNBWRSR-KNIFDHDWSA-N 0.000 description 2
- NHBKXEKEPDILRR-UHFFFAOYSA-N 2,3-bis(butanoylsulfanyl)propyl butanoate Chemical compound CCCC(=O)OCC(SC(=O)CCC)CSC(=O)CCC NHBKXEKEPDILRR-UHFFFAOYSA-N 0.000 description 2
- 108010036211 5-HT-moduline Proteins 0.000 description 2
- 101710166488 6 kDa early secretory antigenic target Proteins 0.000 description 2
- 108010044087 AS-I toxin Proteins 0.000 description 2
- 102100020925 Adenosylhomocysteinase Human genes 0.000 description 2
- 108020002202 Adenosylhomocysteinase Proteins 0.000 description 2
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- SSSROGPPPVTHLX-FXQIFTODSA-N Ala-Arg-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSROGPPPVTHLX-FXQIFTODSA-N 0.000 description 2
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 2
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 2
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 2
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 2
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 2
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 2
- 108010040956 Ala-Asp-Glu-Leu Proteins 0.000 description 2
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 2
- RXTBLQVXNIECFP-FXQIFTODSA-N Ala-Gln-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RXTBLQVXNIECFP-FXQIFTODSA-N 0.000 description 2
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 2
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- SIGTYDNEPYEXGK-ZANVPECISA-N Ala-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@@H](N)C)C(O)=O)=CNC2=C1 SIGTYDNEPYEXGK-ZANVPECISA-N 0.000 description 2
- SHKGHIFSEAGTNL-DLOVCJGASA-N Ala-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 SHKGHIFSEAGTNL-DLOVCJGASA-N 0.000 description 2
- ZSOICJZJSRWNHX-ACZMJKKPSA-N Ala-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@H](C)[NH3+] ZSOICJZJSRWNHX-ACZMJKKPSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 2
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 2
- XHNLCGXYBXNRIS-BJDJZHNGSA-N Ala-Lys-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XHNLCGXYBXNRIS-BJDJZHNGSA-N 0.000 description 2
- BLTRAARCJYVJKV-QEJZJMRPSA-N Ala-Lys-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](Cc1ccccc1)C(O)=O BLTRAARCJYVJKV-QEJZJMRPSA-N 0.000 description 2
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 2
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 2
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 2
- IPWKGIFRRBGCJO-IMJSIDKUSA-N Ala-Ser Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](CO)C([O-])=O IPWKGIFRRBGCJO-IMJSIDKUSA-N 0.000 description 2
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 2
- BUQICHWNXBIBOG-LMVFSUKVSA-N Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)N BUQICHWNXBIBOG-LMVFSUKVSA-N 0.000 description 2
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 2
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- DDPKBJZLAXLQGZ-KBIXCLLPSA-N Ala-Val-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DDPKBJZLAXLQGZ-KBIXCLLPSA-N 0.000 description 2
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 2
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 2
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- WVRUNFYJIHNFKD-WDSKDSINSA-N Arg-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N WVRUNFYJIHNFKD-WDSKDSINSA-N 0.000 description 2
- OOBVTWHLKYJFJH-FXQIFTODSA-N Arg-Ala-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O OOBVTWHLKYJFJH-FXQIFTODSA-N 0.000 description 2
- DFCIPNHFKOQAME-FXQIFTODSA-N Arg-Ala-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFCIPNHFKOQAME-FXQIFTODSA-N 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- XPSGESXVBSQZPL-SRVKXCTJSA-N Arg-Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XPSGESXVBSQZPL-SRVKXCTJSA-N 0.000 description 2
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 2
- DCGLNNVKIZXQOJ-FXQIFTODSA-N Arg-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N DCGLNNVKIZXQOJ-FXQIFTODSA-N 0.000 description 2
- IIABBYGHLYWVOS-FXQIFTODSA-N Arg-Asn-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O IIABBYGHLYWVOS-FXQIFTODSA-N 0.000 description 2
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 2
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 2
- XUUXCWCKKCZEAW-YFKPBYRVSA-N Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N XUUXCWCKKCZEAW-YFKPBYRVSA-N 0.000 description 2
- HQIZDMIGUJOSNI-IUCAKERBSA-N Arg-Gly-Arg Chemical compound N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O HQIZDMIGUJOSNI-IUCAKERBSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- QKSAZKCRVQYYGS-UWVGGRQHSA-N Arg-Gly-His Chemical compound N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O QKSAZKCRVQYYGS-UWVGGRQHSA-N 0.000 description 2
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 2
- QYLJIYOGHRGUIH-CIUDSAMLSA-N Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N QYLJIYOGHRGUIH-CIUDSAMLSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- DNUKXVMPARLPFN-XUXIUFHCSA-N Arg-Leu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DNUKXVMPARLPFN-XUXIUFHCSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- JQFZHHSQMKZLRU-IUCAKERBSA-N Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N JQFZHHSQMKZLRU-IUCAKERBSA-N 0.000 description 2
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 2
- NGTYEHIRESTSRX-UWVGGRQHSA-N Arg-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NGTYEHIRESTSRX-UWVGGRQHSA-N 0.000 description 2
- PQBHGSGQZSOLIR-RYUDHWBXSA-N Arg-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PQBHGSGQZSOLIR-RYUDHWBXSA-N 0.000 description 2
- YTMKMRSYXHBGER-IHRRRGAJSA-N Arg-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YTMKMRSYXHBGER-IHRRRGAJSA-N 0.000 description 2
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 2
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 2
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- AIFHRTPABBBHKU-RCWTZXSCSA-N Arg-Thr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AIFHRTPABBBHKU-RCWTZXSCSA-N 0.000 description 2
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 2
- POZKLUIXMHIULG-FDARSICLSA-N Arg-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCCN=C(N)N)N POZKLUIXMHIULG-FDARSICLSA-N 0.000 description 2
- NVPHRWNWTKYIST-BPNCWPANSA-N Arg-Tyr-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=C(O)C=C1 NVPHRWNWTKYIST-BPNCWPANSA-N 0.000 description 2
- DAQIJMOLTMGJLO-YUMQZZPRSA-N Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCNC(N)=N DAQIJMOLTMGJLO-YUMQZZPRSA-N 0.000 description 2
- KEZVOBAKAXHMOF-GUBZILKMSA-N Arg-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N KEZVOBAKAXHMOF-GUBZILKMSA-N 0.000 description 2
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 2
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 2
- JJGRJMKUOYXZRA-LPEHRKFASA-N Asn-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O JJGRJMKUOYXZRA-LPEHRKFASA-N 0.000 description 2
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 2
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 2
- UPALZCBCKAMGIY-PEFMBERDSA-N Asn-Gln-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UPALZCBCKAMGIY-PEFMBERDSA-N 0.000 description 2
- MSBDSTRUMZFSEU-PEFMBERDSA-N Asn-Glu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MSBDSTRUMZFSEU-PEFMBERDSA-N 0.000 description 2
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 2
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 2
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 2
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 2
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 2
- DVUFTQLHHHJEMK-IMJSIDKUSA-N Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O DVUFTQLHHHJEMK-IMJSIDKUSA-N 0.000 description 2
- KDFQZBWWPYQBEN-ZLUOBGJFSA-N Asp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N KDFQZBWWPYQBEN-ZLUOBGJFSA-N 0.000 description 2
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 2
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 2
- FRYULLIZUDQONW-IMJSIDKUSA-N Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(O)=O FRYULLIZUDQONW-IMJSIDKUSA-N 0.000 description 2
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 2
- IJHUZMGJRGNXIW-CIUDSAMLSA-N Asp-Glu-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IJHUZMGJRGNXIW-CIUDSAMLSA-N 0.000 description 2
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 2
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 2
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 2
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 2
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 2
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 2
- BSWHERGFUNMWGS-UHFFFAOYSA-N Asp-Ile Chemical compound CCC(C)C(C(O)=O)NC(=O)C(N)CC(O)=O BSWHERGFUNMWGS-UHFFFAOYSA-N 0.000 description 2
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 2
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 2
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 2
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 2
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 2
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 2
- ALMIMUZAWTUNIO-BZSNNMDCSA-N Asp-Tyr-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ALMIMUZAWTUNIO-BZSNNMDCSA-N 0.000 description 2
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- 108090001008 Avidin Proteins 0.000 description 2
- 244000063299 Bacillus subtilis Species 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 238000011752 CBA/J (JAX™ mouse strain) Methods 0.000 description 2
- 241000282994 Cervidae Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- DCXGXDGGXVZVMY-GHCJXIJMSA-N Cys-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CS DCXGXDGGXVZVMY-GHCJXIJMSA-N 0.000 description 2
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 2
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 2
- 229920002271 DEAE-Sepharose Polymers 0.000 description 2
- 108010041986 DNA Vaccines Proteins 0.000 description 2
- 229940021995 DNA vaccine Drugs 0.000 description 2
- 102000016911 Deoxyribonucleases Human genes 0.000 description 2
- 108010053770 Deoxyribonucleases Proteins 0.000 description 2
- 206010015150 Erythema Diseases 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 2
- JSYULGSPLTZDHM-NRPADANISA-N Gln-Ala-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O JSYULGSPLTZDHM-NRPADANISA-N 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- SXIJQMBEVYWAQT-GUBZILKMSA-N Gln-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXIJQMBEVYWAQT-GUBZILKMSA-N 0.000 description 2
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- SWDSRANUCKNBLA-AVGNSLFASA-N Gln-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SWDSRANUCKNBLA-AVGNSLFASA-N 0.000 description 2
- DRNMNLKUUKKPIA-HTUGSXCWSA-N Gln-Phe-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)CCC(N)=O)C(O)=O DRNMNLKUUKKPIA-HTUGSXCWSA-N 0.000 description 2
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 2
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 2
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 2
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 2
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 2
- QPRZKNOOOBWXSU-CIUDSAMLSA-N Glu-Asp-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N QPRZKNOOOBWXSU-CIUDSAMLSA-N 0.000 description 2
- JVSBYEDSSRZQGV-GUBZILKMSA-N Glu-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O JVSBYEDSSRZQGV-GUBZILKMSA-N 0.000 description 2
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- LSPKYLAFTPBWIL-BYPYZUCNSA-N Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(O)=O LSPKYLAFTPBWIL-BYPYZUCNSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- VXQOONWNIWFOCS-HGNGGELXSA-N Glu-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N VXQOONWNIWFOCS-HGNGGELXSA-N 0.000 description 2
- SNFUTDLOCQQRQD-ZKWXMUAHSA-N Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SNFUTDLOCQQRQD-ZKWXMUAHSA-N 0.000 description 2
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 2
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 2
- YBAFDPFAUTYYRW-YUMQZZPRSA-N Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O YBAFDPFAUTYYRW-YUMQZZPRSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 2
- UMHRCVCZUPBBQW-GARJFASQSA-N Glu-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UMHRCVCZUPBBQW-GARJFASQSA-N 0.000 description 2
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 2
- WXONSNSSBYQGNN-AVGNSLFASA-N Glu-Ser-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O WXONSNSSBYQGNN-AVGNSLFASA-N 0.000 description 2
- BDISFWMLMNBTGP-NUMRIWBASA-N Glu-Thr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O BDISFWMLMNBTGP-NUMRIWBASA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 2
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 2
- SITLTJHOQZFJGG-XPUUQOCRSA-N Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SITLTJHOQZFJGG-XPUUQOCRSA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 2
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 2
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 2
- OCDLPQDYTJPWNG-YUMQZZPRSA-N Gly-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN OCDLPQDYTJPWNG-YUMQZZPRSA-N 0.000 description 2
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 2
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 2
- IEFJWDNGDZAYNZ-BYPYZUCNSA-N Gly-Glu Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(O)=O IEFJWDNGDZAYNZ-BYPYZUCNSA-N 0.000 description 2
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 2
- HFXJIZNEXNIZIJ-BQBZGAKWSA-N Gly-Glu-Gln Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFXJIZNEXNIZIJ-BQBZGAKWSA-N 0.000 description 2
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 2
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 2
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- ZWRDOVYMQAAISL-UWVGGRQHSA-N Gly-Met-Lys Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCCN ZWRDOVYMQAAISL-UWVGGRQHSA-N 0.000 description 2
- GAFKBWKVXNERFA-QWRGUYRKSA-N Gly-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 GAFKBWKVXNERFA-QWRGUYRKSA-N 0.000 description 2
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 2
- 108010009504 Gly-Phe-Leu-Gly Proteins 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 2
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 2
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 2
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 2
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 2
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 2
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 2
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 2
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 2
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 2
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 2
- FMRKUXFLLPKVPG-JYJNAYRXSA-N His-Gln-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O FMRKUXFLLPKVPG-JYJNAYRXSA-N 0.000 description 2
- TXLQHACKRLWYCM-DCAQKATOSA-N His-Glu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O TXLQHACKRLWYCM-DCAQKATOSA-N 0.000 description 2
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 2
- YADRBUZBKHHDAO-XPUUQOCRSA-N His-Gly-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C)C(O)=O YADRBUZBKHHDAO-XPUUQOCRSA-N 0.000 description 2
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 2
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 2
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 2
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 2
- YAJQKIBLYPFAET-NAZCDGGXSA-N His-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O YAJQKIBLYPFAET-NAZCDGGXSA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 2
- CYHYBSGMHMHKOA-CIQUZCHMSA-N Ile-Ala-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CYHYBSGMHMHKOA-CIQUZCHMSA-N 0.000 description 2
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 2
- WKXVAXOSIPTXEC-HAFWLYHUSA-N Ile-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O WKXVAXOSIPTXEC-HAFWLYHUSA-N 0.000 description 2
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 2
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 2
- UCGDDTHMMVWVMV-FSPLSTOPSA-N Ile-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(O)=O UCGDDTHMMVWVMV-FSPLSTOPSA-N 0.000 description 2
- LPFBXFILACZHIB-LAEOZQHASA-N Ile-Gly-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)O)C(=O)O)N LPFBXFILACZHIB-LAEOZQHASA-N 0.000 description 2
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 2
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- UWBDLNOCIDGPQE-GUBZILKMSA-N Ile-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN UWBDLNOCIDGPQE-GUBZILKMSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 2
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 2
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 2
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 2
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 2
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 2
- BCISUQVFDGYZBO-QSFUFRPTSA-N Ile-Val-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O BCISUQVFDGYZBO-QSFUFRPTSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- 108010074328 Interferon-gamma Proteins 0.000 description 2
- 108010065805 Interleukin-12 Proteins 0.000 description 2
- 108010002350 Interleukin-2 Proteins 0.000 description 2
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 2
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 2
- DEFJQIDDEAULHB-IMJSIDKUSA-N L-alanyl-L-alanine Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(O)=O DEFJQIDDEAULHB-IMJSIDKUSA-N 0.000 description 2
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 2
- YUGVQABRIJXYNQ-UHFFFAOYSA-N Leu-Ala-Ala Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C)C(O)=O YUGVQABRIJXYNQ-UHFFFAOYSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 2
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 2
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 2
- PJYSOYLLTJKZHC-GUBZILKMSA-N Leu-Asp-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O PJYSOYLLTJKZHC-GUBZILKMSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- ZTLGVASZOIKNIX-DCAQKATOSA-N Leu-Gln-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZTLGVASZOIKNIX-DCAQKATOSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 2
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 2
- LESXFEZIFXFIQR-LURJTMIESA-N Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(O)=O LESXFEZIFXFIQR-LURJTMIESA-N 0.000 description 2
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 2
- BKTXKJMNTSMJDQ-AVGNSLFASA-N Leu-His-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BKTXKJMNTSMJDQ-AVGNSLFASA-N 0.000 description 2
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- LCPYQJIKPJDLLB-UWVGGRQHSA-N Leu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(C)C LCPYQJIKPJDLLB-UWVGGRQHSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- FOBUGKUBUJOWAD-IHPCNDPISA-N Leu-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 FOBUGKUBUJOWAD-IHPCNDPISA-N 0.000 description 2
- OTXBNHIUIHNGAO-UWVGGRQHSA-N Leu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN OTXBNHIUIHNGAO-UWVGGRQHSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 2
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 2
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 2
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 2
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 2
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 2
- 108010074338 Lymphokines Proteins 0.000 description 2
- 102000008072 Lymphokines Human genes 0.000 description 2
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 2
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 2
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 2
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 2
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 2
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 2
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 2
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 2
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 2
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 2
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 2
- MEQLGHAMAUPOSJ-DCAQKATOSA-N Lys-Ser-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O MEQLGHAMAUPOSJ-DCAQKATOSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 2
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 2
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 2
- 241000282346 Meles meles Species 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- IYXDSYWCVVXSKB-CIUDSAMLSA-N Met-Asn-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IYXDSYWCVVXSKB-CIUDSAMLSA-N 0.000 description 2
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 2
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 2
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 2
- UNPGTBHYKJOCCZ-DCAQKATOSA-N Met-Lys-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O UNPGTBHYKJOCCZ-DCAQKATOSA-N 0.000 description 2
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 2
- MIAZEQZXAFTCCG-UBHSHLNASA-N Met-Phe-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 MIAZEQZXAFTCCG-UBHSHLNASA-N 0.000 description 2
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 2
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 2
- 241000186366 Mycobacterium bovis Species 0.000 description 2
- 101100291912 Mycobacterium bovis (strain ATCC BAA-935 / AF2122/97) mpb64 gene Proteins 0.000 description 2
- 241000187480 Mycobacterium smegmatis Species 0.000 description 2
- 101000856404 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Carboxylesterase Culp1 Proteins 0.000 description 2
- 101000941208 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) Probable carboxylesterase Culp2 Proteins 0.000 description 2
- 101100310308 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) sigH gene Proteins 0.000 description 2
- 101100369576 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) sseA gene Proteins 0.000 description 2
- 241000863422 Myxococcus xanthus Species 0.000 description 2
- SEQKRHFRPICQDD-UHFFFAOYSA-N N-tris(hydroxymethyl)methylglycine Chemical compound OCC(CO)(CO)[NH2+]CC([O-])=O SEQKRHFRPICQDD-UHFFFAOYSA-N 0.000 description 2
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 108010038807 Oligopeptides Proteins 0.000 description 2
- 102000015636 Oligopeptides Human genes 0.000 description 2
- 229910019142 PO4 Inorganic materials 0.000 description 2
- YRKFKTQRVBJYLT-CQDKDKBSSA-N Phe-Ala-His Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 YRKFKTQRVBJYLT-CQDKDKBSSA-N 0.000 description 2
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 2
- LGBVMDMZZFYSFW-HJWJTTGWSA-N Phe-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N LGBVMDMZZFYSFW-HJWJTTGWSA-N 0.000 description 2
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 2
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 2
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 2
- WIVCOAKLPICYGY-KKUMJFAQSA-N Phe-Asp-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N WIVCOAKLPICYGY-KKUMJFAQSA-N 0.000 description 2
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 2
- JXWLMUIXUXLIJR-QWRGUYRKSA-N Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JXWLMUIXUXLIJR-QWRGUYRKSA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- WFHRXJOZEXUKLV-IRXDYDNUSA-N Phe-Gly-Tyr Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 WFHRXJOZEXUKLV-IRXDYDNUSA-N 0.000 description 2
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 2
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 2
- IWZRODDWOSIXPZ-IRXDYDNUSA-N Phe-Phe-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 IWZRODDWOSIXPZ-IRXDYDNUSA-N 0.000 description 2
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 2
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 2
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 2
- QSKCKTUQPICLSO-AVGNSLFASA-N Pro-Arg-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O QSKCKTUQPICLSO-AVGNSLFASA-N 0.000 description 2
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 2
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 2
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 2
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 2
- FISHYTLIMUYTQY-GUBZILKMSA-N Pro-Gln-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 FISHYTLIMUYTQY-GUBZILKMSA-N 0.000 description 2
- CMOIIANLNNYUTP-SRVKXCTJSA-N Pro-Gln-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CMOIIANLNNYUTP-SRVKXCTJSA-N 0.000 description 2
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 2
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 2
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 2
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 2
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 2
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- QKDIHFHGHBYTKB-IHRRRGAJSA-N Pro-Ser-Phe Chemical compound N([C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 QKDIHFHGHBYTKB-IHRRRGAJSA-N 0.000 description 2
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 2
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 2
- UIUWGMRJTWHIJZ-ULQDDVLXSA-N Pro-Tyr-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O UIUWGMRJTWHIJZ-ULQDDVLXSA-N 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 2
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 2
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 2
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 2
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- YIUWWXVTYLANCJ-NAKRPEOUSA-N Ser-Ile-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YIUWWXVTYLANCJ-NAKRPEOUSA-N 0.000 description 2
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 2
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 2
- WBAXJMCUFIXCNI-WDSKDSINSA-N Ser-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WBAXJMCUFIXCNI-WDSKDSINSA-N 0.000 description 2
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 2
- XZKQVQKUZMAADP-IMJSIDKUSA-N Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(O)=O XZKQVQKUZMAADP-IMJSIDKUSA-N 0.000 description 2
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 2
- NERYDXBVARJIQS-JYBASQMISA-N Ser-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N)O NERYDXBVARJIQS-JYBASQMISA-N 0.000 description 2
- MFQMZDPAZRZAPV-NAKRPEOUSA-N Ser-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CO)N MFQMZDPAZRZAPV-NAKRPEOUSA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 2
- 206010040914 Skin reaction Diseases 0.000 description 2
- NHUHCSRWZMLRLA-UHFFFAOYSA-N Sulfisoxazole Chemical compound CC1=NOC(NS(=O)(=O)C=2C=CC(N)=CC=2)=C1C NHUHCSRWZMLRLA-UHFFFAOYSA-N 0.000 description 2
- 241000282898 Sus scrofa Species 0.000 description 2
- 230000005867 T cell response Effects 0.000 description 2
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 2
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- HYLXOQURIOCKIH-VQVTYTSYSA-N Thr-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N HYLXOQURIOCKIH-VQVTYTSYSA-N 0.000 description 2
- CAGTXGDOIFXLPC-KZVJFYERSA-N Thr-Arg-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CCCN=C(N)N CAGTXGDOIFXLPC-KZVJFYERSA-N 0.000 description 2
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- KRPKYGOFYUNIGM-XVSYOHENSA-N Thr-Asp-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O KRPKYGOFYUNIGM-XVSYOHENSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 2
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 2
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 2
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 2
- NCGUQWSJUKYCIT-SZZJOZGLSA-N Thr-His-Trp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NCGUQWSJUKYCIT-SZZJOZGLSA-N 0.000 description 2
- LUMXICQAOKVQOB-YWIQKCBGSA-N Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O LUMXICQAOKVQOB-YWIQKCBGSA-N 0.000 description 2
- XTCNBOBTROGWMW-RWRJDSDZSA-N Thr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XTCNBOBTROGWMW-RWRJDSDZSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- BDGBHYCAZJPLHX-HJGDQZAQSA-N Thr-Lys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BDGBHYCAZJPLHX-HJGDQZAQSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 2
- QOLYAJSZHIJCTO-VQVTYTSYSA-N Thr-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O QOLYAJSZHIJCTO-VQVTYTSYSA-N 0.000 description 2
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 2
- CKHWEVXPLJBEOZ-VQVTYTSYSA-N Thr-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])[C@@H](C)O CKHWEVXPLJBEOZ-VQVTYTSYSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 2
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- YEGMNOHLZNGOCG-UBHSHLNASA-N Trp-Asn-Asn Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YEGMNOHLZNGOCG-UBHSHLNASA-N 0.000 description 2
- WLBZWXXGSOLJBA-HOCLYGCPSA-N Trp-Gly-Lys Chemical compound C1=CC=C2C(C[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O)=CNC2=C1 WLBZWXXGSOLJBA-HOCLYGCPSA-N 0.000 description 2
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 2
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 2
- YTYHAYZPOARHAP-HOCLYGCPSA-N Trp-Lys-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)O)N YTYHAYZPOARHAP-HOCLYGCPSA-N 0.000 description 2
- CSOBBJWWODOYGW-ILWGZMRPSA-N Trp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O CSOBBJWWODOYGW-ILWGZMRPSA-N 0.000 description 2
- WMIUTJPFHMMUGY-ZFWWWQNUSA-N Trp-Pro-Gly Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)NCC(=O)O WMIUTJPFHMMUGY-ZFWWWQNUSA-N 0.000 description 2
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 2
- CUHBVKUVJIXRFK-DVXDUOKCSA-N Trp-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC=3C4=CC=CC=C4NC=3)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CUHBVKUVJIXRFK-DVXDUOKCSA-N 0.000 description 2
- 206010053613 Type IV hypersensitivity reaction Diseases 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 2
- KLGFILUOTCBNLJ-IHRRRGAJSA-N Tyr-Cys-Arg Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O KLGFILUOTCBNLJ-IHRRRGAJSA-N 0.000 description 2
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 2
- PDSLRCZINIDLMU-QWRGUYRKSA-N Tyr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PDSLRCZINIDLMU-QWRGUYRKSA-N 0.000 description 2
- JWGXUKHIKXZWNG-RYUDHWBXSA-N Tyr-Gly-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JWGXUKHIKXZWNG-RYUDHWBXSA-N 0.000 description 2
- CTDPLKMBVALCGN-JSGCOSHPSA-N Tyr-Gly-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O CTDPLKMBVALCGN-JSGCOSHPSA-N 0.000 description 2
- BSCBBPKDVOZICB-KKUMJFAQSA-N Tyr-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BSCBBPKDVOZICB-KKUMJFAQSA-N 0.000 description 2
- SINRIKQYQJRGDQ-MEYUZBJRSA-N Tyr-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SINRIKQYQJRGDQ-MEYUZBJRSA-N 0.000 description 2
- RGYCVIZZTUBSSG-JYJNAYRXSA-N Tyr-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O RGYCVIZZTUBSSG-JYJNAYRXSA-N 0.000 description 2
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 2
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 2
- UEOOXDLMQZBPFR-ZKWXMUAHSA-N Val-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N UEOOXDLMQZBPFR-ZKWXMUAHSA-N 0.000 description 2
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 2
- LABUITCFCAABSV-BPNCWPANSA-N Val-Ala-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-BPNCWPANSA-N 0.000 description 2
- LABUITCFCAABSV-UHFFFAOYSA-N Val-Ala-Tyr Natural products CC(C)C(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LABUITCFCAABSV-UHFFFAOYSA-N 0.000 description 2
- NMANTMWGQZASQN-QXEWZRGKSA-N Val-Arg-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N NMANTMWGQZASQN-QXEWZRGKSA-N 0.000 description 2
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 2
- CVUDMNSZAIZFAE-TUAOUCFPSA-N Val-Arg-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N CVUDMNSZAIZFAE-TUAOUCFPSA-N 0.000 description 2
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 2
- ZSZFTYVFQLUWBF-QXEWZRGKSA-N Val-Asp-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O)N ZSZFTYVFQLUWBF-QXEWZRGKSA-N 0.000 description 2
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- UEHRGZCNLSWGHK-DLOVCJGASA-N Val-Glu-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UEHRGZCNLSWGHK-DLOVCJGASA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 2
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 2
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 2
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 2
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 2
- GVRKWABULJAONN-VQVTYTSYSA-N Val-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVRKWABULJAONN-VQVTYTSYSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 2
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 2
- 108010056243 alanylalanine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- ZVDPYSVOZFINEE-BQBZGAKWSA-N alpha-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O ZVDPYSVOZFINEE-BQBZGAKWSA-N 0.000 description 2
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 2
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 2
- 239000001166 ammonium sulphate Substances 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 229940034014 antimycobacterial agent Drugs 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 108010094001 arginyl-tryptophyl-arginine Proteins 0.000 description 2
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000004202 carbamide Substances 0.000 description 2
- WOWHHFRSBJGXCM-UHFFFAOYSA-M cetyltrimethylammonium chloride Chemical compound [Cl-].CCCCCCCCCCCCCCCC[N+](C)(C)C WOWHHFRSBJGXCM-UHFFFAOYSA-M 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 239000003599 detergent Substances 0.000 description 2
- 239000013024 dilution buffer Substances 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 239000000839 emulsion Substances 0.000 description 2
- 231100000321 erythema Toxicity 0.000 description 2
- 101150069551 esxH gene Proteins 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 2
- 108010010096 glycyl-glycyl-tyrosine Proteins 0.000 description 2
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 230000005847 immunogenicity Effects 0.000 description 2
- 230000028709 inflammatory response Effects 0.000 description 2
- 239000002198 insoluble material Substances 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 229960003350 isoniazid Drugs 0.000 description 2
- QRXWMOHMRWLFEY-UHFFFAOYSA-N isoniazide Chemical compound NNC(=O)C1=CC=NC=C1 QRXWMOHMRWLFEY-UHFFFAOYSA-N 0.000 description 2
- 108010053037 kyotorphin Proteins 0.000 description 2
- 239000008101 lactose Substances 0.000 description 2
- 231100000636 lethal dose Toxicity 0.000 description 2
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 2
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010091798 leucylleucine Proteins 0.000 description 2
- 108010012058 leucyltyrosine Proteins 0.000 description 2
- 229920006008 lipopolysaccharide Polymers 0.000 description 2
- 239000007791 liquid phase Substances 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 230000002934 lysing effect Effects 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 230000000813 microbial effect Effects 0.000 description 2
- 101150014428 mpt64 gene Proteins 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 235000021317 phosphate Nutrition 0.000 description 2
- 108010025488 pinealon Proteins 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 239000012460 protein solution Substances 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 238000010188 recombinant method Methods 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 229910052709 silver Inorganic materials 0.000 description 2
- 239000004332 silver Substances 0.000 description 2
- 230000035483 skin reaction Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 210000004989 spleen cell Anatomy 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 2
- GETQZCLCWQTVFV-UHFFFAOYSA-N trimethylamine Chemical compound CN(C)C GETQZCLCWQTVFV-UHFFFAOYSA-N 0.000 description 2
- 108010080629 tryptophan-leucine Proteins 0.000 description 2
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 2
- 108010084932 tryptophyl-proline Proteins 0.000 description 2
- 230000005951 type IV hypersensitivity Effects 0.000 description 2
- 108010029599 tyrosyl-glutamyl-tryptophan Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010021199 valyl-valyl-valine Proteins 0.000 description 2
- 210000003462 vein Anatomy 0.000 description 2
- 230000001018 virulence Effects 0.000 description 2
- 108010027345 wheylin-1 peptide Proteins 0.000 description 2
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 1
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 1
- VWWKKDNCCLAGRM-GVXVVHGQSA-N (2s)-2-[[2-[[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]propanoyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VWWKKDNCCLAGRM-GVXVVHGQSA-N 0.000 description 1
- JFOWDKWFHZIMTR-RUCXOUQFSA-N (2s)-2-aminopentanedioic acid;(2s)-2,5-diamino-5-oxopentanoic acid Chemical compound OC(=O)[C@@H](N)CCC(N)=O.OC(=O)[C@@H](N)CCC(O)=O JFOWDKWFHZIMTR-RUCXOUQFSA-N 0.000 description 1
- ARNGIGOPGOEJCH-KKUMJFAQSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-phenylethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ARNGIGOPGOEJCH-KKUMJFAQSA-N 0.000 description 1
- QRXMUCSWCMTJGU-UHFFFAOYSA-L (5-bromo-4-chloro-1h-indol-3-yl) phosphate Chemical compound C1=C(Br)C(Cl)=C2C(OP([O-])(=O)[O-])=CNC2=C1 QRXMUCSWCMTJGU-UHFFFAOYSA-L 0.000 description 1
- QZCJOXAIQXPLNS-UHFFFAOYSA-N 1,1,2,2,3,3,4,4,4a,5,5,6,6,7,7,8,8,8a-octadecafluoronaphthalene 4-(2-aminoethyl)benzene-1,2-diol Chemical compound NCCc1ccc(O)c(O)c1.FC1(F)C(F)(F)C(F)(F)C2(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C2(F)C1(F)F QZCJOXAIQXPLNS-UHFFFAOYSA-N 0.000 description 1
- WURBVZBTWMNKQT-UHFFFAOYSA-N 1-(4-chlorophenoxy)-3,3-dimethyl-1-(1,2,4-triazol-1-yl)butan-2-one Chemical compound C1=NC=NN1C(C(=O)C(C)(C)C)OC1=CC=C(Cl)C=C1 WURBVZBTWMNKQT-UHFFFAOYSA-N 0.000 description 1
- MIJDSYMOBYNHOT-UHFFFAOYSA-N 2-(ethylamino)ethanol Chemical compound CCNCCO MIJDSYMOBYNHOT-UHFFFAOYSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- WEZDRVHTDXTVLT-GJZGRUSLSA-N 2-[[(2s)-2-[[(2s)-2-[(2-aminoacetyl)amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]acetic acid Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 WEZDRVHTDXTVLT-GJZGRUSLSA-N 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- RCLDHCIEAUJSBD-UHFFFAOYSA-N 6-(6-sulfonaphthalen-2-yl)oxynaphthalene-2-sulfonic acid Chemical compound C1=C(S(O)(=O)=O)C=CC2=CC(OC3=CC4=CC=C(C=C4C=C3)S(=O)(=O)O)=CC=C21 RCLDHCIEAUJSBD-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 101710146995 Acyl carrier protein Proteins 0.000 description 1
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 1
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 1
- DWINFPQUSSHSFS-UVBJJODRSA-N Ala-Arg-Trp Chemical compound N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C12)C(=O)O DWINFPQUSSHSFS-UVBJJODRSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- LZRNYBIJOSKKRJ-XVYDVKMFSA-N Ala-Asp-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N LZRNYBIJOSKKRJ-XVYDVKMFSA-N 0.000 description 1
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 1
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- NFDVJAKFMXHJEQ-HERUPUMHSA-N Ala-Asp-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N NFDVJAKFMXHJEQ-HERUPUMHSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- HJCMDXDYPOUFDY-WHFBIAKZSA-N Ala-Gln Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O HJCMDXDYPOUFDY-WHFBIAKZSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- CXISPYVYMQWFLE-VKHMYHEASA-N Ala-Gly Chemical compound C[C@H]([NH3+])C(=O)NCC([O-])=O CXISPYVYMQWFLE-VKHMYHEASA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- JDIQCVUDDFENPU-ZKWXMUAHSA-N Ala-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CNC=N1 JDIQCVUDDFENPU-ZKWXMUAHSA-N 0.000 description 1
- IVKWMMGFLAMMKJ-XVYDVKMFSA-N Ala-His-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N IVKWMMGFLAMMKJ-XVYDVKMFSA-N 0.000 description 1
- ANGAOPNEPIDLPO-XVYDVKMFSA-N Ala-His-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N ANGAOPNEPIDLPO-XVYDVKMFSA-N 0.000 description 1
- OKEWAFFWMHBGPT-XPUUQOCRSA-N Ala-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 OKEWAFFWMHBGPT-XPUUQOCRSA-N 0.000 description 1
- JEPNLGMEZMCFEX-QSFUFRPTSA-N Ala-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N JEPNLGMEZMCFEX-QSFUFRPTSA-N 0.000 description 1
- AAXVGJXZKHQQHD-LSJOCFKGSA-N Ala-His-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N AAXVGJXZKHQQHD-LSJOCFKGSA-N 0.000 description 1
- LBFXVAXPDOBRKU-LKTVYLICSA-N Ala-His-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LBFXVAXPDOBRKU-LKTVYLICSA-N 0.000 description 1
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 1
- CFPQUJZTLUQUTJ-HTFCKZLJSA-N Ala-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@H](C)N CFPQUJZTLUQUTJ-HTFCKZLJSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- UJJUHXAJSRHWFZ-DCAQKATOSA-N Ala-Leu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O UJJUHXAJSRHWFZ-DCAQKATOSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- SUHLZMHFRALVSY-YUMQZZPRSA-N Ala-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O SUHLZMHFRALVSY-YUMQZZPRSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- VEAPAYQQLSEKEM-GUBZILKMSA-N Ala-Met-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(O)=O VEAPAYQQLSEKEM-GUBZILKMSA-N 0.000 description 1
- OMNVYXHOSHNURL-WPRPVWTQSA-N Ala-Phe Chemical compound C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OMNVYXHOSHNURL-WPRPVWTQSA-N 0.000 description 1
- PEIBBAXIKUAYGN-UBHSHLNASA-N Ala-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 PEIBBAXIKUAYGN-UBHSHLNASA-N 0.000 description 1
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- WPWUFUBLGADILS-WDSKDSINSA-N Ala-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O WPWUFUBLGADILS-WDSKDSINSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- LTTLSZVJTDSACD-OWLDWWDNSA-N Ala-Thr-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O LTTLSZVJTDSACD-OWLDWWDNSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 1
- TVUFMYKTYXTRPY-HERUPUMHSA-N Ala-Trp-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O TVUFMYKTYXTRPY-HERUPUMHSA-N 0.000 description 1
- XPBVBZPVNFIHOA-UVBJJODRSA-N Ala-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 XPBVBZPVNFIHOA-UVBJJODRSA-N 0.000 description 1
- XAXMJQUMRJAFCH-CQDKDKBSSA-N Ala-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 XAXMJQUMRJAFCH-CQDKDKBSSA-N 0.000 description 1
- LIWMQSWFLXEGMA-WDSKDSINSA-N Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)N LIWMQSWFLXEGMA-WDSKDSINSA-N 0.000 description 1
- SOTXLXCVCZAKFI-FXQIFTODSA-N Ala-Val-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O SOTXLXCVCZAKFI-FXQIFTODSA-N 0.000 description 1
- XSLGWYYNOSUMRM-ZKWXMUAHSA-N Ala-Val-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XSLGWYYNOSUMRM-ZKWXMUAHSA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- ZDILXFDENZVOTL-BPNCWPANSA-N Ala-Val-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDILXFDENZVOTL-BPNCWPANSA-N 0.000 description 1
- 108010088751 Albumins Proteins 0.000 description 1
- 102000009027 Albumins Human genes 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- QGZKDVFQNNGYKY-UHFFFAOYSA-O Ammonium Chemical compound [NH4+] QGZKDVFQNNGYKY-UHFFFAOYSA-O 0.000 description 1
- 101000634115 Arabidopsis thaliana RNA polymerase sigma factor sigE, chloroplastic/mitochondrial Proteins 0.000 description 1
- 241000205042 Archaeoglobus fulgidus Species 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- YYOVLDPHIJAOSY-DCAQKATOSA-N Arg-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N YYOVLDPHIJAOSY-DCAQKATOSA-N 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- XEPSCVXTCUUHDT-AVGNSLFASA-N Arg-Arg-Leu Natural products CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCN=C(N)N XEPSCVXTCUUHDT-AVGNSLFASA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 1
- NONSEUUPKITYQT-BQBZGAKWSA-N Arg-Asn-Gly Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N)CN=C(N)N NONSEUUPKITYQT-BQBZGAKWSA-N 0.000 description 1
- KWTVWJPNHAOREN-IHRRRGAJSA-N Arg-Asn-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KWTVWJPNHAOREN-IHRRRGAJSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- KBBKCNHWCDJPGN-GUBZILKMSA-N Arg-Gln-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KBBKCNHWCDJPGN-GUBZILKMSA-N 0.000 description 1
- OBFTYSPXDRROQO-SRVKXCTJSA-N Arg-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCN=C(N)N OBFTYSPXDRROQO-SRVKXCTJSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 1
- OHYQKYUTLIPFOX-ZPFDUUQYSA-N Arg-Glu-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OHYQKYUTLIPFOX-ZPFDUUQYSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- JAYIQMNQDMOBFY-KKUMJFAQSA-N Arg-Glu-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JAYIQMNQDMOBFY-KKUMJFAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 1
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- FLYANDHDFRGGTM-PYJNHQTQSA-N Arg-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FLYANDHDFRGGTM-PYJNHQTQSA-N 0.000 description 1
- FFEUXEAKYRCACT-PEDHHIEDSA-N Arg-Ile-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)CC)C(O)=O FFEUXEAKYRCACT-PEDHHIEDSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- FSNVAJOPUDVQAR-AVGNSLFASA-N Arg-Lys-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FSNVAJOPUDVQAR-AVGNSLFASA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 1
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 1
- IJYZHIOOBGIINM-WDSKDSINSA-N Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N IJYZHIOOBGIINM-WDSKDSINSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- QUBKBPZGMZWOKQ-SZMVWBNQSA-N Arg-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 QUBKBPZGMZWOKQ-SZMVWBNQSA-N 0.000 description 1
- JBQORRNSZGTLCV-WDSOQIARSA-N Arg-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 JBQORRNSZGTLCV-WDSOQIARSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- XTWSWDJMIKUJDQ-RYUDHWBXSA-N Arg-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XTWSWDJMIKUJDQ-RYUDHWBXSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 1
- QEYJFBMTSMLPKZ-ZKWXMUAHSA-N Asn-Ala-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O QEYJFBMTSMLPKZ-ZKWXMUAHSA-N 0.000 description 1
- NPDLYUOYAGBHFB-WDSKDSINSA-N Asn-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NPDLYUOYAGBHFB-WDSKDSINSA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- JEPNYDRDYNSFIU-QXEWZRGKSA-N Asn-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(N)=O)C(O)=O JEPNYDRDYNSFIU-QXEWZRGKSA-N 0.000 description 1
- BHQQRVARKXWXPP-ACZMJKKPSA-N Asn-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BHQQRVARKXWXPP-ACZMJKKPSA-N 0.000 description 1
- QISZHYWZHJRDAO-CIUDSAMLSA-N Asn-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N QISZHYWZHJRDAO-CIUDSAMLSA-N 0.000 description 1
- ZDOQDYFZNGASEY-BIIVOSGPSA-N Asn-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O ZDOQDYFZNGASEY-BIIVOSGPSA-N 0.000 description 1
- QCWJKJLNCFEVPQ-WHFBIAKZSA-N Asn-Gln Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O QCWJKJLNCFEVPQ-WHFBIAKZSA-N 0.000 description 1
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- HXWUJJADFMXNKA-BQBZGAKWSA-N Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O HXWUJJADFMXNKA-BQBZGAKWSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- GADKFYNESXNRLC-WDSKDSINSA-N Asn-Pro Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GADKFYNESXNRLC-WDSKDSINSA-N 0.000 description 1
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 1
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- SONUFGRSSMFHFN-IMJSIDKUSA-N Asn-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O SONUFGRSSMFHFN-IMJSIDKUSA-N 0.000 description 1
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- SKQTXVZTCGSRJS-SRVKXCTJSA-N Asn-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O SKQTXVZTCGSRJS-SRVKXCTJSA-N 0.000 description 1
- KWBQPGIYEZKDEG-FSPLSTOPSA-N Asn-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(N)=O KWBQPGIYEZKDEG-FSPLSTOPSA-N 0.000 description 1
- CBWCQCANJSGUOH-ZKWXMUAHSA-N Asn-Val-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O CBWCQCANJSGUOH-ZKWXMUAHSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- CBHVAFXKOYAHOY-NHCYSSNCSA-N Asn-Val-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O CBHVAFXKOYAHOY-NHCYSSNCSA-N 0.000 description 1
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- PSZNHSNIGMJYOZ-WDSKDSINSA-N Asp-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PSZNHSNIGMJYOZ-WDSKDSINSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 1
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- VGRHZPNRCLAHQA-IMJSIDKUSA-N Asp-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(O)=O VGRHZPNRCLAHQA-IMJSIDKUSA-N 0.000 description 1
- CASGONAXMZPHCK-FXQIFTODSA-N Asp-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N CASGONAXMZPHCK-FXQIFTODSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- FRSGNOZCTWDVFZ-ACZMJKKPSA-N Asp-Asp-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O FRSGNOZCTWDVFZ-ACZMJKKPSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- PXLNPFOJZQMXAT-BYULHYEWSA-N Asp-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O PXLNPFOJZQMXAT-BYULHYEWSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 1
- CSEJMKNZDCJYGJ-XHNCKOQMSA-N Asp-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O CSEJMKNZDCJYGJ-XHNCKOQMSA-N 0.000 description 1
- DXQOQMCLWWADMU-ACZMJKKPSA-N Asp-Gln-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DXQOQMCLWWADMU-ACZMJKKPSA-N 0.000 description 1
- CKAJHWFHHFSCDT-WHFBIAKZSA-N Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O CKAJHWFHHFSCDT-WHFBIAKZSA-N 0.000 description 1
- HSWYMWGDMPLTTH-FXQIFTODSA-N Asp-Glu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HSWYMWGDMPLTTH-FXQIFTODSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 1
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- SEMWSADZTMJELF-BYULHYEWSA-N Asp-Ile-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O SEMWSADZTMJELF-BYULHYEWSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- XLILXFRAKOYEJX-GUBZILKMSA-N Asp-Leu-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLILXFRAKOYEJX-GUBZILKMSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- JTRDJYIZIKCIRC-AJNGGQMLSA-N Asp-Leu-Leu-Gln Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JTRDJYIZIKCIRC-AJNGGQMLSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- WNGZKSVJFDZICU-XIRDDKMYSA-N Asp-Leu-Trp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N WNGZKSVJFDZICU-XIRDDKMYSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- LBOVBQONZJRWPV-YUMQZZPRSA-N Asp-Lys-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LBOVBQONZJRWPV-YUMQZZPRSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- ZXRQJQCXPSMNMR-XIRDDKMYSA-N Asp-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N ZXRQJQCXPSMNMR-XIRDDKMYSA-N 0.000 description 1
- DYDKXJWQCIVTMR-WDSKDSINSA-N Asp-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O DYDKXJWQCIVTMR-WDSKDSINSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- GXHDGYOXPNQCKM-XVSYOHENSA-N Asp-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GXHDGYOXPNQCKM-XVSYOHENSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- ZARXTZFGQZBYFO-JQWIXIFHSA-N Asp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(O)=O)=CNC2=C1 ZARXTZFGQZBYFO-JQWIXIFHSA-N 0.000 description 1
- LLRJPYJQNBMOOO-QEJZJMRPSA-N Asp-Trp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N LLRJPYJQNBMOOO-QEJZJMRPSA-N 0.000 description 1
- IHZFGJLKDYINPV-XIRDDKMYSA-N Asp-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(O)=O)N)C(O)=O)C1=CN=CN1 IHZFGJLKDYINPV-XIRDDKMYSA-N 0.000 description 1
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- OTKUAVXGMREHRX-CFMVVWHZSA-N Asp-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 OTKUAVXGMREHRX-CFMVVWHZSA-N 0.000 description 1
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 1
- 208000031504 Asymptomatic Infections Diseases 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000282461 Canis lupus Species 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 241000905957 Channa melasoma Species 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 208000003322 Coinfection Diseases 0.000 description 1
- 241000037164 Collema parvum Species 0.000 description 1
- 238000011537 Coomassie blue staining Methods 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- SMEYEQDCCBHTEF-FXQIFTODSA-N Cys-Pro-Ala Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O SMEYEQDCCBHTEF-FXQIFTODSA-N 0.000 description 1
- YXQDRIRSAHTJKM-IMJSIDKUSA-N Cys-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(O)=O YXQDRIRSAHTJKM-IMJSIDKUSA-N 0.000 description 1
- GGRDJANMZPGMNS-CIUDSAMLSA-N Cys-Ser-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O GGRDJANMZPGMNS-CIUDSAMLSA-N 0.000 description 1
- DSTWKJOBKSMVCV-UWVGGRQHSA-N Cys-Tyr Chemical compound SC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DSTWKJOBKSMVCV-UWVGGRQHSA-N 0.000 description 1
- ZOMMHASZJQRLFS-IHRRRGAJSA-N Cys-Tyr-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CS)N ZOMMHASZJQRLFS-IHRRRGAJSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 238000011763 DBA/1J (JAX™ mouse strain) Methods 0.000 description 1
- 238000011767 DBA/2J (JAX™ mouse strain) Methods 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108020005199 Dehydrogenases Proteins 0.000 description 1
- 102100030695 Electron transfer flavoprotein subunit alpha, mitochondrial Human genes 0.000 description 1
- 241000196324 Embryophyta Species 0.000 description 1
- 102000011426 Enoyl-CoA hydratase Human genes 0.000 description 1
- 108010023922 Enoyl-CoA hydratase Proteins 0.000 description 1
- 241000588921 Enterobacteriaceae Species 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241001522878 Escherichia coli B Species 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241001302584 Escherichia coli str. K-12 substr. W3110 Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- POPZASPRNPGIPZ-UHFFFAOYSA-N Gln Gln Ala Pro Chemical compound NC(=O)CCC(N)C(=O)NC(CCC(N)=O)C(=O)NC(C)C(=O)N1CCCC1C(O)=O POPZASPRNPGIPZ-UHFFFAOYSA-N 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 1
- ULXXDWZMMSQBDC-ACZMJKKPSA-N Gln-Asp-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ULXXDWZMMSQBDC-ACZMJKKPSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 1
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 1
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 1
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 1
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- CLPQUWHBWXFJOX-BQBZGAKWSA-N Gln-Gly-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O CLPQUWHBWXFJOX-BQBZGAKWSA-N 0.000 description 1
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 1
- JXFLPKSDLDEOQK-JHEQGTHGSA-N Gln-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O JXFLPKSDLDEOQK-JHEQGTHGSA-N 0.000 description 1
- DWDBJWAXPXXYLP-SRVKXCTJSA-N Gln-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DWDBJWAXPXXYLP-SRVKXCTJSA-N 0.000 description 1
- XITLYYAIPBBHPX-ZKWXMUAHSA-N Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(N)=O XITLYYAIPBBHPX-ZKWXMUAHSA-N 0.000 description 1
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 1
- OAOOXBSVCJEIFY-QAETUUGQSA-N Gln-Leu-Leu-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O OAOOXBSVCJEIFY-QAETUUGQSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- OKQLXOYFUPVEHI-CIUDSAMLSA-N Gln-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N OKQLXOYFUPVEHI-CIUDSAMLSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- SGVGIVDZLSHSEN-RYUDHWBXSA-N Gln-Tyr-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O SGVGIVDZLSHSEN-RYUDHWBXSA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- MPZWMIIOPAPAKE-BQBZGAKWSA-N Glu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N MPZWMIIOPAPAKE-BQBZGAKWSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 1
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- SBYVDRJAXWSXQL-AVGNSLFASA-N Glu-Asn-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SBYVDRJAXWSXQL-AVGNSLFASA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- XXCDTYBVGMPIOA-FXQIFTODSA-N Glu-Asp-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XXCDTYBVGMPIOA-FXQIFTODSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 1
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 1
- VOORMNJKNBGYGK-YUMQZZPRSA-N Glu-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N VOORMNJKNBGYGK-YUMQZZPRSA-N 0.000 description 1
- ZWQVYZXPYSYPJD-RYUDHWBXSA-N Glu-Gly-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZWQVYZXPYSYPJD-RYUDHWBXSA-N 0.000 description 1
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 1
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 1
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- PDLGMYVCPJOYAR-DKIMLUQUSA-N Glu-Leu-Phe-Ala Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 PDLGMYVCPJOYAR-DKIMLUQUSA-N 0.000 description 1
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- ZGEJRLJEAMPEDV-SRVKXCTJSA-N Glu-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)O)N ZGEJRLJEAMPEDV-SRVKXCTJSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- SXGAGTVDWKQYCX-BQBZGAKWSA-N Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O SXGAGTVDWKQYCX-BQBZGAKWSA-N 0.000 description 1
- XMBSYZWANAQXEV-QWRGUYRKSA-N Glu-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-QWRGUYRKSA-N 0.000 description 1
- KJBGAZSLZAQDPV-KKUMJFAQSA-N Glu-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N KJBGAZSLZAQDPV-KKUMJFAQSA-N 0.000 description 1
- LHIPZASLKPYDPI-AVGNSLFASA-N Glu-Phe-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LHIPZASLKPYDPI-AVGNSLFASA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- YBTCBQBIJKGSJP-BQBZGAKWSA-N Glu-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O YBTCBQBIJKGSJP-BQBZGAKWSA-N 0.000 description 1
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 1
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- ZAPFAWQHBOHWLL-GUBZILKMSA-N Glu-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N ZAPFAWQHBOHWLL-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- ZGXGVBYEJGVJMV-HJGDQZAQSA-N Glu-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O ZGXGVBYEJGVJMV-HJGDQZAQSA-N 0.000 description 1
- CQGBSALYGOXQPE-HTUGSXCWSA-N Glu-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O CQGBSALYGOXQPE-HTUGSXCWSA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- OLTHVCNYJAALPL-BHYGNILZSA-N Glu-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OLTHVCNYJAALPL-BHYGNILZSA-N 0.000 description 1
- YSWHPLCDIMUKFE-QWRGUYRKSA-N Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YSWHPLCDIMUKFE-QWRGUYRKSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 1
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- BRFJMRSRMOMIMU-WHFBIAKZSA-N Gly-Ala-Asn Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O BRFJMRSRMOMIMU-WHFBIAKZSA-N 0.000 description 1
- GQGAFTPXAPKSCF-WHFBIAKZSA-N Gly-Ala-Cys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O GQGAFTPXAPKSCF-WHFBIAKZSA-N 0.000 description 1
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- YMUFWNJHVPQNQD-ZKWXMUAHSA-N Gly-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN YMUFWNJHVPQNQD-ZKWXMUAHSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 1
- LERGJIVJIIODPZ-ZANVPECISA-N Gly-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)C)C(O)=O)=CNC2=C1 LERGJIVJIIODPZ-ZANVPECISA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KRRMJKMGWWXWDW-STQMWFEESA-N Gly-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KRRMJKMGWWXWDW-STQMWFEESA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- SCCPDJAQCXWPTF-VKHMYHEASA-N Gly-Asp Chemical compound NCC(=O)N[C@H](C(O)=O)CC(O)=O SCCPDJAQCXWPTF-VKHMYHEASA-N 0.000 description 1
- LLXVQPKEQQCISF-YUMQZZPRSA-N Gly-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN LLXVQPKEQQCISF-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 1
- MFBYPDKTAJXHNI-VKHMYHEASA-N Gly-Cys Chemical compound [NH3+]CC(=O)N[C@@H](CS)C([O-])=O MFBYPDKTAJXHNI-VKHMYHEASA-N 0.000 description 1
- XXGQRGQPGFYECI-WDSKDSINSA-N Gly-Cys-Glu Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(O)=O XXGQRGQPGFYECI-WDSKDSINSA-N 0.000 description 1
- PNMUAGGSDZXTHX-BYPYZUCNSA-N Gly-Gln Chemical compound NCC(=O)N[C@H](C(O)=O)CCC(N)=O PNMUAGGSDZXTHX-BYPYZUCNSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- LHRXAHLCRMQBGJ-RYUDHWBXSA-N Gly-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN LHRXAHLCRMQBGJ-RYUDHWBXSA-N 0.000 description 1
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 1
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 1
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- TVDHVLGFJSHPAX-UWVGGRQHSA-N Gly-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 TVDHVLGFJSHPAX-UWVGGRQHSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- KGVHCTWYMPWEGN-FSPLSTOPSA-N Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CN KGVHCTWYMPWEGN-FSPLSTOPSA-N 0.000 description 1
- SWQALSGKVLYKDT-ZKWXMUAHSA-N Gly-Ile-Ala Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SWQALSGKVLYKDT-ZKWXMUAHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 1
- DKEXFJVMVGETOO-LURJTMIESA-N Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CN DKEXFJVMVGETOO-LURJTMIESA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- IKAIKUBBJHFNBZ-LURJTMIESA-N Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CN IKAIKUBBJHFNBZ-LURJTMIESA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- QGDOOCIPHSSADO-STQMWFEESA-N Gly-Met-Phe Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGDOOCIPHSSADO-STQMWFEESA-N 0.000 description 1
- JBCLFWXMTIKCCB-VIFPVBQESA-N Gly-Phe Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-VIFPVBQESA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 1
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 1
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- BCCRXDTUTZHDEU-VKHMYHEASA-N Gly-Ser Chemical compound NCC(=O)N[C@@H](CO)C(O)=O BCCRXDTUTZHDEU-VKHMYHEASA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- WSLHFAFASQFMSK-SFTDATJTSA-N Gly-Trp-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)CN)C(O)=O)=CNC2=C1 WSLHFAFASQFMSK-SFTDATJTSA-N 0.000 description 1
- XBGGUPMXALFZOT-VIFPVBQESA-N Gly-Tyr Chemical compound NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-VIFPVBQESA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- DUAWRXXTOQOECJ-JSGCOSHPSA-N Gly-Tyr-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O DUAWRXXTOQOECJ-JSGCOSHPSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 1
- VPZXBVLAVMBEQI-VKHMYHEASA-N Glycyl-alanine Chemical compound OC(=O)[C@H](C)NC(=O)CN VPZXBVLAVMBEQI-VKHMYHEASA-N 0.000 description 1
- 206010018691 Granuloma Diseases 0.000 description 1
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- XINDHUAGVGCNSF-QSFUFRPTSA-N His-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XINDHUAGVGCNSF-QSFUFRPTSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- WYWBYSPRCFADBM-GARJFASQSA-N His-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O WYWBYSPRCFADBM-GARJFASQSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 1
- LYCVKHSJGDMDLM-LURJTMIESA-N His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 LYCVKHSJGDMDLM-LURJTMIESA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- FDQYIRHBVVUTJF-ZETCQYMHSA-N His-Gly-Gly Chemical compound [O-]C(=O)CNC(=O)CNC(=O)[C@@H]([NH3+])CC1=CN=CN1 FDQYIRHBVVUTJF-ZETCQYMHSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- IDXZDKMBEXLFMB-HGNGGELXSA-N His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 IDXZDKMBEXLFMB-HGNGGELXSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- XKIYNCLILDLGRS-QWRGUYRKSA-N His-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CN=CN1 XKIYNCLILDLGRS-QWRGUYRKSA-N 0.000 description 1
- AYIZHKDZYOSOGY-IUCAKERBSA-N His-Met Chemical compound CSCC[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 AYIZHKDZYOSOGY-IUCAKERBSA-N 0.000 description 1
- WSEITRHJRVDTRX-QTKMDUPCSA-N His-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N)O WSEITRHJRVDTRX-QTKMDUPCSA-N 0.000 description 1
- RLAOTFTXBFQJDV-KKUMJFAQSA-N His-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CN=CN1 RLAOTFTXBFQJDV-KKUMJFAQSA-N 0.000 description 1
- SOYCWSKCUVDLMC-AVGNSLFASA-N His-Pro-Arg Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CCCNC(=N)N)C(=O)O SOYCWSKCUVDLMC-AVGNSLFASA-N 0.000 description 1
- PYNPBMCLAKTHJL-SRVKXCTJSA-N His-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O PYNPBMCLAKTHJL-SRVKXCTJSA-N 0.000 description 1
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 1
- KRBMQYPTDYSENE-BQBZGAKWSA-N His-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CNC=N1 KRBMQYPTDYSENE-BQBZGAKWSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- 101001010541 Homo sapiens Electron transfer flavoprotein subunit alpha, mitochondrial Proteins 0.000 description 1
- 101000802660 Homo sapiens Histo-blood group ABO system transferase Proteins 0.000 description 1
- RCFDOSNHHZGBOY-ACZMJKKPSA-N Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(O)=O RCFDOSNHHZGBOY-ACZMJKKPSA-N 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- DPTBVFUDCPINIP-JURCDPSOSA-N Ile-Ala-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DPTBVFUDCPINIP-JURCDPSOSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 1
- HLYBGMZJVDHJEO-CYDGBPFRSA-N Ile-Arg-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N HLYBGMZJVDHJEO-CYDGBPFRSA-N 0.000 description 1
- SACHLUOUHCVIKI-GMOBBJLQSA-N Ile-Arg-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SACHLUOUHCVIKI-GMOBBJLQSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- HZYHBDVRCBDJJV-HAFWLYHUSA-N Ile-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O HZYHBDVRCBDJJV-HAFWLYHUSA-N 0.000 description 1
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 1
- HZMLFETXHFHGBB-UGYAYLCHSA-N Ile-Asn-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZMLFETXHFHGBB-UGYAYLCHSA-N 0.000 description 1
- UKTUOMWSJPXODT-GUDRVLHUSA-N Ile-Asn-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N UKTUOMWSJPXODT-GUDRVLHUSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- HGNUKGZQASSBKQ-PCBIJLKTSA-N Ile-Asp-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HGNUKGZQASSBKQ-PCBIJLKTSA-N 0.000 description 1
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 1
- CNPNWGHRMBQHBZ-ZKWXMUAHSA-N Ile-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O CNPNWGHRMBQHBZ-ZKWXMUAHSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- KTGFOCFYOZQVRJ-ZKWXMUAHSA-N Ile-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O KTGFOCFYOZQVRJ-ZKWXMUAHSA-N 0.000 description 1
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 1
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 1
- IGJWJGIHUFQANP-LAEOZQHASA-N Ile-Gly-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N IGJWJGIHUFQANP-LAEOZQHASA-N 0.000 description 1
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- VOBYAKCXGQQFLR-LSJOCFKGSA-N Ile-Gly-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O VOBYAKCXGQQFLR-LSJOCFKGSA-N 0.000 description 1
- WIZPFZKOFZXDQG-HTFCKZLJSA-N Ile-Ile-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O WIZPFZKOFZXDQG-HTFCKZLJSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- WMDZARSFSMZOQO-DRZSPHRISA-N Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WMDZARSFSMZOQO-DRZSPHRISA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- XMYURPUVJSKTMC-KBIXCLLPSA-N Ile-Ser-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XMYURPUVJSKTMC-KBIXCLLPSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 1
- CNMOKANDJMLAIF-CIQUZCHMSA-N Ile-Thr-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O CNMOKANDJMLAIF-CIQUZCHMSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- NAFIFZNBSPWYOO-RWRJDSDZSA-N Ile-Thr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NAFIFZNBSPWYOO-RWRJDSDZSA-N 0.000 description 1
- COWHUQXTSYTKQC-RWRJDSDZSA-N Ile-Thr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N COWHUQXTSYTKQC-RWRJDSDZSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- XVUAQNRNFMVWBR-BLMTYFJBSA-N Ile-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N XVUAQNRNFMVWBR-BLMTYFJBSA-N 0.000 description 1
- MUFXDFWAJSPHIQ-XDTLVQLUSA-N Ile-Tyr Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 MUFXDFWAJSPHIQ-XDTLVQLUSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- YWCJXQKATPNPOE-UKJIMTQDSA-N Ile-Val-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YWCJXQKATPNPOE-UKJIMTQDSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- DGAQECJNVWCQMB-PUAWFVPOSA-M Ilexoside XXIX Chemical compound C[C@@H]1CC[C@@]2(CC[C@@]3(C(=CC[C@H]4[C@]3(CC[C@@H]5[C@@]4(CC[C@@H](C5(C)C)OS(=O)(=O)[O-])C)C)[C@@H]2[C@]1(C)O)C)C(=O)O[C@H]6[C@@H]([C@H]([C@@H]([C@H](O6)CO)O)O)O.[Na+] DGAQECJNVWCQMB-PUAWFVPOSA-M 0.000 description 1
- 102100037850 Interferon gamma Human genes 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108090000174 Interleukin-10 Proteins 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 1
- QLROSWPKSBORFJ-BQBZGAKWSA-N L-Prolyl-L-glutamic acid Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 QLROSWPKSBORFJ-BQBZGAKWSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- HSQGMTRYSIHDAC-BQBZGAKWSA-N Leu-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(O)=O HSQGMTRYSIHDAC-BQBZGAKWSA-N 0.000 description 1
- YUGVQABRIJXYNQ-CIUDSAMLSA-N Leu-Ala-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YUGVQABRIJXYNQ-CIUDSAMLSA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 1
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- VIWUBXKCYJGNCL-SRVKXCTJSA-N Leu-Asn-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 VIWUBXKCYJGNCL-SRVKXCTJSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 1
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 1
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 1
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 1
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 1
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- MPSBSKHOWJQHBS-IHRRRGAJSA-N Leu-His-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N MPSBSKHOWJQHBS-IHRRRGAJSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- ZALAVHVPPOHAOL-XUXIUFHCSA-N Leu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N ZALAVHVPPOHAOL-XUXIUFHCSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- VUZMPNMNJBGOKE-IHRRRGAJSA-N Leu-Leu-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VUZMPNMNJBGOKE-IHRRRGAJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- VTJUNIYRYIAIHF-IUCAKERBSA-N Leu-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O VTJUNIYRYIAIHF-IUCAKERBSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 1
- GOFJOGXGMPHOGL-DCAQKATOSA-N Leu-Ser-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(C)C GOFJOGXGMPHOGL-DCAQKATOSA-N 0.000 description 1
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- HQBOMRTVKVKFMN-WDSOQIARSA-N Leu-Trp-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O HQBOMRTVKVKFMN-WDSOQIARSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- MDSUKZSLOATHMH-IUCAKERBSA-N Leu-Val Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C([O-])=O MDSUKZSLOATHMH-IUCAKERBSA-N 0.000 description 1
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 1
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 1
- FMFNIDICDKEMOE-XUXIUFHCSA-N Leu-Val-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMFNIDICDKEMOE-XUXIUFHCSA-N 0.000 description 1
- FPPCCQGECVKLDY-IHRRRGAJSA-N Leu-Val-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C FPPCCQGECVKLDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- KVSBQLNBMUPADA-AVGNSLFASA-N Leu-Val-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KVSBQLNBMUPADA-AVGNSLFASA-N 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 1
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 1
- BYPMOIFBQPEWOH-CIUDSAMLSA-N Lys-Asn-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BYPMOIFBQPEWOH-CIUDSAMLSA-N 0.000 description 1
- CIOWSLJGLSUOME-BQBZGAKWSA-N Lys-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O CIOWSLJGLSUOME-BQBZGAKWSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 1
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 1
- OAPNERBWQWUPTI-YUMQZZPRSA-N Lys-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O OAPNERBWQWUPTI-YUMQZZPRSA-N 0.000 description 1
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 1
- VSRXPEHZMHSFKU-IUCAKERBSA-N Lys-Gln-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O VSRXPEHZMHSFKU-IUCAKERBSA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- UGTZHPSKYRIGRJ-YUMQZZPRSA-N Lys-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O UGTZHPSKYRIGRJ-YUMQZZPRSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- VQXAVLQBQJMENB-SRVKXCTJSA-N Lys-Glu-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O VQXAVLQBQJMENB-SRVKXCTJSA-N 0.000 description 1
- ODUQLUADRKMHOZ-JYJNAYRXSA-N Lys-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)O ODUQLUADRKMHOZ-JYJNAYRXSA-N 0.000 description 1
- HGNRJCINZYHNOU-LURJTMIESA-N Lys-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(O)=O HGNRJCINZYHNOU-LURJTMIESA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 1
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- ATIPDCIQTUXABX-UWVGGRQHSA-N Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ATIPDCIQTUXABX-UWVGGRQHSA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- ZCWWVXAXWUAEPZ-SRVKXCTJSA-N Lys-Met-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZCWWVXAXWUAEPZ-SRVKXCTJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- AIXUQKMMBQJZCU-IUCAKERBSA-N Lys-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O AIXUQKMMBQJZCU-IUCAKERBSA-N 0.000 description 1
- AFLBTVGQCQLOFJ-AVGNSLFASA-N Lys-Pro-Arg Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O AFLBTVGQCQLOFJ-AVGNSLFASA-N 0.000 description 1
- SVSQSPICRKBMSZ-SRVKXCTJSA-N Lys-Pro-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O SVSQSPICRKBMSZ-SRVKXCTJSA-N 0.000 description 1
- LUTDBHBIHHREDC-IHRRRGAJSA-N Lys-Pro-Lys Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O LUTDBHBIHHREDC-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- ZOKVLMBYDSIDKG-CSMHCCOUSA-N Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN ZOKVLMBYDSIDKG-CSMHCCOUSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- RVKIPWVMZANZLI-ZFWWWQNUSA-N Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-ZFWWWQNUSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- YQAIUOWPSUOINN-IUCAKERBSA-N Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCCCN YQAIUOWPSUOINN-IUCAKERBSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 239000004472 Lysine Substances 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- HUKLXYYPZWPXCC-KZVJFYERSA-N Met-Ala-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HUKLXYYPZWPXCC-KZVJFYERSA-N 0.000 description 1
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 1
- QTZXSYBVOSXBEJ-WDSKDSINSA-N Met-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O QTZXSYBVOSXBEJ-WDSKDSINSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- UOENBSHXYCHSAU-YUMQZZPRSA-N Met-Gln-Gly Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UOENBSHXYCHSAU-YUMQZZPRSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- ADHNYKZHPOEULM-BQBZGAKWSA-N Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(O)=O ADHNYKZHPOEULM-BQBZGAKWSA-N 0.000 description 1
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 1
- VZBXCMCHIHEPBL-SRVKXCTJSA-N Met-Glu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN VZBXCMCHIHEPBL-SRVKXCTJSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- MWAYJIAKVUBKKP-IUCAKERBSA-N Met-His Chemical compound CSCC[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 MWAYJIAKVUBKKP-IUCAKERBSA-N 0.000 description 1
- XPCLRYNQMZOOFB-ULQDDVLXSA-N Met-His-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N XPCLRYNQMZOOFB-ULQDDVLXSA-N 0.000 description 1
- GETCJHFFECHWHI-QXEWZRGKSA-N Met-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCSC)N GETCJHFFECHWHI-QXEWZRGKSA-N 0.000 description 1
- WPTDJKDGICUFCP-XUXIUFHCSA-N Met-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCSC)N WPTDJKDGICUFCP-XUXIUFHCSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- KMSMNUFBNCHMII-IHRRRGAJSA-N Met-Leu-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN KMSMNUFBNCHMII-IHRRRGAJSA-N 0.000 description 1
- LBNFTWKGISQVEE-AVGNSLFASA-N Met-Leu-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCSC LBNFTWKGISQVEE-AVGNSLFASA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- IMTUWVJPCQPJEE-IUCAKERBSA-N Met-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN IMTUWVJPCQPJEE-IUCAKERBSA-N 0.000 description 1
- AXHNAGAYRGCDLG-UWVGGRQHSA-N Met-Lys-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AXHNAGAYRGCDLG-UWVGGRQHSA-N 0.000 description 1
- KKXGLCPUAWODHF-GUBZILKMSA-N Met-Met-Cys Chemical compound N[C@@H](CCSC)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(O)=O KKXGLCPUAWODHF-GUBZILKMSA-N 0.000 description 1
- UDOYVQQKQHZYMB-DCAQKATOSA-N Met-Met-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDOYVQQKQHZYMB-DCAQKATOSA-N 0.000 description 1
- DZMGFGQBRYWJOR-YUMQZZPRSA-N Met-Pro Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(O)=O DZMGFGQBRYWJOR-YUMQZZPRSA-N 0.000 description 1
- WYDFQSJOARJAMM-GUBZILKMSA-N Met-Pro-Asp Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WYDFQSJOARJAMM-GUBZILKMSA-N 0.000 description 1
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 1
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 1
- SOAYQFDWEIWPPR-IHRRRGAJSA-N Met-Ser-Tyr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O SOAYQFDWEIWPPR-IHRRRGAJSA-N 0.000 description 1
- KSIPKXNIQOWMIC-RCWTZXSCSA-N Met-Thr-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KSIPKXNIQOWMIC-RCWTZXSCSA-N 0.000 description 1
- FXBKQTOGURNXSL-HJGDQZAQSA-N Met-Thr-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O FXBKQTOGURNXSL-HJGDQZAQSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 1
- NSMXRFMGZYTFEX-KJEVXHAQSA-N Met-Thr-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCSC)N)O NSMXRFMGZYTFEX-KJEVXHAQSA-N 0.000 description 1
- FSTWDRPCQQUJIT-NHCYSSNCSA-N Met-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N FSTWDRPCQQUJIT-NHCYSSNCSA-N 0.000 description 1
- 241001302042 Methanothermobacter thermautotrophicus Species 0.000 description 1
- HDAJUGGARUFROU-JSUDGWJLSA-L MoO2-molybdopterin cofactor Chemical compound O([C@H]1NC=2N=C(NC(=O)C=2N[C@H]11)N)[C@H](COP(O)(O)=O)C2=C1S[Mo](=O)(=O)S2 HDAJUGGARUFROU-JSUDGWJLSA-L 0.000 description 1
- 108010006519 Molecular Chaperones Proteins 0.000 description 1
- 102000005431 Molecular Chaperones Human genes 0.000 description 1
- 206010062207 Mycobacterial infection Diseases 0.000 description 1
- 241001467553 Mycobacterium africanum Species 0.000 description 1
- 101100000702 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) acpM gene Proteins 0.000 description 1
- 101100309462 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) ahcY gene Proteins 0.000 description 1
- 101100092304 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) frr gene Proteins 0.000 description 1
- 101100348252 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) ndkA gene Proteins 0.000 description 1
- 101100529128 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) rpoA gene Proteins 0.000 description 1
- MDSUKZSLOATHMH-UHFFFAOYSA-N N-L-leucyl-L-valine Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(O)=O MDSUKZSLOATHMH-UHFFFAOYSA-N 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 1
- 241001644525 Nastus productus Species 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 108090000417 Oxygenases Proteins 0.000 description 1
- 102000004020 Oxygenases Human genes 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 108010087702 Penicillinase Proteins 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- MIDZLCFIAINOQN-WPRPVWTQSA-N Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 MIDZLCFIAINOQN-WPRPVWTQSA-N 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- AJOKKVTWEMXZHC-DRZSPHRISA-N Phe-Ala-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 AJOKKVTWEMXZHC-DRZSPHRISA-N 0.000 description 1
- CYZBFPYMSJGBRL-DRZSPHRISA-N Phe-Ala-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CYZBFPYMSJGBRL-DRZSPHRISA-N 0.000 description 1
- ULECEJGNDHWSKD-QEJZJMRPSA-N Phe-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 ULECEJGNDHWSKD-QEJZJMRPSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- VHWOBXIWBDWZHK-IHRRRGAJSA-N Phe-Arg-Asp Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHWOBXIWBDWZHK-IHRRRGAJSA-N 0.000 description 1
- MQWISMJKHOUEMW-ULQDDVLXSA-N Phe-Arg-His Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 MQWISMJKHOUEMW-ULQDDVLXSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- YQNBKXUTWBRQCS-BVSLBCMMSA-N Phe-Arg-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 YQNBKXUTWBRQCS-BVSLBCMMSA-N 0.000 description 1
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 1
- HWMGTNOVUDIKRE-UWVGGRQHSA-N Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 HWMGTNOVUDIKRE-UWVGGRQHSA-N 0.000 description 1
- DDYIRGBOZVKRFR-AVGNSLFASA-N Phe-Asp-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DDYIRGBOZVKRFR-AVGNSLFASA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 1
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 1
- GLUBLISJVJFHQS-VIFPVBQESA-N Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 GLUBLISJVJFHQS-VIFPVBQESA-N 0.000 description 1
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical class [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- XEXSSIBQYNKFBX-KBPBESRZSA-N Phe-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1N=CNC=1)C(O)=O)C1=CC=CC=C1 XEXSSIBQYNKFBX-KBPBESRZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- SWCOXQLDICUYOL-ULQDDVLXSA-N Phe-His-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SWCOXQLDICUYOL-ULQDDVLXSA-N 0.000 description 1
- BEEVXUYVEHXWRQ-YESZJQIVSA-N Phe-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O BEEVXUYVEHXWRQ-YESZJQIVSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- JKJSIYKSGIDHPM-WBAXXEDZSA-N Phe-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O JKJSIYKSGIDHPM-WBAXXEDZSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- MVIJMIZJPHQGEN-IHRRRGAJSA-N Phe-Ser-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@H](CO)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 MVIJMIZJPHQGEN-IHRRRGAJSA-N 0.000 description 1
- NYQBYASWHVRESG-MIMYLULJSA-N Phe-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 NYQBYASWHVRESG-MIMYLULJSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- MHNBYYFXWDUGBW-RPTUDFQQSA-N Phe-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O MHNBYYFXWDUGBW-RPTUDFQQSA-N 0.000 description 1
- IEHDJWSAXBGJIP-RYUDHWBXSA-N Phe-Val Chemical compound CC(C)[C@@H](C([O-])=O)NC(=O)[C@@H]([NH3+])CC1=CC=CC=C1 IEHDJWSAXBGJIP-RYUDHWBXSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108091036414 Polyinosinic:polycytidylic acid Proteins 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- ZLMJMSJWJFRBEC-UHFFFAOYSA-N Potassium Chemical compound [K] ZLMJMSJWJFRBEC-UHFFFAOYSA-N 0.000 description 1
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 1
- GLEOIKLQBZNKJZ-WDSKDSINSA-N Pro-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GLEOIKLQBZNKJZ-WDSKDSINSA-N 0.000 description 1
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 1
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- XKHCJJPNXFBADI-DCAQKATOSA-N Pro-Asp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O XKHCJJPNXFBADI-DCAQKATOSA-N 0.000 description 1
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 1
- SHAQGFGGJSLLHE-BQBZGAKWSA-N Pro-Gln Chemical compound NC(=O)CC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 SHAQGFGGJSLLHE-BQBZGAKWSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- ZTVCLZLGHZXLOT-ULQDDVLXSA-N Pro-Glu-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O ZTVCLZLGHZXLOT-ULQDDVLXSA-N 0.000 description 1
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEPSEIDIPBMIOS-QXEWZRGKSA-N Pro-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEPSEIDIPBMIOS-QXEWZRGKSA-N 0.000 description 1
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 1
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 1
- VWXGFAIZUQBBBG-UWVGGRQHSA-N Pro-His-Gly Chemical compound C([C@@H](C(=O)NCC(=O)[O-])NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 VWXGFAIZUQBBBG-UWVGGRQHSA-N 0.000 description 1
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 1
- OCYROESYHWUPBP-CIUDSAMLSA-N Pro-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 OCYROESYHWUPBP-CIUDSAMLSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- AQGUSRZKDZYGGV-GMOBBJLQSA-N Pro-Ile-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O AQGUSRZKDZYGGV-GMOBBJLQSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- ZKQOUHVVXABNDG-IUCAKERBSA-N Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 ZKQOUHVVXABNDG-IUCAKERBSA-N 0.000 description 1
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 1
- RVQDZELMXZRSSI-IUCAKERBSA-N Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 RVQDZELMXZRSSI-IUCAKERBSA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- CDGABSWLRMECHC-IHRRRGAJSA-N Pro-Lys-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O CDGABSWLRMECHC-IHRRRGAJSA-N 0.000 description 1
- ULWBBFKQBDNGOY-RWMBFGLXSA-N Pro-Lys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N2CCC[C@@H]2C(=O)O ULWBBFKQBDNGOY-RWMBFGLXSA-N 0.000 description 1
- KLOQCCRTPHPIFN-DCAQKATOSA-N Pro-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 KLOQCCRTPHPIFN-DCAQKATOSA-N 0.000 description 1
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- IWIANZLCJVYEFX-RYUDHWBXSA-N Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 IWIANZLCJVYEFX-RYUDHWBXSA-N 0.000 description 1
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- RWCOTTLHDJWHRS-YUMQZZPRSA-N Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RWCOTTLHDJWHRS-YUMQZZPRSA-N 0.000 description 1
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- AFWBWPCXSWUCLB-WDSKDSINSA-N Pro-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H]1CCC[NH2+]1 AFWBWPCXSWUCLB-WDSKDSINSA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- GVUVRRPYYDHHGK-VQVTYTSYSA-N Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 GVUVRRPYYDHHGK-VQVTYTSYSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- IURWWZYKYPEANQ-HJGDQZAQSA-N Pro-Thr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IURWWZYKYPEANQ-HJGDQZAQSA-N 0.000 description 1
- DMNANGOFEUVBRV-GJZGRUSLSA-N Pro-Trp-Gly Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)O)C(=O)[C@@H]1CCCN1 DMNANGOFEUVBRV-GJZGRUSLSA-N 0.000 description 1
- OIDKVWTWGDWMHY-RYUDHWBXSA-N Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 OIDKVWTWGDWMHY-RYUDHWBXSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- AWJGUZSYVIVZGP-YUMQZZPRSA-N Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1 AWJGUZSYVIVZGP-YUMQZZPRSA-N 0.000 description 1
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102100032116 Putative nucleoside diphosphate kinase Human genes 0.000 description 1
- 101710205590 Putative nucleoside diphosphate kinase Proteins 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 208000035415 Reinfection Diseases 0.000 description 1
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 1
- 241000187559 Saccharopolyspora erythraea Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 206010070834 Sensitisation Diseases 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- SSJMZMUVNKEENT-IMJSIDKUSA-N Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CO SSJMZMUVNKEENT-IMJSIDKUSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 1
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- PZZJMBYSYAKYPK-UWJYBYFXSA-N Ser-Ala-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PZZJMBYSYAKYPK-UWJYBYFXSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- RZEQTVHJZCIUBT-WDSKDSINSA-N Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RZEQTVHJZCIUBT-WDSKDSINSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- CTLVSHXLRVEILB-UBHSHLNASA-N Ser-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N CTLVSHXLRVEILB-UBHSHLNASA-N 0.000 description 1
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- UJTZHGHXJKIAOS-WHFBIAKZSA-N Ser-Gln Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O UJTZHGHXJKIAOS-WHFBIAKZSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 1
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 1
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 1
- BRIZMMZEYSAKJX-QEJZJMRPSA-N Ser-Glu-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N BRIZMMZEYSAKJX-QEJZJMRPSA-N 0.000 description 1
- WOUIMBGNEUWXQG-VKHMYHEASA-N Ser-Gly Chemical compound OC[C@H](N)C(=O)NCC(O)=O WOUIMBGNEUWXQG-VKHMYHEASA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- AEGUWTFAQQWVLC-BQBZGAKWSA-N Ser-Gly-Arg Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEGUWTFAQQWVLC-BQBZGAKWSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- BXLYSRPHVMCOPS-ACZMJKKPSA-N Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO BXLYSRPHVMCOPS-ACZMJKKPSA-N 0.000 description 1
- BKZYBLLIBOBOOW-GHCJXIJMSA-N Ser-Ile-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O BKZYBLLIBOBOOW-GHCJXIJMSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- LWMQRHDTXHQQOV-MXAVVETBSA-N Ser-Ile-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LWMQRHDTXHQQOV-MXAVVETBSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- RQXDSYQXBCRXBT-GUBZILKMSA-N Ser-Met-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RQXDSYQXBCRXBT-GUBZILKMSA-N 0.000 description 1
- XNXRTQZTFVMJIJ-DCAQKATOSA-N Ser-Met-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNXRTQZTFVMJIJ-DCAQKATOSA-N 0.000 description 1
- PPQRSMGDOHLTBE-UWVGGRQHSA-N Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PPQRSMGDOHLTBE-UWVGGRQHSA-N 0.000 description 1
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 1
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 1
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- QPPYAWVLAVXISR-DCAQKATOSA-N Ser-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QPPYAWVLAVXISR-DCAQKATOSA-N 0.000 description 1
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- PURRNJBBXDDWLX-ZDLURKLDSA-N Ser-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CO)N)O PURRNJBBXDDWLX-ZDLURKLDSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 1
- FGBLCMLXHRPVOF-IHRRRGAJSA-N Ser-Tyr-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FGBLCMLXHRPVOF-IHRRRGAJSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 1
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 1
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 1
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 1
- RCOUFINCYASMDN-GUBZILKMSA-N Ser-Val-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O RCOUFINCYASMDN-GUBZILKMSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- 241000607720 Serratia Species 0.000 description 1
- 241000589196 Sinorhizobium meliloti Species 0.000 description 1
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 1
- 229920002125 Sokalan® Polymers 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108010090804 Streptavidin Proteins 0.000 description 1
- 102000004385 Sulfurtransferases Human genes 0.000 description 1
- 108090000984 Sulfurtransferases Proteins 0.000 description 1
- 241000192707 Synechococcus Species 0.000 description 1
- UZMAPBJVXOGOFT-UHFFFAOYSA-N Syringetin Natural products COC1=C(O)C(OC)=CC(C2=C(C(=O)C3=C(O)C=C(O)C=C3O2)O)=C1 UZMAPBJVXOGOFT-UHFFFAOYSA-N 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 1
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 1
- PKXHGEXFMIZSER-QTKMDUPCSA-N Thr-Arg-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O PKXHGEXFMIZSER-QTKMDUPCSA-N 0.000 description 1
- VFEHSAJCWWHDBH-RHYQMDGZSA-N Thr-Arg-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VFEHSAJCWWHDBH-RHYQMDGZSA-N 0.000 description 1
- VOGXLRKCWFLJBY-HSHDSVGOSA-N Thr-Arg-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VOGXLRKCWFLJBY-HSHDSVGOSA-N 0.000 description 1
- YLXAMFZYJTZXFH-OLHMAJIHSA-N Thr-Asn-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YLXAMFZYJTZXFH-OLHMAJIHSA-N 0.000 description 1
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 1
- YOSLMIPKOUAHKI-OLHMAJIHSA-N Thr-Asp-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O YOSLMIPKOUAHKI-OLHMAJIHSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 1
- XXNLGZRRSKPSGF-HTUGSXCWSA-N Thr-Gln-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O XXNLGZRRSKPSGF-HTUGSXCWSA-N 0.000 description 1
- LIXBDERDAGNVAV-XKBZYTNZSA-N Thr-Gln-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O LIXBDERDAGNVAV-XKBZYTNZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- AYCQVUUPIJHJTA-IXOXFDKPSA-N Thr-His-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O AYCQVUUPIJHJTA-IXOXFDKPSA-N 0.000 description 1
- CRZNCABIJLRFKZ-IUKAMOBKSA-N Thr-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N CRZNCABIJLRFKZ-IUKAMOBKSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 1
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- IJVNLNRVDUTWDD-MEYUZBJRSA-N Thr-Leu-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IJVNLNRVDUTWDD-MEYUZBJRSA-N 0.000 description 1
- QNCFWHZVRNXAKW-OEAJRASXSA-N Thr-Lys-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNCFWHZVRNXAKW-OEAJRASXSA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- LHNNQVXITHUCAB-QTKMDUPCSA-N Thr-Met-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O LHNNQVXITHUCAB-QTKMDUPCSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 1
- GVMXJJAJLIEASL-ZJDVBMNYSA-N Thr-Pro-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O GVMXJJAJLIEASL-ZJDVBMNYSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- PRTHQBSMXILLPC-XGEHTFHBSA-N Thr-Ser-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PRTHQBSMXILLPC-XGEHTFHBSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- QYDKSNXSBXZPFK-ZJDVBMNYSA-N Thr-Thr-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYDKSNXSBXZPFK-ZJDVBMNYSA-N 0.000 description 1
- MFMGPEKYBXFIRF-SUSMZKCASA-N Thr-Thr-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFMGPEKYBXFIRF-SUSMZKCASA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- OGOYMQWIWHGTGH-KZVJFYERSA-N Thr-Val-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O OGOYMQWIWHGTGH-KZVJFYERSA-N 0.000 description 1
- CURFABYITJVKEW-QTKMDUPCSA-N Thr-Val-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O CURFABYITJVKEW-QTKMDUPCSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- 239000007997 Tricine buffer Substances 0.000 description 1
- 101710154918 Trigger factor Proteins 0.000 description 1
- 244000250129 Trigonella foenum graecum Species 0.000 description 1
- 235000001484 Trigonella foenum graecum Nutrition 0.000 description 1
- 244000098338 Triticum aestivum Species 0.000 description 1
- MQVGIFJSFFVGFW-XEGUGMAKSA-N Trp-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MQVGIFJSFFVGFW-XEGUGMAKSA-N 0.000 description 1
- MJBBMTOGSOSAKJ-HJXMPXNTSA-N Trp-Ala-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MJBBMTOGSOSAKJ-HJXMPXNTSA-N 0.000 description 1
- AVYVKJMBNLPWRX-WFBYXXMGSA-N Trp-Ala-Ser Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O)=CNC2=C1 AVYVKJMBNLPWRX-WFBYXXMGSA-N 0.000 description 1
- IXEGQBJZDIRRIV-QEJZJMRPSA-N Trp-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IXEGQBJZDIRRIV-QEJZJMRPSA-N 0.000 description 1
- DQDXHYIEITXNJY-BPUTZDHNSA-N Trp-Gln-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N DQDXHYIEITXNJY-BPUTZDHNSA-N 0.000 description 1
- VTHNLRXALGUDBS-BPUTZDHNSA-N Trp-Gln-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VTHNLRXALGUDBS-BPUTZDHNSA-N 0.000 description 1
- FNOQJVHFVLVMOS-AAEUAGOBSA-N Trp-Gly-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N FNOQJVHFVLVMOS-AAEUAGOBSA-N 0.000 description 1
- NOBINHCGDUHOBV-NAZCDGGXSA-N Trp-His-Thr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NOBINHCGDUHOBV-NAZCDGGXSA-N 0.000 description 1
- SAKLWFSRZTZQAJ-GQGQLFGLSA-N Trp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N SAKLWFSRZTZQAJ-GQGQLFGLSA-N 0.000 description 1
- CXPJPTFWKXNDKV-NUTKFTJISA-N Trp-Leu-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CXPJPTFWKXNDKV-NUTKFTJISA-N 0.000 description 1
- VPRHDRKAPYZMHL-SZMVWBNQSA-N Trp-Leu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 VPRHDRKAPYZMHL-SZMVWBNQSA-N 0.000 description 1
- XGFOXYJQBRTJPO-PJODQICGSA-N Trp-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XGFOXYJQBRTJPO-PJODQICGSA-N 0.000 description 1
- GQNCRIFNDVFRNF-BPUTZDHNSA-N Trp-Pro-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O GQNCRIFNDVFRNF-BPUTZDHNSA-N 0.000 description 1
- IVBJBFSWJDNQFW-XIRDDKMYSA-N Trp-Pro-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IVBJBFSWJDNQFW-XIRDDKMYSA-N 0.000 description 1
- RNDWCRUOGGQDKN-UBHSHLNASA-N Trp-Ser-Asp Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RNDWCRUOGGQDKN-UBHSHLNASA-N 0.000 description 1
- YBRHKUNWEYBZGT-WLTAIBSBSA-N Trp-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(O)=O)=CNC2=C1 YBRHKUNWEYBZGT-WLTAIBSBSA-N 0.000 description 1
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 1
- DYIXEGROAOVQPK-VFAJRCTISA-N Trp-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O DYIXEGROAOVQPK-VFAJRCTISA-N 0.000 description 1
- YXSSXUIBUJGHJY-SFJXLCSZSA-N Trp-Thr-Phe Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)[C@H](O)C)C(O)=O)C1=CC=CC=C1 YXSSXUIBUJGHJY-SFJXLCSZSA-N 0.000 description 1
- LWFWZRANSFAJDR-JSGCOSHPSA-N Trp-Val Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 LWFWZRANSFAJDR-JSGCOSHPSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 1
- 102100040247 Tumor necrosis factor Human genes 0.000 description 1
- XLMDWQNAOKLKCP-XDTLVQLUSA-N Tyr-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N XLMDWQNAOKLKCP-XDTLVQLUSA-N 0.000 description 1
- IELISNUVHBKYBX-XDTLVQLUSA-N Tyr-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 IELISNUVHBKYBX-XDTLVQLUSA-N 0.000 description 1
- SDNVRAKIJVKAGS-LKTVYLICSA-N Tyr-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N SDNVRAKIJVKAGS-LKTVYLICSA-N 0.000 description 1
- ADBDQGBDNUTRDB-ULQDDVLXSA-N Tyr-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O ADBDQGBDNUTRDB-ULQDDVLXSA-N 0.000 description 1
- DYEGCOJHFNJBKB-UFYCRDLUSA-N Tyr-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 DYEGCOJHFNJBKB-UFYCRDLUSA-N 0.000 description 1
- GAYLGYUVTDMLKC-UWJYBYFXSA-N Tyr-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GAYLGYUVTDMLKC-UWJYBYFXSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- WZQZUVWEPMGIMM-JYJNAYRXSA-N Tyr-Gln-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O WZQZUVWEPMGIMM-JYJNAYRXSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- HIINQLBHPIQYHN-JTQLQIEISA-N Tyr-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HIINQLBHPIQYHN-JTQLQIEISA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- MVYRJYISVJWKSX-KBPBESRZSA-N Tyr-His-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)NCC(=O)O)N)O MVYRJYISVJWKSX-KBPBESRZSA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- KSCVLGXNQXKUAR-JYJNAYRXSA-N Tyr-Leu-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KSCVLGXNQXKUAR-JYJNAYRXSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 1
- VXFXIBCCVLJCJT-JYJNAYRXSA-N Tyr-Pro-Pro Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N1CCC[C@H]1C(O)=O VXFXIBCCVLJCJT-JYJNAYRXSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- KLOZTPOXVVRVAQ-DZKIICNBSA-N Tyr-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KLOZTPOXVVRVAQ-DZKIICNBSA-N 0.000 description 1
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- REJBPZVUHYNMEN-LSJOCFKGSA-N Val-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N REJBPZVUHYNMEN-LSJOCFKGSA-N 0.000 description 1
- UXBZYLSMYOATLH-DCAQKATOSA-N Val-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)C(C)C UXBZYLSMYOATLH-DCAQKATOSA-N 0.000 description 1
- AZSHAZJLOZQYAY-FXQIFTODSA-N Val-Ala-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O AZSHAZJLOZQYAY-FXQIFTODSA-N 0.000 description 1
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- WITCOKQIPFWQQD-FSPLSTOPSA-N Val-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CC(N)=O WITCOKQIPFWQQD-FSPLSTOPSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- AUMNPAUHKUNHHN-BYULHYEWSA-N Val-Asn-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N AUMNPAUHKUNHHN-BYULHYEWSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- XXDVDTMEVBYRPK-XPUUQOCRSA-N Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCC(N)=O XXDVDTMEVBYRPK-XPUUQOCRSA-N 0.000 description 1
- OUUBKKIJQIAPRI-LAEOZQHASA-N Val-Gln-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OUUBKKIJQIAPRI-LAEOZQHASA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- NYTKXWLZSNRILS-IFFSRLJSSA-N Val-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N)O NYTKXWLZSNRILS-IFFSRLJSSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 1
- CELJCNRXKZPTCX-XPUUQOCRSA-N Val-Gly-Ala Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O CELJCNRXKZPTCX-XPUUQOCRSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- BNQVUHQWZGTIBX-IUCAKERBSA-N Val-His Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CN=CN1 BNQVUHQWZGTIBX-IUCAKERBSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OACSGBOREVRSME-NHCYSSNCSA-N Val-His-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CC(N)=O)C(O)=O OACSGBOREVRSME-NHCYSSNCSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- DLMNFMXSNGTSNJ-PYJNHQTQSA-N Val-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N DLMNFMXSNGTSNJ-PYJNHQTQSA-N 0.000 description 1
- PNVLWFYAPWAQMU-CIUDSAMLSA-N Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](N)C(C)C PNVLWFYAPWAQMU-CIUDSAMLSA-N 0.000 description 1
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 1
- VHRLUTIMTDOVCG-PEDHHIEDSA-N Val-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](C(C)C)N VHRLUTIMTDOVCG-PEDHHIEDSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- JZWZACGUZVCQPS-RNJOBUHISA-N Val-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N JZWZACGUZVCQPS-RNJOBUHISA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 1
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 1
- BZOSBRIDWSSTFN-AVGNSLFASA-N Val-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N BZOSBRIDWSSTFN-AVGNSLFASA-N 0.000 description 1
- JKHXYJKMNSSFFL-IUCAKERBSA-N Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(O)=O)CCCCN JKHXYJKMNSSFFL-IUCAKERBSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- IEBGHUMBJXIXHM-AVGNSLFASA-N Val-Lys-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)O)N IEBGHUMBJXIXHM-AVGNSLFASA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 1
- GJNDXQBALKCYSZ-RYUDHWBXSA-N Val-Phe Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 GJNDXQBALKCYSZ-RYUDHWBXSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 1
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 1
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- DVLWZWNAQUBZBC-ZNSHCXBVSA-N Val-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N)O DVLWZWNAQUBZBC-ZNSHCXBVSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- UEXPMFIAZZHEAD-HSHDSVGOSA-N Val-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N)O UEXPMFIAZZHEAD-HSHDSVGOSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- IWADHXDXSQONEL-GUBZILKMSA-N Val-Val-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O IWADHXDXSQONEL-GUBZILKMSA-N 0.000 description 1
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 1
- WBPFYNYTYASCQP-CYDGBPFRSA-N Val-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N WBPFYNYTYASCQP-CYDGBPFRSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- STTYIMSDIYISRG-UHFFFAOYSA-N Valyl-Serine Chemical compound CC(C)C(N)C(=O)NC(CO)C(O)=O STTYIMSDIYISRG-UHFFFAOYSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 241000588902 Zymomonas mobilis Species 0.000 description 1
- LUXUAZKGQZPOBZ-SAXJAHGMSA-N [(3S,4S,5S,6R)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl] (Z)-octadec-9-enoate Chemical compound CCCCCCCC\C=C/CCCCCCCC(=O)OC1O[C@H](CO)[C@@H](O)[C@H](O)[C@@H]1O LUXUAZKGQZPOBZ-SAXJAHGMSA-N 0.000 description 1
- ZWBTYMGEBZUQTK-PVLSIAFMSA-N [(7S,9E,11S,12R,13S,14R,15R,16R,17S,18S,19E,21Z)-2,15,17,32-tetrahydroxy-11-methoxy-3,7,12,14,16,18,22-heptamethyl-1'-(2-methylpropyl)-6,23-dioxospiro[8,33-dioxa-24,27,29-triazapentacyclo[23.6.1.14,7.05,31.026,30]tritriaconta-1(32),2,4,9,19,21,24,26,30-nonaene-28,4'-piperidine]-13-yl] acetate Chemical compound CO[C@H]1\C=C\O[C@@]2(C)Oc3c(C2=O)c2c4NC5(CCN(CC(C)C)CC5)N=c4c(=NC(=O)\C(C)=C/C=C/[C@H](C)[C@H](O)[C@@H](C)[C@@H](O)[C@@H](C)[C@H](OC(C)=O)[C@@H]1C)c(O)c2c(O)c3C ZWBTYMGEBZUQTK-PVLSIAFMSA-N 0.000 description 1
- 206010000269 abscess Diseases 0.000 description 1
- 230000000240 adjuvant effect Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 229940037003 alum Drugs 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 125000003277 amino group Chemical group 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000001355 anti-mycobacterial effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010080488 arginyl-arginyl-leucine Proteins 0.000 description 1
- 108010057412 arginyl-glycyl-aspartyl-phenylalanine Proteins 0.000 description 1
- 108010066988 asparaginyl-alanyl-glycyl-alanine Proteins 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 238000002869 basic local alignment search tool Methods 0.000 description 1
- 235000013405 beer Nutrition 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 239000012148 binding buffer Substances 0.000 description 1
- 230000006287 biotinylation Effects 0.000 description 1
- 238000007413 biotinylation Methods 0.000 description 1
- 210000004900 c-terminal fragment Anatomy 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000002775 capsule Substances 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 210000004970 cd4 cell Anatomy 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000004663 cell proliferation Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000002512 chemotherapy Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 239000011247 coating layer Substances 0.000 description 1
- 230000001332 colony forming effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000012258 culturing Methods 0.000 description 1
- WZHCOOQXZCIUNC-UHFFFAOYSA-N cyclandelate Chemical compound C1C(C)(C)CC(C)CC1OC(=O)C(O)C1=CC=CC=C1 WZHCOOQXZCIUNC-UHFFFAOYSA-N 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 210000001151 cytotoxic T lymphocyte Anatomy 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 231100000517 death Toxicity 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- KCFYHBSOLOXZIF-UHFFFAOYSA-N dihydrochrysin Natural products COC1=C(O)C(OC)=CC(C2OC3=CC(O)=CC(O)=C3C(=O)C2)=C1 KCFYHBSOLOXZIF-UHFFFAOYSA-N 0.000 description 1
- PSLWZOIUBRXAQW-UHFFFAOYSA-M dimethyl(dioctadecyl)azanium;bromide Chemical compound [Br-].CCCCCCCCCCCCCCCCCC[N+](C)(C)CCCCCCCCCCCCCCCCCC PSLWZOIUBRXAQW-UHFFFAOYSA-M 0.000 description 1
- 230000003292 diminished effect Effects 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 239000003651 drinking water Substances 0.000 description 1
- 235000020188 drinking water Nutrition 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000003797 essential amino acid Substances 0.000 description 1
- 235000020776 essential amino acid Nutrition 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 239000000706 filtrate Substances 0.000 description 1
- 101150059052 fixB gene Proteins 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- 230000005182 global health Effects 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010084760 glycyl-tyrosyl-glycyl-aspartate Proteins 0.000 description 1
- YMAWOPBAYDPSLA-UHFFFAOYSA-N glycylglycine Chemical compound [NH3+]CC(=O)NCC([O-])=O YMAWOPBAYDPSLA-UHFFFAOYSA-N 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010033706 glycylserine Proteins 0.000 description 1
- 238000011554 guinea pig model Methods 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010079916 homocysteinase Proteins 0.000 description 1
- 102000056538 human ABO Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 235000011167 hydrochloric acid Nutrition 0.000 description 1
- 150000004679 hydroxides Chemical class 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 230000008073 immune recognition Effects 0.000 description 1
- 230000003053 immunization Effects 0.000 description 1
- 230000006054 immunological memory Effects 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 239000004615 ingredient Substances 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 229960003130 interferon gamma Drugs 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000010255 intramuscular injection Methods 0.000 description 1
- 239000007927 intramuscular injection Substances 0.000 description 1
- 238000007912 intraperitoneal administration Methods 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000001155 isoelectric focusing Methods 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- JJWLVOIRVHMVIS-UHFFFAOYSA-N isopropylamine Chemical compound CC(C)N JJWLVOIRVHMVIS-UHFFFAOYSA-N 0.000 description 1
- DVCSNHXRZUVYAM-BQBZGAKWSA-N leu-asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC(O)=O DVCSNHXRZUVYAM-BQBZGAKWSA-N 0.000 description 1
- 108010071185 leucyl-alanine Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010091871 leucylmethionine Proteins 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000006193 liquid solution Substances 0.000 description 1
- 239000006194 liquid suspension Substances 0.000 description 1
- 230000005923 long-lasting effect Effects 0.000 description 1
- 230000007787 long-term memory Effects 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 210000002540 macrophage Anatomy 0.000 description 1
- ZLNQQNXFFQJAID-UHFFFAOYSA-L magnesium carbonate Chemical compound [Mg+2].[O-]C([O-])=O ZLNQQNXFFQJAID-UHFFFAOYSA-L 0.000 description 1
- 239000001095 magnesium carbonate Substances 0.000 description 1
- 229910000021 magnesium carbonate Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 210000004962 mammalian cell Anatomy 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 210000004379 membrane Anatomy 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 108010056582 methionylglutamic acid Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 108010068488 methionylphenylalanine Proteins 0.000 description 1
- KXKVLQRXCPHEJC-UHFFFAOYSA-N methyl acetate Chemical compound COC(C)=O KXKVLQRXCPHEJC-UHFFFAOYSA-N 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 101150109609 moaCB gene Proteins 0.000 description 1
- 108010046778 molybdenum cofactor Proteins 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000005087 mononuclear cell Anatomy 0.000 description 1
- 210000000663 muscle cell Anatomy 0.000 description 1
- 208000027531 mycobacterial infectious disease Diseases 0.000 description 1
- 230000017074 necrotic cell death Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 150000003833 nucleoside derivatives Chemical class 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 239000006179 pH buffering agent Substances 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 229950009506 penicillinase Drugs 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 210000004976 peripheral blood cell Anatomy 0.000 description 1
- 102000013415 peroxidase activity proteins Human genes 0.000 description 1
- 108040007629 peroxidase activity proteins Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- 235000011007 phosphoric acid Nutrition 0.000 description 1
- 150000003013 phosphoric acid derivatives Chemical class 0.000 description 1
- 150000003016 phosphoric acids Chemical class 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 238000007747 plating Methods 0.000 description 1
- 229940115272 polyinosinic:polycytidylic acid Drugs 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 239000011591 potassium Substances 0.000 description 1
- 229910052700 potassium Inorganic materials 0.000 description 1
- 230000003389 potentiating effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- MFDFERRIHVXMIY-UHFFFAOYSA-N procaine Chemical compound CCN(CC)CCOC(=O)C1=CC=C(N)C=C1 MFDFERRIHVXMIY-UHFFFAOYSA-N 0.000 description 1
- 229960004919 procaine Drugs 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 108010077112 prolyl-proline Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 238000004064 recycling Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 229960000885 rifabutin Drugs 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 239000012723 sample buffer Substances 0.000 description 1
- 238000003118 sandwich ELISA Methods 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000008313 sensitization Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 229910052708 sodium Inorganic materials 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- YEENEYXBHNNNGV-XEHWZWQGSA-M sodium;3-acetamido-5-[acetyl(methyl)amino]-2,4,6-triiodobenzoate;(2r,3r,4s,5s,6r)-2-[(2r,3s,4s,5r)-3,4-dihydroxy-2,5-bis(hydroxymethyl)oxolan-2-yl]oxy-6-(hydroxymethyl)oxane-3,4,5-triol Chemical compound [Na+].CC(=O)N(C)C1=C(I)C(NC(C)=O)=C(I)C(C([O-])=O)=C1I.O[C@H]1[C@H](O)[C@@H](CO)O[C@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 YEENEYXBHNNNGV-XEHWZWQGSA-M 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 230000003393 splenic effect Effects 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 238000007447 staining method Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 230000004936 stimulating effect Effects 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 235000000346 sugar Nutrition 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 238000013268 sustained release Methods 0.000 description 1
- 239000012730 sustained-release form Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 239000003826 tablet Substances 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- TXEYQDLBPFQVAA-UHFFFAOYSA-N tetrafluoromethane Chemical compound FC(F)(F)F TXEYQDLBPFQVAA-UHFFFAOYSA-N 0.000 description 1
- DHCDFWKWKRSZHF-UHFFFAOYSA-L thiosulfate(2-) Chemical compound [O-]S([S-])(=O)=O DHCDFWKWKRSZHF-UHFFFAOYSA-L 0.000 description 1
- 239000003053 toxin Substances 0.000 description 1
- 231100000765 toxin Toxicity 0.000 description 1
- 108700012359 toxins Proteins 0.000 description 1
- 230000005029 transcription elongation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 150000003626 triacylglycerols Chemical class 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 229960001005 tuberculin Drugs 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 229940125575 vaccine candidate Drugs 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 108010000998 wheylin-2 peptide Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/35—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Mycobacteriaceae (F)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Biophysics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Genetics & Genomics (AREA)
- Medicinal Chemistry (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Peptides Or Proteins (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
The present invention relates to substantially pure polypeptides, which has a sequence identity of at least 80 % to an amino acid sequence disclosed, or which is a subsequence of at least 6 amino acids thereof, preferably a B- or T-cell epitope of the polypeptides disclosed. The polypeptide or the subsequence thereof has at least one of nine properties. The use of the disclosed polypeptides in medicine is disclosed, preferably as vaccine or diagnostic agents relating to virulent <i>Mycobacterium</i>. The invention further relates to the nucleotide sequences disclosed and the nucleotide sequences encoding the disclosed polypeptides. Medical and non-medical use of the nucleotide sequences is disclosed.
Description
TB vaccine and diagnostic based on antigens from the M. tuberculosis cell BACKGROUND OF THE INVENTION
Human tuberculosis (TB) caused by Mycobacterium tuberculosis is a serious global health problem responsible for approximately 3 million deaths annually, according to WHO. The world-wide incidence of new tuberculosis cases has been progressively falling for the last decade but the recent years have markedly changed this trend due to the advent of AIDS and the appearance of multidrug resistant strains of Mycobacterium tuberculosis.
The only vaccine presently available for clinical use is BCG, a vaccine whose efficacy remains a matter of controversy. BCG generally induces a high level of acquired resistance in animal models of tuberculosis, but several human trials in developing countries have failed to demonstrate significant protection. Notably, BCG is not approved by the FDA for use in the United States because BCG vaccination impairs the specificity of the Tuberculin skin test for diagnosis of TB infection.
This makes the development of a new and improved vaccine against tuberculosis an urgent matter which has been given a very high priority by the WHO. Many attempts have been made to define the protective Mycobacterial substances and a series of experiments were conducted to compare the protective efficacy of vaccination with live versus killed preparations of M. tuberculosis (Orme IM. Infect.lmmun.1988;
56:3310-12).
The conclusion of these studies was that vaccination of mice with dead M.
tuberculosis administered without adjuvants only induced short term protection against TB, whereas live M.tuberculosis vaccines induced efficient immunological memory. This information was the background for the further search for protective substances focused on antigens actively secreted from the live Mycobacteria (Andersen P. Infect.lmmun.1994;
62:2536-44, Honivitz et al. Proc. Natl Acad. Sci. USA 1995; 92:1530-4, Pal PG et al.
infect.lmmun.
1992; 60: 4781-92).
DETAILED DISCLOSURE OF THE INVENTION
The present inventors conducted a study comparing the long term protection against TB
after vaccination three times with killed M. tuberculosis administered with DDA as an adjuvant with the long term protection obtained with ST-CF, and surprisingly similar levels
Human tuberculosis (TB) caused by Mycobacterium tuberculosis is a serious global health problem responsible for approximately 3 million deaths annually, according to WHO. The world-wide incidence of new tuberculosis cases has been progressively falling for the last decade but the recent years have markedly changed this trend due to the advent of AIDS and the appearance of multidrug resistant strains of Mycobacterium tuberculosis.
The only vaccine presently available for clinical use is BCG, a vaccine whose efficacy remains a matter of controversy. BCG generally induces a high level of acquired resistance in animal models of tuberculosis, but several human trials in developing countries have failed to demonstrate significant protection. Notably, BCG is not approved by the FDA for use in the United States because BCG vaccination impairs the specificity of the Tuberculin skin test for diagnosis of TB infection.
This makes the development of a new and improved vaccine against tuberculosis an urgent matter which has been given a very high priority by the WHO. Many attempts have been made to define the protective Mycobacterial substances and a series of experiments were conducted to compare the protective efficacy of vaccination with live versus killed preparations of M. tuberculosis (Orme IM. Infect.lmmun.1988;
56:3310-12).
The conclusion of these studies was that vaccination of mice with dead M.
tuberculosis administered without adjuvants only induced short term protection against TB, whereas live M.tuberculosis vaccines induced efficient immunological memory. This information was the background for the further search for protective substances focused on antigens actively secreted from the live Mycobacteria (Andersen P. Infect.lmmun.1994;
62:2536-44, Honivitz et al. Proc. Natl Acad. Sci. USA 1995; 92:1530-4, Pal PG et al.
infect.lmmun.
1992; 60: 4781-92).
DETAILED DISCLOSURE OF THE INVENTION
The present inventors conducted a study comparing the long term protection against TB
after vaccination three times with killed M. tuberculosis administered with DDA as an adjuvant with the long term protection obtained with ST-CF, and surprisingly similar levels
2 of long term protection induced in the group receiving killed bacteria were found as in the group vaccinated with ST-CF/DDA (figure 1 ).
This leads to the conclusion that protective components can be found also among the components of the cell wall, cell membrane or cytosol derived from a preparation of dead virulent Mycobacteria.
It is thus an object of the present invention to provide a composition for the generation or determination of an immune response against a virulent Mycobacterium such as a vaccine for immunising a mammal, including a human being, against disease caused by a virulent Mycobacterium and a diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium.
By the terms "somatic protein" or "protein derived from the cell wall, the cell membrane or the cytosol", or by the abbreviation "SPE" is understood a polypeptide or a protein extract obtainable from a cell or a part. A preferred method to obtain a somatic protein is described in the examples, especially examples 2, 3, 4, and 5.
By the term "virulent Mycobacterium" is understood a bacterium capable of causing the tuberculosis disease in a mammal including a human being. Examples of virulent Mycobacteria are M. tuberculosis, M. africanum, and M. bovis.
By "a TB patient" is understood an individual with culture or microscopically proven infection with virulent Mycobacteria, and/or an individual clinically diagnosed with TB and who is responsive to anti-TB chemotherapy. Culture, microscopy and clinical diagnosis of TB is well known by the person skilled in the art.
A significant decrease or increase is defined as a decrease or increase which is significant at the 95% level by comparison of immunised and placebo-treated groups using an appropriate statistical analysis such as a Student's two-tailed T
test.
By the term "PPD positive individual" is understood an individual with a positive Mantoux test or an individual where PPD induces an increase in in itro recall response determined by release of IFN-y of at least 1,000 pgiml from Peripheral Blood Mononuclear Cells (PBMC) or whole blood, the induction being performed by the addition
This leads to the conclusion that protective components can be found also among the components of the cell wall, cell membrane or cytosol derived from a preparation of dead virulent Mycobacteria.
It is thus an object of the present invention to provide a composition for the generation or determination of an immune response against a virulent Mycobacterium such as a vaccine for immunising a mammal, including a human being, against disease caused by a virulent Mycobacterium and a diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium.
By the terms "somatic protein" or "protein derived from the cell wall, the cell membrane or the cytosol", or by the abbreviation "SPE" is understood a polypeptide or a protein extract obtainable from a cell or a part. A preferred method to obtain a somatic protein is described in the examples, especially examples 2, 3, 4, and 5.
By the term "virulent Mycobacterium" is understood a bacterium capable of causing the tuberculosis disease in a mammal including a human being. Examples of virulent Mycobacteria are M. tuberculosis, M. africanum, and M. bovis.
By "a TB patient" is understood an individual with culture or microscopically proven infection with virulent Mycobacteria, and/or an individual clinically diagnosed with TB and who is responsive to anti-TB chemotherapy. Culture, microscopy and clinical diagnosis of TB is well known by the person skilled in the art.
A significant decrease or increase is defined as a decrease or increase which is significant at the 95% level by comparison of immunised and placebo-treated groups using an appropriate statistical analysis such as a Student's two-tailed T
test.
By the term "PPD positive individual" is understood an individual with a positive Mantoux test or an individual where PPD induces an increase in in itro recall response determined by release of IFN-y of at least 1,000 pgiml from Peripheral Blood Mononuclear Cells (PBMC) or whole blood, the induction being performed by the addition
3 PCT/DK99/00538 of 2.5 to 5 pg PPD/ml to a suspension comprising about 1.0 to 2.5 x 105 PBMC, the release of IFN-y being assessable by determination of IFN-y in supernatant harvested 5 days after the addition of PPD to the suspension compared to the release of IFN-y without the addition of PPD.
By the term "delayed type hypersensitivity reaction" is understood a T-cell mediated inflammatory response elicited after the injection of a polypeptide into or application to the skin, said inflammatory response appearing 72-96 hours after the polypeptide injection or application.
By the term "IFN-y" is understood interferon-gamma.
Throughout this specification, unless the context requires otherwise, the word "comprise", or variations thereof such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element or integer or group of elements or integers but not the exclusion of any other element or integer or group of elements or integers.
By the term "a polypeptide" in the present application is generally understood a polypeptide of the invention, as will be described later. It is also within the meaning of "a polypeptide" that several polypeptides can be used, i.e. in the present context "a" means "at least one" unless explicitly indicated otherwise. The "polypeptide" is used to referrer to short peptides with a length of at least two amino acid residues and at most 10 amino acid residues, oligopeptides (11-100 amino acid residues), and longer peptides (the usual interpretation of "polypeptide", i.e. more than 100 amino acid residues in length) as well as proteins (the functional entity comprising at least one peptide, oligopeptide, or polypeptide which may be chemically modified by being phosphorylated, glycosylated, by being lipidated, or by comprising prosthetic groups). The definition of polypeptides comprises native forms of peptides/proteins in Mycobacteria as well as recombinant proteins or peptides in any type of expression vectors transforming any kind of host, and also chemically synthesised polypeptides. Within the scope of the invention is a polypeptide which is at least 6 amino acids long, preferably 7, such as 8, 9, 10, 11, 12 , 13, 14 amino acids long, preferably at least 15 amino acids, such as 15, 16, 17, 18, 19, 20 amino acids long. However, also longer polypeptides having a length of e.g.
25, 50, 75, 100, 125, 150, 175 or 200 amino acids are within the scope of the present invention.
By the term "delayed type hypersensitivity reaction" is understood a T-cell mediated inflammatory response elicited after the injection of a polypeptide into or application to the skin, said inflammatory response appearing 72-96 hours after the polypeptide injection or application.
By the term "IFN-y" is understood interferon-gamma.
Throughout this specification, unless the context requires otherwise, the word "comprise", or variations thereof such as "comprises" or "comprising", will be understood to imply the inclusion of a stated element or integer or group of elements or integers but not the exclusion of any other element or integer or group of elements or integers.
By the term "a polypeptide" in the present application is generally understood a polypeptide of the invention, as will be described later. It is also within the meaning of "a polypeptide" that several polypeptides can be used, i.e. in the present context "a" means "at least one" unless explicitly indicated otherwise. The "polypeptide" is used to referrer to short peptides with a length of at least two amino acid residues and at most 10 amino acid residues, oligopeptides (11-100 amino acid residues), and longer peptides (the usual interpretation of "polypeptide", i.e. more than 100 amino acid residues in length) as well as proteins (the functional entity comprising at least one peptide, oligopeptide, or polypeptide which may be chemically modified by being phosphorylated, glycosylated, by being lipidated, or by comprising prosthetic groups). The definition of polypeptides comprises native forms of peptides/proteins in Mycobacteria as well as recombinant proteins or peptides in any type of expression vectors transforming any kind of host, and also chemically synthesised polypeptides. Within the scope of the invention is a polypeptide which is at least 6 amino acids long, preferably 7, such as 8, 9, 10, 11, 12 , 13, 14 amino acids long, preferably at least 15 amino acids, such as 15, 16, 17, 18, 19, 20 amino acids long. However, also longer polypeptides having a length of e.g.
25, 50, 75, 100, 125, 150, 175 or 200 amino acids are within the scope of the present invention.
4 In the present context the term "purified polypeptide" means a polypeptide preparation which contains at most 5% by weight of other polypeptide material with which it is natively associated (tower percentages of other polypeptide material are preferred, e.g. at most 4%, at most 3%, at most 2%, at most 1%, and at most'/Z%). It is preferred that the substantially pure polypeptide is at least 96% pure, i.e. that the poiypeptide constitutes at least 96% by weight of total polypeptide material present in the preparation, and higher percentages are preferred, such as at least 97%, at least 98%, at least 99%, at least 99,25%, at least 99,5%, and at least 99,75%. It is especially preferred that the polypeptide is in "essentially pure form", i.e. that the polypeptide is essentially free of any other antigen with which it is natively associated, i.e. free of any other antigen from bacteria belonging to the tuberculosis complex. This can be accomplished by preparing the polypeptide by means of recombinant methods in a non-mycobacterial host cell as will be described in detail below, or by synthesising the polypeptide by the well-known methods of solid or liquid phase peptide synthesis, e.g. by the method described by Merrifield or variations thereof.
By the term "non-naturally occurring polypeptide" is understood a polypeptide that does not occur naturally. This means that the polypeptide is substantially pure, and/or that the polypeptide has been synthesised in the laboratory, and/or that the polypeptide has been produced by means of recombinant technology.
By the terms "analogue" and "subsequence" when used in connection with polypeptides is meant any polypeptide having the same immunological characteristics as the polypeptides of the invention described above with respect to the ability to confer increased resistance to infection with virulent Mycobacteria. Thus, included is also a polypeptide from a different source, such as from another bacterium or even from a eukaryotic cell.
The term "sequence identity" indicates a quantitative measure of the degree of homology between two amino acid sequences of equal length or between two nucleotide sequences of equal length. If the two sequences to be compared are not of equal length, they must be aligned to best possible fit. The sequence identity can be calculated as (Nn,-N~~)J°° , wherein Nd;, is the total number of non-identical residues in the two sequences Nn~
when aligned and wherein Nre, is the number of residues in one of the sequences. Hence, the DNA sequence AGTCAGTC will have a sequence identity of 75% with the sequence AATCAATC (Nd;,=2 and N~ef=8). A gap is counted as non-identity of the specific residue(s), i.e. the DNA sequence AGTGTC will have a sequence identity of 75%
with the DNA sequence AGTCAGTC (Nd;f-2 and N~ef=8). Sequence identity can alternatively be calculated by the BLAST program e.g. the BLASTP program or the BLASTN program
By the term "non-naturally occurring polypeptide" is understood a polypeptide that does not occur naturally. This means that the polypeptide is substantially pure, and/or that the polypeptide has been synthesised in the laboratory, and/or that the polypeptide has been produced by means of recombinant technology.
By the terms "analogue" and "subsequence" when used in connection with polypeptides is meant any polypeptide having the same immunological characteristics as the polypeptides of the invention described above with respect to the ability to confer increased resistance to infection with virulent Mycobacteria. Thus, included is also a polypeptide from a different source, such as from another bacterium or even from a eukaryotic cell.
The term "sequence identity" indicates a quantitative measure of the degree of homology between two amino acid sequences of equal length or between two nucleotide sequences of equal length. If the two sequences to be compared are not of equal length, they must be aligned to best possible fit. The sequence identity can be calculated as (Nn,-N~~)J°° , wherein Nd;, is the total number of non-identical residues in the two sequences Nn~
when aligned and wherein Nre, is the number of residues in one of the sequences. Hence, the DNA sequence AGTCAGTC will have a sequence identity of 75% with the sequence AATCAATC (Nd;,=2 and N~ef=8). A gap is counted as non-identity of the specific residue(s), i.e. the DNA sequence AGTGTC will have a sequence identity of 75%
with the DNA sequence AGTCAGTC (Nd;f-2 and N~ef=8). Sequence identity can alternatively be calculated by the BLAST program e.g. the BLASTP program or the BLASTN program
5 (Pearson W.R and D.J. Lipman (1988) PNAS USA 85:2444-2448)(www.ncbi.nlm.nih.govIBLAST). In one aspect of the invention, alignment is performed with the global align algorithm with default parameters as described by X.
Huang and W. Miller. Adv. Appl. Math. (1991 ) 12:337-357, available at http:// _www.ch.embnet.org/software/LALIGN form.html.
When the term nucleotide is used in the following, it should be understood in the broadest sense. That is, most often the nucleotide should be considered as DNA.
However, when DNA can be substituted with RNA, the term nucleotide should be read to include RNA
embodiments which will be apparent for the person skilled in the art. For the purposes of hybridisation, PNA or LNA may be used instead of DNA. PNA has been shown to exhibit a very dynamic hybridisation profile and is described in Nielsen P E et al., 1991, Science 254: 1497-1500). LNA (Locked Nucleic Acids) is a recently introduced oligonucleotide analogue containing bicyclo nucleoside monomers (Koshkin et al., 1998, 54, 3630;Nielsen, N.K. et al. J.Am.Chem.Soc 1998, 120, 5458-5463).
It is surprisingly demonstrated herein that the SPE comprising polypeptides isolated from the cell wall, cell membrane and cytosol induces protective immunity against infection with M. tuberculosis in an animal model, when injected with an adjuvant. It is contemplated that these pofypeptides, either alone or in combination, can be used as vaccine components.
It is further demonstrated that several polypeptides isolated from the cell wall, cell membrane or cytosol are recognised by human tuberculosis antisera. Therefore it is considered likely that these polypeptides, either alone or in combination, can be useful as diagnostic reagents in the diagnosis of tuberculosis.
One embodiment of the invention relates to a method for producing a polypeptide in an immunological composition comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a);
Huang and W. Miller. Adv. Appl. Math. (1991 ) 12:337-357, available at http:// _www.ch.embnet.org/software/LALIGN form.html.
When the term nucleotide is used in the following, it should be understood in the broadest sense. That is, most often the nucleotide should be considered as DNA.
However, when DNA can be substituted with RNA, the term nucleotide should be read to include RNA
embodiments which will be apparent for the person skilled in the art. For the purposes of hybridisation, PNA or LNA may be used instead of DNA. PNA has been shown to exhibit a very dynamic hybridisation profile and is described in Nielsen P E et al., 1991, Science 254: 1497-1500). LNA (Locked Nucleic Acids) is a recently introduced oligonucleotide analogue containing bicyclo nucleoside monomers (Koshkin et al., 1998, 54, 3630;Nielsen, N.K. et al. J.Am.Chem.Soc 1998, 120, 5458-5463).
It is surprisingly demonstrated herein that the SPE comprising polypeptides isolated from the cell wall, cell membrane and cytosol induces protective immunity against infection with M. tuberculosis in an animal model, when injected with an adjuvant. It is contemplated that these pofypeptides, either alone or in combination, can be used as vaccine components.
It is further demonstrated that several polypeptides isolated from the cell wall, cell membrane or cytosol are recognised by human tuberculosis antisera. Therefore it is considered likely that these polypeptides, either alone or in combination, can be useful as diagnostic reagents in the diagnosis of tuberculosis.
One embodiment of the invention relates to a method for producing a polypeptide in an immunological composition comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a);
6 c) resuspending the pellet of b) in PBS;
d) centrifugating the suspension of c);
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane from the supernatant of d) with SDS;
f) centrifugating the extract of e);
g) precipitating the supernatant of f) in cold acetone;
h) resuspending the precipitate of g) in PBS;
i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) blotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system;
n) isolating and purifying the polypeptide expressed in m); and 0) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
Another embodiment is a method of producing a polypeptide originating from the cell wall in an immunofogical composition, said method comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min d) lysing the suspension of c) e) centrifugating the lysed suspension of d) f) resuspending the pellet of e) in homogenising buffer g) incubating the suspension of f) with RNase and DNase overnight h) incubating the suspension of g) with SDS
i) centrifugating the incubated suspension of h) j) incubating the supernatant of i) with SDS
k) precipitating the incubated supernatant of j) with acetone I) resuspending the precipitate of k) in PBS
m) subjecting the suspension of I) to a Triton X-114 extraction n) applying the resuspension of m) to 2 dimensional electrophoresis;
0) blotting the gel of n) to a PVDF membrane;
d) centrifugating the suspension of c);
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane from the supernatant of d) with SDS;
f) centrifugating the extract of e);
g) precipitating the supernatant of f) in cold acetone;
h) resuspending the precipitate of g) in PBS;
i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) blotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system;
n) isolating and purifying the polypeptide expressed in m); and 0) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
Another embodiment is a method of producing a polypeptide originating from the cell wall in an immunofogical composition, said method comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min d) lysing the suspension of c) e) centrifugating the lysed suspension of d) f) resuspending the pellet of e) in homogenising buffer g) incubating the suspension of f) with RNase and DNase overnight h) incubating the suspension of g) with SDS
i) centrifugating the incubated suspension of h) j) incubating the supernatant of i) with SDS
k) precipitating the incubated supernatant of j) with acetone I) resuspending the precipitate of k) in PBS
m) subjecting the suspension of I) to a Triton X-114 extraction n) applying the resuspension of m) to 2 dimensional electrophoresis;
0) blotting the gel of n) to a PVDF membrane;
7 p) subjecting the spots on o) to N-terminal sequencing;
q) searching a database for homology with the sequence of p) to identify the nucleotide sequence;
r) cloning the nucleotide sequence of q) into an expression system;
s) isolating and purifying the polypeptide expressed in r); and t) formulating the polypeptide of s) with an adjuvant substance in an immunological composition.
A third embodiment is a method of producing a polypeptide originating from the cell membrane in an immunological composition, said method comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min d) lysing the suspension of c) e) centrifugating the lysed suspension of d) f) ultracentrifugating the supernatant of e) g) resuspending the pellet of f) in PBS
h) subject the suspension of g) to a Triton X-114 extraction i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) blotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system; and n) isolating and purifying the polypeptide expressed in m);
o) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
A fourth embodiment is a method of producing a polypeptide originating from the cytosol in an immunological composition comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min
q) searching a database for homology with the sequence of p) to identify the nucleotide sequence;
r) cloning the nucleotide sequence of q) into an expression system;
s) isolating and purifying the polypeptide expressed in r); and t) formulating the polypeptide of s) with an adjuvant substance in an immunological composition.
A third embodiment is a method of producing a polypeptide originating from the cell membrane in an immunological composition, said method comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min d) lysing the suspension of c) e) centrifugating the lysed suspension of d) f) ultracentrifugating the supernatant of e) g) resuspending the pellet of f) in PBS
h) subject the suspension of g) to a Triton X-114 extraction i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) blotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system; and n) isolating and purifying the polypeptide expressed in m);
o) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
A fourth embodiment is a method of producing a polypeptide originating from the cytosol in an immunological composition comprising the steps of:
a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) c) resuspending the pellet of b) in PBS supplemented with EDTA and phenylmethylsulfonyl fluoride and sonicating for 15 min
8 d) iysing the suspension of c) e) centrifugating the lysed suspension of d) f) ultracentrifugating the supernatant of e) g) precipitating the supernatant of f) with acetone h) resuspending the precipitate of g) in PBS
i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) plotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system;
n) isolating and purifying the polypeptide expressed in m); and o) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
In particular, the invention relates to a polypeptide obtainable by a method as described above which polypeptide has at least one of the following properties:
i) it induces an in vitro recall response determined by a release of IFN-y of at least 1,500 pg/ml from reactivated memory T-lymphocytes withdrawn from a C57BI/6J mouse within 4 days after the mouse has been rechallenged with 1 x 106 virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a suspension comprising about 2 x 105 cells isolated from the spleen of said mouse, the addition of the polypeptide resulting in a concentration of the polypeptide of not more than 20 p.g per ml suspension, the release of IFN-y being assessable by determination of IFN-y in supernatant harvested 3 days after the addition of the polypeptide to the suspension, ii) it induces an in vitro response during primary infection with virulent Mycobacteria, determined by release of IFN-y of at least 1,500 pg/ml from T-lymphocytes withdrawn from a mouse within 28 days after the mouse has been infected with 5 x 10° virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a suspension comprising about 2 x 105 cells isolated from the spleen, the addition of the polypeptide resulting in a concentration of not more than 20 p.g per ml suspension, the release of IFNJy being assessable by determination of IFN-y in supernatant harvested 3 days after the addition of the polypeptide to the suspension,
i) applying the resuspension of h) to 2 dimensional electrophoresis;
j) plotting the gel of i) to a PVDF membrane;
k) subjecting the spots on j) to N-terminal sequencing;
I) searching a database for homology with the sequence of k) to identify the nucleotide sequence;
m) cloning the nucleotide sequence of I) into an expression system;
n) isolating and purifying the polypeptide expressed in m); and o) formulating the polypeptide of n) with an adjuvant substance in an immunological composition.
In particular, the invention relates to a polypeptide obtainable by a method as described above which polypeptide has at least one of the following properties:
i) it induces an in vitro recall response determined by a release of IFN-y of at least 1,500 pg/ml from reactivated memory T-lymphocytes withdrawn from a C57BI/6J mouse within 4 days after the mouse has been rechallenged with 1 x 106 virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a suspension comprising about 2 x 105 cells isolated from the spleen of said mouse, the addition of the polypeptide resulting in a concentration of the polypeptide of not more than 20 p.g per ml suspension, the release of IFN-y being assessable by determination of IFN-y in supernatant harvested 3 days after the addition of the polypeptide to the suspension, ii) it induces an in vitro response during primary infection with virulent Mycobacteria, determined by release of IFN-y of at least 1,500 pg/ml from T-lymphocytes withdrawn from a mouse within 28 days after the mouse has been infected with 5 x 10° virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a suspension comprising about 2 x 105 cells isolated from the spleen, the addition of the polypeptide resulting in a concentration of not more than 20 p.g per ml suspension, the release of IFNJy being assessable by determination of IFN-y in supernatant harvested 3 days after the addition of the polypeptide to the suspension,
9 iii) it induces a protective immunity determined by vaccinating an animal model with the polypeptide and an adjuvant in a total of three times with two weeks interval starting at 6-8 weeks of age, 6 weeks after the last vaccination challenging with 5 x 10s virulent Mycobacteria/ml by aerosol and determining a significant decrease in the number of bacteria recoverable from the lung 6 weeks after the animal has been challenged, compared to the number recovered from the same organ in a mammal given placebo treatment, iv) it induces in vitro recall response determined by release of IFN-y of at least 1,000 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) or whole blood withdrawn from TB patients 0-6 months after diagnosis, or PPD positive individual, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 105 PBMC or whole blood cells, the addition of the polypeptide resulting in a concentration of not more than 20 pg per ml suspension, the release of IFN-y being assessable by determination of IFN-y in supernatant harvested 5 days after the addition of the polypeptide to the suspension, v) it induces a specific antibody response in a TB patient as determined by an ELISA
technique or a western blot when the whole blood is diluted 1:20 in PBS and stimulated with the polypeptide in a concentration of at the most 20 ~glml and induces an OD of at least 0.1 in ELISA, or a visual response in western blot.
vi) it induces a positive in vitro response determined by release of IFN-y of at least 500 pgiml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual who is clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 105 PBMC, the addition of the polypeptide resulting in a concentration of not more than 20 p.g per ml suspension, the release of IFN-y being assessable by determi-nation of IFN-y in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and preferably does not induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, vii) it induces a positive in vitro response determined by release of IFN-y of at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 105 PBMC, the addition of the polypeptide resulting in a concentration of not more than pg per ml suspension, the release of IFN~y being assessable by determination of IFN-y 5 in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and preferably does not induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, viii) it induces a positive DTH response determined by intradermal injection or local
technique or a western blot when the whole blood is diluted 1:20 in PBS and stimulated with the polypeptide in a concentration of at the most 20 ~glml and induces an OD of at least 0.1 in ELISA, or a visual response in western blot.
vi) it induces a positive in vitro response determined by release of IFN-y of at least 500 pgiml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual who is clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 105 PBMC, the addition of the polypeptide resulting in a concentration of not more than 20 p.g per ml suspension, the release of IFN-y being assessable by determi-nation of IFN-y in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and preferably does not induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, vii) it induces a positive in vitro response determined by release of IFN-y of at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an individual clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 105 PBMC, the addition of the polypeptide resulting in a concentration of not more than pg per ml suspension, the release of IFN~y being assessable by determination of IFN-y 5 in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and preferably does not induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, viii) it induces a positive DTH response determined by intradermal injection or local
10 application patch of at most 100 pg of the polypeptide to an individual who is clinically or subclinically infected with a virulent Mycobacterium, a positive response having a diameter of at least 10 mm 72-96 hours after the injection or application, ix) it induces a positive DTH response determined by intradermal injection or local 15 application patch of at most 100 ~.g of the polypeptide to an individual who is clinically or subclinically infected with a virulent Mycobacterium, a positive response having a diameter of at least 10 mm 72-96 hours after the injection, and preferably does not induce a such response in an individual who has a cleared infection with a virulent Mycobacterium.
Any polypeptide fulfilling one or more of the above properties and which is obtainable from either the cell wall, cell membrane or the cytosol is within the scope of the present invention.
The property described in i) will also be satisfied if the release of IFN-y from reactivated memory T-lymphocytes is 2,000 pg/ml, such as 3,000 pg/ml. In an alternative embodiment of the invention, the immunological effect of the polypeptide could be determined by comparing the IFN-y release as described with the IFN-y release from a similar assay, wherein the polypeptide is not added, a significant increase being indicative of an immunologically effective polypeptide. In a preferred embodiment of the invention, the addition of the polypeptide results in a concentration of not more than 20 p.g per ml suspension, such as 15 ug, 10 p.g, 5 pg, 3 pg, 2 pg, or 1 p,g polypeptide per ml suspension.
Any polypeptide fulfilling one or more of the above properties and which is obtainable from either the cell wall, cell membrane or the cytosol is within the scope of the present invention.
The property described in i) will also be satisfied if the release of IFN-y from reactivated memory T-lymphocytes is 2,000 pg/ml, such as 3,000 pg/ml. In an alternative embodiment of the invention, the immunological effect of the polypeptide could be determined by comparing the IFN-y release as described with the IFN-y release from a similar assay, wherein the polypeptide is not added, a significant increase being indicative of an immunologically effective polypeptide. In a preferred embodiment of the invention, the addition of the polypeptide results in a concentration of not more than 20 p.g per ml suspension, such as 15 ug, 10 p.g, 5 pg, 3 pg, 2 pg, or 1 p,g polypeptide per ml suspension.
11 The property mentions as an example the mouse strain C57Bi/6j as the animal model. As will be known by a person skilled in the art, due to genetic variation, different strains may react with immune responses of varying strength to the same polypeptide. It is presently unknown which strains of mice will give the best predictability of immunogenic reactivity in which human population. Therefore, it is important to test other mouse strains, such as C3H/HeN, CBA (preferably CBA/J), DBA (preferably DBA/2J), A/J, AKR/N, DBA/1J, FVB/N, SJL/N, 129/SvJ, C3H/HeJ-Lps or BALE mice (preferably BALB/cA, BALB/cJ).
It is presently contemplated that also a similar test performed in another animal model such as a guinea pig or a rat will have clinical predictability. In order to obtain good clinical predictability to humans, it is contemplated that any farm animal, such as a cow, pig, or deer, or any primate will have clinical predictability and thus serve as an animal model.
It should be noted, moreover, that tuberculosis disease also affects a number of different animal species such as cows, primates, guinea pigs, badgers, possums, and deers. A
polypeptide which has proven effective in any of the models mentioned above may be of interest for animal treatment even if it is not effective in a human being.
It is proposed to measure the release of 1FN-y from reactivated T lymphocytes withdrawn from a C57BI/6j mouse within 4 days after the mouse has been rechallenged with virulent Mycobacteria. This is due to the fact that when an immune host mounts a protective immune response, the specific T-cells responsible for the early recognition of the infected macrophage stimulate a powerful bactericidal activity through their production of IFN-y (Rook, G.A.W. (1990) Res. Microbiol. 141:253-256; Flesch, 1. et S.H.E.
Kaufmann ( 1987) J Immunol.138(12):4408-13). However other cytokines could be relevant when monitoring the immunological response to the polypeptide, such as IL-12, TNF-a, IL-4, IL-5, IL-10, IL-6, TGF-(3. Usually one or more cytokines will be measured utilising for example the PCR technique or ELISA. It will be appreciated by the person skilled in the art that a significant increase or decrease in the amount of any of these cytokines induced by a specific polypeptide can be used in evaluation of the immunological efficacy of the polypeptide. The ability of a polypeptide to induce a IFN-y response is presently believed to be the most relevant correlate of protective immunity as mice with a disruption of the gene coding for IFN-y are unable to control a mycobacterial infection and die very rapidly with widespread dissemination, Gaseous necrosis and large abscesses (Flynn et al (1993) J.Exp.Med 178: 2249-2254, Cooper et al (1993) J.Exp.Med. 178:2243-2248). A
specific model for obtaining information regarding the antigenic targets of a protective
It is presently contemplated that also a similar test performed in another animal model such as a guinea pig or a rat will have clinical predictability. In order to obtain good clinical predictability to humans, it is contemplated that any farm animal, such as a cow, pig, or deer, or any primate will have clinical predictability and thus serve as an animal model.
It should be noted, moreover, that tuberculosis disease also affects a number of different animal species such as cows, primates, guinea pigs, badgers, possums, and deers. A
polypeptide which has proven effective in any of the models mentioned above may be of interest for animal treatment even if it is not effective in a human being.
It is proposed to measure the release of 1FN-y from reactivated T lymphocytes withdrawn from a C57BI/6j mouse within 4 days after the mouse has been rechallenged with virulent Mycobacteria. This is due to the fact that when an immune host mounts a protective immune response, the specific T-cells responsible for the early recognition of the infected macrophage stimulate a powerful bactericidal activity through their production of IFN-y (Rook, G.A.W. (1990) Res. Microbiol. 141:253-256; Flesch, 1. et S.H.E.
Kaufmann ( 1987) J Immunol.138(12):4408-13). However other cytokines could be relevant when monitoring the immunological response to the polypeptide, such as IL-12, TNF-a, IL-4, IL-5, IL-10, IL-6, TGF-(3. Usually one or more cytokines will be measured utilising for example the PCR technique or ELISA. It will be appreciated by the person skilled in the art that a significant increase or decrease in the amount of any of these cytokines induced by a specific polypeptide can be used in evaluation of the immunological efficacy of the polypeptide. The ability of a polypeptide to induce a IFN-y response is presently believed to be the most relevant correlate of protective immunity as mice with a disruption of the gene coding for IFN-y are unable to control a mycobacterial infection and die very rapidly with widespread dissemination, Gaseous necrosis and large abscesses (Flynn et al (1993) J.Exp.Med 178: 2249-2254, Cooper et al (1993) J.Exp.Med. 178:2243-2248). A
specific model for obtaining information regarding the antigenic targets of a protective
12 immunity in the memory model was originally developed by Lefford (Lefford et al (1973) Immunology 25:703) and has been used extensively in the recent years (Orme et al (1988). infect.lmmun. 140:3589, P.Andersen and I. Heron (1993) J.Immunol.154:3359).
The property described in ii) will also be satisfied if the release of IFN-y from T-lymphocytes withdrawn during primary infection is 2,000 pg/ml, such as 3,000 pglml. The comments on property i) regarding a significant increase in IFN-y, concentration of polypeptide, animal model, and other cytokines are equally relevant to property ii), and vice versa.
The property described in iii) will also be satisfied if the protective immunity is determined by challenging the mouse more than 6 weeks after the last vaccination challenge such as 7 weeks, preferably 8 weeks, 9 weeks, 10 weeks, 11 weeks, 12 weeks or 15 weeks. In one embodiment of the invention the bacteria are recovered from the spleen more than 6 weeks after the last vaccination challenge such as 7 weeks, preferably 8 weeks, 9 weeks, 10 weeks, 11 weeks, 12 weeks or 15 weeks. In another embodiment of the invention, the last vaccination challenge is given subcutaneously with 5x104 virulent Mycobacteria. As will be known by the person skilled in the art, the number of viable bacteria in the lung is presently considered to be relevant to the degree of bacterial infection of the animal. An equally important measure is the determination of the number of viable bacteria in the spleen, lymph node, or blood.
The amount of polypeptide and adjuvant used for vaccinating will depend on the animal model used, e.g. the mouse strain. When a mouse model is used it is preferred that the amount of polypeptide used for vaccinating the mouse is between 2 and 20 fig, such as between 5 and 15 pg, preferably 10 p,g. For larger animals such as guinea pigs, Beers, cows, primates, badgers, and possums higher doses such as 5 to 50 ~g of a single polypeptide are preferred.
The comments on property i) regarding concentration of polypeptide and animal model are equally relevant to property iii), and vice versa.
In another aspect of property iii), the mice, or other animal model, are given the standard lethal dose of virulent Mycobacteria. The standard lethal dose varies from around 3x105 to around 5x106 virulent Mycobacteria depending on the specific strain of virulent
The property described in ii) will also be satisfied if the release of IFN-y from T-lymphocytes withdrawn during primary infection is 2,000 pg/ml, such as 3,000 pglml. The comments on property i) regarding a significant increase in IFN-y, concentration of polypeptide, animal model, and other cytokines are equally relevant to property ii), and vice versa.
The property described in iii) will also be satisfied if the protective immunity is determined by challenging the mouse more than 6 weeks after the last vaccination challenge such as 7 weeks, preferably 8 weeks, 9 weeks, 10 weeks, 11 weeks, 12 weeks or 15 weeks. In one embodiment of the invention the bacteria are recovered from the spleen more than 6 weeks after the last vaccination challenge such as 7 weeks, preferably 8 weeks, 9 weeks, 10 weeks, 11 weeks, 12 weeks or 15 weeks. In another embodiment of the invention, the last vaccination challenge is given subcutaneously with 5x104 virulent Mycobacteria. As will be known by the person skilled in the art, the number of viable bacteria in the lung is presently considered to be relevant to the degree of bacterial infection of the animal. An equally important measure is the determination of the number of viable bacteria in the spleen, lymph node, or blood.
The amount of polypeptide and adjuvant used for vaccinating will depend on the animal model used, e.g. the mouse strain. When a mouse model is used it is preferred that the amount of polypeptide used for vaccinating the mouse is between 2 and 20 fig, such as between 5 and 15 pg, preferably 10 p,g. For larger animals such as guinea pigs, Beers, cows, primates, badgers, and possums higher doses such as 5 to 50 ~g of a single polypeptide are preferred.
The comments on property i) regarding concentration of polypeptide and animal model are equally relevant to property iii), and vice versa.
In another aspect of property iii), the mice, or other animal model, are given the standard lethal dose of virulent Mycobacteria. The standard lethal dose varies from around 3x105 to around 5x106 virulent Mycobacteria depending on the specific strain of virulent
13 Mycobacteria and strain of mice. The mortality in the mice is then monitored and compared to a placebo vaccinated control group. A significant decrease in mortality, measured as the mean survival time, will be indicative of an immunologically effective polypeptide. In a very recent paper it is shown that there is good correlation between mortality of the individual animals and the bacterial counts in the same animals.
(S.Baldwin (1998) Infect.lmmun 66:2951-2959).
The property described in iv) will also be satisfied if the release of IFN-y from PBMC is determined in PBMC withdrawn from TB patients or PPD positive individuals more than 6 months after diagnosis such as 9 months, 1 year, 2 years, 5 years, or 10 years after diagnosis.
The comments on property i) regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines are equally relevant to property iv).
The property described in v) will in particular be satisfied, if the ELISA is performed as follows: the polypeptide of interest in the concentration of 1 to 10 ~g/ml is coated on a 96 wells polystyrene plate (NUNC, Denmark) and after a washing step with phosphate buffer pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20 the serum or plasma from a TB
patient is applied in dilution's from 1:10 to 1:1000 in PBS with 1 % Tween-20.
Binding of an antibody to the polypeptide is determined by addition of a labeled (e.g.
peroxidase labeled) secondary antibody and reaction is thereafter visualized by the use of OPD and H202 as described by the manufacturer (DAKO, Denmark). The OD value in each well is determined using an appropriate ELISA reader.
In a preferred embodiment the western blot is performed as follows: The polypeptide is applied in concentrations from 1-40 ~g to a SDS-PAGE and after electrophoresis the polypeptide is transferred to a membrane e.g. nitrocellulose or PVDF. The membrane is thereafter washed in phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5%
Tween-20 for 30 min. The sera obtained from one or more TB patients were diluted 1:10 to 1:1000 in phosphate buffer pH 7.3 containing 0.37 M NaCI. The membrane is hereafter washed four times five minutes in binding buffer and incubated with peroxidase-or phosphates-labeled secondary antibody. Reaction is then visualized using the staining method recommended by the manufacture (DAKO, Denmark).
(S.Baldwin (1998) Infect.lmmun 66:2951-2959).
The property described in iv) will also be satisfied if the release of IFN-y from PBMC is determined in PBMC withdrawn from TB patients or PPD positive individuals more than 6 months after diagnosis such as 9 months, 1 year, 2 years, 5 years, or 10 years after diagnosis.
The comments on property i) regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines are equally relevant to property iv).
The property described in v) will in particular be satisfied, if the ELISA is performed as follows: the polypeptide of interest in the concentration of 1 to 10 ~g/ml is coated on a 96 wells polystyrene plate (NUNC, Denmark) and after a washing step with phosphate buffer pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20 the serum or plasma from a TB
patient is applied in dilution's from 1:10 to 1:1000 in PBS with 1 % Tween-20.
Binding of an antibody to the polypeptide is determined by addition of a labeled (e.g.
peroxidase labeled) secondary antibody and reaction is thereafter visualized by the use of OPD and H202 as described by the manufacturer (DAKO, Denmark). The OD value in each well is determined using an appropriate ELISA reader.
In a preferred embodiment the western blot is performed as follows: The polypeptide is applied in concentrations from 1-40 ~g to a SDS-PAGE and after electrophoresis the polypeptide is transferred to a membrane e.g. nitrocellulose or PVDF. The membrane is thereafter washed in phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5%
Tween-20 for 30 min. The sera obtained from one or more TB patients were diluted 1:10 to 1:1000 in phosphate buffer pH 7.3 containing 0.37 M NaCI. The membrane is hereafter washed four times five minutes in binding buffer and incubated with peroxidase-or phosphates-labeled secondary antibody. Reaction is then visualized using the staining method recommended by the manufacture (DAKO, Denmark).
14 The property described in vi) will in particular be satisfied if the polypeptide does not induce such an IFN-y release in an individual not infected with a virulent Mycobacterium, i.e. an individual who has been BCG vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium (NTM). The comments on property i) regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines are equally relevant to property vi).
The property described in vii) will in particular be satisfied if the polypeptide does not induce such an IFN~y release in an individual cleared of an infection with a virulent Mycobacterium, i.e. which does not have any positive culture, microscopically or clinically proven ongoing infection with virulent Mycobacterium. The comments on property i}
regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines are equally relevant to property vii).
The property described in viii) will in particular be satisfied if the polypeptide does not induce such a response in an individual not infected with a virulent Mycobacterium, i.e.
an individual who has been BCG vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium. In a preferred embodiment the amount of polypeptide intradermally injected or applied is 90 fig, such as 80~g, 70 p.g, 60 pg, 50 pg, 40 fig, or 30 pg. In another embodiment of the invention, the diameter of the positive response is at least 11 mm, such as 12 mm, 13 mm, 14 mm, or 15 mm. In a preferred embodiment the induration of erythema or both could be determined after administration of the polypeptide by intradermal injection, patch test or multipuncture. The reaction diameter could be positive after mare than 48, such as 72 or 96 hours.
The property described in ix) will in particular be satisfied if the polypeptide does not induce such a response in an individual cleared of an infection with a virulent Mycobacterium, i.e. which does not have any positive culture or microscopically proven ongoing infection with virulent Mycobacterium. The comments on property viii) regarding the amount of polypeptide intradermally injected or applied and the diameter of the positive response are equally relevant to property ix).
Preferred embodiments of the invention are the specific polypeptides which have been identified and analogues and subsequences thereof. It has been noted that none of the identified polypeptides in the examples include a signal sequence.
Until the present invention was made, it was unknown that the polypeptides with the amino acid sequences disclosed in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 are expressed in live virulent Mycobacterium.
5 These polypeptides in purified form, or non-naturally occurring, i.e.
recombinantly or synthetically produced, are considered part of the invention. It is understood that a polypeptide which has any of the properties i) - ix) and has a sequence identity of at least 80% with any of the amino acid sequences shown in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 or has a sequence identity of 10 at least 80% to any subsequence thereof is considered part of the invention. (n a preferred embodiment the sequence identity is at feast 80%, such as 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 99.5%. Furthermore, any T cell epitope of the polypeptides disclosed in SEQ ID
NOs: 2, 4, 6, 8, 10, 12, 14; 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 is
The property described in vii) will in particular be satisfied if the polypeptide does not induce such an IFN~y release in an individual cleared of an infection with a virulent Mycobacterium, i.e. which does not have any positive culture, microscopically or clinically proven ongoing infection with virulent Mycobacterium. The comments on property i}
regarding significant increase in IFN-y, concentration of polypeptide, and other cytokines are equally relevant to property vii).
The property described in viii) will in particular be satisfied if the polypeptide does not induce such a response in an individual not infected with a virulent Mycobacterium, i.e.
an individual who has been BCG vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium. In a preferred embodiment the amount of polypeptide intradermally injected or applied is 90 fig, such as 80~g, 70 p.g, 60 pg, 50 pg, 40 fig, or 30 pg. In another embodiment of the invention, the diameter of the positive response is at least 11 mm, such as 12 mm, 13 mm, 14 mm, or 15 mm. In a preferred embodiment the induration of erythema or both could be determined after administration of the polypeptide by intradermal injection, patch test or multipuncture. The reaction diameter could be positive after mare than 48, such as 72 or 96 hours.
The property described in ix) will in particular be satisfied if the polypeptide does not induce such a response in an individual cleared of an infection with a virulent Mycobacterium, i.e. which does not have any positive culture or microscopically proven ongoing infection with virulent Mycobacterium. The comments on property viii) regarding the amount of polypeptide intradermally injected or applied and the diameter of the positive response are equally relevant to property ix).
Preferred embodiments of the invention are the specific polypeptides which have been identified and analogues and subsequences thereof. It has been noted that none of the identified polypeptides in the examples include a signal sequence.
Until the present invention was made, it was unknown that the polypeptides with the amino acid sequences disclosed in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 are expressed in live virulent Mycobacterium.
5 These polypeptides in purified form, or non-naturally occurring, i.e.
recombinantly or synthetically produced, are considered part of the invention. It is understood that a polypeptide which has any of the properties i) - ix) and has a sequence identity of at least 80% with any of the amino acid sequences shown in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 or has a sequence identity of 10 at least 80% to any subsequence thereof is considered part of the invention. (n a preferred embodiment the sequence identity is at feast 80%, such as 81 %, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 99.5%. Furthermore, any T cell epitope of the polypeptides disclosed in SEQ ID
NOs: 2, 4, 6, 8, 10, 12, 14; 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 is
15 considered part of the invention. Also, any B-cell epitope of the polypeptides disclosed in SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 is considered part of the invention.
Although the minimum length of a T-cell epitope has been shown to be at least 6 amino acids, it is normal that such epitopes are constituted of longer stretches of amino acids.
Hence it is preferred that the polypeptide fragment of the invention has a length of at least 7 amino acid residues, such as at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, or at least 30 amino acid residues.
In both immunodiagnostics and vaccine preparation, it is often possible and practical to prepare antigens from segments of a known immunogenic protein or polypeptide.
Certain epitopic regions may be used to produce responses similar to those produced by the entire antigenic polypeptide. Potential antigenic or immunogenic regions may be identified by any of a number of approaches, e.g., Jameson-Wolf or Kyte-Doolittle antigenicity analyses or Hopp and Woods (Hopp et Woods, (1981), Proc Natl Acad Sci USA 78/6:3824-8) hydrophobicity analysis (see, e.g., Jameson and Wolf, (1988) Comput Appl Biosci, 4(1):181-6; Kyte and Doolittle, (1982) J Mol Biol, 157(1):105-32;
or U.S.
Patent No. 4,554,101). Hydrophobicity analysis assigns average hydrophilicity values to each amino acid residue; from these values average hydrophilicities can be calculated
Although the minimum length of a T-cell epitope has been shown to be at least 6 amino acids, it is normal that such epitopes are constituted of longer stretches of amino acids.
Hence it is preferred that the polypeptide fragment of the invention has a length of at least 7 amino acid residues, such as at least 8, at least 9, at least 10, at least 12, at least 14, at least 16, at least 18, at least 20, at least 22, at least 24, or at least 30 amino acid residues.
In both immunodiagnostics and vaccine preparation, it is often possible and practical to prepare antigens from segments of a known immunogenic protein or polypeptide.
Certain epitopic regions may be used to produce responses similar to those produced by the entire antigenic polypeptide. Potential antigenic or immunogenic regions may be identified by any of a number of approaches, e.g., Jameson-Wolf or Kyte-Doolittle antigenicity analyses or Hopp and Woods (Hopp et Woods, (1981), Proc Natl Acad Sci USA 78/6:3824-8) hydrophobicity analysis (see, e.g., Jameson and Wolf, (1988) Comput Appl Biosci, 4(1):181-6; Kyte and Doolittle, (1982) J Mol Biol, 157(1):105-32;
or U.S.
Patent No. 4,554,101). Hydrophobicity analysis assigns average hydrophilicity values to each amino acid residue; from these values average hydrophilicities can be calculated
16 and regions of greatest hydrophilicity determined. Using one or more of these methods, regions of predicted antigenicity may be derived from the amino acid sequence assigned to the polypeptides of the invention. Alternatively, in order to identify relevant T-cell epitopes which are recognised during an immune response, it is also possible to use a "brute force" method: Since T-cell epitopes are linear, deletion mutants of polypeptides will, if constructed systematically, reveal what regions of the polypeptide are essential in immune recognition, e.g. by subjecting these deletion mutants to the IFN-y assay described herein. A presently preferred method utilises overlapping oligomers (preferably synthetic ones having a length of e.g. 20 amino acid residues) derived from the polypeptide. Some of these will give a positive response in the IFN-y assay whereas others will not. A preferred T-cell epitope is a T-helper cell epitope or a cytotoxic T-cell epitope.
B-cell epitopes may be linear or spatial. The three-dimensional structure of a protein is often such that amino acids, which are located distant from each other in the one-dimensional structure, are located near to each other in the folded protein.
Within the meaning of the present context, the expression epitope is intended to comprise the one-and three-dimensional structure as well as mimics thereof.. The term is further intended to include discontinuous B-cell epitopes. The linear B-cell epitopes can be identified in a similar manner as described for the T-cell epitopes above. However, when identifying B
celi epitopes the assay should be an ELISA using overlapping oligomers derived from the polypeptide as the coating layer on a microtiter plate as described elsewhere.
A non-naturally occurring polypeptide, an analogue, a subsequence, a T-cell epitope and/or a B-cell epitope of any of the described polypeptides are defined as any non-naturally occurring polypeptide, analogue, subsequence, T-cell epitope and/or 8-cell epitope of any of the polypeptides having any of the properties i)-ix).
Table 1 lists the antigens of the invention.
1?
Table 1 The antigens of the invention by the names used herein as well as by reference to relevant SEQ ID NOs of N-terminal sequences, full amino acid sequences and sequences of nucleotides encoding the antigens AntigenN-Terminal sequenceNucleotide Amino acid sequence SEQ ID NO: sequence SEQ ID NO:
SEQ ID NO:
B
TB12.5 80 74 75 TB20.6 81 76 77 TB40.8 82 78 79 Each of the polypeptides may be characterised by specific amino acid and nucleic acid sequences. It will be understood that such sequences include analogues and variants produced by recombinant methods wherein such nucleic acid and polypeptide sequences have been modified by substitution, insertion, addition and/or deletion of one or more nucleotides in said nucleic acid sequences to cause the substitution, insertion, addition or deletion of one or more amino acid residues in the recombinant polypeptide. A
preferred nucleotide sequence encoding a polypeptide of the invention is a nucleotide sequence which 1) is a nucleotide sequence selected from the group consisting of SEQ ID NOs:
1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 and 78 or an analogue of said sequence which hybridises with any of the nucleotide sequences shown in SEQ ID
NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78 or a nucleotide sequence complementary thereto, or a specific part thereof, preferably under stringent hybridisation conditions. By stringent conditions is understood, as defined in the art, 5-10°C under the melting point Tm, cf. Sambrook et al, 1989, pages 11.45-11.49, and/or 2) encodes a polypeptide, the amino acid sequence of which has a 80% sequence identity with an amino acid sequence selected from the group consisting of SEQ
ID NOs:
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 and/or 3) constitutes a subsequence of any of the above mentioned nucleotide sequences, and/or 4) constitutes a subsequence of any of the above mentioned polypeptide sequences.
The terms "analogue" or "subsequence" when used in connection with the nucleotide fragments of the invention are thus intended to indicate a nucleotide sequence which encodes a polypeptide exhibiting identical or substantially identical immunological properties to a polypeptide encoded by the nucleotide fragment of the invention shown in any of SEQ I D NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78, allowing for minor variations which do not have an adverse effect on the ligand binding properties and/or biological function and/or immunogenicity as compared to any of the polypeptides of the invention or which give interesting and useful novel binding properties or biological functions and immunogenicities etc. of the analogue andlor subsequence. The analogous nucleotide fragment or nucleotide sequence may be derived from a bacterium, a mammal, or a human or may be partially or completely of synthetic origin. The analogue and/or subsequence may also be derived through the use of recombinant nucleotide techniques.
Furthermore, the terms "analogue" and "subsequence" are intended to allow for variations in the sequence such as substitution, insertion (including introns), addition, deletion and rearrangement of one or more nucleotides, which variations do not have any substantial effect on the polypeptide encoded by a nucleotide fragment or a subsequence thereof. The term "substitution" is intended to mean the replacement of one or more nucleotides in the full nucleotide sequence with one or more different nucleotides, "addition" is understood to mean the addition of one or more nucleotides at either end of the full nucleotide sequence, "insertion" is intended to mean the introduction of one or more nucleotides within the full nucleotide sequence, "deletion" is intended to indicate that one or more nucleotides have been deleted from the full nucleotide sequence whether at either end of the sequence or at any suitable point within it, and "re-arrangement" is intended to mean that two or more nucleotide residues have been exchanged with each other.
It is well known that the same amino acid may be encoded by various codons, the codon usage being related, inter alia, to the preference of the organisms in question expressing the nucleotide sequence. Thus, at least one nucleotide or codon of a nucleotide fragment of the invention may be exchanged by others which, when expressed, results in a polypeptide identical or substantially identical to the polypeptide encoded by the nucleotide fragment in question.
The term "subsequence" when used in connection with the nucleic acid fragments of the invention is intended to indicate a continuous stretch of at least 10 nucleotides which ex-hibits the above hybridization pattern. Normally this will require a minimum sequence identity of at least 70% with a subsequence of the hybridization partner having SEQ ID
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78. It is preferred that the nucleic acid fragment is longer than 10 nucleotides, such as at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, and at least 80 nucleotides long, and the sequence identity should preferable also be higher than 70%, such as at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 96%, and at least 98%. It is most preferred that the sequence identity is 100%. Such fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, by application of nucleic acid reproduction technology, such as the PCR
tech-nology of U.S. Patent 4,603,102, or by introducing selected sequences into recombinant vectors for recombinant production.
The nucleotide sequence to be modified may be of cDNA or genomic origin as discussed above, but may also be of synthetic origin. Furthermore, the sequence may be of mixed cDNA and genomic, mixed cDNA and synthetic or genomic and synthetic origin as discussed above. The sequence may have been modified, e.g. by site-directed mu-5 tagenesis, to result in the desired nucleic acid fragment encoding the desired polypep-tide.
The invention also relates to a replicable expression vector which comprises a nucleic acid fragment defined above, especially a vector which comprises a nucleic acid frag-10 ment encoding a polypeptide fragment of the invention. The vector may be any vector which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced.
Thus, the vector may be an autonomously replicating vector, i.e, a vector which exists as an extrachromo-somal entity, the replication of which is independent of chromosomal replication;
15 examples of such a vector are a plasmid, phage, cosmid, mini-chromosome and virus.
Alternatively, the vector may be one which, when introduced in a host cell, is integrated in the host cell genome and replicated together with the chromosome{s) into which it has been integrated.
20 Expression vectors may be constructed to include any of the DNA segments disclosed herein. Such DNA might encode an antigenic protein specific for virulent strains of mycobacteria or even hybridization probes for detecting mycobacteria nucleic acids in samples. Longer or shorker DNA segments could be used, depending on the antigenic protein desired. Epitopic regions of the proteins expressed or encoded by the disclosed DNA could be included as relatively short segments of DNA. A wide variety of expression vectors is possible including, for example, DNA segments encoding reporter gene products useful for identification of heterologous gene products and/or resistance genes such as antibiotic resistance genes which may be useful in identifying transformed cells.
The vector of the invention may be used to transform cells so as to allow propagation of the nucleic acid fragments of the invention or so as to allow expression of the polypeptide fragments of the invention. Hence, the invention also pertains to a transformed cell harbouring at least one such vector according to the invention, said cell being one which does not natively harbour the vector and/or the nucleic acid fragment of the invention contained therein. Such a transformed cell (which is also a part of the invention) may be any suitable bacterial host cell or any other type of cell such as a unicellular eukaryotic organism, a fungus or yeast, or a cell derived from a multicellular organism, e.g. an ani-mal or a plant. It is especially in cases where glycosylation is desired that a mammalian cell is used, although glycosylation of proteins is a rare event in prokaryotes. Normally, however, a prokaryotic cell is preferred such as a bacterium belonging to the genera Mycobacterium, Salmonella, Pseudomonas, Bacillus and Eschericia. It is preferred that the transformed cell is an E. coli, 8. subtilis, or M. bovis BCG cell, and it is especially preferred that the transformed cell expresses a polypeptide according of the invention.
The latter opens for the possibility to produce the polypeptide of the invention by simply recovering it from the culture containing the transformed cell. In the most preferred embodiment of this part of the invention the transformed cell is Mycobacterium bovis BCG strain: Danish 1331, which is the Mycobacterium bovis strain Copenhagen from the Copenhagen BCG Laboratory, Statens Seruminstitut, Denmark.
The nucleic acid fragments of the invention allow for the recombinant production of the polypeptides fragments of the invention. However, also isolation from the natural source is a way of providing the polypeptide fragments as is peptide synthesis.
Therefore, the invention also pertains to a method for the preparation of a polypeptide fragment of the invention, said method comprising inserting a nucleic acid fragment as described in the present application into a vector which is able to replicate in a host cell, introducing the resulting recombinant vector into the host cell (transformed cells may be selected using various techniques, including screening by differential hybridization, identification of fused reporter gene products, resistance markers, anti-antigen antibodies and the like), culturing the host cell in a culture medium under conditions sufficient to effect expression of the polypeptide (of course the cell may be cultivated under conditions appropriate to the circumstances, and if DNA is desired, replication conditions are used), and recovering the polypeptide from the host cell or culture medium; or isolating the polypeptide from a short-term culture filtrate; or isolating the polypeptide from whole mycobacteria of the tuberculosis complex or from lysates or fractions thereof, e.g. cell wall containing fractions, or synthesizing the polypeptide by solid or liquid phase peptide synthesis.
The medium used to grow the transformed cells may be any conventional medium suitable for the purpose. A suitable vector may be any of the vectors described above, and an appropriate host cell may be any of the cell types listed above. The methods employed to construct the vector and effect introduction thereof into the host cell may be any methods known for such purposes within the field of recombinant DNA. In the follow-ing a more detailed description of the possibilities will be given:
In general, of course, prokaryotes are preferred for the initial cloning of nucleic se quences of the invention and constructing the vectors useful in the invention.
For ex ample, in addition to the particular strains mentioned in the more specific disclosure below, one may mention by way of example, strains such as E. coli K12 strain (ATCC No. 31446), E. coli B, and E. coli X 1776 (ATCC No. 31537). These examples are, of course, intended to be illustrative and not limiting.
Prokaryotes are also preferred for expression. The aforementioned strains, as well as E.
coli W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli such as Bacillus subtilis, or other enterobacteriaceae such as Salmonella typhimurium or Serratia mar-cesans, and various Pseudomonas species may be used. Especially interesting are rapid-growing mycobacteria, e.g. M. smegmatis, as these bacteria have a high degree of resemblance with mycobacteria of the tuberculosis complex and therefore stand a good chance of reducing the need of performing post-translational modifications of the expression product.
In general, plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells. For example, E. coliis typically transformed using pBR322, a plasmid derived from an E. coli species (see, e.g., Bolivar et al., 1977, Gene 2: 95). The pBR322 plasmid contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells.
The pBR plasmid, or other microbial plasmids or phages must also contain, or be modified to contain, promoters which can be used by the microorganism for expression.
Those promoters most commonly used in recombinant DNA construction include the B-lactamase (penicillinase) and lactose promoter systems (Chang et al., (1978), Nature, 35:515; Itakura et al., (1977), Science 198:1056; Goeddel et al., (1979), Nature 281:544) and a tryptophan (trp) promoter system (Goeddel et al., (1979) Nature 281:544;
EPO
Appl. Publ. No. 0036776). While these are the most commonly used, other microbial promoters have been discovered and utilized, and details concerning their nucleotide sequences have been published, enabling a skilled worker to ligate them functionally with plasmid vectors (Siebwenlist et al., (1980), Cell, 20:269). Certain genes from prokaryotes may be expressed efficiently in E. coli from their own promoter sequences, precluding the need for addition of another promoter by artificial means.
After the recombinant preparation of the polypeptide according to the invention, the isolation of the polypeptide may for instance be carried out by affinity chromatography (or other conventional biochemical procedures based on chromatography), using a monoclonal antibody which substantially specifically binds the polypeptide according to the invention. Another possibility is to employ the simultaneous electroelution technique described by Andersen et al. in J. Immunol. Methods 161: 29-39.
According to the invention the post-translational modifications involves lipidation, gly-cosylation, cleavage, or elongation of the polypeptide.
In certain aspects, the DNA sequence information provided by this invention allows for the preparation of relatively short DNA (or RNA or PNA) sequences having the ability to specifically hybridize to mycobacterial gene sequences. In these aspects, nucleic acid probes of an appropriate length are prepared based on a consideration of the relevant sequence. The ability of such nucleic acid probes to specifically hybridize to the mycobacterial gene sequences lend them particular utility in a variety of embodiments.
Most importantly, the probes can be used in a variety of diagnostic assays for detecting the presence of pathogenic organisms in a given sample. However, either uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructs.
Apart from their use as starting points for the synthesis of polypeptides of the invention and for hybridization probes (useful for direct hybridization assays or as primers in e.g.
PCR or other molecular amplification methods) the nucleic acid fragments of the WO 00/219$3 PCT/DK99/00538 invention may be used for effecting in vivo expression of antigens, i.e. the nucleic acid fragments may be used in so-called DNA vaccines. Recent research have revealed that a DNA fragment cloned in a vector which is non-replicative in eukaryotic cells may be introduced into an animal (including a human being) by e.g, intramuscular injection or percutaneous administration (the so-called "gene gun" approach). The DNA is taken up by e.g. muscle cells and the gene of interest is expressed by a promoter which is func-tioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates the immune system. These newly discovered methods are reviewed in Ulmer et al., (1993), Curr. Opin. Invest. Drugs, 2:983-989 which hereby is included by reference.
Hence, the invention also relates to a vaccine comprising a nucleic acid fragment ac-cording to the invention, the vaccine effecting in vivo expression of antigen by an animal, including a human being, to whom the vaccine has been administered, the amount of expressed antigen being effective to confer substantially increased resistance to infec-tions with mycobacteria of the tuberculosis complex in an animal, including a human being.
The efficacy of such a "DNA vaccine" can possibly be enhanced by administering the gene encoding the expression product together with a DNA fragment encoding a poly-peptide which has the capability of modulating an immune response. For instance, a gene encoding lymphokine precursors or lymphokines (e.g. IFN-y, IL-2, or IL-12) could be administered together with the gene encoding the immunogenic protein, either by ad-ministering two separate DNA fragments or by administering both DNA fragments included in the same vector. It also is a possibility to administer DNA
fragments compri-sing a multitude of nucleotide sequences which each encode relevant epitopes of the poiypeptides disclosed herein so as to effect a continuous sensitization of the immune system with a broad spectrum of these epitopes.
In one embodiment of the invention, any of the above mentioned polypeptides is used in the manufacture of an immunogenic composition to be used for induction of an immune response in a mammal against an infection with a virulent Mycobacterium.
Preferably, the immunogenic composition is used as a vaccine.
The preparation of vaccines which contain peptide sequences as active ingredients is generally well understood in the art, as exemplified by U.S. Patents 4,608,251;
4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by reference. Typically, such vaccines are prepared as injectables either as liquid solutions or suspensions; solid forms suitable for solution in liquid or suspension in liquid prior to injection may also be prepared. The preparation may also be emulsified. The active 5 immunogenic ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof.
In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance 10 the effectiveness of the vaccines.
In one embodiment the composition used for vaccination comprises at least one, but preferably at least 2, such as at least 3, 4, 5, 10, 15 or at least 20 different polypeptides of the invention.
In another embodiment the composition to be used for vaccine comprises, together with at least one polypeptide of the invention, at least one, but preferably at least 2, such as at least 3, 4, 5, 10, 15 or at least 20 polypeptides which are not polypeptides of the present invention but are derived from a virulent Mycobacterium such as a polypeptide belonging to the group of ST-CF (Elhay MJ and Andersen P, Immunology and cell Biology (1997) 75, 595-603). ESAT-6, CFP7, CFP10 (EMBL accession number: AL022120), CFP17, CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64.
The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkalene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1-2%.
Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 10-95% of active ingredient, preferably 25-70%.
The proteins may be formulated into the vaccine as neutral or salt forms.
Pharmaceutically acceptable salts include acid addition salts (formed with the free amino groups of the peptide) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.
The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to mount an immune response, and the degree of protection desired. Suitable dosage ranges are of the order of several hundred micrograms of active ingredient per vaccination with a preferred range from about 0.1 pg to 1000 p.g, such as in the range from about 1 ~g to 300 pg, and especially in the range from about 10 ug to 50 wg. Suitable regimes for initial administration and booster shots are also variable but are typified by an initial administration followed by subsequent inoculations or other administrations.
The manner of application may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. Preferred routes of administration are the parenteral route such as the intravenous, intraperitoneal, intramuscular, subcutaneous or intradermal routes; the oral (on a solid physiologically acceptable base or in a physiologi-cally acceptable dispersion), buccal, sublingual, nasal, rectal or transdermal routes. The dosage of the vaccine will depend on the route of administration and will vary according to the age of the person to be vaccinated and, to a lesser degree, the weight of the person to be vaccinated.
Some of the polypeptides of the vaccine are sufficiently immunogenic in a vaccine, but for some of the others the immune response will be enhanced if the vaccine further comprises an adjuvant substance.
Various methods of achieving adjuvant effect for the vaccine include use of agents such as aluminum hydroxide or phosphate (alum), commonly used as a 0.05 to 0.1 percent solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol) used as a 0.25 percent solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between 70° to 101 °C
for 30 second to 2 minute periods respectively. Aggregation by reactivating with pepsin treated {Fab) antibodies to albumin, mixture with bacterial cells such as C. parvum or endotoxins or lipopolysaccharide components of gram-negative bacteria, emulsion in physiologically acceptable oif vehicles such as mannide mono-oleate (Aracel A) or emulsion with 20 percent solution of a perfluorocarbon (Fluosol-DA) used as a block substitute may also be employed. According to the invention DDA (dimethyldioctadecylammonium bromide) is an interesting candidate for an adjuvant, but also Freund's complete and incomplete adjuvants as well as QuilA and RIBI adjuvants are interesting possibilities.
Other possibilities to enhance the immunogenic effect involve the use of immune modulating substances such as lymphokines (e.g. IFN-y, IL-2 and It_-12) or synthetic IFN-y inducers such as poly I:C in combination with the above-mentioned adjuvants.
In many instances, it will be necessary to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations. The vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-25 years, such as 20 years, preferably 15 or 10 years, more preferably 1-5 years usually three years, will be desirable to maintain the desired levels of protective immunity.
In one embodiment of the invention a composition is produced comprising as the effective component a micro-organism, the micro-organism is a bacterium such as Mycobacterium, Salmonella, Pseudomonas and Escherichia, preferably Mycobacterium bovis BCG wherein at least one, such as at least 2 copies, such as at least 5 copies of a nucleotide fragment comprising a nucleotide sequence encoding a polypeptide of the invention has been incorporated into the genome of the micro-organism or introduced as a part of an expression vector in a manner allowing the micro-organism to express and optionally secrete the polypeptide. In a preferred embodiment, the composition comprises at least 2 different nucleotide sequences encoding at least 2 different polypeptides of the invention. In a much preferred embodiment, the composition comprises at least different nucleotide sequences encoding at least one polypeptide of the invention and at least one polypeptide belonging to the group of ST-CF (Elhay MJ and Andersen P, Immunology and cell Biology (1997) 75, 595-603) such as ESAT-6, CFP7, CFP10, CFP17, CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64.
Individuals infected with virulent Mycobacteria can generally be divided into two groups.
The first group has an infection with a virulent Mycobacterium e.g. contacts of TB
patients. The virulent Mycobacterium may have established colonies in the lungs, but the individual has, as yet, no symptoms of TB. The second group has clinical symptoms of TB, as a TB patient.
In one embodiment of the invention, any of the above mentioned polypeptides are used for the manufacture of a diagnostic reagent that preferably distinguishes a subclinically or clinically infected individual (group I and group II) from an individual who has been BCG
vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium (NTM), and may distinguish a subclinically or clinically infected individual from an individual who has cleared a previous infection with a virulent Mycobacterium. It is most likely that specific polypeptides derived from SPE will identify group I and/or group II from individuals not infected with virulent Mycobacteria in the same way as ESAT-f and CFP10 (P.Ravn et al., (1998), J. Infectious Disease 179:637-45).
In one embodiment of the invention, any of the above discussed polypeptides are used for the manufacture of a diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium. One embodiment of the invention provides a diagnostic reagent for differentiating an individual who is clinically or subclinically infected with a virulent Mycobacterium from an individual not infected with virulent Mycobacterium, i.e. an individual who has been BCG vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium (NTM). Such a diagnostic reagent will distinguish between an individual in group I and/or II of the infection stages above, from an individual who has been vaccinated against TB. Another embodiment of the invention provides a diagnostic reagent for differentiating an individual who is clinically or subclinically infected with a virulent Mycobacterium from an individual who has a cleared infection with a virulent Mycobacterium. Such a diagnostic reagent will distinguish between an individual in group I and/or II of the infection stages above, from an individual who has cleared the infection.
Determination of an infection with virulent Mycobacterium will be instrumental in the, still very laborious, diagnostic process of tuberculosis. A number of possible diagnostic assays and methods can be envisaged (some more specifically described in the examples and the list of properties): a sample comprising whole blood or mononuclear cells {i.a. T-lymphocytes) from a patient could be contacted with a sample of one or more polypeptides of the invention. This contacting can be performed in vitro and a positive reaction could e.g. be proliferation of the T-cells or release of cytokines such as IFN-y into the extracellular phase (e.g. into a culture supernatant).
Alternatively, a sample of a possibly infected organ may be contacted with an antibody raised against a polypeptide of the invention. The demonstration of the reaction by means of methods well-known in the art between the sample and the antibody will be indicative of ongoing infection and could be used to monitor treatment effect by reduction in responses. It is of course also a possibility to demonstrate the presence of anti-Mycobacterial antibodies in serum by contacting a serum sample from a subject with at least one of the polypeptide fragments of the invention and using well-known methods for visualising the reaction between the antibody and antigen such as ELISA, Western blot, precipitation assays.
Also a method of determining the presence of virulent Mycobacterium nucleic acids in a mammal, including a human being, or in a sample, comprising incubating the sample with a nucleic acid sequence of the invention or a nucleic acid sequence complementary thereto, and detecting the presence of hybridised nucleic acids resulting from the incubation (by using the hybridisation assays which are well-known in the art), is included in the invention. Such a method of diagnosing TB might involve the use of a composition comprising at least a part of a nucleotide sequence as defined above and detecting the presence of nucleotide sequences in a sample from the animal or human being to be tested which hybridises with the nucleic acid sequence (or a complementary sequence) by the use of PCR techniques.
The invention also relates to a method of diagnosing infection caused by a virulent Mycobacterium in a mammal, including a human being, comprising locally applying (patch test) or intradermally injecting (Mantoux test) a polypeptide of the invention. These tests are both called a delayed hypersensitivity reaction (DTH). A positive skin response at the location of injection or application is indicative of the mammal including a human being, being infected with a virulent Mycobacterium, and a negative skin response at the location of injection or application is indicative of the mammal including a human being not having TB. A positive response is a skin reaction having a diameter of at least 5 mm larger than background, but larger reactions are preferred, such as at least 1 cm, 1.5 cm, 5 and at least 2 cm in diameter. A skin reaction is here to mean erythema or induration of the skin, as directly measured. The composition used as the skin test reagent can be prepared in the same manner as described for the vaccines above.
In human volunteers, the generation of a significant immune response can alternatively 10 be defined as the ability of the reagent being tested to stimulate an in vitro recall response by peripheral blood cells from at least 30% of PPD positive individuals previously vaccinated with that reagent or infected with a virulent Mycobacterium, said recall response being defined as proliferation of T cells or the production of cytokine(s) which is higher than the responses generated by cells from unimmunised or uninfected 15 control individuals, with a 95% confidence interval as defined by an appropriate statistical analysis such as a Student's two-tailed T test.
Alternatively, a significant immune response could be detected in vivo by a test such as the generation of delayed type hypersensitivity in the skin in response to exposure to the 20 immunising reagent, such response being significantly larger (with a 95%
confidence interval as defined by appropriate statistical analysis such as a Student's two-tailed T
test) in at least 30% of vaccinated or infected individuals than in placebo-treated or uninfected individuals.
25 The polypeptides according to the invention may be potential drug targets.
Once a particular interesting polypeptide has been identified, the biological function of that polypeptide may be tested. The polypeptides may constitute receptor molecules or toxins which facilitates the infection by the Mycobacterium and if such functionality is blocked, the infectivity of the virulent Mycobacterium will be diminished.
The biological function of particular interesting polypeptides may be tested by studying the effect of inhibiting the expression of the polypeptides on the virulence of the virulent Mycobacterium. This inhibition may be performed at the gene level such as by blocking the expression using antisense nucleic acid, PNA or LNA or by interfering with regulatory sequences or the inhibition may be at the level of translation or post-translational processing of the polypeptide.
Once a particular polypeptide according to the invention is identified as critical for virulence, an anti-mycobacterial agent might be designed to inhibit the expression of that polypeptide. Such anti-mycobacterial agent might be used as a prophylactic or therapeutic agent. For instance, antibodies or fragments thereof, such as Fab and (Fab')2 fragments, can be prepared against such critical polypeptides by methods known in the art and thereafter used as prophylactic or therapeutic agents A presently preferred embodiment is an extract of polypeptides obtainable by a method comprising the steps of a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) at 2,OOOg for 40 minutes;
c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with rounds of 90 seconds;
d) centrifugating the suspension of c) at 5,OOOg for 30 minutes;
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane components from the supernatant of d) with 10% SDS;
f) centrifugating the extract of e) at 20,000g for 30 minutes;
g) precipitating the supernatant of f) with 8 volumes of cold acetone;
with an adjuvant substance.
In other words, the invention relates to use of an extract of polypeptides with an adjuvant substance for the preparation of a composition for the generation or determination of an immune response against a virulent Mycobacterium.
Finally, a monoclonal or polycional antibody, which is specifically reacting with a poly-peptide of the invention in an immuno assay, or a specific binding fragment of said anti-body, is also a part of the invention. The production of such polyclonal antibodies requires that a suitable animal be immunized with the polypeptide and that these anti-bodies are subsequently isolated, suitably by immune affinity chromatography.
The production of monoclonals can be effected by methods well-known in the art, since the present invention provides for adequate amounts of antigen for both immunization and screening of positive hybridomas.
Examples EXAMPLE 1: Total extraction of proteins from dead M.tuberculosis bacteria.
1.5 x 109 bacteria/ml M.tuberculosis was heat treated at 55°C for 1.5 hours and checked for sterility. 10 ml of these heat killed bacteria was centrifuged at 2000 g for 40 min; the supernatant was discharged and the pellet resuspended in PBS containing 0.5%
Tween 20 and used as the antigen source. The pellet was sonicated with 20 rounds of seconds and centrifuged 30 min at 5000 g to remove unbroken cells. The supernatant containing soluble proteins as well as cell wall and cell membrane components was extracted twice with 10% SDS to release proteins inserted in the cell wall and membrane compartments. After a centrifugation at 20.000 g for 30 min the supernatant was precipitated with 8 volume of cold acetone and resuspended in PBS at a protein concentration of 5 mg/ml and named: Somatic Proteins Extract (SPE).
Analysis of protective immune response for tuberculosis after immunisation with different M.tuberculosis protein preparations.
The protective efficacy of SPE was evaluated in a vaccination experiment and compared to the two vaccines ST-CF and BCG, known to induce protection against TB.
Five groups of 6-8 weeks old, female C5781/6J mice (Bomholtgaard, Denmark) were immunised subcutaneously at the base of the tail with vaccines of the following composition:
Group 1: BCG
Group 2: 1x 10' heat killed M.tuberculosislDDA (250 ~.g DDA) Group 3: 50 p.g ST-CF/DDA (250 pg) Group 4: 50 pg SPE/DDA (250 wg}
Group 5: Adjuvant control: DDA (250 p,g) in NaCI
The animals were injected with a volume of 0.2 ml. The mice of groups2, 3 and 4 were boosted twice at two weeks interval.
Four weeks after the last immunisation three mice/group were sacrificed and the spleens removed. The immune response induced in the spleen cells was monitored by release of IFN-y into the culture supernatants when stimulated in vitro with relevant antigens (Table 2). ST-CF and SPE induced a similar immune response while only a very low IFN-y release was observed after immunisation with BCG and stimulation with ST-CF.
Table 2 Recognition of protein preparations after immunisation presented as IFN-y release (pg/ml) after restimulation.
Immunogen No antigen ST-CF SPE
ST-CF <200 6752 ~ 591 8431 ~ 459 SPE <200 6621 t 203 11079 ~ 178 BCG <200 469 t 32 ND
Seven weeks after the final immunisation the mice received a primary infection with 5x105 H37Rv in 0.1 ml iv. and two weeks later the mice were sacrificed and the spleens were isolated for bacterial enumeration (figure 2).
BCG induced a high level of protection in the spleen as expected but so did the killed H37Rv, ST-CF and SPE and ali preparations induced protection at almost the same level, with SPE as the most potent of these preparations.
These data demonstrate that there are components to be found among the somatic proteins of H37Rv which in an animal model protect against tuberculosis at the same level as BCG.
EXAMPLE 2: Subcellular fractionation of Mycobacterium tuberculosis 1.5 x 109 colony forming units (CFU/ml) of M. tuberculosis H37Rv were inactivated by heat-killing at 60°C for 1.5 hour. The heat-killed Mycobacteria was centrifuged at 3,000 x g for 20 min; the supernatant was discarded and the pellet was resuspended in cold PBS.
This step was repeated twice. After the final wash, the pellet was resuspended in a homogenising buffer consisting of PBS supplemented with 10 mM EDTA and 1 mM of phenylmethylsulfonyl fluoride in a ratio of 1 ml buffer per 0.5 g of heat-killed Mycobacteria. The sample was sonicated on ice for 15 min (1-min-pulser-on110-sec-pulser off) and subsequently lysed three times with a French Pressure Cell at 12,000 Iblin2. The lysate was centrifuged at 27,000 x g for 20 min; the pellet was washed in homogenising buffer and recentrifuged. The pooled supernatants contained a mixture of cytosol and membrane components, while the pellet represented the crude cell wall.
Preparation of cell wall The cell wall pellet, resuspended in homogenising buffer, was added RNase and DNase to a final concentration of 1 mg/ml and incubated overnight at 4°C. The cell wall was -washed twice in homogenising buffer, twice in homogenising buffer saturated with KCI, 5 and twice with PBS. Soluble proteins were extracted from the cell wall by a 2 hour incubation with 2% SDS at 6°C. The insoluble cell wall core was removed by a centrifugation at 27,000 x g for 20 min and the SDS-extraction was repeated.
Finally, the pooled supernatants were precipitated with 6 volumes of chilled acetone and resuspended in PBS.
10 Preparation of cytosol and membrane:
To separate the cytosol and the membrane fraction, the pooled supernatants were ultracentrifugated at 100,000 x g for 2 hours at 5°C. The cytosol proteins in the supernatant were precipitated with acetone and resuspended in PBS. The pellet, representing the membrane fraction, was washed in PBS, ultracentrifugated, and finally 15 resuspended in PBS.
Triton X-114 extraction of cell wall and membrane:
To prepare protein fractions largely devoid of lipoarabinomannan, the cell wall and the membrane fraction were subjected to extraction with precondensed Triton X-114.
Triton X-114 was added to the protein sample at a final concentration of 4%. The solution was 20 mixed on ice for 60 min and centrifuged at 20,000 x g for 15 min at 4°C. The pellet containing residual insoluble material was extracted once more (membrane) or twice (cell wall), while the supernatant was warmed to 37°C to condense the Triton X-114. After centrifugation of the supernatant at 12,000 x g for 15 min, the aqueous phase and detergent phase were separated. The aqueous phase and detergent phase were washed 25 twice with Triton X-114 and PBS, respectively. The combined aqueous phases and residual insoluble material containing the majority of proteins were pooled, precipitated with acetone, and resupended in PBS.
The specificity of the human T-cell response in TB patients was investigated by 30 stimulating PBMCs with panels of narrow molecular mass fractions from membrane, cell wall, and cytosol obtained by the mufti-elution technique described by Andersen et al.
(1993) J. Immunol. Methods 161:29-39. The technique resulted in 30 sharply defined fractions and allowed an identification of immunological active regions, of potential as either diagnostic reagents or as vaccine components.
The study demonstrated that multiple targets within the cell wall, membrane, and cytosol were recognised by the donors and initiated IFN-y release as well as cellular proliferation (unpublished results). The broad cellular response were directed towards both the low molecular mass as well as the some of the higher molecular mass fractions.
These experiments suggest the existence of numerous target antigens among the cell wall, membrane, and cytosol fractions and it is therefore likely that some of these will have a potential as a protective or diagnostic reagent.
EXAMPLE 3: Identification of proteins from the cytosolic fraction Use of patient sera to identify M. tuberculosis antigens This example illustrates the identification of antigens from the cytosol fraction by screening with serum from M. tuberculosis infected individuals in western blot. The reaction with serum was used as an indication that the proteins are recognised immunologically.
The cytosol was precipitated with ammonium sulphate at 80% saturation. The non-precipitated proteins were removed by centrifugation and precipitated proteins were resuspended in 20 mM imidazole pH 7Ø The protein solution was applied to a DEAE
Sepharose 6B column, equilibrated with 20 mM imidazole pH 7Ø Bound protein was eluted from the column using a salt gradient from 0 to 1 M NaCI, in 20 mM
imidazole pH
7Ø Fractions collected during elution was analysed on a silver stained 10-20% SDS-PAGE and on 2 dimensional electrophoresis.
For use in western blot a pool of serum from 5 TB patients was made. These patients ranged from minimal to severe TB. Nitrocellulose membranes were blocked with phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20, for 30 min. The serum pool was diluted in phosphate buffer pH 7.3 containing 0.37 M NaCI. The blots incubated in serum dilution overnight at room temperature on a shaker.
Membranes were washed for four times five minutes in the dilution buffer, and incubated with 1:1,000 diluted peroxidase-labelled swine anti human-IgG {P214, Dako) for 1 hour at room temperature on a shaker. Blots were then washed for four times 5 min. in the dilution buffer and stained with DONS/TMB.
N-terrninai sequencing and amino acid analysis Proteins of the fractions containing bands reactive with serum from TB
patients in Western blot were separated by 2D electrophoresis. Gels were blotted to PVDF
membranes and spots subjected to N-terminal sequencing on a Procise sequencer (Applied Biosystems).
The following N-terminal sequences were obtained ForTB15 :TERTAVLIKPDGIER
(SEQ ID NO: 39) ForTB18 :TDTQVTWLTQESHDR
(SEQ ID NO: 40) ForTB21 :MIDEALFDAEEKMEK
(SEQ ID NO: 41) ForTB33 :PLPADPSTDLSAYAQ
(SEQ ID NO: 42) ForTB38 :MLISQRPTLSEDVLT
(SEQ ID NO: 43) ForTB54 :TGNLVTKNSLTPDVR
(SEQ ID NO: 44) Sequence identity searches The N-terminal sequences obtained were used for an identity search using the blast program of the Sanger M. tuberculosis database http://www.sanger.ac.uk/Projects/M tuberculosis/blast server.shtml In addition, the GenEMBL database was searched using the BLASTP program (Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J. Lipman (1990).
Basic local alignment search tool. J. Mol. Biol. 215:403-10.), to reveal proteins with homology to the full amino acid sequences obtained from the Sanger database.
Thereby, the following information was obtained For the 15 determined N-terminal amino acids for TB15 a 93% identical sequence was found in MTV008.01 c. Amino acid 5 of the determined N-terminal sequence (A) is an L in the sequence MTV008.01c.
Within the open reading frame the translated protein is 136 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 136 amino acids, which corresponds to a theoretical molecular mass of 14 509 Da and a theoretical pl of 5.36. The observed mass in SDS-PAGE
is 14 kDa.
TB15 has 80% sequence identity in a 139 amino acid overlap to a protein of M.
smegmatis. It is homologous to putative nucleoside diphosphate kinases from several species, e.g. 59% sequence identity to a 151 amino acid protein of Archaeoglobus fulgidus and 57% sequence identity to a 149 amino acid protein of Bacillus subtilis.
For the 15 determined N-terminal amino acids for TB18 a 100% identical sequence was found in MTCY017.33c.
Within the open reading frame the translated protein is 164 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 164 amino acids, which corresponds to a theoretical molecular mass of 17 855 Da and a theoretical pl of 4.81. The observed mass in SDS-PAGE
is 20 kDa.
TB18 has 94% sequence identity, in a 164 amino acid overlap, to a protein from M.
leprae. In addition, it is homologous to transcription elongation factors from several species, e.g. 32% sequence identity in a 114 amino acid overlap, to a protein from Zymomonas mobilis.
For the 15 determined N-terminal amino acids for TB21 a 100% identical sequence was found in MTCY274.13c.
Within the open reading frame the translated protein is 185 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 1.
This corresponds to a theoretical molecular mass of 20 829 Da and a theoretical pl of 5.81. The observed mass in SDS-PAGE is 22 kDa.
TB21 has 90% sequence identity in a 185 amino acid overlap to a protein from M. leprae.
In addition, it is homologous to ribosome recycling factors from several species, e.g. 63%
in a 185 amino acid overlap to a protein from Streptomyces coelicolor.
For the 15 determined N-terminal amino acids for TB33 a 85% identical sequence was found in MTCY71.23. Amino acids 8 and 9 of the determined N-terminal sequence (T and D) are a P and a T in MTCY71.23, respectively.
Within the open reading frame the translated protein is 297 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 297 amino acids, which corresponds to a theoretical molecular mass of 33 323 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 35 kDa.
TB33 has 83% sequence identity in a 296 amino acid overlap to a protein from M. leprae.
In addition, it is homologous to thiosulphate sulfurtransferases (rhodanese) from several species, e.g. 48% in a 131 amino acid overlap to rhodanese from Saccharopolyspora erythraea.
For the 15 determined N-terminal amino acids for TB38 a 100% identical sequence was found in MTCY13E12.10c.
Within the open reading frame the translated protein is 347 amino acids long.
The N-terminal sequence of the protein identified in the cytosoi starts at amino acid no 1.
This corresponds to a theoretical molecular mass of 37 710 Da and a theoretical pl of 4.53. The observed mass in SDS-PAGE is 38 kDa.
TB38 is homologous to DNA-directed RNA polymerase alpha-chains from several species, e.g. 79% in a 321 amino acid overlap to a protein from Sfreptomyces coelicolor.
For the 15 determined N-terminal amino acids for TB54 a 100% identical sequence was found in MTCY20B11.23c.
Within the open reading frame the translated protein is 495 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 495 amino acids, which corresponds to a theoretical molecular mass of 54 329 Da and a theoretical pl of 5.00. The observed mass in SDS-PAGE
is 60 kDa.
TB54 is homologous to adanosyl homocysteinases from several species, e.g. 73%
in a 90 amino acid overlap to S-adenosyl-L-homocysteine hydrolase from Triticum aestivum.
It contains a S-adenosyl-L-homocysteine hydrolase signature (PS00739).
Example 3a: Use of patient sera to identify M. tuberculosis cytosol antigens.
5 Anion exchange chromatography of the cytosol proteins and Western blot experiments with a pool of sera from TB patients were performed as described in Example 3.
N-terminal sequencing Proteins of the fractions containing TB12.5, TB20.6, and TB40.8 were separated by 2D
electrophoresis. Gels were blotted to PVDF membranes and spots subjected to N-10 terminal sequencing on a Procise sequencer (Applied Biosystems).
The following N-terminal sequences were obtained For TB12.5 :ALKVEMVTFDXSDPA
(SEQ ID NO: 80) 15 For TB20.6 :ADADTTDFDVDAEAP
(SEQ ID NO: 81) For TB40.8 :SKTVLILGAGVGGLT (SEQ ID NO: 82) Sequence identity searches was performed as described in Example 3.
20 Thereby, the following information was obtained TB12.5 For the 15 determined N-terminal amino acids of TB12.5 a 93 % identical sequence was found in Rv0801. The x in position 11 is a cysteine.
25 Within the open reading frame the translated protein is 115 amino acids long. The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 115 amino acids, which corresponds to a theoretical molecular mass of 12 512 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 14 30 kDa.
No homology was found to TB12.5.
TB20.6 For the 15 determined N-terminal amino acids of TB20.6 a 100 % identical sequence was found in Rv3920c.
Within the open reading frame the translated protein is 187 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 1.
This gives a protein of 187 amino acids, which corresponds to a theoretical molecular mass of 20.559 Da and a theoretical pl of 4.14. The observed mass in SDS-PAGE
is 24 kDa.
TB20.6 has 73 % homology to a 193 amino acid protein of M. leprae. It has 59%
homology in a 184 amino acid overlap to a Jag-like protein from Streptomyces coelicolor.
TB40.8 For the 15 determined N-terminal amino acids of TB40.8 a 100 % identical sequence was found in Rv0331.
Within the open reading frame the translated protein is 388 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 388 amino acids, which corresponds to a theoretical molecular mass of 40 792 Da and a theoretical pl of 5.06. The observed mass in SDS-PAGE
is 44 kDa.
No homology was found to TB40.8.
Identification of abundant proteins As immunity to tuberculosis is not B-cell but T-cell mediated, reactivity with serum from TB patients was not the only selection criterion used to identify proteins from the cytosol.
Further proteins were selected by virtue of their abundance in the cytosol.
The cytosol was precipitated with ammonium sulphate at 80% saturation. The non-precipitated proteins were removed by centrifugation and precipitated proteins were resuspended in 20 mM imidazole, pH 7Ø The protein solution was applied to a DEAE
Sepharose 6B column, equilibrated with 20 mM imidazole. Bound protein was eluted from the column using a salt gradient from 0 to 1 M NaCI, in 20 mM imidazole.
Fractions collected during elution was analyzed on a silver stained 10-20% SDS-PAGE and on 2 dimensional electrophoresis. Fractions containing well separated bands were selected for 2D electrophoresis and blotted to PVDF, after which spots, visualised by staining with Coomassie Blue, were selected for N-terminal sequencing.
The following N-terminal sequences were obtained ForT810C :MEVKIGITDSPRELV
(SEQ ID NO: 45) ForTBI5A : SAYKTVVVGTDDXSX
(SEQ ID NO: 46) ForTBl7 :MEQRAELVVGRALVV
(SEQ ID NO: 47) ForTB24 :ADIDGVTGSAGL(N)PA
(SEQ ID NO: 48) ForTB27B :TYETILVERDQRVGI
(SEQ ID NO: 49) No sequence identity was found, when searching the Sanger database using the blast program. However, when the blast program at Swiss-blast was used, a sequence was obtained.
For the 15 determined N-terminal amino acids for TB10C a 93% identical sequence was obtained. The first amino acid of the N-terminal sequence (M) is a V in the sequence found, corresponding to GTG being used as a start codon, instead of ATG.
Within the open reading frame the translated protein is 90 amino acids. The N-terminal sequence of the protein identified in the cytosol starts at amino acid 1.
This corresponds to a theoretical molecular mass of 9 433 Da and a theoretical pl of 4.93. The observed mass in SDS-PAGE is 10 kDa.
For the determined N-terminal sequence of TB15 a 78% identical sequence was found in CY0182.28. The X at position 13 of the determined N-terminal sequence corresponds to a G in MTCY0182.28 and the X at position 15 to a D.
Within the open reading frame the translated protein is 146 amino acids long.
The N
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 146 amino acids, which corresponds to a theoretical molecular mass of 15 313 Da and a theoretical pl of 5.60. The observed mass in SDS-PAGE
is 16 kDa.
The highest sequence identity, 32% in a 34 amino acid overlap, was found to a conserved protein of Methanobacterium thermoautotrophicum.
For the 15 determined N-terminal amino acids for TB17 a 100% identical sequence was found in MTV044.12.
Within the open reading frame the translated protein is 165 amino acids. The N-terminal sequence of the protein identified in the cytosol starts at amino acid 1.
This gives a protein of 165 aa. Theoretical molecular mass 16 793 Da and a theoretical pl of 4.22. The observed mass in SDS-PAGE is 18 kDa.
TB17 is homologous to putative molybdenum cofactor biosynthesis proteins from several species, e.g. 34% in a 103 amino acid overlap to moaCB from Synechococcus spp.
For the 15 determined N-terminal amino acids for TB24 a 92% identical sequence was found in MTCY07D11.03. The tentative N in position 13 of the determined amino acid sequence is a Q in MTCY07D11.03, and the A at position 15 is a G.
Within the open reading frame the translated protein is 216 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 216 amino acids, which corresponds to a theoretical molecular mass of 24 227 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 28 kDa.
TB24 is homologous to a RNA polymerase sigma-E factors from several species, e.g.
55% in a 72 amino acid overlap to ECF sigma factor RpoE1 from Myxococcus xanthus.
For the 15 determined N-terminal amino acids for TB27B a 100% identical sequence was found in MTCY017.23c.
Within the open reading frame the translated protein is 257 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 257 amino acids, which corresponds to a theoretical molecular mass of 27 276 Da and a theoretical pl of 4.82. The observed mass in SDS-PAGE
is 28 kDa.
WO 00121983 PCTlDK99/00538 TB27B has 86% sequence identity in a 257 amino acid overlap, to a protein from M.
leprae. In addition, it is homologous to enoyl-CoA hydratases from several species, e.g.
66% in a 257 amino acid overlap to a protein from Rhizobium meliloti.
Identification of TB13A
One protein spot was selected by its reaction with the monoclonal antibody ST-3 in western blot. N-terminal sequencing of the spot on the PVDF membrane corresponding to the ST-3 spot yielded the following results ForTB13A :PVTQEEIIAGIAEII
(SEQ ID NO: 50) Sequence identity search on the TB13A N-terminal sequence gave the following results:
For the 15 determined N-terminal amino acids for TB13A a 100% identical sequence was found in MTCY427.25.
Within the open reading frame the translated protein is 115 amino acids long.
The N
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 115 amino acids, which corresponds to a theoretical molecular mass of 12 524 Da and a theoretical pl of 3.87. The observed mass in SDS-PAGE
is 10 kDa.
TB13A has 94% sequence identity to a 115 amino acid protein of M. leprae. It is homologous to putative acyl carrier proteins from several species, e.g. 59%
sequence identity to a 78 amino acid protein of Myxococcus xanthus and 56% to a 82 amino acid protein from Streptomyces coelicolor.
Identification of TB64 Biotinylated proteins were purified from the cytosol fraction in the following way: 12 mg of the cytosol fraction was added to 100 p.l of TetraLink Tetrameric Avidin Resin (Promega) in PBS, pH 7.4 in an eppendort tube. After incubation over night at 4°C, centrifugation (1000 g for 5 min) was performed and the resin was washed five times with PBS, pH 7.4, each time followed by centrifugation and collection of the supernatant.
Thereafter, 100 NI
of 4 times concentrated SDS-PAGE sample buffer (0.08 M Tris-HCI, 8% SDS, 16%
glycerol, 24 mM EDTA , pH 8.0) was added to the resin and it was boiled for 20 minutes.
After centrifugation the supernatant was collected and analysed for the presence of biotinylated proteins: The sample was analysed on SDS-PAGE followed by semi-dry blotting to nitrocellulose. The nitrocellulose membranes were incubated with alkaline 5 phosphatase labeled streptavidin (D396, DAKO, Glostrup, Denmark). Nitro-blue tetrazolium/5-bromo-4-chloro-3-indolyl phosphate was used as substrate.
N-terminal sequencing The eluate from the TetraLink Tetrameric Avidin Resin was loaded on a precast 10-20%
Tricine SDS-PAGE gel (Novex, San Diego, USA). After electrophoresis the gel was 10 blotted to Problott PVDF membrane (Applied Biosystems, Foster City, CA) by semidry electroblotting in 10 mM CAPS, 10% methanol, pH 11. The PVDF membrane was stained with 0.1 % Coomassie R-250 in 40% methanol, 1 % acetid acid, and destained in 50% methanol. A band of 10 kDa which was identified as a biotinylated protein as described above was excised and subjected to N-terminal sequence analysis by 15 automated Edman degradation using a Procise 494 sequencer (Applied Biosystems) as described by the manufacturer.
The following sequence was obtained:
VIRRKPKPRXR (SEQ ID NO: 57) 20 Submission of this sequence to the Sanger Centre M. tuberculosis blast server identified the open reading frame Rv3285 (91 % identity in 11 amino acids) encoding a protein of 600 amino acids. The determined sequence showed identity to amino acids 511 to suggesting that the identified peptide is a C-terminal fragment of the protein. As expected, the pattern for biotinylation of a lysine was identified in the C-terminal part of 25 the protein: GDLVVVLEAMKMENPVTA (residues 556-573, PROSITE pattern PS00188).
EXAMPLE 4: Identification of proteins from the cell wall.
Identification of TB11 B, TB16, TB16A, TB32, TB32A, and TB51.
Proteins contained in the cell wall fraction were separated by 2-D
electrophoresis. A
30 sample containing 120 mg protein was subjected to isoelectric focusing in a pH gradient from 4 to 7. The second dimension separation (SDS-PAGE) was carried out in a 10-20%
acrylamide gradient. After blotting onto a PVDF membrane, proteins could be visualised by Coomassie blue staining.
N-terminal sequencing.
The relevant spots were excised from the PVDF membrane and subjected to N-terminal sequencing using a Procise sequences (Applied Biosystems). The following N-terminal sequences were obtained:
TB11B:PWKINAIEVPAGA (SEQ ID NO: 51) TB16:ADKTTQTIYIDADPG (SEQ ID NO: 52) TB16A:PVLSKTVEVTADAAS (SEQ ID NO: 53) TB32:SGNSSLGIIVGIDD
(SEQ ID NO: 54) TB32A:AEVLVLVEHAEGALK (SEQ ID NO: 55) TB51:MKSTVEQLSPTRVRI (SEQ ID NO: 56) N-terminal sequence identity searching and identification of the corresponding genes.
The N-terminal amino acid sequence from each of the proteins identified was used for a sequence identity search using the tblastn program at NCBI:
http:/lwww. ncbi. nl m. nih.govlcgi-bin/BLAST/nph-blast?Jform=0 The following information was obtained:
TB11 B:
The 14 as N-terminal sequence was found to be 100% identical to a sequence found on cosmid SCY06F7.
The identity is found within an open reading frame of 105 amino acids lenght corresponding to a theoretical molecular mass of 11 185 Da and a pl of 6.18.
The apparent molecular mass in an SDS-PAGE gel is 12 kDa.
The amino acid sequence shows some low level similarity to oxygenases and hypothetical proteins.
TB16:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV021.
The identity is found within an open reading frame of 144 amino acids length corresponding to a theoretical molecular mass of 16294 Da and a pl of 4.64.
The apparent molecular mass in an SDS-PAGE gel is 17 kDa.
The amino acid sequence shows some similarity to other hypothetical Mycobacterial proteins.
TB16A:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found on cosmid 128.
The identity is found within an open reading frame of 146 amino acids length corresponding to a theoretical molecular mass of 16 060 Da and a pl of 4.44.
The apparent molecular mass in an SDS-PAGE gel is 14 kDa.
TB32:
The 14 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTCY1A10.
The identity is found within an open reading frame of 297 amino acids length corresponding to a theoretical molecular mass of 31654 Da and a pl of 5.55.
The apparent molecular mass in an SDS-PAGE gel is 33 kDa.
The amino acid sequence shows some similarity to other hypothetical Mycobacterial proteins.
TB32A:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV012.
20 The identity is found within an open reading frame of 318 amino acids length corresponding to a theoretical molecular mass of 31694 Da and a pl of 4.61.
The apparent molecular mass in an SDS-PAGE gel is 32 kDa.
The amino acid sequence reveals high sequence identity to the fixB gene product from several organisms. Probable electron transfer flavoprotein alpha subunit far various dehydrogenases. Equivalent to Mycobacterium leprae FixB.
TB51:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV008.
The identity is found within an open reading frame of 466 amino acids length corresponding to a theoretical molecular mass of 50587 Da and a pl of 4.3. The apparent molecular mass in an SDS-PAGE gel is 56 kDa.
The amino acid sequence shows similarities to trigger factor from several organisms.
Possible chaperone protein.
EXAMPLE 5: Cloning of the genes encoding TB10C, TB13A, TB17, TB11 B, TB16, TB16A, TB32, TB51 The genes encoding TB10C, TB13A, TB17, TB11 B, TB16, TB16A, TB32, TB51 were all cloned into the E. coli expression vector pMCT3, by PCR amplification with gene specific primers.
Each PCR reaction contained 10 ng of M. tuberculosis chromosomal DNA in 1x low salt Taq+ buffer (Stratagene) supplemented with 250 pM of each of the four nucleotides {Boehringer Mannheim), 0.5 mg/ml BSA (IgG technology), 1 % DMSO (Merck), 5 pmoles of each primer, and 0.5 unit Taq+ DNA polymerase (Stratagene) in 10 pl reaction volume.
Reactions were initially heated to 94°C for 25 sec. and run for 30 cycles according to the following program; 94°C for 10 sec., 55°C for 10 sec., and 72°C for 90 sec., using thermocycler equipment from Idaho Technology.
The PCR fragment was ligated with TA cloning vector pCR~ 2.1 (Invitrogen) and transformed into E. coli. Plasmid DNA was thereafter prepared from clones harbouring the desired fragment, digested with suitable restriction enzymes and subcloned into the expression vector pMCT3 in frame with 6 histidine residues which are added to the N-terminal of the expressed proteins. The resulting clones were hereafter sequenced by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
Expression and metal affinity purification of recombinant proteins was undertaken essentially as described by the manufacturers. For each protein, 1 1 LB-media containing 100 pg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells harbouring recombinant pMCT3 plasmids. Cultures were shaken at 37°C
until they reached a density of ODsoo= 0.4 - 0.6. IPTG was hereafter added to a final concentration of 1 mM and the cultures were further incubated 4 - 16 hours. Cells were harvested, resuspended in 1x sonication buffer + 8 M urea and sonicated 5 x 30 sec. with 30 sec.
pausing between the pulses.
After centrifugation, the lysate was applied to a column containing 10 ml of resuspended Talon resin (Clontec, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at OD28o gym. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCI, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using 1 ml HiTrap columns (Pharmacia, Sweden) eluted with a linear salt gradient from 0 - 1 M NaCI.
Fractions were analysed by SDS-PAGE and protein concentrations were estimated at OD28onm.
Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH
8.5.
Finally, the protein concentration and the LPS content were determined by the BCA
(Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
For cloning of the individual proteins, the following gene specific primers were used TB10C : Primers used for cloning of TB10C
TB10C-F : CTG AGA TCT GTG GAG GTC AAG ATC GGT
(SEQ ID NO: 58) TB10C-R : CTC CCA TGG CTAC TTA CCC GCT CGT AGC AAC (SEQ ID NO: 59) TB10C-F and TB10C-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB13A : Primers used for cloning of TB13A
TB13A-F : CTG AGA TCT CCT GTC ACT CAG GAA GAA
(SEQ ID NO: 60) TB13A-R : CTC CCA TGG GAA ACC GCC ATT AGC GGT
(SEQ ID NO: 61) TB13A-F and TB13A-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB17 : Primers used for cloning of TB17 TB17-F : CCC AAG CTT ATG GAA CAG CGT GCG GAG
(SEQ ID NO: 62) TB17-R : CTC CCA TGG CGA CAC TCG ATC CGG ATT (SEQ ID NO: 63) TB17-F and TB17-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB11 B : Primers used for cloning of TB11 B
TB11 B-F : CTG AGA TCT ATG CCA GTG GTG AAG ATC
{SEQ ID NO: 64) TB11 B-R : CTC CCA TGG TTA TGC AGT CTT GCC GGT (SEQ ID NO: 65) TB11B-F and TB11B-R create BG/II and Ncol sites, respectively, used for the cloning in 5 pMCT3.
TB16 : Primers used for cloning OF TB16 TB16-F : CTG AGA TCT GCG GAC AAG ACG ACA CAG
(SEQ ID NO: 66) TB16-R : CTC CCA TGG TAC CGG AAT CAC TCA GCC {SEQ ID NO: 67) TB16-F and TB16-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB16A : Primers used for cloning of TB16A
TB16A-F : CTG AGA TCT CCA GTT TTG AGC AAG ACC {SEQ ID NO: 68) TB16A-R : CTC CCA TGG GCA CAT GCC TTA GCT GGC
(SEQ ID NO: 69) TB16A-F and TB16A-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB32 : Primers used for cloning of TB32 TB32-F : CTG AGA TCT ATG TCA TCG GGC AAT TCA (SEQ ID NO: 70) TB32-R : CTC CCA TGG CTAC CTA AGT CAG CGA CTC GCG (SEQ ID NO: 71) TB32-F and TB32-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB51 : Primers used for cloning of TB51 TB51-F : CTG AGA TCT GTG AAG AGC ACC GTC GAG
(SEQ ID NO: 72) TB51-R : CTC CCA TGG GTC ATA CGG TCA CGT TGT (SEQ ID NO: 73) TB51-F and TB51-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB15A: Primers used for cloning of TB15A:
TB15A-F: CTG CCA TGG CTA GGT GGT GTG CAC GAT C
(SEQ ID NO: 89) TB15A-R: CTG AAG CTT ATG AGC GCC TAT AAG ACC
{SEQ ID NO: 90) TB15-F and TB15-R create Ncol and Hindlll sites, respectively, used for the cloning in pMCT3.
TB21: Primers used for cloning of TB21:
TB21-F: CTG AGA TCT ATG ATT GAT GAGGCT CTC
(SEQ ID NO: 91 ) TB21-R: CTC CCA TGG AGC GGC CGC TAG ACC TCC (SEQ ID NO: 92) TB21-F and TB21-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB24: Primers used for cloning of TB24:
TB24-F: GGCTGAGACTC ATG GCC GAC ATC GAT GGT G
(SEQ ID NO: 93) TB24-R: CGTACCATGG TCA TGA CGA CAC CCC CTC GTG (SEQ ID NO: 94) TB24-F and TB24-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB32A: Primers used for cloning of TB32A:
TB32A-F: GGCTGAGACTC ATG GCT GAA GTA CTG GTG C (SEQ ID NO: 95) TB32A-R: CGTACCATGGCTA GCC GGC GAC CGC CGG TTC (SEQ ID NO: 96) TB32A-F and TB32A-R create Bgll I and Ncol sites, respectively, used for the cloning in pMCT3.
TB14: Primers used for cloning of TB14:
TB14-F: 5'-GTG ACC GAA CGG ACT CTG GT-3' (SEQ ID NO: 97) TB14-R: 5'-CTA GGC GCC GGG AAA CCA GAG-3' (SEQ ID NO: 98) TB18: Primers used for cloning of TB18:
TB18-F: 5'-ATG ACG GAT ACT CAA GTC ACC TG-3"
(SEQ ID NO: 99) TB18-R: 5'-GGA GTG GTA CGG CTC GGC GC-3' (SEQ ID NO: 100) T827: Primers used for cloning of TB27:
TB27-F: 5'-ATG ACG TAC GAA ACC ATC CT-3' (SEQ ID NO: 101) TB27-R: 5'-TCA TCG GTG GGT GAA CTG GGG-3' (SEQ ID NO: 102) TB33: Primers used for cloning of TB33:
TB33-F: 5'-ATG CCG CTT CCC GCA GAC CCT AG-3' (SEQ ID NO: 103) TB33-R: 5'-TAC GAC GGG TAC CAC TCC TGG-3' (SEQ ID NO: 104) TB38: Primers used for cloning of TB38:
TB38-F: 5'-ATG CTG ATC TCA CAG CGC CCC A-3' (SEQ ID NO: 105) TB38-R: 5'-AAG CTG TTC GGT TTC GGC GTA G-3' (SEQ ID NO: 106) TB54: Primers used for cloning of TB54:
TB54-F: 5' -ATG ACC GGA AAT TTG GTG AC-3' (SEQ ID NO: 107) TB54-R: 5'-TCA GTA GCG GTA GTG GTC CGG-3' (SEQ ID NO: 108) TB14,TB18,TB27,TB33,TB38 and TB54 will be cloned in ex-pressions vector pBAD-TOPO (Invitrogen).
Example 5a: Cloning of the genes encoding TB12.5, TB20.6, and TB40.8 The genes encoding TB12.5, TB20.6, and TB40.8 were all cloned into the E. coli expression vector pMCT3 as described in Example 5.
For cloning of the individual genes, the following gene specific primers were used:
TB12.5: Primers used for cloning of TB12.5:
5 TB12.5-F: CTG AGA TCT ATG GCA CTC AAG GTA GAG (SEQ ID NO: 83) TB12.5-R: CTC CCA TGG TTA TTG ACC CGC CAC GCA
(SEQ ID NO: 84) TB12.5-F and TB12.5-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB20.6: Primers used for cloning of TB20.6:
TB20.6-F: CTG AGA TCT ATG GCC GAC GCT GAC ACC
(SEQ ID NO: 85) TB20.6-R: CTC CCA TGG CTA GTC GCG GAG CAC AAC
(SEQ ID NO: 86) TB20.6-F and TB20.6-R create 8glll and Ncol sites, respectively, used for the cloning in pMCT3.
TB40.8: Primers used for cloning of TB40.8:
TB40.8-F: CTG AGA TCT ATG AGC AAG ACG GTT CTC (SEQ ID NO: 87) TB40.8-R: CTC CCA TGG TCA CGT CTT CCA GCG GGT
(SEQ ID NO: 88) TB40.8-F and TB40.8-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
Expressionlpurification of recombinant proteins was performed as described in Example 5.
EXAMPLE 6: Evaluation of immunological activity of identified somatic proteins.
Each of the proteins identified in either the cell wall, cytosol or the cell membrane derived from M.tuberculosis will be evaluated for the immunological recognition in M.
tuberculosis infected animals or in TB patients.
IFN-~y induction in the mouse model of TB infection The recognition of an antigen by IFN-y producing T cells in M.tuberculosis infected animals or in TB patients is presently believed to be the most relevant correlate of protective immunity.
We will therefore evaluate the ability of the polypeptides of the invention to induce an IFN-y production in mice of four different haplotypes during a primary infection: 8-12 weeks old female mice C57BL/6j (H-2b), CBA/J (H-2k), DBA.2 (H-2d) and A.SW (H-2g) mice (Bomholtgaard, Ry, Denmark) will be infected i.v. via the lateral tail vein with an inoculum of 5 x 104 M.tuberculosis suspended in PBS in a vof. of 0.1 ml. 14 days postinfection the animals will be sacrificed and spleen cells isolated and tested for proliferation and the IFN-y release in response to stimulation with the recombinantly produced proteins.
As a specific model we will analyse the recognition of the purified polypeptides of the invention the mouse model of memory immunity to TB: A group of efficiently protected mice will be generated by infecting 8-12 weeks old female C57BI/6j mice with 5 x 104 M. tuberculosis i.v. After 30 days of infection the mice will be subjected to 60 days of antibiotic treatment with isoniazid (Merck and Co., Rahway, NJ) and rifabutin (Farmatalia Carlo Erba, Milano, Italy) then left for 200-240 days to ensure the establishment of resting long-term memory immunity. Such memory immune mice are very efficient protected against a secondary infection (Orme; Andersen, Boom 1993, J. Infect.Dis. 167:
1497). Long lasting immunity in this model is mediated by a population of highly reactive CD4 cells recruited to the site of infection and triggered to produce large amounts of IFN-y in response to M. tuberculosis antigens.
This model will be used to identify single antigens recognised by protectiveT
cells.
Memory immune mice will be reinfected with 1 x 106 M.tuberculosis i.v and splenic lymphocytes harvested at day 4-6 of reinfection and proliferation and the amount of IFN-y produced in response to any of the recombinantly produced proteins will be evaluated.
IFN-y induction in humans during infection with virulent Mycobacteria.
IFN-y is currently believed to be the best marker of protective immunity in humans. In patients with limited tuberculosis, high levels of IFN-y can be induced, in contrast to patients with severe TB who often respond with low levels of IFN-y (Boesen et al (1995), Human T-cell response to secreted antigen fractions of M.tuberculosis.
Infection and Immunity 63(4):1491-1497). Furthermore, IFN-y release has been shown to correlate inversely with the severity of disease as determined by X-ray findings (Sodhi A, et al (1997) Clinical correlates of IFN-gamma production in patients with Tuberculosis, Clinical Infectious disease. 25; 617-620). Healthy exposed contacts of sputum positive TB
patients also produce very high levels of IFN~ in response to mycobacterial antigens 5 (unpublished, manus in prep) indicative of early, subclinical infection.
Together these findings indicate that those individuals who are relatively protected (i.e.
minimal TB
patients) respond with high levels of IFN-y. The ability of the polypeptides to induce IFN-y release in cultures of PBMC or whole blood from 20 PPD responsive patients with microscopy or culture proven TB (0-6 month after diagnosis), exposed household 10 contacts, or BCG vaccinated individuals from different geographical regions will be evaluated. Evaluation of donors from different geographical regions will enable us to take into account the influence of i.e. exposure to virulent Mycobacterium or NTM
(Non-Tuberculous Mycobacteria) and different genetic background. The most important selection criteria for vaccine candidates are the polypeptides which are recognised by 15 >30% of the donors with a level of IFN y >30% of that induced by a crude antigen preparation like ST-CF, PPD and SPE.
Cultures will be established with 1 to 2 x 105 PBMC in 2001 in microtiter plates (Nunc, Roskilde, Denmark) or with 1 ml of serum or plasma stimulated with the identified polypeptide and the IFN-y release measured by ELISA.
20 Polypeptides of the invention frequently recognised will be preferred.
The use of polypeptides as diagnostic reagents:
A polypeptide has diagnostic potential in humans when it is inducing significantly higher responses in patients with microscopy or culture positive tuberculosis compared to PPD
positive or PPD negative individuals with no known history of TB infection or exposure to 25 M.tuberculosis but who may or may not have received a prior BCG
vaccination, have been exposed to non-tuberculous mycobacteria(NTM), or be actively infected with M.avium. To identify polypeptides capable of discriminating between the above mentioned groups, the level of response and the frequency of positive responders to the polypeptide is compared. By positive responders are meant i) in vitro IFN-y release by 30 PBMC or whole blood stimulated with the polypeptide of at least 3-500 pg/ml above background or another cut off relating to the specific test kit used, ii) reactivity by human serum or plasma from TB patients with the polypeptide using conventional antibody ELISAIVIIestern blot or iii) in vivo delayed type hypersensitivity response to the polypeptide which is at least 5 mm higher than the response induced by a control 35 material.
The diagnostic potential of polypeptides will initially be evaluated in 10 individuals with TB
infection and 10 individuals with no known exposure to virulent Mycobacteria.
High specificity, >80% ,will be the most important selection criteria for these polypeptides and a sensitivity >80% is desirable but sensitivity >30% is acceptable as combinations of several specific antigens may be preferred in a cocktail of diagnostic reagent recognised by different individuals.
Skin test reaction in TB infected guinea pigs To identify polypeptides as antigens with the potential as TB diagnostic reagents the ability of the proteins to induce a skin test response will be evaluated in the guinea pig model where groups of guinea pigs have been infected with either M.
tuberculosis or M.avium or vaccinated with BCG.
To evaluate the response in M.tuberculosis infected guinea pigs, female outbred guinea pigs will be infected via an ear vein with 1 x 104 CFU of M. tuberculosis H37Rv in 0.2 ml of PBS or aerosol infected (in an exposure chamber of a Middlebrook Aerosol Generation device) with 1x 105 CFU/ml of M.tuberculosis Erdman given rise to 10-15 granulomas per animal in the lung. After 4 weeks skin test will be performed with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
To evaluate the response in M.avium infected guinea pigs, female outbred guinea pigs will be infected intradermally with 2 x 106 CFU of a clinical isolate of M.avium (Atyp.1443;
Statens Serum Institut, Denmark). Skin test are performed 4 weeks after with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
To evaluate the response in BCG vaccinated guinea pigs, female outbred guinea pigs will be sensitized intradermally with 2 x 106 CFU of BCG (BCG Danish 1331; Statens Serum Institut). Skin test are performed 4 weeks after with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
If a polypeptide induces a significant reaction in animal infected with M.tuberculosis but not in BCG vaccinated guinea pigs this polypeptide may have a potential as a diagnostic reagent to differentiate between BCG vaccinated and M.tuberculosis infected individuals, which will hereafter be evaluated in the human population.
If a polypeptide induces a reaction in M.tuberculosis infected guinea pigs but not in guinea pigs infected with M.avium, this polypeptide may have a potential as a diagnostic reagent with respect to differentiate between an individual infected with M.
tuberculosis and an individual infected with Mycobacteria not belonging to the tuberculosis complex.
The polypeptide may also have a potential as a diagnostic reagent to differentiate between a M.avium and a M.tuberculosis infected individual.
Induction of protective immunity by the recombinant proteins in the mice model.
The recombinant polypeptides will be evaluated as immunological compositions in mice.
Female C57BLI6j mice of 6-8 weeks old (Bomholtgaard, Denmark) will be immunised subcutaneously at the base of the tail with the recombinantly produced polypeptides with DDA as adjuvant. The mice will be vaccinated with a volume of 0.2 ml in total of three times with two weeks interval between each immunisation. One week after last immunisation the mice will be bled and the blood cells isolated. The immune response induced will be monitored by release of IFN-y into the culture supernatant when stimulated in vitro with the homologous proteins.
6 weeks after the last immunisation the mice will be aerosol challenged with 5.5 ml of 5 x 106 viable M.tuberculosislml. After 6 weeks of infection the mice will be killed and the number of viable bacteria in lung and spleen determined by plating serial 3-fold dilution of organ homogenates on 7H11 plates. Colonies will be counted after 2-3 weeks of incubation and the levels of protection induced by each of the single polypeptide will be determined.
Example 6a: Interferon~y induction in human TB patients and BCG
vaccinated Human donors: PBMC were obtained from healthy BCG vaccinated donors with no known exposure to M, tuberculosis and from patients with culture or microscopy proven infection with TB. Blood samples were drawn from the TB patients 0-fi months after diagnosis of tuberculosis, and 20 months to 40 years after BCG vaccination.
Lymphocyte preparations and cell culture: PBMC were freshly isolated by gradient centrifugation of heparinized blood on Lymphoprep (Nycomed, Oslo, Norway) and stored in liquid nitrogene until use. The cells were resuspended in complete RPMI
1640 medium (Gibco, Grand Island, N.Y.) supplemented with 1 % penicillinlstreptomycin (Gibco BRL, Life Technologies), 1 % non-essential-amino acids (FLOW, ICN Biomedicals, CA, USA), and 10% normal human ABO serum (NHS) from the local blood bank. The number and the viability of the cells were determined by Nigrosin staining. Cultures were established with 1.25 x 105 PBMCs in 100 pl in microtitre plates (Nunc, Roskilde, Denmark) and stimulated with ST-CF (5pg/ml), TB13A, TB15A, TB17, TB18, TB33, TB11 B, TB16A, TB16, TB32, and TB51 in a final concentration of 10 pg/ml. No antigen and phytohaemagglutinin (PHA) were used as negative and positive control, respectively.
Supernatants for the detection of cytokines were harvested after 5 days of culture, pooled, and stored at -80°C until used.
Cytokine analysis: Interferon-y (IFN-y) was detected with a standard sandwich ELISA
technique using a commercially available pair of monoclonal antibodies (Endogen) and used according to the manufacturers instruction. Recombinant IFN-y (Endogen) was used as a standard. All data are means of duplicate wells and the variation between wells did not exceed 10 % of the mean. Cytokine levels below 50 pg/ml were considered negative.
Responses of 10 individual donors are shown in TABLE 3.
As shown in Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, and Table 12 a marked release of IFN-y is observed after stimulation with some of the recombinant proteins. For 50% of the donors, stimulation with TB18, TB32, and TB51 give rise to high IFN-y responses (> 1,000 pglml). Less than 1/3 of the donors recognised TB15A and TB11B at this level. Between 30 and 70% of the donors show intermediate IFN-y response (> 500 pg/ml) when stimulated with TB17 and TB16A
whereas only limited response was obtained by TB13A, TB33, and TB16. However, TB13A, TB33 and TB16 may still be of immunological importance and meet some of the other properties of the present invention. E.g. as demonstrated for TB33 which is recognised by a pool of sera from human TB-patients.
Table 3 Stimulation of PBMCs from 6 healthy BCG vaccinated and 4 TB patients with recombinant TB13A. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No PHA ST-CF TB13A
ag (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) TB patients DonorNo PHA ST-CF TB13A
ag (1 ~glml) (5 ug/ml) (10 ~g/ml) 2 51 10058 64$9 0 Table 4 Stimulation of PBMCs from 6 healthy BCG vaccinated and 5 TB patients with recombinant TB15A. Responses to ST-CF and shown for comparison.
PHA are Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB15A
(1 ug/ml) (5 ~g/ml) (10 ~g/ml) TB patients Donor No ag PHA ST-CF TB15A
(1 ~g/ml) (5 ~g/ml) (10 ~g/ml) Table 5 Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB17.
Responses to ST-CF and PHA are shown . Results are given for comparison as pg IFN-y/ml BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB17 {1 ~g/ml) {5 ~g/ml) (10 ~g/ml) Table 6 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB18. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-ylml BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB18 (1 ~g/ml) (5 uglml) (10 ~g/ml) TB patients Donor No ag PHA (1 ST-CF(5 TB18 (10 u9lml) pglml) ~.glml) Table 7 Stimulation of PBMCs from 5 healthy BCG vaccinated and 6 TB patients with recombinant TB33. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
5 BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB33 {1 ~g/ml) (5 ~g/ml)(10 ~g/ml) TB patients Donor No ag PHA ST-CF TB33 (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) s3 Table 8 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB11 B. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB11 B
(1 ~glml) (5 ~g/ml) (10 ~glml) TB patients Donor No ag PHA ST-CF TB11 B
(1 ~g/ml) (5 ~glml) (10 ~g/ml) 10 Table 9. Stimulation of PBMCs from 2 healthy BCG vaccinated and 5 TB
patients with recombinant TB16A. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF' TB16A
(1 ~glml) (5 ~g/ml) (10 ~glml) TB patients Donor No ag PHA ST-CF TB16A
(1 ~glml) (5 ~glml) (10 ~glml) Table 10. Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB16.
Responses to ST-CF and PHA are shown for comparison. Results are given as oa IFN-Donor No ag PHA ST-CF TB16 (1 ~g/ml) (5 ~.g/ml) (10 ~g/ml) Table 11. Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB32. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
Donor No ag PHA ST-CF TB32 (1 ~glml) (5 ~g/ml) (10 ~g/ml) TB patients Donor No ag PHA ST-CF TB32 (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) Table 12, Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB51.
Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
5 BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB51 (1 ~g/ml) (5 ~glml) {10 ~g/ml) ,. " . """. . . -."., . ...
Figure legends:
Figure 1:
Long term protection against TB can be induced by immunisation with dead M. tuberculosis.
5 Mice received either: three immunisations with 1x10' CFU of dead M.tuberculosis H37Rv (squares); three immunisations with 50 pg of ST-CF (triangles); one immunisation with 5 x 104 CFU of live M.tuberculosis H37Rv (circle) and was hereafter cleared for the infection by administration of isoniazid in the drinking water. At 3, 6 and 12 month after the last immunisation the mice received an infection with M. tuberculosis H37Rv and two 10 weeks later the bacterial load and the resistance against TB in the spleens were determined.
Figure 2:
Mice received three immunisations with 50~,g of either of the three vaccines:
heat killed H37Rv, SPE or ST-CF or received a vaccination with BCG. Two weeks after a primary 15 infection the bacterial load in the spleen was used to determined the resistance against TB.
SEQ(JENCE LISTING
<110> Statens Serum Institute <120> TB vaccine and diagnostic based antigens from the M.tuberculosis cell <130> 21868PC1 <160> 108 <170> FastSEQ for Windows Version 3.0 <210> 1 <211> 273 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(270) <400> 1 gtggag gtcaagatcggt atcacg gacagtccg cgcgagctg gtgttc 48 ValGlu ValLysIleGly IleThr AspSerPro ArgGiuLeu ValPhe tccagt gcgcagacgccc agtgag gtagaagaa ctcgtcagc aacgcg 96 SerSer AlaGlnThrPro SerGlu ValGluGlu LeuValSer AsnAla ctgcgc gacgactctggt ttgctg accctgacc gacgagcgg ggccgt 149 LeuArg AspAspSerGly LeuLeu ThrLeuThr AspGluArg GlyArg cgcttc ctaattcacacc gccagg atcgcctat gtcgagatc ggtgtc 192 ArgPhe LeuIleHisThr AlaArg IleAlaTyr ValGluIle GlyVal gcagac gcccgccgggtg ggcttc ggcgtcggg gtggacgcc gcaget 240 AlaAsp AlaArgArgVal GlyPhe GlyValGly ValAspAla AlaAla gggtcc gecggaaaggtt getacg agcgggtaa 273 GlySer AlaGlyLysVal AlaThr SerGly <210> 2 <211> 90 <212> PRT
<213> M.Tuberculosis <400> 2 Met Glu Val Lys Ile Gly Ile Thr Asp Ser Pro Arg Glu Leu Val Phe Ser Ser Ala Gln Thr Pro Ser Glu Val Glu Glu heu Val Ser Asn Ala Leu Arg Asp Asp Ser Gly Leu Leu Thr Leu Thr Asp Glu Arg Gly Arg ArgPheLeuIle Thr AlaArgIle Tyr Val Glu Ile Gly Val His Ala AiaAspAlaArg Val GlyPheGly Gly Val Asp Ala Ala Ala Arg Val GlySerAlaGly Val AlaThrSer Lys Gly <210> 3 <211> 348 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(345) <400> 3 gtgcct actcaggaa gaaatcatt gccggtatc gccgag atcatc 48 gtc ValPro ThrGlnGlu GluIleIle AlaGlyIle AlaGlu IleIle Val gaagag accggtatc gagccgtcc gagatcacc ccggag aagtcg 96 gta GluGlu ThrGlyIle GluProSer GluIleThr ProGlu LysSer Val ttcgtc gacctggac atcgactcg ctgtcgatg gtcgag atcgcc 144 gac PheVal AspLeuAsp IleAspSer LeuSerMet ValGlu IleAla Asp gtgcag gaggacaag tacggcgtc aagatcccc gacgag gacctc 192 acc ValGln GluAspLys TyrGlyVal LysIlePro AspGlu AspLeu Thr gccggt cgtaccgtc ggtgacgtt gtcgcctac atccag aagctc 240 ctg AlaGly ArgThrVal GlyAspVal ValAlaTyr IleGln LysLeu Leu gaggaa aacccggag gcggetcag gcgttgcgc gcgaag attgag 288 gaa GluGlu AsnProGlu AlaAlaGln AlaLeuArg AlaLys IleGlu Glu tcggag cccgatgcc gttgccaac gttcaggcg aggctt gaggcc 336 aac SerGlu ProAspAla ValAlaAsn ValGlnAla ArgLeu GluAla Asn gagtcc tga aag GluSer Lys <210> 9 <211> 115 <212> PRT
<213> M.Tuberculo sis <400> 4 MetPro ThrGlnGlu GluIleIle AlaGlyIle AlaGlu IleIle Val GluGlu ThrGlyIle GluProSer Glu Thr ProGlu LysSer Val Ile Phe Val Asp Asp Leu Asp Ile Asp Ser Leu Ser Met Val Glu Ile Ala Val Gln Thr Glu Asp Lys Tyr Gly Val Lys Ile Pro Asp Glu Asp Leu Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala Tyr Ile Gln Lys Leu Glu Glu Glu Asn Pro Glu Ala A1a Gln Ala Leu Arg Ala Lys Ile Glu Ser Glu Asn Pro Asp Ala Val Ala Asn Val Gln Ala Arg Leu Glu Ala Glu Ser Lys <210>5 <211>411 <212>DNA
<213>M.Tuberculosis <220>
<221>CDS
<222>(1)...(408) <400> 5 gtgaccgaa cggactctg gtactgatc aagccggat ggcatcgaa agg 48 ValThrGlu ArgThrLeu ValLeuIle LysProAsp GlyIleGlu Arg cagctgatc ggcgagatc atcagccgc atcgagcgc aaaggcctc acc 96 GlnLeuIle GlyGluIle IleSerArg IleGluArg LysGlyLeu Thr atcgetgcg ctgcagctc aggaccgtc agcgcggag ttggccagc cag 144 IleAlaAla LeuGlnLeu ArgThrVal SerAlaGlu LeuAlaSer Gln cactacgcc gaacatgaa ggcaaacca ttctttgga tcgttgctg gag 192 HisTyrAla GluHisGlu GlyLysPro PhePheGly SerLeuLeu Glu 50 55 fi0 ttcatcacg tcgggtccg gtggtagcg gcgatcgt:ggagggaacc cga 240 PheIleThr SerGlyPro ValValAla AlaIleVal GluGlyThr Arg gccatcgcg gcggttcgc caactcgcc ggcggcacc gacccggtg cag 288 AlaIleAla AlaValArg GlnLeuAla GlyGlyThr AspProVal Gln gcggcggcg cccggcaca atccggggc gacttcget ctagagacg cag 336 AlaAlaAla ProGlyThr IleArgGly AspPheA:LaLeuGluThr Gln ttcaacctg gtgcacggg tctgattcg gccgaatcc gcgcagcgc gaa 384 PheAsnLeu ValHisGly SerAspSer AlaGluSer AlaGlnArg Glu atcgcgctc tggtttccc ggcgcctag 411 IleAlaLeu TrpPhePro GlyAla <210> 6 <211> 136 <212> PRT
<213> M.Tuberculosis <400> 6 Met Thr Glu Arg Thr Leu Val Leu Ile Lys Pro Asp Gly Ile Glu Arg Gln Leu Ile Gly Glu Ile Ile Ser Arg Ile Glu Arg Lys Gly Leu Thr Ile Ala Ala Leu Gln Leu Arg Thr Val Ser Ala Glu Leu Ala Ser Gln His Tyr Ala Glu His Glu Gly Lys Pro Phe Phe Gly Ser Leu Leu Glu Phe Ile Thr Ser Gly Pro Val Val Ala Ala Ile Val Glu Gly Thr Arg Ala Ile Ala Ala Val Arg Gln Leu Ala Gly Gly Thr Asp Pro Val Gln Ala Ala Ala Pro Gly Thr Ile Arg Gly Asp Phe Ala Leu Glu Thr Gln Phe Asn Leu Val His Gly Ser Asp Ser Ala Glu Ser Ala Gln Arg Glu Ile Ala Leu Trp Phe Pro Gly Ala <210> 7 <211> 941 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(438) <400> 7 atgagcgcc tataagacc gtggtggta ggaaccgac ggttcggac tcg 48 MetSerAla TyrLysThr ValValVal GlyThrAsp GlySerAsp Ser tcgatgcga gcggtagat cgcgetgcc cagatcgcc ggcgcagac gcc 96 SerMetArg AlaValAsp ArgAlaAla GlnIleAla GlyAlaAsp Ala aagttgatc atcgcctcg gcataccta cctcagcac gaggacget cgc 144 LysLeuIle IleAlaSer AlaTyrLeu ProGlnHis GluAspAla Arg gccgccgac attctgaag gacgaaagc tacaaggtg acgggcacc gcc 192 AlaAlaAsp IleLeuLys AspGluSer TyrLysVal ThrGlyThr Ala ccgatctac gagatcttg cacgacgcc aaggaacga gcgcacaac gcc 240 ProIleTyr GluIleLeu HisAspAla LysGluArg AlaHisAsn Ala ggtgcgaaa aacgtcgag gaacggccg atcgtcggc gccccggtc gac 288 GlyAlaLys AsnValGlu GluArgPro IleValGly AlaProVal Asp gcgttggtg aacctggcc gatgaggag aaggcggac ctgctggtc gtc 336 AlaLeuVal AsnLeuAla AspGluGlu LysAlaAsp LeuLeuVal Val ggc aat gtc ggt ctg agc acg atc gcg ggt cgg ctg ctc gga tcg gta 384 Gly Asn Val Gly Leu Ser Thr Ile Ala Gly Arg Leu Leu Gly Ser Val ccg gcc aat gtg tca cgc cgg gcc aag gtc gac gtg ctg atc gtg cac 432 Pro Ala Asn Val Ser Arg Arg Ala Lys Val Asp Val Leu Ile Val His acc acc tag 441 Thr Thr <210> 8 <211> 146 <212> PRT
<213> M.Tuberculosis <900> 8 Met Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Gly Ser Asp Ser Ser Met Arg Ala Val Asp Arg Ala Ala Gln Ile Ala Gly Ala Asp Ala Lys Leu Ile Ile Ala Ser Ala Tyr Leu Pro Gln His Glu Asp Ala Arg Ala Ala Asp Ile Leu Lys Asp Glu Ser Tyr Lys Val Thr Gly Thr Ala Pro Ile Tyr Glu Ile Leu His Asp AIa Lys Glu Arg Ala His Asn Ala Gly Ala Lys Asn Val Glu Glu Arg Pro Ile Val Gly Ala Pro Val Asp Ala Leu Val Asn Leu Ala Asp Glu Glu Lys Ala Asp Leu Leu Val Val Gly Asn Val Gly Leu Ser Thr Ile Ala Gly Arg Leu Leu Gly Ser Val Pro Ala Asn Val Ser Arg Arg Ala Lys Val Asp Val Leu Ile Val His Thr Thr <210> 9 <211> 998 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(495) <400> 9 atg gaa cag cgt gcg gag ttg gtg gtt ggc cgg gca ctt gtc gtc gtc 48 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val Val gtt gac gat cgc acg gcg cac ggc gat gaa gac cac agc ggg ccg ctt 96 Val Asp Asp Arg Thr Ala His Gly Asp Glu Asp His Ser Gly Pro Leu gtc acc gag ctg ctc acc gag gcc ggg ttt gtt gtc gac ggc gtg gtg 149 Val Thr Glu Leu Leu Thr Glu Ala Gly Phe Val Val Asp Gly Val Val gcg gtg tcg gcc gac gag gtc gag atc cga aat gcg ctg aac aca gcg 192 Ala Val Ser Ala Asp Glu Val Glu Ile Arg Asn Ala Leu Asn Thr Ala gtg atc ggc ggg gtg gac ctg gtg gtg tcg gtc ggc ggg acc ggg gtg 240 Val Ile Gly Gly Val Asp Leu Val Val Ser Val Gly Gly Thr Gly Val acg cctcgcgat gtcaccccg gaagccacc cgcgac attctggaccgc 288 Thr ProArgAsp ValThrPro GluAlaThr ArgAsp IleLeuAspArg gag atcctcggt atcgccgag gccatccgc gcgtcc gggctgtccgcg 336 Glu IleLeuGly IleAlaGlu AlaIleArg AlaSer GlyLeuSerAla gga atcgtcgac gccgggttg tcgcgcggc ctggcg ggtgtctccggc 384 Gly IleValAsp AlaGlyLeu SerArgGly LeuAl.aGlyValSerGly agc acgctggtg gtcaacctc gcgggttcg cgttat gcggtgcgcgat 432 Ser ThrLeuVal ValAsnLeu AlaGlySer ArgTyr AlaValArgAsp gga atggcgacg ctgaatccg ctagcggca cagat:catcgggcagttg 480 Gly MetAlaThr LeuAsnPro LeuAlaAla GlnIle IleGlyGlnLeu tcg agcttggag atctga 498 Ser SerLeuGlu Ile <210> 10 <211> 165 <212> PRT
<213> M.Tuberculosis <400> 10 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val Val Val Asp Asp Arg Thr Ala His Gly Asp Glu Asp H:is Ser Gly Pro Leu Val Thr Glu Leu Leu Thr Glu Ala Gly Phe Val Val Asp Gly Val Val Ala Val Ser Ala Asp Glu Val Glu Ile Arg Asn Ala Leu Asn Thr Ala Val Ile Gly Gly Val Asp Leu Val Val Ser Val Gly Gly Thr Gly Val Thr Pro Arg Asp Val Thr Pro Glu Ala Thr Arg Asp Ile Leu Asp Arg Glu Ile Leu Gly Ile Ala Glu Ala Ile Arg Ala Ser Gly Leu Ser Ala Gly Ile Val Asp Ala Gly Leu Ser Arg Gly Leu Ala Gly Val Ser Gly Ser Thr Leu Val Val Asn Leu Ala Gly Ser Arg Tyr Ala Val Arg Asp Gly Met Ala Thr Leu Asn Pro Leu Ala Ala Gln Ile Ile Gly Gln Leu SerSer Glu Ile Leu <210> 11 <211> 495 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(992) <400> 11 atgacg act caagtcacc tggttgacc caagagtca catgac cga 48 gat MetThr Thr GlnValThr TrpLeuThr GlnGluSer HisAsp Arg Asp ctcaaa gag ctcgaccag ctgattgcg aatcgcccg gtcatc gcc 96 gca LeuLys Glu LeuAspGln LeuIleAla AsnArgPro ValIle Ala Ala gccgaa aac gaccgccgc gaagaaggc gacctgcgc gagaac ggc 199 atc AlaGlu Asn AspArgArg GluGluGly AspLeuArg GluAsn Gly Ile ggatac gcc gcccgcgag gagcagggc cagcaggag gcccgc att 192 cac GlyTyr Ala AlaArgGlu GluGlnGly GlnGl.nGlu AlaArg Ile His cgccag cag gacttgctc agcaacgca aaggttggc gaggca ccc 240 ctg ArgGln Gln AspLeuLeu SerAsnAla LysValGly GluAla Pro Leu aagcaa ggc gtcgcatta cccggttct gtggtcaag gtgtac tac 288 tcc LysGln Gly ValAlaLeu ProGlySer ValValLys ValTyr Tyr Ser aacggc aag tcggacagc gaaacgttc ctcat:cgcc acccgc cag 336 gac AsnGly Lys SerAspSer GluThrPhe LeuIleAla ThrArg Gln Asp gagggc agc gacggcaag ctcgaggtc tactcgccg aattca ccg 384 gtc GluGly Ser AspGlyLys LeuGluVal TyrSerPro AsnSer Pro Val ctcggt gcc ctgatcgac gccaaggtc ggcgagacc cgcagc tac 932 ggg LeuGly Ala LeuIleAsp AlaLysVal GlyGluThr ArgSer Tyr Gly acggtg aac ggcagcacc gtgtcggtg accctagtc agcgcc gag 480 ccc ThrVal Asn GlySerThr ValSerVal ThrLeuVal SerAla Glu Pro ccgtac tcc tag 495 cac ProTyr Ser His <210> 12 <211> 164 <212> PRT
<213> M.Tuberculosis <400> 12 MetThr AspThrGln ValThrTrp LeuThrGln GluSerHis AspArg LeuLys AlaGluLeu AspGlnLeu IleAlaAsn ArgProVal IleAla AlaGlu IleAsnAsp ArgArgGlu GluGlyAsp LeuArgGlu AsnGly GlyTyr HisAlaAla ArgGluGlu GlnGlyGln GlnGluAla ArgIle ArgGln LeuGlnAsp LeuLeuSer AsnAlaLys ValGlyGlu AlaPro LysGln SerGlyVal AlaLeuPro Gly5erVal ValLysVal TyrTyr AsnGly AspLysSer AspSerGlu ThrPheLeu IleAlaThr ArgGln GluGly ValSerAsp GlyLysLeu GluValTyr SerProAsn SerPro LeuGly GlyAlaLeu IleAspAla LysValGly GluThrArg SerTyr ThrVal ProAsnGly SerThrVal SerValThr LeuValSer AlaGlu ProTyr HisSer <210> 13 <211> 558 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(555) <400> 13 atgatt gatgagget ctcttcgac gccgaa gagaaaatg gagaagget 48 MetIle AspGluAla LeuPheAsp AlaGlu GluLysMet GluLysAla gtggcg gtggcacgt gacgacctg tcaact atccgtacc ggccgcgcc 96 ValAla ValAlaArg AspAspLeu SerThr IleArgThr GlyArgAla aaccct ggcatgttc tctcggatc accatc gactactac ggtgcggcc 144 AsnPro GlyMetPhe SerArgIle ThrIle AspTyrTyr GlyAlaAla accccg atcacgcaa ctggccagc atcaat gtccccgag gcgcggcta 192 ThrPro IleThrGln LeuAlaSer IleAsn ValProGlu AlaArgLeu gtcgtg ataaagccg tatgaagcc aatcag ttgcgcget atcgagact 240 ValVal IleLysFro TyrGluAla AsnGln LeuArgAla IleGluThr gcaatt cgcaactcc gaccttgga gtgaat cccaccaac gacggcgcc 288 AlaIle ArgAsnSer AspLeuGly ValAsn ProThrAsn AspGlyAla cttatt cgcgtggcc gtaccgcag ctcacc gaagaacgt cggcgagag 336 Leu Ile Arg Val Ala Val Pro Gln Leu Thr Glu Glu Arg Arg Arg Glu ctggtcaaacag gcaaagcat aagggggag gaggccaag gtttcg gtg 384 LeuValLysGln AlaLysHis LysGlyGlu GluAlaLys ValSer Val cgtaatatccgt cgcaaagcg atggaggaa ctccatcgc atccgt aag 432 ArgAsnIleArg ArgLysAla MetGluGlu LeuHisArg IleArg Lys gaaggcgaggcc ggcgaggat gaggtcggt cgcgcagaa aaggat ctc 980 GluGlyGluAla GlyGluAsp GluValGly ArgAlaGlu LysAsp Leu gacaagaccacg caccaatac gtcacccaa attgatgag ctggtt aaa 528 AspLysThrThr HisGlnTyr ValThrGln IleAspGlu LeuVal Lys cacaaagaaggc gagctgctg gaggtctag 558 HisLysGluGly GluLeuLeu GluVal <210> 14 <211> 185 <212> PRT
<213> M.Tuberculosis <400> 19 Met Ile Asp Glu A1a Leu Phe Asp Ala Glu Glu Lys Met Glu Lys Ala Val Ala Val Ala Arg Asp Asp Leu Ser Thr Ile Arg Thr Gly Arg Ala Asn Pro Gly Met Phe Ser Arg Ile Thr Ile Asp Tyr Tyr Gly Ala Ala Thr Pro Ile Thr Gln Leu Ala Ser Ile Asn Val Pro Glu Ala Arg Leu Val Val Ile Lys Pro Tyr Glu Ala Asn Gln Leu Arg Ala Ile Glu Thr Ala Ile Arg Asn Ser Asp Leu Gly Val Asn Pro Thr Asn Asp Gly Ala Leu Ile Arg Val Ala Val Pro Gln Leu Thr Glu Glu Arg Arg Arg Glu Leu Val Lys Gln Ala Lys His Lys Gly Glu Glu Ala Lys Val Ser Val Arg Asn Ile Arg Arg Lys Ala Met Glu Glu Leu His Arg Ile Arg Lys Glu Gly Glu Ala Gly Glu Asp Glu Val Gly Arg Ala Glu Lys Asp Leu Asp Lys Thr Thr His Gln Tyr Val Thr Gln Ile Asp Glu Leu Val Lys His Lys Glu Gly Glu Leu Leu Glu Val <210> 15 <211> 651 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(648) <400> 15 atggccgac atcgatggt gtaaccggt tcggcgggt ctgcag cctggg 98 MetAlaAsp IleAspGly ValThrGly SerAlaGly LeuGln ProGly ccgtctgag gagacagac gaggagttg accgcgcgt ttcgag cgcgac 96 ProSerGlu GluThrAsp GluGluLeu ThrAlaArg PheGlu ArgAsp gcgattccc ctgttggac cagctgtac ggcggtgcg ctgcgg atgacg 144 AlaIlePro LeuLeuAsp GlnLeuTyr GlyGlyA1<~LeuArg MetThr cgcaatccg gccgacgcc gaggacttg ctccaggag acgatg gtgaag 192 ArgAsnPro AlaAspAla GluAspLeu LeuGlnGlu ThrMet ValLys gcctatgcg ggatttcgt tcgttccgg cacggtacc aatctc aaggcc 240 AlaTyrAla GlyPheArg SerPheArg HisGlyThr_AsnLeu LysAla tggctctac cggatactg accaacacc tacatcaac agctat cgcaag 288 TrpLeuTyr ArgIleLeu ThrAsnThr TyrIleAsn SerTyr ArgLys aaacagcgg caaccggcg gagtatccg accgagcag atcacc gattgg 336 LysGlnArg GlnProAla GluTyrPro ThrGluG1I1IleThr AspTrp caactggcg tccaacgcc gagcattcc tcgaccggc3ctgcgc tcgget 389 GlnLeuAla SerAsnAla GluHisSer SerThrGly LeuArg SerAla gaagtcgaa gcgttagaa gcgttgccg gacaccgag atcaaa gaggcg 432 GluValGlu AlaLeuGlu AlaLeuPro AspThrGlu IleLys GluAla ctgcaggca ttgccggaa gagttccgg atggcggtc tactac gccgat 480 LeuGlnAla LeuProGlu GluPheArg MetAlaVal TyrTyr AlaAsp gtcgaaggt ttcccctac aaggagatc gccgagatc atggat actccg 528 ValGluGly PheProTyr LysGluIle AlaGluIllsMetAsp ThrPro atcggcacc gtgatgtcg aggcttcat cgcggccga cgtcag ttgcgc 576 IleGlyThr ValMetSer ArgLeuHis ArgGlyArg ArgGln LeuArg ggtctttta gccgatgtg gccagggat cgggggttt gccagg ggcgag 624 GlyLeuLeu AlaAspVal AlaArgAsp ArgGlyPhe AlaArg GlyGlu caggcgcac gagggggtg tcgtcatga 651 GlnAlaHis GluGlyVal SerSer <210> 16 <211> 216 <212> PRT
<213> M.Tuberculosis <900> 16 Met Ala Asp Ile Asp Gly Val Thr Gly Ser Ala Gly Leu Gln Pro Giy Pro Ser Glu Glu Thr Asp Glu Glu Leu Thr Ala Arg Phe Glu Arg Asp Ala Ile Pro Leu Leu Asp Gln Leu Tyr Gly Gly Ala Leu Arg Met Thr Arg Asn Pro Ala Asp Ala Glu Asp Leu Leu Gln Glu Thr Met Val Lys Ala Tyr Ala Gly Phe Arg Ser Phe Arg His Gly Thr Asn Leu Lys Ala Trp Leu Tyr Arg Ile Leu Thr Asn Thr Tyr Ile Asn Ser Tyr Arg Lys Lys Gln Arg Gln Pro Ala Glu Tyr Pro Thr Glu Gln Ile Thr Asp Trp Gln Leu Ala Ser Asn Ala Glu His Ser Ser Thr Gly Leu Arg Ser Ala Glu Val Glu Ala Leu Glu Ala Leu Pro Asp Thr Glu Ile Lys Glu Ala Leu Gln Ala Leu Pro Glu Glu Phe Arg Met Ala Val Tyr Tyr Ala Asp Val Glu Gly Phe Pro Tyr Lys Glu Ile Ala Glu Ile Met Asp Thr Pro Ile Gly Thr Val Met Ser Arg Leu His Arg Gly Arg Arg Gln Leu Arg Gly Leu Leu Ala Asp Val Ala Arg Asp Arg Gly Phe Ala Arg Gly Glu Gln Ala His Glu Gly Val Ser Ser <210> 17 <211> 779 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(771) <900> 17 atg acg tac gaa acc atc ctg gtc gag cgc gat cag cga gtt ggc att 48 Met Thr Tyr Glu Thr Ile Leu Val Glu Arg Asp Gln Arg Val Gly Ile atc acg ctg aac cgt ccc cag gca ctg aac gcg ctc aac agc cag gtg 96 Ile Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Asn Ser Gln Val atg aac gag gtc acc agc get gca acc gaa ctg gac gat gac ccg gac 144 Met Asn Glu Val Thr Ser Ala Ala Thr Glu Leu Asp Asp Asp Pro Asp att ggg gcg atc atc atc acc ggt tcg gcc aaa gcg ttt gcc gcc gga 192 Ile Gly Ala Ile Ile Ile Thr Gly Ser Ala Lys Ala Phe Ala Ala Gly gccgacatc aaagaaatg gccgacctg acgttcgcc gacgcgttc acc 290 AlaAspIle LysGluMet AlaAspLeu ThrPheAla AspAlaPhe Thr gccgacttc ttcgccacc tggggcaag ctggccgcc gtgcgcacc ccg 288 AlaAspPhe PheAlaThr TrpGlyLys LeuAlaAla ValArgThr Pro acgatcgcc gcggtggcg ggatacgcg ctcggcggt ggctgcgag ctg 336 ThrIleAla AlaValAla GlyTyrAla LeuGlyGly GlyCysGlu Leu gcgatgatg tgcgacgtg ctgatcgcc gccgacacc gcgaagttc gga 384 AlaMetMet CysAspVal LeuIleAla AlaAspThr AlaLysPhe Gly cagcccgag ataaagctg ggcgtgctg ccaggcatg ggcggctcc cag 432 GlnProGlu IleLysLeu GlyValLeu ProGlyMet GlyGlySer Gln cggctgacccgg getatc ggcaagget aaggcgatg gacctcatc ctg 480 ArgLeuThrArg AlaIle GlyLysAla LysAlaMet AspLeuIle Leu accgggcgcacc atggac gccgccgag gccgagcgc:agcggtctg gtt 528 ThrGlyArgThr MetAsp AlaAlaGlu AlaGluArg SerGlyLeu Val tcacgggtggtg ccggcc gacgacttg ctgaccgaa gccagggcc act 576 SerArgValVal ProAla AspAspLeu LeuThrGlu AlaArgAla Thr gccacgaccatt tcgcag atgtcggcc tcggcggcc:cggatggcc aag 629 AlaThrThrIle SerGln MetSerAla SerAlaAla ArgMetAla Lys gaggcc aaccggget ttcgaatcc agtttgtcc:gaggggctg ctc 672 gtc GluAla AsnArgAla PheGluSer SerLeuSer GluGlyLeu Leu Val tacgaa cggcttttc cattcgget ttcgcgacc gaagaccaa tcc 720 cgc TyrGlu ArgLeuPhe HisSerAla PheAlaThr GluAspGln Ser Arg gaaggt gcagcgttc atcgagaaa cgcgetccc cagttcacc cac 768 atg GluGly AlaAlaPhe IleGluLys ArgAlaPro GlnPheThr His Met cgatga 774 Arg <210> 18 <211> 257 <212> PRT
<213> M.Tuberculosis <400> 18 MetThr GluThrIle LeuValGlu ArgAspGln ArgValGly Ile Tyr Ile Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Asn Ser Gln Val Met Asn Glu Val Thr Ser Ala Ala Thr Glu Leu Asp Asp Asp Pro Asp Ile Gly Ala Ile Ile Ile Thr Gly Ser Ala Lys Ala Phe Ala Ala Gly Ala Asp Ile Lys Glu Met Ala Asp Leu Thr Phe Ala Asp Ala Phe Thr 65 70 75 g0 Ala Asp Phe Phe Ala Thr Trp Gly Lys Leu Ala Ala Val Arg Thr Pro Thr Ile Ala Ala Val Ala Gly Tyr Ala Leu Gly Gly Gly Cys Glu Leu Ala Met Met Cys Asp Val Leu Ile Ala Ala Asp Thr Ala Lys Phe Gly Gln Pro Glu Ile Lys Leu Gly Val Leu Pro Gly Met: Gly Gly Ser Gln Arg Leu Thr Arg Ala Ile Gly Lys Ala Lys Ala Met Asp Leu Ile Leu Thr Gly Arg Thr Met Asp Ala Ala Glu Ala Glu Arg Ser Gly Leu Val Ser Arg Val Val Pro Ala Asp Asp Leu Leu Thr Glu Ala Arg Ala Thr Ala Thr Thr Ile Ser Gln Met Ser Ala Ser Ala Ala Arg Met Ala Lys Glu Ala Val Asn Arg Ala Phe Glu Ser Ser Leu Ser Glu Gly Leu Leu Tyr Glu Arg Arg Leu Phe His Ser Ala Phe Ala Thr Glu Asp Gln Ser Glu Gly Met Ala Ala Phe Ile Glu Lys Arg Ala Pro Gln Phe Thr His Arg <210> 19 <211> 894 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(891) <400> 19 gtgccgcttccc gcagaccct agccccacc ttgtcggcc tacgcc cat 48 ValProLeuPro AlaAspPro SerProThr LeuSerAla TyrAla His cccgaacggctc gtgaccgcc gactggttg tcggcacac atgggc gcg 96 ProGluArgLeu ValThrA1a AspTrpLeu SerAlaHis MetGly Ala ccgggcctggcg atcgtcgaa tccgacgag gacgtc_ttg ctctac gac 144 ProGlyLeuAla IleValG1u SerAspGlu AspValLeu LeuTyr Asp gtcggccatatt cccggcgcc gtcaagatc gactggcac accgac ctc 192 ValGlyHisIle ProGlyAla ValLysIle AspTrpHis ThrAsp Leu aac gac cca cgg gtg cgc gac tac atc aac ggc gag cag ttc gcc gaa 290 Asn Asp Pro Arg Val Arg Asp Tyr Ile Asn Gly Glu Gln Phe Ala Glu 65 70 75 g0 ttgatggac cgcaagggc atcgcccgc gatgacacc gtggtg atctat 288 LeuMetAsp ArgLysGly :CleAlaArg AspAspThr ValVal IleTyr ggcgacaag agcaattgg tgggccgcc tatgcgttg tgggtg ttcacg 336 GlyAspLys SerAsnTrp TrpAlaAla TyrAlaLeu TrpVal PheThr ctgttcggt cacgccgac gtgcgactc ctcaacggc ggccgt gacctc 389 LeuPheGly HisAlaAsp ValArgLeu LeuAsnGly GlyArg AspLeu tggctcgcc gagcgccgg gaaaccacc ttggacgtc ccgacc aagacc 432 TrpLeuAla GluArgArg GluThrThr LeuAspVal ProThr LysThr tgcaccggt tatcccgtc gtgcagcgc aacgatgca cccatc cgcgca 980 CysThrGly TyrProVal ValGlnArg AsnAspAla ProIle ArgAla ttcagagac gacgtgctg gccatcctg ggcgetcag ccgctg atcgac 528 PheArgAsp AspValLeu AlaIleLeu GlyAlaGln ProLeu IleAsp gtacgctct cccgaggag tacaccggc aagcgcacc catatg cccgat 576 ValArgSer ProGluGlu T'yrThrGly LysArgThr HisMet ProAsp taccccgag gaaggggcg ctgcgggcc ggtcacatc cccacg gcggtg 629 TyrProGlu GluGlyAla heuArgAla GlyHisIleaProThr AlaVal cacattccg tgggggaag gccgccgac gaaagtgga cggttt cgcagc 672 HisIlePro TrpGlyLys AlaAlaAsp GluSerGly ArgPhe ArgSer cgcgaggaa ttggaacgg ctctatgac ttcataaac ccggacgac caa 720 ArgGluGlu LeuGluArg LeuTyrAsp PheIleAsn ProAspAsp Gln accgtcgtc tattgccgc atcggtgaa cgctccagc:catacctgg ttc 768 ThrValVal TyrCysArg IleGlyGlu ArgSerSer HisThrTrp Phe gtgctcaca cacctgctg ggcaaggca gatgtacgg aactacgac ggc 816 ValLeuThr HisLeuLeu GlyLysAla AspValArg AsnTyrAsp Gly tcgtggacc gagtggggc aacgccgtg cgagtgccg atcgtcgcg ggc 864 SerTrpThr GluTrpGly AsnAlaVal ArgValPro IleValAla Gly gaagaacca ggagtggta cccgtcgta tga g9q GluGluPro GlyValVal ProValVal <210> 20 <211> 297 <212> PRT
<213> M.Tuberculosis <400> 20 Met Pro Leu Pro Ala Asp Pro Ser Pro Thr Leu Ser Ala Tyr Ala His Pro Glu Arg Leu Val Thr Ala Asp Trp Leu 5er Ala His Met Gly Ala Pro Gly Leu Ala Ile Val Glu Ser Asp Glu Asp Val Leu Leu Tyr Asp Val Gly His Ile Pro Gly Ala Val Lys Ile Asp Trp His Thr Asp Leu Asn Asp Pro Arg Val Arg Asp Tyr Ile Asn Gly Glu Gln Phe Ala Glu Leu Met Asp Arg Lys Gly Ile Ala Arg Asp Asp Th:r Val Val Ile Tyr Gly Asp Lys Ser Asn Trp Trp Ala Ala Tyr Ala Leu Trp Val Phe Thr Leu Phe Gly His Ala Asp Val Arg Leu Leu Asn Gly Gly Arg Asp Leu Trp Leu Ala Glu Arg Arg Glu Thr Thr Leu Asp Val Pro Thr Lys Thr Cys Thr Gly Tyr Pro Val Val Gln Arg Asn Asp Ala Pro Ile Arg Ala Phe Arg Asp Asp Val Leu Ala Ile Leu Gly Ala Gln Pro Leu Ile Asp Val Arg Ser Pro Glu Glu Tyr Thr Gly Lys Arg Thr His Met Pro Asp Tyr Pro Glu Glu Gly Ala Leu Arg Ala Gly His Ile: Pro Thr Ala Val His Ile Pro Trp Gly Lys Ala Ala Asp Glu Ser Gly Arg Phe Arg Ser Arg Glu Glu Leu Glu Arg Leu Tyr Asp Phe Ile Asn Pro Asp Asp Gln Thr Val Val Tyr Cys Arg Ile Gly Glu Arg Ser Ser His Thr Trp Phe Val Leu Thr His Leu Leu Gly Lys Ala Asp Val Arg Asn Tyr Asp Gly Ser Trp Thr Glu Trp Gly Asn Ala Val Arg Val Pro Ile Val Ala Gly Glu Glu Pro Gly Val Val Pro Val Val <210> 21 <211> 1094 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1041) <400> 21 atg ctg atc tca cag cgc ccc acc ctg tcc gag gac gtc ctc acc gac 48 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr Asp aac cga tcc cag ttc gtg atc gaa ccg ctg gag ccg gga ttc ggc tac 96 Asn Arg Ser Gln Phe Val I1e Glu Pro Leu Glu Pro Gly Phe Gly Tyr acc ctg ggc aat tcg ctg cgt cgc acc ctg ctg tcg tcg att ccc gga 199 Thr Leu Gly Asn Ser Leu Arg Arg Thr Leu Leu Ser Ser Ile Pro Gly gcg gcc gtc acc agc att cgc atc gat ggt gta ctg cac gaa ttc acc 192 Ala Ala Val Thr Ser Ile Arg Ile Asp Gly Val Leu His Glu Phe Thr acggtgcccggg gtcaaagaa gatgtcacc gagatc atcctgaat ctc 240 ThrValProGly ValLysGlu AspValThr GluIle IleLeuAsn Leu aagagcctggtg gtgtcctcg gaggaggac gagccg gtcaccatg tac 288 LysSerLeuVal ValSerSer GluGluAsp GluPro ValThrMet Tyr ctacgcaagcag ggtccgggt gaggttacc gccggc gacatcgtg ccg 336 LeuArgLysGln GlyProGly GluValThr AlaGly AspIleVal Pro ccggccggcgtc accgtgcac aaccccggc atgcac atcgccacg ctg 384 ProAlaGlyVal ThrValHis AsnProGly MetHis IleAlaThr Leu aacgataag ggcaagctggaa gtcgag ctcgtcgtc gagcgtggc cgc 432 AsnAspLys GlyLysLeuGlu ValGlu LeuValVal GluArgGly Arg ggctatgtc ccggcggtgcaa aaccgg gettcgggt gccgaaatt ggg 480 GlyTyrVal ProAlaValGln AsnArg AlaSerGly AlaGluIle Gly cgcattcca gtcgattccatc tactca ccggtgctc aaagtgacc tac 528 ArgIlePro ValAspSerIle TyrSer ProValLeu LysValThr Tyr aaggtggac gccacccgggtc gagcag cgcaccgac ttcgacaag ctg 576 LysValAsp AlaThrArgVal GluGln ArgThrAsp PheAspLys Leu atcctggac gtggagaccaag aattca atcagcccg cgcgacgcg ctg 624 IleLeuAsp ValGluThrLys AsnSer IleSerPro ArgAspAla Leu gcgtcgget ggcaagacgctg gtcgag ttgttcggc:ctggcacgg gaa 672 AlaSerAla GlyLysThrLeu ValGlu LeuPheGly LeuAlaArg Glu ctcaacgtc gaggccgaaggc atcgag atcgggccg tcgccggcc gag 720 LeuAsnVal GluAlaGluGly IleGlu IleGlyPro SerProAla Glu gccgatcac attgcgtcattc gccctg ccgatcgac gacctggat ctg 768 AlaAspHis IleAlaSerPhe AlaLeu ProIleAsp AspLeuAsp Leu acggtgcgg tcctacaactgc ctcaag cgcgagggg gtgcacacc gtg 816 ThrValArg SerTyrAsnCys LeuLys ArgGluGly ValHisThr Val ggc gaa ctg gtg gcg cgc acc gaa tcc gac ctg ctt gac atc cgc aac 864
B-cell epitopes may be linear or spatial. The three-dimensional structure of a protein is often such that amino acids, which are located distant from each other in the one-dimensional structure, are located near to each other in the folded protein.
Within the meaning of the present context, the expression epitope is intended to comprise the one-and three-dimensional structure as well as mimics thereof.. The term is further intended to include discontinuous B-cell epitopes. The linear B-cell epitopes can be identified in a similar manner as described for the T-cell epitopes above. However, when identifying B
celi epitopes the assay should be an ELISA using overlapping oligomers derived from the polypeptide as the coating layer on a microtiter plate as described elsewhere.
A non-naturally occurring polypeptide, an analogue, a subsequence, a T-cell epitope and/or a B-cell epitope of any of the described polypeptides are defined as any non-naturally occurring polypeptide, analogue, subsequence, T-cell epitope and/or 8-cell epitope of any of the polypeptides having any of the properties i)-ix).
Table 1 lists the antigens of the invention.
1?
Table 1 The antigens of the invention by the names used herein as well as by reference to relevant SEQ ID NOs of N-terminal sequences, full amino acid sequences and sequences of nucleotides encoding the antigens AntigenN-Terminal sequenceNucleotide Amino acid sequence SEQ ID NO: sequence SEQ ID NO:
SEQ ID NO:
B
TB12.5 80 74 75 TB20.6 81 76 77 TB40.8 82 78 79 Each of the polypeptides may be characterised by specific amino acid and nucleic acid sequences. It will be understood that such sequences include analogues and variants produced by recombinant methods wherein such nucleic acid and polypeptide sequences have been modified by substitution, insertion, addition and/or deletion of one or more nucleotides in said nucleic acid sequences to cause the substitution, insertion, addition or deletion of one or more amino acid residues in the recombinant polypeptide. A
preferred nucleotide sequence encoding a polypeptide of the invention is a nucleotide sequence which 1) is a nucleotide sequence selected from the group consisting of SEQ ID NOs:
1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 and 78 or an analogue of said sequence which hybridises with any of the nucleotide sequences shown in SEQ ID
NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78 or a nucleotide sequence complementary thereto, or a specific part thereof, preferably under stringent hybridisation conditions. By stringent conditions is understood, as defined in the art, 5-10°C under the melting point Tm, cf. Sambrook et al, 1989, pages 11.45-11.49, and/or 2) encodes a polypeptide, the amino acid sequence of which has a 80% sequence identity with an amino acid sequence selected from the group consisting of SEQ
ID NOs:
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 75, 77 and 79 and/or 3) constitutes a subsequence of any of the above mentioned nucleotide sequences, and/or 4) constitutes a subsequence of any of the above mentioned polypeptide sequences.
The terms "analogue" or "subsequence" when used in connection with the nucleotide fragments of the invention are thus intended to indicate a nucleotide sequence which encodes a polypeptide exhibiting identical or substantially identical immunological properties to a polypeptide encoded by the nucleotide fragment of the invention shown in any of SEQ I D NOs: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78, allowing for minor variations which do not have an adverse effect on the ligand binding properties and/or biological function and/or immunogenicity as compared to any of the polypeptides of the invention or which give interesting and useful novel binding properties or biological functions and immunogenicities etc. of the analogue andlor subsequence. The analogous nucleotide fragment or nucleotide sequence may be derived from a bacterium, a mammal, or a human or may be partially or completely of synthetic origin. The analogue and/or subsequence may also be derived through the use of recombinant nucleotide techniques.
Furthermore, the terms "analogue" and "subsequence" are intended to allow for variations in the sequence such as substitution, insertion (including introns), addition, deletion and rearrangement of one or more nucleotides, which variations do not have any substantial effect on the polypeptide encoded by a nucleotide fragment or a subsequence thereof. The term "substitution" is intended to mean the replacement of one or more nucleotides in the full nucleotide sequence with one or more different nucleotides, "addition" is understood to mean the addition of one or more nucleotides at either end of the full nucleotide sequence, "insertion" is intended to mean the introduction of one or more nucleotides within the full nucleotide sequence, "deletion" is intended to indicate that one or more nucleotides have been deleted from the full nucleotide sequence whether at either end of the sequence or at any suitable point within it, and "re-arrangement" is intended to mean that two or more nucleotide residues have been exchanged with each other.
It is well known that the same amino acid may be encoded by various codons, the codon usage being related, inter alia, to the preference of the organisms in question expressing the nucleotide sequence. Thus, at least one nucleotide or codon of a nucleotide fragment of the invention may be exchanged by others which, when expressed, results in a polypeptide identical or substantially identical to the polypeptide encoded by the nucleotide fragment in question.
The term "subsequence" when used in connection with the nucleic acid fragments of the invention is intended to indicate a continuous stretch of at least 10 nucleotides which ex-hibits the above hybridization pattern. Normally this will require a minimum sequence identity of at least 70% with a subsequence of the hybridization partner having SEQ ID
NO: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 74, 76 or 78. It is preferred that the nucleic acid fragment is longer than 10 nucleotides, such as at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, and at least 80 nucleotides long, and the sequence identity should preferable also be higher than 70%, such as at least 75%, at least 80%, at least 85%, at least 90%, at least 92%, at least 94%, at least 96%, and at least 98%. It is most preferred that the sequence identity is 100%. Such fragments may be readily prepared by, for example, directly synthesizing the fragment by chemical means, by application of nucleic acid reproduction technology, such as the PCR
tech-nology of U.S. Patent 4,603,102, or by introducing selected sequences into recombinant vectors for recombinant production.
The nucleotide sequence to be modified may be of cDNA or genomic origin as discussed above, but may also be of synthetic origin. Furthermore, the sequence may be of mixed cDNA and genomic, mixed cDNA and synthetic or genomic and synthetic origin as discussed above. The sequence may have been modified, e.g. by site-directed mu-5 tagenesis, to result in the desired nucleic acid fragment encoding the desired polypep-tide.
The invention also relates to a replicable expression vector which comprises a nucleic acid fragment defined above, especially a vector which comprises a nucleic acid frag-10 ment encoding a polypeptide fragment of the invention. The vector may be any vector which may conveniently be subjected to recombinant DNA procedures, and the choice of vector will often depend on the host cell into which it is to be introduced.
Thus, the vector may be an autonomously replicating vector, i.e, a vector which exists as an extrachromo-somal entity, the replication of which is independent of chromosomal replication;
15 examples of such a vector are a plasmid, phage, cosmid, mini-chromosome and virus.
Alternatively, the vector may be one which, when introduced in a host cell, is integrated in the host cell genome and replicated together with the chromosome{s) into which it has been integrated.
20 Expression vectors may be constructed to include any of the DNA segments disclosed herein. Such DNA might encode an antigenic protein specific for virulent strains of mycobacteria or even hybridization probes for detecting mycobacteria nucleic acids in samples. Longer or shorker DNA segments could be used, depending on the antigenic protein desired. Epitopic regions of the proteins expressed or encoded by the disclosed DNA could be included as relatively short segments of DNA. A wide variety of expression vectors is possible including, for example, DNA segments encoding reporter gene products useful for identification of heterologous gene products and/or resistance genes such as antibiotic resistance genes which may be useful in identifying transformed cells.
The vector of the invention may be used to transform cells so as to allow propagation of the nucleic acid fragments of the invention or so as to allow expression of the polypeptide fragments of the invention. Hence, the invention also pertains to a transformed cell harbouring at least one such vector according to the invention, said cell being one which does not natively harbour the vector and/or the nucleic acid fragment of the invention contained therein. Such a transformed cell (which is also a part of the invention) may be any suitable bacterial host cell or any other type of cell such as a unicellular eukaryotic organism, a fungus or yeast, or a cell derived from a multicellular organism, e.g. an ani-mal or a plant. It is especially in cases where glycosylation is desired that a mammalian cell is used, although glycosylation of proteins is a rare event in prokaryotes. Normally, however, a prokaryotic cell is preferred such as a bacterium belonging to the genera Mycobacterium, Salmonella, Pseudomonas, Bacillus and Eschericia. It is preferred that the transformed cell is an E. coli, 8. subtilis, or M. bovis BCG cell, and it is especially preferred that the transformed cell expresses a polypeptide according of the invention.
The latter opens for the possibility to produce the polypeptide of the invention by simply recovering it from the culture containing the transformed cell. In the most preferred embodiment of this part of the invention the transformed cell is Mycobacterium bovis BCG strain: Danish 1331, which is the Mycobacterium bovis strain Copenhagen from the Copenhagen BCG Laboratory, Statens Seruminstitut, Denmark.
The nucleic acid fragments of the invention allow for the recombinant production of the polypeptides fragments of the invention. However, also isolation from the natural source is a way of providing the polypeptide fragments as is peptide synthesis.
Therefore, the invention also pertains to a method for the preparation of a polypeptide fragment of the invention, said method comprising inserting a nucleic acid fragment as described in the present application into a vector which is able to replicate in a host cell, introducing the resulting recombinant vector into the host cell (transformed cells may be selected using various techniques, including screening by differential hybridization, identification of fused reporter gene products, resistance markers, anti-antigen antibodies and the like), culturing the host cell in a culture medium under conditions sufficient to effect expression of the polypeptide (of course the cell may be cultivated under conditions appropriate to the circumstances, and if DNA is desired, replication conditions are used), and recovering the polypeptide from the host cell or culture medium; or isolating the polypeptide from a short-term culture filtrate; or isolating the polypeptide from whole mycobacteria of the tuberculosis complex or from lysates or fractions thereof, e.g. cell wall containing fractions, or synthesizing the polypeptide by solid or liquid phase peptide synthesis.
The medium used to grow the transformed cells may be any conventional medium suitable for the purpose. A suitable vector may be any of the vectors described above, and an appropriate host cell may be any of the cell types listed above. The methods employed to construct the vector and effect introduction thereof into the host cell may be any methods known for such purposes within the field of recombinant DNA. In the follow-ing a more detailed description of the possibilities will be given:
In general, of course, prokaryotes are preferred for the initial cloning of nucleic se quences of the invention and constructing the vectors useful in the invention.
For ex ample, in addition to the particular strains mentioned in the more specific disclosure below, one may mention by way of example, strains such as E. coli K12 strain (ATCC No. 31446), E. coli B, and E. coli X 1776 (ATCC No. 31537). These examples are, of course, intended to be illustrative and not limiting.
Prokaryotes are also preferred for expression. The aforementioned strains, as well as E.
coli W3110 (F-, lambda-, prototrophic, ATCC No. 273325), bacilli such as Bacillus subtilis, or other enterobacteriaceae such as Salmonella typhimurium or Serratia mar-cesans, and various Pseudomonas species may be used. Especially interesting are rapid-growing mycobacteria, e.g. M. smegmatis, as these bacteria have a high degree of resemblance with mycobacteria of the tuberculosis complex and therefore stand a good chance of reducing the need of performing post-translational modifications of the expression product.
In general, plasmid vectors containing replicon and control sequences which are derived from species compatible with the host cell are used in connection with these hosts. The vector ordinarily carries a replication site, as well as marking sequences which are capable of providing phenotypic selection in transformed cells. For example, E. coliis typically transformed using pBR322, a plasmid derived from an E. coli species (see, e.g., Bolivar et al., 1977, Gene 2: 95). The pBR322 plasmid contains genes for ampicillin and tetracycline resistance and thus provides easy means for identifying transformed cells.
The pBR plasmid, or other microbial plasmids or phages must also contain, or be modified to contain, promoters which can be used by the microorganism for expression.
Those promoters most commonly used in recombinant DNA construction include the B-lactamase (penicillinase) and lactose promoter systems (Chang et al., (1978), Nature, 35:515; Itakura et al., (1977), Science 198:1056; Goeddel et al., (1979), Nature 281:544) and a tryptophan (trp) promoter system (Goeddel et al., (1979) Nature 281:544;
EPO
Appl. Publ. No. 0036776). While these are the most commonly used, other microbial promoters have been discovered and utilized, and details concerning their nucleotide sequences have been published, enabling a skilled worker to ligate them functionally with plasmid vectors (Siebwenlist et al., (1980), Cell, 20:269). Certain genes from prokaryotes may be expressed efficiently in E. coli from their own promoter sequences, precluding the need for addition of another promoter by artificial means.
After the recombinant preparation of the polypeptide according to the invention, the isolation of the polypeptide may for instance be carried out by affinity chromatography (or other conventional biochemical procedures based on chromatography), using a monoclonal antibody which substantially specifically binds the polypeptide according to the invention. Another possibility is to employ the simultaneous electroelution technique described by Andersen et al. in J. Immunol. Methods 161: 29-39.
According to the invention the post-translational modifications involves lipidation, gly-cosylation, cleavage, or elongation of the polypeptide.
In certain aspects, the DNA sequence information provided by this invention allows for the preparation of relatively short DNA (or RNA or PNA) sequences having the ability to specifically hybridize to mycobacterial gene sequences. In these aspects, nucleic acid probes of an appropriate length are prepared based on a consideration of the relevant sequence. The ability of such nucleic acid probes to specifically hybridize to the mycobacterial gene sequences lend them particular utility in a variety of embodiments.
Most importantly, the probes can be used in a variety of diagnostic assays for detecting the presence of pathogenic organisms in a given sample. However, either uses are envisioned, including the use of the sequence information for the preparation of mutant species primers, or primers for use in preparing other genetic constructs.
Apart from their use as starting points for the synthesis of polypeptides of the invention and for hybridization probes (useful for direct hybridization assays or as primers in e.g.
PCR or other molecular amplification methods) the nucleic acid fragments of the WO 00/219$3 PCT/DK99/00538 invention may be used for effecting in vivo expression of antigens, i.e. the nucleic acid fragments may be used in so-called DNA vaccines. Recent research have revealed that a DNA fragment cloned in a vector which is non-replicative in eukaryotic cells may be introduced into an animal (including a human being) by e.g, intramuscular injection or percutaneous administration (the so-called "gene gun" approach). The DNA is taken up by e.g. muscle cells and the gene of interest is expressed by a promoter which is func-tioning in eukaryotes, e.g. a viral promoter, and the gene product thereafter stimulates the immune system. These newly discovered methods are reviewed in Ulmer et al., (1993), Curr. Opin. Invest. Drugs, 2:983-989 which hereby is included by reference.
Hence, the invention also relates to a vaccine comprising a nucleic acid fragment ac-cording to the invention, the vaccine effecting in vivo expression of antigen by an animal, including a human being, to whom the vaccine has been administered, the amount of expressed antigen being effective to confer substantially increased resistance to infec-tions with mycobacteria of the tuberculosis complex in an animal, including a human being.
The efficacy of such a "DNA vaccine" can possibly be enhanced by administering the gene encoding the expression product together with a DNA fragment encoding a poly-peptide which has the capability of modulating an immune response. For instance, a gene encoding lymphokine precursors or lymphokines (e.g. IFN-y, IL-2, or IL-12) could be administered together with the gene encoding the immunogenic protein, either by ad-ministering two separate DNA fragments or by administering both DNA fragments included in the same vector. It also is a possibility to administer DNA
fragments compri-sing a multitude of nucleotide sequences which each encode relevant epitopes of the poiypeptides disclosed herein so as to effect a continuous sensitization of the immune system with a broad spectrum of these epitopes.
In one embodiment of the invention, any of the above mentioned polypeptides is used in the manufacture of an immunogenic composition to be used for induction of an immune response in a mammal against an infection with a virulent Mycobacterium.
Preferably, the immunogenic composition is used as a vaccine.
The preparation of vaccines which contain peptide sequences as active ingredients is generally well understood in the art, as exemplified by U.S. Patents 4,608,251;
4,601,903; 4,599,231; 4,599,230; 4,596,792; and 4,578,770, all incorporated herein by reference. Typically, such vaccines are prepared as injectables either as liquid solutions or suspensions; solid forms suitable for solution in liquid or suspension in liquid prior to injection may also be prepared. The preparation may also be emulsified. The active 5 immunogenic ingredient is often mixed with excipients which are pharmaceutically acceptable and compatible with the active ingredient. Suitable excipients are, for example, water, saline, dextrose, glycerol, ethanol, or the like, and combinations thereof.
In addition, if desired, the vaccine may contain minor amounts of auxiliary substances such as wetting or emulsifying agents, pH buffering agents, or adjuvants which enhance 10 the effectiveness of the vaccines.
In one embodiment the composition used for vaccination comprises at least one, but preferably at least 2, such as at least 3, 4, 5, 10, 15 or at least 20 different polypeptides of the invention.
In another embodiment the composition to be used for vaccine comprises, together with at least one polypeptide of the invention, at least one, but preferably at least 2, such as at least 3, 4, 5, 10, 15 or at least 20 polypeptides which are not polypeptides of the present invention but are derived from a virulent Mycobacterium such as a polypeptide belonging to the group of ST-CF (Elhay MJ and Andersen P, Immunology and cell Biology (1997) 75, 595-603). ESAT-6, CFP7, CFP10 (EMBL accession number: AL022120), CFP17, CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64.
The vaccines are conventionally administered parenterally, by injection, for example, either subcutaneously or intramuscularly. Additional formulations which are suitable for other modes of administration include suppositories and, in some cases, oral formulations. For suppositories, traditional binders and carriers may include, for example, polyalkalene glycols or triglycerides; such suppositories may be formed from mixtures containing the active ingredient in the range of 0.5% to 10%, preferably 1-2%.
Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like. These compositions take the form of solutions, suspensions, tablets, pills, capsules, sustained release formulations or powders and contain 10-95% of active ingredient, preferably 25-70%.
The proteins may be formulated into the vaccine as neutral or salt forms.
Pharmaceutically acceptable salts include acid addition salts (formed with the free amino groups of the peptide) and which are formed with inorganic acids such as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups may also be derived from inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino ethanol, histidine, procaine, and the like.
The vaccines are administered in a manner compatible with the dosage formulation, and in such amount as will be therapeutically effective and immunogenic. The quantity to be administered depends on the subject to be treated, including, e.g., the capacity of the individual's immune system to mount an immune response, and the degree of protection desired. Suitable dosage ranges are of the order of several hundred micrograms of active ingredient per vaccination with a preferred range from about 0.1 pg to 1000 p.g, such as in the range from about 1 ~g to 300 pg, and especially in the range from about 10 ug to 50 wg. Suitable regimes for initial administration and booster shots are also variable but are typified by an initial administration followed by subsequent inoculations or other administrations.
The manner of application may be varied widely. Any of the conventional methods for administration of a vaccine are applicable. Preferred routes of administration are the parenteral route such as the intravenous, intraperitoneal, intramuscular, subcutaneous or intradermal routes; the oral (on a solid physiologically acceptable base or in a physiologi-cally acceptable dispersion), buccal, sublingual, nasal, rectal or transdermal routes. The dosage of the vaccine will depend on the route of administration and will vary according to the age of the person to be vaccinated and, to a lesser degree, the weight of the person to be vaccinated.
Some of the polypeptides of the vaccine are sufficiently immunogenic in a vaccine, but for some of the others the immune response will be enhanced if the vaccine further comprises an adjuvant substance.
Various methods of achieving adjuvant effect for the vaccine include use of agents such as aluminum hydroxide or phosphate (alum), commonly used as a 0.05 to 0.1 percent solution in phosphate buffered saline, admixture with synthetic polymers of sugars (Carbopol) used as a 0.25 percent solution, aggregation of the protein in the vaccine by heat treatment with temperatures ranging between 70° to 101 °C
for 30 second to 2 minute periods respectively. Aggregation by reactivating with pepsin treated {Fab) antibodies to albumin, mixture with bacterial cells such as C. parvum or endotoxins or lipopolysaccharide components of gram-negative bacteria, emulsion in physiologically acceptable oif vehicles such as mannide mono-oleate (Aracel A) or emulsion with 20 percent solution of a perfluorocarbon (Fluosol-DA) used as a block substitute may also be employed. According to the invention DDA (dimethyldioctadecylammonium bromide) is an interesting candidate for an adjuvant, but also Freund's complete and incomplete adjuvants as well as QuilA and RIBI adjuvants are interesting possibilities.
Other possibilities to enhance the immunogenic effect involve the use of immune modulating substances such as lymphokines (e.g. IFN-y, IL-2 and It_-12) or synthetic IFN-y inducers such as poly I:C in combination with the above-mentioned adjuvants.
In many instances, it will be necessary to have multiple administrations of the vaccine, usually not exceeding six vaccinations, more usually not exceeding four vaccinations and preferably one or more, usually at least about three vaccinations. The vaccinations will normally be at from two to twelve week intervals, more usually from three to five week intervals. Periodic boosters at intervals of 1-25 years, such as 20 years, preferably 15 or 10 years, more preferably 1-5 years usually three years, will be desirable to maintain the desired levels of protective immunity.
In one embodiment of the invention a composition is produced comprising as the effective component a micro-organism, the micro-organism is a bacterium such as Mycobacterium, Salmonella, Pseudomonas and Escherichia, preferably Mycobacterium bovis BCG wherein at least one, such as at least 2 copies, such as at least 5 copies of a nucleotide fragment comprising a nucleotide sequence encoding a polypeptide of the invention has been incorporated into the genome of the micro-organism or introduced as a part of an expression vector in a manner allowing the micro-organism to express and optionally secrete the polypeptide. In a preferred embodiment, the composition comprises at least 2 different nucleotide sequences encoding at least 2 different polypeptides of the invention. In a much preferred embodiment, the composition comprises at least different nucleotide sequences encoding at least one polypeptide of the invention and at least one polypeptide belonging to the group of ST-CF (Elhay MJ and Andersen P, Immunology and cell Biology (1997) 75, 595-603) such as ESAT-6, CFP7, CFP10, CFP17, CFP21, CFP25, CFP29, MPB59, MPT59, MPB64, and MPT64.
Individuals infected with virulent Mycobacteria can generally be divided into two groups.
The first group has an infection with a virulent Mycobacterium e.g. contacts of TB
patients. The virulent Mycobacterium may have established colonies in the lungs, but the individual has, as yet, no symptoms of TB. The second group has clinical symptoms of TB, as a TB patient.
In one embodiment of the invention, any of the above mentioned polypeptides are used for the manufacture of a diagnostic reagent that preferably distinguishes a subclinically or clinically infected individual (group I and group II) from an individual who has been BCG
vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium (NTM), and may distinguish a subclinically or clinically infected individual from an individual who has cleared a previous infection with a virulent Mycobacterium. It is most likely that specific polypeptides derived from SPE will identify group I and/or group II from individuals not infected with virulent Mycobacteria in the same way as ESAT-f and CFP10 (P.Ravn et al., (1998), J. Infectious Disease 179:637-45).
In one embodiment of the invention, any of the above discussed polypeptides are used for the manufacture of a diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium. One embodiment of the invention provides a diagnostic reagent for differentiating an individual who is clinically or subclinically infected with a virulent Mycobacterium from an individual not infected with virulent Mycobacterium, i.e. an individual who has been BCG vaccinated or infected with Mycobacterium avium or sensitised by non-tuberculosis Mycobacterium (NTM). Such a diagnostic reagent will distinguish between an individual in group I and/or II of the infection stages above, from an individual who has been vaccinated against TB. Another embodiment of the invention provides a diagnostic reagent for differentiating an individual who is clinically or subclinically infected with a virulent Mycobacterium from an individual who has a cleared infection with a virulent Mycobacterium. Such a diagnostic reagent will distinguish between an individual in group I and/or II of the infection stages above, from an individual who has cleared the infection.
Determination of an infection with virulent Mycobacterium will be instrumental in the, still very laborious, diagnostic process of tuberculosis. A number of possible diagnostic assays and methods can be envisaged (some more specifically described in the examples and the list of properties): a sample comprising whole blood or mononuclear cells {i.a. T-lymphocytes) from a patient could be contacted with a sample of one or more polypeptides of the invention. This contacting can be performed in vitro and a positive reaction could e.g. be proliferation of the T-cells or release of cytokines such as IFN-y into the extracellular phase (e.g. into a culture supernatant).
Alternatively, a sample of a possibly infected organ may be contacted with an antibody raised against a polypeptide of the invention. The demonstration of the reaction by means of methods well-known in the art between the sample and the antibody will be indicative of ongoing infection and could be used to monitor treatment effect by reduction in responses. It is of course also a possibility to demonstrate the presence of anti-Mycobacterial antibodies in serum by contacting a serum sample from a subject with at least one of the polypeptide fragments of the invention and using well-known methods for visualising the reaction between the antibody and antigen such as ELISA, Western blot, precipitation assays.
Also a method of determining the presence of virulent Mycobacterium nucleic acids in a mammal, including a human being, or in a sample, comprising incubating the sample with a nucleic acid sequence of the invention or a nucleic acid sequence complementary thereto, and detecting the presence of hybridised nucleic acids resulting from the incubation (by using the hybridisation assays which are well-known in the art), is included in the invention. Such a method of diagnosing TB might involve the use of a composition comprising at least a part of a nucleotide sequence as defined above and detecting the presence of nucleotide sequences in a sample from the animal or human being to be tested which hybridises with the nucleic acid sequence (or a complementary sequence) by the use of PCR techniques.
The invention also relates to a method of diagnosing infection caused by a virulent Mycobacterium in a mammal, including a human being, comprising locally applying (patch test) or intradermally injecting (Mantoux test) a polypeptide of the invention. These tests are both called a delayed hypersensitivity reaction (DTH). A positive skin response at the location of injection or application is indicative of the mammal including a human being, being infected with a virulent Mycobacterium, and a negative skin response at the location of injection or application is indicative of the mammal including a human being not having TB. A positive response is a skin reaction having a diameter of at least 5 mm larger than background, but larger reactions are preferred, such as at least 1 cm, 1.5 cm, 5 and at least 2 cm in diameter. A skin reaction is here to mean erythema or induration of the skin, as directly measured. The composition used as the skin test reagent can be prepared in the same manner as described for the vaccines above.
In human volunteers, the generation of a significant immune response can alternatively 10 be defined as the ability of the reagent being tested to stimulate an in vitro recall response by peripheral blood cells from at least 30% of PPD positive individuals previously vaccinated with that reagent or infected with a virulent Mycobacterium, said recall response being defined as proliferation of T cells or the production of cytokine(s) which is higher than the responses generated by cells from unimmunised or uninfected 15 control individuals, with a 95% confidence interval as defined by an appropriate statistical analysis such as a Student's two-tailed T test.
Alternatively, a significant immune response could be detected in vivo by a test such as the generation of delayed type hypersensitivity in the skin in response to exposure to the 20 immunising reagent, such response being significantly larger (with a 95%
confidence interval as defined by appropriate statistical analysis such as a Student's two-tailed T
test) in at least 30% of vaccinated or infected individuals than in placebo-treated or uninfected individuals.
25 The polypeptides according to the invention may be potential drug targets.
Once a particular interesting polypeptide has been identified, the biological function of that polypeptide may be tested. The polypeptides may constitute receptor molecules or toxins which facilitates the infection by the Mycobacterium and if such functionality is blocked, the infectivity of the virulent Mycobacterium will be diminished.
The biological function of particular interesting polypeptides may be tested by studying the effect of inhibiting the expression of the polypeptides on the virulence of the virulent Mycobacterium. This inhibition may be performed at the gene level such as by blocking the expression using antisense nucleic acid, PNA or LNA or by interfering with regulatory sequences or the inhibition may be at the level of translation or post-translational processing of the polypeptide.
Once a particular polypeptide according to the invention is identified as critical for virulence, an anti-mycobacterial agent might be designed to inhibit the expression of that polypeptide. Such anti-mycobacterial agent might be used as a prophylactic or therapeutic agent. For instance, antibodies or fragments thereof, such as Fab and (Fab')2 fragments, can be prepared against such critical polypeptides by methods known in the art and thereafter used as prophylactic or therapeutic agents A presently preferred embodiment is an extract of polypeptides obtainable by a method comprising the steps of a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) at 2,OOOg for 40 minutes;
c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with rounds of 90 seconds;
d) centrifugating the suspension of c) at 5,OOOg for 30 minutes;
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane components from the supernatant of d) with 10% SDS;
f) centrifugating the extract of e) at 20,000g for 30 minutes;
g) precipitating the supernatant of f) with 8 volumes of cold acetone;
with an adjuvant substance.
In other words, the invention relates to use of an extract of polypeptides with an adjuvant substance for the preparation of a composition for the generation or determination of an immune response against a virulent Mycobacterium.
Finally, a monoclonal or polycional antibody, which is specifically reacting with a poly-peptide of the invention in an immuno assay, or a specific binding fragment of said anti-body, is also a part of the invention. The production of such polyclonal antibodies requires that a suitable animal be immunized with the polypeptide and that these anti-bodies are subsequently isolated, suitably by immune affinity chromatography.
The production of monoclonals can be effected by methods well-known in the art, since the present invention provides for adequate amounts of antigen for both immunization and screening of positive hybridomas.
Examples EXAMPLE 1: Total extraction of proteins from dead M.tuberculosis bacteria.
1.5 x 109 bacteria/ml M.tuberculosis was heat treated at 55°C for 1.5 hours and checked for sterility. 10 ml of these heat killed bacteria was centrifuged at 2000 g for 40 min; the supernatant was discharged and the pellet resuspended in PBS containing 0.5%
Tween 20 and used as the antigen source. The pellet was sonicated with 20 rounds of seconds and centrifuged 30 min at 5000 g to remove unbroken cells. The supernatant containing soluble proteins as well as cell wall and cell membrane components was extracted twice with 10% SDS to release proteins inserted in the cell wall and membrane compartments. After a centrifugation at 20.000 g for 30 min the supernatant was precipitated with 8 volume of cold acetone and resuspended in PBS at a protein concentration of 5 mg/ml and named: Somatic Proteins Extract (SPE).
Analysis of protective immune response for tuberculosis after immunisation with different M.tuberculosis protein preparations.
The protective efficacy of SPE was evaluated in a vaccination experiment and compared to the two vaccines ST-CF and BCG, known to induce protection against TB.
Five groups of 6-8 weeks old, female C5781/6J mice (Bomholtgaard, Denmark) were immunised subcutaneously at the base of the tail with vaccines of the following composition:
Group 1: BCG
Group 2: 1x 10' heat killed M.tuberculosislDDA (250 ~.g DDA) Group 3: 50 p.g ST-CF/DDA (250 pg) Group 4: 50 pg SPE/DDA (250 wg}
Group 5: Adjuvant control: DDA (250 p,g) in NaCI
The animals were injected with a volume of 0.2 ml. The mice of groups2, 3 and 4 were boosted twice at two weeks interval.
Four weeks after the last immunisation three mice/group were sacrificed and the spleens removed. The immune response induced in the spleen cells was monitored by release of IFN-y into the culture supernatants when stimulated in vitro with relevant antigens (Table 2). ST-CF and SPE induced a similar immune response while only a very low IFN-y release was observed after immunisation with BCG and stimulation with ST-CF.
Table 2 Recognition of protein preparations after immunisation presented as IFN-y release (pg/ml) after restimulation.
Immunogen No antigen ST-CF SPE
ST-CF <200 6752 ~ 591 8431 ~ 459 SPE <200 6621 t 203 11079 ~ 178 BCG <200 469 t 32 ND
Seven weeks after the final immunisation the mice received a primary infection with 5x105 H37Rv in 0.1 ml iv. and two weeks later the mice were sacrificed and the spleens were isolated for bacterial enumeration (figure 2).
BCG induced a high level of protection in the spleen as expected but so did the killed H37Rv, ST-CF and SPE and ali preparations induced protection at almost the same level, with SPE as the most potent of these preparations.
These data demonstrate that there are components to be found among the somatic proteins of H37Rv which in an animal model protect against tuberculosis at the same level as BCG.
EXAMPLE 2: Subcellular fractionation of Mycobacterium tuberculosis 1.5 x 109 colony forming units (CFU/ml) of M. tuberculosis H37Rv were inactivated by heat-killing at 60°C for 1.5 hour. The heat-killed Mycobacteria was centrifuged at 3,000 x g for 20 min; the supernatant was discarded and the pellet was resuspended in cold PBS.
This step was repeated twice. After the final wash, the pellet was resuspended in a homogenising buffer consisting of PBS supplemented with 10 mM EDTA and 1 mM of phenylmethylsulfonyl fluoride in a ratio of 1 ml buffer per 0.5 g of heat-killed Mycobacteria. The sample was sonicated on ice for 15 min (1-min-pulser-on110-sec-pulser off) and subsequently lysed three times with a French Pressure Cell at 12,000 Iblin2. The lysate was centrifuged at 27,000 x g for 20 min; the pellet was washed in homogenising buffer and recentrifuged. The pooled supernatants contained a mixture of cytosol and membrane components, while the pellet represented the crude cell wall.
Preparation of cell wall The cell wall pellet, resuspended in homogenising buffer, was added RNase and DNase to a final concentration of 1 mg/ml and incubated overnight at 4°C. The cell wall was -washed twice in homogenising buffer, twice in homogenising buffer saturated with KCI, 5 and twice with PBS. Soluble proteins were extracted from the cell wall by a 2 hour incubation with 2% SDS at 6°C. The insoluble cell wall core was removed by a centrifugation at 27,000 x g for 20 min and the SDS-extraction was repeated.
Finally, the pooled supernatants were precipitated with 6 volumes of chilled acetone and resuspended in PBS.
10 Preparation of cytosol and membrane:
To separate the cytosol and the membrane fraction, the pooled supernatants were ultracentrifugated at 100,000 x g for 2 hours at 5°C. The cytosol proteins in the supernatant were precipitated with acetone and resuspended in PBS. The pellet, representing the membrane fraction, was washed in PBS, ultracentrifugated, and finally 15 resuspended in PBS.
Triton X-114 extraction of cell wall and membrane:
To prepare protein fractions largely devoid of lipoarabinomannan, the cell wall and the membrane fraction were subjected to extraction with precondensed Triton X-114.
Triton X-114 was added to the protein sample at a final concentration of 4%. The solution was 20 mixed on ice for 60 min and centrifuged at 20,000 x g for 15 min at 4°C. The pellet containing residual insoluble material was extracted once more (membrane) or twice (cell wall), while the supernatant was warmed to 37°C to condense the Triton X-114. After centrifugation of the supernatant at 12,000 x g for 15 min, the aqueous phase and detergent phase were separated. The aqueous phase and detergent phase were washed 25 twice with Triton X-114 and PBS, respectively. The combined aqueous phases and residual insoluble material containing the majority of proteins were pooled, precipitated with acetone, and resupended in PBS.
The specificity of the human T-cell response in TB patients was investigated by 30 stimulating PBMCs with panels of narrow molecular mass fractions from membrane, cell wall, and cytosol obtained by the mufti-elution technique described by Andersen et al.
(1993) J. Immunol. Methods 161:29-39. The technique resulted in 30 sharply defined fractions and allowed an identification of immunological active regions, of potential as either diagnostic reagents or as vaccine components.
The study demonstrated that multiple targets within the cell wall, membrane, and cytosol were recognised by the donors and initiated IFN-y release as well as cellular proliferation (unpublished results). The broad cellular response were directed towards both the low molecular mass as well as the some of the higher molecular mass fractions.
These experiments suggest the existence of numerous target antigens among the cell wall, membrane, and cytosol fractions and it is therefore likely that some of these will have a potential as a protective or diagnostic reagent.
EXAMPLE 3: Identification of proteins from the cytosolic fraction Use of patient sera to identify M. tuberculosis antigens This example illustrates the identification of antigens from the cytosol fraction by screening with serum from M. tuberculosis infected individuals in western blot. The reaction with serum was used as an indication that the proteins are recognised immunologically.
The cytosol was precipitated with ammonium sulphate at 80% saturation. The non-precipitated proteins were removed by centrifugation and precipitated proteins were resuspended in 20 mM imidazole pH 7Ø The protein solution was applied to a DEAE
Sepharose 6B column, equilibrated with 20 mM imidazole pH 7Ø Bound protein was eluted from the column using a salt gradient from 0 to 1 M NaCI, in 20 mM
imidazole pH
7Ø Fractions collected during elution was analysed on a silver stained 10-20% SDS-PAGE and on 2 dimensional electrophoresis.
For use in western blot a pool of serum from 5 TB patients was made. These patients ranged from minimal to severe TB. Nitrocellulose membranes were blocked with phosphate buffer, pH 7.3, containing 0.37 M NaCI and 0.5% Tween-20, for 30 min. The serum pool was diluted in phosphate buffer pH 7.3 containing 0.37 M NaCI. The blots incubated in serum dilution overnight at room temperature on a shaker.
Membranes were washed for four times five minutes in the dilution buffer, and incubated with 1:1,000 diluted peroxidase-labelled swine anti human-IgG {P214, Dako) for 1 hour at room temperature on a shaker. Blots were then washed for four times 5 min. in the dilution buffer and stained with DONS/TMB.
N-terrninai sequencing and amino acid analysis Proteins of the fractions containing bands reactive with serum from TB
patients in Western blot were separated by 2D electrophoresis. Gels were blotted to PVDF
membranes and spots subjected to N-terminal sequencing on a Procise sequencer (Applied Biosystems).
The following N-terminal sequences were obtained ForTB15 :TERTAVLIKPDGIER
(SEQ ID NO: 39) ForTB18 :TDTQVTWLTQESHDR
(SEQ ID NO: 40) ForTB21 :MIDEALFDAEEKMEK
(SEQ ID NO: 41) ForTB33 :PLPADPSTDLSAYAQ
(SEQ ID NO: 42) ForTB38 :MLISQRPTLSEDVLT
(SEQ ID NO: 43) ForTB54 :TGNLVTKNSLTPDVR
(SEQ ID NO: 44) Sequence identity searches The N-terminal sequences obtained were used for an identity search using the blast program of the Sanger M. tuberculosis database http://www.sanger.ac.uk/Projects/M tuberculosis/blast server.shtml In addition, the GenEMBL database was searched using the BLASTP program (Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J. Lipman (1990).
Basic local alignment search tool. J. Mol. Biol. 215:403-10.), to reveal proteins with homology to the full amino acid sequences obtained from the Sanger database.
Thereby, the following information was obtained For the 15 determined N-terminal amino acids for TB15 a 93% identical sequence was found in MTV008.01 c. Amino acid 5 of the determined N-terminal sequence (A) is an L in the sequence MTV008.01c.
Within the open reading frame the translated protein is 136 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 136 amino acids, which corresponds to a theoretical molecular mass of 14 509 Da and a theoretical pl of 5.36. The observed mass in SDS-PAGE
is 14 kDa.
TB15 has 80% sequence identity in a 139 amino acid overlap to a protein of M.
smegmatis. It is homologous to putative nucleoside diphosphate kinases from several species, e.g. 59% sequence identity to a 151 amino acid protein of Archaeoglobus fulgidus and 57% sequence identity to a 149 amino acid protein of Bacillus subtilis.
For the 15 determined N-terminal amino acids for TB18 a 100% identical sequence was found in MTCY017.33c.
Within the open reading frame the translated protein is 164 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 164 amino acids, which corresponds to a theoretical molecular mass of 17 855 Da and a theoretical pl of 4.81. The observed mass in SDS-PAGE
is 20 kDa.
TB18 has 94% sequence identity, in a 164 amino acid overlap, to a protein from M.
leprae. In addition, it is homologous to transcription elongation factors from several species, e.g. 32% sequence identity in a 114 amino acid overlap, to a protein from Zymomonas mobilis.
For the 15 determined N-terminal amino acids for TB21 a 100% identical sequence was found in MTCY274.13c.
Within the open reading frame the translated protein is 185 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 1.
This corresponds to a theoretical molecular mass of 20 829 Da and a theoretical pl of 5.81. The observed mass in SDS-PAGE is 22 kDa.
TB21 has 90% sequence identity in a 185 amino acid overlap to a protein from M. leprae.
In addition, it is homologous to ribosome recycling factors from several species, e.g. 63%
in a 185 amino acid overlap to a protein from Streptomyces coelicolor.
For the 15 determined N-terminal amino acids for TB33 a 85% identical sequence was found in MTCY71.23. Amino acids 8 and 9 of the determined N-terminal sequence (T and D) are a P and a T in MTCY71.23, respectively.
Within the open reading frame the translated protein is 297 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 297 amino acids, which corresponds to a theoretical molecular mass of 33 323 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 35 kDa.
TB33 has 83% sequence identity in a 296 amino acid overlap to a protein from M. leprae.
In addition, it is homologous to thiosulphate sulfurtransferases (rhodanese) from several species, e.g. 48% in a 131 amino acid overlap to rhodanese from Saccharopolyspora erythraea.
For the 15 determined N-terminal amino acids for TB38 a 100% identical sequence was found in MTCY13E12.10c.
Within the open reading frame the translated protein is 347 amino acids long.
The N-terminal sequence of the protein identified in the cytosoi starts at amino acid no 1.
This corresponds to a theoretical molecular mass of 37 710 Da and a theoretical pl of 4.53. The observed mass in SDS-PAGE is 38 kDa.
TB38 is homologous to DNA-directed RNA polymerase alpha-chains from several species, e.g. 79% in a 321 amino acid overlap to a protein from Sfreptomyces coelicolor.
For the 15 determined N-terminal amino acids for TB54 a 100% identical sequence was found in MTCY20B11.23c.
Within the open reading frame the translated protein is 495 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 495 amino acids, which corresponds to a theoretical molecular mass of 54 329 Da and a theoretical pl of 5.00. The observed mass in SDS-PAGE
is 60 kDa.
TB54 is homologous to adanosyl homocysteinases from several species, e.g. 73%
in a 90 amino acid overlap to S-adenosyl-L-homocysteine hydrolase from Triticum aestivum.
It contains a S-adenosyl-L-homocysteine hydrolase signature (PS00739).
Example 3a: Use of patient sera to identify M. tuberculosis cytosol antigens.
5 Anion exchange chromatography of the cytosol proteins and Western blot experiments with a pool of sera from TB patients were performed as described in Example 3.
N-terminal sequencing Proteins of the fractions containing TB12.5, TB20.6, and TB40.8 were separated by 2D
electrophoresis. Gels were blotted to PVDF membranes and spots subjected to N-10 terminal sequencing on a Procise sequencer (Applied Biosystems).
The following N-terminal sequences were obtained For TB12.5 :ALKVEMVTFDXSDPA
(SEQ ID NO: 80) 15 For TB20.6 :ADADTTDFDVDAEAP
(SEQ ID NO: 81) For TB40.8 :SKTVLILGAGVGGLT (SEQ ID NO: 82) Sequence identity searches was performed as described in Example 3.
20 Thereby, the following information was obtained TB12.5 For the 15 determined N-terminal amino acids of TB12.5 a 93 % identical sequence was found in Rv0801. The x in position 11 is a cysteine.
25 Within the open reading frame the translated protein is 115 amino acids long. The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 115 amino acids, which corresponds to a theoretical molecular mass of 12 512 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 14 30 kDa.
No homology was found to TB12.5.
TB20.6 For the 15 determined N-terminal amino acids of TB20.6 a 100 % identical sequence was found in Rv3920c.
Within the open reading frame the translated protein is 187 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 1.
This gives a protein of 187 amino acids, which corresponds to a theoretical molecular mass of 20.559 Da and a theoretical pl of 4.14. The observed mass in SDS-PAGE
is 24 kDa.
TB20.6 has 73 % homology to a 193 amino acid protein of M. leprae. It has 59%
homology in a 184 amino acid overlap to a Jag-like protein from Streptomyces coelicolor.
TB40.8 For the 15 determined N-terminal amino acids of TB40.8 a 100 % identical sequence was found in Rv0331.
Within the open reading frame the translated protein is 388 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 388 amino acids, which corresponds to a theoretical molecular mass of 40 792 Da and a theoretical pl of 5.06. The observed mass in SDS-PAGE
is 44 kDa.
No homology was found to TB40.8.
Identification of abundant proteins As immunity to tuberculosis is not B-cell but T-cell mediated, reactivity with serum from TB patients was not the only selection criterion used to identify proteins from the cytosol.
Further proteins were selected by virtue of their abundance in the cytosol.
The cytosol was precipitated with ammonium sulphate at 80% saturation. The non-precipitated proteins were removed by centrifugation and precipitated proteins were resuspended in 20 mM imidazole, pH 7Ø The protein solution was applied to a DEAE
Sepharose 6B column, equilibrated with 20 mM imidazole. Bound protein was eluted from the column using a salt gradient from 0 to 1 M NaCI, in 20 mM imidazole.
Fractions collected during elution was analyzed on a silver stained 10-20% SDS-PAGE and on 2 dimensional electrophoresis. Fractions containing well separated bands were selected for 2D electrophoresis and blotted to PVDF, after which spots, visualised by staining with Coomassie Blue, were selected for N-terminal sequencing.
The following N-terminal sequences were obtained ForT810C :MEVKIGITDSPRELV
(SEQ ID NO: 45) ForTBI5A : SAYKTVVVGTDDXSX
(SEQ ID NO: 46) ForTBl7 :MEQRAELVVGRALVV
(SEQ ID NO: 47) ForTB24 :ADIDGVTGSAGL(N)PA
(SEQ ID NO: 48) ForTB27B :TYETILVERDQRVGI
(SEQ ID NO: 49) No sequence identity was found, when searching the Sanger database using the blast program. However, when the blast program at Swiss-blast was used, a sequence was obtained.
For the 15 determined N-terminal amino acids for TB10C a 93% identical sequence was obtained. The first amino acid of the N-terminal sequence (M) is a V in the sequence found, corresponding to GTG being used as a start codon, instead of ATG.
Within the open reading frame the translated protein is 90 amino acids. The N-terminal sequence of the protein identified in the cytosol starts at amino acid 1.
This corresponds to a theoretical molecular mass of 9 433 Da and a theoretical pl of 4.93. The observed mass in SDS-PAGE is 10 kDa.
For the determined N-terminal sequence of TB15 a 78% identical sequence was found in CY0182.28. The X at position 13 of the determined N-terminal sequence corresponds to a G in MTCY0182.28 and the X at position 15 to a D.
Within the open reading frame the translated protein is 146 amino acids long.
The N
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 146 amino acids, which corresponds to a theoretical molecular mass of 15 313 Da and a theoretical pl of 5.60. The observed mass in SDS-PAGE
is 16 kDa.
The highest sequence identity, 32% in a 34 amino acid overlap, was found to a conserved protein of Methanobacterium thermoautotrophicum.
For the 15 determined N-terminal amino acids for TB17 a 100% identical sequence was found in MTV044.12.
Within the open reading frame the translated protein is 165 amino acids. The N-terminal sequence of the protein identified in the cytosol starts at amino acid 1.
This gives a protein of 165 aa. Theoretical molecular mass 16 793 Da and a theoretical pl of 4.22. The observed mass in SDS-PAGE is 18 kDa.
TB17 is homologous to putative molybdenum cofactor biosynthesis proteins from several species, e.g. 34% in a 103 amino acid overlap to moaCB from Synechococcus spp.
For the 15 determined N-terminal amino acids for TB24 a 92% identical sequence was found in MTCY07D11.03. The tentative N in position 13 of the determined amino acid sequence is a Q in MTCY07D11.03, and the A at position 15 is a G.
Within the open reading frame the translated protein is 216 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 216 amino acids, which corresponds to a theoretical molecular mass of 24 227 Da and a theoretical pl of 4.91. The observed mass in SDS-PAGE
is 28 kDa.
TB24 is homologous to a RNA polymerase sigma-E factors from several species, e.g.
55% in a 72 amino acid overlap to ECF sigma factor RpoE1 from Myxococcus xanthus.
For the 15 determined N-terminal amino acids for TB27B a 100% identical sequence was found in MTCY017.23c.
Within the open reading frame the translated protein is 257 amino acids long.
The N-terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 257 amino acids, which corresponds to a theoretical molecular mass of 27 276 Da and a theoretical pl of 4.82. The observed mass in SDS-PAGE
is 28 kDa.
WO 00121983 PCTlDK99/00538 TB27B has 86% sequence identity in a 257 amino acid overlap, to a protein from M.
leprae. In addition, it is homologous to enoyl-CoA hydratases from several species, e.g.
66% in a 257 amino acid overlap to a protein from Rhizobium meliloti.
Identification of TB13A
One protein spot was selected by its reaction with the monoclonal antibody ST-3 in western blot. N-terminal sequencing of the spot on the PVDF membrane corresponding to the ST-3 spot yielded the following results ForTB13A :PVTQEEIIAGIAEII
(SEQ ID NO: 50) Sequence identity search on the TB13A N-terminal sequence gave the following results:
For the 15 determined N-terminal amino acids for TB13A a 100% identical sequence was found in MTCY427.25.
Within the open reading frame the translated protein is 115 amino acids long.
The N
terminal sequence of the protein identified in the cytosol starts at amino acid no 2, with the N-terminal Met cleaved off.
This gives a protein of 115 amino acids, which corresponds to a theoretical molecular mass of 12 524 Da and a theoretical pl of 3.87. The observed mass in SDS-PAGE
is 10 kDa.
TB13A has 94% sequence identity to a 115 amino acid protein of M. leprae. It is homologous to putative acyl carrier proteins from several species, e.g. 59%
sequence identity to a 78 amino acid protein of Myxococcus xanthus and 56% to a 82 amino acid protein from Streptomyces coelicolor.
Identification of TB64 Biotinylated proteins were purified from the cytosol fraction in the following way: 12 mg of the cytosol fraction was added to 100 p.l of TetraLink Tetrameric Avidin Resin (Promega) in PBS, pH 7.4 in an eppendort tube. After incubation over night at 4°C, centrifugation (1000 g for 5 min) was performed and the resin was washed five times with PBS, pH 7.4, each time followed by centrifugation and collection of the supernatant.
Thereafter, 100 NI
of 4 times concentrated SDS-PAGE sample buffer (0.08 M Tris-HCI, 8% SDS, 16%
glycerol, 24 mM EDTA , pH 8.0) was added to the resin and it was boiled for 20 minutes.
After centrifugation the supernatant was collected and analysed for the presence of biotinylated proteins: The sample was analysed on SDS-PAGE followed by semi-dry blotting to nitrocellulose. The nitrocellulose membranes were incubated with alkaline 5 phosphatase labeled streptavidin (D396, DAKO, Glostrup, Denmark). Nitro-blue tetrazolium/5-bromo-4-chloro-3-indolyl phosphate was used as substrate.
N-terminal sequencing The eluate from the TetraLink Tetrameric Avidin Resin was loaded on a precast 10-20%
Tricine SDS-PAGE gel (Novex, San Diego, USA). After electrophoresis the gel was 10 blotted to Problott PVDF membrane (Applied Biosystems, Foster City, CA) by semidry electroblotting in 10 mM CAPS, 10% methanol, pH 11. The PVDF membrane was stained with 0.1 % Coomassie R-250 in 40% methanol, 1 % acetid acid, and destained in 50% methanol. A band of 10 kDa which was identified as a biotinylated protein as described above was excised and subjected to N-terminal sequence analysis by 15 automated Edman degradation using a Procise 494 sequencer (Applied Biosystems) as described by the manufacturer.
The following sequence was obtained:
VIRRKPKPRXR (SEQ ID NO: 57) 20 Submission of this sequence to the Sanger Centre M. tuberculosis blast server identified the open reading frame Rv3285 (91 % identity in 11 amino acids) encoding a protein of 600 amino acids. The determined sequence showed identity to amino acids 511 to suggesting that the identified peptide is a C-terminal fragment of the protein. As expected, the pattern for biotinylation of a lysine was identified in the C-terminal part of 25 the protein: GDLVVVLEAMKMENPVTA (residues 556-573, PROSITE pattern PS00188).
EXAMPLE 4: Identification of proteins from the cell wall.
Identification of TB11 B, TB16, TB16A, TB32, TB32A, and TB51.
Proteins contained in the cell wall fraction were separated by 2-D
electrophoresis. A
30 sample containing 120 mg protein was subjected to isoelectric focusing in a pH gradient from 4 to 7. The second dimension separation (SDS-PAGE) was carried out in a 10-20%
acrylamide gradient. After blotting onto a PVDF membrane, proteins could be visualised by Coomassie blue staining.
N-terminal sequencing.
The relevant spots were excised from the PVDF membrane and subjected to N-terminal sequencing using a Procise sequences (Applied Biosystems). The following N-terminal sequences were obtained:
TB11B:PWKINAIEVPAGA (SEQ ID NO: 51) TB16:ADKTTQTIYIDADPG (SEQ ID NO: 52) TB16A:PVLSKTVEVTADAAS (SEQ ID NO: 53) TB32:SGNSSLGIIVGIDD
(SEQ ID NO: 54) TB32A:AEVLVLVEHAEGALK (SEQ ID NO: 55) TB51:MKSTVEQLSPTRVRI (SEQ ID NO: 56) N-terminal sequence identity searching and identification of the corresponding genes.
The N-terminal amino acid sequence from each of the proteins identified was used for a sequence identity search using the tblastn program at NCBI:
http:/lwww. ncbi. nl m. nih.govlcgi-bin/BLAST/nph-blast?Jform=0 The following information was obtained:
TB11 B:
The 14 as N-terminal sequence was found to be 100% identical to a sequence found on cosmid SCY06F7.
The identity is found within an open reading frame of 105 amino acids lenght corresponding to a theoretical molecular mass of 11 185 Da and a pl of 6.18.
The apparent molecular mass in an SDS-PAGE gel is 12 kDa.
The amino acid sequence shows some low level similarity to oxygenases and hypothetical proteins.
TB16:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV021.
The identity is found within an open reading frame of 144 amino acids length corresponding to a theoretical molecular mass of 16294 Da and a pl of 4.64.
The apparent molecular mass in an SDS-PAGE gel is 17 kDa.
The amino acid sequence shows some similarity to other hypothetical Mycobacterial proteins.
TB16A:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found on cosmid 128.
The identity is found within an open reading frame of 146 amino acids length corresponding to a theoretical molecular mass of 16 060 Da and a pl of 4.44.
The apparent molecular mass in an SDS-PAGE gel is 14 kDa.
TB32:
The 14 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTCY1A10.
The identity is found within an open reading frame of 297 amino acids length corresponding to a theoretical molecular mass of 31654 Da and a pl of 5.55.
The apparent molecular mass in an SDS-PAGE gel is 33 kDa.
The amino acid sequence shows some similarity to other hypothetical Mycobacterial proteins.
TB32A:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV012.
20 The identity is found within an open reading frame of 318 amino acids length corresponding to a theoretical molecular mass of 31694 Da and a pl of 4.61.
The apparent molecular mass in an SDS-PAGE gel is 32 kDa.
The amino acid sequence reveals high sequence identity to the fixB gene product from several organisms. Probable electron transfer flavoprotein alpha subunit far various dehydrogenases. Equivalent to Mycobacterium leprae FixB.
TB51:
The 15 as N-terminal sequence was found to be 100% identical to a sequence found within the Mycobacterium tuberculosis sequence MTV008.
The identity is found within an open reading frame of 466 amino acids length corresponding to a theoretical molecular mass of 50587 Da and a pl of 4.3. The apparent molecular mass in an SDS-PAGE gel is 56 kDa.
The amino acid sequence shows similarities to trigger factor from several organisms.
Possible chaperone protein.
EXAMPLE 5: Cloning of the genes encoding TB10C, TB13A, TB17, TB11 B, TB16, TB16A, TB32, TB51 The genes encoding TB10C, TB13A, TB17, TB11 B, TB16, TB16A, TB32, TB51 were all cloned into the E. coli expression vector pMCT3, by PCR amplification with gene specific primers.
Each PCR reaction contained 10 ng of M. tuberculosis chromosomal DNA in 1x low salt Taq+ buffer (Stratagene) supplemented with 250 pM of each of the four nucleotides {Boehringer Mannheim), 0.5 mg/ml BSA (IgG technology), 1 % DMSO (Merck), 5 pmoles of each primer, and 0.5 unit Taq+ DNA polymerase (Stratagene) in 10 pl reaction volume.
Reactions were initially heated to 94°C for 25 sec. and run for 30 cycles according to the following program; 94°C for 10 sec., 55°C for 10 sec., and 72°C for 90 sec., using thermocycler equipment from Idaho Technology.
The PCR fragment was ligated with TA cloning vector pCR~ 2.1 (Invitrogen) and transformed into E. coli. Plasmid DNA was thereafter prepared from clones harbouring the desired fragment, digested with suitable restriction enzymes and subcloned into the expression vector pMCT3 in frame with 6 histidine residues which are added to the N-terminal of the expressed proteins. The resulting clones were hereafter sequenced by cycle sequencing using the Dye Terminator system in combination with an automated gel reader (model 373A; Applied Biosystems) according to the instructions provided. Both strands of the DNA were sequenced.
Expression and metal affinity purification of recombinant proteins was undertaken essentially as described by the manufacturers. For each protein, 1 1 LB-media containing 100 pg/ml ampicillin, was inoculated with 10 ml of an overnight culture of XL1-Blue cells harbouring recombinant pMCT3 plasmids. Cultures were shaken at 37°C
until they reached a density of ODsoo= 0.4 - 0.6. IPTG was hereafter added to a final concentration of 1 mM and the cultures were further incubated 4 - 16 hours. Cells were harvested, resuspended in 1x sonication buffer + 8 M urea and sonicated 5 x 30 sec. with 30 sec.
pausing between the pulses.
After centrifugation, the lysate was applied to a column containing 10 ml of resuspended Talon resin (Clontec, Palo Alto, USA). The column was washed and eluted as described by the manufacturers.
After elution, all fractions (1.5 ml each) were subjected to analysis by SDS-PAGE using the Mighty Small (Hoefer Scientific Instruments, USA) system and the protein concentrations were estimated at OD28o gym. Fractions containing recombinant protein were pooled and dialysed against 3 M urea in 10 mM Tris-HCI, pH 8.5. The dialysed protein was further purified by FPLC (Pharmacia, Sweden) using 1 ml HiTrap columns (Pharmacia, Sweden) eluted with a linear salt gradient from 0 - 1 M NaCI.
Fractions were analysed by SDS-PAGE and protein concentrations were estimated at OD28onm.
Fractions containing protein were pooled and dialysed against 25 mM Hepes buffer, pH
8.5.
Finally, the protein concentration and the LPS content were determined by the BCA
(Pierce, Holland) and LAL (Endosafe, Charleston, USA) tests, respectively.
For cloning of the individual proteins, the following gene specific primers were used TB10C : Primers used for cloning of TB10C
TB10C-F : CTG AGA TCT GTG GAG GTC AAG ATC GGT
(SEQ ID NO: 58) TB10C-R : CTC CCA TGG CTAC TTA CCC GCT CGT AGC AAC (SEQ ID NO: 59) TB10C-F and TB10C-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB13A : Primers used for cloning of TB13A
TB13A-F : CTG AGA TCT CCT GTC ACT CAG GAA GAA
(SEQ ID NO: 60) TB13A-R : CTC CCA TGG GAA ACC GCC ATT AGC GGT
(SEQ ID NO: 61) TB13A-F and TB13A-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB17 : Primers used for cloning of TB17 TB17-F : CCC AAG CTT ATG GAA CAG CGT GCG GAG
(SEQ ID NO: 62) TB17-R : CTC CCA TGG CGA CAC TCG ATC CGG ATT (SEQ ID NO: 63) TB17-F and TB17-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB11 B : Primers used for cloning of TB11 B
TB11 B-F : CTG AGA TCT ATG CCA GTG GTG AAG ATC
{SEQ ID NO: 64) TB11 B-R : CTC CCA TGG TTA TGC AGT CTT GCC GGT (SEQ ID NO: 65) TB11B-F and TB11B-R create BG/II and Ncol sites, respectively, used for the cloning in 5 pMCT3.
TB16 : Primers used for cloning OF TB16 TB16-F : CTG AGA TCT GCG GAC AAG ACG ACA CAG
(SEQ ID NO: 66) TB16-R : CTC CCA TGG TAC CGG AAT CAC TCA GCC {SEQ ID NO: 67) TB16-F and TB16-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB16A : Primers used for cloning of TB16A
TB16A-F : CTG AGA TCT CCA GTT TTG AGC AAG ACC {SEQ ID NO: 68) TB16A-R : CTC CCA TGG GCA CAT GCC TTA GCT GGC
(SEQ ID NO: 69) TB16A-F and TB16A-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB32 : Primers used for cloning of TB32 TB32-F : CTG AGA TCT ATG TCA TCG GGC AAT TCA (SEQ ID NO: 70) TB32-R : CTC CCA TGG CTAC CTA AGT CAG CGA CTC GCG (SEQ ID NO: 71) TB32-F and TB32-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB51 : Primers used for cloning of TB51 TB51-F : CTG AGA TCT GTG AAG AGC ACC GTC GAG
(SEQ ID NO: 72) TB51-R : CTC CCA TGG GTC ATA CGG TCA CGT TGT (SEQ ID NO: 73) TB51-F and TB51-R create BG/II and Ncol sites, respectively, used for the cloning in pMCT3.
TB15A: Primers used for cloning of TB15A:
TB15A-F: CTG CCA TGG CTA GGT GGT GTG CAC GAT C
(SEQ ID NO: 89) TB15A-R: CTG AAG CTT ATG AGC GCC TAT AAG ACC
{SEQ ID NO: 90) TB15-F and TB15-R create Ncol and Hindlll sites, respectively, used for the cloning in pMCT3.
TB21: Primers used for cloning of TB21:
TB21-F: CTG AGA TCT ATG ATT GAT GAGGCT CTC
(SEQ ID NO: 91 ) TB21-R: CTC CCA TGG AGC GGC CGC TAG ACC TCC (SEQ ID NO: 92) TB21-F and TB21-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB24: Primers used for cloning of TB24:
TB24-F: GGCTGAGACTC ATG GCC GAC ATC GAT GGT G
(SEQ ID NO: 93) TB24-R: CGTACCATGG TCA TGA CGA CAC CCC CTC GTG (SEQ ID NO: 94) TB24-F and TB24-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB32A: Primers used for cloning of TB32A:
TB32A-F: GGCTGAGACTC ATG GCT GAA GTA CTG GTG C (SEQ ID NO: 95) TB32A-R: CGTACCATGGCTA GCC GGC GAC CGC CGG TTC (SEQ ID NO: 96) TB32A-F and TB32A-R create Bgll I and Ncol sites, respectively, used for the cloning in pMCT3.
TB14: Primers used for cloning of TB14:
TB14-F: 5'-GTG ACC GAA CGG ACT CTG GT-3' (SEQ ID NO: 97) TB14-R: 5'-CTA GGC GCC GGG AAA CCA GAG-3' (SEQ ID NO: 98) TB18: Primers used for cloning of TB18:
TB18-F: 5'-ATG ACG GAT ACT CAA GTC ACC TG-3"
(SEQ ID NO: 99) TB18-R: 5'-GGA GTG GTA CGG CTC GGC GC-3' (SEQ ID NO: 100) T827: Primers used for cloning of TB27:
TB27-F: 5'-ATG ACG TAC GAA ACC ATC CT-3' (SEQ ID NO: 101) TB27-R: 5'-TCA TCG GTG GGT GAA CTG GGG-3' (SEQ ID NO: 102) TB33: Primers used for cloning of TB33:
TB33-F: 5'-ATG CCG CTT CCC GCA GAC CCT AG-3' (SEQ ID NO: 103) TB33-R: 5'-TAC GAC GGG TAC CAC TCC TGG-3' (SEQ ID NO: 104) TB38: Primers used for cloning of TB38:
TB38-F: 5'-ATG CTG ATC TCA CAG CGC CCC A-3' (SEQ ID NO: 105) TB38-R: 5'-AAG CTG TTC GGT TTC GGC GTA G-3' (SEQ ID NO: 106) TB54: Primers used for cloning of TB54:
TB54-F: 5' -ATG ACC GGA AAT TTG GTG AC-3' (SEQ ID NO: 107) TB54-R: 5'-TCA GTA GCG GTA GTG GTC CGG-3' (SEQ ID NO: 108) TB14,TB18,TB27,TB33,TB38 and TB54 will be cloned in ex-pressions vector pBAD-TOPO (Invitrogen).
Example 5a: Cloning of the genes encoding TB12.5, TB20.6, and TB40.8 The genes encoding TB12.5, TB20.6, and TB40.8 were all cloned into the E. coli expression vector pMCT3 as described in Example 5.
For cloning of the individual genes, the following gene specific primers were used:
TB12.5: Primers used for cloning of TB12.5:
5 TB12.5-F: CTG AGA TCT ATG GCA CTC AAG GTA GAG (SEQ ID NO: 83) TB12.5-R: CTC CCA TGG TTA TTG ACC CGC CAC GCA
(SEQ ID NO: 84) TB12.5-F and TB12.5-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
TB20.6: Primers used for cloning of TB20.6:
TB20.6-F: CTG AGA TCT ATG GCC GAC GCT GAC ACC
(SEQ ID NO: 85) TB20.6-R: CTC CCA TGG CTA GTC GCG GAG CAC AAC
(SEQ ID NO: 86) TB20.6-F and TB20.6-R create 8glll and Ncol sites, respectively, used for the cloning in pMCT3.
TB40.8: Primers used for cloning of TB40.8:
TB40.8-F: CTG AGA TCT ATG AGC AAG ACG GTT CTC (SEQ ID NO: 87) TB40.8-R: CTC CCA TGG TCA CGT CTT CCA GCG GGT
(SEQ ID NO: 88) TB40.8-F and TB40.8-R create Bglll and Ncol sites, respectively, used for the cloning in pMCT3.
Expressionlpurification of recombinant proteins was performed as described in Example 5.
EXAMPLE 6: Evaluation of immunological activity of identified somatic proteins.
Each of the proteins identified in either the cell wall, cytosol or the cell membrane derived from M.tuberculosis will be evaluated for the immunological recognition in M.
tuberculosis infected animals or in TB patients.
IFN-~y induction in the mouse model of TB infection The recognition of an antigen by IFN-y producing T cells in M.tuberculosis infected animals or in TB patients is presently believed to be the most relevant correlate of protective immunity.
We will therefore evaluate the ability of the polypeptides of the invention to induce an IFN-y production in mice of four different haplotypes during a primary infection: 8-12 weeks old female mice C57BL/6j (H-2b), CBA/J (H-2k), DBA.2 (H-2d) and A.SW (H-2g) mice (Bomholtgaard, Ry, Denmark) will be infected i.v. via the lateral tail vein with an inoculum of 5 x 104 M.tuberculosis suspended in PBS in a vof. of 0.1 ml. 14 days postinfection the animals will be sacrificed and spleen cells isolated and tested for proliferation and the IFN-y release in response to stimulation with the recombinantly produced proteins.
As a specific model we will analyse the recognition of the purified polypeptides of the invention the mouse model of memory immunity to TB: A group of efficiently protected mice will be generated by infecting 8-12 weeks old female C57BI/6j mice with 5 x 104 M. tuberculosis i.v. After 30 days of infection the mice will be subjected to 60 days of antibiotic treatment with isoniazid (Merck and Co., Rahway, NJ) and rifabutin (Farmatalia Carlo Erba, Milano, Italy) then left for 200-240 days to ensure the establishment of resting long-term memory immunity. Such memory immune mice are very efficient protected against a secondary infection (Orme; Andersen, Boom 1993, J. Infect.Dis. 167:
1497). Long lasting immunity in this model is mediated by a population of highly reactive CD4 cells recruited to the site of infection and triggered to produce large amounts of IFN-y in response to M. tuberculosis antigens.
This model will be used to identify single antigens recognised by protectiveT
cells.
Memory immune mice will be reinfected with 1 x 106 M.tuberculosis i.v and splenic lymphocytes harvested at day 4-6 of reinfection and proliferation and the amount of IFN-y produced in response to any of the recombinantly produced proteins will be evaluated.
IFN-y induction in humans during infection with virulent Mycobacteria.
IFN-y is currently believed to be the best marker of protective immunity in humans. In patients with limited tuberculosis, high levels of IFN-y can be induced, in contrast to patients with severe TB who often respond with low levels of IFN-y (Boesen et al (1995), Human T-cell response to secreted antigen fractions of M.tuberculosis.
Infection and Immunity 63(4):1491-1497). Furthermore, IFN-y release has been shown to correlate inversely with the severity of disease as determined by X-ray findings (Sodhi A, et al (1997) Clinical correlates of IFN-gamma production in patients with Tuberculosis, Clinical Infectious disease. 25; 617-620). Healthy exposed contacts of sputum positive TB
patients also produce very high levels of IFN~ in response to mycobacterial antigens 5 (unpublished, manus in prep) indicative of early, subclinical infection.
Together these findings indicate that those individuals who are relatively protected (i.e.
minimal TB
patients) respond with high levels of IFN-y. The ability of the polypeptides to induce IFN-y release in cultures of PBMC or whole blood from 20 PPD responsive patients with microscopy or culture proven TB (0-6 month after diagnosis), exposed household 10 contacts, or BCG vaccinated individuals from different geographical regions will be evaluated. Evaluation of donors from different geographical regions will enable us to take into account the influence of i.e. exposure to virulent Mycobacterium or NTM
(Non-Tuberculous Mycobacteria) and different genetic background. The most important selection criteria for vaccine candidates are the polypeptides which are recognised by 15 >30% of the donors with a level of IFN y >30% of that induced by a crude antigen preparation like ST-CF, PPD and SPE.
Cultures will be established with 1 to 2 x 105 PBMC in 2001 in microtiter plates (Nunc, Roskilde, Denmark) or with 1 ml of serum or plasma stimulated with the identified polypeptide and the IFN-y release measured by ELISA.
20 Polypeptides of the invention frequently recognised will be preferred.
The use of polypeptides as diagnostic reagents:
A polypeptide has diagnostic potential in humans when it is inducing significantly higher responses in patients with microscopy or culture positive tuberculosis compared to PPD
positive or PPD negative individuals with no known history of TB infection or exposure to 25 M.tuberculosis but who may or may not have received a prior BCG
vaccination, have been exposed to non-tuberculous mycobacteria(NTM), or be actively infected with M.avium. To identify polypeptides capable of discriminating between the above mentioned groups, the level of response and the frequency of positive responders to the polypeptide is compared. By positive responders are meant i) in vitro IFN-y release by 30 PBMC or whole blood stimulated with the polypeptide of at least 3-500 pg/ml above background or another cut off relating to the specific test kit used, ii) reactivity by human serum or plasma from TB patients with the polypeptide using conventional antibody ELISAIVIIestern blot or iii) in vivo delayed type hypersensitivity response to the polypeptide which is at least 5 mm higher than the response induced by a control 35 material.
The diagnostic potential of polypeptides will initially be evaluated in 10 individuals with TB
infection and 10 individuals with no known exposure to virulent Mycobacteria.
High specificity, >80% ,will be the most important selection criteria for these polypeptides and a sensitivity >80% is desirable but sensitivity >30% is acceptable as combinations of several specific antigens may be preferred in a cocktail of diagnostic reagent recognised by different individuals.
Skin test reaction in TB infected guinea pigs To identify polypeptides as antigens with the potential as TB diagnostic reagents the ability of the proteins to induce a skin test response will be evaluated in the guinea pig model where groups of guinea pigs have been infected with either M.
tuberculosis or M.avium or vaccinated with BCG.
To evaluate the response in M.tuberculosis infected guinea pigs, female outbred guinea pigs will be infected via an ear vein with 1 x 104 CFU of M. tuberculosis H37Rv in 0.2 ml of PBS or aerosol infected (in an exposure chamber of a Middlebrook Aerosol Generation device) with 1x 105 CFU/ml of M.tuberculosis Erdman given rise to 10-15 granulomas per animal in the lung. After 4 weeks skin test will be performed with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
To evaluate the response in M.avium infected guinea pigs, female outbred guinea pigs will be infected intradermally with 2 x 106 CFU of a clinical isolate of M.avium (Atyp.1443;
Statens Serum Institut, Denmark). Skin test are performed 4 weeks after with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
To evaluate the response in BCG vaccinated guinea pigs, female outbred guinea pigs will be sensitized intradermally with 2 x 106 CFU of BCG (BCG Danish 1331; Statens Serum Institut). Skin test are performed 4 weeks after with the polypeptides diluted in 0.1 ml of PBS and 24 hours after the injection reaction diameter is measured.
If a polypeptide induces a significant reaction in animal infected with M.tuberculosis but not in BCG vaccinated guinea pigs this polypeptide may have a potential as a diagnostic reagent to differentiate between BCG vaccinated and M.tuberculosis infected individuals, which will hereafter be evaluated in the human population.
If a polypeptide induces a reaction in M.tuberculosis infected guinea pigs but not in guinea pigs infected with M.avium, this polypeptide may have a potential as a diagnostic reagent with respect to differentiate between an individual infected with M.
tuberculosis and an individual infected with Mycobacteria not belonging to the tuberculosis complex.
The polypeptide may also have a potential as a diagnostic reagent to differentiate between a M.avium and a M.tuberculosis infected individual.
Induction of protective immunity by the recombinant proteins in the mice model.
The recombinant polypeptides will be evaluated as immunological compositions in mice.
Female C57BLI6j mice of 6-8 weeks old (Bomholtgaard, Denmark) will be immunised subcutaneously at the base of the tail with the recombinantly produced polypeptides with DDA as adjuvant. The mice will be vaccinated with a volume of 0.2 ml in total of three times with two weeks interval between each immunisation. One week after last immunisation the mice will be bled and the blood cells isolated. The immune response induced will be monitored by release of IFN-y into the culture supernatant when stimulated in vitro with the homologous proteins.
6 weeks after the last immunisation the mice will be aerosol challenged with 5.5 ml of 5 x 106 viable M.tuberculosislml. After 6 weeks of infection the mice will be killed and the number of viable bacteria in lung and spleen determined by plating serial 3-fold dilution of organ homogenates on 7H11 plates. Colonies will be counted after 2-3 weeks of incubation and the levels of protection induced by each of the single polypeptide will be determined.
Example 6a: Interferon~y induction in human TB patients and BCG
vaccinated Human donors: PBMC were obtained from healthy BCG vaccinated donors with no known exposure to M, tuberculosis and from patients with culture or microscopy proven infection with TB. Blood samples were drawn from the TB patients 0-fi months after diagnosis of tuberculosis, and 20 months to 40 years after BCG vaccination.
Lymphocyte preparations and cell culture: PBMC were freshly isolated by gradient centrifugation of heparinized blood on Lymphoprep (Nycomed, Oslo, Norway) and stored in liquid nitrogene until use. The cells were resuspended in complete RPMI
1640 medium (Gibco, Grand Island, N.Y.) supplemented with 1 % penicillinlstreptomycin (Gibco BRL, Life Technologies), 1 % non-essential-amino acids (FLOW, ICN Biomedicals, CA, USA), and 10% normal human ABO serum (NHS) from the local blood bank. The number and the viability of the cells were determined by Nigrosin staining. Cultures were established with 1.25 x 105 PBMCs in 100 pl in microtitre plates (Nunc, Roskilde, Denmark) and stimulated with ST-CF (5pg/ml), TB13A, TB15A, TB17, TB18, TB33, TB11 B, TB16A, TB16, TB32, and TB51 in a final concentration of 10 pg/ml. No antigen and phytohaemagglutinin (PHA) were used as negative and positive control, respectively.
Supernatants for the detection of cytokines were harvested after 5 days of culture, pooled, and stored at -80°C until used.
Cytokine analysis: Interferon-y (IFN-y) was detected with a standard sandwich ELISA
technique using a commercially available pair of monoclonal antibodies (Endogen) and used according to the manufacturers instruction. Recombinant IFN-y (Endogen) was used as a standard. All data are means of duplicate wells and the variation between wells did not exceed 10 % of the mean. Cytokine levels below 50 pg/ml were considered negative.
Responses of 10 individual donors are shown in TABLE 3.
As shown in Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, and Table 12 a marked release of IFN-y is observed after stimulation with some of the recombinant proteins. For 50% of the donors, stimulation with TB18, TB32, and TB51 give rise to high IFN-y responses (> 1,000 pglml). Less than 1/3 of the donors recognised TB15A and TB11B at this level. Between 30 and 70% of the donors show intermediate IFN-y response (> 500 pg/ml) when stimulated with TB17 and TB16A
whereas only limited response was obtained by TB13A, TB33, and TB16. However, TB13A, TB33 and TB16 may still be of immunological importance and meet some of the other properties of the present invention. E.g. as demonstrated for TB33 which is recognised by a pool of sera from human TB-patients.
Table 3 Stimulation of PBMCs from 6 healthy BCG vaccinated and 4 TB patients with recombinant TB13A. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No PHA ST-CF TB13A
ag (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) TB patients DonorNo PHA ST-CF TB13A
ag (1 ~glml) (5 ug/ml) (10 ~g/ml) 2 51 10058 64$9 0 Table 4 Stimulation of PBMCs from 6 healthy BCG vaccinated and 5 TB patients with recombinant TB15A. Responses to ST-CF and shown for comparison.
PHA are Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB15A
(1 ug/ml) (5 ~g/ml) (10 ~g/ml) TB patients Donor No ag PHA ST-CF TB15A
(1 ~g/ml) (5 ~g/ml) (10 ~g/ml) Table 5 Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB17.
Responses to ST-CF and PHA are shown . Results are given for comparison as pg IFN-y/ml BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB17 {1 ~g/ml) {5 ~g/ml) (10 ~g/ml) Table 6 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB18. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-ylml BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB18 (1 ~g/ml) (5 uglml) (10 ~g/ml) TB patients Donor No ag PHA (1 ST-CF(5 TB18 (10 u9lml) pglml) ~.glml) Table 7 Stimulation of PBMCs from 5 healthy BCG vaccinated and 6 TB patients with recombinant TB33. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
5 BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB33 {1 ~g/ml) (5 ~g/ml)(10 ~g/ml) TB patients Donor No ag PHA ST-CF TB33 (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) s3 Table 8 Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB11 B. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB11 B
(1 ~glml) (5 ~g/ml) (10 ~glml) TB patients Donor No ag PHA ST-CF TB11 B
(1 ~g/ml) (5 ~glml) (10 ~g/ml) 10 Table 9. Stimulation of PBMCs from 2 healthy BCG vaccinated and 5 TB
patients with recombinant TB16A. Responses to ST-CF and PHA are shown for comparison.
Results are given as pg IFN-y/ml.
BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF' TB16A
(1 ~glml) (5 ~g/ml) (10 ~glml) TB patients Donor No ag PHA ST-CF TB16A
(1 ~glml) (5 ~glml) (10 ~glml) Table 10. Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB16.
Responses to ST-CF and PHA are shown for comparison. Results are given as oa IFN-Donor No ag PHA ST-CF TB16 (1 ~g/ml) (5 ~.g/ml) (10 ~g/ml) Table 11. Stimulation of PBMCs from 3 healthy BCG vaccinated and 3 TB patients with recombinant TB32. Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
Donor No ag PHA ST-CF TB32 (1 ~glml) (5 ~g/ml) (10 ~g/ml) TB patients Donor No ag PHA ST-CF TB32 (1 ~g/ml) (5 ~g/ml) (10 ~g/ml) Table 12, Stimulation of PBMCs from 6 healthy BCG vaccinated with recombinant TB51.
Responses to ST-CF and PHA are shown for comparison. Results are given as pg IFN-y/ml.
5 BCG vaccinated control donors, no known TB exposure Donor No ag PHA ST-CF TB51 (1 ~g/ml) (5 ~glml) {10 ~g/ml) ,. " . """. . . -."., . ...
Figure legends:
Figure 1:
Long term protection against TB can be induced by immunisation with dead M. tuberculosis.
5 Mice received either: three immunisations with 1x10' CFU of dead M.tuberculosis H37Rv (squares); three immunisations with 50 pg of ST-CF (triangles); one immunisation with 5 x 104 CFU of live M.tuberculosis H37Rv (circle) and was hereafter cleared for the infection by administration of isoniazid in the drinking water. At 3, 6 and 12 month after the last immunisation the mice received an infection with M. tuberculosis H37Rv and two 10 weeks later the bacterial load and the resistance against TB in the spleens were determined.
Figure 2:
Mice received three immunisations with 50~,g of either of the three vaccines:
heat killed H37Rv, SPE or ST-CF or received a vaccination with BCG. Two weeks after a primary 15 infection the bacterial load in the spleen was used to determined the resistance against TB.
SEQ(JENCE LISTING
<110> Statens Serum Institute <120> TB vaccine and diagnostic based antigens from the M.tuberculosis cell <130> 21868PC1 <160> 108 <170> FastSEQ for Windows Version 3.0 <210> 1 <211> 273 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(270) <400> 1 gtggag gtcaagatcggt atcacg gacagtccg cgcgagctg gtgttc 48 ValGlu ValLysIleGly IleThr AspSerPro ArgGiuLeu ValPhe tccagt gcgcagacgccc agtgag gtagaagaa ctcgtcagc aacgcg 96 SerSer AlaGlnThrPro SerGlu ValGluGlu LeuValSer AsnAla ctgcgc gacgactctggt ttgctg accctgacc gacgagcgg ggccgt 149 LeuArg AspAspSerGly LeuLeu ThrLeuThr AspGluArg GlyArg cgcttc ctaattcacacc gccagg atcgcctat gtcgagatc ggtgtc 192 ArgPhe LeuIleHisThr AlaArg IleAlaTyr ValGluIle GlyVal gcagac gcccgccgggtg ggcttc ggcgtcggg gtggacgcc gcaget 240 AlaAsp AlaArgArgVal GlyPhe GlyValGly ValAspAla AlaAla gggtcc gecggaaaggtt getacg agcgggtaa 273 GlySer AlaGlyLysVal AlaThr SerGly <210> 2 <211> 90 <212> PRT
<213> M.Tuberculosis <400> 2 Met Glu Val Lys Ile Gly Ile Thr Asp Ser Pro Arg Glu Leu Val Phe Ser Ser Ala Gln Thr Pro Ser Glu Val Glu Glu heu Val Ser Asn Ala Leu Arg Asp Asp Ser Gly Leu Leu Thr Leu Thr Asp Glu Arg Gly Arg ArgPheLeuIle Thr AlaArgIle Tyr Val Glu Ile Gly Val His Ala AiaAspAlaArg Val GlyPheGly Gly Val Asp Ala Ala Ala Arg Val GlySerAlaGly Val AlaThrSer Lys Gly <210> 3 <211> 348 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(345) <400> 3 gtgcct actcaggaa gaaatcatt gccggtatc gccgag atcatc 48 gtc ValPro ThrGlnGlu GluIleIle AlaGlyIle AlaGlu IleIle Val gaagag accggtatc gagccgtcc gagatcacc ccggag aagtcg 96 gta GluGlu ThrGlyIle GluProSer GluIleThr ProGlu LysSer Val ttcgtc gacctggac atcgactcg ctgtcgatg gtcgag atcgcc 144 gac PheVal AspLeuAsp IleAspSer LeuSerMet ValGlu IleAla Asp gtgcag gaggacaag tacggcgtc aagatcccc gacgag gacctc 192 acc ValGln GluAspLys TyrGlyVal LysIlePro AspGlu AspLeu Thr gccggt cgtaccgtc ggtgacgtt gtcgcctac atccag aagctc 240 ctg AlaGly ArgThrVal GlyAspVal ValAlaTyr IleGln LysLeu Leu gaggaa aacccggag gcggetcag gcgttgcgc gcgaag attgag 288 gaa GluGlu AsnProGlu AlaAlaGln AlaLeuArg AlaLys IleGlu Glu tcggag cccgatgcc gttgccaac gttcaggcg aggctt gaggcc 336 aac SerGlu ProAspAla ValAlaAsn ValGlnAla ArgLeu GluAla Asn gagtcc tga aag GluSer Lys <210> 9 <211> 115 <212> PRT
<213> M.Tuberculo sis <400> 4 MetPro ThrGlnGlu GluIleIle AlaGlyIle AlaGlu IleIle Val GluGlu ThrGlyIle GluProSer Glu Thr ProGlu LysSer Val Ile Phe Val Asp Asp Leu Asp Ile Asp Ser Leu Ser Met Val Glu Ile Ala Val Gln Thr Glu Asp Lys Tyr Gly Val Lys Ile Pro Asp Glu Asp Leu Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala Tyr Ile Gln Lys Leu Glu Glu Glu Asn Pro Glu Ala A1a Gln Ala Leu Arg Ala Lys Ile Glu Ser Glu Asn Pro Asp Ala Val Ala Asn Val Gln Ala Arg Leu Glu Ala Glu Ser Lys <210>5 <211>411 <212>DNA
<213>M.Tuberculosis <220>
<221>CDS
<222>(1)...(408) <400> 5 gtgaccgaa cggactctg gtactgatc aagccggat ggcatcgaa agg 48 ValThrGlu ArgThrLeu ValLeuIle LysProAsp GlyIleGlu Arg cagctgatc ggcgagatc atcagccgc atcgagcgc aaaggcctc acc 96 GlnLeuIle GlyGluIle IleSerArg IleGluArg LysGlyLeu Thr atcgetgcg ctgcagctc aggaccgtc agcgcggag ttggccagc cag 144 IleAlaAla LeuGlnLeu ArgThrVal SerAlaGlu LeuAlaSer Gln cactacgcc gaacatgaa ggcaaacca ttctttgga tcgttgctg gag 192 HisTyrAla GluHisGlu GlyLysPro PhePheGly SerLeuLeu Glu 50 55 fi0 ttcatcacg tcgggtccg gtggtagcg gcgatcgt:ggagggaacc cga 240 PheIleThr SerGlyPro ValValAla AlaIleVal GluGlyThr Arg gccatcgcg gcggttcgc caactcgcc ggcggcacc gacccggtg cag 288 AlaIleAla AlaValArg GlnLeuAla GlyGlyThr AspProVal Gln gcggcggcg cccggcaca atccggggc gacttcget ctagagacg cag 336 AlaAlaAla ProGlyThr IleArgGly AspPheA:LaLeuGluThr Gln ttcaacctg gtgcacggg tctgattcg gccgaatcc gcgcagcgc gaa 384 PheAsnLeu ValHisGly SerAspSer AlaGluSer AlaGlnArg Glu atcgcgctc tggtttccc ggcgcctag 411 IleAlaLeu TrpPhePro GlyAla <210> 6 <211> 136 <212> PRT
<213> M.Tuberculosis <400> 6 Met Thr Glu Arg Thr Leu Val Leu Ile Lys Pro Asp Gly Ile Glu Arg Gln Leu Ile Gly Glu Ile Ile Ser Arg Ile Glu Arg Lys Gly Leu Thr Ile Ala Ala Leu Gln Leu Arg Thr Val Ser Ala Glu Leu Ala Ser Gln His Tyr Ala Glu His Glu Gly Lys Pro Phe Phe Gly Ser Leu Leu Glu Phe Ile Thr Ser Gly Pro Val Val Ala Ala Ile Val Glu Gly Thr Arg Ala Ile Ala Ala Val Arg Gln Leu Ala Gly Gly Thr Asp Pro Val Gln Ala Ala Ala Pro Gly Thr Ile Arg Gly Asp Phe Ala Leu Glu Thr Gln Phe Asn Leu Val His Gly Ser Asp Ser Ala Glu Ser Ala Gln Arg Glu Ile Ala Leu Trp Phe Pro Gly Ala <210> 7 <211> 941 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(438) <400> 7 atgagcgcc tataagacc gtggtggta ggaaccgac ggttcggac tcg 48 MetSerAla TyrLysThr ValValVal GlyThrAsp GlySerAsp Ser tcgatgcga gcggtagat cgcgetgcc cagatcgcc ggcgcagac gcc 96 SerMetArg AlaValAsp ArgAlaAla GlnIleAla GlyAlaAsp Ala aagttgatc atcgcctcg gcataccta cctcagcac gaggacget cgc 144 LysLeuIle IleAlaSer AlaTyrLeu ProGlnHis GluAspAla Arg gccgccgac attctgaag gacgaaagc tacaaggtg acgggcacc gcc 192 AlaAlaAsp IleLeuLys AspGluSer TyrLysVal ThrGlyThr Ala ccgatctac gagatcttg cacgacgcc aaggaacga gcgcacaac gcc 240 ProIleTyr GluIleLeu HisAspAla LysGluArg AlaHisAsn Ala ggtgcgaaa aacgtcgag gaacggccg atcgtcggc gccccggtc gac 288 GlyAlaLys AsnValGlu GluArgPro IleValGly AlaProVal Asp gcgttggtg aacctggcc gatgaggag aaggcggac ctgctggtc gtc 336 AlaLeuVal AsnLeuAla AspGluGlu LysAlaAsp LeuLeuVal Val ggc aat gtc ggt ctg agc acg atc gcg ggt cgg ctg ctc gga tcg gta 384 Gly Asn Val Gly Leu Ser Thr Ile Ala Gly Arg Leu Leu Gly Ser Val ccg gcc aat gtg tca cgc cgg gcc aag gtc gac gtg ctg atc gtg cac 432 Pro Ala Asn Val Ser Arg Arg Ala Lys Val Asp Val Leu Ile Val His acc acc tag 441 Thr Thr <210> 8 <211> 146 <212> PRT
<213> M.Tuberculosis <900> 8 Met Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Gly Ser Asp Ser Ser Met Arg Ala Val Asp Arg Ala Ala Gln Ile Ala Gly Ala Asp Ala Lys Leu Ile Ile Ala Ser Ala Tyr Leu Pro Gln His Glu Asp Ala Arg Ala Ala Asp Ile Leu Lys Asp Glu Ser Tyr Lys Val Thr Gly Thr Ala Pro Ile Tyr Glu Ile Leu His Asp AIa Lys Glu Arg Ala His Asn Ala Gly Ala Lys Asn Val Glu Glu Arg Pro Ile Val Gly Ala Pro Val Asp Ala Leu Val Asn Leu Ala Asp Glu Glu Lys Ala Asp Leu Leu Val Val Gly Asn Val Gly Leu Ser Thr Ile Ala Gly Arg Leu Leu Gly Ser Val Pro Ala Asn Val Ser Arg Arg Ala Lys Val Asp Val Leu Ile Val His Thr Thr <210> 9 <211> 998 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(495) <400> 9 atg gaa cag cgt gcg gag ttg gtg gtt ggc cgg gca ctt gtc gtc gtc 48 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val Val gtt gac gat cgc acg gcg cac ggc gat gaa gac cac agc ggg ccg ctt 96 Val Asp Asp Arg Thr Ala His Gly Asp Glu Asp His Ser Gly Pro Leu gtc acc gag ctg ctc acc gag gcc ggg ttt gtt gtc gac ggc gtg gtg 149 Val Thr Glu Leu Leu Thr Glu Ala Gly Phe Val Val Asp Gly Val Val gcg gtg tcg gcc gac gag gtc gag atc cga aat gcg ctg aac aca gcg 192 Ala Val Ser Ala Asp Glu Val Glu Ile Arg Asn Ala Leu Asn Thr Ala gtg atc ggc ggg gtg gac ctg gtg gtg tcg gtc ggc ggg acc ggg gtg 240 Val Ile Gly Gly Val Asp Leu Val Val Ser Val Gly Gly Thr Gly Val acg cctcgcgat gtcaccccg gaagccacc cgcgac attctggaccgc 288 Thr ProArgAsp ValThrPro GluAlaThr ArgAsp IleLeuAspArg gag atcctcggt atcgccgag gccatccgc gcgtcc gggctgtccgcg 336 Glu IleLeuGly IleAlaGlu AlaIleArg AlaSer GlyLeuSerAla gga atcgtcgac gccgggttg tcgcgcggc ctggcg ggtgtctccggc 384 Gly IleValAsp AlaGlyLeu SerArgGly LeuAl.aGlyValSerGly agc acgctggtg gtcaacctc gcgggttcg cgttat gcggtgcgcgat 432 Ser ThrLeuVal ValAsnLeu AlaGlySer ArgTyr AlaValArgAsp gga atggcgacg ctgaatccg ctagcggca cagat:catcgggcagttg 480 Gly MetAlaThr LeuAsnPro LeuAlaAla GlnIle IleGlyGlnLeu tcg agcttggag atctga 498 Ser SerLeuGlu Ile <210> 10 <211> 165 <212> PRT
<213> M.Tuberculosis <400> 10 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val Val Val Asp Asp Arg Thr Ala His Gly Asp Glu Asp H:is Ser Gly Pro Leu Val Thr Glu Leu Leu Thr Glu Ala Gly Phe Val Val Asp Gly Val Val Ala Val Ser Ala Asp Glu Val Glu Ile Arg Asn Ala Leu Asn Thr Ala Val Ile Gly Gly Val Asp Leu Val Val Ser Val Gly Gly Thr Gly Val Thr Pro Arg Asp Val Thr Pro Glu Ala Thr Arg Asp Ile Leu Asp Arg Glu Ile Leu Gly Ile Ala Glu Ala Ile Arg Ala Ser Gly Leu Ser Ala Gly Ile Val Asp Ala Gly Leu Ser Arg Gly Leu Ala Gly Val Ser Gly Ser Thr Leu Val Val Asn Leu Ala Gly Ser Arg Tyr Ala Val Arg Asp Gly Met Ala Thr Leu Asn Pro Leu Ala Ala Gln Ile Ile Gly Gln Leu SerSer Glu Ile Leu <210> 11 <211> 495 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(992) <400> 11 atgacg act caagtcacc tggttgacc caagagtca catgac cga 48 gat MetThr Thr GlnValThr TrpLeuThr GlnGluSer HisAsp Arg Asp ctcaaa gag ctcgaccag ctgattgcg aatcgcccg gtcatc gcc 96 gca LeuLys Glu LeuAspGln LeuIleAla AsnArgPro ValIle Ala Ala gccgaa aac gaccgccgc gaagaaggc gacctgcgc gagaac ggc 199 atc AlaGlu Asn AspArgArg GluGluGly AspLeuArg GluAsn Gly Ile ggatac gcc gcccgcgag gagcagggc cagcaggag gcccgc att 192 cac GlyTyr Ala AlaArgGlu GluGlnGly GlnGl.nGlu AlaArg Ile His cgccag cag gacttgctc agcaacgca aaggttggc gaggca ccc 240 ctg ArgGln Gln AspLeuLeu SerAsnAla LysValGly GluAla Pro Leu aagcaa ggc gtcgcatta cccggttct gtggtcaag gtgtac tac 288 tcc LysGln Gly ValAlaLeu ProGlySer ValValLys ValTyr Tyr Ser aacggc aag tcggacagc gaaacgttc ctcat:cgcc acccgc cag 336 gac AsnGly Lys SerAspSer GluThrPhe LeuIleAla ThrArg Gln Asp gagggc agc gacggcaag ctcgaggtc tactcgccg aattca ccg 384 gtc GluGly Ser AspGlyLys LeuGluVal TyrSerPro AsnSer Pro Val ctcggt gcc ctgatcgac gccaaggtc ggcgagacc cgcagc tac 932 ggg LeuGly Ala LeuIleAsp AlaLysVal GlyGluThr ArgSer Tyr Gly acggtg aac ggcagcacc gtgtcggtg accctagtc agcgcc gag 480 ccc ThrVal Asn GlySerThr ValSerVal ThrLeuVal SerAla Glu Pro ccgtac tcc tag 495 cac ProTyr Ser His <210> 12 <211> 164 <212> PRT
<213> M.Tuberculosis <400> 12 MetThr AspThrGln ValThrTrp LeuThrGln GluSerHis AspArg LeuLys AlaGluLeu AspGlnLeu IleAlaAsn ArgProVal IleAla AlaGlu IleAsnAsp ArgArgGlu GluGlyAsp LeuArgGlu AsnGly GlyTyr HisAlaAla ArgGluGlu GlnGlyGln GlnGluAla ArgIle ArgGln LeuGlnAsp LeuLeuSer AsnAlaLys ValGlyGlu AlaPro LysGln SerGlyVal AlaLeuPro Gly5erVal ValLysVal TyrTyr AsnGly AspLysSer AspSerGlu ThrPheLeu IleAlaThr ArgGln GluGly ValSerAsp GlyLysLeu GluValTyr SerProAsn SerPro LeuGly GlyAlaLeu IleAspAla LysValGly GluThrArg SerTyr ThrVal ProAsnGly SerThrVal SerValThr LeuValSer AlaGlu ProTyr HisSer <210> 13 <211> 558 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(555) <400> 13 atgatt gatgagget ctcttcgac gccgaa gagaaaatg gagaagget 48 MetIle AspGluAla LeuPheAsp AlaGlu GluLysMet GluLysAla gtggcg gtggcacgt gacgacctg tcaact atccgtacc ggccgcgcc 96 ValAla ValAlaArg AspAspLeu SerThr IleArgThr GlyArgAla aaccct ggcatgttc tctcggatc accatc gactactac ggtgcggcc 144 AsnPro GlyMetPhe SerArgIle ThrIle AspTyrTyr GlyAlaAla accccg atcacgcaa ctggccagc atcaat gtccccgag gcgcggcta 192 ThrPro IleThrGln LeuAlaSer IleAsn ValProGlu AlaArgLeu gtcgtg ataaagccg tatgaagcc aatcag ttgcgcget atcgagact 240 ValVal IleLysFro TyrGluAla AsnGln LeuArgAla IleGluThr gcaatt cgcaactcc gaccttgga gtgaat cccaccaac gacggcgcc 288 AlaIle ArgAsnSer AspLeuGly ValAsn ProThrAsn AspGlyAla cttatt cgcgtggcc gtaccgcag ctcacc gaagaacgt cggcgagag 336 Leu Ile Arg Val Ala Val Pro Gln Leu Thr Glu Glu Arg Arg Arg Glu ctggtcaaacag gcaaagcat aagggggag gaggccaag gtttcg gtg 384 LeuValLysGln AlaLysHis LysGlyGlu GluAlaLys ValSer Val cgtaatatccgt cgcaaagcg atggaggaa ctccatcgc atccgt aag 432 ArgAsnIleArg ArgLysAla MetGluGlu LeuHisArg IleArg Lys gaaggcgaggcc ggcgaggat gaggtcggt cgcgcagaa aaggat ctc 980 GluGlyGluAla GlyGluAsp GluValGly ArgAlaGlu LysAsp Leu gacaagaccacg caccaatac gtcacccaa attgatgag ctggtt aaa 528 AspLysThrThr HisGlnTyr ValThrGln IleAspGlu LeuVal Lys cacaaagaaggc gagctgctg gaggtctag 558 HisLysGluGly GluLeuLeu GluVal <210> 14 <211> 185 <212> PRT
<213> M.Tuberculosis <400> 19 Met Ile Asp Glu A1a Leu Phe Asp Ala Glu Glu Lys Met Glu Lys Ala Val Ala Val Ala Arg Asp Asp Leu Ser Thr Ile Arg Thr Gly Arg Ala Asn Pro Gly Met Phe Ser Arg Ile Thr Ile Asp Tyr Tyr Gly Ala Ala Thr Pro Ile Thr Gln Leu Ala Ser Ile Asn Val Pro Glu Ala Arg Leu Val Val Ile Lys Pro Tyr Glu Ala Asn Gln Leu Arg Ala Ile Glu Thr Ala Ile Arg Asn Ser Asp Leu Gly Val Asn Pro Thr Asn Asp Gly Ala Leu Ile Arg Val Ala Val Pro Gln Leu Thr Glu Glu Arg Arg Arg Glu Leu Val Lys Gln Ala Lys His Lys Gly Glu Glu Ala Lys Val Ser Val Arg Asn Ile Arg Arg Lys Ala Met Glu Glu Leu His Arg Ile Arg Lys Glu Gly Glu Ala Gly Glu Asp Glu Val Gly Arg Ala Glu Lys Asp Leu Asp Lys Thr Thr His Gln Tyr Val Thr Gln Ile Asp Glu Leu Val Lys His Lys Glu Gly Glu Leu Leu Glu Val <210> 15 <211> 651 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(648) <400> 15 atggccgac atcgatggt gtaaccggt tcggcgggt ctgcag cctggg 98 MetAlaAsp IleAspGly ValThrGly SerAlaGly LeuGln ProGly ccgtctgag gagacagac gaggagttg accgcgcgt ttcgag cgcgac 96 ProSerGlu GluThrAsp GluGluLeu ThrAlaArg PheGlu ArgAsp gcgattccc ctgttggac cagctgtac ggcggtgcg ctgcgg atgacg 144 AlaIlePro LeuLeuAsp GlnLeuTyr GlyGlyA1<~LeuArg MetThr cgcaatccg gccgacgcc gaggacttg ctccaggag acgatg gtgaag 192 ArgAsnPro AlaAspAla GluAspLeu LeuGlnGlu ThrMet ValLys gcctatgcg ggatttcgt tcgttccgg cacggtacc aatctc aaggcc 240 AlaTyrAla GlyPheArg SerPheArg HisGlyThr_AsnLeu LysAla tggctctac cggatactg accaacacc tacatcaac agctat cgcaag 288 TrpLeuTyr ArgIleLeu ThrAsnThr TyrIleAsn SerTyr ArgLys aaacagcgg caaccggcg gagtatccg accgagcag atcacc gattgg 336 LysGlnArg GlnProAla GluTyrPro ThrGluG1I1IleThr AspTrp caactggcg tccaacgcc gagcattcc tcgaccggc3ctgcgc tcgget 389 GlnLeuAla SerAsnAla GluHisSer SerThrGly LeuArg SerAla gaagtcgaa gcgttagaa gcgttgccg gacaccgag atcaaa gaggcg 432 GluValGlu AlaLeuGlu AlaLeuPro AspThrGlu IleLys GluAla ctgcaggca ttgccggaa gagttccgg atggcggtc tactac gccgat 480 LeuGlnAla LeuProGlu GluPheArg MetAlaVal TyrTyr AlaAsp gtcgaaggt ttcccctac aaggagatc gccgagatc atggat actccg 528 ValGluGly PheProTyr LysGluIle AlaGluIllsMetAsp ThrPro atcggcacc gtgatgtcg aggcttcat cgcggccga cgtcag ttgcgc 576 IleGlyThr ValMetSer ArgLeuHis ArgGlyArg ArgGln LeuArg ggtctttta gccgatgtg gccagggat cgggggttt gccagg ggcgag 624 GlyLeuLeu AlaAspVal AlaArgAsp ArgGlyPhe AlaArg GlyGlu caggcgcac gagggggtg tcgtcatga 651 GlnAlaHis GluGlyVal SerSer <210> 16 <211> 216 <212> PRT
<213> M.Tuberculosis <900> 16 Met Ala Asp Ile Asp Gly Val Thr Gly Ser Ala Gly Leu Gln Pro Giy Pro Ser Glu Glu Thr Asp Glu Glu Leu Thr Ala Arg Phe Glu Arg Asp Ala Ile Pro Leu Leu Asp Gln Leu Tyr Gly Gly Ala Leu Arg Met Thr Arg Asn Pro Ala Asp Ala Glu Asp Leu Leu Gln Glu Thr Met Val Lys Ala Tyr Ala Gly Phe Arg Ser Phe Arg His Gly Thr Asn Leu Lys Ala Trp Leu Tyr Arg Ile Leu Thr Asn Thr Tyr Ile Asn Ser Tyr Arg Lys Lys Gln Arg Gln Pro Ala Glu Tyr Pro Thr Glu Gln Ile Thr Asp Trp Gln Leu Ala Ser Asn Ala Glu His Ser Ser Thr Gly Leu Arg Ser Ala Glu Val Glu Ala Leu Glu Ala Leu Pro Asp Thr Glu Ile Lys Glu Ala Leu Gln Ala Leu Pro Glu Glu Phe Arg Met Ala Val Tyr Tyr Ala Asp Val Glu Gly Phe Pro Tyr Lys Glu Ile Ala Glu Ile Met Asp Thr Pro Ile Gly Thr Val Met Ser Arg Leu His Arg Gly Arg Arg Gln Leu Arg Gly Leu Leu Ala Asp Val Ala Arg Asp Arg Gly Phe Ala Arg Gly Glu Gln Ala His Glu Gly Val Ser Ser <210> 17 <211> 779 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(771) <900> 17 atg acg tac gaa acc atc ctg gtc gag cgc gat cag cga gtt ggc att 48 Met Thr Tyr Glu Thr Ile Leu Val Glu Arg Asp Gln Arg Val Gly Ile atc acg ctg aac cgt ccc cag gca ctg aac gcg ctc aac agc cag gtg 96 Ile Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Asn Ser Gln Val atg aac gag gtc acc agc get gca acc gaa ctg gac gat gac ccg gac 144 Met Asn Glu Val Thr Ser Ala Ala Thr Glu Leu Asp Asp Asp Pro Asp att ggg gcg atc atc atc acc ggt tcg gcc aaa gcg ttt gcc gcc gga 192 Ile Gly Ala Ile Ile Ile Thr Gly Ser Ala Lys Ala Phe Ala Ala Gly gccgacatc aaagaaatg gccgacctg acgttcgcc gacgcgttc acc 290 AlaAspIle LysGluMet AlaAspLeu ThrPheAla AspAlaPhe Thr gccgacttc ttcgccacc tggggcaag ctggccgcc gtgcgcacc ccg 288 AlaAspPhe PheAlaThr TrpGlyLys LeuAlaAla ValArgThr Pro acgatcgcc gcggtggcg ggatacgcg ctcggcggt ggctgcgag ctg 336 ThrIleAla AlaValAla GlyTyrAla LeuGlyGly GlyCysGlu Leu gcgatgatg tgcgacgtg ctgatcgcc gccgacacc gcgaagttc gga 384 AlaMetMet CysAspVal LeuIleAla AlaAspThr AlaLysPhe Gly cagcccgag ataaagctg ggcgtgctg ccaggcatg ggcggctcc cag 432 GlnProGlu IleLysLeu GlyValLeu ProGlyMet GlyGlySer Gln cggctgacccgg getatc ggcaagget aaggcgatg gacctcatc ctg 480 ArgLeuThrArg AlaIle GlyLysAla LysAlaMet AspLeuIle Leu accgggcgcacc atggac gccgccgag gccgagcgc:agcggtctg gtt 528 ThrGlyArgThr MetAsp AlaAlaGlu AlaGluArg SerGlyLeu Val tcacgggtggtg ccggcc gacgacttg ctgaccgaa gccagggcc act 576 SerArgValVal ProAla AspAspLeu LeuThrGlu AlaArgAla Thr gccacgaccatt tcgcag atgtcggcc tcggcggcc:cggatggcc aag 629 AlaThrThrIle SerGln MetSerAla SerAlaAla ArgMetAla Lys gaggcc aaccggget ttcgaatcc agtttgtcc:gaggggctg ctc 672 gtc GluAla AsnArgAla PheGluSer SerLeuSer GluGlyLeu Leu Val tacgaa cggcttttc cattcgget ttcgcgacc gaagaccaa tcc 720 cgc TyrGlu ArgLeuPhe HisSerAla PheAlaThr GluAspGln Ser Arg gaaggt gcagcgttc atcgagaaa cgcgetccc cagttcacc cac 768 atg GluGly AlaAlaPhe IleGluLys ArgAlaPro GlnPheThr His Met cgatga 774 Arg <210> 18 <211> 257 <212> PRT
<213> M.Tuberculosis <400> 18 MetThr GluThrIle LeuValGlu ArgAspGln ArgValGly Ile Tyr Ile Thr Leu Asn Arg Pro Gln Ala Leu Asn Ala Leu Asn Ser Gln Val Met Asn Glu Val Thr Ser Ala Ala Thr Glu Leu Asp Asp Asp Pro Asp Ile Gly Ala Ile Ile Ile Thr Gly Ser Ala Lys Ala Phe Ala Ala Gly Ala Asp Ile Lys Glu Met Ala Asp Leu Thr Phe Ala Asp Ala Phe Thr 65 70 75 g0 Ala Asp Phe Phe Ala Thr Trp Gly Lys Leu Ala Ala Val Arg Thr Pro Thr Ile Ala Ala Val Ala Gly Tyr Ala Leu Gly Gly Gly Cys Glu Leu Ala Met Met Cys Asp Val Leu Ile Ala Ala Asp Thr Ala Lys Phe Gly Gln Pro Glu Ile Lys Leu Gly Val Leu Pro Gly Met: Gly Gly Ser Gln Arg Leu Thr Arg Ala Ile Gly Lys Ala Lys Ala Met Asp Leu Ile Leu Thr Gly Arg Thr Met Asp Ala Ala Glu Ala Glu Arg Ser Gly Leu Val Ser Arg Val Val Pro Ala Asp Asp Leu Leu Thr Glu Ala Arg Ala Thr Ala Thr Thr Ile Ser Gln Met Ser Ala Ser Ala Ala Arg Met Ala Lys Glu Ala Val Asn Arg Ala Phe Glu Ser Ser Leu Ser Glu Gly Leu Leu Tyr Glu Arg Arg Leu Phe His Ser Ala Phe Ala Thr Glu Asp Gln Ser Glu Gly Met Ala Ala Phe Ile Glu Lys Arg Ala Pro Gln Phe Thr His Arg <210> 19 <211> 894 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(891) <400> 19 gtgccgcttccc gcagaccct agccccacc ttgtcggcc tacgcc cat 48 ValProLeuPro AlaAspPro SerProThr LeuSerAla TyrAla His cccgaacggctc gtgaccgcc gactggttg tcggcacac atgggc gcg 96 ProGluArgLeu ValThrA1a AspTrpLeu SerAlaHis MetGly Ala ccgggcctggcg atcgtcgaa tccgacgag gacgtc_ttg ctctac gac 144 ProGlyLeuAla IleValG1u SerAspGlu AspValLeu LeuTyr Asp gtcggccatatt cccggcgcc gtcaagatc gactggcac accgac ctc 192 ValGlyHisIle ProGlyAla ValLysIle AspTrpHis ThrAsp Leu aac gac cca cgg gtg cgc gac tac atc aac ggc gag cag ttc gcc gaa 290 Asn Asp Pro Arg Val Arg Asp Tyr Ile Asn Gly Glu Gln Phe Ala Glu 65 70 75 g0 ttgatggac cgcaagggc atcgcccgc gatgacacc gtggtg atctat 288 LeuMetAsp ArgLysGly :CleAlaArg AspAspThr ValVal IleTyr ggcgacaag agcaattgg tgggccgcc tatgcgttg tgggtg ttcacg 336 GlyAspLys SerAsnTrp TrpAlaAla TyrAlaLeu TrpVal PheThr ctgttcggt cacgccgac gtgcgactc ctcaacggc ggccgt gacctc 389 LeuPheGly HisAlaAsp ValArgLeu LeuAsnGly GlyArg AspLeu tggctcgcc gagcgccgg gaaaccacc ttggacgtc ccgacc aagacc 432 TrpLeuAla GluArgArg GluThrThr LeuAspVal ProThr LysThr tgcaccggt tatcccgtc gtgcagcgc aacgatgca cccatc cgcgca 980 CysThrGly TyrProVal ValGlnArg AsnAspAla ProIle ArgAla ttcagagac gacgtgctg gccatcctg ggcgetcag ccgctg atcgac 528 PheArgAsp AspValLeu AlaIleLeu GlyAlaGln ProLeu IleAsp gtacgctct cccgaggag tacaccggc aagcgcacc catatg cccgat 576 ValArgSer ProGluGlu T'yrThrGly LysArgThr HisMet ProAsp taccccgag gaaggggcg ctgcgggcc ggtcacatc cccacg gcggtg 629 TyrProGlu GluGlyAla heuArgAla GlyHisIleaProThr AlaVal cacattccg tgggggaag gccgccgac gaaagtgga cggttt cgcagc 672 HisIlePro TrpGlyLys AlaAlaAsp GluSerGly ArgPhe ArgSer cgcgaggaa ttggaacgg ctctatgac ttcataaac ccggacgac caa 720 ArgGluGlu LeuGluArg LeuTyrAsp PheIleAsn ProAspAsp Gln accgtcgtc tattgccgc atcggtgaa cgctccagc:catacctgg ttc 768 ThrValVal TyrCysArg IleGlyGlu ArgSerSer HisThrTrp Phe gtgctcaca cacctgctg ggcaaggca gatgtacgg aactacgac ggc 816 ValLeuThr HisLeuLeu GlyLysAla AspValArg AsnTyrAsp Gly tcgtggacc gagtggggc aacgccgtg cgagtgccg atcgtcgcg ggc 864 SerTrpThr GluTrpGly AsnAlaVal ArgValPro IleValAla Gly gaagaacca ggagtggta cccgtcgta tga g9q GluGluPro GlyValVal ProValVal <210> 20 <211> 297 <212> PRT
<213> M.Tuberculosis <400> 20 Met Pro Leu Pro Ala Asp Pro Ser Pro Thr Leu Ser Ala Tyr Ala His Pro Glu Arg Leu Val Thr Ala Asp Trp Leu 5er Ala His Met Gly Ala Pro Gly Leu Ala Ile Val Glu Ser Asp Glu Asp Val Leu Leu Tyr Asp Val Gly His Ile Pro Gly Ala Val Lys Ile Asp Trp His Thr Asp Leu Asn Asp Pro Arg Val Arg Asp Tyr Ile Asn Gly Glu Gln Phe Ala Glu Leu Met Asp Arg Lys Gly Ile Ala Arg Asp Asp Th:r Val Val Ile Tyr Gly Asp Lys Ser Asn Trp Trp Ala Ala Tyr Ala Leu Trp Val Phe Thr Leu Phe Gly His Ala Asp Val Arg Leu Leu Asn Gly Gly Arg Asp Leu Trp Leu Ala Glu Arg Arg Glu Thr Thr Leu Asp Val Pro Thr Lys Thr Cys Thr Gly Tyr Pro Val Val Gln Arg Asn Asp Ala Pro Ile Arg Ala Phe Arg Asp Asp Val Leu Ala Ile Leu Gly Ala Gln Pro Leu Ile Asp Val Arg Ser Pro Glu Glu Tyr Thr Gly Lys Arg Thr His Met Pro Asp Tyr Pro Glu Glu Gly Ala Leu Arg Ala Gly His Ile: Pro Thr Ala Val His Ile Pro Trp Gly Lys Ala Ala Asp Glu Ser Gly Arg Phe Arg Ser Arg Glu Glu Leu Glu Arg Leu Tyr Asp Phe Ile Asn Pro Asp Asp Gln Thr Val Val Tyr Cys Arg Ile Gly Glu Arg Ser Ser His Thr Trp Phe Val Leu Thr His Leu Leu Gly Lys Ala Asp Val Arg Asn Tyr Asp Gly Ser Trp Thr Glu Trp Gly Asn Ala Val Arg Val Pro Ile Val Ala Gly Glu Glu Pro Gly Val Val Pro Val Val <210> 21 <211> 1094 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1041) <400> 21 atg ctg atc tca cag cgc ccc acc ctg tcc gag gac gtc ctc acc gac 48 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr Asp aac cga tcc cag ttc gtg atc gaa ccg ctg gag ccg gga ttc ggc tac 96 Asn Arg Ser Gln Phe Val I1e Glu Pro Leu Glu Pro Gly Phe Gly Tyr acc ctg ggc aat tcg ctg cgt cgc acc ctg ctg tcg tcg att ccc gga 199 Thr Leu Gly Asn Ser Leu Arg Arg Thr Leu Leu Ser Ser Ile Pro Gly gcg gcc gtc acc agc att cgc atc gat ggt gta ctg cac gaa ttc acc 192 Ala Ala Val Thr Ser Ile Arg Ile Asp Gly Val Leu His Glu Phe Thr acggtgcccggg gtcaaagaa gatgtcacc gagatc atcctgaat ctc 240 ThrValProGly ValLysGlu AspValThr GluIle IleLeuAsn Leu aagagcctggtg gtgtcctcg gaggaggac gagccg gtcaccatg tac 288 LysSerLeuVal ValSerSer GluGluAsp GluPro ValThrMet Tyr ctacgcaagcag ggtccgggt gaggttacc gccggc gacatcgtg ccg 336 LeuArgLysGln GlyProGly GluValThr AlaGly AspIleVal Pro ccggccggcgtc accgtgcac aaccccggc atgcac atcgccacg ctg 384 ProAlaGlyVal ThrValHis AsnProGly MetHis IleAlaThr Leu aacgataag ggcaagctggaa gtcgag ctcgtcgtc gagcgtggc cgc 432 AsnAspLys GlyLysLeuGlu ValGlu LeuValVal GluArgGly Arg ggctatgtc ccggcggtgcaa aaccgg gettcgggt gccgaaatt ggg 480 GlyTyrVal ProAlaValGln AsnArg AlaSerGly AlaGluIle Gly cgcattcca gtcgattccatc tactca ccggtgctc aaagtgacc tac 528 ArgIlePro ValAspSerIle TyrSer ProValLeu LysValThr Tyr aaggtggac gccacccgggtc gagcag cgcaccgac ttcgacaag ctg 576 LysValAsp AlaThrArgVal GluGln ArgThrAsp PheAspLys Leu atcctggac gtggagaccaag aattca atcagcccg cgcgacgcg ctg 624 IleLeuAsp ValGluThrLys AsnSer IleSerPro ArgAspAla Leu gcgtcgget ggcaagacgctg gtcgag ttgttcggc:ctggcacgg gaa 672 AlaSerAla GlyLysThrLeu ValGlu LeuPheGly LeuAlaArg Glu ctcaacgtc gaggccgaaggc atcgag atcgggccg tcgccggcc gag 720 LeuAsnVal GluAlaGluGly IleGlu IleGlyPro SerProAla Glu gccgatcac attgcgtcattc gccctg ccgatcgac gacctggat ctg 768 AlaAspHis IleAlaSerPhe AlaLeu ProIleAsp AspLeuAsp Leu acggtgcgg tcctacaactgc ctcaag cgcgagggg gtgcacacc gtg 816 ThrValArg SerTyrAsnCys LeuLys ArgGluGly ValHisThr Val ggc gaa ctg gtg gcg cgc acc gaa tcc gac ctg ctt gac atc cgc aac 864
17 Gly Glu Leu Val Ala Arg Thr Glu Ser Asp Leu Leu Asp Ile Arg Asn ttc ggt cag aag tcc atc gac gag gtg aag atc aag ctg cac cag ctg 912 Phe Gly Gln Lys Ser Ile Asp Glu Val Lys Ile Lys Leu His Gln Leu ggc ctg tca ctc aag gac agc ccg ccg agc ttc gac ccc tcg gag gtc 960 Gly Leu Ser Leu Lys Asp Ser Pro Pro Ser Phe Asp Pro Ser Glu Val gcg ggc tac gac gtc gcc acc ggc acc tgg tcg acc gag ggc gcg tac 1008 Ala Gly Tyr Asp Val Ala Thr Gly Thr Trp Ser Thr Glu Gly Ala Tyr gac gag cag gac tac gcc gaa acc gaa cag ctt tag 1044 Asp Glu Gln Asp Tyr Ala Glu Thr Glu Gln Leu <210> 22 <211> 347 <212> PRT
<213> M.Tuberculosis <900> 22 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr Asp Asn Arg Ser Gln Phe Val Ile Glu Pro Leu Glu Pro Gly Phe Gly Tyr Thr Leu Gly Asn Ser Leu Arg Arg Thr Leu Leu Ser Ser Ile Pro Gly Ala Ala Val Thr Ser Ile Arg Ile Asp Gly Val Leu His Glu Phe Thr Thr Val Pro Gly Val Lys Glu Asp Val Thr Glu Ile Ile Leu Asn Leu Lys Ser Leu Val Val Ser Ser Glu Glu Asp Glu Pro Val Thr Met Tyr Leu Arg Lys Gln Gly Pro Gly Glu Val Thr Ala Gly Asp Ile Val Pro Pro Ala Gly Val Thr Val His Asn Pro Gly Met His Ile Ala Thr Leu Asn Asp Lys Gly Lys Leu Glu Val Glu Leu Val Va.1 Glu Arg Gly Arg Gly Tyr Val Pro Ala Val Gln Asn Arg Ala Ser Gly Ala Glu Ile Gly Arg Ile Pro Val Asp Ser Ile Tyr Ser Pro Val Leu Lys Val Thr Tyr Lys Val Asp Ala Thr Arg Val Glu Gln Arg Thr Asp Phe Asp Lys Leu Ile Leu Asp Val Glu Thr hys Asn Ser Ile Ser Pro Arg Asp Ala Leu Ala Ser Ala Gly Lys Thr Leu Val Glu Leu Phe Gl:y Leu Ala Arg Glu Leu Asn Val Glu Ala Glu Gly Ile Glu Ile Gly Pro Ser Pro Ala Glu Ala Asp His Ile Ala Ser Phe Ala Leu Pro Ile Asp Asp Leu Asp Leu Thr Val Arg Ser Tyr Asn Cys Leu Lys Arg Glu Gly Val His Thr Val Gly Glu Leu Val Ala Arg Thr Glu Ser Asp Leu Leu Asp Ile Arg Asn
<213> M.Tuberculosis <900> 22 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr Asp Asn Arg Ser Gln Phe Val Ile Glu Pro Leu Glu Pro Gly Phe Gly Tyr Thr Leu Gly Asn Ser Leu Arg Arg Thr Leu Leu Ser Ser Ile Pro Gly Ala Ala Val Thr Ser Ile Arg Ile Asp Gly Val Leu His Glu Phe Thr Thr Val Pro Gly Val Lys Glu Asp Val Thr Glu Ile Ile Leu Asn Leu Lys Ser Leu Val Val Ser Ser Glu Glu Asp Glu Pro Val Thr Met Tyr Leu Arg Lys Gln Gly Pro Gly Glu Val Thr Ala Gly Asp Ile Val Pro Pro Ala Gly Val Thr Val His Asn Pro Gly Met His Ile Ala Thr Leu Asn Asp Lys Gly Lys Leu Glu Val Glu Leu Val Va.1 Glu Arg Gly Arg Gly Tyr Val Pro Ala Val Gln Asn Arg Ala Ser Gly Ala Glu Ile Gly Arg Ile Pro Val Asp Ser Ile Tyr Ser Pro Val Leu Lys Val Thr Tyr Lys Val Asp Ala Thr Arg Val Glu Gln Arg Thr Asp Phe Asp Lys Leu Ile Leu Asp Val Glu Thr hys Asn Ser Ile Ser Pro Arg Asp Ala Leu Ala Ser Ala Gly Lys Thr Leu Val Glu Leu Phe Gl:y Leu Ala Arg Glu Leu Asn Val Glu Ala Glu Gly Ile Glu Ile Gly Pro Ser Pro Ala Glu Ala Asp His Ile Ala Ser Phe Ala Leu Pro Ile Asp Asp Leu Asp Leu Thr Val Arg Ser Tyr Asn Cys Leu Lys Arg Glu Gly Val His Thr Val Gly Glu Leu Val Ala Arg Thr Glu Ser Asp Leu Leu Asp Ile Arg Asn
18 PheGly LysSerIle AspGluVal LysIleLys LeuHisGln Leu Gln GlyLeu LeuLysAsp SerProPro SerPheAsp ProSerGlu Val Ser AlaGly AspValAla ThrGlyThr TrpSerThr GluGlyAla Tyr Tyr AspGlu AspTyrAla GluThrGlu GlnLeu Gln <210> 23 <211> 1488 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1485) <400> 23 atgacc aatttggtg accaaaaat tcgctgacc cctgacgtt cgt 48 gga MetThr AsnLeuVal ThrLysAsn SerLeuThr ProAspVal Arg Gly aacggc gactttaag atcgccgac ctgtcacta gcggatttc ggc 96 atc AsnGly AspPheLys IleAlaAsp LeuSerLeu AlaAspPhe Gly Ile cgcaaa ctccggatc gccgagcac gagatgccc ggcctgatg tcg 144 gaa ArgLys LeuArgIle AlaGluHis GluMetPro GlyLeuMet Ser Glu ctgcgg gagtatgcc gaggtgcaa cccctgaag ggggcccgg atc 192 cgc LeuArg GluTyrAla GluValGln ProLeuLys GlyAlaArg Ile Arg tcgggttcg ctgcacatg acggtgcag accgcggtg ttgatcgaa acc 240 SerGlySer LeuHisMet ThrValGln ThrAlaVal LeuIleGlu Thr ctcaccgcg ctgggcgcc gaagtccgc tgggcctcg tgcaacatc ttc 288 LeuThrAla LeuGlyAla GluValArg TrpAlaSer CysAsnIle Phe tccacccag gatcacgcc gccgccgcc gtcgtggtc:ggcccgcac ggc 336 SerThrGln AspHisAla AlaAlaAla ValValVal GlyProHis Gly acccccgac gagcccaag ggtgtcccg gtgttcgcg tggaagggc gag 389 ThrProAsp GluProLys GlyValPro ValPheAla TrpLysGly Glu acgctcgaa gagtactgg tgggccgcc gagcagatg ctcacctgg ccg 932 ThrLeuGlu GluTyrTrp TrpAlaAla GluGlnMet:LeuThrTrp Pro gaccccgac aagccggcc aacatgatc ctcgatgac ggcggtgac gcc 480 AspProAsp LysProAla AsnMetIle LeuAspAsp GlyGlyAsp Ala acc atg ttg gtg ctg cgc ggc atg cag tat gag aag gcc ggc gtg gtg 528
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1485) <400> 23 atgacc aatttggtg accaaaaat tcgctgacc cctgacgtt cgt 48 gga MetThr AsnLeuVal ThrLysAsn SerLeuThr ProAspVal Arg Gly aacggc gactttaag atcgccgac ctgtcacta gcggatttc ggc 96 atc AsnGly AspPheLys IleAlaAsp LeuSerLeu AlaAspPhe Gly Ile cgcaaa ctccggatc gccgagcac gagatgccc ggcctgatg tcg 144 gaa ArgLys LeuArgIle AlaGluHis GluMetPro GlyLeuMet Ser Glu ctgcgg gagtatgcc gaggtgcaa cccctgaag ggggcccgg atc 192 cgc LeuArg GluTyrAla GluValGln ProLeuLys GlyAlaArg Ile Arg tcgggttcg ctgcacatg acggtgcag accgcggtg ttgatcgaa acc 240 SerGlySer LeuHisMet ThrValGln ThrAlaVal LeuIleGlu Thr ctcaccgcg ctgggcgcc gaagtccgc tgggcctcg tgcaacatc ttc 288 LeuThrAla LeuGlyAla GluValArg TrpAlaSer CysAsnIle Phe tccacccag gatcacgcc gccgccgcc gtcgtggtc:ggcccgcac ggc 336 SerThrGln AspHisAla AlaAlaAla ValValVal GlyProHis Gly acccccgac gagcccaag ggtgtcccg gtgttcgcg tggaagggc gag 389 ThrProAsp GluProLys GlyValPro ValPheAla TrpLysGly Glu acgctcgaa gagtactgg tgggccgcc gagcagatg ctcacctgg ccg 932 ThrLeuGlu GluTyrTrp TrpAlaAla GluGlnMet:LeuThrTrp Pro gaccccgac aagccggcc aacatgatc ctcgatgac ggcggtgac gcc 480 AspProAsp LysProAla AsnMetIle LeuAspAsp GlyGlyAsp Ala acc atg ttg gtg ctg cgc ggc atg cag tat gag aag gcc ggc gtg gtg 528
19 ThrMet LeuValLeu ArgGly MetGlnTyrGlu LysAla GlyValVal ccgccc gccgaggag gacgac cccgccgagtgg aaggtc ttcctgaac 576 ProPro AlaGluGlu AspAsp ProAlaGluTrp LysVal PheLeuAsn ctgcta cggacccgc ttcgag accgacaaggac aagtgg accaagata 624 LeuLeu ArgThrArg PheGlu ThrAspLysAsp LysTrp ThrLysIle gccgag tcggtcaag ggcgtc accgaggagacc accacc ggcgtgctg 672 AlaGlu SerValLys GlyVal ThrGluGluThr ThrThr GlyValLeu cggctc taccaattc gccgcg gccggggatctg gccttc ccggcgatc 720 ArgLeu TyrGlnPhe AlaAla AlaGlyAspLeu AlaPhe ProAlaIle aacgtc aacgactcg gtgacc aagtccaaattc gacaac aagtacggc 768 AsnVal AsnAspSer ValThr LysSerLysPhe AspAsn LysTyrGly actcgg cactccctg atcgac ggcatcaaccgc ggcacc gacgcgctg 816 ThrArg HisSerLeu IleAsp GlyIleAsnArg GlyThr AspAlaLeu atcggc ggtaagaag gtcctc atctgcggctac ggcgac gtcggtaag 864 IleGly GlyLysLys ValLeu IleCysGlyTyr GlyAsp ValGlyLys ggctgt gcggaggcg atgaag ggccagggagcg cgggtc tccgtcacc 912 GlyCys AlaGluAla MetLys GlyGlnGlyAla ArgVal SerValThr gagatc gacccgatc aacgcg ctgcaggccatg atggag ggcttcgac 960 GluIle AspProIle AsnAla LeuGlnAlaMet MetGlu GlyPheAsp gtggtc accgtcgag gaggcc atcggggacgcc gacatc gtcgtaacc 1008 ValVal ThrValGlu GluAla IleGlyAspAla AspIle ValValThr gcgacc ggcaacaaa gacatc atcatgctcgag cacatt aaggcgatg 1056 AlaThr GlyAsnLys AspIle IleMetLeuGlu HisIle LysAlaMet aaggac cacgcgatc ctggga aatatcggccac ttcgac aacgagatc 1109 LysAsp HisAlaIle LeuGly AsnIleGlyHis PheAsp AsnGluIle gacatg gccgggctg gagcgc tccggggcgaca cgggtc aacgtcaag 1152 AspMet AlaGlyLeu GluArg SerGlyAlaThr ArgVal AsnValLys cctcag gtcgacctg tggacc tttggcgacacg ggccgc tcgatcatc 1200 ProGln ValAspLeu TrpThr PheGlyAspThr GlyArg SerIleIle gtgctg tccgagggg cggctg ctgaacctgggc aatgcc accgggcac 1298 ValLeu SerGluGly ArgLeu LeuAsnLeuGly AsnAla ThrGlyHis ccc tcg ttc gtg atg agc aac agc ttc get aac cag acg atc gcc cag 1296 Pro Ser Phe Val Met Ser Asn Ser Phe Ala Asn Gln Thr Ile Ala Gln atc gag ctg tgg acc aag aac gac gag tac gac aac gag gtg tac cgg 1349 Ile Glu Leu Trp Thr Lys Asn Asp Glu Tyr Asp Asn Glu Val Tyr Arg ctg ccc aag cac ctc gac gag aag gtg get cga atc cat gtc gag gcc 1392 Leu Pro Lys His Leu Asp Glu Lys Val Ala Arg Ile His Val Glu Ala ctt ggc ggt cac ctg acc aag ctg acc aag gag cag gcc gaa tac ctc 1490 Leu Gly Gly His Leu Thr Lys Leu Thr Lys Glu Gln Ala Glu Tyr Leu ggc gtc gac gtc gag ggt ccc tac aag ccg gac cac tac cgc tac 1985 Gly Val Asp Val Glu Gly Pro Tyr Lys Pro Asp His Tyr Arg Tyr tga 1488 <210> 29 <211> 495 <212> PRT
<213> M.Tuberculosis <400> 24 Met Thr Gly Asn Leu Val Thr Lys Asn Ser Leu Thr Pro Asp Val Arg Asn Gly Ile Asp Phe Lys Ile Ala Asp Leu Ser Leu Ala Asp Phe Gly
<213> M.Tuberculosis <400> 24 Met Thr Gly Asn Leu Val Thr Lys Asn Ser Leu Thr Pro Asp Val Arg Asn Gly Ile Asp Phe Lys Ile Ala Asp Leu Ser Leu Ala Asp Phe Gly
20 25 30 Arg Lys Glu Leu Arg Ile Ala Glu His Glu Met Pro Gly Leu Met Ser Leu Arg Arg Glu Tyr Ala Glu Val Gln Pro Leu Lys Gly Ala Arg Ile Ser Gly Ser Leu His Met Thr Val Gln Thr Ala Val Leu Ile Glu Thr Leu Thr Ala Leu Gly Ala Glu Val Arg Trp Ala Sex Cys Asn Ile Phe Ser Thr Gln Asp His Ala Ala Ala Ala Val Val Val Gly Pro His Gly Thr Pro Asp Glu Pro Lys Gly Val Pro Val Phe Ala Trp Lys Gly Glu Thr Leu Glu Glu Tyr Trp Trp Ala Ala Glu Gln Met Leu Thr Trp Pro Asp Pro Asp Lys Pro Ala Asn Met Ile Leu Asp Asp Gly Gly Asp Ala Thr Met Leu Val Leu Arg Gly Met Gln Tyr Glu Lys Ala Gly Val Val Pro Pro Ala Glu Glu Asp Asp Pro Ala Glu Trp Lys Val Phe Leu Asn Leu Leu Arg Thr Arg Phe Glu Thr Asp Lys Asp Lys Trp Thr Lys Ile Ala Glu Ser Val Lys Gly Val Thr Glu Glu Thr Thr Thr Gly Val Leu Arg Leu Tyr Gln Phe Ala Ala Ala Gly Asp Leu Ala Phe Pro Ala Ile Asn Val Asn Asp Ser Val Thr Lys Ser Lys Phe Asp Asn Lys Tyr Gly
21 Thr Arg His Ser Leu Ile Asp Gly Ile Asn Arg Gly Thr Asp Ala Leu Ile Gly Gly Lys Lys Val Leu Ile Cys Gly Tyr Gly Asp Val Gly Lys Gly Cys Ala Glu Ala Met Lys Gly Gln Gly Ala Arg Val Ser Val Thr Glu Ile Asp Pro Ile Asn Ala Leu Gln Ala Met Met Glu Gly Phe Asp Val Val Thr Val Glu Glu Ala Ile Gly Asp Ala Asp Ile Val Val Thr Ala Thr Gly Asn Lys Asp Ile Ile Met Leu Glu His Ile Lys Ala Met Lys Asp His Ala Ile Leu Gly Asn Ile Gly His Phe Asp Asn Glu Ile Asp Met Ala Gly Leu Glu Arg Ser Gly Ala Thr Arg Val Asn Val Lys Pro Gln Val Asp Leu Trp Thr Phe Gly Asp Thr Gly Arg Ser Ile Ile Val Leu Ser Glu Gly Arg Leu Leu Asn Leu Gly Asn Ala Thr Gly His Pro Ser Phe Val Met Ser Asn Ser Phe Ala Asn Gln Thr Ile Ala Gln Ile Glu Leu Trp Thr Lys Asn Asp Glu Tyr Asp Asn Glu Val Tyr Arg Leu Pro Lys His Leu Asp Glu Lys Val Ala Arg Ile His Val Glu Ala Leu Gly Gly His Leu Thr Lys Leu Thr Lys Glu Gln Ala Glu Tyr Leu Gly Val Asp Val Glu Gly Pro Tyr Lys Pro Asp His Tyr Arg Tyr <210> 25 <211> 1803 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1800) <400> 25 gtgget agtcacgcc ggctcgagg atcgetcgg atctctaag gttctc 48 ValAla SerHisAla GlySerArg IleAlaArg IleSerLys ValLeu gtcgcc aatcgcggc gagatcgca gtgcgggtg atccgggcg gcccgc 96 ValAla AsnArgGly GluIleAla ValArgVal IleArgAla AlaArg gacgcc ggcctgccc agcgtggcg gtgtacgcc gaacccgac gccgag 194 AspAla GlyLeuPro SerValAla ValTyrAla GluProAsp AlaGlu tccccg catgttcgg ctggccgac gaggcgttc gcgctgggc ggccag 192 SerPro HisValArg LeuAlaAsp GluAlaPhe AlaLeuGly GlyGln acc tcg gcg gag tcc tat ctg gac ttc gcc aag atc ctc gac gcg gca 240 Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys Ile Leu Asp Ala Ala
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1800) <400> 25 gtgget agtcacgcc ggctcgagg atcgetcgg atctctaag gttctc 48 ValAla SerHisAla GlySerArg IleAlaArg IleSerLys ValLeu gtcgcc aatcgcggc gagatcgca gtgcgggtg atccgggcg gcccgc 96 ValAla AsnArgGly GluIleAla ValArgVal IleArgAla AlaArg gacgcc ggcctgccc agcgtggcg gtgtacgcc gaacccgac gccgag 194 AspAla GlyLeuPro SerValAla ValTyrAla GluProAsp AlaGlu tccccg catgttcgg ctggccgac gaggcgttc gcgctgggc ggccag 192 SerPro HisValArg LeuAlaAsp GluAlaPhe AlaLeuGly GlyGln acc tcg gcg gag tcc tat ctg gac ttc gcc aag atc ctc gac gcg gca 240 Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys Ile Leu Asp Ala Ala
22 gcc aag tcc ggg gcc aac gcc atc cac ccc ggc tac ggc ttc cta gcg 288 Ala Lys Ser Gly Ala Asn Ala Ile His Pro Gly Tyr Gly Phe Leu Ala gaa aat gcc gac ttc gcc cag gcg gtg atc gac gcc ggc ctg atc tgg 336 Glu Asn Ala Asp Phe Ala Gln Ala Val Ile Asp Ala Gly Leu Ile Trp atcggcccc agcccgcag tcgatc cgcgacctgggc gacaag gtcacg 389 IleGlyPro SerProGln SerIle ArgAspLeuGly AspLys ValThr gcccgtcac atcgcggcc cgcget caggcgcccctg gtgccg ggtacc 932 AlaArgHis IleAlaAla ArgAla GlnAlaProLeu ValPro GlyThr cccgatccg gtcaaaggc gccgac gaggtggtggca ttcgcc gaggag 480 ProAspPro ValLysGly AlaAsp GluValValAla PheAla GluGlu tacggcctg ccgatcgcg atcaag gccgcccacggc ggcggc ggcaag 528 TyrGlyLeu ProIleAla IleLys AlaAlaHisGly GlyGly GlyLys ggcatgaag gtggcccgc accatc gacgagattccg gagctg tacgag 576 GlyMetLys ValAlaArg ThrIle AspGluIlePro GluLeu TyrGlu tcggcggtg cgcgaggcc acggcc gcgttcggccgc ggtgag tgctac 629 SerAlaVal ArgGluAla ThrAla AlaPheGlyArg GlyGlu CysTyr gtggagcgc tatctcgac aagccg cgccacgtcgaa gcacag gtgatc 672 ValGluArg TyrLeuAsp LysPro ArgHisValGlu AlaGln ValIle gccgaccag cacggcaac gtcgtc gtcgccggcacc cgggac tgctcg 720 AlaAspGln HisGlyAsn ValVal ValAlaGlyThr ArgAsp CysSer ctgcagcgc cgctaccag aagctg gtcgaggaggcg cccgca ccgttc 768 LeuGlnArg ArgTyrGln LysLeu ValGluGluA1<~ProAla ProPhe ctgaccgac tttcaacgc aaagag atccacgactcg gccaaa cggatt 816 LeuThrAsp PheGlnArg LysGlu IleHisAspSer AlaLys ArgIle tgcaaagag gcccattac cacggc gccggcaccgtc gaatac ctggtc 864 CysLysGlu AlaHisTyr HisGly AlaGlyThrVa1 GluTyr LeuVal ggtcaggac ggcttgatc tcgttc ttggaggtcaac acgcgc cttcag 912 GlyGlnAsp GlyLeuIle SerPhe LeuGluValAsn ThrArg LeuGln gtagaacac ccggtcacc gaggaa accgcgggcatc gacttg gtgctg 960 ValGluHis ProValThr GluGlu ThrAlaGlyIle AspLeu ValLeu
23 cag caa ttc cgg atc gcc aac ggc gaa aag ctg gac atc acc gag gat 1008 Gln Gln Phe Arg Ile Ala Asn Gly Glu Lys Leu Asp Ile Thr Glu Asp ccc acc ccg cgc ggg cac gcc atc gaa ttc cgg atc aac ggc gag gac 1056 Pro Thr Pro Arg Gly His Ala Ile Glu Phe Arg Ile Asn Gly Glu Asp gcg ggg cgt aac ttc cta ccg gcg ccc ggg ccg gtg aca aag ttc cac 1109 Ala Gly Arg Asn Phe Leu Pro Ala Pro Gly Pro Val Thr Lys Phe His ccg ccg tcc ggc ccc ggt gtg cgg gtg gac tcc ggt gtc gag acc ggc 1152 Pro Pro Ser Gly Pro Gly Val Arg Val Asp Ser Gly Val Glu Thr Gly tcg gtg atc ggc ggc cag ttc gac tcg atg ctg gcc aag ctg atc gtg 1200 Ser Val Ile Gly Gly Gln Phe Asp Ser Met Leu Ala Lys Leu Ile Val cac ggt gcc gac cgc gcc gag gcg ctg gcg cgg gcc cgg cgc gcg ctg 1298 His Gly Ala Asp Arg Ala Glu Ala Leu Ala Arg Ala Arg Arg Ala Leu aac gag ttc ggt gtc gaa ggc ctg gcg acg gtc atc ccg ttt cac cgc 1296 Asn Glu Phe Gly Val Glu Gly Leu Ala Thr Val Ile Pro Phe His Arg gcc gtg gtg tcc gac ccg gca ttc atc ggc gac gcg aac ggc ttt tcg 1344 Ala Val Val Ser Asp Pro Ala Phe Ile Gly Asp Ala Asn Gly Phe Ser gta cat acc cgc tgg atc gag acc gag tgg aat aac acc atc gag ccc 1392 Val His Thr Arg Trp Ile Glu Thr Glu Trp Asn Asn Thr Ile Glu Pro ttt acc gac ggc gaa cct ctc gac gag gac gcc cgg ccg cgt cag aag 1940 Phe Thr Asp Gly Glu Pro Leu Asp Glu Asp Ala Arg Pro Arg Gln Lys gtg gtc gtc gaa atc gac ggt cgc cgc gtc gaa gtc tcg ctg ccg get 1988 Val Val Val Glu Ile Asp Gly Arg Arg Val Glu Va.l Ser Leu Pro Ala gat ctc gcg ctg tcc aat ggc ggc ggt tgc gac ccg gtc ggt gtc atc 1536 Asp Leu Ala Leu Ser Asn Gly Gly Gly Cys Asp Pro Val Gly Val Ile cgg cgc aag ccc aag ccg c:gc aag cgg ggt gcg cac acc ggc gcg gcg 1584 Arg Arg Lys Pro Lys Pro Arg Lys Arg Gly Ala His Thr Gl.y Ala Ala gcc tcc ggt gac gcg gtg acc gcg cct atg cag ggc acc gta gtt aag 1632 Ala Ser Gly Asp Ala Val Thr Ala Pro Met Gln Gly Thr Val Val Lys ttc gcg gtc gaa gaa ggg caa gag gtc gtg gcc ggc gac cta gtg gtg 1680 Phe Ala Val Glu Glu Gly Gln Glu Val Val Ala Gly Asp Leu Val Val gtc ctc gag gcg atg aag atg gaa aac ccg gtc acc gcg cat aag gat 1728
24 Val Leu Glu Ala Met Lys Met Glu Asn Pro Val Thr Ala His Lys Asp ggc acc atc acc ggg ctg gcg gtc gag gcg ggc gcg gcc atc acc cag 1776 Gly Thr Ile Thr Gly Leu Ala Val Glu Ala Gly Ala Ala Ile Thr Gln ggc acg gtg ctc gcc gag atc aag taa 1803 Gly Thr Val Leu Ala Glu Ile Lys <210> 26 <211> 600 <212> PRT
<213> M.Tuberculosis <400> 26 Val Ala Ser His Ala Gly Ser Arg Ile Ala Arg Ile Ser Lys Val Leu Val Ala Asn Arg Gly Glu Tle Ala Val Arg Val Ile Arg Ala Ala Arg Asp Ala Gly Leu Pro Ser Val Ala Val Tyr Ala Glu Pro Asp Ala Glu Ser Pro His Val Arg Leu Ala Asp Glu Ala Phe Ala Leu Gly Gly Gln Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys Ile Leu Asp Ala Ala Ala Lys Ser Gly Ala Asn Ala Ile His Pro Gly Ty:r Gly Phe Leu Ala Glu Asn Ala Asp Phe Ala Gln Ala Val Ile Asp Ala Gly Leu Ile Trp Ile Gly Pro Ser Pro Gln Ser Ile Arg Asp Leu Gly Asp Lys Val Thr Ala Arg His Ile Ala Ala Arg Ala Gln Ala Pro Leu Val Pro Gly Thr Pro Asp Pro Val Lys Gly Ala Asp Glu Val Val Ala Phe Ala Glu Glu Tyr Gly Leu Pro Ile Ala Lle Lys Ala Ala His Gly Gly Gly Gly Lys Gly Met Lys Val Ala Arg Thr Ile Asp Glu Ile Pro Glu Leu Tyr Glu Ser Ala Val Arg Glu Ala Thr Ala Ala Phe Gly Arg Gly Glu Cys Tyr Val Glu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Ala Gln Val Ile Ala Asp Gln His Gly Asn Val Val Val Ala Gly Thr Arg Asp Cys Ser Leu Gln Arg Arg Tyr Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Thr Asp Phe Gln Arg Lys Glu Ile His Asp Ser Ala Lys Arg Ile Cys Lys Glu Ala His Tyr His Gly Ala Gly Thr Va_L Glu Tyr Leu Val Gly Gln Asp Gly Leu Ile Ser Phe Leu Glu Val Asn Thr Arg Leu Gln Val Glu His Pro Val Thr Glu Glu Thr Ala Gly Ile Asp Leu Val Leu Gln Gln Phe Arg Ile Ala Asn Gly Glu Lys Leu Asp Ile Thr Glu Asp Pro Thr Pro Arg Gly His Ala Ile Glu Phe Arg Ile Asn Gly Glu Asp Ala Gly Arg Asn Phe Leu Pro Ala Pro Gly Pro Val Thr Lys Phe His Pro Pro Ser Gly Pro Gly Val Arg Val Asp Ser Gly Val Glu Thr Gly Ser Val Ile Gly Gly Gln Phe Asp Ser Met Leu Ala Lys Leu Ile Val His Gly Ala Asp Arg Ala Glu Ala Leu Ala Arg Ala Arg Arg Ala Leu Asn Glu Phe Gly Val Glu Gly Leu Ala Thr Val Ile Pro Phe His Arg Ala Val Val Ser Asp Pro Ala Phe Ile Gly Asp Ala Asn Gly Phe Ser Val His Thr Arg Trp Ile Glu Thr Glu Trp Asn Asn Thr Ile Glu Pro Phe Thr Asp Gly Glu Pro Leu Asp Glu Asp Ala Arg Pro Arg Gln Lys 465 ~ 470 475 480 Val Val Val Glu Ile Asp Gly Arg Arg Val Glu Val Ser Leu Pro Ala Asp Leu Ala Leu Ser Asn Gly Gly Gly Cys Asp Pro Val Gly Val Ile Arg Arg Lys Pro Lys Pro Arg Lys Arg Gly Ala His Thr Gly Ala Ala Ala Ser Gly Asp Ala Val Thr Ala Pro Met Gln Gly Thr Val Val Lys Phe Ala Val Glu Glu Gly Gln Glu Val Val Ala Gly Asp Leu Val Val Val Leu Glu Ala Met Lys Met Glu Asn Pro Val Thr Ala His Lys Asp Gly Thr Ile Thr Gly Leu Ala Val Glu Ala Gly Ala Ala Ile Thr Gln Gly Thr Val Leu Ala Glu Ile Lys <210> 27 <211> 318 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(315) <400> 27 atg cca gtg gtg aag atc aac gca atc gag gtg cc<: gcc ggc get ggc 48 Met Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala Gly ccc gag ctg gag aag cgg ttc get cac cgc gcg cac gcg gtc gag aac 96 Pro Glu Leu Glu Lys Arg Phe Ala His Arg Ala His Ala Val Glu Asn tcc ccg ggt ttc ctc ggc ttt cag ctg tta cgt ccg gtc aag ggt gaa 144 Ser Pro Gly Phe Leu Gly Phe Gln Leu Leu Arg Pro Val Lys Gly Glu gaa cgc tac ttc gtg gtg aca cac tgg gag tcc gat gaa gca ttc cag 192 Glu Arg Tyr Phe Val Val Thr His Trp Glu Ser Asp Glu Ala Phe Gln gcg tgg gca aac ggg ccc gcc atc gca gcc cat gcc gga cac cgg gcc 240 Ala Trp Ala Asn Gly Pro Ala Ile Ala Ala His Ala Gly His Arg Ala aac ccc gtg gcg acc ggt get tcg ctg ctg gaa ttc gag gtc gtg ctt 288 Asn Pro Val Ala Thr Gly Ala Ser Leu Leu Glu Phe Glu Val Val Leu gac gtc ggt ggg acc ggc aag act gca taa 318 Asp Val Gly Gly Thr Gly Lys Thr Ala <210> 28 <211> 105 <212> PRT
<213> M.Tuberculosis <400> 28 Met Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala Gly Pro Glu Leu Glu Lys Arg Phe Ala His Arg Ala His Ala Val Glu Asn Ser Pro Gly Phe Leu Gly Phe Gln Leu Leu Arg Pro Val Lys Gly Glu ' Glu Arg Tyr Phe Val Val Thr His Trp Glu Ser Asp Glu Ala Phe Gln Ala Trp Ala Asn Gly Pro Ala Ile Ala Ala His Ala Gly His Arg Ala Asn Pro Val Ala Thr Gly Ala Ser Leu Leu Glu Phe Glu Val Val Leu Asp Val Gly Gly Thr Gly Lys Thr Ala <210> 29 <211> 935 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(935) <900> 29 gtggcg gacaagacg acacagacg atttacatc gacgcg gatccaggc 98 ValAla AspLysThr ThrGlnThr IleTyrIle AspAla AspProGly gaggtg atgaaggcg atcgccgac atcgaagcc tacccg caatggatt 96 GluVal MetLysAla IleAlaAsp IleGluAla TyrPro GlnTrpIle tcggag tataaggaa gtcgagatc ctagaggcc gacgac gagggctac 144 SerGiu TyrLysGlu ValGluIle LeuGluAla AspAsp GluGlyTyr ccgaaa cgagcgcga atgttgatg gacgcagcc atcttc aaagacacc 192 ProLys ArgAlaArg MetLeuMet AspAlaAla IlePhe LysAspThr ttg atc atg tcc tac gag tgg ccg gaa gac cgc caa tcg ctt agc tgg 240 Leu Ile Met Ser Tyr Glu Trp Pro Glu Asp Arg Gln Ser Leu Ser Trp WO 00/219$3 PCT/DK99/00538 act ctc gaa tcc.agc tcg ctg cta aag tcc ctc gaa ggc acg tat cgc 288 Thr Leu Glu Ser Ser Ser Leu Leu Lys Ser Leu Glu Gly Thr Tyr Arg ttg gcg ccc aag ggt tct ggc act gag gtc acc tac gag ctt gcc gtc 336 Leu Ala Pro Lys Gly Ser Gly Thr Glu Val Thr Tyr Glu Leu Ala Val gac ctt get gtc ccc atg atc ggg atg ctc aag cgt aag gcg gaa cgc 384 Asp Leu Ala Val Pro Met Ile Gly Met Leu Lys Arg Lys Ala Glu Arg agg ttg ata gac ggc gcg ttg aag gat ctg aag aaa cga gtc gag ggc 932 Arg Leu Ile Asp Gly Ala Leu Lys Asp Leu Lys Lys Arg Val Glu Gly tga 435 <210> 30 <211> 149 <212> PRT
<213> M.Tuberculosis <400> 30 Met Ala Asp Lys Thr Thr Gln Thr Ile Tyr Ile Asp Ala Asp Pro Gly Glu Val Met Lys Ala Ile Ala Asp Ile Glu Ala Tyr Pro Gln Trp Ile Ser Glu Tyr Lys Glu Val Glu Ile Leu Glu Ala Asp Asp Glu Gly Tyr Pro Lys Arg Ala Arg Met Leu Met Asp Ala Ala Ile Phe Lys Asp Thr Leu Ile Met Ser Tyr Glu Trp Pro Glu Asp Arg Gln Ser Leu Ser Trp Thr Leu Glu Ser Ser Ser Leu Leu Lys Ser Leu Glu Gly Thr Tyr Arg Leu Ala Pro Lys Gly Ser Gly Thr Glu Val Thr Tyr Glu Leu Ala Val Asp Leu Ala Val Pro Met Ile Gly Met Leu Lys Arg Lys Ala Glu Arg Arg Leu Ile Asp Gly Ala Leu Lys Asp Leu Lys Lys Arg Val Glu Gly <210> 31 <211> 441 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(938) <400> 31 atg cca gtt ttg agc aag acc gtc gag gtc acc gcc gac gcc gca tcg 48 Met Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser atc atg gcc atc gtt gcc gat atc gag cgc tac cca gag tgg aat gaa 96 Ile Met Ala Ile Val Ala Asp Ile Glu Arg Tyr Pro Glu Trp Asn Glu ggg gtc aag ggc gca tgg gtg ctc get cgc tac gat gac ggg cgt ccc 144 Gly Val Lys Gly Ala Trp Val Leu Ala Arg Tyr Asp Asp Gly Arg Pro agc cag gtg cgg ctc gac acc get gtt caa ggc atc gag ggc acc tat 192 Ser Gln Val Arg Leu Asp Thr Ala Val Gln Gly Ile Glu Gly Thr Tyr atc cac gcc gtg tac tac cca ggc gaa aac cag att caa acc gtc atg 290 Ile His Ala Val Tyr Tyr Pro Gly Glu Asn Gln Ile Gln Thr Val Met cag cag ggt gaa ctg ttt gcc aag cag gag cag ctg ttc agt gtg gtg 288 Gln Gln Gly Glu Leu Phe Ala Lys Gln Glu Gln Leu Phe Ser Val Val gca acc ggc gcc gcg agc ttg ctc acg gtg gac atg gac gtc cag gtc 336 Ala Thr Gly Ala Ala Ser Leu Leu Thr Val Asp Met Asp Val Gln Val acc atg ccg gtg ccc gag ccg atg gtg aag atg ctg ctc aac aac gtc 384 Thr Met Pro Val Pro Glu Pro Met Val Lys Met Leu Leu Asn Asn Val ctg gag cat ctc gcc gaa aat ctc aag cag cgc gcc: gag cag ctg gcg 432 Leu Glu His Leu Ala Glu Asn Leu Lys Gln Arg Ala Glu Gln Leu Ala gcc agc taa 441 Ala Ser <210> 32 <211> 196 <212> PRT
<213> M.Tuberculosis <400> 32 Met Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser Ile Met Ala Ile Val Ala Asp Ile Glu Arg Tyr Pro Glu Trp Asn Glu Gly Val Lys Gly Ala Trp Val Leu Ala Arg Tyr Asp Asp Gly Arg Pro Ser Gln Val Arg Leu Asp Thr Ala Val Gln Gly Ile Glu Gly Thr Tyr Ile His Ala Val Tyr Tyr Pro Gly Glu Asn Gln Ile Gln Thr Val Met Gln Gln Gly Glu Leu Phe A:La Lys Gln Glu Gln Leu Phe Ser Val Val Ala Thr Gly Ala Ala Ser Leu Leu Thr Val Asp Met Asp Val Gln Val Thr Met Pro Val Pro Glu Pro Met Val Lys Met Leu Leu Asn Asn Val Leu Glu His Leu Ala Glu Asn Leu Lys Gln Arg Ala Glu Gln Leu Ala Ala Ser <210> 33 <211> 894 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(891) <400> 33 atg tca tcg ggc aat tca tct ctg gga att atc gtc ggg atc gac gat 98 Met Ser Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp tca ccg gcc gca cag gtt gcg gtg cgg tgg gca get cgg gat gcg gag 96 Ser Pro Ala Ala Gln Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu ttg cga aaa atc cct ctg acg ctc gtg cac gcg gtg tcg ccg gaa gta 144 Leu Arg Lys Ile Pro Leu Thr Leu Val His Ala Val Ser Pro Glu Val gcc acc tgg ctg gag gtg cca ctg ccg ccg ggc gtg ctg cga tgg cag 192 Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gln cag gat cac ggg cgc cac ctg atc gac gac gca ctc aag gtg gtt gaa 240 Gln Asp His Gly Arg His Leu Ile Asp Asp Ala Leu Lys Val Val Glu cag get tcg ctg cgc get ggt ccc ccc acg gtc cac agt gaa atc gtt 288 Gln Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu Ile Val ccg gcg gca gcc gtt ccc aca ttg gtc gac atg tcc aaa gac gca gtg 336 Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Se:r Lys Asp Ala Val ctg atg gtc gtg ggt tgt ctc gga agt ggg cgg tgg ccg ggc cgg ctg 384 Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu ctc ggt tcg gtc agt tcc ggc ctg ctc cgc cac gcg cac tgt ccg gtc 432 Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val gtg atc atc cac gac gaa gat tcg gtg atg ccg cat ccc cag caa gcg 480 Val Ile Ile His Asp Glu Asp Ser Val Met Pro His Pro Gln Gln Ala ccg gtg cta gtt ggc gtt gac ggc tcg tcg gcc tcc gag ctg gcg acc 528 Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr gca atc gca ttc gac gaa gcg tcg cgg cga aac gtg gac ctg gtg gcg 576 Ala Ile Ala Phe Asp Glu Ala Ser Arg Arg Asn Va:L Asp Leu Val Ala ctg cac gca tgg agc gac gtc gat gtg tcg gag tgg ccc gga atc gat 624 Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly Ile Asp tgg ccg gca act cag tcg atg gcc gag cag gtg ctg gcc gag cgg ttg 672 Trp Pro Ala Thr Gln Ser Met Ala Glu Gln Val Leu Ala Glu Arg Leu gcg ggt tgg cag gag cgg tat ccc aac gta gcc ata acc cgc gtg gtg 720 Ala Gly Trp Gln Glu Arg Tyr Pro Asn Val Ala Ile Thr Arg Val Val gtg cgc gat cag ccg gcc cgc cag ctc gtc caa cgc tcc gag gaa gcc 768 Va1 Arg Asp Gln Pro Ala Arg Gln Leu Val Gln Arg Ser Glu Glu Ala cag ctg gtc gtg gtc ggc agc cgg ggc cgc ggc ggc tac gcc gga atg 816 Gln Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met ctg gtg ggg tcg gta ggc gaa acc gtt get cag ctg gcg cgg acg ccg 869 Leu Val Gly Ser Val Gly Glu Thr Val Ala Gln Leu Ala Arg Thr Pro gtc atc gtg gca cgc gag tcg ctg act tag 894 Val Ile Val Ala Arg Glu Ser Leu Thr <210> 39 <211> 297 <212> PRT
<213> M.Tuberculosis <400> 39 Met Ser Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp Ser Pro Ala Ala Gln Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu Leu Arg Lys Ile Pro Leu Thr Leu Val His Ala Val. Ser Pro Glu Val Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gln Gln Asp His Gly Arg His Leu Ile Asp Asp Ala Leu Lys Val Val Glu Gln Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu Ile Val Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Ser Lys Asp Ala Val Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val Val Ile Ile His Asp Glu Asp Ser Val Met Pro His Pro Gln Gln Ala Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr Ala Ile Ala Phe Asp Glu Ala Ser Arg Arg Asn Val Asp Leu Val Ala Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly Ile Asp Trp Pro Ala Thr Gln Ser Met Ala Glu Gln Val Leu Ala Glu Arg Leu Ala Gly Trp Gln Glu Arg Tyr Pro Asn Val Ala Ile Thr Arg Val Val Val Arg Asp Gln Pro Ala Arg Gln Leu Val Gln Arg Ser Glu Glu Ala Gln Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met Leu Val Gly Ser Val Gly Glu Thr Val Ala Gln Leu Ala Arg Thr Pro Val Ile Val Ala Arg Glu Ser Leu Thr <210> 35 <211> 957 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(954) <900> 35 atg get gaa gta etg gtg cte gtt gag cac get gaa ggc gcg tta aag 48 Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys aag gtc agc gcc gaa ttg atc acc gcc gcc cgc gcc: ttg ggc gaa cca 96 Lys Val Ser Ala Glu Leu Ile Thr Ala Ala Arg Ala Leu Gly Glu Pro gcc gcc gtc gtc gtc ggt gtg ccg ggg acg gcc gcg ccg ctg gtg gac 144 Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp ggg ctt aag gcg get ggt gcc gcc aag atc tac gtc: gcc gag tcc gac 192 Gly Leu Lys Ala Ala Gly Ala Ala Lys Ile Tyr Val. Ala Glu Ser Asp ctt gtc gac aaa tac ctg atc acc ccg gcg gtc gac: gtg ctg gcc ggg 290 Leu Val Asp Lys Tyr Leu Ile Thr Pro Ala Val Asp Val Leu Ala Gly ctg gcc gag tcc tcg gcc cct gcc ggc gta cta atc: gcc gcc acc gcg 288 Leu Ala Glu Ser Ser Ala Pro Ala Gly Val Leu Ile: Ala Ala Thr Ala gac gge aag gag atc gcc ggc cga ett gcg get cgg atc ggc tcg ggt 336 Asp Gly Lys Glu Ile Ala Gly Arg Leu Ala Ala Arg Ile Gly Ser Gly ctg ctg gtc gac gtg gtc gac gtg aga gaa ggt gga gtg ggt gtc cac 389 Leu Leu Val Asp Val Val Asp Val Arg Glu Gly Gly Val Gly Val His agc atc ttc ggt ggg gcg ttc acc gtc gaa gcg cag gcc aac ggc gac 432 Ser Ile Phe Gly Gly Ala Phe Thr Val Glu Ala Gln Ala Asn Gly Asp acc ccg gtg atc acc gtg cgc gca gga gcc gtg gag gcg gag ccg gcc 480 Thr Pro Val Ile Thr Val Arg Ala Gly Ala Val Glu Ala Glu Pro Ala gcc ggc gcc ggt gag cag gtc agc gtg gaa gtg ccg get gcg gcg gag 528 Ala Gly Ala Gly Glu Gln Val Ser Val Glu Val Pro Ala Ala Ala Glu aac gcc gcc agg atc acc gcg cgc gaa ccg gcg gtc gcc ggc gac cgg 576 Asn Ala Ala Arg Ile Thr Ala Arg Glu Pro Ala Va.1 Ala Gly Asp Arg ccg gag ctg acc gag gcg acc att gtg gtg gcc ggt ggc cgt ggt gtc 624 Pro Glu Leu Thr Glu Ala Thr Ile Val Val Ala Gly Gly Arg Gly Val ggc agc gcg gag aac ttc agc gtg gtc gag gcg ctg gcc gac tcg ctg 672 Gly Ser Ala Glu Asn Phe Ser Val Val Glu Ala Leu Ala Asp Sex Leu ggc gcc gcg gtc ggg gcc tcg cgt gcc gca gtc gac tcc ggc tac tac 720 Gly Ala Ala Val Gly Ala Ser Arg Ala Ala Val Asp Ser Gly Tyr Tyr ccg ggc cag ttc cag gtc ggc cag acc ggc aag acg gtg tcg ccc cag 768 Pro Gly Gln Phe Gln Val Gly Gln Thr Gly Lys Thr Val Ser Pro Gln ctc tac att gcc ctg ggc atc tcc ggg gcg atc cag cac cgc get ggc 816 Leu Tyr Ile Ala Leu Gly Ile Ser Gly Ala Ile Gln His Arg Ala Gly atg cag acg tcc aag acc atc gtc gcg gtc aac aag gac gaa gag gcg 864 Met Gln Thr Ser Lys Thr Ile Val Ala Val Asn Lys Asp Glu Glu Ala ccg atc ttt gag atc gcc gac tac ggg gtg gtg gga gac ctg ttc aag 912 Pro Ile Phe Glu Ile Ala Asp Tyr Gly Val Val Gly Asp Leu Phe Lys gtc get ccg cag ctg acc gag gcc atc aag gcc cgc aag ggc 954 Val Ala Pro Gln Leu Thr Glu Ala Ile Lys Ala Arg Lys Gly tag 957 <210> 36 <211> 318 <212> PRT
<213> M.Tuberculosis <400> 36 Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys Lys Val Ser Ala Glu Leu Ile Thr Ala Ala Arg Ala Leu Gly Glu Pro Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp Gly Leu Lys Ala Ala Gly Ala Ala Lys Ile Tyr Val. Ala Glu Ser Asp Leu Val Asp Lys Tyr Leu Ile Thr Pro Ala Val Asp Val Leu Ala Gly Leu Ala Glu Ser Ser Ala Pro Ala Gly Val Leu Ile Ala Ala Thr Ala Asp Gly Lys Glu Ile Ala Gly Arg Leu Ala Ala Arg Ile Gly Ser Gly Leu Leu Val Asp Val Val Asp Val Arg Glu Gly Gly Val Gly Val His Ser Ile Phe Gly Gly Ala Phe Thr Val Glu Ala Gln Ala Asn Gly Asp Thr Pro Val Ile Thr Val Arg Ala Gly Ala Val Glu Ala Glu Pro Ala Ala Gly Ala Gly Glu Gln Val Ser Val Glu Val Pro Ala Ala Ala Glu Asn Ala Ala Arg Ile Thr Ala Arg Glu Pro Ala Val Ala Gly Asp Arg Pro Glu Leu Thr Glu Ala Thr Ile Val Val Ala Gly Gly Arg Gly Val Gly Ser Ala Glu Asn Phe Ser Val Val Glu Ala Leu Ala Asp Ser Leu Gly Ala Ala Val Gly Ala Ser Arg Ala Ala Val Asp Ser Gly Tyr Tyr Pro Gly Gln Phe Gln Val Gly Gln Thr Gly Lys Thr Val Ser Pro Gln Leu Tyr Ile Ala Leu Gly Ile Ser Gly Ala Ile Gln His Arg Ala Gly Met Gln Thr Ser Lys Thr Ile Val Ala Val Asn Lys Asp Glu Glu Ala Pro Ile Phe Glu Ile Ala Asp Tyr Gly Val Val Gly Asp Leu Phe Lys Val Ala Pro Gln Leu Thr Glu Ala Ile Lys Ala Arg Lys Gly <210> 37 <211> 1401 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1398) <400> 37 gtg aag agc acc gtc gag cag ttg agc ccc acc cgg gtt cgt atc aac 98 Val Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile Asn gtg gag gtg cca ttc gcc gag ctt gag ccg gat ttc cag cgg gcc tac 96 Val Glu Val Pro Phe Ala Glu Leu Glu Pro Asp Phe Gln Arg Ala Tyr aaa gag ctg gcc aaa cag gtg cgg ctg ccc ggc ttc cgg ccc ggg aag 194 Lys Glu Leu Ala Lys Gln Val Arg Leu Pro Gly Phe Arg Pro Gly Lys gcg ccg gcc aaa cta ctc gaa gcc cgc atc ggc cgg gag gcc atg ctg 192 Ala Pro Ala Lys Leu Leu Glu Ala Arg Ile Gly Arg Glu Ala Met Leu gat caa atc gtc aac gat gcg ctg ccc agc cgg tac gga cag gcg gtg 240 Asp Gln Ile Val Asn Asp A1_a Leu Pro Ser Arg Tyr Gly Gln Ala Val gcc gag tcg gat gtc caa ccg ctc ggc cgg ccc aac atc gag gtg acc 288 Ala Glu Ser Asp Val Gln Pro Leu Gly Arg Pro Asn Ile Glu Val Thr aag aag gag tac ggc cag gac ctg caa ttc acc gcc gag gtc gac atc 336 Lys Lys Glu Tyr Gly Gln Asp Leu Gln Phe Thr Ala Glu Val Asp Ile cgc ccg aag atc agt ccc ccg gac ctg agc gcg ctg acg gtc tcg gtg 384 Arg Pro Lys Ile Ser Pro Pro Asp Leu Ser Ala Leu Thr Val Ser Val gat ccg atc gaa atc ggt gag gac gac gtc gac gcc gaa ctg cag tcg 432 Asp Pro Ile Glu Ile Gly Glu Asp Asp Val Asp Ala Glu Leu Gln Ser tta cgt acc cgg ttc ggc acc ctg acc gcg gtg gac cgg ccg gtg gcc 480 Leu Arg Thr Arg Phe Gly Thr Leu Thr Ala Val Asp Arg Pro Val Ala gtc ggc gac gtc gtc tcg atc gac ttg tct gcc acg gtc gac gga gag 528 Val Gly Asp Val Val Ser Ile Asp Leu Ser Ala Thr Val Asp Gly Glu gac ata ccg aac gca gcc get gag gga ctc tcc cac gag gtc ggc tcc 576 Asp Ile Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser ggc cgg ctc atc gca ggt ctc gac gac gcg gtt gtt ggt ctg tcc gcc 624 Gly Arg Leu Ile Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala gac gag tcc cgg gtc ttc acc gcc aag ctg gca gcc ggc gag cac gcc 672 Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala ggg cag gaa get cag gtt acc gtc acg gtc agg tcg gtt aag gag cgc 720 Gly Gln Glu Ala Gln Val Thr Val Thr Val Arg Ser Val Lys Glu Arg gaa cta cca gag ccc gac gac gaa ttc gcg cag tta gcc agc gag ttc 768 Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gln Leu Ala Sex Glu Phe gac agc atc gac gaa ttg cgg gcc agc ctc agc gac cag gtg cgc cag 816 Asp Ser Ile Asp Glu Leu Arg Ala Ser Leu Ser Asp Gln Val Arg Gln gcc aag cgc gcc cag cag gcc gag cag att cga aac gcc acc atc gat 864 Ala Lys Arg Ala Gln Gln Ala Glu Gln Ile Arg Asn Ala Thr Ile Asp gcg cta ctc gaa cag gtc gac gtg ccg ttg ccg gag tcg tat gtg cag 912 Ala Leu Leu Glu Gln Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gln gcc caa ttc gac agc gtg ctg cac agc gcg ctc agc ggt ctt aat cac 960 Ala Gln Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His gac gaa gcc cgg ttc aat gag ttg ctc gtc gag caa ggc tcg tca cgc 1008 Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gln Gly Ser Ser Arg gcg gcg ttc gat gcc gag gcg cgc acc gcc tca gaa aag gac gtc aag 1056 Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys agg cag ctg ttg cta gac gcc ctg gcc gat gag ctg cag gtc caa gtt 1104 Arg Gln Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gln Val Gln Val ggc cag gat gat ctg acc gaa cga ctg gtg acg acg tct cgg caa tac 1152 Gly Gln Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gln Tyr ggc atc gag ccg cag cag ctg ttc ggc tac ctc caa gag cgc aac cag 1200 Gly Ile Glu Pro Gln Gln Leu Phe Gly Tyr Leu Gln Glu Arg Asn Gln ctg ceg acc atg ttc get gac gtg egg cgc gag etg gcg atc agg gcc 1248 Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala Ile Arg Ala gca gtg gag gcg gcg acg gtc acc gac agt gac gga aac acg atc gat 1296 Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr Ile Asp acc agt gag ttc ttc ggc aag cgt gtg teg gcc ggt: gag get gag gag 1344 Thr Ser Glu Phe Phe Gly Lys Arg Vai Ser Ala Gly Glu Ala Glu Glu gcc gaa ccg gca gac gag ggt gcc gcg cgg gcg gcq tcc gac gaa gcg 1392 Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala aca acg tga 1401 Thr Thr <210> 38 <211> 466 <212> PRT
<213> M.Tuberculosis <400> 38 Met Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile Asn Val Glu Val Pro Phe Ala Glu Leu Glu Pro Asp Phe Gln Arg Ala Tyr Lys Glu Leu Ala Lys Gln Val Arg Leu Pro Gly Phe Arg Pro Gly Lys Ala Pro Ala Lys Leu Leu Glu Ala Arg Ile Gly Arg Glu Ala Met Leu Asp Gln Ile Val Asn Asp Ala Leu Pro Ser Arg Tyr Gly Gln Ala Val Ala Glu Ser Asp Val Gln Pro Leu Gly Arg Pro Asn Ile Glu Val Thr Lys Lys Glu Tyr Gly Gln Asp Leu Gln Phe Thr Ala Glu Val Asp Ile Arg Pro Lys Ile Ser Pro Pro Asp Leu Ser Ala Leu Thr Val Ser Val Asp Pro Ile Glu Ile Gly Glu Asp Asp Val Asp Ala Glu Leu Gln Ser Leu Arg Thr Arg Phe Gly 'rhr Leu Thr Ala Val Asp Arg Pro Val Ala Val Gly Asp Val Val Ser Ile Asp Leu Ser Ala Thr Val Asp Gly Glu Asp Ile Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser Gly Arg Leu Ile Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala Gly Gln Glu Ala Gln Val Thr Val Thr Val Arg Ser Val Lys Glu Arg Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gln Leu Ala Ser Glu Phe Asp Ser Ile Asp Glu Leu Arg Ala Ser Leu Ser Asp Gln Val Arg Gln Ala Lys Arg Ala Gln Gln Ala Glu Gln Ile Arg Asn Ala Thr Ile Asp Ala Leu Leu Glu Gln Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gln Ala Gln Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gln Gly Ser Ser Arg Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys Arg Gln Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gln Val Gln Val Gly Gln Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gln Tyr Gly Ile Glu Pro Gln Gln Leu Phe Gly Tyr Leu Gln Glu Arg Asn Gln Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala Ile Arg Ala Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr Ile Asp Thr Ser Glu Phe Phe Gly Lys Arg Val Ser Ala Gly Glu Ala Glu Glu Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala Thr Thr <210> 39 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 39 Thr Glu Arg Thr Ala Val Leu Ile Lys Pro Asp Gly Ile Glu Arg <210> 90 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 40 Thr Asp Thr Gln Val Thr Trp Leu Thr Gln Glu Ser His Asp Arg <210> 41 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 41 Met Ile Asp Glu Ala Leu Phe Asp Ala Glu Glu Lys Met Glu Lys <210> 42 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 42 Pro Leu Pro Ala Asp Pro Ser Thr Asp Leu Ser Ala Tyr Ala Gln <210> 43 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 43 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr <210> 44 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 44 Thr Gly Asn Leu Val Thr hys Asn Ser Leu Thr Pro Asp Val Arg <210> 45 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 95 Met Glu Val Lys Ile Gly Ile Thr Asp Ser Pro Arg Glu Leu Val <210> 46 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 46 Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Asp Xaa Ser Xaa <210> 47 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 47 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val <210> 98 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 48 Ala Asp Ile Asp Gly Val Thr Gly Ser Ala Gly Leu Asn Pro Ala <210> 49 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 49 Thr Tyr Glu Thr Ile Leu Val Glu Arg Asp Gln Arg Val Gly Ile <210> 50 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 50 Pro Val Thr Gln Glu Glu Ile Ile Ala Gly Ile Ala Glu Ile Ile <210> 51 <211> 19 <212> PRT
<213> M.Tuberculosis <900> 51 Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala <210> 52 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 52 Ala Asp Lys Thr Thr Gln Thr Ile Tyr Ile Asp Ala Asp Pro Gly <210> 53 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 53 Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser <210> 54 <211> 14 <212> PRT
<213> M.Tuberculosis <900> 54 Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp <210> 55 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 55 Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys <210> 56 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 56 Met Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile <210> 57 <211> 11 <212> PRT
<213> M.Tuberculosis <400> 57 Val Ile Arg Arg Lys Pro Lys Pro Arg Xaa Arg <210> 58 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 58 ctgagatctg tggaggtcaa gatcggt 27 <210> 59 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 59 ctcccatggc tacttacccg ctcgtagcaa c 31 <210> 60 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 60 ctgagatctc ctgtcactca ggaagaa 27 <210> 61 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 61 ctcccatggg aaaccgccat tagcggt 27 <210> 62 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 62 cccaagctta tggaacagcg tgcggag <210> 63 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 63 ctcccatggc gacactcgat ccggatt 27 <210> 64 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 69 ctgagatcta tgccagtggt gaagatc 27 <210> 65 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 65 ctcccatggt tatgcagtct tgccggt 27 <210> 66 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 66 ctgagatctg cggacaagac gacacag 27 <210> 67 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 67 ctcccatggt accggaatca ctcagcc 27 <210> 68 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 68 ctgagatctc cagttttgag caagacc 27 <210> 69 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 69 ctcccatggg cacatgcctt agctggc 27 <210> 70 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 70 ctgagatcta tgtcatcggg caattca 27 <210> 71 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 71 ctcccatggc tacctaagtc agcgactcgc g 31 <210> 72 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 72 ctgagatctg tgaagagcac cgtcgag 27 <210> 73 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 73 ctcccatggg tcatacggtc acgttgt 27 <210> 79 <211> 398 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(398) <400> 79 atg gca ctc aag gta gag atg gtc act ttc gac tgc agc gac cct gcg 98 Met Ala Leu Lys Val Glu Met Val Thr Phe Asp Cys Ser Asp Pro Ala aag ctt gcc ggc tgg tgg gcc gag cag ttc gat ggc acg acg cgt gaa 96 Lys Leu Ala Gly Trp Trp Ala Glu Gln Phe Asp Gly Thr Thr Arg Glu ctg ctg ccc ggc gaa ttc gtc gtg gtc gcc cgg acc gat gga ccg cgg 144 Leu Leu Pro Gly Glu Phe Val Val Val Ala Arg Thr Asp Gly Pro Arg ttg gga ttc cag aag gtg ccc gat ccc gcc cct ggg aaa aac cgc gtg 192 Leu Gly Phe Gln Lys Val Pro Asp Pro Ala Pro Gly Lys Asn Arg Val cac ctc gac ttc acg acc aag gac ctg gat gcc gag gtg ttg cgc ctg 290 His Leu Asp Phe Thr Thr Lys Asp Leu Asp Ala Glu Val Leu Arg Leu gtc gcc gcc gga gcc agt gag gtc ggg cgg cat cag gtc ggc gag agc 288 Val Ala Ala Gly Ala Ser Glu Val Gly Arg His Gln Val Gly Glu Ser ttt cgc tgg gtg gtg ctg get gac ccc gaa ggc aac get ttt tgc gtg 336 Phe Arg Trp Val Val Leu Ala Asp Pro Glu Gly Asn Ala Phe Cys Val gcg ggt caa taa 348 Ala Gly Gln <210> 75 <211> 115 <212> PRT
<213> M.Tuberculosis <400> 75 Met Ala Leu Lys Val Glu Met Val Thr Phe Asp Cys Ser Asp Pro Ala Lys Leu Ala Gly Trp Trp Ala Glu Gln Phe Asp Gly Thr Thr Arg Glu Leu Leu Pro Gly Glu Phe Val Val Val Ala Arg Thr Asp Gly Pro Arg Leu Gly Phe Gln Lys Val Pro Asp Pro Ala Pro Gly Lys Asn Arg Val His Leu Asp Phe Thr Thr Lys Asp Leu Asp Ala Glu Val Leu Arg Leu Val Ala Ala Gly Ala Ser Glu Val Gly Arg His Gln Val Gly Glu Ser Phe Arg Trp Val Val Leu Ala Asp Pro Glu Gly Asn Ala Phe Cys Val Ala Gly Gln <210> 76 <211> 569 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(564) <900> 76 atg gcc gac get gac acc acc gac ttc gac gtc gac gca gaa gca ccg 48 Met Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro ggt gga ggc gtc cgg gag gac acg gcg acg gat get gac gag gcc gac 96 Gly Gly Gly Val Arg Glu Asp Thr Ala Thr Asp Ala Asp Glu Ala Asp gat caa gaa gag aga ttg gtc gcc gag ggc gag att gca ggc gac tac 149 Asp Gln Glu Glu Arg Leu Val Ala Glu Gly Glu Ile Ala Gly Asp Tyr ctg gaa gag tta ttg gac gtg ttg gac ttc gat ggc gac atc gac ctc 192 Leu Glu Glu Leu Leu Asp Val Leu Asp Phe Asp Gly Asp Ile Asp Leu gat gtc gaa ggc aat cgt gcg gtg gtg agc atc gac ggc agt gac gac 240 Asp Val Glu Gly Asn Arg Ala Val Val Ser Ile Asp Gly Ser Asp Asp ctg aac aag ttg gtc ggg cgc ggg ggc gag gtg ctc gac get ctg cag 288 Leu Asn Lys Leu Val Gly Arg Gly Gly Glu Val Leu Asp Ala Leu Gln gaa ctc acc cgg ttg gcg gtg cat cag aag acc ggt gtg cgg agc cgg 336 Glu Leu Thr Arg Leu Ala Val His Gln Lys Thr Gly Val Arg Ser Arg ttg atg cta gac atc gcg agg tgg cga cgg cgg cgc cgg gag gaa ttg 384 Leu Met Leu Asp Ile Ala Arg Trp Arg Arg Arg Arg Arg Glu Glu Leu gcg gcg ctg gcc gac gag gtg gcg cgg cga gtg gcc gaa acc ggt gac 432 Ala Ala Leu Ala Asp Glu Val Ala Arg Arg Val Ala Glu Thr Gly Asp cgc gag gaa ctc gtt cca atg acg ccg ttc gaa cgg aag atc gtc cac 480 Arg Glu Glu Leu Val Pro Met Thr Pro Phe Glu Arg Lys Ile Val His gat gcg gtt gca gcg gtg cca ggt gtg cac agc gaa agc gaa ggc gtg 528 Asp Ala Val Ala Ala Val Pro Gly Val His Ser Glu Ser Glu Gly Val gag cca gaa cgc cga gtc gtt gtg ctc cgc gac tag 564 Glu Pro Glu Arg Arg Val Val Val Leu Arg Asp <210> 77 <211> 187 <212> PRT
<213> M.Tuberculosis <400> 77 Met Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro Gly Gly Gly Val Arg Glu Asp Thr Ala Thr Asp Ala Asp Glu Ala Asp Asp Gln Glu Glu Arg Leu Val Ala Glu Gly Glu Ile Ala Gly Asp Tyr Leu Glu Glu Leu Leu Asp Val Leu Asp Phe Asp Gly Asp Ile Asp Leu Asp Val Glu Gly Asn Arg Ala Val Val Ser Ile Asp Gly Ser Asp Asp Leu Asn Lys Leu Val Gly Arg Gly Gly Glu Val Leu Asp Ala Leu Gln Glu Leu Thr Arg Leu Ala Val His Gln Lys Thr Gly Val Arg Ser Arg Leu Met Leu Asp Ile Ala Arg Trp Arg Arg Arg Arg Arg Glu Glu Leu Ala Ala Leu Ala Asp Glu Val Ala Arg Arg Val Ala Glu Thr Gly Asp Arg Glu Glu Leu Val Pro Met Thr Pro Phe Glu Arg Lys Ile Val His Asp Ala Val Ala Ala Val Pro Gly Val His Ser Glu Ser Glu Gly Val Glu Pro Glu Arg Arg Val Val Val Leu Arg Asp <210> 78 <211> 1167 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1167) <400> 78 atg agc aag acg gtt ctc atc ctt ggc gcg ggt gtc ggc ggc ctg acc 48 Met Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr acc gcc gac acc ctc cgt caa ctg cta cca cct gag gat cga atc ata 96 Thr Ala Asp Thr Leu Arg Gln Leu Leu Pro Pro Glu Asp Arg Ile Ile ttg gtg gac agg agc ttt gac ggg acg ctg ggc ttg tcg ttg cta tgg 149 Leu Val Asp Arg Ser Phe Asp Gly Thr Leu Gly Leu Ser Leu Leu Trp gtg ttg cgg ggc tgg cgg cgg cct gac gac gtc cgc: gtc cgc ccc acc 192 Val Leu Arg Gly Trp Arg Arg Pro Asp Asp Val Arg Val Arg Pro Thr gcggcgtcgctg cccggtgtg gaaatg gttactgca accgtcgcc cac 240 AlaAlaSerLeu ProGlyVal GluMet ValThrAla ThrValAla His attgacatcgcg gcccaggta gtgcac accgacaac agcgtcatc ggc 288 IleAspIleAla AlaGlnVal ValHis ThrAspAsn SerValIle Gly tatgacgcgttg gtgatcgca ttaggt gcggcgctg aacaccgac gcc 336 TyrAspAlaLeu ValIleAla LeuGly AlaAlaLeu AsnThrAsp Ala gttcccggactg tcggacgcg ctcgac gccgacgtc:gcgggccag ttc 384 ValProGlyLeu SerAspAla LeuAsp AlaAspVal.AlaGlyGln Phe tacaccctggac ggcgcgget gagctg cgtgcgaag gtcgaggcg etc 432 TyrThrLeuAsp GlyAlaAla GluLeu ArgAlaLys ValGluAla Leu gag cat ggc egg atc get gtg get ate gec ggg gtg ccg ttc aaa tgc 480 Glu His Gly Arg Ile Ala Val Ala Ile Ala Gly Val Pro Phe Lys Cys cca gcc gca ccg ttc gaa gcg gcg ttt ctg atc gcc gcc caa ctc ggt 528 Pro Ala Ala Pro Phe Glu Ala Ala Phe Leu Ile Ala Ala Gln Leu Gly gac cgc tac gcc acc gga acc gta cag atc gac acg ttc acg cct gac 576 Asp Arg Tyr Ala Thr Gly Thr Val Gln Ile Asp Thr Phe Thr Pro Asp ccg ctg ccg atg ecc gtt gca ggt cce gag gtc ggc gag get ttg gtc 624 Pro Leu Pro Met Pro Val Ala Gly Pro Glu Val Gly Glu Ala Leu Val tcgatgctc aaggatcac ggtgtc ggcttccat cctcgcaag gcccta 672 SerMetLeu LysAspHis GlyVal GlyPheHis ProArgLys AlaLeu 210 2.15 220 getcgcgtc gatgaggcc gcaagg acgatgcac ttcggtgac ggeacg 720 AlaArgVal AspGluAla AlaArg ThrMetHis PheGlyAsp GlyThr tccgaaccg ttcgatctg cttgcc gtggtcccc ccgcacgtg ccctcc 768 SerGluPro PheAspLeu LeuAla ValValPro ProHisVal ProSer gccgcggcg cggtcagcg ggtctc agcgaatcc gggtggata cccgtg 816 AlaAlaAla ArgSerAla GlyLeu SerGluSer GlyTrpIle ProVal gacccgcgc accctgtcc actagc gccgacaac gtgtgggcc atcggc 864 AspProArg ThrLeuSer ThrSer AlaAspAsn ValTrpAla IleGly gatgegacc gtgctgacg ctgeeg aatggcaaa ccgctgecc aagget 912 AspAlaThr ValLeuThr LeuPro AsnGlyLys ProLeuPro LysAla gccgtgttc gccgaagcc caggccgca gttgtcgcc cacggcgtc gcc 960 AlaValPhe AlaGluAla GlnAlaAla ValValAla HisGlyVal Ala cgccatctc ggttacgac gtagetgag cgccacttc accggcacg ggc 1008 ArgHisLeu GlyTyrAsp ValAlaGlu ArgHisPhe:ThrGlyThr Gly gcctgctac gtcgagacc ggtgatcac caggcagcc aagggcgac ggc 1056 AlaCysTyr ValGluThr GlyAspHis GlnAlaAla LysGlyAsp Gly gatttcttc getccgtcg gcgccctcg gtgacgctg taeccgccg tcg 1104 AspPhePhe AlaProSer AlaProSer ValThrLeu TyrProPro Ser cgggagttt cacgaggag aaggtcgca caagaactg gcctggctg acc 1152 ArgGluPhe HisGluGlu LysValAla GlnGluLeu AlaTrpLeu Thr cgctggaag acgtga 1167 ArgTrpLys Thr <210> 79 <211> 388 <212> PRT
<213> M.Tuberculosis <400> 79 Met Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr Thr Ala Asp Thr Leu Arg Gln Leu Leu Pro Pro Glu Asp Arg Ile Ile Leu Val Asp Arg Ser Phe Asp Gly Thr Leu Gly Leu Ser Leu Leu Trp Val Leu Arg Gly Trp Arg Arg Pro Asp Asp Val Arg Val Arg Pro Thr Ala Ala Ser Leu Pro Gly Val Glu Met Val Thr Ala Thr Val Ala His Ile Asp Ile Ala Ala Gln Val Val His Thr Asp Asn Ser Val Ile Gly Tyr Asp Ala Leu Val Ile Ala Leu Gly Ala Ala Leu Asn Thr Asp Ala Val Pro Gly Leu Ser Asp Ala Leu Asp Ala Asp Val Ala Gly Gln Phe Tyr Thr Leu Asp Gly Ala Ala Glu Leu Arg Ala Lys Val Glu Ala Leu Glu His Gly Arg Ile Ala Val Ala Ile Ala Gly Val Pro Phe Lys Cys Pro Ala Ala Pro Phe Glu Ala Ala Phe Leu Ile Ala Ala Gln Leu Gly Asp Arg Tyr Ala Thr Gly Thr Val Gln Ile Asp Thr Phe Thr Pro Asp Pro Leu Pro Met Pro Val Ala Gly Pro Glu Val Gly Glu Ala Leu Val Ser Met Leu Lys Asp His G.Ly Val Gly Phe His Pro Arg Lys Ala Leu Ala Arg Val Asp Glu Ala Ala Arg Thr Met His Phe Gly Asp Gly Thr Ser Glu Pro Phe Asp Leu Leu Ala Val Val Pro Pro His Val Pro Ser Ala Ala Ala Arg Ser Ala Gly Leu Ser Glu Ser Gly Trp Ile Pro Val Asp Pro Arg Thr Leu Ser Thr Ser Ala Asp Asn Val Trp Ala Ile Gly Asp Ala Thr Val Leu Thr Leu Pro Asn Gly Lys Pro Leu Pro Lys Ala Ala Val Phe Ala Glu Ala Gln Ala Ala Val Val Ala His Gly Val Ala Arg His Leu Gly Tyr Asp Val Ala Glu Arg His Phe Thr Gly Thr Gly Ala Cys Tyr Val Glu Thr Gly Asp His Gln Ala Ala Lys Gly Asp Gly Asp Phe Phe Ala Pro Ser Ala Pro Ser Val Thr Leu Tyr Pro Pro Ser Arg Glu Phe His Glu Glu Lys Val Ala Gln Glu Leu Ala Trp Leu Thr Arg Trp Lys Thr <210> 80 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 80 Ala Leu Lys Val Glu Met Val Thr Phe Asp Xaa Ser Asp Pro Ala <210> 81 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 81 Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro <210> 82 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 82 Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr <210> 83 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 83 ctgagatcta tggcactcaa ggtagag 27 <210> 84 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 84 ctcccatggt tattgacccg ccacgca 27 <210> 85 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 85 ctgagatcta tggccgacgc tgacacc 27 <210> 86 <211> 27 <212> DNA
<213> M.Tuberculosis <400> $6 ctcccatggc tagtcgcgga gcacaac 27 <210> 87 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 87 ctgagatcta tgagcaagac ggttctc 27 <210> 88 <211> 27 ' <212> DNA
<213> M.Tuberculosis <400> 88 ctcccatggt cacgtcttcc agcgggt 27 <210> 89 <211> 28 <212> DNA
<213> M.Tuberculosis <400> 89 ctgccatggc taggtggtgt gcacgatc 28 <210> 90 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 90 ctgaagctta tgagcgccta taagacc 27 <210> 91 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 91 ctgagatcta tgattgatga ggctctc 27 <210> 92 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 92 ctcccatgga gcggccgcta gacctcc 27 <210> 93 <211> 30 <212> DNA
<213> M.Tuberculosis <400> 93 ggctgagact catggccgac atcgatggtg 30 <210> 94 <211> 31 <212> DNA
<213> M.Tuberculosis <900> 94 cgtaccatgg tcatgacgac accccctcgt g 31 <210> 95 <211> 30 <212> DNA
<213> M.Tuberculosis <400> 95 ggctgagact catggctgaa gtactggtgc 30 <210> 96 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 96 cgtaccatgg ctagccggcg accgccggtt c 31 <210> 97 <211> 20 <212> DNA
<213> M.Tuberculosis <900> 97 gtgaccgaac ggactctggt 20 <210> 98 <211> 21 <212> DNA
<213> M.Tuberculosis <900> 98 ctaggcgccg ggaaaccaga g 21 <210> 99 <211> 23 <212> DNA
<213> M.Tuberculosis <400> 99 atgacggata ctcaagtcac ctg 23 <210> 100 <211> 20 <212> DNA
<213> M.Tuberculosis <400> 100 ggagtggtac ggctcggcgc 20 <210> 101 <211> 20 <212> DNA
<213> M.Tubercuiosis <400> 101 atgacgtacg aaaccatcct 20 <210> 102 <211> 21 <212> DNA
<213> M.Tuberculosis <400> 102 tcatcggtgg gtgaactggg g 21 <210> 103 <211> 23 <212> DNA
<213> M.Tuberculosis <400> 103 atgccgcttc ccgcagaccc tag 23 <210> 104 <211> 21 <212> DNA
<213> M.Tuberculosis <900> 109 tacgacgggt accactcctg g 21 <210> 105 <211> 22 <212> DNA
<213> M.Tuberculosis <400> 105 atgctgatct cacagcgccc ca 22 <210> 106 <211> 22 <212> DNA
<213> M.Tuberculosis <400> 106 aagctgttcg gtttcggcgt ag 22 <210> 107 <211> 20 <212> DNA
<213> M.Tuberculosis <400> 107 atgaccggaa atttggtgac 20 <210> 108 <211> 21 <212> DNA
<213> M.Tuberculosis <400> 108 tcagtagcgg tagtggtccg g 21
<213> M.Tuberculosis <400> 26 Val Ala Ser His Ala Gly Ser Arg Ile Ala Arg Ile Ser Lys Val Leu Val Ala Asn Arg Gly Glu Tle Ala Val Arg Val Ile Arg Ala Ala Arg Asp Ala Gly Leu Pro Ser Val Ala Val Tyr Ala Glu Pro Asp Ala Glu Ser Pro His Val Arg Leu Ala Asp Glu Ala Phe Ala Leu Gly Gly Gln Thr Ser Ala Glu Ser Tyr Leu Asp Phe Ala Lys Ile Leu Asp Ala Ala Ala Lys Ser Gly Ala Asn Ala Ile His Pro Gly Ty:r Gly Phe Leu Ala Glu Asn Ala Asp Phe Ala Gln Ala Val Ile Asp Ala Gly Leu Ile Trp Ile Gly Pro Ser Pro Gln Ser Ile Arg Asp Leu Gly Asp Lys Val Thr Ala Arg His Ile Ala Ala Arg Ala Gln Ala Pro Leu Val Pro Gly Thr Pro Asp Pro Val Lys Gly Ala Asp Glu Val Val Ala Phe Ala Glu Glu Tyr Gly Leu Pro Ile Ala Lle Lys Ala Ala His Gly Gly Gly Gly Lys Gly Met Lys Val Ala Arg Thr Ile Asp Glu Ile Pro Glu Leu Tyr Glu Ser Ala Val Arg Glu Ala Thr Ala Ala Phe Gly Arg Gly Glu Cys Tyr Val Glu Arg Tyr Leu Asp Lys Pro Arg His Val Glu Ala Gln Val Ile Ala Asp Gln His Gly Asn Val Val Val Ala Gly Thr Arg Asp Cys Ser Leu Gln Arg Arg Tyr Gln Lys Leu Val Glu Glu Ala Pro Ala Pro Phe Leu Thr Asp Phe Gln Arg Lys Glu Ile His Asp Ser Ala Lys Arg Ile Cys Lys Glu Ala His Tyr His Gly Ala Gly Thr Va_L Glu Tyr Leu Val Gly Gln Asp Gly Leu Ile Ser Phe Leu Glu Val Asn Thr Arg Leu Gln Val Glu His Pro Val Thr Glu Glu Thr Ala Gly Ile Asp Leu Val Leu Gln Gln Phe Arg Ile Ala Asn Gly Glu Lys Leu Asp Ile Thr Glu Asp Pro Thr Pro Arg Gly His Ala Ile Glu Phe Arg Ile Asn Gly Glu Asp Ala Gly Arg Asn Phe Leu Pro Ala Pro Gly Pro Val Thr Lys Phe His Pro Pro Ser Gly Pro Gly Val Arg Val Asp Ser Gly Val Glu Thr Gly Ser Val Ile Gly Gly Gln Phe Asp Ser Met Leu Ala Lys Leu Ile Val His Gly Ala Asp Arg Ala Glu Ala Leu Ala Arg Ala Arg Arg Ala Leu Asn Glu Phe Gly Val Glu Gly Leu Ala Thr Val Ile Pro Phe His Arg Ala Val Val Ser Asp Pro Ala Phe Ile Gly Asp Ala Asn Gly Phe Ser Val His Thr Arg Trp Ile Glu Thr Glu Trp Asn Asn Thr Ile Glu Pro Phe Thr Asp Gly Glu Pro Leu Asp Glu Asp Ala Arg Pro Arg Gln Lys 465 ~ 470 475 480 Val Val Val Glu Ile Asp Gly Arg Arg Val Glu Val Ser Leu Pro Ala Asp Leu Ala Leu Ser Asn Gly Gly Gly Cys Asp Pro Val Gly Val Ile Arg Arg Lys Pro Lys Pro Arg Lys Arg Gly Ala His Thr Gly Ala Ala Ala Ser Gly Asp Ala Val Thr Ala Pro Met Gln Gly Thr Val Val Lys Phe Ala Val Glu Glu Gly Gln Glu Val Val Ala Gly Asp Leu Val Val Val Leu Glu Ala Met Lys Met Glu Asn Pro Val Thr Ala His Lys Asp Gly Thr Ile Thr Gly Leu Ala Val Glu Ala Gly Ala Ala Ile Thr Gln Gly Thr Val Leu Ala Glu Ile Lys <210> 27 <211> 318 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(315) <400> 27 atg cca gtg gtg aag atc aac gca atc gag gtg cc<: gcc ggc get ggc 48 Met Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala Gly ccc gag ctg gag aag cgg ttc get cac cgc gcg cac gcg gtc gag aac 96 Pro Glu Leu Glu Lys Arg Phe Ala His Arg Ala His Ala Val Glu Asn tcc ccg ggt ttc ctc ggc ttt cag ctg tta cgt ccg gtc aag ggt gaa 144 Ser Pro Gly Phe Leu Gly Phe Gln Leu Leu Arg Pro Val Lys Gly Glu gaa cgc tac ttc gtg gtg aca cac tgg gag tcc gat gaa gca ttc cag 192 Glu Arg Tyr Phe Val Val Thr His Trp Glu Ser Asp Glu Ala Phe Gln gcg tgg gca aac ggg ccc gcc atc gca gcc cat gcc gga cac cgg gcc 240 Ala Trp Ala Asn Gly Pro Ala Ile Ala Ala His Ala Gly His Arg Ala aac ccc gtg gcg acc ggt get tcg ctg ctg gaa ttc gag gtc gtg ctt 288 Asn Pro Val Ala Thr Gly Ala Ser Leu Leu Glu Phe Glu Val Val Leu gac gtc ggt ggg acc ggc aag act gca taa 318 Asp Val Gly Gly Thr Gly Lys Thr Ala <210> 28 <211> 105 <212> PRT
<213> M.Tuberculosis <400> 28 Met Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala Gly Pro Glu Leu Glu Lys Arg Phe Ala His Arg Ala His Ala Val Glu Asn Ser Pro Gly Phe Leu Gly Phe Gln Leu Leu Arg Pro Val Lys Gly Glu ' Glu Arg Tyr Phe Val Val Thr His Trp Glu Ser Asp Glu Ala Phe Gln Ala Trp Ala Asn Gly Pro Ala Ile Ala Ala His Ala Gly His Arg Ala Asn Pro Val Ala Thr Gly Ala Ser Leu Leu Glu Phe Glu Val Val Leu Asp Val Gly Gly Thr Gly Lys Thr Ala <210> 29 <211> 935 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(935) <900> 29 gtggcg gacaagacg acacagacg atttacatc gacgcg gatccaggc 98 ValAla AspLysThr ThrGlnThr IleTyrIle AspAla AspProGly gaggtg atgaaggcg atcgccgac atcgaagcc tacccg caatggatt 96 GluVal MetLysAla IleAlaAsp IleGluAla TyrPro GlnTrpIle tcggag tataaggaa gtcgagatc ctagaggcc gacgac gagggctac 144 SerGiu TyrLysGlu ValGluIle LeuGluAla AspAsp GluGlyTyr ccgaaa cgagcgcga atgttgatg gacgcagcc atcttc aaagacacc 192 ProLys ArgAlaArg MetLeuMet AspAlaAla IlePhe LysAspThr ttg atc atg tcc tac gag tgg ccg gaa gac cgc caa tcg ctt agc tgg 240 Leu Ile Met Ser Tyr Glu Trp Pro Glu Asp Arg Gln Ser Leu Ser Trp WO 00/219$3 PCT/DK99/00538 act ctc gaa tcc.agc tcg ctg cta aag tcc ctc gaa ggc acg tat cgc 288 Thr Leu Glu Ser Ser Ser Leu Leu Lys Ser Leu Glu Gly Thr Tyr Arg ttg gcg ccc aag ggt tct ggc act gag gtc acc tac gag ctt gcc gtc 336 Leu Ala Pro Lys Gly Ser Gly Thr Glu Val Thr Tyr Glu Leu Ala Val gac ctt get gtc ccc atg atc ggg atg ctc aag cgt aag gcg gaa cgc 384 Asp Leu Ala Val Pro Met Ile Gly Met Leu Lys Arg Lys Ala Glu Arg agg ttg ata gac ggc gcg ttg aag gat ctg aag aaa cga gtc gag ggc 932 Arg Leu Ile Asp Gly Ala Leu Lys Asp Leu Lys Lys Arg Val Glu Gly tga 435 <210> 30 <211> 149 <212> PRT
<213> M.Tuberculosis <400> 30 Met Ala Asp Lys Thr Thr Gln Thr Ile Tyr Ile Asp Ala Asp Pro Gly Glu Val Met Lys Ala Ile Ala Asp Ile Glu Ala Tyr Pro Gln Trp Ile Ser Glu Tyr Lys Glu Val Glu Ile Leu Glu Ala Asp Asp Glu Gly Tyr Pro Lys Arg Ala Arg Met Leu Met Asp Ala Ala Ile Phe Lys Asp Thr Leu Ile Met Ser Tyr Glu Trp Pro Glu Asp Arg Gln Ser Leu Ser Trp Thr Leu Glu Ser Ser Ser Leu Leu Lys Ser Leu Glu Gly Thr Tyr Arg Leu Ala Pro Lys Gly Ser Gly Thr Glu Val Thr Tyr Glu Leu Ala Val Asp Leu Ala Val Pro Met Ile Gly Met Leu Lys Arg Lys Ala Glu Arg Arg Leu Ile Asp Gly Ala Leu Lys Asp Leu Lys Lys Arg Val Glu Gly <210> 31 <211> 441 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(938) <400> 31 atg cca gtt ttg agc aag acc gtc gag gtc acc gcc gac gcc gca tcg 48 Met Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser atc atg gcc atc gtt gcc gat atc gag cgc tac cca gag tgg aat gaa 96 Ile Met Ala Ile Val Ala Asp Ile Glu Arg Tyr Pro Glu Trp Asn Glu ggg gtc aag ggc gca tgg gtg ctc get cgc tac gat gac ggg cgt ccc 144 Gly Val Lys Gly Ala Trp Val Leu Ala Arg Tyr Asp Asp Gly Arg Pro agc cag gtg cgg ctc gac acc get gtt caa ggc atc gag ggc acc tat 192 Ser Gln Val Arg Leu Asp Thr Ala Val Gln Gly Ile Glu Gly Thr Tyr atc cac gcc gtg tac tac cca ggc gaa aac cag att caa acc gtc atg 290 Ile His Ala Val Tyr Tyr Pro Gly Glu Asn Gln Ile Gln Thr Val Met cag cag ggt gaa ctg ttt gcc aag cag gag cag ctg ttc agt gtg gtg 288 Gln Gln Gly Glu Leu Phe Ala Lys Gln Glu Gln Leu Phe Ser Val Val gca acc ggc gcc gcg agc ttg ctc acg gtg gac atg gac gtc cag gtc 336 Ala Thr Gly Ala Ala Ser Leu Leu Thr Val Asp Met Asp Val Gln Val acc atg ccg gtg ccc gag ccg atg gtg aag atg ctg ctc aac aac gtc 384 Thr Met Pro Val Pro Glu Pro Met Val Lys Met Leu Leu Asn Asn Val ctg gag cat ctc gcc gaa aat ctc aag cag cgc gcc: gag cag ctg gcg 432 Leu Glu His Leu Ala Glu Asn Leu Lys Gln Arg Ala Glu Gln Leu Ala gcc agc taa 441 Ala Ser <210> 32 <211> 196 <212> PRT
<213> M.Tuberculosis <400> 32 Met Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser Ile Met Ala Ile Val Ala Asp Ile Glu Arg Tyr Pro Glu Trp Asn Glu Gly Val Lys Gly Ala Trp Val Leu Ala Arg Tyr Asp Asp Gly Arg Pro Ser Gln Val Arg Leu Asp Thr Ala Val Gln Gly Ile Glu Gly Thr Tyr Ile His Ala Val Tyr Tyr Pro Gly Glu Asn Gln Ile Gln Thr Val Met Gln Gln Gly Glu Leu Phe A:La Lys Gln Glu Gln Leu Phe Ser Val Val Ala Thr Gly Ala Ala Ser Leu Leu Thr Val Asp Met Asp Val Gln Val Thr Met Pro Val Pro Glu Pro Met Val Lys Met Leu Leu Asn Asn Val Leu Glu His Leu Ala Glu Asn Leu Lys Gln Arg Ala Glu Gln Leu Ala Ala Ser <210> 33 <211> 894 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(891) <400> 33 atg tca tcg ggc aat tca tct ctg gga att atc gtc ggg atc gac gat 98 Met Ser Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp tca ccg gcc gca cag gtt gcg gtg cgg tgg gca get cgg gat gcg gag 96 Ser Pro Ala Ala Gln Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu ttg cga aaa atc cct ctg acg ctc gtg cac gcg gtg tcg ccg gaa gta 144 Leu Arg Lys Ile Pro Leu Thr Leu Val His Ala Val Ser Pro Glu Val gcc acc tgg ctg gag gtg cca ctg ccg ccg ggc gtg ctg cga tgg cag 192 Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gln cag gat cac ggg cgc cac ctg atc gac gac gca ctc aag gtg gtt gaa 240 Gln Asp His Gly Arg His Leu Ile Asp Asp Ala Leu Lys Val Val Glu cag get tcg ctg cgc get ggt ccc ccc acg gtc cac agt gaa atc gtt 288 Gln Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu Ile Val ccg gcg gca gcc gtt ccc aca ttg gtc gac atg tcc aaa gac gca gtg 336 Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Se:r Lys Asp Ala Val ctg atg gtc gtg ggt tgt ctc gga agt ggg cgg tgg ccg ggc cgg ctg 384 Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu ctc ggt tcg gtc agt tcc ggc ctg ctc cgc cac gcg cac tgt ccg gtc 432 Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val gtg atc atc cac gac gaa gat tcg gtg atg ccg cat ccc cag caa gcg 480 Val Ile Ile His Asp Glu Asp Ser Val Met Pro His Pro Gln Gln Ala ccg gtg cta gtt ggc gtt gac ggc tcg tcg gcc tcc gag ctg gcg acc 528 Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr gca atc gca ttc gac gaa gcg tcg cgg cga aac gtg gac ctg gtg gcg 576 Ala Ile Ala Phe Asp Glu Ala Ser Arg Arg Asn Va:L Asp Leu Val Ala ctg cac gca tgg agc gac gtc gat gtg tcg gag tgg ccc gga atc gat 624 Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly Ile Asp tgg ccg gca act cag tcg atg gcc gag cag gtg ctg gcc gag cgg ttg 672 Trp Pro Ala Thr Gln Ser Met Ala Glu Gln Val Leu Ala Glu Arg Leu gcg ggt tgg cag gag cgg tat ccc aac gta gcc ata acc cgc gtg gtg 720 Ala Gly Trp Gln Glu Arg Tyr Pro Asn Val Ala Ile Thr Arg Val Val gtg cgc gat cag ccg gcc cgc cag ctc gtc caa cgc tcc gag gaa gcc 768 Va1 Arg Asp Gln Pro Ala Arg Gln Leu Val Gln Arg Ser Glu Glu Ala cag ctg gtc gtg gtc ggc agc cgg ggc cgc ggc ggc tac gcc gga atg 816 Gln Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met ctg gtg ggg tcg gta ggc gaa acc gtt get cag ctg gcg cgg acg ccg 869 Leu Val Gly Ser Val Gly Glu Thr Val Ala Gln Leu Ala Arg Thr Pro gtc atc gtg gca cgc gag tcg ctg act tag 894 Val Ile Val Ala Arg Glu Ser Leu Thr <210> 39 <211> 297 <212> PRT
<213> M.Tuberculosis <400> 39 Met Ser Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp Ser Pro Ala Ala Gln Val Ala Val Arg Trp Ala Ala Arg Asp Ala Glu Leu Arg Lys Ile Pro Leu Thr Leu Val His Ala Val. Ser Pro Glu Val Ala Thr Trp Leu Glu Val Pro Leu Pro Pro Gly Val Leu Arg Trp Gln Gln Asp His Gly Arg His Leu Ile Asp Asp Ala Leu Lys Val Val Glu Gln Ala Ser Leu Arg Ala Gly Pro Pro Thr Val His Ser Glu Ile Val Pro Ala Ala Ala Val Pro Thr Leu Val Asp Met Ser Lys Asp Ala Val Leu Met Val Val Gly Cys Leu Gly Ser Gly Arg Trp Pro Gly Arg Leu Leu Gly Ser Val Ser Ser Gly Leu Leu Arg His Ala His Cys Pro Val Val Ile Ile His Asp Glu Asp Ser Val Met Pro His Pro Gln Gln Ala Pro Val Leu Val Gly Val Asp Gly Ser Ser Ala Ser Glu Leu Ala Thr Ala Ile Ala Phe Asp Glu Ala Ser Arg Arg Asn Val Asp Leu Val Ala Leu His Ala Trp Ser Asp Val Asp Val Ser Glu Trp Pro Gly Ile Asp Trp Pro Ala Thr Gln Ser Met Ala Glu Gln Val Leu Ala Glu Arg Leu Ala Gly Trp Gln Glu Arg Tyr Pro Asn Val Ala Ile Thr Arg Val Val Val Arg Asp Gln Pro Ala Arg Gln Leu Val Gln Arg Ser Glu Glu Ala Gln Leu Val Val Val Gly Ser Arg Gly Arg Gly Gly Tyr Ala Gly Met Leu Val Gly Ser Val Gly Glu Thr Val Ala Gln Leu Ala Arg Thr Pro Val Ile Val Ala Arg Glu Ser Leu Thr <210> 35 <211> 957 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(954) <900> 35 atg get gaa gta etg gtg cte gtt gag cac get gaa ggc gcg tta aag 48 Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys aag gtc agc gcc gaa ttg atc acc gcc gcc cgc gcc: ttg ggc gaa cca 96 Lys Val Ser Ala Glu Leu Ile Thr Ala Ala Arg Ala Leu Gly Glu Pro gcc gcc gtc gtc gtc ggt gtg ccg ggg acg gcc gcg ccg ctg gtg gac 144 Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp ggg ctt aag gcg get ggt gcc gcc aag atc tac gtc: gcc gag tcc gac 192 Gly Leu Lys Ala Ala Gly Ala Ala Lys Ile Tyr Val. Ala Glu Ser Asp ctt gtc gac aaa tac ctg atc acc ccg gcg gtc gac: gtg ctg gcc ggg 290 Leu Val Asp Lys Tyr Leu Ile Thr Pro Ala Val Asp Val Leu Ala Gly ctg gcc gag tcc tcg gcc cct gcc ggc gta cta atc: gcc gcc acc gcg 288 Leu Ala Glu Ser Ser Ala Pro Ala Gly Val Leu Ile: Ala Ala Thr Ala gac gge aag gag atc gcc ggc cga ett gcg get cgg atc ggc tcg ggt 336 Asp Gly Lys Glu Ile Ala Gly Arg Leu Ala Ala Arg Ile Gly Ser Gly ctg ctg gtc gac gtg gtc gac gtg aga gaa ggt gga gtg ggt gtc cac 389 Leu Leu Val Asp Val Val Asp Val Arg Glu Gly Gly Val Gly Val His agc atc ttc ggt ggg gcg ttc acc gtc gaa gcg cag gcc aac ggc gac 432 Ser Ile Phe Gly Gly Ala Phe Thr Val Glu Ala Gln Ala Asn Gly Asp acc ccg gtg atc acc gtg cgc gca gga gcc gtg gag gcg gag ccg gcc 480 Thr Pro Val Ile Thr Val Arg Ala Gly Ala Val Glu Ala Glu Pro Ala gcc ggc gcc ggt gag cag gtc agc gtg gaa gtg ccg get gcg gcg gag 528 Ala Gly Ala Gly Glu Gln Val Ser Val Glu Val Pro Ala Ala Ala Glu aac gcc gcc agg atc acc gcg cgc gaa ccg gcg gtc gcc ggc gac cgg 576 Asn Ala Ala Arg Ile Thr Ala Arg Glu Pro Ala Va.1 Ala Gly Asp Arg ccg gag ctg acc gag gcg acc att gtg gtg gcc ggt ggc cgt ggt gtc 624 Pro Glu Leu Thr Glu Ala Thr Ile Val Val Ala Gly Gly Arg Gly Val ggc agc gcg gag aac ttc agc gtg gtc gag gcg ctg gcc gac tcg ctg 672 Gly Ser Ala Glu Asn Phe Ser Val Val Glu Ala Leu Ala Asp Sex Leu ggc gcc gcg gtc ggg gcc tcg cgt gcc gca gtc gac tcc ggc tac tac 720 Gly Ala Ala Val Gly Ala Ser Arg Ala Ala Val Asp Ser Gly Tyr Tyr ccg ggc cag ttc cag gtc ggc cag acc ggc aag acg gtg tcg ccc cag 768 Pro Gly Gln Phe Gln Val Gly Gln Thr Gly Lys Thr Val Ser Pro Gln ctc tac att gcc ctg ggc atc tcc ggg gcg atc cag cac cgc get ggc 816 Leu Tyr Ile Ala Leu Gly Ile Ser Gly Ala Ile Gln His Arg Ala Gly atg cag acg tcc aag acc atc gtc gcg gtc aac aag gac gaa gag gcg 864 Met Gln Thr Ser Lys Thr Ile Val Ala Val Asn Lys Asp Glu Glu Ala ccg atc ttt gag atc gcc gac tac ggg gtg gtg gga gac ctg ttc aag 912 Pro Ile Phe Glu Ile Ala Asp Tyr Gly Val Val Gly Asp Leu Phe Lys gtc get ccg cag ctg acc gag gcc atc aag gcc cgc aag ggc 954 Val Ala Pro Gln Leu Thr Glu Ala Ile Lys Ala Arg Lys Gly tag 957 <210> 36 <211> 318 <212> PRT
<213> M.Tuberculosis <400> 36 Met Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys Lys Val Ser Ala Glu Leu Ile Thr Ala Ala Arg Ala Leu Gly Glu Pro Ala Ala Val Val Val Gly Val Pro Gly Thr Ala Ala Pro Leu Val Asp Gly Leu Lys Ala Ala Gly Ala Ala Lys Ile Tyr Val. Ala Glu Ser Asp Leu Val Asp Lys Tyr Leu Ile Thr Pro Ala Val Asp Val Leu Ala Gly Leu Ala Glu Ser Ser Ala Pro Ala Gly Val Leu Ile Ala Ala Thr Ala Asp Gly Lys Glu Ile Ala Gly Arg Leu Ala Ala Arg Ile Gly Ser Gly Leu Leu Val Asp Val Val Asp Val Arg Glu Gly Gly Val Gly Val His Ser Ile Phe Gly Gly Ala Phe Thr Val Glu Ala Gln Ala Asn Gly Asp Thr Pro Val Ile Thr Val Arg Ala Gly Ala Val Glu Ala Glu Pro Ala Ala Gly Ala Gly Glu Gln Val Ser Val Glu Val Pro Ala Ala Ala Glu Asn Ala Ala Arg Ile Thr Ala Arg Glu Pro Ala Val Ala Gly Asp Arg Pro Glu Leu Thr Glu Ala Thr Ile Val Val Ala Gly Gly Arg Gly Val Gly Ser Ala Glu Asn Phe Ser Val Val Glu Ala Leu Ala Asp Ser Leu Gly Ala Ala Val Gly Ala Ser Arg Ala Ala Val Asp Ser Gly Tyr Tyr Pro Gly Gln Phe Gln Val Gly Gln Thr Gly Lys Thr Val Ser Pro Gln Leu Tyr Ile Ala Leu Gly Ile Ser Gly Ala Ile Gln His Arg Ala Gly Met Gln Thr Ser Lys Thr Ile Val Ala Val Asn Lys Asp Glu Glu Ala Pro Ile Phe Glu Ile Ala Asp Tyr Gly Val Val Gly Asp Leu Phe Lys Val Ala Pro Gln Leu Thr Glu Ala Ile Lys Ala Arg Lys Gly <210> 37 <211> 1401 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1398) <400> 37 gtg aag agc acc gtc gag cag ttg agc ccc acc cgg gtt cgt atc aac 98 Val Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile Asn gtg gag gtg cca ttc gcc gag ctt gag ccg gat ttc cag cgg gcc tac 96 Val Glu Val Pro Phe Ala Glu Leu Glu Pro Asp Phe Gln Arg Ala Tyr aaa gag ctg gcc aaa cag gtg cgg ctg ccc ggc ttc cgg ccc ggg aag 194 Lys Glu Leu Ala Lys Gln Val Arg Leu Pro Gly Phe Arg Pro Gly Lys gcg ccg gcc aaa cta ctc gaa gcc cgc atc ggc cgg gag gcc atg ctg 192 Ala Pro Ala Lys Leu Leu Glu Ala Arg Ile Gly Arg Glu Ala Met Leu gat caa atc gtc aac gat gcg ctg ccc agc cgg tac gga cag gcg gtg 240 Asp Gln Ile Val Asn Asp A1_a Leu Pro Ser Arg Tyr Gly Gln Ala Val gcc gag tcg gat gtc caa ccg ctc ggc cgg ccc aac atc gag gtg acc 288 Ala Glu Ser Asp Val Gln Pro Leu Gly Arg Pro Asn Ile Glu Val Thr aag aag gag tac ggc cag gac ctg caa ttc acc gcc gag gtc gac atc 336 Lys Lys Glu Tyr Gly Gln Asp Leu Gln Phe Thr Ala Glu Val Asp Ile cgc ccg aag atc agt ccc ccg gac ctg agc gcg ctg acg gtc tcg gtg 384 Arg Pro Lys Ile Ser Pro Pro Asp Leu Ser Ala Leu Thr Val Ser Val gat ccg atc gaa atc ggt gag gac gac gtc gac gcc gaa ctg cag tcg 432 Asp Pro Ile Glu Ile Gly Glu Asp Asp Val Asp Ala Glu Leu Gln Ser tta cgt acc cgg ttc ggc acc ctg acc gcg gtg gac cgg ccg gtg gcc 480 Leu Arg Thr Arg Phe Gly Thr Leu Thr Ala Val Asp Arg Pro Val Ala gtc ggc gac gtc gtc tcg atc gac ttg tct gcc acg gtc gac gga gag 528 Val Gly Asp Val Val Ser Ile Asp Leu Ser Ala Thr Val Asp Gly Glu gac ata ccg aac gca gcc get gag gga ctc tcc cac gag gtc ggc tcc 576 Asp Ile Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser ggc cgg ctc atc gca ggt ctc gac gac gcg gtt gtt ggt ctg tcc gcc 624 Gly Arg Leu Ile Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala gac gag tcc cgg gtc ttc acc gcc aag ctg gca gcc ggc gag cac gcc 672 Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala ggg cag gaa get cag gtt acc gtc acg gtc agg tcg gtt aag gag cgc 720 Gly Gln Glu Ala Gln Val Thr Val Thr Val Arg Ser Val Lys Glu Arg gaa cta cca gag ccc gac gac gaa ttc gcg cag tta gcc agc gag ttc 768 Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gln Leu Ala Sex Glu Phe gac agc atc gac gaa ttg cgg gcc agc ctc agc gac cag gtg cgc cag 816 Asp Ser Ile Asp Glu Leu Arg Ala Ser Leu Ser Asp Gln Val Arg Gln gcc aag cgc gcc cag cag gcc gag cag att cga aac gcc acc atc gat 864 Ala Lys Arg Ala Gln Gln Ala Glu Gln Ile Arg Asn Ala Thr Ile Asp gcg cta ctc gaa cag gtc gac gtg ccg ttg ccg gag tcg tat gtg cag 912 Ala Leu Leu Glu Gln Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gln gcc caa ttc gac agc gtg ctg cac agc gcg ctc agc ggt ctt aat cac 960 Ala Gln Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His gac gaa gcc cgg ttc aat gag ttg ctc gtc gag caa ggc tcg tca cgc 1008 Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gln Gly Ser Ser Arg gcg gcg ttc gat gcc gag gcg cgc acc gcc tca gaa aag gac gtc aag 1056 Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys agg cag ctg ttg cta gac gcc ctg gcc gat gag ctg cag gtc caa gtt 1104 Arg Gln Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gln Val Gln Val ggc cag gat gat ctg acc gaa cga ctg gtg acg acg tct cgg caa tac 1152 Gly Gln Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gln Tyr ggc atc gag ccg cag cag ctg ttc ggc tac ctc caa gag cgc aac cag 1200 Gly Ile Glu Pro Gln Gln Leu Phe Gly Tyr Leu Gln Glu Arg Asn Gln ctg ceg acc atg ttc get gac gtg egg cgc gag etg gcg atc agg gcc 1248 Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala Ile Arg Ala gca gtg gag gcg gcg acg gtc acc gac agt gac gga aac acg atc gat 1296 Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr Ile Asp acc agt gag ttc ttc ggc aag cgt gtg teg gcc ggt: gag get gag gag 1344 Thr Ser Glu Phe Phe Gly Lys Arg Vai Ser Ala Gly Glu Ala Glu Glu gcc gaa ccg gca gac gag ggt gcc gcg cgg gcg gcq tcc gac gaa gcg 1392 Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala aca acg tga 1401 Thr Thr <210> 38 <211> 466 <212> PRT
<213> M.Tuberculosis <400> 38 Met Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile Asn Val Glu Val Pro Phe Ala Glu Leu Glu Pro Asp Phe Gln Arg Ala Tyr Lys Glu Leu Ala Lys Gln Val Arg Leu Pro Gly Phe Arg Pro Gly Lys Ala Pro Ala Lys Leu Leu Glu Ala Arg Ile Gly Arg Glu Ala Met Leu Asp Gln Ile Val Asn Asp Ala Leu Pro Ser Arg Tyr Gly Gln Ala Val Ala Glu Ser Asp Val Gln Pro Leu Gly Arg Pro Asn Ile Glu Val Thr Lys Lys Glu Tyr Gly Gln Asp Leu Gln Phe Thr Ala Glu Val Asp Ile Arg Pro Lys Ile Ser Pro Pro Asp Leu Ser Ala Leu Thr Val Ser Val Asp Pro Ile Glu Ile Gly Glu Asp Asp Val Asp Ala Glu Leu Gln Ser Leu Arg Thr Arg Phe Gly 'rhr Leu Thr Ala Val Asp Arg Pro Val Ala Val Gly Asp Val Val Ser Ile Asp Leu Ser Ala Thr Val Asp Gly Glu Asp Ile Pro Asn Ala Ala Ala Glu Gly Leu Ser His Glu Val Gly Ser Gly Arg Leu Ile Ala Gly Leu Asp Asp Ala Val Val Gly Leu Ser Ala Asp Glu Ser Arg Val Phe Thr Ala Lys Leu Ala Ala Gly Glu His Ala Gly Gln Glu Ala Gln Val Thr Val Thr Val Arg Ser Val Lys Glu Arg Glu Leu Pro Glu Pro Asp Asp Glu Phe Ala Gln Leu Ala Ser Glu Phe Asp Ser Ile Asp Glu Leu Arg Ala Ser Leu Ser Asp Gln Val Arg Gln Ala Lys Arg Ala Gln Gln Ala Glu Gln Ile Arg Asn Ala Thr Ile Asp Ala Leu Leu Glu Gln Val Asp Val Pro Leu Pro Glu Ser Tyr Val Gln Ala Gln Phe Asp Ser Val Leu His Ser Ala Leu Ser Gly Leu Asn His Asp Glu Ala Arg Phe Asn Glu Leu Leu Val Glu Gln Gly Ser Ser Arg Ala Ala Phe Asp Ala Glu Ala Arg Thr Ala Ser Glu Lys Asp Val Lys Arg Gln Leu Leu Leu Asp Ala Leu Ala Asp Glu Leu Gln Val Gln Val Gly Gln Asp Asp Leu Thr Glu Arg Leu Val Thr Thr Ser Arg Gln Tyr Gly Ile Glu Pro Gln Gln Leu Phe Gly Tyr Leu Gln Glu Arg Asn Gln Leu Pro Thr Met Phe Ala Asp Val Arg Arg Glu Leu Ala Ile Arg Ala Ala Val Glu Ala Ala Thr Val Thr Asp Ser Asp Gly Asn Thr Ile Asp Thr Ser Glu Phe Phe Gly Lys Arg Val Ser Ala Gly Glu Ala Glu Glu Ala Glu Pro Ala Asp Glu Gly Ala Ala Arg Ala Ala Ser Asp Glu Ala Thr Thr <210> 39 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 39 Thr Glu Arg Thr Ala Val Leu Ile Lys Pro Asp Gly Ile Glu Arg <210> 90 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 40 Thr Asp Thr Gln Val Thr Trp Leu Thr Gln Glu Ser His Asp Arg <210> 41 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 41 Met Ile Asp Glu Ala Leu Phe Asp Ala Glu Glu Lys Met Glu Lys <210> 42 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 42 Pro Leu Pro Ala Asp Pro Ser Thr Asp Leu Ser Ala Tyr Ala Gln <210> 43 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 43 Met Leu Ile Ser Gln Arg Pro Thr Leu Ser Glu Asp Val Leu Thr <210> 44 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 44 Thr Gly Asn Leu Val Thr hys Asn Ser Leu Thr Pro Asp Val Arg <210> 45 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 95 Met Glu Val Lys Ile Gly Ile Thr Asp Ser Pro Arg Glu Leu Val <210> 46 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 46 Ser Ala Tyr Lys Thr Val Val Val Gly Thr Asp Asp Xaa Ser Xaa <210> 47 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 47 Met Glu Gln Arg Ala Glu Leu Val Val Gly Arg Ala Leu Val Val <210> 98 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 48 Ala Asp Ile Asp Gly Val Thr Gly Ser Ala Gly Leu Asn Pro Ala <210> 49 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 49 Thr Tyr Glu Thr Ile Leu Val Glu Arg Asp Gln Arg Val Gly Ile <210> 50 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 50 Pro Val Thr Gln Glu Glu Ile Ile Ala Gly Ile Ala Glu Ile Ile <210> 51 <211> 19 <212> PRT
<213> M.Tuberculosis <900> 51 Pro Val Val Lys Ile Asn Ala Ile Glu Val Pro Ala Gly Ala <210> 52 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 52 Ala Asp Lys Thr Thr Gln Thr Ile Tyr Ile Asp Ala Asp Pro Gly <210> 53 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 53 Pro Val Leu Ser Lys Thr Val Glu Val Thr Ala Asp Ala Ala Ser <210> 54 <211> 14 <212> PRT
<213> M.Tuberculosis <900> 54 Ser Gly Asn Ser Ser Leu Gly Ile Ile Val Gly Ile Asp Asp <210> 55 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 55 Ala Glu Val Leu Val Leu Val Glu His Ala Glu Gly Ala Leu Lys <210> 56 <211> 15 <212> PRT
<213> M.Tuberculosis <900> 56 Met Lys Ser Thr Val Glu Gln Leu Ser Pro Thr Arg Val Arg Ile <210> 57 <211> 11 <212> PRT
<213> M.Tuberculosis <400> 57 Val Ile Arg Arg Lys Pro Lys Pro Arg Xaa Arg <210> 58 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 58 ctgagatctg tggaggtcaa gatcggt 27 <210> 59 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 59 ctcccatggc tacttacccg ctcgtagcaa c 31 <210> 60 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 60 ctgagatctc ctgtcactca ggaagaa 27 <210> 61 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 61 ctcccatggg aaaccgccat tagcggt 27 <210> 62 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 62 cccaagctta tggaacagcg tgcggag <210> 63 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 63 ctcccatggc gacactcgat ccggatt 27 <210> 64 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 69 ctgagatcta tgccagtggt gaagatc 27 <210> 65 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 65 ctcccatggt tatgcagtct tgccggt 27 <210> 66 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 66 ctgagatctg cggacaagac gacacag 27 <210> 67 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 67 ctcccatggt accggaatca ctcagcc 27 <210> 68 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 68 ctgagatctc cagttttgag caagacc 27 <210> 69 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 69 ctcccatggg cacatgcctt agctggc 27 <210> 70 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 70 ctgagatcta tgtcatcggg caattca 27 <210> 71 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 71 ctcccatggc tacctaagtc agcgactcgc g 31 <210> 72 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 72 ctgagatctg tgaagagcac cgtcgag 27 <210> 73 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 73 ctcccatggg tcatacggtc acgttgt 27 <210> 79 <211> 398 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(398) <400> 79 atg gca ctc aag gta gag atg gtc act ttc gac tgc agc gac cct gcg 98 Met Ala Leu Lys Val Glu Met Val Thr Phe Asp Cys Ser Asp Pro Ala aag ctt gcc ggc tgg tgg gcc gag cag ttc gat ggc acg acg cgt gaa 96 Lys Leu Ala Gly Trp Trp Ala Glu Gln Phe Asp Gly Thr Thr Arg Glu ctg ctg ccc ggc gaa ttc gtc gtg gtc gcc cgg acc gat gga ccg cgg 144 Leu Leu Pro Gly Glu Phe Val Val Val Ala Arg Thr Asp Gly Pro Arg ttg gga ttc cag aag gtg ccc gat ccc gcc cct ggg aaa aac cgc gtg 192 Leu Gly Phe Gln Lys Val Pro Asp Pro Ala Pro Gly Lys Asn Arg Val cac ctc gac ttc acg acc aag gac ctg gat gcc gag gtg ttg cgc ctg 290 His Leu Asp Phe Thr Thr Lys Asp Leu Asp Ala Glu Val Leu Arg Leu gtc gcc gcc gga gcc agt gag gtc ggg cgg cat cag gtc ggc gag agc 288 Val Ala Ala Gly Ala Ser Glu Val Gly Arg His Gln Val Gly Glu Ser ttt cgc tgg gtg gtg ctg get gac ccc gaa ggc aac get ttt tgc gtg 336 Phe Arg Trp Val Val Leu Ala Asp Pro Glu Gly Asn Ala Phe Cys Val gcg ggt caa taa 348 Ala Gly Gln <210> 75 <211> 115 <212> PRT
<213> M.Tuberculosis <400> 75 Met Ala Leu Lys Val Glu Met Val Thr Phe Asp Cys Ser Asp Pro Ala Lys Leu Ala Gly Trp Trp Ala Glu Gln Phe Asp Gly Thr Thr Arg Glu Leu Leu Pro Gly Glu Phe Val Val Val Ala Arg Thr Asp Gly Pro Arg Leu Gly Phe Gln Lys Val Pro Asp Pro Ala Pro Gly Lys Asn Arg Val His Leu Asp Phe Thr Thr Lys Asp Leu Asp Ala Glu Val Leu Arg Leu Val Ala Ala Gly Ala Ser Glu Val Gly Arg His Gln Val Gly Glu Ser Phe Arg Trp Val Val Leu Ala Asp Pro Glu Gly Asn Ala Phe Cys Val Ala Gly Gln <210> 76 <211> 569 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(564) <900> 76 atg gcc gac get gac acc acc gac ttc gac gtc gac gca gaa gca ccg 48 Met Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro ggt gga ggc gtc cgg gag gac acg gcg acg gat get gac gag gcc gac 96 Gly Gly Gly Val Arg Glu Asp Thr Ala Thr Asp Ala Asp Glu Ala Asp gat caa gaa gag aga ttg gtc gcc gag ggc gag att gca ggc gac tac 149 Asp Gln Glu Glu Arg Leu Val Ala Glu Gly Glu Ile Ala Gly Asp Tyr ctg gaa gag tta ttg gac gtg ttg gac ttc gat ggc gac atc gac ctc 192 Leu Glu Glu Leu Leu Asp Val Leu Asp Phe Asp Gly Asp Ile Asp Leu gat gtc gaa ggc aat cgt gcg gtg gtg agc atc gac ggc agt gac gac 240 Asp Val Glu Gly Asn Arg Ala Val Val Ser Ile Asp Gly Ser Asp Asp ctg aac aag ttg gtc ggg cgc ggg ggc gag gtg ctc gac get ctg cag 288 Leu Asn Lys Leu Val Gly Arg Gly Gly Glu Val Leu Asp Ala Leu Gln gaa ctc acc cgg ttg gcg gtg cat cag aag acc ggt gtg cgg agc cgg 336 Glu Leu Thr Arg Leu Ala Val His Gln Lys Thr Gly Val Arg Ser Arg ttg atg cta gac atc gcg agg tgg cga cgg cgg cgc cgg gag gaa ttg 384 Leu Met Leu Asp Ile Ala Arg Trp Arg Arg Arg Arg Arg Glu Glu Leu gcg gcg ctg gcc gac gag gtg gcg cgg cga gtg gcc gaa acc ggt gac 432 Ala Ala Leu Ala Asp Glu Val Ala Arg Arg Val Ala Glu Thr Gly Asp cgc gag gaa ctc gtt cca atg acg ccg ttc gaa cgg aag atc gtc cac 480 Arg Glu Glu Leu Val Pro Met Thr Pro Phe Glu Arg Lys Ile Val His gat gcg gtt gca gcg gtg cca ggt gtg cac agc gaa agc gaa ggc gtg 528 Asp Ala Val Ala Ala Val Pro Gly Val His Ser Glu Ser Glu Gly Val gag cca gaa cgc cga gtc gtt gtg ctc cgc gac tag 564 Glu Pro Glu Arg Arg Val Val Val Leu Arg Asp <210> 77 <211> 187 <212> PRT
<213> M.Tuberculosis <400> 77 Met Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro Gly Gly Gly Val Arg Glu Asp Thr Ala Thr Asp Ala Asp Glu Ala Asp Asp Gln Glu Glu Arg Leu Val Ala Glu Gly Glu Ile Ala Gly Asp Tyr Leu Glu Glu Leu Leu Asp Val Leu Asp Phe Asp Gly Asp Ile Asp Leu Asp Val Glu Gly Asn Arg Ala Val Val Ser Ile Asp Gly Ser Asp Asp Leu Asn Lys Leu Val Gly Arg Gly Gly Glu Val Leu Asp Ala Leu Gln Glu Leu Thr Arg Leu Ala Val His Gln Lys Thr Gly Val Arg Ser Arg Leu Met Leu Asp Ile Ala Arg Trp Arg Arg Arg Arg Arg Glu Glu Leu Ala Ala Leu Ala Asp Glu Val Ala Arg Arg Val Ala Glu Thr Gly Asp Arg Glu Glu Leu Val Pro Met Thr Pro Phe Glu Arg Lys Ile Val His Asp Ala Val Ala Ala Val Pro Gly Val His Ser Glu Ser Glu Gly Val Glu Pro Glu Arg Arg Val Val Val Leu Arg Asp <210> 78 <211> 1167 <212> DNA
<213> M.Tuberculosis <220>
<221> CDS
<222> (1)...(1167) <400> 78 atg agc aag acg gtt ctc atc ctt ggc gcg ggt gtc ggc ggc ctg acc 48 Met Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr acc gcc gac acc ctc cgt caa ctg cta cca cct gag gat cga atc ata 96 Thr Ala Asp Thr Leu Arg Gln Leu Leu Pro Pro Glu Asp Arg Ile Ile ttg gtg gac agg agc ttt gac ggg acg ctg ggc ttg tcg ttg cta tgg 149 Leu Val Asp Arg Ser Phe Asp Gly Thr Leu Gly Leu Ser Leu Leu Trp gtg ttg cgg ggc tgg cgg cgg cct gac gac gtc cgc: gtc cgc ccc acc 192 Val Leu Arg Gly Trp Arg Arg Pro Asp Asp Val Arg Val Arg Pro Thr gcggcgtcgctg cccggtgtg gaaatg gttactgca accgtcgcc cac 240 AlaAlaSerLeu ProGlyVal GluMet ValThrAla ThrValAla His attgacatcgcg gcccaggta gtgcac accgacaac agcgtcatc ggc 288 IleAspIleAla AlaGlnVal ValHis ThrAspAsn SerValIle Gly tatgacgcgttg gtgatcgca ttaggt gcggcgctg aacaccgac gcc 336 TyrAspAlaLeu ValIleAla LeuGly AlaAlaLeu AsnThrAsp Ala gttcccggactg tcggacgcg ctcgac gccgacgtc:gcgggccag ttc 384 ValProGlyLeu SerAspAla LeuAsp AlaAspVal.AlaGlyGln Phe tacaccctggac ggcgcgget gagctg cgtgcgaag gtcgaggcg etc 432 TyrThrLeuAsp GlyAlaAla GluLeu ArgAlaLys ValGluAla Leu gag cat ggc egg atc get gtg get ate gec ggg gtg ccg ttc aaa tgc 480 Glu His Gly Arg Ile Ala Val Ala Ile Ala Gly Val Pro Phe Lys Cys cca gcc gca ccg ttc gaa gcg gcg ttt ctg atc gcc gcc caa ctc ggt 528 Pro Ala Ala Pro Phe Glu Ala Ala Phe Leu Ile Ala Ala Gln Leu Gly gac cgc tac gcc acc gga acc gta cag atc gac acg ttc acg cct gac 576 Asp Arg Tyr Ala Thr Gly Thr Val Gln Ile Asp Thr Phe Thr Pro Asp ccg ctg ccg atg ecc gtt gca ggt cce gag gtc ggc gag get ttg gtc 624 Pro Leu Pro Met Pro Val Ala Gly Pro Glu Val Gly Glu Ala Leu Val tcgatgctc aaggatcac ggtgtc ggcttccat cctcgcaag gcccta 672 SerMetLeu LysAspHis GlyVal GlyPheHis ProArgLys AlaLeu 210 2.15 220 getcgcgtc gatgaggcc gcaagg acgatgcac ttcggtgac ggeacg 720 AlaArgVal AspGluAla AlaArg ThrMetHis PheGlyAsp GlyThr tccgaaccg ttcgatctg cttgcc gtggtcccc ccgcacgtg ccctcc 768 SerGluPro PheAspLeu LeuAla ValValPro ProHisVal ProSer gccgcggcg cggtcagcg ggtctc agcgaatcc gggtggata cccgtg 816 AlaAlaAla ArgSerAla GlyLeu SerGluSer GlyTrpIle ProVal gacccgcgc accctgtcc actagc gccgacaac gtgtgggcc atcggc 864 AspProArg ThrLeuSer ThrSer AlaAspAsn ValTrpAla IleGly gatgegacc gtgctgacg ctgeeg aatggcaaa ccgctgecc aagget 912 AspAlaThr ValLeuThr LeuPro AsnGlyLys ProLeuPro LysAla gccgtgttc gccgaagcc caggccgca gttgtcgcc cacggcgtc gcc 960 AlaValPhe AlaGluAla GlnAlaAla ValValAla HisGlyVal Ala cgccatctc ggttacgac gtagetgag cgccacttc accggcacg ggc 1008 ArgHisLeu GlyTyrAsp ValAlaGlu ArgHisPhe:ThrGlyThr Gly gcctgctac gtcgagacc ggtgatcac caggcagcc aagggcgac ggc 1056 AlaCysTyr ValGluThr GlyAspHis GlnAlaAla LysGlyAsp Gly gatttcttc getccgtcg gcgccctcg gtgacgctg taeccgccg tcg 1104 AspPhePhe AlaProSer AlaProSer ValThrLeu TyrProPro Ser cgggagttt cacgaggag aaggtcgca caagaactg gcctggctg acc 1152 ArgGluPhe HisGluGlu LysValAla GlnGluLeu AlaTrpLeu Thr cgctggaag acgtga 1167 ArgTrpLys Thr <210> 79 <211> 388 <212> PRT
<213> M.Tuberculosis <400> 79 Met Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr Thr Ala Asp Thr Leu Arg Gln Leu Leu Pro Pro Glu Asp Arg Ile Ile Leu Val Asp Arg Ser Phe Asp Gly Thr Leu Gly Leu Ser Leu Leu Trp Val Leu Arg Gly Trp Arg Arg Pro Asp Asp Val Arg Val Arg Pro Thr Ala Ala Ser Leu Pro Gly Val Glu Met Val Thr Ala Thr Val Ala His Ile Asp Ile Ala Ala Gln Val Val His Thr Asp Asn Ser Val Ile Gly Tyr Asp Ala Leu Val Ile Ala Leu Gly Ala Ala Leu Asn Thr Asp Ala Val Pro Gly Leu Ser Asp Ala Leu Asp Ala Asp Val Ala Gly Gln Phe Tyr Thr Leu Asp Gly Ala Ala Glu Leu Arg Ala Lys Val Glu Ala Leu Glu His Gly Arg Ile Ala Val Ala Ile Ala Gly Val Pro Phe Lys Cys Pro Ala Ala Pro Phe Glu Ala Ala Phe Leu Ile Ala Ala Gln Leu Gly Asp Arg Tyr Ala Thr Gly Thr Val Gln Ile Asp Thr Phe Thr Pro Asp Pro Leu Pro Met Pro Val Ala Gly Pro Glu Val Gly Glu Ala Leu Val Ser Met Leu Lys Asp His G.Ly Val Gly Phe His Pro Arg Lys Ala Leu Ala Arg Val Asp Glu Ala Ala Arg Thr Met His Phe Gly Asp Gly Thr Ser Glu Pro Phe Asp Leu Leu Ala Val Val Pro Pro His Val Pro Ser Ala Ala Ala Arg Ser Ala Gly Leu Ser Glu Ser Gly Trp Ile Pro Val Asp Pro Arg Thr Leu Ser Thr Ser Ala Asp Asn Val Trp Ala Ile Gly Asp Ala Thr Val Leu Thr Leu Pro Asn Gly Lys Pro Leu Pro Lys Ala Ala Val Phe Ala Glu Ala Gln Ala Ala Val Val Ala His Gly Val Ala Arg His Leu Gly Tyr Asp Val Ala Glu Arg His Phe Thr Gly Thr Gly Ala Cys Tyr Val Glu Thr Gly Asp His Gln Ala Ala Lys Gly Asp Gly Asp Phe Phe Ala Pro Ser Ala Pro Ser Val Thr Leu Tyr Pro Pro Ser Arg Glu Phe His Glu Glu Lys Val Ala Gln Glu Leu Ala Trp Leu Thr Arg Trp Lys Thr <210> 80 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 80 Ala Leu Lys Val Glu Met Val Thr Phe Asp Xaa Ser Asp Pro Ala <210> 81 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 81 Ala Asp Ala Asp Thr Thr Asp Phe Asp Val Asp Ala Glu Ala Pro <210> 82 <211> 15 <212> PRT
<213> M.Tuberculosis <400> 82 Ser Lys Thr Val Leu Ile Leu Gly Ala Gly Val Gly Gly Leu Thr <210> 83 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 83 ctgagatcta tggcactcaa ggtagag 27 <210> 84 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 84 ctcccatggt tattgacccg ccacgca 27 <210> 85 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 85 ctgagatcta tggccgacgc tgacacc 27 <210> 86 <211> 27 <212> DNA
<213> M.Tuberculosis <400> $6 ctcccatggc tagtcgcgga gcacaac 27 <210> 87 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 87 ctgagatcta tgagcaagac ggttctc 27 <210> 88 <211> 27 ' <212> DNA
<213> M.Tuberculosis <400> 88 ctcccatggt cacgtcttcc agcgggt 27 <210> 89 <211> 28 <212> DNA
<213> M.Tuberculosis <400> 89 ctgccatggc taggtggtgt gcacgatc 28 <210> 90 <211> 27 <212> DNA
<213> M.Tuberculosis <900> 90 ctgaagctta tgagcgccta taagacc 27 <210> 91 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 91 ctgagatcta tgattgatga ggctctc 27 <210> 92 <211> 27 <212> DNA
<213> M.Tuberculosis <400> 92 ctcccatgga gcggccgcta gacctcc 27 <210> 93 <211> 30 <212> DNA
<213> M.Tuberculosis <400> 93 ggctgagact catggccgac atcgatggtg 30 <210> 94 <211> 31 <212> DNA
<213> M.Tuberculosis <900> 94 cgtaccatgg tcatgacgac accccctcgt g 31 <210> 95 <211> 30 <212> DNA
<213> M.Tuberculosis <400> 95 ggctgagact catggctgaa gtactggtgc 30 <210> 96 <211> 31 <212> DNA
<213> M.Tuberculosis <400> 96 cgtaccatgg ctagccggcg accgccggtt c 31 <210> 97 <211> 20 <212> DNA
<213> M.Tuberculosis <900> 97 gtgaccgaac ggactctggt 20 <210> 98 <211> 21 <212> DNA
<213> M.Tuberculosis <900> 98 ctaggcgccg ggaaaccaga g 21 <210> 99 <211> 23 <212> DNA
<213> M.Tuberculosis <400> 99 atgacggata ctcaagtcac ctg 23 <210> 100 <211> 20 <212> DNA
<213> M.Tuberculosis <400> 100 ggagtggtac ggctcggcgc 20 <210> 101 <211> 20 <212> DNA
<213> M.Tubercuiosis <400> 101 atgacgtacg aaaccatcct 20 <210> 102 <211> 21 <212> DNA
<213> M.Tuberculosis <400> 102 tcatcggtgg gtgaactggg g 21 <210> 103 <211> 23 <212> DNA
<213> M.Tuberculosis <400> 103 atgccgcttc ccgcagaccc tag 23 <210> 104 <211> 21 <212> DNA
<213> M.Tuberculosis <900> 109 tacgacgggt accactcctg g 21 <210> 105 <211> 22 <212> DNA
<213> M.Tuberculosis <400> 105 atgctgatct cacagcgccc ca 22 <210> 106 <211> 22 <212> DNA
<213> M.Tuberculosis <400> 106 aagctgttcg gtttcggcgt ag 22 <210> 107 <211> 20 <212> DNA
<213> M.Tuberculosis <400> 107 atgaccggaa atttggtgac 20 <210> 108 <211> 21 <212> DNA
<213> M.Tuberculosis <400> 108 tcagtagcgg tagtggtccg g 21
Claims (23)
1. A substantially pure polypeptide, which has a sequence identity of at least 80% to SEQ
ID NOs 34 or a subsequence of at least 8 amino acids thereof, wherein the polypeptide or the subsequence thereof has at least one of the following properties:
i) the polypeptide induces an in vitro recall response determined by a release of IFN-.gamma. of at least 1,500 pg/ml from reactivated memory T-lymphocytes withdrawn from a mouse within 4 days after the mouse has been rechallenged with 1 x 10 6 virulent Mycobacteria, the induc-tion being performed by the addition of the polypeptide to a suspension comprising about 2 x 10 6 cells isolated from the spleen of said mouse, the addition of the polypeptide resulting in a concentration of the polypeptide of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supermatant harvested 3 days after the addition of the polypeptide to the suspension.
ii) the polypeptide induces an in vitro response during primary infection with virulent Myco-badana, determined by release of IFN-.gamma. of at least 1,500 pg/ml from T-lymphocytes with-drawn from a mouse within 28 days after the mouse has been infected with 5 x 10 4 virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a sus-pension comprising about 2 x 10 5 cells isolated from the spleen, the addition of the poly-peptide resulting in a concentration of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supernatant harvested 3 days after the addition of the polypeptide to the suspension, iii) the polypeptide induces a protective immunity determined by vaccinating an animal with the polypeptide and an adjuvant in a total of three times with two weeks interval starting at 6-8 weeks of age. 6 weeks after the last vaccination challenging with 5 x 10 6 virulent Myco-bacteria/ml by aerosol and determining a significant decrease in the number of bacteria re-coverable from the spleen 6 weeks after the animal has been challenged, compared to the number recovered from the same organ in an animal given placebo treatment, v) the polypeptide induces a specific antibody response in a TB patient as determined by an ELISA technique or a western blot when the whole blood is diluted 1:20 in PBS
and stimu-lated with the polypeptide in a concentration of not more than 20 µg/ml vi) the polypeptide reduces a positive in vitro response determined by release of IFN-.gamma. of at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an indi-vidual who is clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and does not induce such an IFN-.gamma. release in an individual not infected with a virulent Mycobac-ferium, viii) the polypeptide induces a positive DTH response determined by intradermal injection of at most 100 µg of the polypeptide to an individual who is clinically or subclinically infected with a virulent Mycobacterium, a positive response having a diameter of at feast 10 mm 72 hours after the injection, and does not induce such a response in an individual not infected with a virulent Mycobacterium.
ID NOs 34 or a subsequence of at least 8 amino acids thereof, wherein the polypeptide or the subsequence thereof has at least one of the following properties:
i) the polypeptide induces an in vitro recall response determined by a release of IFN-.gamma. of at least 1,500 pg/ml from reactivated memory T-lymphocytes withdrawn from a mouse within 4 days after the mouse has been rechallenged with 1 x 10 6 virulent Mycobacteria, the induc-tion being performed by the addition of the polypeptide to a suspension comprising about 2 x 10 6 cells isolated from the spleen of said mouse, the addition of the polypeptide resulting in a concentration of the polypeptide of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supermatant harvested 3 days after the addition of the polypeptide to the suspension.
ii) the polypeptide induces an in vitro response during primary infection with virulent Myco-badana, determined by release of IFN-.gamma. of at least 1,500 pg/ml from T-lymphocytes with-drawn from a mouse within 28 days after the mouse has been infected with 5 x 10 4 virulent Mycobacteria, the induction being performed by the addition of the polypeptide to a sus-pension comprising about 2 x 10 5 cells isolated from the spleen, the addition of the poly-peptide resulting in a concentration of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supernatant harvested 3 days after the addition of the polypeptide to the suspension, iii) the polypeptide induces a protective immunity determined by vaccinating an animal with the polypeptide and an adjuvant in a total of three times with two weeks interval starting at 6-8 weeks of age. 6 weeks after the last vaccination challenging with 5 x 10 6 virulent Myco-bacteria/ml by aerosol and determining a significant decrease in the number of bacteria re-coverable from the spleen 6 weeks after the animal has been challenged, compared to the number recovered from the same organ in an animal given placebo treatment, v) the polypeptide induces a specific antibody response in a TB patient as determined by an ELISA technique or a western blot when the whole blood is diluted 1:20 in PBS
and stimu-lated with the polypeptide in a concentration of not more than 20 µg/ml vi) the polypeptide reduces a positive in vitro response determined by release of IFN-.gamma. of at least 500 pg/ml from Peripheral Blood Mononuclear Cells (PBMC) withdrawn from an indi-vidual who is clinically or subclinically infected with a virulent Mycobacterium, the induction being performed by the addition of the polypeptide to a suspension comprising about 1.0 to 2.5 x 10 5 PBMC, the addition of the polypeptide resulting in a concentration of not more than 20 µg per ml suspension, the release of IFN-.gamma. being assessable by determination of IFN-.gamma. in supernatant harvested 5 days after the addition of the polypeptide to the suspension, and does not induce such an IFN-.gamma. release in an individual not infected with a virulent Mycobac-ferium, viii) the polypeptide induces a positive DTH response determined by intradermal injection of at most 100 µg of the polypeptide to an individual who is clinically or subclinically infected with a virulent Mycobacterium, a positive response having a diameter of at feast 10 mm 72 hours after the injection, and does not induce such a response in an individual not infected with a virulent Mycobacterium.
2. A substantially pure polypeptide which comprises an amino acid sequence consisting of SEQ ID NO: 34.
3. A polypeptide according to any of claims 1 or 2, which comprises an amino add sequence which has a sequence identity of at least 80% to an amino acid sequence consisting of SEQ
ID NO: 34 and/or is a subsequence thereof.
ID NO: 34 and/or is a subsequence thereof.
4. A purified or non-naturally occurring polypeptide as defined in any of claims 1-3 which comprises a T cell epitope.
5. A purified ar non-naturally occurring polypeptide as defined in any of claims 1-4 which comprises a B cell epitope.
6. A polypeptide according to any of claims 1-5, wherein the polypeptide is encodable by a nucleic acid sequence, which sequence 1) is the DNA sequence consisting of SEQ ID NO: 33 or an analogue of said sequence which hybridises with any of the DNA sequences shown in SEQ ID NO: 33 or a DNA
se-quence complementary thereto, or a specific part thereof, under stringent hybridization con-ditions, and/or 2) encodes a polypeptide, the amino acid sequence of which has a 8096 sequence identity with an amino acid sequence consisting of SEQ ID NO: 34 and/or 3) constitutes a subsequence of any of the above mentioned DNA sequences, and/or 4) constitutes a subsequence of any of the above mentioned polypeptide sequences.
se-quence complementary thereto, or a specific part thereof, under stringent hybridization con-ditions, and/or 2) encodes a polypeptide, the amino acid sequence of which has a 8096 sequence identity with an amino acid sequence consisting of SEQ ID NO: 34 and/or 3) constitutes a subsequence of any of the above mentioned DNA sequences, and/or 4) constitutes a subsequence of any of the above mentioned polypeptide sequences.
7. A polypeptide as defined in any of claims 1-8 for use in medicine.
8. Use of a polypeptide as defined in any of claims 1-6 for the manufacture of a diagnostic reagent for the diagnosis of an infection with a virulent Mycobacterium.
9. Use of a polypeptide as defined in any of claims 1-6 for the manufacture of a composition for induction of a protective immune response in a mammal against infection with a virulent Mycobacterium.
10. A composition comprising a polypeptide as defined in any of claims 1-7, further com-prising at least one other polypeptide derived from a virulent Mycobacterium.
11. A composition comprising, as the effective component, a recombinant micro-organism, wherein at least one copy of a DNA sequence comprising a DNA sequence encoding a poly-peptide as defined in any of claims 1-6 has been incorporated into the genome of the micro-organism in a manner allowing the micro-organism to express and secrete the polypeptide.
12. A diagnostic reagent for diagnosing an infection with a virulent Mycobacterium compri-sing a polypeptide as defined in any of claims 1-7, optionally in combination with a pharma-ceutically acceptable carrier or vehicle.
13. A diagnostic reagent according to claim 12 for differentiating an individual who is clini-cally or subclinically infected with a virulent Mycobacterium from an individual not infected with virulent Mycobacterium.
14. A diagnostic reagent according to any of claims 12 for differentiating an individual who fa clinically or subclinically infected with a virulent Mycobacterium from an individual who has a cleared infection with a virulent Mycobacterium.
15. A diagnostic reagent according to any of claims 12 for diagnosing an infection with My-cobacterium tuberculosis.
18. An extract of polypeptides obtainable by a method comprising the steps of a) killing a sample of virulent Mycobacteria;
b) centrifugating the sample of a) at 2,000g for 40 minutes;
c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with 20 rounds of 90 seconds;
d) centrifugating the suspension of c) at 5,000g for 30 minutes;
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane com-ponents from the supernatant of d) with 10% SDS;
f) centrifugating the extract of e) at 20,000g for 30 minutes;
g) precipitating the supernatant of f) with 8 volumes of cold acetone;
with an adjuvant substance.
b) centrifugating the sample of a) at 2,000g for 40 minutes;
c) resuspending the pellet of b) in PBS and 0.5% Tween 20 and sonicating with 20 rounds of 90 seconds;
d) centrifugating the suspension of c) at 5,000g for 30 minutes;
e) extracting soluble proteins from the cytosol as well as cell wall and cell membrane com-ponents from the supernatant of d) with 10% SDS;
f) centrifugating the extract of e) at 20,000g for 30 minutes;
g) precipitating the supernatant of f) with 8 volumes of cold acetone;
with an adjuvant substance.
17. Use of an extract of polypeptides with an adjuvant substance according to claim 16 for the preparation of a composition for the generation of an immune response against a virulent Mycobacterium,
18. A method of screening for inhibition of the infectivity of a virulent Mycobacterium be-longing to the tuberculosis complex, said method comprising a) inhibiting the expression of one ar more of the polypeptides according to the invention, and b) observing the effect, if any, on the infectivity of the bacteria.
19. A method according to claim 18 wherein the expression is inhibited by blocking the tran-scription of the polypeptides or by interfering with regulatory sequences.
20. A method according to claim 19, wherein the inhibition is at the level of translation or post-translational processing of the polypeptides or by direct interaction with the polypep-tides.
21. A method of using the polypeptides having a significant effect on the infectivity of a viru-lent Mycobacterium as tested in any of claims 18-20 for designing a prophylactic or thera-peutic agent.
22. A nucleotide sequence which is a nucleotide sequence consisting of SEG ID
NO: 33 or an analogue of said sequence which hybridises with any of the nucleotide sequences shown in SEQ IQ NO: 33 or a nucleotide sequence complementary thereto, or a specific part or subsequence thereof, under stringent hybridisation conditions.
NO: 33 or an analogue of said sequence which hybridises with any of the nucleotide sequences shown in SEQ IQ NO: 33 or a nucleotide sequence complementary thereto, or a specific part or subsequence thereof, under stringent hybridisation conditions.
23. A monoclonal or polyclonal antibody, which is specifically reacting wish a polypeptide ac-cording to any of claims 1-7 in an immuno assay, or a specific binding fragment of said anti-body.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DKPA199801281 | 1998-10-08 | ||
DKPA199801281 | 1998-10-08 | ||
US11667399P | 1999-01-21 | 1999-01-21 | |
US60/116,673 | 1999-01-21 | ||
PCT/DK1999/000538 WO2000021983A2 (en) | 1998-10-08 | 1999-10-08 | Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2346218A1 true CA2346218A1 (en) | 2000-04-20 |
Family
ID=26065520
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002346218A Abandoned CA2346218A1 (en) | 1998-10-08 | 1999-10-08 | Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1117683A2 (en) |
AU (1) | AU766093B2 (en) |
CA (1) | CA2346218A1 (en) |
WO (1) | WO2000021983A2 (en) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB0030368D0 (en) * | 2000-12-13 | 2001-01-24 | Inst Of Molecul & Cell Biology | Dormancy-induced mycobacterium proteins |
ES2231037B1 (en) * | 2003-10-31 | 2005-12-16 | Archivel Technologies, Sl | USEFUL IMMUNOTHERAPIC AGENT FOR THE COMBINED TREATMENT OF TUBERCULOSIS IN ASSOCIATION WITH OTHER PHARMACOS. |
EP1812580B1 (en) | 2004-11-16 | 2014-12-17 | Crucell Holland B.V. | Multivalent vaccines comprising recombinant viral vectors |
US7608277B2 (en) * | 2004-12-01 | 2009-10-27 | Gene Therapy Systems, Inc. | Tuberculosis nucleic acids, polypeptides and immunogenic compositions |
WO2007108829A2 (en) * | 2005-10-26 | 2007-09-27 | Gene Therapy Systems, Inc. | Tuberculosis nucleic acids, polypeptides and immunogenic compositions |
ES2307402B1 (en) * | 2006-10-30 | 2009-09-30 | Archivel Farma, S.L. | PROFILACTIC VACCINE AGAINST TUBERCULOSIS. |
US10414819B2 (en) | 2013-08-30 | 2019-09-17 | Longhorn Vaccines And Diagnostics, Llc | Monoclonal antibodies that modulate immunity to MTB and enhance immune clearance |
US10370437B2 (en) | 2013-08-30 | 2019-08-06 | Longhorn Vaccines And Diagnostics, Llc | Antibodies that modulate immunity to drug resistant and latent MTB infections |
CA2922431C (en) * | 2013-08-30 | 2022-01-11 | Longhorn Vaccines And Diagnostics, Llc | Enhancing immunity to tuberculosis |
CN106248934B (en) * | 2016-08-25 | 2018-04-06 | 中国疾病预防控制中心传染病预防控制所 | Antigen of mycobacterium tuberculosis albumen Rv0446c and its t cell epitope peptide application |
CN106248935B (en) * | 2016-08-31 | 2018-04-06 | 中国疾病预防控制中心传染病预防控制所 | Antigen of mycobacterium tuberculosis albumen Rv1798 and its t cell epitope peptide application |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6290969B1 (en) * | 1995-09-01 | 2001-09-18 | Corixa Corporation | Compounds and methods for immunotherapy and diagnosis of tuberculosis |
-
1999
- 1999-10-08 AU AU60784/99A patent/AU766093B2/en not_active Ceased
- 1999-10-08 CA CA002346218A patent/CA2346218A1/en not_active Abandoned
- 1999-10-08 WO PCT/DK1999/000538 patent/WO2000021983A2/en not_active Application Discontinuation
- 1999-10-08 EP EP99947257A patent/EP1117683A2/en not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
WO2000021983A2 (en) | 2000-04-20 |
AU766093B2 (en) | 2003-10-09 |
EP1117683A2 (en) | 2001-07-25 |
WO2000021983A3 (en) | 2000-11-23 |
AU6078499A (en) | 2000-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2008224343B2 (en) | Tuberculosis vaccine and diagnostics based on the Mycobacterium tuberculosis esat-6 gene family | |
JP4759011B2 (en) | Compounds for immunotherapy and diagnosis of tuberculosis and methods of their use | |
US8076469B2 (en) | TB diagnostic based on antigens from M. tuberculosis | |
ES2229220T3 (en) | VACCINE AGAINST TUBERCULOSIS. | |
AU750173B2 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
US20130149324A1 (en) | Therapeutic tb vaccine | |
AU740545B2 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
CA2405247A1 (en) | Tuberculosis antigens and methods of use thereof | |
JP2003510018A5 (en) | ||
CA2346218A1 (en) | Tuberculosis vaccine and diagnostic reagents based on antigens from the mycobacterium tuberculosis cell | |
US20040013685A1 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
AU2012202486B2 (en) | Tuberculosis vaccine and diagnostics based on the Mycobacterium tuberculosis esat-6 gene family | |
JP5075969B2 (en) | Mycobacterium tuberculosis esat-6 gene family based tuberculosis vaccine and diagnostic method | |
US7041295B2 (en) | Compounds for treatment of infectious and immune system disorders and methods for their use | |
EP1787994A1 (en) | TB vaccine and diagnostic based on antigens from M. tuberculosis cell | |
ANDERSEN et al. | Patent 2378763 Summary | |
AU2006252186A1 (en) | Nucleic acid fragments and polypeptide fragments derived from M. tuberculosis | |
NZ502423A (en) | Polynucleotide sequences, designated GS, in pathogenic mycobacteria and their use in vaccines |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
FZDE | Discontinued |