CA2341765A1 - Trna binding domain - Google Patents
Trna binding domain Download PDFInfo
- Publication number
- CA2341765A1 CA2341765A1 CA002341765A CA2341765A CA2341765A1 CA 2341765 A1 CA2341765 A1 CA 2341765A1 CA 002341765 A CA002341765 A CA 002341765A CA 2341765 A CA2341765 A CA 2341765A CA 2341765 A1 CA2341765 A1 CA 2341765A1
- Authority
- CA
- Canada
- Prior art keywords
- peptide
- trna
- loop
- leu
- val
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 102000000885 tRNA-binding domains Human genes 0.000 title description 4
- 108050007916 tRNA-binding domains Proteins 0.000 title description 4
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 212
- 108020004566 Transfer RNA Proteins 0.000 claims abstract description 133
- 108020005098 Anticodon Proteins 0.000 claims abstract description 92
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 66
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 66
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 66
- 150000001413 amino acids Chemical class 0.000 claims abstract description 45
- 238000000034 method Methods 0.000 claims abstract description 41
- 150000001875 compounds Chemical class 0.000 claims abstract description 39
- 230000003993 interaction Effects 0.000 claims abstract description 37
- 230000027455 binding Effects 0.000 claims abstract description 34
- 239000000126 substance Substances 0.000 claims abstract description 31
- 239000000203 mixture Substances 0.000 claims abstract description 28
- 210000003705 ribosome Anatomy 0.000 claims abstract description 27
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 claims abstract description 23
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 claims abstract description 23
- 238000012360 testing method Methods 0.000 claims abstract description 23
- 108010000605 Ribosomal Proteins Proteins 0.000 claims abstract description 22
- 102000002278 Ribosomal Proteins Human genes 0.000 claims abstract description 22
- 239000000556 agonist Substances 0.000 claims abstract description 14
- 239000005557 antagonist Substances 0.000 claims abstract description 12
- 239000008194 pharmaceutical composition Substances 0.000 claims abstract description 7
- 230000002401 inhibitory effect Effects 0.000 claims abstract description 4
- 229940024606 amino acid Drugs 0.000 claims description 45
- 235000001014 amino acid Nutrition 0.000 claims description 45
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 claims description 43
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 claims description 40
- 235000014705 isoleucine Nutrition 0.000 claims description 40
- 229960000310 isoleucine Drugs 0.000 claims description 40
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 claims description 40
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 claims description 36
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical group CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 claims description 35
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Chemical group CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 claims description 35
- 235000005772 leucine Nutrition 0.000 claims description 35
- 235000014393 valine Nutrition 0.000 claims description 35
- 239000004474 valine Substances 0.000 claims description 35
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 claims description 34
- 229960003136 leucine Drugs 0.000 claims description 34
- 235000006109 methionine Nutrition 0.000 claims description 22
- 229930182817 methionine Natural products 0.000 claims description 22
- 239000004471 Glycine Substances 0.000 claims description 21
- 239000007787 solid Substances 0.000 claims description 21
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 20
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 claims description 18
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 claims description 17
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 claims description 17
- 235000004400 serine Nutrition 0.000 claims description 17
- 235000004279 alanine Nutrition 0.000 claims description 16
- 239000004475 Arginine Substances 0.000 claims description 15
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 claims description 15
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 15
- 235000009697 arginine Nutrition 0.000 claims description 15
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 claims description 14
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 claims description 13
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 claims description 13
- 239000004472 Lysine Substances 0.000 claims description 13
- 235000009582 asparagine Nutrition 0.000 claims description 13
- 229960001230 asparagine Drugs 0.000 claims description 13
- 235000018977 lysine Nutrition 0.000 claims description 13
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 claims description 12
- 230000014616 translation Effects 0.000 claims description 11
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 10
- 229960005190 phenylalanine Drugs 0.000 claims description 10
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 claims description 9
- 235000018417 cysteine Nutrition 0.000 claims description 9
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 claims description 9
- 235000008729 phenylalanine Nutrition 0.000 claims description 9
- 229960004441 tyrosine Drugs 0.000 claims description 9
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 claims description 8
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 claims description 8
- 239000004473 Threonine Substances 0.000 claims description 8
- 235000008521 threonine Nutrition 0.000 claims description 8
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 8
- 235000002374 tyrosine Nutrition 0.000 claims description 8
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 claims description 7
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 claims description 7
- 235000003704 aspartic acid Nutrition 0.000 claims description 7
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 claims description 7
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 claims description 7
- 235000014304 histidine Nutrition 0.000 claims description 7
- 235000013930 proline Nutrition 0.000 claims description 7
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 claims description 6
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 claims description 6
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 claims description 6
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 claims description 6
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 6
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 claims description 6
- 235000004554 glutamine Nutrition 0.000 claims description 6
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 claims description 5
- 231100000742 Plant toxin Toxicity 0.000 claims description 5
- 239000003443 antiviral agent Substances 0.000 claims description 5
- 239000002596 immunotoxin Substances 0.000 claims description 5
- 231100000608 immunotoxin Toxicity 0.000 claims description 5
- 230000002637 immunotoxin Effects 0.000 claims description 5
- 229940051026 immunotoxin Drugs 0.000 claims description 5
- 239000003123 plant toxin Substances 0.000 claims description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 4
- 239000003242 anti bacterial agent Substances 0.000 claims description 4
- 230000015572 biosynthetic process Effects 0.000 claims description 4
- 235000013922 glutamic acid Nutrition 0.000 claims description 4
- 239000004220 glutamic acid Substances 0.000 claims description 4
- 238000001243 protein synthesis Methods 0.000 claims description 4
- 238000002360 preparation method Methods 0.000 claims description 3
- 230000002452 interceptive effect Effects 0.000 claims description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 claims description 2
- XTHOIFAGDPGJPZ-PQQJDVFMSA-N (e,2r,3r,4s,5r)-n-(2,3-dihydro-1h-inden-2-yl)-3,4,5-trihydroxy-2-methoxy-8,8-dimethylnon-6-enamide Chemical compound C1=CC=C2CC(NC(=O)[C@@H]([C@H](O)[C@@H](O)[C@H](O)\C=C\C(C)(C)C)OC)CC2=C1 XTHOIFAGDPGJPZ-PQQJDVFMSA-N 0.000 claims 1
- GARHCDOTUULBOQ-PKNBQFBNSA-N 4-{(1e)-3-oxo-3-[(2-phenylethyl)amino]prop-1-en-1-yl}-1,2-phenylene diacetate Chemical compound C1=C(OC(C)=O)C(OC(=O)C)=CC=C1\C=C\C(=O)NCCC1=CC=CC=C1 GARHCDOTUULBOQ-PKNBQFBNSA-N 0.000 claims 1
- VSHUQLRHTJOKTA-XBXARRHUSA-N N-cis-Caffeoyltyramine Chemical compound C1=CC(O)=CC=C1CCNC(=O)\C=C\C1=CC=C(O)C(O)=C1 VSHUQLRHTJOKTA-XBXARRHUSA-N 0.000 claims 1
- 239000003937 drug carrier Substances 0.000 claims 1
- HBPXFHNNLMCUPA-UHFFFAOYSA-M molport-023-277-200 Chemical compound [Br-].C1N(C2)CN3CN2C[N+]1(CCO)C3 HBPXFHNNLMCUPA-UHFFFAOYSA-M 0.000 claims 1
- 238000005406 washing Methods 0.000 claims 1
- 102000004196 processed proteins & peptides Human genes 0.000 description 75
- 108090000623 proteins and genes Proteins 0.000 description 74
- 235000018102 proteins Nutrition 0.000 description 69
- 102000004169 proteins and genes Human genes 0.000 description 69
- 101710146427 Probable tyrosine-tRNA ligase, cytoplasmic Proteins 0.000 description 39
- 101710107268 Tyrosine-tRNA ligase, mitochondrial Proteins 0.000 description 39
- 102000018378 Tyrosine-tRNA ligase Human genes 0.000 description 38
- 210000004027 cell Anatomy 0.000 description 21
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 17
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 14
- 108010008355 arginyl-glutamine Proteins 0.000 description 14
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 13
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 12
- 230000000694 effects Effects 0.000 description 12
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 10
- 210000003470 mitochondria Anatomy 0.000 description 10
- 241000193385 Geobacillus stearothermophilus Species 0.000 description 9
- 239000013604 expression vector Substances 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- 241000894006 Bacteria Species 0.000 description 8
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 8
- -1 amino acid adenylate Chemical class 0.000 description 8
- 239000012634 fragment Substances 0.000 description 8
- 241000588724 Escherichia coli Species 0.000 description 7
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 7
- JXCOEPXCBVCTRD-JYJNAYRXSA-N Val-Tyr-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JXCOEPXCBVCTRD-JYJNAYRXSA-N 0.000 description 7
- 239000002253 acid Substances 0.000 description 7
- 210000004899 c-terminal region Anatomy 0.000 description 7
- 108020001507 fusion proteins Proteins 0.000 description 7
- 102000037865 fusion proteins Human genes 0.000 description 7
- 230000014509 gene expression Effects 0.000 description 7
- 108010028295 histidylhistidine Proteins 0.000 description 7
- 108010018625 phenylalanylarginine Proteins 0.000 description 7
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 6
- IOFDDSNZJDIGPB-GVXVVHGQSA-N Gln-Leu-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IOFDDSNZJDIGPB-GVXVVHGQSA-N 0.000 description 6
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 6
- 108010047562 NGR peptide Proteins 0.000 description 6
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 6
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 6
- 238000003556 assay Methods 0.000 description 6
- 230000001580 bacterial effect Effects 0.000 description 6
- 238000007796 conventional method Methods 0.000 description 6
- 230000004927 fusion Effects 0.000 description 6
- 239000003446 ligand Substances 0.000 description 6
- 239000003550 marker Substances 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 238000012216 screening Methods 0.000 description 6
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 5
- 241000196324 Embryophyta Species 0.000 description 5
- 102000004190 Enzymes Human genes 0.000 description 5
- 108090000790 Enzymes Proteins 0.000 description 5
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 5
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 5
- AWHJQEYGWRKPHE-LSJOCFKGSA-N His-Ala-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AWHJQEYGWRKPHE-LSJOCFKGSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- 150000007513 acids Chemical class 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 239000000463 material Substances 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 229920001184 polypeptide Polymers 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 108010048818 seryl-histidine Proteins 0.000 description 5
- YVOOPGWEIRIUOX-UHFFFAOYSA-N 2-azanyl-3-sulfanyl-propanoic acid Chemical compound SCC(N)C(O)=O.SCC(N)C(O)=O YVOOPGWEIRIUOX-UHFFFAOYSA-N 0.000 description 4
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 4
- MUXONAMCEUBVGA-DCAQKATOSA-N Arg-Arg-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O MUXONAMCEUBVGA-DCAQKATOSA-N 0.000 description 4
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 4
- 108010069514 Cyclic Peptides Proteins 0.000 description 4
- 102000001189 Cyclic Peptides Human genes 0.000 description 4
- 241000206602 Eukaryota Species 0.000 description 4
- 108010070675 Glutathione transferase Proteins 0.000 description 4
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 4
- MPXGJGBXCRQQJE-MXAVVETBSA-N His-Ile-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O MPXGJGBXCRQQJE-MXAVVETBSA-N 0.000 description 4
- 241000282414 Homo sapiens Species 0.000 description 4
- MKWSZEHGHSLNPF-NAKRPEOUSA-N Ile-Ala-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O)N MKWSZEHGHSLNPF-NAKRPEOUSA-N 0.000 description 4
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 4
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 4
- QSXSHZIRKTUXNG-STECZYCISA-N Ile-Val-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QSXSHZIRKTUXNG-STECZYCISA-N 0.000 description 4
- 108060003951 Immunoglobulin Proteins 0.000 description 4
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 4
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 4
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 4
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 4
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 4
- 102000003960 Ligases Human genes 0.000 description 4
- 108090000364 Ligases Proteins 0.000 description 4
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 4
- LQMHZERGCQJKAH-STQMWFEESA-N Met-Gly-Phe Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LQMHZERGCQJKAH-STQMWFEESA-N 0.000 description 4
- 241000192584 Synechocystis Species 0.000 description 4
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 4
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 4
- 230000001086 cytosolic effect Effects 0.000 description 4
- 238000012217 deletion Methods 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 4
- 102000018358 immunoglobulin Human genes 0.000 description 4
- 238000010348 incorporation Methods 0.000 description 4
- 108010060857 isoleucyl-valyl-tyrosine Proteins 0.000 description 4
- JVTAAEKCZFNVCJ-UHFFFAOYSA-N lactic acid Chemical compound CC(O)C(O)=O JVTAAEKCZFNVCJ-UHFFFAOYSA-N 0.000 description 4
- 229920002521 macromolecule Polymers 0.000 description 4
- 230000000144 pharmacologic effect Effects 0.000 description 4
- 239000013612 plasmid Substances 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 238000003259 recombinant expression Methods 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000007423 screening assay Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 241001515965 unidentified phage Species 0.000 description 4
- 239000003981 vehicle Substances 0.000 description 4
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 3
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 3
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 3
- UHFUZWSZQKMDSX-DCAQKATOSA-N Arg-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UHFUZWSZQKMDSX-DCAQKATOSA-N 0.000 description 3
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 3
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 3
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 3
- 235000014469 Bacillus subtilis Nutrition 0.000 description 3
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 3
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 3
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 3
- FHQRLHFYVZAQHU-IUCAKERBSA-N Gly-Lys-Gln Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O FHQRLHFYVZAQHU-IUCAKERBSA-N 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- JJHWJUYYTWYXPL-PYJNHQTQSA-N His-Ile-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CN=CN1 JJHWJUYYTWYXPL-PYJNHQTQSA-N 0.000 description 3
- IOVUXUSIGXCREV-DKIMLUQUSA-N Ile-Leu-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IOVUXUSIGXCREV-DKIMLUQUSA-N 0.000 description 3
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 3
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- 239000002202 Polyethylene glycol Substances 0.000 description 3
- 125000000539 amino acid group Chemical group 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 125000004057 biotinyl group Chemical group [H]N1C(=O)N([H])[C@]2([H])[C@@]([H])(SC([H])([H])[C@]12[H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C(*)=O 0.000 description 3
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 3
- 210000003763 chloroplast Anatomy 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 108010049041 glutamylalanine Proteins 0.000 description 3
- 108010050848 glycylleucine Proteins 0.000 description 3
- 230000002209 hydrophobic effect Effects 0.000 description 3
- 230000028993 immune response Effects 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 229920001223 polyethylene glycol Polymers 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 108010029020 prolylglycine Proteins 0.000 description 3
- 238000007363 ring formation reaction Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- OIXLLKLZKCBCPS-RZVRUWJTSA-N (2s)-2-azanyl-5-[bis(azanyl)methylideneamino]pentanoic acid Chemical compound OC(=O)[C@@H](N)CCCNC(N)=N.OC(=O)[C@@H](N)CCCNC(N)=N OIXLLKLZKCBCPS-RZVRUWJTSA-N 0.000 description 2
- MYRTYDVEIRVNKP-UHFFFAOYSA-N 1,2-Divinylbenzene Chemical compound C=CC1=CC=CC=C1C=C MYRTYDVEIRVNKP-UHFFFAOYSA-N 0.000 description 2
- 108020004465 16S ribosomal RNA Proteins 0.000 description 2
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 2
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 2
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- 101000640990 Arabidopsis thaliana Tryptophan-tRNA ligase, chloroplastic/mitochondrial Proteins 0.000 description 2
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- JCAISGGAOQXEHJ-ZPFDUUQYSA-N Arg-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N JCAISGGAOQXEHJ-ZPFDUUQYSA-N 0.000 description 2
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 2
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 2
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 2
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 2
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 2
- VXLBDJWTONZHJN-YUMQZZPRSA-N Asn-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N VXLBDJWTONZHJN-YUMQZZPRSA-N 0.000 description 2
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 2
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 2
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 2
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 2
- 150000008574 D-amino acids Chemical class 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 229920002307 Dextran Polymers 0.000 description 2
- 101100480329 Enterococcus faecalis (strain TX4000 / JH2-2) tyrS1 gene Proteins 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 2
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- AEMRFAOFKBGASW-UHFFFAOYSA-N Glycolic acid Chemical compound OCC(O)=O AEMRFAOFKBGASW-UHFFFAOYSA-N 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- PGRPSOUCWRBWKZ-DLOVCJGASA-N His-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 PGRPSOUCWRBWKZ-DLOVCJGASA-N 0.000 description 2
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 2
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 2
- 241000880493 Leptailurus serval Species 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 2
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 2
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- CGHXMODRYJISSK-NHCYSSNCSA-N Leu-Val-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O CGHXMODRYJISSK-NHCYSSNCSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- NTEVEUCLFMWSND-SRVKXCTJSA-N Lys-Arg-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O NTEVEUCLFMWSND-SRVKXCTJSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- 244000173297 Medicago polymorpha Species 0.000 description 2
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 2
- 241000204051 Mycoplasma genitalium Species 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 229920002352 Peptidyl-tRNA Polymers 0.000 description 2
- QMMRHASQEVCJGR-UBHSHLNASA-N Phe-Ala-Pro Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 QMMRHASQEVCJGR-UBHSHLNASA-N 0.000 description 2
- KOUUGTKGEQZRHV-KKUMJFAQSA-N Phe-Gln-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KOUUGTKGEQZRHV-KKUMJFAQSA-N 0.000 description 2
- WEMYTDDMDBLPMI-DKIMLUQUSA-N Phe-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N WEMYTDDMDBLPMI-DKIMLUQUSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- 244000308495 Potentilla anserina Species 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- LCTONWCANYUPML-UHFFFAOYSA-N Pyruvic acid Chemical compound CC(=O)C(O)=O LCTONWCANYUPML-UHFFFAOYSA-N 0.000 description 2
- 241000007425 Ramalina americana Species 0.000 description 2
- 101100363853 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RPS9B gene Proteins 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 2
- IHAPJUHCZXBPHR-WZLNRYEVSA-N Thr-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N IHAPJUHCZXBPHR-WZLNRYEVSA-N 0.000 description 2
- 102000002501 Tryptophan-tRNA Ligase Human genes 0.000 description 2
- BYOHPUZJVXWHAE-BYULHYEWSA-N Val-Asn-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BYOHPUZJVXWHAE-BYULHYEWSA-N 0.000 description 2
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 2
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 2
- DLRZGNXCXUGIDG-KKHAAJSZSA-N Val-Thr-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O DLRZGNXCXUGIDG-KKHAAJSZSA-N 0.000 description 2
- RLVTVHSDKHBFQP-ULQDDVLXSA-N Val-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 RLVTVHSDKHBFQP-ULQDDVLXSA-N 0.000 description 2
- 238000010521 absorption reaction Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 238000007792 addition Methods 0.000 description 2
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 2
- 125000000266 alpha-aminoacyl group Chemical group 0.000 description 2
- 125000003368 amide group Chemical group 0.000 description 2
- 125000003277 amino group Chemical group 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 239000001110 calcium chloride Substances 0.000 description 2
- 229960002713 calcium chloride Drugs 0.000 description 2
- 235000011148 calcium chloride Nutrition 0.000 description 2
- 229910001628 calcium chloride Inorganic materials 0.000 description 2
- 235000014633 carbohydrates Nutrition 0.000 description 2
- 150000001720 carbohydrates Chemical class 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 239000013078 crystal Substances 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- PMMYEEVYMWASQN-UHFFFAOYSA-N dl-hydroxyproline Natural products OC1C[NH2+]C(C([O-])=O)C1 PMMYEEVYMWASQN-UHFFFAOYSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 239000000194 fatty acid Substances 0.000 description 2
- 238000009472 formulation Methods 0.000 description 2
- 230000002538 fungal effect Effects 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- 125000001841 imino group Chemical group [H]N=* 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 239000004310 lactic acid Substances 0.000 description 2
- 235000014655 lactic acid Nutrition 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 210000004962 mammalian cell Anatomy 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 238000000302 molecular modelling Methods 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 239000002773 nucleotide Substances 0.000 description 2
- 125000003729 nucleotide group Chemical group 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 210000001236 prokaryotic cell Anatomy 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 238000011321 prophylaxis Methods 0.000 description 2
- 230000006916 protein interaction Effects 0.000 description 2
- 239000012857 radioactive material Substances 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 108010033786 ribosomal protein S4 Proteins 0.000 description 2
- YGSDEFSMJLZEOE-UHFFFAOYSA-N salicylic acid Chemical compound OC(=O)C1=CC=CC=C1O YGSDEFSMJLZEOE-UHFFFAOYSA-N 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 125000006850 spacer group Chemical group 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- 101150101943 tyrS gene Proteins 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- BJBUEDPLEOHJGE-UHFFFAOYSA-N (2R,3S)-3-Hydroxy-2-pyrolidinecarboxylic acid Natural products OC1CCNC1C(O)=O BJBUEDPLEOHJGE-UHFFFAOYSA-N 0.000 description 1
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- WURBVZBTWMNKQT-UHFFFAOYSA-N 1-(4-chlorophenoxy)-3,3-dimethyl-1-(1,2,4-triazol-1-yl)butan-2-one Chemical compound C1=NC=NN1C(C(=O)C(C)(C)C)OC1=CC=C(Cl)C=C1 WURBVZBTWMNKQT-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- 229940117976 5-hydroxylysine Drugs 0.000 description 1
- 241000224423 Acanthamoeba castellanii Species 0.000 description 1
- 241000605222 Acidithiobacillus ferrooxidans Species 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- GORKKVHIBWAQHM-GCJQMDKQSA-N Ala-Asn-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GORKKVHIBWAQHM-GCJQMDKQSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- PIXQDIGKDNNOOV-GUBZILKMSA-N Ala-Lys-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O PIXQDIGKDNNOOV-GUBZILKMSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- 102000006268 Alanine-tRNA ligase Human genes 0.000 description 1
- 108010058060 Alanine-tRNA ligase Proteins 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 1
- 239000004382 Amylase Substances 0.000 description 1
- 101000787278 Arabidopsis thaliana Valine-tRNA ligase, chloroplastic/mitochondrial 2 Proteins 0.000 description 1
- 101000787296 Arabidopsis thaliana Valine-tRNA ligase, mitochondrial 1 Proteins 0.000 description 1
- HJWQFFYRVFEWRM-SRVKXCTJSA-N Arg-Arg-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O HJWQFFYRVFEWRM-SRVKXCTJSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- YHQGEARSFILVHL-HJGDQZAQSA-N Arg-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)O YHQGEARSFILVHL-HJGDQZAQSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- XRNXPIGJPQHCPC-RCWTZXSCSA-N Arg-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCNC(N)=N)[C@@H](C)O)C(O)=O XRNXPIGJPQHCPC-RCWTZXSCSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 1
- 102000002249 Arginine-tRNA Ligase Human genes 0.000 description 1
- 108010014885 Arginine-tRNA ligase Proteins 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- NNMUHYLAYUSTTN-FXQIFTODSA-N Asn-Gln-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O NNMUHYLAYUSTTN-FXQIFTODSA-N 0.000 description 1
- GWNMUVANAWDZTI-YUMQZZPRSA-N Asn-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N GWNMUVANAWDZTI-YUMQZZPRSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- QNNBHTFDFFFHGC-KKUMJFAQSA-N Asn-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QNNBHTFDFFFHGC-KKUMJFAQSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- YODBPLSWNJMZOJ-BPUTZDHNSA-N Asp-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N YODBPLSWNJMZOJ-BPUTZDHNSA-N 0.000 description 1
- 102000003924 Asparagine-tRNA ligases Human genes 0.000 description 1
- 108090000314 Asparagine-tRNA ligases Proteins 0.000 description 1
- 102000012951 Aspartate-tRNA Ligase Human genes 0.000 description 1
- 108010065272 Aspartate-tRNA ligase Proteins 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 108090001008 Avidin Proteins 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 108010077805 Bacterial Proteins Proteins 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 241000894010 Buchnera aphidicola Species 0.000 description 1
- 125000001433 C-terminal amino-acid group Chemical group 0.000 description 1
- 229920002134 Carboxymethyl cellulose Polymers 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000195597 Chlamydomonas reinhardtii Species 0.000 description 1
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 1
- 240000000885 Citrullus colocynthis Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- 102000004403 Cysteine-tRNA ligases Human genes 0.000 description 1
- 108090000918 Cysteine-tRNA ligases Proteins 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- 101000787280 Dictyostelium discoideum Probable valine-tRNA ligase, mitochondrial Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 108090000331 Firefly luciferases Proteins 0.000 description 1
- 241000700662 Fowlpox virus Species 0.000 description 1
- 108010058643 Fungal Proteins Proteins 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 1
- GFLNKSQHOBOMNM-AVGNSLFASA-N Gln-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GFLNKSQHOBOMNM-AVGNSLFASA-N 0.000 description 1
- FFVXLVGUJBCKRX-UKJIMTQDSA-N Gln-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N FFVXLVGUJBCKRX-UKJIMTQDSA-N 0.000 description 1
- SXGMGNZEHFORAV-IUCAKERBSA-N Gln-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N SXGMGNZEHFORAV-IUCAKERBSA-N 0.000 description 1
- BJPPYOMRAVLXBY-YUMQZZPRSA-N Gln-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N BJPPYOMRAVLXBY-YUMQZZPRSA-N 0.000 description 1
- RWCBJYUPAUTWJD-NHCYSSNCSA-N Gln-Met-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O RWCBJYUPAUTWJD-NHCYSSNCSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- UEILCTONAMOGBR-RWRJDSDZSA-N Gln-Thr-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UEILCTONAMOGBR-RWRJDSDZSA-N 0.000 description 1
- SDSMVVSHLAAOJL-UKJIMTQDSA-N Gln-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N SDSMVVSHLAAOJL-UKJIMTQDSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 1
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 1
- QLPYYTDOUQNJGQ-AVGNSLFASA-N Glu-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N QLPYYTDOUQNJGQ-AVGNSLFASA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 1
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- 102000001888 Glutamate-tRNA ligase Human genes 0.000 description 1
- 108010015514 Glutamate-tRNA ligase Proteins 0.000 description 1
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 1
- JPXNYFOHTHSREU-UWVGGRQHSA-N Gly-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN JPXNYFOHTHSREU-UWVGGRQHSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 1
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 1
- HHSOPSCKAZKQHQ-PEXQALLHSA-N Gly-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN HHSOPSCKAZKQHQ-PEXQALLHSA-N 0.000 description 1
- ICUTTWWCDIIIEE-BQBZGAKWSA-N Gly-Met-Asn Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN ICUTTWWCDIIIEE-BQBZGAKWSA-N 0.000 description 1
- 108091027874 Group I catalytic intron Proteins 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241000205063 Haloarcula marismortui Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- FSOXZQBMPBQKGJ-QSFUFRPTSA-N His-Ile-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]([NH3+])CC1=CN=CN1 FSOXZQBMPBQKGJ-QSFUFRPTSA-N 0.000 description 1
- ZRSJXIKQXUGKRB-TUBUOCAGSA-N His-Ile-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZRSJXIKQXUGKRB-TUBUOCAGSA-N 0.000 description 1
- FHGVHXCQMJWQPK-SRVKXCTJSA-N His-Lys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O FHGVHXCQMJWQPK-SRVKXCTJSA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- CKRJBQJIGOEKMC-SRVKXCTJSA-N His-Lys-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CKRJBQJIGOEKMC-SRVKXCTJSA-N 0.000 description 1
- AYUOWUNWZGTNKB-ULQDDVLXSA-N His-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AYUOWUNWZGTNKB-ULQDDVLXSA-N 0.000 description 1
- GYXDQXPCPASCNR-NHCYSSNCSA-N His-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N GYXDQXPCPASCNR-NHCYSSNCSA-N 0.000 description 1
- QLBXWYXMLHAREM-PYJNHQTQSA-N His-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CN=CN1)N QLBXWYXMLHAREM-PYJNHQTQSA-N 0.000 description 1
- 102000029746 Histidine-tRNA Ligase Human genes 0.000 description 1
- 101710177011 Histidine-tRNA ligase, cytoplasmic Proteins 0.000 description 1
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 1
- PMMYEEVYMWASQN-DMTCNVIQSA-N Hydroxyproline Chemical compound O[C@H]1CN[C@H](C(O)=O)C1 PMMYEEVYMWASQN-DMTCNVIQSA-N 0.000 description 1
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 1
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 1
- QYZYJFXHXYUZMZ-UGYAYLCHSA-N Ile-Asn-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N QYZYJFXHXYUZMZ-UGYAYLCHSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- JHCVYQKVKOLAIU-NAKRPEOUSA-N Ile-Cys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N JHCVYQKVKOLAIU-NAKRPEOUSA-N 0.000 description 1
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 1
- DMSVBUWGDLYNLC-IAVJCBSLSA-N Ile-Ile-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 DMSVBUWGDLYNLC-IAVJCBSLSA-N 0.000 description 1
- KBAPKNDWAGVGTH-IGISWZIWSA-N Ile-Ile-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KBAPKNDWAGVGTH-IGISWZIWSA-N 0.000 description 1
- XDUVMJCBYUKNFJ-MXAVVETBSA-N Ile-Lys-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N XDUVMJCBYUKNFJ-MXAVVETBSA-N 0.000 description 1
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 1
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- WRDTXMBPHMBGIB-STECZYCISA-N Ile-Tyr-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 WRDTXMBPHMBGIB-STECZYCISA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- WIYDLTIBHZSPKY-HJWJTTGWSA-N Ile-Val-Phe Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WIYDLTIBHZSPKY-HJWJTTGWSA-N 0.000 description 1
- APQYGMBHIVXFML-OSUNSFLBSA-N Ile-Val-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N APQYGMBHIVXFML-OSUNSFLBSA-N 0.000 description 1
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 1
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 1
- 238000012404 In vitro experiment Methods 0.000 description 1
- IMQLKJBTEOYOSI-GPIVLXJGSA-N Inositol-hexakisphosphate Chemical compound OP(O)(=O)O[C@H]1[C@H](OP(O)(O)=O)[C@@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@H](OP(O)(O)=O)[C@@H]1OP(O)(O)=O IMQLKJBTEOYOSI-GPIVLXJGSA-N 0.000 description 1
- 108700001097 Insect Genes Proteins 0.000 description 1
- 102000029793 Isoleucine-tRNA ligase Human genes 0.000 description 1
- 101710176147 Isoleucine-tRNA ligase, cytoplasmic Proteins 0.000 description 1
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 125000002842 L-seryl group Chemical group O=C([*])[C@](N([H])[H])([H])C([H])([H])O[H] 0.000 description 1
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 1
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 1
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- ZGUMORRUBUCXEH-AVGNSLFASA-N Leu-Lys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZGUMORRUBUCXEH-AVGNSLFASA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 1
- 108010071170 Leucine-tRNA ligase Proteins 0.000 description 1
- 102100023342 Leucine-tRNA ligase, mitochondrial Human genes 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- LMGNWHDWJDIOPK-DKIMLUQUSA-N Lys-Phe-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LMGNWHDWJDIOPK-DKIMLUQUSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 1
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241001599018 Melanogaster Species 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- JHDNAOVJJQSMMM-GMOBBJLQSA-N Met-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCSC)N JHDNAOVJJQSMMM-GMOBBJLQSA-N 0.000 description 1
- ODFBIJXEWPWSAN-CYDGBPFRSA-N Met-Ile-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O ODFBIJXEWPWSAN-CYDGBPFRSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- 241000203407 Methanocaldococcus jannaschii Species 0.000 description 1
- 102000004587 Methionine-tRNA ligase Human genes 0.000 description 1
- 108010003060 Methionine-tRNA ligase Proteins 0.000 description 1
- 241000186359 Mycobacterium Species 0.000 description 1
- 241001138504 Mycoplasma bovis Species 0.000 description 1
- DTERQYGMUDWYAZ-ZETCQYMHSA-N N(6)-acetyl-L-lysine Chemical compound CC(=O)NCCCC[C@H]([NH3+])C([O-])=O DTERQYGMUDWYAZ-ZETCQYMHSA-N 0.000 description 1
- CYZKJBZEIFWZSR-LURJTMIESA-N N(alpha)-methyl-L-histidine Chemical compound CN[C@H](C(O)=O)CC1=CNC=N1 CYZKJBZEIFWZSR-LURJTMIESA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- JJIHLJJYMXLCOY-BYPYZUCNSA-N N-acetyl-L-serine Chemical compound CC(=O)N[C@@H](CO)C(O)=O JJIHLJJYMXLCOY-BYPYZUCNSA-N 0.000 description 1
- PYUSHNKNPOHWEZ-YFKPBYRVSA-N N-formyl-L-methionine Chemical compound CSCC[C@@H](C(O)=O)NC=O PYUSHNKNPOHWEZ-YFKPBYRVSA-N 0.000 description 1
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 1
- 241000224438 Naegleria fowleri Species 0.000 description 1
- 241000221961 Neurospora crassa Species 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 244000061176 Nicotiana tabacum Species 0.000 description 1
- 240000007594 Oryza sativa Species 0.000 description 1
- 235000007164 Oryza sativa Nutrition 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- FXPZZKBHNOMLGA-HJWJTTGWSA-N Phe-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FXPZZKBHNOMLGA-HJWJTTGWSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- KPEIBEPEUAZWNS-ULQDDVLXSA-N Phe-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KPEIBEPEUAZWNS-ULQDDVLXSA-N 0.000 description 1
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- 102000002798 Phenylalanine-tRNA Ligase Human genes 0.000 description 1
- 108010004478 Phenylalanine-tRNA Ligase Proteins 0.000 description 1
- IMQLKJBTEOYOSI-UHFFFAOYSA-N Phytic acid Natural products OP(O)(=O)OC1C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C(OP(O)(O)=O)C1OP(O)(O)=O IMQLKJBTEOYOSI-UHFFFAOYSA-N 0.000 description 1
- 108010020346 Polyglutamic Acid Proteins 0.000 description 1
- 239000004793 Polystyrene Substances 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- PUQRDHNIOONJJN-AVGNSLFASA-N Pro-Lys-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PUQRDHNIOONJJN-AVGNSLFASA-N 0.000 description 1
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 1
- 101710096715 Probable histidine-tRNA ligase, cytoplasmic Proteins 0.000 description 1
- 101710149031 Probable isoleucine-tRNA ligase, cytoplasmic Proteins 0.000 description 1
- 102000007327 Protamines Human genes 0.000 description 1
- 108010007568 Protamines Proteins 0.000 description 1
- 101100408135 Pseudomonas aeruginosa (strain ATCC 15692 / DSM 22644 / CIP 104116 / JCM 14847 / LMG 12228 / 1C / PRS 101 / PAO1) phnA gene Proteins 0.000 description 1
- 241000700157 Rattus norvegicus Species 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 101150080963 S4 gene Proteins 0.000 description 1
- 229940124639 Selective inhibitor Drugs 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- LDEBVRIURYMKQS-WISUUJSJSA-N Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CO LDEBVRIURYMKQS-WISUUJSJSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 1
- 108010030161 Serine-tRNA ligase Proteins 0.000 description 1
- 102100040516 Serine-tRNA ligase, cytoplasmic Human genes 0.000 description 1
- 244000139010 Spilanthes oleracea Species 0.000 description 1
- 241000295644 Staphylococcaceae Species 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 241000194017 Streptococcus Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- 241000701093 Suid alphaherpesvirus 1 Species 0.000 description 1
- 241000205098 Sulfolobus acidocaldarius Species 0.000 description 1
- 241000205091 Sulfolobus solfataricus Species 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 1
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 1
- CGCMNOIQVAXYMA-UNQGMJICSA-N Thr-Met-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CGCMNOIQVAXYMA-UNQGMJICSA-N 0.000 description 1
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- 102000001618 Threonine-tRNA Ligase Human genes 0.000 description 1
- 108010029287 Threonine-tRNA ligase Proteins 0.000 description 1
- 241000096130 Toxopus brucei Species 0.000 description 1
- GSCPHMSPGQSZJT-JYBASQMISA-N Trp-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O GSCPHMSPGQSZJT-JYBASQMISA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QYSBJAUCUKHSLU-JYJNAYRXSA-N Tyr-Arg-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O QYSBJAUCUKHSLU-JYJNAYRXSA-N 0.000 description 1
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- JIODCDXKCJRMEH-NHCYSSNCSA-N Val-Arg-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N JIODCDXKCJRMEH-NHCYSSNCSA-N 0.000 description 1
- UBTBGUDNDFZLGP-SRVKXCTJSA-N Val-Arg-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UBTBGUDNDFZLGP-SRVKXCTJSA-N 0.000 description 1
- DCOOGDCRFXXQNW-ZKWXMUAHSA-N Val-Asn-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N DCOOGDCRFXXQNW-ZKWXMUAHSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- DHINLYMWMXQGMQ-IHRRRGAJSA-N Val-His-His Chemical compound C([C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 DHINLYMWMXQGMQ-IHRRRGAJSA-N 0.000 description 1
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 1
- WNZSAUMKZQXHNC-UKJIMTQDSA-N Val-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N WNZSAUMKZQXHNC-UKJIMTQDSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- GTACFKZDQFTVAI-STECZYCISA-N Val-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 GTACFKZDQFTVAI-STECZYCISA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 102000013625 Valine-tRNA Ligase Human genes 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 1
- 239000004480 active ingredient Substances 0.000 description 1
- 238000003450 affinity purification method Methods 0.000 description 1
- 230000004520 agglutination Effects 0.000 description 1
- 101150115889 al gene Proteins 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 108010087924 alanylproline Proteins 0.000 description 1
- 239000000783 alginic acid Substances 0.000 description 1
- 235000010443 alginic acid Nutrition 0.000 description 1
- 229920000615 alginic acid Polymers 0.000 description 1
- 229960001126 alginic acid Drugs 0.000 description 1
- 150000004781 alginic acids Chemical class 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 230000006229 amino acid addition Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 230000000844 anti-bacterial effect Effects 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 230000000890 antigenic effect Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010059459 arginyl-threonyl-phenylalanine Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000001042 autoregulative effect Effects 0.000 description 1
- DMLAVOWQYNRWNQ-UHFFFAOYSA-N azobenzene Chemical compound C1=CC=CC=C1N=NC1=CC=CC=C1 DMLAVOWQYNRWNQ-UHFFFAOYSA-N 0.000 description 1
- 208000022362 bacterial infectious disease Diseases 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000010256 biochemical assay Methods 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 239000012620 biological material Substances 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 239000011449 brick Substances 0.000 description 1
- 239000008366 buffered solution Substances 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010948 carboxy methyl cellulose Nutrition 0.000 description 1
- 239000008112 carboxymethyl-cellulose Substances 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000001142 circular dichroism spectrum Methods 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000000749 co-immunoprecipitation Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012875 competitive assay Methods 0.000 description 1
- 230000006957 competitive inhibition Effects 0.000 description 1
- 239000002131 composite material Substances 0.000 description 1
- 239000000356 contaminant Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 238000012866 crystallographic experiment Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 125000001295 dansyl group Chemical group [H]C1=C([H])C(N(C([H])([H])[H])C([H])([H])[H])=C2C([H])=C([H])C([H])=C(C2=C1[H])S(*)(=O)=O 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- YSMODUONRAFBET-UHFFFAOYSA-N delta-DL-hydroxylysine Natural products NCC(O)CCC(N)C(O)=O YSMODUONRAFBET-UHFFFAOYSA-N 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000009795 derivation Methods 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- XPPKVPWEQAFLFU-UHFFFAOYSA-J diphosphate(4-) Chemical compound [O-]P([O-])(=O)OP([O-])([O-])=O XPPKVPWEQAFLFU-UHFFFAOYSA-J 0.000 description 1
- 235000011180 diphosphates Nutrition 0.000 description 1
- 230000005059 dormancy Effects 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- YSMODUONRAFBET-UHNVWZDZSA-N erythro-5-hydroxy-L-lysine Chemical compound NC[C@H](O)CC[C@H](N)C(O)=O YSMODUONRAFBET-UHNVWZDZSA-N 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000008713 feedback mechanism Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 150000002334 glycols Chemical class 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 1
- 108010090037 glycyl-alanyl-isoleucine Proteins 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010020688 glycylhistidine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 229910052739 hydrogen Inorganic materials 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- TUJKJAMUKRIRHC-UHFFFAOYSA-N hydroxyl Chemical compound [OH] TUJKJAMUKRIRHC-UHFFFAOYSA-N 0.000 description 1
- 229960002591 hydroxyproline Drugs 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 230000008073 immune recognition Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000001990 intravenous administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 229910052747 lanthanoid Inorganic materials 0.000 description 1
- 150000002602 lanthanoids Chemical class 0.000 description 1
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108010054155 lysyllysine Proteins 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229960004452 methionine Drugs 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 150000007522 mineralic acids Chemical class 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 238000012900 molecular simulation Methods 0.000 description 1
- 238000001216 nucleic acid method Methods 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 150000007524 organic acids Chemical class 0.000 description 1
- 235000005985 organic acids Nutrition 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- FJKROLUGYXJWQN-UHFFFAOYSA-N papa-hydroxy-benzoic acid Natural products OC(=O)C1=CC=C(O)C=C1 FJKROLUGYXJWQN-UHFFFAOYSA-N 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- 125000001151 peptidyl group Chemical group 0.000 description 1
- 239000003090 pesticide formulation Substances 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- BZQFBWGGLXLEPQ-REOHCLBHSA-N phosphoserine Chemical compound OC(=O)[C@@H](N)COP(O)(O)=O BZQFBWGGLXLEPQ-REOHCLBHSA-N 0.000 description 1
- DCWXELXMIBXGTH-UHFFFAOYSA-N phosphotyrosine Chemical compound OC(=O)C(N)CC1=CC=C(OP(O)(O)=O)C=C1 DCWXELXMIBXGTH-UHFFFAOYSA-N 0.000 description 1
- 230000000243 photosynthetic effect Effects 0.000 description 1
- 235000002949 phytic acid Nutrition 0.000 description 1
- 239000000467 phytic acid Substances 0.000 description 1
- 229940068041 phytic acid Drugs 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 229920002643 polyglutamic acid Polymers 0.000 description 1
- 229920002223 polystyrene Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 125000001436 propyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 229940048914 protamine Drugs 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 229940107700 pyruvic acid Drugs 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 210000004708 ribosome subunit Anatomy 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 229960004889 salicylic acid Drugs 0.000 description 1
- 238000005185 salting out Methods 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 238000007920 subcutaneous administration Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- BDHFUVZGWQCTTF-UHFFFAOYSA-M sulfonate Chemical compound [O-]S(=O)=O BDHFUVZGWQCTTF-UHFFFAOYSA-M 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 230000004797 therapeutic response Effects 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 101150044170 trpE gene Proteins 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- 238000007039 two-step reaction Methods 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229960005486 vaccine Drugs 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 238000002424 x-ray crystallography Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Classifications
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
- G01N33/48—Biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/53—Immunoassay; Biospecific binding assay; Materials therefor
- G01N33/5308—Immunoassay; Biospecific binding assay; Materials therefor for analytes not provided for elsewhere, e.g. nucleic acids, uric acid, worms, mites
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/04—Antibacterial agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P39/00—General protective or antinoxious agents
- A61P39/06—Free radical scavengers or antioxidants
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/32—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Bacillus (G)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/93—Ligases (6)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y601/00—Ligases forming carbon-oxygen bonds (6.1)
- C12Y601/01—Ligases forming aminoacyl-tRNA and related compounds (6.1.1)
- C12Y601/01001—Tyrosine-tRNA ligase (6.1.1.1)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N2500/00—Screening for compounds of potential therapeutic value
- G01N2500/04—Screening involving studying the effect of compounds C directly on molecule A (e.g. C are potential ligands for a receptor A, or potential substrates for an enzyme A)
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A50/00—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE in human health protection, e.g. against extreme weather
- Y02A50/30—Against vector-borne diseases, e.g. mosquito-borne, fly-borne, tick-borne or waterborne diseases whose impact is exacerbated by climate change
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- General Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Medicinal Chemistry (AREA)
- Biochemistry (AREA)
- Molecular Biology (AREA)
- Genetics & Genomics (AREA)
- Veterinary Medicine (AREA)
- Animal Behavior & Ethology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biomedical Technology (AREA)
- Zoology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Immunology (AREA)
- Public Health (AREA)
- Wood Science & Technology (AREA)
- Microbiology (AREA)
- General Engineering & Computer Science (AREA)
- Oncology (AREA)
- Communicable Diseases (AREA)
- Biotechnology (AREA)
- Urology & Nephrology (AREA)
- Hematology (AREA)
- Tropical Medicine & Parasitology (AREA)
- Cell Biology (AREA)
- Food Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- General Physics & Mathematics (AREA)
- Pathology (AREA)
- Virology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Gastroenterology & Hepatology (AREA)
Abstract
There is disclosed a substantially pure peptide comprising the following sequence: [ML] -X(3) -[LIVMC] -X(3) -[GMNEHK] -[MLFYI] -X(2) -[ST] -X(3) -[SAIG] - [RMIK] -X(2) -[MIVA] -X(2) -[GNRHK] -X-[IVLF] -X-[LIV] -[NDSRGAL] -X(3) -[LIVSQ] -X(2) -[PILVTAC] wherein X represents any amino acid. Also disclosed is a method for determining whether a nucleic acid comprises a tRNA
anticodon stem-loop. Another aspect comprises a method for identifying a substance which binds to a wherein the binding is detected by assaying for conjugates, for free substance, or for non-complexed peptide. Also described is a method of determining whether a test compound is an agonist or antagonist of a tRNA synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA anticodon stem-loop interaction. There is also described a method for obtaining a substantially pure nucleic acid comprising a tRNA anticodon stem-loop from a mixture of different nucleic acids. Finally, there is described a pharmaceutical composition for inhibiting the interaction of a tRNA, with a tRNA synthetase or ribosomal protein along with other related aspects.
anticodon stem-loop. Another aspect comprises a method for identifying a substance which binds to a wherein the binding is detected by assaying for conjugates, for free substance, or for non-complexed peptide. Also described is a method of determining whether a test compound is an agonist or antagonist of a tRNA synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA anticodon stem-loop interaction. There is also described a method for obtaining a substantially pure nucleic acid comprising a tRNA anticodon stem-loop from a mixture of different nucleic acids. Finally, there is described a pharmaceutical composition for inhibiting the interaction of a tRNA, with a tRNA synthetase or ribosomal protein along with other related aspects.
Description
TITLE: NOVEL tRNA BINDING DOMAIN
FIELD OF THE INVENTION
The invention relates to a novel domain that binds to tRNA; peptides derived from the domain which modulate the interaction of tRNA with proteins including tRNA
synthetases and ribosomal 5 proteins; and, uses of the peptides. The invention also relates to complexes comprising a domain or peptide of the invention and a tRNA or portion thereof.
BACKGROUND OF THE INVENTION
A number of important cellular activities are carried out by protein-tRNA
complexes. The recognition of tRNA by their cognate aminoacyl-tRNA synthetases is the critical step in the translation 10 of the genetic code. The tRNA synthetases catalyze the aminoacylation of tRNA in a two-step reaction.
The amino acid is first activated with ATP to form an amino acid adenylate and pyrophosphate, then the adenylate is attacked by the 3'terminal ribose of the tRNA to form amino acid-tRNA and AMP.
Ribosomes interact with aminoacyl tRNA, peptidyl tRNA, and exiting tRNA, and these interactions account for the codon-anticodon interaction between mRNA and tRNA, the correct 15 positioning of tRNA acceptor and donor arms during peptide bond formation, and the movement of mRNA relative to the ribosome. Ribosomal protein S4 is a multifunctional protein associated with the 30S subunit, comprising 206 amino acids in E. toll, which has been implicated in the binding of aminoacyl tRNA.
Inhibition of tRNA protein interactions leads to a reduction in protein translation triggering a 20 cascade of responses. For example in prokaryotes, inhibition of tRNA
synthetases may lead to a state of dormancy in the organism. Therefore, selective inhibitors of bacterial or fungal protein biosynthesis have potential as antibacterial agents. Inhibitors are also potentially useful as anti-viral agents, immunotoxins, and plant toxins.
SUMMARY OF THE INVENTION
25 The present inventor has identified a novel domain within the C-terminus of a tyrosyl-tRNA
synthetase and the ribosomal S4 protein that mediates the binding of the proteins to a tRNA anticodon stem-loop.
Broadly stated the present invention provides a novel domain and peptides derived therefrom having the following sequence:
[MLJ-X(3)-[LIVMCJ-X(3)-[GMNEHK]-[MLFYI]-X(2)-[ST]-X(3)-[SAIG]-[RMIK]-X(2)-[MIVA]-X(2~[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC]
wherein X represents any amino acid.
35 In accordance with an embodiment of the invention peptides are provided comprising the sequence motif Y,-X(3)-YZ-X(3)-Y3-Y4-X(2)-YS-X(3)-Y6-Y7-X(2)-Y8-X(2)-Y9-X-Y,o-X-Y"-Y,2-X(3)_ Y,3-X(2~Y,4 where Y, is methionine or leucine, preferably leucine, YZ is leucine, isoleucine, valine, methionine, or cysteine, preferably isoleucine, leucine, or valine,Y3 is glycine, methionine, asparagine, glutamic acid, histidine, or lysine, preferably glycine, Y4 is methionine, leucine, phenylalanine, 40 tyrosine, or isoleucine, preferably phenylalanine, leucine, methionine, or tyrosine, YS is serine or WO 00/11141 PCTlCA99/00779 threonine, Y6 is serine, alanine, isoleucine, or glycine, preferably alanine, Y~ is arginine, methionine, isoleucine, or lysine, preferably arginine, Y8 is methionine, isoleucine, valine, alanine, preferably valine or isoleucine, Y9 is glycine, asparagine, arginine, histidine, or lysine, preferably lysine, glycine, or arginine, Y,o is isoleucine, valine, leucine, or phenylalanine, preferably valine or isoleucine, Y" is leucine, isoleucine, or valine, preferably valine or isoleucine, Y,Z is asparagine, aspartic acid, serine, arginine, glycine, alanine, or leucine, preferably asparagine, aspartic acid, or glycine,Y~3 is leucine, isoleucine, valine, serine, or glutamine, preferably glutamine or valine, Y,4 is proline, isoleucine, leucine, valine, threonine, alanine, cysteine, or serine, preferably proline or valine, and X is any amino acid.
10 The invention also provides biologically, diagnostically, prophylactically, clinically, or therapeutically useful variants thereof, and compositions comprising the peptides and variants. In particular, the invention contemplates truncations and analogs of the peptides of the invention.
The present invention also relates to a complex comprising a peptide having the following sequence motif:
[ML]-X(3r[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2)-[STJ-X(3)-[SAIG]-[RMIK]-X(2)-[MIVAJ-X(2)-[GNRHKJ-X-[IVLF]-X-[LIV]-[NDSRGALJ-X(3)-[LIVSQ]-X(2Jh[PILVTAC]
wherein X represents any amino acid, with a tRNA anticodon stem-loop.
20 The invention also contemplates antibodies specific for the complexes and peptides of the invention.
The invention also relates to the use of a peptide or complex of the invention to interfere with the interaction of a tRNA anticodon stem-loop (e.g. tRNA), with proteins comprising a domain of the invention including tRNA synthetases or ribosomal proteins and, pharmaceutical compositions for 25 inhibiting the interaction of a tRNA anticodon stem-loop (e.g. tRNA), with proteins including tRNA
synthetases or ribosomal proteins. The peptides, compositions and antibodies may be used to interfere with protein synthesis and they may be used as antibacterial agents, anti-viral agents, immunotoxins, or plant toxins.
Further, the invention relates to a method of modulating protein synthesis and in particular the 30 interaction of a tRNA anticodon stem-loop (e.g. tRNA), with a tRNA
synthetase or ribosomal protein comprising changing the following sequence motif in a tRNA synthetase or ribosomal protein:
[ML]-X(3)-[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2r[ST]-X(3)-[SAIG]-[RMIK]-X(2}-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC], wherein X
is any 35 amino acid.
The present invention also provides a method for determining whether a nucleic acid comprises a tRNA anticodon stem-loop. The method comprises the steps of contacting a nucleic acid with a peptide of the invention and determining whether the peptide binds to the nucleic acid. The binding of the peptide to the nucleic acid is indicative that the nucleic acid comprises a tRNA
anticodon stem-loop.
In another embodiment, the present invention provides a method of determining whether a test compound is an agonist or antagonist of a protein comprising a domain of the invention (i.e. a tRNA
5 anticodon stem-loop recognition motif) and a tRNA anticodon stem-loop, including a tRNA
synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA
anticodon stem-loop interaction. The method comprises the steps of incubating the test compound with a nucleic acid comprising a tRNA anticodon stem-loop, and a peptide of the invention, determining the amount of nucleic acid bound to the peptide during the incubating step, and comparing the amount of nucleic acid 10 bound to the peptide during the incubating step to an amount of nucleic acid bound to peptide in the absence of the test compound. An increase in the amount of nucleic acid bound to peptide in the presence of the test compound will be indicative that the test compound is an agonist of an interaction, while a decrease indicates that the test compound is an antagonist of an interaction.
In an additional embodiment, a method is provided for obtaining a substantially pure nucleic 15 acid comprising a tRNA anticodon stem-loop from a mixture of different nucleic acids. The method comprises the steps of providing a peptide of the invention bound to a solid support. The mixture of different nucleic acids is contacted with the peptide bound to the solid support whereby a nucleic acid comprising a tRNA anticodon stem-loop is bound to the peptide. The solid support is washed to remove unbound nucleic acids and substantially pure nucleic acids comprising a tRNA anticodon stem-20 loop are then eluted from the solid support.
Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating preferred embodiments of the invention are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention 25 will become apparent to those skilled in the art from this detailed description.
DESCRIPTION OF THE DRAWINGS
The invention will be better understood with reference to the drawings in which:
Figure 1. The hydrophobic motif shared between ribosomal S4 protein and Tyrosyl-tRNA
synthetase. Yellow residues comprise the hydrophobic core. An invariant Ser/Thr residue shown in 30 green delimits the N-terminal end of a common central helix. Arginine residues shown in the B.
stearothermophilus TyrRS (red blocks) were shown to bind to tRNATn through site-directed mutagenesis Figure 2. Threading alignment of Synechocystis TyrRS residues 342-404 and the fragment 94-159 from the B. stearoihermophilus S4 3-D. Boxed regions are aligned by the procedure and 35 identical residues are shown in bold. A suboptimal alignment in the threading ensemble has an altered alignment of the third block that closes both gaps, but it has a slightly lower Z score. Starting with a 3-D core comprising 38% of this structure, shown with the wavy lines, the final threading alignment recruited a total of 80% of residues in the S4 structure fragment with a Z-score of 4.55. The self thread recruited 91% of residues with a Z-score of 6.3. Randomized sequences with the same composition 40 made poor alignments with very many alternative suboptimal alignments.
WO 00)11141 PCT/CA99/00779 Figure 3. Cartoon image of the B. stearothermophilus ribosomal S4 structure fragment showing the TyrRS similarities. The blue filled backbone cartoon corresponds to the motif shown in Figure 1. The entire blue and pink filled backbone cartoon represents the substructure that was used in the threading analysis. The backbone drawn with pink lines corresponds to the N-terminus. The grey 5 lines correspond to the C-terminus of S4 that is not in common with TyrRS.
This fragment is completely missing in the archeal S4 proteins, and may not be structurally required to form the motif fold shown in blue/pink. Yellow residues represent the hydrophobic core and correspond to yellow residues in the motif shown in Figure 1. Blue residues represent basic side chains that align with those of TyrRS previously shown to interact with tRNATn . The green residue shows the position of the 10 conserved threonine/serine in the motif as a helical N-cap. The helix at the top left of the structure (pink lines) corresponds to the site of ram mutations, while the helix at the bottom right (solid pink) corresponds to the Ets-domain DNA binding helix similarity.
Figure 4. Neighbor joining clustering using ClustalX of the motif in Figure 1.
Information in this motif is sufficient to reconstruct, from bottom to top, clusters corresponding to archea, eukaryotes, 15 chloroplasts and their photosynthetic relatives, and eubacteria.
Mitochondria) S4 sequences and TyrRS
sequences diverge from the cluster in an atypical fashion, suggesting a change in evolutionary rates.
This divergence may correspond to a systematic decrease in the population of cognate tRNA from >20 to one or two in the case of TyrRS, together with a mixing of cytoplasmic and mitochondria) tRNAs for use in the mitochondrion.
Glossary The following standard abbreviations for the amino acid residues are used throughout the specification: A, Ala - alanine; C, Cys - cysteine; D, Asp- aspartic acid; E, Glu - glutamic acid; F, Phe -phenylalanine; G, Gly - glycine; H, His - histidine; I, Ile - isoleucine; K, Lys - lysine; L, Leu - leucine;
25 M, Met - methionine; N, Asn - asparagine; P, Pro - proline; Q, Gln -glutamine; R, Arg - arginine; S, Ser - serine; T, Thr - threonine; V, Val - valine; W, Trp- tryptophan; Y, Tyr -tyrosine; and p.Y., P.Tyr - phosphotyrosine.
The amino acids used in the peptides of the invention are preferably in the "L" isomeric form.
However, stereoisomers (e.g. D-amino acids) of the twenty conventional amino acids, unnatural amino 30 acids such as a,a-disubstituted amino acids, N-alkyl amino acids, lactic acid, and other unconventional amino acids may also be suitable components for peptides of the present invention. Examples of unconventional or unnatural amino acids include amino acids well known in the art, but which are not included in the twenty conventional amino acids, such as 3 or 4-hydroxyproline, y-carboxyglutamate, e-N,N,N-trimethyllysine, E-N-acetyllysine, O-phosphoserine, N-acetylserine, N-formylmethionine, 3-35 methylhistidine, 5-hydroxylysine, w-N-methylarginine, and other similar amino acids and imino acids.
In the peptide notation used herein, the IeRhand direction is the amino terminal direction and the righthand direction is the carboxy-terminal direction, in accordance with standard usage and convention.
T'he term "protein comprising a tRNA anticodon stem-loop recognition motif' refers to a 40 protein comprising the following sequence motif [ML]-X(3)-[LIVMC]-X(3}-[GMNEHK]-[MLFYIJ-X(2)-[ST)-X(3r[SAIG]-[RMIKJ-X(2)-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIVJ-[NDSRGAL]-X(3~[LIVSQ]-X(2)~[PILVTAC], wherein X
is any amino acid. Examples of the proteins are tRNA synthetases, and ribosomal proteins.
The tenor "tRNA synthetase" refers to a protein or peptide which comprises or consists of a sequence which is capable of binding to a tRNA anticodon stem-loop. Examples of tRNA synthetases are tyrosyl tRNA synthetase [see for example Guez-Ivanier and Bedouelle (25), Brick et al. (2) and Brick and Blow (I ) re the structure of tyrosyl-tRNA synthetase from B.
stearothermophilus;
W09739015, and W09726351 j; isoleucyl tRNA synthetase (Chalker, A. F. et al Gene 141:103, 1994);
10 valyl tRNA synthetase (LJ.S. Nos. 5,789,218, and 5,747,314, W09726355, EP0785267); asparaginyl tRNA synthetase (U.S. 5,789,21, W09726348); alanyl tRNA synthetase (U.S.
5,776,750, W09739013, W09726353); cysteinyl tRNA synthetase (LJ.S. 5,775,749, U.S. 5,753,480, W09726341); arginyl tRNA synthetase (U.S. 5,763,246, W09726347, EP0785266); glycyi tRNA synthetase (U.S.
5,756,330, W09726340); phenylalanyl tRNA synthetase (U.S. 5,756,329, U.S.
5,753,479, 15 W09726356); tryptophanyl tRNA synthetase (EP0843014); leucyl tRNA
synthetase (LJS. 5,750,387, W09726349); histidyl tRNA synthetase (iJ.S. 5,747,313, W09739017, W09726354, EP0785269);
seryl tRNA synthetase (U.S. 5,744,338, W09726352, EP0785270); threonyl tRNA
synthetase (EP0815237, W0972634, EP0785271); aspartyl tRNA synthetase (U.S. 5,747,315, W09739014, W09726344); methionyl tRNA synthetase (W09726350, W09739012, EP0785268), isoleucyl tRNA
20 synthetase (W09739011); tryptophanyl tRNA synthetase (W09726346); glutamyl tRNA synthetase (WO9726345); and propyl tRNA synthetase (W09726343, EP0785272). tRNA
synthetases may also be identified by using antibodies or probes specific for the enzymes.
The term "ribosomal protein" refers to a ribosomal protein or peptide that comprises or consists of a sequence that is capable of binding tRNA and in particular binding a tRNA anticodon 25 stem-loop. Preferably the ribosomal protein is ribosomal S4 protein which is a multifunctional ribosomal protein associated with the 30S subunit, comprising 206 amino acids in E. toll. S4 is an ancient, if not one of the most ancient of ribosomal proteins (5). In the ribosome, S4 is required and it is the fu~st protein involved in the folding of 16S rRNA. (7) The term includes mutations of S4 proteins, specifically the ram (ribosomal ambiguity) D14 and D12 mutants in E. toll (8,9,10), the omnipotent 30 suppressor mutant SUP46 in yeast ( 11 ), and the NAM-9 mutants of yeast ( 12).
The term "tRNA anticodon stem-loop " refers to a nucleic acid comprising or consisting of an anticodon sequence of a tRNA or a chemically, enzymatically, or metabolically modified form thereof, which has high affinity to a domain of the invention and proteins comprising such a domain including tRNA-synthetase or ribosomal S4 protein. tRNA anticodon stem-loops may be identified using data 35 base search methods (see for example, www.molgen.uc.edu/analyze/Stem.htm~
http://mell.angis.org.au/Stadenn. tRNA anticodon stem-loops may also be identified by screening libraries with a protein containing a sequence with high affinity to a tRNA
anticodon stem-loop, i.e. a peptide of the invention which may be labeled.
FIELD OF THE INVENTION
The invention relates to a novel domain that binds to tRNA; peptides derived from the domain which modulate the interaction of tRNA with proteins including tRNA
synthetases and ribosomal 5 proteins; and, uses of the peptides. The invention also relates to complexes comprising a domain or peptide of the invention and a tRNA or portion thereof.
BACKGROUND OF THE INVENTION
A number of important cellular activities are carried out by protein-tRNA
complexes. The recognition of tRNA by their cognate aminoacyl-tRNA synthetases is the critical step in the translation 10 of the genetic code. The tRNA synthetases catalyze the aminoacylation of tRNA in a two-step reaction.
The amino acid is first activated with ATP to form an amino acid adenylate and pyrophosphate, then the adenylate is attacked by the 3'terminal ribose of the tRNA to form amino acid-tRNA and AMP.
Ribosomes interact with aminoacyl tRNA, peptidyl tRNA, and exiting tRNA, and these interactions account for the codon-anticodon interaction between mRNA and tRNA, the correct 15 positioning of tRNA acceptor and donor arms during peptide bond formation, and the movement of mRNA relative to the ribosome. Ribosomal protein S4 is a multifunctional protein associated with the 30S subunit, comprising 206 amino acids in E. toll, which has been implicated in the binding of aminoacyl tRNA.
Inhibition of tRNA protein interactions leads to a reduction in protein translation triggering a 20 cascade of responses. For example in prokaryotes, inhibition of tRNA
synthetases may lead to a state of dormancy in the organism. Therefore, selective inhibitors of bacterial or fungal protein biosynthesis have potential as antibacterial agents. Inhibitors are also potentially useful as anti-viral agents, immunotoxins, and plant toxins.
SUMMARY OF THE INVENTION
25 The present inventor has identified a novel domain within the C-terminus of a tyrosyl-tRNA
synthetase and the ribosomal S4 protein that mediates the binding of the proteins to a tRNA anticodon stem-loop.
Broadly stated the present invention provides a novel domain and peptides derived therefrom having the following sequence:
[MLJ-X(3)-[LIVMCJ-X(3)-[GMNEHK]-[MLFYI]-X(2)-[ST]-X(3)-[SAIG]-[RMIK]-X(2)-[MIVA]-X(2~[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC]
wherein X represents any amino acid.
35 In accordance with an embodiment of the invention peptides are provided comprising the sequence motif Y,-X(3)-YZ-X(3)-Y3-Y4-X(2)-YS-X(3)-Y6-Y7-X(2)-Y8-X(2)-Y9-X-Y,o-X-Y"-Y,2-X(3)_ Y,3-X(2~Y,4 where Y, is methionine or leucine, preferably leucine, YZ is leucine, isoleucine, valine, methionine, or cysteine, preferably isoleucine, leucine, or valine,Y3 is glycine, methionine, asparagine, glutamic acid, histidine, or lysine, preferably glycine, Y4 is methionine, leucine, phenylalanine, 40 tyrosine, or isoleucine, preferably phenylalanine, leucine, methionine, or tyrosine, YS is serine or WO 00/11141 PCTlCA99/00779 threonine, Y6 is serine, alanine, isoleucine, or glycine, preferably alanine, Y~ is arginine, methionine, isoleucine, or lysine, preferably arginine, Y8 is methionine, isoleucine, valine, alanine, preferably valine or isoleucine, Y9 is glycine, asparagine, arginine, histidine, or lysine, preferably lysine, glycine, or arginine, Y,o is isoleucine, valine, leucine, or phenylalanine, preferably valine or isoleucine, Y" is leucine, isoleucine, or valine, preferably valine or isoleucine, Y,Z is asparagine, aspartic acid, serine, arginine, glycine, alanine, or leucine, preferably asparagine, aspartic acid, or glycine,Y~3 is leucine, isoleucine, valine, serine, or glutamine, preferably glutamine or valine, Y,4 is proline, isoleucine, leucine, valine, threonine, alanine, cysteine, or serine, preferably proline or valine, and X is any amino acid.
10 The invention also provides biologically, diagnostically, prophylactically, clinically, or therapeutically useful variants thereof, and compositions comprising the peptides and variants. In particular, the invention contemplates truncations and analogs of the peptides of the invention.
The present invention also relates to a complex comprising a peptide having the following sequence motif:
[ML]-X(3r[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2)-[STJ-X(3)-[SAIG]-[RMIK]-X(2)-[MIVAJ-X(2)-[GNRHKJ-X-[IVLF]-X-[LIV]-[NDSRGALJ-X(3)-[LIVSQ]-X(2Jh[PILVTAC]
wherein X represents any amino acid, with a tRNA anticodon stem-loop.
20 The invention also contemplates antibodies specific for the complexes and peptides of the invention.
The invention also relates to the use of a peptide or complex of the invention to interfere with the interaction of a tRNA anticodon stem-loop (e.g. tRNA), with proteins comprising a domain of the invention including tRNA synthetases or ribosomal proteins and, pharmaceutical compositions for 25 inhibiting the interaction of a tRNA anticodon stem-loop (e.g. tRNA), with proteins including tRNA
synthetases or ribosomal proteins. The peptides, compositions and antibodies may be used to interfere with protein synthesis and they may be used as antibacterial agents, anti-viral agents, immunotoxins, or plant toxins.
Further, the invention relates to a method of modulating protein synthesis and in particular the 30 interaction of a tRNA anticodon stem-loop (e.g. tRNA), with a tRNA
synthetase or ribosomal protein comprising changing the following sequence motif in a tRNA synthetase or ribosomal protein:
[ML]-X(3)-[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2r[ST]-X(3)-[SAIG]-[RMIK]-X(2}-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC], wherein X
is any 35 amino acid.
The present invention also provides a method for determining whether a nucleic acid comprises a tRNA anticodon stem-loop. The method comprises the steps of contacting a nucleic acid with a peptide of the invention and determining whether the peptide binds to the nucleic acid. The binding of the peptide to the nucleic acid is indicative that the nucleic acid comprises a tRNA
anticodon stem-loop.
In another embodiment, the present invention provides a method of determining whether a test compound is an agonist or antagonist of a protein comprising a domain of the invention (i.e. a tRNA
5 anticodon stem-loop recognition motif) and a tRNA anticodon stem-loop, including a tRNA
synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA
anticodon stem-loop interaction. The method comprises the steps of incubating the test compound with a nucleic acid comprising a tRNA anticodon stem-loop, and a peptide of the invention, determining the amount of nucleic acid bound to the peptide during the incubating step, and comparing the amount of nucleic acid 10 bound to the peptide during the incubating step to an amount of nucleic acid bound to peptide in the absence of the test compound. An increase in the amount of nucleic acid bound to peptide in the presence of the test compound will be indicative that the test compound is an agonist of an interaction, while a decrease indicates that the test compound is an antagonist of an interaction.
In an additional embodiment, a method is provided for obtaining a substantially pure nucleic 15 acid comprising a tRNA anticodon stem-loop from a mixture of different nucleic acids. The method comprises the steps of providing a peptide of the invention bound to a solid support. The mixture of different nucleic acids is contacted with the peptide bound to the solid support whereby a nucleic acid comprising a tRNA anticodon stem-loop is bound to the peptide. The solid support is washed to remove unbound nucleic acids and substantially pure nucleic acids comprising a tRNA anticodon stem-20 loop are then eluted from the solid support.
Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating preferred embodiments of the invention are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention 25 will become apparent to those skilled in the art from this detailed description.
DESCRIPTION OF THE DRAWINGS
The invention will be better understood with reference to the drawings in which:
Figure 1. The hydrophobic motif shared between ribosomal S4 protein and Tyrosyl-tRNA
synthetase. Yellow residues comprise the hydrophobic core. An invariant Ser/Thr residue shown in 30 green delimits the N-terminal end of a common central helix. Arginine residues shown in the B.
stearothermophilus TyrRS (red blocks) were shown to bind to tRNATn through site-directed mutagenesis Figure 2. Threading alignment of Synechocystis TyrRS residues 342-404 and the fragment 94-159 from the B. stearoihermophilus S4 3-D. Boxed regions are aligned by the procedure and 35 identical residues are shown in bold. A suboptimal alignment in the threading ensemble has an altered alignment of the third block that closes both gaps, but it has a slightly lower Z score. Starting with a 3-D core comprising 38% of this structure, shown with the wavy lines, the final threading alignment recruited a total of 80% of residues in the S4 structure fragment with a Z-score of 4.55. The self thread recruited 91% of residues with a Z-score of 6.3. Randomized sequences with the same composition 40 made poor alignments with very many alternative suboptimal alignments.
WO 00)11141 PCT/CA99/00779 Figure 3. Cartoon image of the B. stearothermophilus ribosomal S4 structure fragment showing the TyrRS similarities. The blue filled backbone cartoon corresponds to the motif shown in Figure 1. The entire blue and pink filled backbone cartoon represents the substructure that was used in the threading analysis. The backbone drawn with pink lines corresponds to the N-terminus. The grey 5 lines correspond to the C-terminus of S4 that is not in common with TyrRS.
This fragment is completely missing in the archeal S4 proteins, and may not be structurally required to form the motif fold shown in blue/pink. Yellow residues represent the hydrophobic core and correspond to yellow residues in the motif shown in Figure 1. Blue residues represent basic side chains that align with those of TyrRS previously shown to interact with tRNATn . The green residue shows the position of the 10 conserved threonine/serine in the motif as a helical N-cap. The helix at the top left of the structure (pink lines) corresponds to the site of ram mutations, while the helix at the bottom right (solid pink) corresponds to the Ets-domain DNA binding helix similarity.
Figure 4. Neighbor joining clustering using ClustalX of the motif in Figure 1.
Information in this motif is sufficient to reconstruct, from bottom to top, clusters corresponding to archea, eukaryotes, 15 chloroplasts and their photosynthetic relatives, and eubacteria.
Mitochondria) S4 sequences and TyrRS
sequences diverge from the cluster in an atypical fashion, suggesting a change in evolutionary rates.
This divergence may correspond to a systematic decrease in the population of cognate tRNA from >20 to one or two in the case of TyrRS, together with a mixing of cytoplasmic and mitochondria) tRNAs for use in the mitochondrion.
Glossary The following standard abbreviations for the amino acid residues are used throughout the specification: A, Ala - alanine; C, Cys - cysteine; D, Asp- aspartic acid; E, Glu - glutamic acid; F, Phe -phenylalanine; G, Gly - glycine; H, His - histidine; I, Ile - isoleucine; K, Lys - lysine; L, Leu - leucine;
25 M, Met - methionine; N, Asn - asparagine; P, Pro - proline; Q, Gln -glutamine; R, Arg - arginine; S, Ser - serine; T, Thr - threonine; V, Val - valine; W, Trp- tryptophan; Y, Tyr -tyrosine; and p.Y., P.Tyr - phosphotyrosine.
The amino acids used in the peptides of the invention are preferably in the "L" isomeric form.
However, stereoisomers (e.g. D-amino acids) of the twenty conventional amino acids, unnatural amino 30 acids such as a,a-disubstituted amino acids, N-alkyl amino acids, lactic acid, and other unconventional amino acids may also be suitable components for peptides of the present invention. Examples of unconventional or unnatural amino acids include amino acids well known in the art, but which are not included in the twenty conventional amino acids, such as 3 or 4-hydroxyproline, y-carboxyglutamate, e-N,N,N-trimethyllysine, E-N-acetyllysine, O-phosphoserine, N-acetylserine, N-formylmethionine, 3-35 methylhistidine, 5-hydroxylysine, w-N-methylarginine, and other similar amino acids and imino acids.
In the peptide notation used herein, the IeRhand direction is the amino terminal direction and the righthand direction is the carboxy-terminal direction, in accordance with standard usage and convention.
T'he term "protein comprising a tRNA anticodon stem-loop recognition motif' refers to a 40 protein comprising the following sequence motif [ML]-X(3)-[LIVMC]-X(3}-[GMNEHK]-[MLFYIJ-X(2)-[ST)-X(3r[SAIG]-[RMIKJ-X(2)-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIVJ-[NDSRGAL]-X(3~[LIVSQ]-X(2)~[PILVTAC], wherein X
is any amino acid. Examples of the proteins are tRNA synthetases, and ribosomal proteins.
The tenor "tRNA synthetase" refers to a protein or peptide which comprises or consists of a sequence which is capable of binding to a tRNA anticodon stem-loop. Examples of tRNA synthetases are tyrosyl tRNA synthetase [see for example Guez-Ivanier and Bedouelle (25), Brick et al. (2) and Brick and Blow (I ) re the structure of tyrosyl-tRNA synthetase from B.
stearothermophilus;
W09739015, and W09726351 j; isoleucyl tRNA synthetase (Chalker, A. F. et al Gene 141:103, 1994);
10 valyl tRNA synthetase (LJ.S. Nos. 5,789,218, and 5,747,314, W09726355, EP0785267); asparaginyl tRNA synthetase (U.S. 5,789,21, W09726348); alanyl tRNA synthetase (U.S.
5,776,750, W09739013, W09726353); cysteinyl tRNA synthetase (LJ.S. 5,775,749, U.S. 5,753,480, W09726341); arginyl tRNA synthetase (U.S. 5,763,246, W09726347, EP0785266); glycyi tRNA synthetase (U.S.
5,756,330, W09726340); phenylalanyl tRNA synthetase (U.S. 5,756,329, U.S.
5,753,479, 15 W09726356); tryptophanyl tRNA synthetase (EP0843014); leucyl tRNA
synthetase (LJS. 5,750,387, W09726349); histidyl tRNA synthetase (iJ.S. 5,747,313, W09739017, W09726354, EP0785269);
seryl tRNA synthetase (U.S. 5,744,338, W09726352, EP0785270); threonyl tRNA
synthetase (EP0815237, W0972634, EP0785271); aspartyl tRNA synthetase (U.S. 5,747,315, W09739014, W09726344); methionyl tRNA synthetase (W09726350, W09739012, EP0785268), isoleucyl tRNA
20 synthetase (W09739011); tryptophanyl tRNA synthetase (W09726346); glutamyl tRNA synthetase (WO9726345); and propyl tRNA synthetase (W09726343, EP0785272). tRNA
synthetases may also be identified by using antibodies or probes specific for the enzymes.
The term "ribosomal protein" refers to a ribosomal protein or peptide that comprises or consists of a sequence that is capable of binding tRNA and in particular binding a tRNA anticodon 25 stem-loop. Preferably the ribosomal protein is ribosomal S4 protein which is a multifunctional ribosomal protein associated with the 30S subunit, comprising 206 amino acids in E. toll. S4 is an ancient, if not one of the most ancient of ribosomal proteins (5). In the ribosome, S4 is required and it is the fu~st protein involved in the folding of 16S rRNA. (7) The term includes mutations of S4 proteins, specifically the ram (ribosomal ambiguity) D14 and D12 mutants in E. toll (8,9,10), the omnipotent 30 suppressor mutant SUP46 in yeast ( 11 ), and the NAM-9 mutants of yeast ( 12).
The term "tRNA anticodon stem-loop " refers to a nucleic acid comprising or consisting of an anticodon sequence of a tRNA or a chemically, enzymatically, or metabolically modified form thereof, which has high affinity to a domain of the invention and proteins comprising such a domain including tRNA-synthetase or ribosomal S4 protein. tRNA anticodon stem-loops may be identified using data 35 base search methods (see for example, www.molgen.uc.edu/analyze/Stem.htm~
http://mell.angis.org.au/Stadenn. tRNA anticodon stem-loops may also be identified by screening libraries with a protein containing a sequence with high affinity to a tRNA
anticodon stem-loop, i.e. a peptide of the invention which may be labeled.
The phrase "interfere with the interaction oP' refers to the ability of the peptides or complexes of the invention to inhibit the interaction of a protein such as a tRNA
synthetase or ribosomal protein and a tRNA anticodon stem-loop thereby affecting protein synthesis.
The term "peptide" refers to macromolecules which comprise a multiplicity of amino or imino acids {or their equivalents) in peptide linkage, wherein the peptides may comprise or lack post-translational modifications (e.g. glycosylation, cleavage, phosphorylation, side-chain derivation and the like).
The terms "label" or "labeled" refer to incorporation of a detectable substance e.g. by incorporation of a radiolabeled amino acid or attachment of biotinyl moieties to a protein or peptide 10 wherein the attached biotinyl moieties can be detected by marked avidin.
Various methods of labeling proteins and peptides are known in the art and may be used. Examples of labels include, but are not limited to the following: radioisotopes (e.g.'H, 14C,'sS,'uI, "'I), fluorescent labels (e.g. FITC, rhodamine, lanthanide phosphors), enzymatic labels (e.g. horseradish peroxidase, ~i-galactosidase, luciferase, alkaline phophatase), biotinyl groups, predetermined polypeptide epitopes recognized by a 15 secondary reporter (e.g, leucine zipper pair sequences, binding sites for secondary antibodies, metal binding domains, epitope tags). In some embodiments, labels are attached by spacer arms of various lengths to reduce potential steric hindrance.
The term "substantially pure" means that the particular peptide is the predominant species present (i.e. on a weight/volume percentage, it is the most abundant single species within the 20 composition), and preferably a substantially purified fraction is a composition wherein the peptide comprises at least about 50 percent (w/v) of all macromolecular species present. Generally, a substantially pure composition will comprise more than 80 to 90 percent of all protein present in the composition. Most preferably, the peptide is purified to essential homogeneity (contaminant proteins cannot be detected in the composition by conventional detection methods) wherein the composition 25 consists essentially of a single protein species.
Peptides and Complexes The peptides of the invention generally comprise a core sequence which corresponds to a tltNA anticodon stem-loop recognition sequence motif. This general motif can be identified by the data described herein. Typically, the peptides will comprise the sequence motif Y,-X(3rYZ-X(3)-Y3-Y4-30 X(2)-Ys-X(3)-Y6-YrX(2~Yg-X(2)-Y9-X-Y,o-X-Y"-Y,2-X(3)-Y,3-X(2)-Y,4 where Y, is methionine or leucine, preferably leucine, YZ is leucine, isoleucine, valine, methionine, or cysteine, preferably isoleucine, leucine, or valine,Y3 is glycine, methionine, asparagine, glutamic acid, histidine, or lysine, preferably glycine, Y4 is methionine, leucine, phenylalanine, tyrosine, or isoleucine, preferably phenylalanine, leucine, methionine, or tyrosine, YS is serine or threonine, Y6 is serine, alanine, 35 isoleucine, or glycine, preferably alanine, Y7 is arginine, methionine, isoleucine, or lysine, preferably arginine, Yg is methionine, isoleucine, valine, or alanine, preferably valine or isoleucine, Y9 is glycine, asparagine, arginine, histidine, or lysine, preferably lysine, glycine, or arginine, Y,o is isoleucine, valine, leucine, or phenylalanine, preferably valine or isoleucine, Y" is leucine, isoleucine, or valine, preferably valine or isoleucine, Y,Z is asparagine, aspartic acid, serine, arginine, glycine, alanine, or 40 leucine, preferably asparagine, aspartic acid, or glycine,Y,3 is leucine, isoleucine, valise, serine, or _7_ glutamine, preferably glutamine or valine, Y" is proline, isoleucine, leucine, valine, threonine, alanine, cysteine, or serine, preferably proline or vaiine, and X is any amino acid.
Generally the sequence recognition motif may be present as its own peptide, or may be a core of a longer sequence. Generally, the peptides of the invention will comprise the motif as a portion, or a whole of a peptide of from 10 to about 200 amino acids in length. Typically the peptides will be from about 20 to 100 amino acids in length, preferably the peptides will be from about 30 to about 75 amino acids in length, more preferably from about 36 to about 65 amino acids in length, A peptide of the invention is also represented herein as comprising the following sequence:
10 [ML]-X{3~[LIVMC]-X(3~[GMNEI-IK]-[MLFYI]-X(2~[ST]-X(3r[SAIG]-[RMIK]-X(2~[MIVA]-X(2~[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3~[LIVSQ]-X{2)-[PILVTAC]
wherein X represents any amino acid.
The invention also provides complexes comprising a peptide of the invention and a tRNA
15 anticodon stem-loop.
Preferred peptides of the invention include the peptides shown in Figure 1 (SEQ.ID.NOs. l-45) and Figure 2 (SEQ.ID.NO. 46-47). Additional peptides within the scope of the invention may be identified using the sequence set out above for example, with the ScanProsite service at http://expasy.hcuge.ch/sprot/scnpsit2.html.
20 In addition to full-length peptides of the invention, truncations of the peptides which inhibit interaction of a tRNA synthetase or ribosomal protein and a tRNA anticodon stem loop are contemplated in the present invention. Truncated peptides may comprise peptides of about 7 to 10 amino acid residues.
The truncated peptides may have an amino group (-NH2), a hydrophobic group (for example, 25 carbobenzoxyl, dansyl, or T-butyloxycarbonyl), an acetyl group, a 9-fluorenyhnethoxy-carbonyl (PMOC) group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or carbohydrates at the amino terminal end. The truncated peptides may have a carboxyl group, an amido group, a T-butyloxycarbonyl group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or carbohydrates at the carboxy terminal 30 end.
The peptides of the invention may also include analogs, and/or truncations thereof, which may include, but are not limited to the peptide of the invention containing one or more amino acid insertions, additions, or deletions, or both. Analogs of the peptide of the invention exhibit the activity characteristic of a peptide of the invention (e.g. interference with the interaction of a tyrosyl-tRNA
35 synthetase or ribosomal S4 protein and a tRNA anticodon stem-loop), and may further possess additional advantageous features such as increased bioavailability, stability, or reduced host immune recognition.
One or more amino acid insertions may be introduced into a peptide of the invention. Amino acid insertions may consist of a single amino acid residue or sequential amino acids. One or more _g_ amino acids, preferably one to five amino acids, may be added to the right or left termini of a peptide of the invention.
Deletions may consist of the removal of one or more amino acids, or discrete portions from the peptide sequence. The deleted amino acids may or may not be contiguous.
The lower limit length 5 of the resulting analog with a deletion mutation is about 7 amino acids.
Cyclic derivatives of the peptides of the invention are also part of the present invention.
Cyclization may allow the peptide to assume a more favorable conformation for association with a tRNA anticodon stem-loop. Cyclization may be achieved using techniques known in the art. For example, disulfide bonds may be formed between two appropriately spaced components having free 10 sulfhydryl groups, or an amide bond may be formed between an amino group of one component and a carboxyl group of another component. Cyclization may also be achieved using an azobenzene-containing amino acid as described by Ulysse, L., et al., J. Am. Chem. Soc.
1995, 117, 8466-8467. The side chains of P.Tyr and Asn may be linked to form cyclic peptides. The components that form the bonds may be side chains of amino acids, non-amino acid components or a combination of the two.
15 It may be desirable to produce a cyclic peptide which is more flexible than the cyclic peptides containing peptide bond linkages as described above. A more flexible peptide may be prepared by introducing cysteines at the right and left position of the peptide and forming a disulphide bridge between the two cysteines. The two cysteines are arranged so as not to deform the beta-sheet and tum.
The peptide is more flexible as a result of the length of the disulfide linkage and the smaller number of 20 hydrogen bonds in the beta-sheet portion. The relative flexibility of a cyclic peptide can be determined by molecular dynamics simulations.
In addition to the above peptides, peptide analogs are also provided. Peptide analogs are commonly used in the pharmaceutical industry as non-peptide drugs with properties analogous to those of the template peptide. These non-peptide compounds are referred to as "peptide mimetics" or 25 "peptidomimetics" (Fauchere, J. 1986, Adv. Drug. Res. 15:29; Veber and Freidinger 1985, TINS, 392;
and Evans et al 1987, J. Med. Chem. 30:1229). Peptide mimetics are generally developed using computerized molecular modeling. Peptide mimetics that are structurally similar to therapeutically useful peptides may be used to produce an equivalent therapeutic or prophylactic effect. Generally, peptide mimetics are structurally similar to a paradigm peptide (i.e, a peptide that has a biological or 30 pharmacological activity) such as a naturally occurring polypeptide (e.g. a tRNA anticodon stem-loop recognition motif of tyrosyl-tRNA synthetase), but have one or more peptide linkages optionally replaced by a linkage from for example a group comprising: -HiNH-, -CHZS-, -CHZCHz-, -CH=CH-(cis and trans), -COCHz-, -CH(OH)CHZ-, and -CHZSO-, by methods known in the art (see for example Spatola, A.F. in "Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins," B. Weinstein, 35 eds., Marcel Dekker, New York, p. 267 (1983); Spatola, A.F., Vega Data (March 1983), Vol. 1, Issue 3, "Peptide Backbone Modifications" (general review); Morley, J.S., Trends Pharm. Sci. (1980), pp.
463-468 (general review); Hudson, D. et al, Int. J. Pept Prot Res. (1979), 14:177-185 (-CHzNH-, CHZCHZ-); Spatola, A.F. et al., Life Sci. (1986), 38:1243-1249 (-CHZ-S); Hann, M.M., J. Chem Soc.
Perkin Trans I (1982), 307-314 (-CH-CH-, cis and traps); Almquist, R.G. et al., J. Med Chem (1980), 40 23:1392-1398 (-COCHZ-); Jennings-White, C. et al., Tetrahedron Lett. 1982, 23:2533 (-COCHZ-);
Szelke, M. et al., European Appln. EP 45665 (1982), CA: 97:39405, (1982) (-CH(OH~HZ-); Holladay, M.W. et al., Tetrahedron Lett. (1983), 24:4401-4404 (-C(OH)CHZ-); and Hruby, V.J., Life Sci. (1982) 31:189-199 (-CH2S-).
Peptide mimetics can have advantages including for example more economical production, 5 greater chemical stability, enhanced pharmacological properties (half life, absorption, potency, efficacy, etc.), altered specificity (e.g. broad spectrum biological activities), and reduced antigenicity.
Labels can be directly or indirectly attached (e.g. through a spacer such as an amide group) to non-interfering positions on a peptide mimetic that are predicted by quantitative structure-activity data and/or molecular modeling. Non-interfering positions typically are positions that do not form direct 10 contacts with the macromolecules to which the mimetic binds to produce the effect. Labeling of mimetics should not substantially interfere with the desired biological or pharmacological activity of the mimetic. Generally, mimetics of the peptides of the invention can bind to a tltNA anticodon stem-loop with high affinity and possess detectable biological activity i.e. are agonists or antagonists to one or more tRNA anticodon stem-loop mediated phenotypes.
15 More stable peptides can be generated by systematic substitution of one or more amino acids of a consensus sequence with a D-amino acid or the same type.
The invention also includes a peptide conjugated with a selected protein, or a detectable substance, or selectable marker (see below] to produce fusion proteins. A
"fusion protein" generally refers to a composite protein made up of two or more separate proteins which are normally not fused 20 together as a single protein. Fusion proteins can be made by either recombinant nucleic acid methods or by chemical synthesis methods well known in the art. Fusion partners can include a substrate, cofactor, inhibitor, affinity ligand, antibody binding epitope tag, or an enzyme capable of being assayed. A
fusion partner can include for example bacterial ~i-galactosidase, trpE, protein A, (3-lactamase, a-amylase, alcohol dehydrogenase, and yeast a-mating factor. Because of their ability to recognize and 25 bind specific proteins e.g. a protein comprising a tltNA anticodon stem-loop, the peptides of the invention may act as an affinity ligand to direct the activity of a fused protein to the specific proteins.
The peptides of the invention may be free in solution or covalently attached to a solid support.
Peptides attached to a solid support can be particularly useful in screening and purification applications. Examples of solid supports include those well known in the art such as cellulose, agarose, 30 polystyrene, divinylbenzene, and the like. Commercially available supports that come prepared for immediate coupling of affinity ligands can be used (e.g. from Sigma Chemical, St. Louis, Missouri, or Pharmacia, Uppsala, Sweden).
The peptides of the invention may be converted into pharmaceutical salts by reacting with inorganic acids such as hydrochloric acid, sulfuric acid, hydrobromic acid, phosphoric acid, etc., or 35 organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, succinic acid, malic acid, tartaric acid, citric acid, benzoic acid, salicylic acid, benezenesulfonic acid, and toluenesulfonic acids.
The peptides of the invention may be prepared using recombinant DNA methods.
Accordingly, nucleic acid molecules which encode a peptide of the invention may be incorporated in a 40 known manner into an appropriate expression vector which ensures good expression of the peptide.
Possible expression vectors include but are not limited to chromosomal, episomal, and virus-derived vectors such as vectors derived from bacterial plasmids, from bacteriophages, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, 5 pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids, or modified viruses so long as the vector is compatible with the host cell used. The expression vectors contain a nucleic acid molecule encoding a peptide of the invention and the necessary regulatory sequences for the transcription and translation of the inserted protein-sequence. Suitable regulatory sequences may 10 be obtained from a variety of sources, including bacterial, fungal, viral, mammalian, or insect genes (For example, see the regulatory sequences described in Goeddel, Gene Expression Technology:
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Selection of appropriate regulatory sequences is dependent on the host cell chosen, and may be readily accomplished by one of ordinary skill in the art. Other sequences, such as an origin of replication, additional DNA restriction 15 sites, enhancers, and sequences conferring inducibility of transcription may also be incorporated into the expression vector.
The recombinant expression vectors may also contain a selectable marker gene which facilitates the selection of transfonmed or transfected host cells. Suitable selectable marker genes are genes encoding proteins such as 6418 and hygromycin which confer resistance to certain drugs, (3-20 galactosidase, chloramphenicol acetyltransferase, firefly luciferase, or an immunoglobulin or portion thereof such as the Fc portion of an immunoglobulin preferably IgG. The selectable markers may be introduced on a separate vector from the nucleic acid of interest.
The recombinant expression vectors may also contain genes which encode a fusion portion which provides increased expression of the recombinant peptide; increased solubility of the 25 recombinant peptide; and/or aid in the purification of the recombinant peptide by acting as a ligand in affinity purification. For example, a proteolytic cleavage site may be inserted in the recombinant peptide to allow separation of the recombinant peptide from the fusion portion after purification of the fusion protein. Examples of fusion expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England Biolabs, Beverly, MA) and pRITS (Pharmacia, Piscataway, NJ) 30 which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the recombinant protein.
Recombinant expression vectors may be introduced into host cells to produce a transformant host cell. Transformant host cells include prokaryotic and eukaryotic cells which have been transformed or transfected with a recombinant expression vector of the invention. The terms 35 "transfonmed with", "transfected with", "transformation" and "transfection"
are intended to include the introduction of nucleic acid (e.g. a vector) into a cell by one of many techniques known in the art. For example, prokaryotic cells can be transformed with nucleic acid by electroporation or calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells using conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 40 transfection, lipofectin, electroporation or microinjection. Suitable methods for transforming and _11_ transfecting host cells may be found in Sambrook et al. (Molecular Cloning: A
Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press ( 1989)), and other laboratory textbooks.
Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, the peptides of the invention may be expressed in bacterial cells such as Streptococci, Staphylococci, Streptomyces, B. Subtilus, E. coli, fungal cells such as yeast cells, insect cells such as Drosophila (using baculovirus), or mammalian cells such as CHO, COS, HeLa, C
127, BHK, 293, and plant cells. Other suitable host cells can be found in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1991).
In an embodiment of the invention the host cells are plant cells and the vectors which are used 10 to transform the plant tissue include Agrobacterium vectors and ballistic vectors.
The peptides of the invention may be tyrosine phosphorylated using conventional methods including the method described in Reedijk et al. (The EMBO Journal 11(4):1365, 1992). For example, tyrosine phosphorylation may be induced by infecting bacteria harbouring a plasmid containing a nucleotide sequence encoding a peptide of the invention, with a ~,gtl 1 bacteriophage encoding the 15 cytoplasmic domain of the Elk tyrosine kinase as a LacZ-EIlc fusion.
Bacteria containing the plasmid and bacteriophage as a lysogen are isolated. Following induction of the lysogen, the expressed peptide becomes phosphorylated by the Elk tyrosine kinase.
The peptides of the invention may also be prepared by chemical synthesis using techniques well known in the chemistry of proteins such as solid phase synthesis (Merrifield, 1964, J. Am. Chem.
20 Assoc. 85:2149-2154) or synthesis in homogenous solution (Houbenweyl, 1987, Methods of Organic Chemistry, ed. E. Wansch, Vol. 15 I and II, Thieme, Stuttgart). By way of example, the peptides may be synthesized using 9-fluorenyl methoxycarbonyl (Fmoc) solid phase chemistry with direct incorporation of phosphotyrosine as the N-fluorenylmethoxy-carbonyl-O-dimethyl phosphono-L-tyrosine derivative.
25 N-terminal or C-terminal fusion proteins comprising a peptide of the invention conjugated with other molecules may be prepared by fusing, through recombinant techniques, the N-terminal or C-terminal of the peptide, and the sequence of a selected protein or selectable marker with a desired biological function. The resultant fusion proteins contain the peptide fused to the selected protein or marker protein as described herein. Examples of proteins which may be used to prepare fusion proteins 30 include immunoglobulins, glutathione-S-transferase (GST), hemagglutinin (HA), and truncated myc.
Antibodies The peptides and complexes of the invention may be used to prepare antibodies immunospecific for such peptides or complexes. Antibodies include monoclonal and polycional antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, and Fab fragments 35 including the products of an Fab immunoglobulin expression library.
Conventional methods can be used to prepare the antibodies. As discussed below, the antibodies may be used to identify proteins containing tRNA anticodon stem-loop recognition motifs.
Screening for tItNA anticodon stem loop recognition motifs and tRNA anticodon stem-loops.
The peptides, antibodies specific for the peptides, and complexes of the invention may be 40 labeled using conventional methods with various enzymes, fluorescent materials, luminescent materials and radioactive materials. Suitable enzymes, fluorescent materials, luminescent materials, and radioactive material are well known to the skilled artisan. Labeled antibodies specific for the peptides of the invention may be used to screen for tltNA anticodon stem-loop recognition motifs or binding sites in proteins such as tliNA synthetases or ribosomal proteins, and labeled peptides of the invention may be used to screen for tIZIVA anticodon stem-loops.
In an embodiment of the invention, the peptides of the present invention can be used as probes to identify nucleic acids comprising tltNA anticodon stem-loops. The methods allow for the identification of nucleic acids that are specifically involved in protein translation.
Therefore, in one aspect, the peptides of the invention can be used to determine whether a 10 particular nucleic acid comprises a tltNA anticodon stem-loop.
Determination of whether a nucleic acid comprises a tRNA anticodon stem-loop may be carried out using a variety of means. For example, the nucleic acid to be tested can be immobilized on a solid support e.g. a microtiter well, or nitrocellulose membrane. After blocking the remaining groups on the support, the nucleic acid to be tested can be exposed to an appropriate amount of a labeled peptide of the invention.
Detection of label bound to the 15 test nucleic acid indicates that the nucleic acid contains a tltNA
anticodon stem-loop.
In a preferred embodiment, the nucleic acid is attached to a solid support prior to contacting the nucleic acid with a peptide of the invention and the peptide used in the contacting step further comprises a detectable substance. The determining step comprises assaying for the presence of the detectable substance. Alternatively, the peptide of the invention can be attached to a solid support prior 20 to contacting the nucleic acid with the peptide of the invention.
As an affinity ligand the peptides of the invention can be used to purify nucleic acids which comprise a tRNA anticodon stem-loop from a mixture of nucleic acids. Affinity purification of such nucleic acids can be carried out using conventional affinity purification methods well known in the art.
For example, a peptide of the invention can be attached to a solid support as described herein. A
25 mixture of nucleic acids can then be contacted with the peptide bound to the solid support, such that the peptide selectively binds tltNA anticodon stem-loop containing nucleic acids present in the mixture.
The bound nucleic acids can be washed to eliminate unbound nucleic acids.
Substantially pure tIZNA
anticodon stem-loop containing nucleic acids can be eluted from the solid support by conventional elution protocols.
30 The invention broadly provides methods for identifying substances that bind to a peptide or complex of the invention. The invention also contemplates methods for identifying compounds that bind to substances that interact with a complex or peptide of the invention.
Conventional methods such as co-immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns may be used to identify such substances and compounds. Substances and compounds 35 identified using the methods of the invention may be isolated and characterized (e.g. sequenced) using conventional techniques.
Substances which can bind with a peptide or complex of the invention can be identified by reacting a peptide or complex of the invention with a test substance which potentially binds to the peptide or complex, under conditions which permit the formation of substance-peptide or substance-40 complex conjugates, and removing and/or detecting the conjugates. The conjugates can be detected by assaying for substance-peptide or substance-complex conjugates, for free substance, or for non-complexed peptide or complexes. Conditions which permit the formation of conjugates may be selected having regard to factors such as the nature and amounts of the substance. The conjugates, free substance or non-complexed peptides or complexes may be isolated by conventional isolation S techniques, for example, salting out, chromatography, electrophoresis, gel filtration, fractionation, absorption, polyacrylamide gel electrophoresis, agglutination, or combinations thereof. To facilitate the assay of the components, antibody against a peptide or complex may be utilized. The antibodies, peptides, complexes, or substances may be labeled with a detectable substance, or they may be bound to a solid support as described herein.
10 X-ray crystallographic studies may be used as a means of evaluating interactions. For example, purified molecules in a conjugate when crystallized in a suitable form are amenable to detection of infra-molecular interactions by x-ray crystallography.
Spectroscopy may also be used to detect interactions and in particular, Q-TOF instrumentation may be used. In addition, two-hybrid systems may be used to detect protein interactions in vivo.
15 Screening Methods The invention also enables screening of compounds that enhance (agonist) or diminish (antagonist) the level of interaction of a protein comprising a tltNA
anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal protein) and a tltNA anticodon stem-loop. The terms "agonist" and "antagonist" as used herein do not imply a particular mechanism of function.
20 In one aspect of the invention to screen for agonists or antagonists, a synthetic reaction mixture, a cellular compartment (e.g membrane, cell envelope, cell wall), or a preparation thereof, comprising a peptide of the invention and a tItNA anticodon stem-loop is incubated in the presence or the absence of a test compound which may be an agonist or antagonist. The ability of the test compound to enhance or interfere can be reflected in increased or decreased binding of the peptide and 25 the tIZNA anticodon stem-loop. The efficiency of the reaction may be enhanced by labeling the peptide or tRNA anticodon stem-loop, using a reporter system, or immobilizing the peptide or tRNA anticodon stem loop upon a solid support.
As a specific example, a nucleic acid comprising a tltNA anticodon stem loop can be coupled to the wells of a microtiter plate or nitrocellulose membrane. The test compound can be added to the 30 well or membrane to preincubate with the nucleic acid. The peptide of the invention, to which a detectable substance is attached is added to the well or membrane. Following sufficient incubation, the wells or membranes are rinsed, and binding of the peptide to the nucleic acid can be assessed by for example assaying for the presence of residual detectable substance. Those of skill in the art will recognize that the screening assay format can be set up in either direction, i.e. either the peptide or 35 nucleic acid can be bound to the support, while the other is labeled. The level of binding can be compared to suitable positive and negative controls. Alternatively, by providing the nucleic acid and/or the peptide in known concentrations, one can assay for free, or unbound nucleic acids and/or peptide, and by negative implication, determine the level of complexes that are formed.
The amount or concentration of the test compound that is added, when known will vary 40 depending on the compound. Typically a range of concentrations will be used. In the case of uncharacterized test compounds it may not be possible, and it is not necessary to determine the concentration of the compound.
It is desirable to include various controls e.g, positive and negative controls, in the assays. In the testing of agonist activity, negative controls can include incubating with inert compounds (e.g.
compounds known not to have agonist activity) or in the absence of added compounds. Positive controls can include incubating with compounds known to have agonist activity such as the natural ligand. As will be apparent to one of ordinary skill in the art, similar (though complementary) controls can be included in assays for antagonist activity, as well as various additional controls.
A competitive assay may also be used to screen for anagonists. The assay combines a protein 10 comprising a tltNA anticodon stem-loop recognition motif (e.g. tltNA
synthetase, ribosomal protein, or peptide of the invention), tRIVA anticodon stem-loop, and test compound under appropriate conditions for a competitive inhibition assay.
Potential agonists and antagonists include small inorganic or organic molecules, peptides, polypeptides, antibodies, a mixture of molecules, peptides, polypeptides etc., or an extract made from 15 biological materials such as bacteria, plants, fungi, or animal cells or tissues. The compound may be an endogenous physiological compound or it may be a natural or synthetic compound.
The screening assays may be used in the discovery and development of therapeutics such as antibacterial or antifungal compounds. In addition, antisense sequences to the sequence encoding the novel domain i.e. tltNA anticodon stem-loop recognition motif or binding site identified herein may be 20 used to control expression of the coding sequence and thus may be used as potential therapeutics.
The peptide of the invention can be used to model small molecules which interfere with the binding of a protein comprising a tltNA anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal protein), with a tRlVA anticodon stem-loop in vivo. In particular, the structure of the tRlVA anticodon stem-loop sequence recognition motif, as described herein, can be applied in 25 generating synthetic analogs and mimics of the recognition sequence.
Synthetic elements can be pieced together based upon their analogy to the structural and chemical aspects of the recognition sequence motif. Such mimics and analogs may be used in blocking or inhibiting specific aspects of protein translation and may be useful as therapeutic treatments in accordance with the methods described herein.
30 Compositions While it is possible to administer an active ingredient alone it is preferable to present it as part of a pharmaceutical composition or formulation. Therefore, the peptides, complexes, antibodies, substances, and compounds of the invention may be formulated into pharmaceutical compositions for administration to subjects in a therapeutically active amount and in a biologically compatible form 35 suitable for administration in vivo i.e. a form of the peptides etc. to be administered in which any toxic effects are outweighed by the therapeutic effects. A therapeutically active amount of a pharmaceutical composition of the invention is defined as an amount effective, at dosages and for periods of time necessary to achieve the desired result. For example, a therapeutically active amount of a peptide may vary according to factors such as the disease state, age, sex, and weight of the individual. Dosage 40 regime may be adjusted to provide the optimum therapeutic response.
- IS -The peptides etc. may be administered in a convenient manner such as by injection (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal administration. Depending on the route of administration, the peptides etc.
may be coated in a material to protect them from the action of enzymes. The peptides etc. may also be used in combination with 5 organic substances for prolongation of their pharmacologic actions. Examples of such organic substances are non-antigenic gelatin, carboxymethylcellulose, sulfonate or phosphate ester of alginic acid, dextran, polyethylene glycol and other glycols, phytic acid, polyglutamic acid, and protamine.
The compositions described herein can be prepared by per se known methods for the preparation of pharmaceutically acceptable compositions which can be administered to subjects, such 10 that an effective quantity of a peptide etc. is combined in a mixture with a pharmaceutically acceptable vehicle. Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's Phanmaceutical Sciences, Mack Publishing Company, Euston, Pa., USA 1985). On this basis, the compositions include, albeit not exclusively, solutions of the peptides in association with one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a 15 suitable pH and iso-osmotic with the physiological fluids. The peptides etc. may also be incorporated in liposomes or similar delivery vehicles.
Applications The peptides and complexes of the invention interfere with the interaction of a protein comprising a tRNA anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal 20 protein), and a tRNA anticodon stem-loop. The activity of a peptide or complex of the invention may be confirmed by assaying for the ability of the peptide or complex to interfere with the interaction of, for example, a tyrosyl tRNA synthetase or ribosomal S4 protein and a tRNA
anticodon stem-loop.
Computer modelling techniques known in the art may also be used to observe the interaction of a peptide of the invention, and truncations and analogs thereof with a tRNA
anticodon stem-loop (for 25 example, Homology Insight II and Discovery available from BioSym/Molecular Simulations, San Diego, California, U.S.A.). If computer modelling indicates a strong interaction, the peptide can be synthesized and tested for its ability to interfere with the interaction of a tRNA anticodon stem-loop, and a protein comprising a tRNA anticodon stem-loop recognition motif ( e.g.
tyrosyl tRNA synthetase or ribosomal S4 protein).
30 The peptides, compositions, complexes, and antibodies of the invention, and compounds and substances identified using the screening assays of the invention may be used in therapeutic applications for the treatment of living organisms including human or non-human mammalian subjects.
Alternatively, the peptides etc. may be useful as a prophylactic treatment, or in screening for compounds effective in prophylactic treatments.
35 The peptides, complexes, compositions, antibodies, and compounds and substances identified using the screening assays of the invention may be used to treat or prevent infections caused by bacteria such as species of Bacillus, E.coli, Mycobacterium, Nelicobacter, Hemophilus, Streptococcus, and Staphylococcus, and infections caused by fungi such as yeast including S.
cerevisiae, and Aspergillus. The peptides, complexes, compositions, antibodies, and compounds may also be used as 40 immunotoxins, and anti-viral agents. They may also be used as plant toxins either as an applied pesticide formulation or by incorporation into the genome of the plant. They may also be useful in vitro to arrest protein translation in biochemical assays where a precise termination of the reaction is desired.
The peptides and complexes of the invention may also be used to induce an immunological response in an individual, particularly a mammal which comprises inoculating the individual with a peptide, complex, or composition of the invention adequate to produce antibody to protect the individual from disease (e.g. bacterial infections). An immunological response may also be induced by delivering through gene therapy a gene encoding a peptide of the invention in vivo in order to induce an immune response to protect the individual from disease. Thus, the invention contemplates a vaccine formulation which comprises a peptide or complex of the invention together with a suitable carrier.
10 The following non-limiting example is illustrative of the present invention:
EXAMPLE
The three-dimensional structure of B, stearothermophiJus tyrosyl-tRNA
synthetase (TyrRS) has been known for more than 10 years ( 1 ), however the 100 C-terminal amino acids were found disordered in the crystal structure. Deletion mutants have demonstrated that this C-terminal domain is 15 required for tRNA binding and recognition (2,3). Starting with prokaryotic TyrRS from Escherichia colt, a PSI-BLAST (4) search iteratively fords weak similarities between the C-terminal region and archeal, chloroplast and prokaryotic ribosomal S4 proteins, as well as NAM-9 the yeast mitochondrial ortholog of S4. The corresponding alignment and the motif inferred from this similarity are shown in Figure 1.
20 Ribosomal S4 protein is a multifunctional ribosomal protein associated with the 30S subunit, comprising 206 amino acids in E. colt. S4 is an ancient, if not one of the most ancient of ribosomal proteins (5). It has been demonstrated that in E. colt S4 has an autoregulatory function, binding to a pseudoknot of its own operon mRNA which limits its expression through a feedback mechanism (6). In the ribosome, S4 is required and it is the first protein involved in the folding of 16S rRNA (7).
25 Mutations of S4 proteins, specifically the ram (ribosomal ambiguity) D14 and D12 mutants in E. colt (8,9,10), the omnipotent suppresser mutant SUP46 in yeast ( 11 ), and the NAM-9 mutants of yeast( 12) affect translational accuracy. Visualization of S4 in the ribosome through 3-D
electron microscopy shows that it is superimposable with the region of the A and P sites, the entry and peptidyl-tRNA
binding sites of the ribosome ( 13,14).
30 There are several experiments elucidating the interaction between ribosome and tRNA that indicate a tRNA anticodon stem-loop interaction when tRNA is in the peptidyl (P) site. In the yeast tRNA~" , the top base pairs of the anticodon helix were shown to have an effect on ribosomal binding and activity independently of anticodon in-vivo ( I 5). Hydroxyl radical probing experiments revealed that the anticodon stem-loop is protected in the P site of 30S ribosomal subunit ( 16). Ribosomal 35 protein S4 is also implicated in the binding of the tRNA anticodon.
Ribosomal proteins S4 and S 18 were crosslinked to chemically modified anticodon nucleotides in early in-vitro experiments ( 17, 18).
Characterization of nonsense and missense suppresser tRNAs in the context of the ram S4 mutant suggested that the level of suppression might depend upon recognition of tRNA
anticodon stem-loop structure by S4 protein ( 19). S4 ram mutations also affect the kinetic off rates of the P site tRNA
40 binding (20).
-17_ The mRNA and rRNA binding region of S4 have been previously characterized. It has been shown that both functions reside within residues 47 -104 in E. coli S4 protein (21,22). Yet the most conserved region of S4 lies to the C-terminal side of this region and has not been shown to possess a specific function to date. This region (residues 97 -I32 in E. cola also comprises the motif similarity 5 between S4 and the TyrRS C-terminal region shown in Figure 1. Interestingly, no suppressor or ribosomal ambiguity mutants have ever been characterized from this most conserved portion of S4.
Mutations here have been only rarely observed probably owing to their functional importance. A more fundamental and specific binding and recognition event is the reason for the conservation of this motif.
The lack of structure of the C-terminus in the crystals of B.
stearothermophilus TyrRS has led 10 to the question as to whether the C-terminus of TyrRS has any structure at all. Similarly, the structure of the S4 protein rRNA and mRNA binding domain has been studied, and most recently the 3-D
structure of S4 has been determined (23, 24). Circular Dichroism spectra of the in E. coli S4 fragment 48 - 177 (21 ) and the B. stearothermophilus TyrRS fragment 323-419 (25) corresponding to the motif presented here are qualitatively similar in shape, supporting a shared folded structure. Sequence-15 structure threading (26) was performed to determine a structure-based alignment to further test this hypothesis. Despite difficulties in threading very small domains, the resulting threading alignment shown in Figure 2 recreated the entire motif alignment of Figure 1, and also aligned a conserved region of positively charged residues downstream of the motif.
The experimental analysis of both the 54 protein and TyrRS indicates that the individual 20 domains comprising these proteins are modular and functionally divisible.
The sequence similarity between TyrRS and S4 may be an evolutionary event in which ribosomal protein S4 was grafted onto the C-terminus of a bacterial precursor TyrRS originally more closely related to the shorter archeal tyrosyl-tRNA synthetases, which lack this C-terminus. Perhaps it is not a coincidence that the B.
subtilis genomic sequence, which encodes two tyrosyl-tRNA synthetase genes (tyrS), has one TyrRS
25 adjacent to the single ribosomal S4 gene (27) however this close spatial relationship is not found in other complete genomes. The observation of two variants of TyrRS in prokaryotes is best explained by an ancient duplication event followed by differential loss of one or the other form {28). This duplication event probably came some time after the TyrRS-S4 chimera was formed in the prokaryotic ancestor.
30 It is clear that this motif persists in the S. cerevisiae mitochondria) form of TyrRS (mtYRS).
This indicates that any such S4 insertion event would precede a symbiotic event forming mitochondria in eukaryotes, thus perhaps it is a useful evolutionary marker. In P. anserina and N. crassa mtTyrRS, the C-terminus has digressed from this sequence, accounted for by the obvious alteration of the C-terminus to accommodate the rather large group I intron splicing polypeptide, possibly a replacement 35 of the S4-like C-terminus in some mitochondria (28). Interestingly no convincing human ESTs have yet been found with similarity to a mitochondria) TyrRS. It is possible that some eukaryotes may have replaced the mtTyrRS with the cytoplasmic TyrRS.
The earlier work of Bedouelle et al.(30) demonstrates the interaction of this fragment of B.
stearothermophilus TyrRS with its cognate tRNATn. The residues from the S4-motif (namely 8368 40 and 8371 ) were amidst the 6 basic residues which, when mutated, could not complement the temperature sensitive tyrS strain of E. coli, demonstrating their requirement in the interaction with tRNATn. These include the conserved residues shown in blue in Figure 3. The model-building studies of the same group showed that the C-terminus of TyrRS will co-localize with the anticodon helix of tRNArn. The C-terminus of the eukaryotic cytoplasmic and archeal TyrRS is shorter than that of the 5 prokaryotic variants, lacking this motif. In a corresponding fashion, the tRNA binding specificity of eukaryotic tyrosyl-tRNA synthetase seems independent of the anticodon helix as the TyrRS binding affinity resides mainly in aminoacyl-stem of tRNATn (31, 32). Specificity for binding the anticodon of tRNATn in prokaryotic TyrRS may have indeed arisen by the apparent ancient addition of S4 sequence to the TyrRS C-terminus forming a chimera.
10 The comparison of ribosomal S4 protein against structures in the PDB using VAST (33) shows similarities to the Ets-fold DNA binding proteins as previously reported by the authors of the S4 structure (24). The compact substructure comprising the S4 signature motif shown in Figure 3 is not on the same side of the domain as the helix corresponding to the DNA binding helix of Ets-domain proteins. The similarity with the Ets-domain comprises this helix and the beta-sheet, but the compact 15 substructure formed by the S4 motif is not found present in any other structure in the current structure database.
All 6 of the residues identified as crucial for tRNA binding in the TyrRS C-terminus lie in this compact substructure, extended along the C-terminal side of the motif as shown as solid cartoons in Figure 3. This similarity indicates that this portion of S4 may form part of the tRNA P site binding 20 domains. A neighbor joining clustering analysis was performed using ClustalX on the motif in Figure 1 itself. Despite this being a very short sequence in the alignment, there is suffcient information to create a remarkable tree shown in Figure 4, with a nested canonical archea/eukaryote/prokaryote/chloroplast classification. Each of the mitochondria) S4 sequences which correspond to alternative mitochondria) genetic code are outgroups to this classification, with the 25 earliest branching R. americana S4 motif corresponding to the most primitive mitochondrion yet discovered (33). Since the molecular carriers of the genetic code are the cognate pool of all tRNA's, this unique clustering supports the conclusion that this is a tRNA binding motif. This nested, compact fragment within S4, indeed the most conserved region of all S4 proteins, forms a portion of the ribosomal tRNA binding P site that binds all tRNAs in the organism or organelle. This new 30 information, taken together with the mapping of the region conferring translational accuracy onto the S4 structure (24) on the opposite side of the S4 "waist" from this motif indicates that ribosomal S4 protein may comprise a significant portion of the anticodon-codon decoding active site of all ribosomes.
35 Having illustrated and described the principles of the invention in a preferred embodiment, it should be appreciated to those skilled in the art that the invention can be modified in arrangement and detail without departure from such principles. We claim all modifications coming within the scope of the following claims.
All publications, patents and patent applications referred to herein are incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
Below full citations are set out for the references referred to in the specification is a listing and detailed legends for the figures are provided.
The application contains sequence listings which form part of the application.
S
FULL CITATIONS FOR REFERENCES REFERRED TO IN THE SPECIFICATION
I Brick, P. and Blow, D.M. (1987) J.Mol.Biol. 194, 287-297 2 Brick, P., Bhat, T.N., and Blow, D.M. (1989) J.Mol.Biol. 208, 83-98 3Waye, M.M. et al. (1983). EMBOJ. 2, 1827-1829 4 Altschul, S.F. et al. (1997). Nucleic.Acids.Res. 25, 3389-3402 10 5 Nowotny, V. and Nierhaus, K.H. (1988) Biochemistry 27, 7051-7055 6 Tang, C.K. and Draper, D.E. (1989) Cell S7, S31-536.
synthetase or ribosomal protein and a tRNA anticodon stem-loop thereby affecting protein synthesis.
The term "peptide" refers to macromolecules which comprise a multiplicity of amino or imino acids {or their equivalents) in peptide linkage, wherein the peptides may comprise or lack post-translational modifications (e.g. glycosylation, cleavage, phosphorylation, side-chain derivation and the like).
The terms "label" or "labeled" refer to incorporation of a detectable substance e.g. by incorporation of a radiolabeled amino acid or attachment of biotinyl moieties to a protein or peptide 10 wherein the attached biotinyl moieties can be detected by marked avidin.
Various methods of labeling proteins and peptides are known in the art and may be used. Examples of labels include, but are not limited to the following: radioisotopes (e.g.'H, 14C,'sS,'uI, "'I), fluorescent labels (e.g. FITC, rhodamine, lanthanide phosphors), enzymatic labels (e.g. horseradish peroxidase, ~i-galactosidase, luciferase, alkaline phophatase), biotinyl groups, predetermined polypeptide epitopes recognized by a 15 secondary reporter (e.g, leucine zipper pair sequences, binding sites for secondary antibodies, metal binding domains, epitope tags). In some embodiments, labels are attached by spacer arms of various lengths to reduce potential steric hindrance.
The term "substantially pure" means that the particular peptide is the predominant species present (i.e. on a weight/volume percentage, it is the most abundant single species within the 20 composition), and preferably a substantially purified fraction is a composition wherein the peptide comprises at least about 50 percent (w/v) of all macromolecular species present. Generally, a substantially pure composition will comprise more than 80 to 90 percent of all protein present in the composition. Most preferably, the peptide is purified to essential homogeneity (contaminant proteins cannot be detected in the composition by conventional detection methods) wherein the composition 25 consists essentially of a single protein species.
Peptides and Complexes The peptides of the invention generally comprise a core sequence which corresponds to a tltNA anticodon stem-loop recognition sequence motif. This general motif can be identified by the data described herein. Typically, the peptides will comprise the sequence motif Y,-X(3rYZ-X(3)-Y3-Y4-30 X(2)-Ys-X(3)-Y6-YrX(2~Yg-X(2)-Y9-X-Y,o-X-Y"-Y,2-X(3)-Y,3-X(2)-Y,4 where Y, is methionine or leucine, preferably leucine, YZ is leucine, isoleucine, valine, methionine, or cysteine, preferably isoleucine, leucine, or valine,Y3 is glycine, methionine, asparagine, glutamic acid, histidine, or lysine, preferably glycine, Y4 is methionine, leucine, phenylalanine, tyrosine, or isoleucine, preferably phenylalanine, leucine, methionine, or tyrosine, YS is serine or threonine, Y6 is serine, alanine, 35 isoleucine, or glycine, preferably alanine, Y7 is arginine, methionine, isoleucine, or lysine, preferably arginine, Yg is methionine, isoleucine, valine, or alanine, preferably valine or isoleucine, Y9 is glycine, asparagine, arginine, histidine, or lysine, preferably lysine, glycine, or arginine, Y,o is isoleucine, valine, leucine, or phenylalanine, preferably valine or isoleucine, Y" is leucine, isoleucine, or valine, preferably valine or isoleucine, Y,Z is asparagine, aspartic acid, serine, arginine, glycine, alanine, or 40 leucine, preferably asparagine, aspartic acid, or glycine,Y,3 is leucine, isoleucine, valise, serine, or _7_ glutamine, preferably glutamine or valine, Y" is proline, isoleucine, leucine, valine, threonine, alanine, cysteine, or serine, preferably proline or vaiine, and X is any amino acid.
Generally the sequence recognition motif may be present as its own peptide, or may be a core of a longer sequence. Generally, the peptides of the invention will comprise the motif as a portion, or a whole of a peptide of from 10 to about 200 amino acids in length. Typically the peptides will be from about 20 to 100 amino acids in length, preferably the peptides will be from about 30 to about 75 amino acids in length, more preferably from about 36 to about 65 amino acids in length, A peptide of the invention is also represented herein as comprising the following sequence:
10 [ML]-X{3~[LIVMC]-X(3~[GMNEI-IK]-[MLFYI]-X(2~[ST]-X(3r[SAIG]-[RMIK]-X(2~[MIVA]-X(2~[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3~[LIVSQ]-X{2)-[PILVTAC]
wherein X represents any amino acid.
The invention also provides complexes comprising a peptide of the invention and a tRNA
15 anticodon stem-loop.
Preferred peptides of the invention include the peptides shown in Figure 1 (SEQ.ID.NOs. l-45) and Figure 2 (SEQ.ID.NO. 46-47). Additional peptides within the scope of the invention may be identified using the sequence set out above for example, with the ScanProsite service at http://expasy.hcuge.ch/sprot/scnpsit2.html.
20 In addition to full-length peptides of the invention, truncations of the peptides which inhibit interaction of a tRNA synthetase or ribosomal protein and a tRNA anticodon stem loop are contemplated in the present invention. Truncated peptides may comprise peptides of about 7 to 10 amino acid residues.
The truncated peptides may have an amino group (-NH2), a hydrophobic group (for example, 25 carbobenzoxyl, dansyl, or T-butyloxycarbonyl), an acetyl group, a 9-fluorenyhnethoxy-carbonyl (PMOC) group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or carbohydrates at the amino terminal end. The truncated peptides may have a carboxyl group, an amido group, a T-butyloxycarbonyl group, or a macromolecule including but not limited to lipid-fatty acid conjugates, polyethylene glycol, or carbohydrates at the carboxy terminal 30 end.
The peptides of the invention may also include analogs, and/or truncations thereof, which may include, but are not limited to the peptide of the invention containing one or more amino acid insertions, additions, or deletions, or both. Analogs of the peptide of the invention exhibit the activity characteristic of a peptide of the invention (e.g. interference with the interaction of a tyrosyl-tRNA
35 synthetase or ribosomal S4 protein and a tRNA anticodon stem-loop), and may further possess additional advantageous features such as increased bioavailability, stability, or reduced host immune recognition.
One or more amino acid insertions may be introduced into a peptide of the invention. Amino acid insertions may consist of a single amino acid residue or sequential amino acids. One or more _g_ amino acids, preferably one to five amino acids, may be added to the right or left termini of a peptide of the invention.
Deletions may consist of the removal of one or more amino acids, or discrete portions from the peptide sequence. The deleted amino acids may or may not be contiguous.
The lower limit length 5 of the resulting analog with a deletion mutation is about 7 amino acids.
Cyclic derivatives of the peptides of the invention are also part of the present invention.
Cyclization may allow the peptide to assume a more favorable conformation for association with a tRNA anticodon stem-loop. Cyclization may be achieved using techniques known in the art. For example, disulfide bonds may be formed between two appropriately spaced components having free 10 sulfhydryl groups, or an amide bond may be formed between an amino group of one component and a carboxyl group of another component. Cyclization may also be achieved using an azobenzene-containing amino acid as described by Ulysse, L., et al., J. Am. Chem. Soc.
1995, 117, 8466-8467. The side chains of P.Tyr and Asn may be linked to form cyclic peptides. The components that form the bonds may be side chains of amino acids, non-amino acid components or a combination of the two.
15 It may be desirable to produce a cyclic peptide which is more flexible than the cyclic peptides containing peptide bond linkages as described above. A more flexible peptide may be prepared by introducing cysteines at the right and left position of the peptide and forming a disulphide bridge between the two cysteines. The two cysteines are arranged so as not to deform the beta-sheet and tum.
The peptide is more flexible as a result of the length of the disulfide linkage and the smaller number of 20 hydrogen bonds in the beta-sheet portion. The relative flexibility of a cyclic peptide can be determined by molecular dynamics simulations.
In addition to the above peptides, peptide analogs are also provided. Peptide analogs are commonly used in the pharmaceutical industry as non-peptide drugs with properties analogous to those of the template peptide. These non-peptide compounds are referred to as "peptide mimetics" or 25 "peptidomimetics" (Fauchere, J. 1986, Adv. Drug. Res. 15:29; Veber and Freidinger 1985, TINS, 392;
and Evans et al 1987, J. Med. Chem. 30:1229). Peptide mimetics are generally developed using computerized molecular modeling. Peptide mimetics that are structurally similar to therapeutically useful peptides may be used to produce an equivalent therapeutic or prophylactic effect. Generally, peptide mimetics are structurally similar to a paradigm peptide (i.e, a peptide that has a biological or 30 pharmacological activity) such as a naturally occurring polypeptide (e.g. a tRNA anticodon stem-loop recognition motif of tyrosyl-tRNA synthetase), but have one or more peptide linkages optionally replaced by a linkage from for example a group comprising: -HiNH-, -CHZS-, -CHZCHz-, -CH=CH-(cis and trans), -COCHz-, -CH(OH)CHZ-, and -CHZSO-, by methods known in the art (see for example Spatola, A.F. in "Chemistry and Biochemistry of Amino Acids, Peptides, and Proteins," B. Weinstein, 35 eds., Marcel Dekker, New York, p. 267 (1983); Spatola, A.F., Vega Data (March 1983), Vol. 1, Issue 3, "Peptide Backbone Modifications" (general review); Morley, J.S., Trends Pharm. Sci. (1980), pp.
463-468 (general review); Hudson, D. et al, Int. J. Pept Prot Res. (1979), 14:177-185 (-CHzNH-, CHZCHZ-); Spatola, A.F. et al., Life Sci. (1986), 38:1243-1249 (-CHZ-S); Hann, M.M., J. Chem Soc.
Perkin Trans I (1982), 307-314 (-CH-CH-, cis and traps); Almquist, R.G. et al., J. Med Chem (1980), 40 23:1392-1398 (-COCHZ-); Jennings-White, C. et al., Tetrahedron Lett. 1982, 23:2533 (-COCHZ-);
Szelke, M. et al., European Appln. EP 45665 (1982), CA: 97:39405, (1982) (-CH(OH~HZ-); Holladay, M.W. et al., Tetrahedron Lett. (1983), 24:4401-4404 (-C(OH)CHZ-); and Hruby, V.J., Life Sci. (1982) 31:189-199 (-CH2S-).
Peptide mimetics can have advantages including for example more economical production, 5 greater chemical stability, enhanced pharmacological properties (half life, absorption, potency, efficacy, etc.), altered specificity (e.g. broad spectrum biological activities), and reduced antigenicity.
Labels can be directly or indirectly attached (e.g. through a spacer such as an amide group) to non-interfering positions on a peptide mimetic that are predicted by quantitative structure-activity data and/or molecular modeling. Non-interfering positions typically are positions that do not form direct 10 contacts with the macromolecules to which the mimetic binds to produce the effect. Labeling of mimetics should not substantially interfere with the desired biological or pharmacological activity of the mimetic. Generally, mimetics of the peptides of the invention can bind to a tltNA anticodon stem-loop with high affinity and possess detectable biological activity i.e. are agonists or antagonists to one or more tRNA anticodon stem-loop mediated phenotypes.
15 More stable peptides can be generated by systematic substitution of one or more amino acids of a consensus sequence with a D-amino acid or the same type.
The invention also includes a peptide conjugated with a selected protein, or a detectable substance, or selectable marker (see below] to produce fusion proteins. A
"fusion protein" generally refers to a composite protein made up of two or more separate proteins which are normally not fused 20 together as a single protein. Fusion proteins can be made by either recombinant nucleic acid methods or by chemical synthesis methods well known in the art. Fusion partners can include a substrate, cofactor, inhibitor, affinity ligand, antibody binding epitope tag, or an enzyme capable of being assayed. A
fusion partner can include for example bacterial ~i-galactosidase, trpE, protein A, (3-lactamase, a-amylase, alcohol dehydrogenase, and yeast a-mating factor. Because of their ability to recognize and 25 bind specific proteins e.g. a protein comprising a tltNA anticodon stem-loop, the peptides of the invention may act as an affinity ligand to direct the activity of a fused protein to the specific proteins.
The peptides of the invention may be free in solution or covalently attached to a solid support.
Peptides attached to a solid support can be particularly useful in screening and purification applications. Examples of solid supports include those well known in the art such as cellulose, agarose, 30 polystyrene, divinylbenzene, and the like. Commercially available supports that come prepared for immediate coupling of affinity ligands can be used (e.g. from Sigma Chemical, St. Louis, Missouri, or Pharmacia, Uppsala, Sweden).
The peptides of the invention may be converted into pharmaceutical salts by reacting with inorganic acids such as hydrochloric acid, sulfuric acid, hydrobromic acid, phosphoric acid, etc., or 35 organic acids such as formic acid, acetic acid, propionic acid, glycolic acid, lactic acid, pyruvic acid, oxalic acid, succinic acid, malic acid, tartaric acid, citric acid, benzoic acid, salicylic acid, benezenesulfonic acid, and toluenesulfonic acids.
The peptides of the invention may be prepared using recombinant DNA methods.
Accordingly, nucleic acid molecules which encode a peptide of the invention may be incorporated in a 40 known manner into an appropriate expression vector which ensures good expression of the peptide.
Possible expression vectors include but are not limited to chromosomal, episomal, and virus-derived vectors such as vectors derived from bacterial plasmids, from bacteriophages, from transposons, from yeast episomes, from insertion elements, from yeast chromosomal elements, from viruses such as baculoviruses, papova viruses, such as SV40, vaccinia viruses, adenoviruses, fowl pox viruses, 5 pseudorabies viruses and retroviruses, and vectors derived from combinations thereof, such as those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids, or modified viruses so long as the vector is compatible with the host cell used. The expression vectors contain a nucleic acid molecule encoding a peptide of the invention and the necessary regulatory sequences for the transcription and translation of the inserted protein-sequence. Suitable regulatory sequences may 10 be obtained from a variety of sources, including bacterial, fungal, viral, mammalian, or insect genes (For example, see the regulatory sequences described in Goeddel, Gene Expression Technology:
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Selection of appropriate regulatory sequences is dependent on the host cell chosen, and may be readily accomplished by one of ordinary skill in the art. Other sequences, such as an origin of replication, additional DNA restriction 15 sites, enhancers, and sequences conferring inducibility of transcription may also be incorporated into the expression vector.
The recombinant expression vectors may also contain a selectable marker gene which facilitates the selection of transfonmed or transfected host cells. Suitable selectable marker genes are genes encoding proteins such as 6418 and hygromycin which confer resistance to certain drugs, (3-20 galactosidase, chloramphenicol acetyltransferase, firefly luciferase, or an immunoglobulin or portion thereof such as the Fc portion of an immunoglobulin preferably IgG. The selectable markers may be introduced on a separate vector from the nucleic acid of interest.
The recombinant expression vectors may also contain genes which encode a fusion portion which provides increased expression of the recombinant peptide; increased solubility of the 25 recombinant peptide; and/or aid in the purification of the recombinant peptide by acting as a ligand in affinity purification. For example, a proteolytic cleavage site may be inserted in the recombinant peptide to allow separation of the recombinant peptide from the fusion portion after purification of the fusion protein. Examples of fusion expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England Biolabs, Beverly, MA) and pRITS (Pharmacia, Piscataway, NJ) 30 which fuse glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the recombinant protein.
Recombinant expression vectors may be introduced into host cells to produce a transformant host cell. Transformant host cells include prokaryotic and eukaryotic cells which have been transformed or transfected with a recombinant expression vector of the invention. The terms 35 "transfonmed with", "transfected with", "transformation" and "transfection"
are intended to include the introduction of nucleic acid (e.g. a vector) into a cell by one of many techniques known in the art. For example, prokaryotic cells can be transformed with nucleic acid by electroporation or calcium-chloride mediated transformation. Nucleic acid can be introduced into mammalian cells using conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 40 transfection, lipofectin, electroporation or microinjection. Suitable methods for transforming and _11_ transfecting host cells may be found in Sambrook et al. (Molecular Cloning: A
Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press ( 1989)), and other laboratory textbooks.
Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, the peptides of the invention may be expressed in bacterial cells such as Streptococci, Staphylococci, Streptomyces, B. Subtilus, E. coli, fungal cells such as yeast cells, insect cells such as Drosophila (using baculovirus), or mammalian cells such as CHO, COS, HeLa, C
127, BHK, 293, and plant cells. Other suitable host cells can be found in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, CA (1991).
In an embodiment of the invention the host cells are plant cells and the vectors which are used 10 to transform the plant tissue include Agrobacterium vectors and ballistic vectors.
The peptides of the invention may be tyrosine phosphorylated using conventional methods including the method described in Reedijk et al. (The EMBO Journal 11(4):1365, 1992). For example, tyrosine phosphorylation may be induced by infecting bacteria harbouring a plasmid containing a nucleotide sequence encoding a peptide of the invention, with a ~,gtl 1 bacteriophage encoding the 15 cytoplasmic domain of the Elk tyrosine kinase as a LacZ-EIlc fusion.
Bacteria containing the plasmid and bacteriophage as a lysogen are isolated. Following induction of the lysogen, the expressed peptide becomes phosphorylated by the Elk tyrosine kinase.
The peptides of the invention may also be prepared by chemical synthesis using techniques well known in the chemistry of proteins such as solid phase synthesis (Merrifield, 1964, J. Am. Chem.
20 Assoc. 85:2149-2154) or synthesis in homogenous solution (Houbenweyl, 1987, Methods of Organic Chemistry, ed. E. Wansch, Vol. 15 I and II, Thieme, Stuttgart). By way of example, the peptides may be synthesized using 9-fluorenyl methoxycarbonyl (Fmoc) solid phase chemistry with direct incorporation of phosphotyrosine as the N-fluorenylmethoxy-carbonyl-O-dimethyl phosphono-L-tyrosine derivative.
25 N-terminal or C-terminal fusion proteins comprising a peptide of the invention conjugated with other molecules may be prepared by fusing, through recombinant techniques, the N-terminal or C-terminal of the peptide, and the sequence of a selected protein or selectable marker with a desired biological function. The resultant fusion proteins contain the peptide fused to the selected protein or marker protein as described herein. Examples of proteins which may be used to prepare fusion proteins 30 include immunoglobulins, glutathione-S-transferase (GST), hemagglutinin (HA), and truncated myc.
Antibodies The peptides and complexes of the invention may be used to prepare antibodies immunospecific for such peptides or complexes. Antibodies include monoclonal and polycional antibodies, chimeric, single chain, simianized antibodies and humanized antibodies, and Fab fragments 35 including the products of an Fab immunoglobulin expression library.
Conventional methods can be used to prepare the antibodies. As discussed below, the antibodies may be used to identify proteins containing tRNA anticodon stem-loop recognition motifs.
Screening for tItNA anticodon stem loop recognition motifs and tRNA anticodon stem-loops.
The peptides, antibodies specific for the peptides, and complexes of the invention may be 40 labeled using conventional methods with various enzymes, fluorescent materials, luminescent materials and radioactive materials. Suitable enzymes, fluorescent materials, luminescent materials, and radioactive material are well known to the skilled artisan. Labeled antibodies specific for the peptides of the invention may be used to screen for tltNA anticodon stem-loop recognition motifs or binding sites in proteins such as tliNA synthetases or ribosomal proteins, and labeled peptides of the invention may be used to screen for tIZIVA anticodon stem-loops.
In an embodiment of the invention, the peptides of the present invention can be used as probes to identify nucleic acids comprising tltNA anticodon stem-loops. The methods allow for the identification of nucleic acids that are specifically involved in protein translation.
Therefore, in one aspect, the peptides of the invention can be used to determine whether a 10 particular nucleic acid comprises a tltNA anticodon stem-loop.
Determination of whether a nucleic acid comprises a tRNA anticodon stem-loop may be carried out using a variety of means. For example, the nucleic acid to be tested can be immobilized on a solid support e.g. a microtiter well, or nitrocellulose membrane. After blocking the remaining groups on the support, the nucleic acid to be tested can be exposed to an appropriate amount of a labeled peptide of the invention.
Detection of label bound to the 15 test nucleic acid indicates that the nucleic acid contains a tltNA
anticodon stem-loop.
In a preferred embodiment, the nucleic acid is attached to a solid support prior to contacting the nucleic acid with a peptide of the invention and the peptide used in the contacting step further comprises a detectable substance. The determining step comprises assaying for the presence of the detectable substance. Alternatively, the peptide of the invention can be attached to a solid support prior 20 to contacting the nucleic acid with the peptide of the invention.
As an affinity ligand the peptides of the invention can be used to purify nucleic acids which comprise a tRNA anticodon stem-loop from a mixture of nucleic acids. Affinity purification of such nucleic acids can be carried out using conventional affinity purification methods well known in the art.
For example, a peptide of the invention can be attached to a solid support as described herein. A
25 mixture of nucleic acids can then be contacted with the peptide bound to the solid support, such that the peptide selectively binds tltNA anticodon stem-loop containing nucleic acids present in the mixture.
The bound nucleic acids can be washed to eliminate unbound nucleic acids.
Substantially pure tIZNA
anticodon stem-loop containing nucleic acids can be eluted from the solid support by conventional elution protocols.
30 The invention broadly provides methods for identifying substances that bind to a peptide or complex of the invention. The invention also contemplates methods for identifying compounds that bind to substances that interact with a complex or peptide of the invention.
Conventional methods such as co-immunoprecipitation, crosslinking and co-purification through gradients or chromatographic columns may be used to identify such substances and compounds. Substances and compounds 35 identified using the methods of the invention may be isolated and characterized (e.g. sequenced) using conventional techniques.
Substances which can bind with a peptide or complex of the invention can be identified by reacting a peptide or complex of the invention with a test substance which potentially binds to the peptide or complex, under conditions which permit the formation of substance-peptide or substance-40 complex conjugates, and removing and/or detecting the conjugates. The conjugates can be detected by assaying for substance-peptide or substance-complex conjugates, for free substance, or for non-complexed peptide or complexes. Conditions which permit the formation of conjugates may be selected having regard to factors such as the nature and amounts of the substance. The conjugates, free substance or non-complexed peptides or complexes may be isolated by conventional isolation S techniques, for example, salting out, chromatography, electrophoresis, gel filtration, fractionation, absorption, polyacrylamide gel electrophoresis, agglutination, or combinations thereof. To facilitate the assay of the components, antibody against a peptide or complex may be utilized. The antibodies, peptides, complexes, or substances may be labeled with a detectable substance, or they may be bound to a solid support as described herein.
10 X-ray crystallographic studies may be used as a means of evaluating interactions. For example, purified molecules in a conjugate when crystallized in a suitable form are amenable to detection of infra-molecular interactions by x-ray crystallography.
Spectroscopy may also be used to detect interactions and in particular, Q-TOF instrumentation may be used. In addition, two-hybrid systems may be used to detect protein interactions in vivo.
15 Screening Methods The invention also enables screening of compounds that enhance (agonist) or diminish (antagonist) the level of interaction of a protein comprising a tltNA
anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal protein) and a tltNA anticodon stem-loop. The terms "agonist" and "antagonist" as used herein do not imply a particular mechanism of function.
20 In one aspect of the invention to screen for agonists or antagonists, a synthetic reaction mixture, a cellular compartment (e.g membrane, cell envelope, cell wall), or a preparation thereof, comprising a peptide of the invention and a tItNA anticodon stem-loop is incubated in the presence or the absence of a test compound which may be an agonist or antagonist. The ability of the test compound to enhance or interfere can be reflected in increased or decreased binding of the peptide and 25 the tIZNA anticodon stem-loop. The efficiency of the reaction may be enhanced by labeling the peptide or tRNA anticodon stem-loop, using a reporter system, or immobilizing the peptide or tRNA anticodon stem loop upon a solid support.
As a specific example, a nucleic acid comprising a tltNA anticodon stem loop can be coupled to the wells of a microtiter plate or nitrocellulose membrane. The test compound can be added to the 30 well or membrane to preincubate with the nucleic acid. The peptide of the invention, to which a detectable substance is attached is added to the well or membrane. Following sufficient incubation, the wells or membranes are rinsed, and binding of the peptide to the nucleic acid can be assessed by for example assaying for the presence of residual detectable substance. Those of skill in the art will recognize that the screening assay format can be set up in either direction, i.e. either the peptide or 35 nucleic acid can be bound to the support, while the other is labeled. The level of binding can be compared to suitable positive and negative controls. Alternatively, by providing the nucleic acid and/or the peptide in known concentrations, one can assay for free, or unbound nucleic acids and/or peptide, and by negative implication, determine the level of complexes that are formed.
The amount or concentration of the test compound that is added, when known will vary 40 depending on the compound. Typically a range of concentrations will be used. In the case of uncharacterized test compounds it may not be possible, and it is not necessary to determine the concentration of the compound.
It is desirable to include various controls e.g, positive and negative controls, in the assays. In the testing of agonist activity, negative controls can include incubating with inert compounds (e.g.
compounds known not to have agonist activity) or in the absence of added compounds. Positive controls can include incubating with compounds known to have agonist activity such as the natural ligand. As will be apparent to one of ordinary skill in the art, similar (though complementary) controls can be included in assays for antagonist activity, as well as various additional controls.
A competitive assay may also be used to screen for anagonists. The assay combines a protein 10 comprising a tltNA anticodon stem-loop recognition motif (e.g. tltNA
synthetase, ribosomal protein, or peptide of the invention), tRIVA anticodon stem-loop, and test compound under appropriate conditions for a competitive inhibition assay.
Potential agonists and antagonists include small inorganic or organic molecules, peptides, polypeptides, antibodies, a mixture of molecules, peptides, polypeptides etc., or an extract made from 15 biological materials such as bacteria, plants, fungi, or animal cells or tissues. The compound may be an endogenous physiological compound or it may be a natural or synthetic compound.
The screening assays may be used in the discovery and development of therapeutics such as antibacterial or antifungal compounds. In addition, antisense sequences to the sequence encoding the novel domain i.e. tltNA anticodon stem-loop recognition motif or binding site identified herein may be 20 used to control expression of the coding sequence and thus may be used as potential therapeutics.
The peptide of the invention can be used to model small molecules which interfere with the binding of a protein comprising a tltNA anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal protein), with a tRlVA anticodon stem-loop in vivo. In particular, the structure of the tRlVA anticodon stem-loop sequence recognition motif, as described herein, can be applied in 25 generating synthetic analogs and mimics of the recognition sequence.
Synthetic elements can be pieced together based upon their analogy to the structural and chemical aspects of the recognition sequence motif. Such mimics and analogs may be used in blocking or inhibiting specific aspects of protein translation and may be useful as therapeutic treatments in accordance with the methods described herein.
30 Compositions While it is possible to administer an active ingredient alone it is preferable to present it as part of a pharmaceutical composition or formulation. Therefore, the peptides, complexes, antibodies, substances, and compounds of the invention may be formulated into pharmaceutical compositions for administration to subjects in a therapeutically active amount and in a biologically compatible form 35 suitable for administration in vivo i.e. a form of the peptides etc. to be administered in which any toxic effects are outweighed by the therapeutic effects. A therapeutically active amount of a pharmaceutical composition of the invention is defined as an amount effective, at dosages and for periods of time necessary to achieve the desired result. For example, a therapeutically active amount of a peptide may vary according to factors such as the disease state, age, sex, and weight of the individual. Dosage 40 regime may be adjusted to provide the optimum therapeutic response.
- IS -The peptides etc. may be administered in a convenient manner such as by injection (subcutaneous, intravenous, etc.), oral administration, inhalation, transdermal application, or rectal administration. Depending on the route of administration, the peptides etc.
may be coated in a material to protect them from the action of enzymes. The peptides etc. may also be used in combination with 5 organic substances for prolongation of their pharmacologic actions. Examples of such organic substances are non-antigenic gelatin, carboxymethylcellulose, sulfonate or phosphate ester of alginic acid, dextran, polyethylene glycol and other glycols, phytic acid, polyglutamic acid, and protamine.
The compositions described herein can be prepared by per se known methods for the preparation of pharmaceutically acceptable compositions which can be administered to subjects, such 10 that an effective quantity of a peptide etc. is combined in a mixture with a pharmaceutically acceptable vehicle. Suitable vehicles are described, for example, in Remington's Pharmaceutical Sciences (Remington's Phanmaceutical Sciences, Mack Publishing Company, Euston, Pa., USA 1985). On this basis, the compositions include, albeit not exclusively, solutions of the peptides in association with one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered solutions with a 15 suitable pH and iso-osmotic with the physiological fluids. The peptides etc. may also be incorporated in liposomes or similar delivery vehicles.
Applications The peptides and complexes of the invention interfere with the interaction of a protein comprising a tRNA anticodon stem-loop recognition motif (e.g. tRNA synthetase or ribosomal 20 protein), and a tRNA anticodon stem-loop. The activity of a peptide or complex of the invention may be confirmed by assaying for the ability of the peptide or complex to interfere with the interaction of, for example, a tyrosyl tRNA synthetase or ribosomal S4 protein and a tRNA
anticodon stem-loop.
Computer modelling techniques known in the art may also be used to observe the interaction of a peptide of the invention, and truncations and analogs thereof with a tRNA
anticodon stem-loop (for 25 example, Homology Insight II and Discovery available from BioSym/Molecular Simulations, San Diego, California, U.S.A.). If computer modelling indicates a strong interaction, the peptide can be synthesized and tested for its ability to interfere with the interaction of a tRNA anticodon stem-loop, and a protein comprising a tRNA anticodon stem-loop recognition motif ( e.g.
tyrosyl tRNA synthetase or ribosomal S4 protein).
30 The peptides, compositions, complexes, and antibodies of the invention, and compounds and substances identified using the screening assays of the invention may be used in therapeutic applications for the treatment of living organisms including human or non-human mammalian subjects.
Alternatively, the peptides etc. may be useful as a prophylactic treatment, or in screening for compounds effective in prophylactic treatments.
35 The peptides, complexes, compositions, antibodies, and compounds and substances identified using the screening assays of the invention may be used to treat or prevent infections caused by bacteria such as species of Bacillus, E.coli, Mycobacterium, Nelicobacter, Hemophilus, Streptococcus, and Staphylococcus, and infections caused by fungi such as yeast including S.
cerevisiae, and Aspergillus. The peptides, complexes, compositions, antibodies, and compounds may also be used as 40 immunotoxins, and anti-viral agents. They may also be used as plant toxins either as an applied pesticide formulation or by incorporation into the genome of the plant. They may also be useful in vitro to arrest protein translation in biochemical assays where a precise termination of the reaction is desired.
The peptides and complexes of the invention may also be used to induce an immunological response in an individual, particularly a mammal which comprises inoculating the individual with a peptide, complex, or composition of the invention adequate to produce antibody to protect the individual from disease (e.g. bacterial infections). An immunological response may also be induced by delivering through gene therapy a gene encoding a peptide of the invention in vivo in order to induce an immune response to protect the individual from disease. Thus, the invention contemplates a vaccine formulation which comprises a peptide or complex of the invention together with a suitable carrier.
10 The following non-limiting example is illustrative of the present invention:
EXAMPLE
The three-dimensional structure of B, stearothermophiJus tyrosyl-tRNA
synthetase (TyrRS) has been known for more than 10 years ( 1 ), however the 100 C-terminal amino acids were found disordered in the crystal structure. Deletion mutants have demonstrated that this C-terminal domain is 15 required for tRNA binding and recognition (2,3). Starting with prokaryotic TyrRS from Escherichia colt, a PSI-BLAST (4) search iteratively fords weak similarities between the C-terminal region and archeal, chloroplast and prokaryotic ribosomal S4 proteins, as well as NAM-9 the yeast mitochondrial ortholog of S4. The corresponding alignment and the motif inferred from this similarity are shown in Figure 1.
20 Ribosomal S4 protein is a multifunctional ribosomal protein associated with the 30S subunit, comprising 206 amino acids in E. colt. S4 is an ancient, if not one of the most ancient of ribosomal proteins (5). It has been demonstrated that in E. colt S4 has an autoregulatory function, binding to a pseudoknot of its own operon mRNA which limits its expression through a feedback mechanism (6). In the ribosome, S4 is required and it is the first protein involved in the folding of 16S rRNA (7).
25 Mutations of S4 proteins, specifically the ram (ribosomal ambiguity) D14 and D12 mutants in E. colt (8,9,10), the omnipotent suppresser mutant SUP46 in yeast ( 11 ), and the NAM-9 mutants of yeast( 12) affect translational accuracy. Visualization of S4 in the ribosome through 3-D
electron microscopy shows that it is superimposable with the region of the A and P sites, the entry and peptidyl-tRNA
binding sites of the ribosome ( 13,14).
30 There are several experiments elucidating the interaction between ribosome and tRNA that indicate a tRNA anticodon stem-loop interaction when tRNA is in the peptidyl (P) site. In the yeast tRNA~" , the top base pairs of the anticodon helix were shown to have an effect on ribosomal binding and activity independently of anticodon in-vivo ( I 5). Hydroxyl radical probing experiments revealed that the anticodon stem-loop is protected in the P site of 30S ribosomal subunit ( 16). Ribosomal 35 protein S4 is also implicated in the binding of the tRNA anticodon.
Ribosomal proteins S4 and S 18 were crosslinked to chemically modified anticodon nucleotides in early in-vitro experiments ( 17, 18).
Characterization of nonsense and missense suppresser tRNAs in the context of the ram S4 mutant suggested that the level of suppression might depend upon recognition of tRNA
anticodon stem-loop structure by S4 protein ( 19). S4 ram mutations also affect the kinetic off rates of the P site tRNA
40 binding (20).
-17_ The mRNA and rRNA binding region of S4 have been previously characterized. It has been shown that both functions reside within residues 47 -104 in E. coli S4 protein (21,22). Yet the most conserved region of S4 lies to the C-terminal side of this region and has not been shown to possess a specific function to date. This region (residues 97 -I32 in E. cola also comprises the motif similarity 5 between S4 and the TyrRS C-terminal region shown in Figure 1. Interestingly, no suppressor or ribosomal ambiguity mutants have ever been characterized from this most conserved portion of S4.
Mutations here have been only rarely observed probably owing to their functional importance. A more fundamental and specific binding and recognition event is the reason for the conservation of this motif.
The lack of structure of the C-terminus in the crystals of B.
stearothermophilus TyrRS has led 10 to the question as to whether the C-terminus of TyrRS has any structure at all. Similarly, the structure of the S4 protein rRNA and mRNA binding domain has been studied, and most recently the 3-D
structure of S4 has been determined (23, 24). Circular Dichroism spectra of the in E. coli S4 fragment 48 - 177 (21 ) and the B. stearothermophilus TyrRS fragment 323-419 (25) corresponding to the motif presented here are qualitatively similar in shape, supporting a shared folded structure. Sequence-15 structure threading (26) was performed to determine a structure-based alignment to further test this hypothesis. Despite difficulties in threading very small domains, the resulting threading alignment shown in Figure 2 recreated the entire motif alignment of Figure 1, and also aligned a conserved region of positively charged residues downstream of the motif.
The experimental analysis of both the 54 protein and TyrRS indicates that the individual 20 domains comprising these proteins are modular and functionally divisible.
The sequence similarity between TyrRS and S4 may be an evolutionary event in which ribosomal protein S4 was grafted onto the C-terminus of a bacterial precursor TyrRS originally more closely related to the shorter archeal tyrosyl-tRNA synthetases, which lack this C-terminus. Perhaps it is not a coincidence that the B.
subtilis genomic sequence, which encodes two tyrosyl-tRNA synthetase genes (tyrS), has one TyrRS
25 adjacent to the single ribosomal S4 gene (27) however this close spatial relationship is not found in other complete genomes. The observation of two variants of TyrRS in prokaryotes is best explained by an ancient duplication event followed by differential loss of one or the other form {28). This duplication event probably came some time after the TyrRS-S4 chimera was formed in the prokaryotic ancestor.
30 It is clear that this motif persists in the S. cerevisiae mitochondria) form of TyrRS (mtYRS).
This indicates that any such S4 insertion event would precede a symbiotic event forming mitochondria in eukaryotes, thus perhaps it is a useful evolutionary marker. In P. anserina and N. crassa mtTyrRS, the C-terminus has digressed from this sequence, accounted for by the obvious alteration of the C-terminus to accommodate the rather large group I intron splicing polypeptide, possibly a replacement 35 of the S4-like C-terminus in some mitochondria (28). Interestingly no convincing human ESTs have yet been found with similarity to a mitochondria) TyrRS. It is possible that some eukaryotes may have replaced the mtTyrRS with the cytoplasmic TyrRS.
The earlier work of Bedouelle et al.(30) demonstrates the interaction of this fragment of B.
stearothermophilus TyrRS with its cognate tRNATn. The residues from the S4-motif (namely 8368 40 and 8371 ) were amidst the 6 basic residues which, when mutated, could not complement the temperature sensitive tyrS strain of E. coli, demonstrating their requirement in the interaction with tRNATn. These include the conserved residues shown in blue in Figure 3. The model-building studies of the same group showed that the C-terminus of TyrRS will co-localize with the anticodon helix of tRNArn. The C-terminus of the eukaryotic cytoplasmic and archeal TyrRS is shorter than that of the 5 prokaryotic variants, lacking this motif. In a corresponding fashion, the tRNA binding specificity of eukaryotic tyrosyl-tRNA synthetase seems independent of the anticodon helix as the TyrRS binding affinity resides mainly in aminoacyl-stem of tRNATn (31, 32). Specificity for binding the anticodon of tRNATn in prokaryotic TyrRS may have indeed arisen by the apparent ancient addition of S4 sequence to the TyrRS C-terminus forming a chimera.
10 The comparison of ribosomal S4 protein against structures in the PDB using VAST (33) shows similarities to the Ets-fold DNA binding proteins as previously reported by the authors of the S4 structure (24). The compact substructure comprising the S4 signature motif shown in Figure 3 is not on the same side of the domain as the helix corresponding to the DNA binding helix of Ets-domain proteins. The similarity with the Ets-domain comprises this helix and the beta-sheet, but the compact 15 substructure formed by the S4 motif is not found present in any other structure in the current structure database.
All 6 of the residues identified as crucial for tRNA binding in the TyrRS C-terminus lie in this compact substructure, extended along the C-terminal side of the motif as shown as solid cartoons in Figure 3. This similarity indicates that this portion of S4 may form part of the tRNA P site binding 20 domains. A neighbor joining clustering analysis was performed using ClustalX on the motif in Figure 1 itself. Despite this being a very short sequence in the alignment, there is suffcient information to create a remarkable tree shown in Figure 4, with a nested canonical archea/eukaryote/prokaryote/chloroplast classification. Each of the mitochondria) S4 sequences which correspond to alternative mitochondria) genetic code are outgroups to this classification, with the 25 earliest branching R. americana S4 motif corresponding to the most primitive mitochondrion yet discovered (33). Since the molecular carriers of the genetic code are the cognate pool of all tRNA's, this unique clustering supports the conclusion that this is a tRNA binding motif. This nested, compact fragment within S4, indeed the most conserved region of all S4 proteins, forms a portion of the ribosomal tRNA binding P site that binds all tRNAs in the organism or organelle. This new 30 information, taken together with the mapping of the region conferring translational accuracy onto the S4 structure (24) on the opposite side of the S4 "waist" from this motif indicates that ribosomal S4 protein may comprise a significant portion of the anticodon-codon decoding active site of all ribosomes.
35 Having illustrated and described the principles of the invention in a preferred embodiment, it should be appreciated to those skilled in the art that the invention can be modified in arrangement and detail without departure from such principles. We claim all modifications coming within the scope of the following claims.
All publications, patents and patent applications referred to herein are incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
Below full citations are set out for the references referred to in the specification is a listing and detailed legends for the figures are provided.
The application contains sequence listings which form part of the application.
S
FULL CITATIONS FOR REFERENCES REFERRED TO IN THE SPECIFICATION
I Brick, P. and Blow, D.M. (1987) J.Mol.Biol. 194, 287-297 2 Brick, P., Bhat, T.N., and Blow, D.M. (1989) J.Mol.Biol. 208, 83-98 3Waye, M.M. et al. (1983). EMBOJ. 2, 1827-1829 4 Altschul, S.F. et al. (1997). Nucleic.Acids.Res. 25, 3389-3402 10 5 Nowotny, V. and Nierhaus, K.H. (1988) Biochemistry 27, 7051-7055 6 Tang, C.K. and Draper, D.E. (1989) Cell S7, S31-536.
7 Sapag, A. and Draper, D.E. (1997)Bioorg.MedChem. S, 1097-1105 8 Andersson, D.I. and Kurland, C.G. (1983) Mol.Gen.Genet. 191, 378-381 9 Gorini, L. (1971) Nat. New Biol. 234, 261-264 20 10 Rosset, R. and Gorini, L. (1969). J.Mol.Biol. 39, 9S-112 1 (Vincent, A. and Liebman, S.W. (1992) Genetics 132, 375-386 12 Boguta, M. et al. ( 1992) Mol.Cell Biol 12, 402-412 13 Agrawal, R.K. et al. (1996). Science 271, 1000-1002 14 Beniac, D.R. et al. (1997)J.Microsc. 188, 24-3S
30 1S Schultz, D.W. and Yarus, M. (1994) J.Mol.Biol. 235, 1395-1405 16 Huttenhofer, A. and Noller, H.F. ( 1992) Proc.Natl AcadSci. U.S.A. 89, 78S
17 Pongs, O. and Lanka, E. (1975) Proc.Natl.AcadSci. U.S.A. 72, (SOS-1509 18 Pongs, O. and Rossner, E. (1975) Hoppe Seylers.Z.Physiol.Chem. 356, 1297-19 Kirsebom, L.A. and Isaksson, L.A. (1986) Mol.Gen.Genet. 205, 240-247 40 20 Karimi, R. and Ehrenberg, M. ( 1996) EMBO J. 1 S, 1149-11 S4 21 Baker, A.M. and Draper, D.E. (1995) J.Biol.Chem. 270, 22939-22945 22 Conrad, R.C. and Craven, G.R. (1987) Nucleic.Acids.Res. 1S, 10331-10343.
23 Ramakrishnan, V., and White, S.W. (1998) Trends. Biochem. Sci. 23, 208-212.
24 Davies, C., et al. ( 1998) EMBO J. In press.
50 2S Guez-Ivanier, V. and Bedouelle, H. (1996) J.Mol.Biol. 2SS, 110-120.
26 Bryant, S.H. (1996) Proteins 26, 172-185.
27 Grundy, F.J. and Henkin, T.M. (1990)J.Bacteriol. 172, 6372-6379.
SS
28 Doolittle, R.F., and Handy, J. (1998) Current Opinion in Genetics and Development. In Press.
29 Chenniack A.D., et al. (1990) Cell 62, 74S-7S5 30 Bedouelle, H. and Winter, G. (1986) Nature 320, 371-373.
31 Motoki, L, Yosinari, S., Watanabe, K., and Nishikawa, K. (1991) Nucleic.Acids.Symp.Ser. 25, 173-1?4 32 Yoshinari, S. and Nishikawa, K. (1990). Nucleic.Acids.Symp.Ser. 22, 115-116.
33 Gibrat, J-F., Madej, T., Bryant, S.H. (1996) Current Opinion in Structural Biolog~ 6, 377-385 10 34 Lang B.F. et al. {1997) Nature 387, 493-497.
1.
SEQUENCE LISTING
<110> Mount Sinai Hospital et al.
<120> Novel tRNA Binding Domain <130> P170PCT4 <140> PCT/CA99/00779 <141> 1999-08-24 <150> US 60/097,670 <151> 1998-08-24 <160> 48 <170> PatentIn Ver. 2.0 <210> 1 <211> 36 <212> PRT
<213> R. americana <400> 1 Leu Asp Ile Ile Ile Tyr Arg Ala Gly Phe Val Asn Ser Ile Tyr Gln Ala Arg Leu Leu Val Asn His Lys His Val Leu Val Asn Asn Lys Ile SUBSTITUTE SHEET (RULE 2B) Gln Asn Ile Ser <210> 2 <211> 36 <212> PRT
<213> M. polymorpha <400> 2 Leu Asp Val Ile Leu Val Arg Leu Asn Phe Cys Ser Thr Met Phe Gln Ala Arg Gln Leu Ile Ser His Lys Asn Ile Cys Val Asn Tyr Lys Lys Val Asn Ile Pro <210> 3 <211> 36 <212> PRT
<213> S. cerevisiae <400> 3 Leu Asp Phe Ala Leu Phe Arg Ala Met Phe Ala Ser Ser Val Arg Gln SUBSTfTUTE SHEET (RULE 26) Ala Arg Gln Phe Ile Leu His Gly Asn Val Arg Val Asn Gly Val Lys Ile Lys His Pro <210> 4 <211> 36 <212> PRT
<213> A. castellanii <400> 4 Leu Glu Asn Phe Leu Met Arg Leu Asn Leu Phe Pro Ser Ile Tyr Phe Ile Lys Lys Phe Ile Glu Tyr Gly Asn Val Phe Val Asn Asn Lys Ile Ile Asn Tyr Thr <210> 5 <211> 36 <212> PRT
<213> S. cerevisiae SUBSTITUTE SHEET (RULE 26) <400> 5 Asp Leu Ile Lys Leu Ile Cys Lys Leu Val Asn Cys Ser Val Ser Glu Ala Arg Arg Lys Leu Ser Gln Gly Ser Val Tyr Leu His His Ser Lys Ser Lys Val Asn <210>6 <211>36 <212>PRT
<213>M. genitalium <400> 6 Leu Ile Asp Tyr Leu Val Glu Thr Lys Phe Ile Lys Ser Lys Ser Glu Ala Arg Arg Leu Ile Ser Gln Lys Gly Leu Thr Ile Asn Asn Lys His Val Leu Asp Leu <210> 7 <211> 36 SUBSTITUTE SHEET (RULE 26) <212> PRT
<213> H. influenzae <400> 7 Leu Ala Thr Leu Leu Lys Glu Ala Gly Leu Val Pro Ser Thr Ser Glu Ala Ile Arg Ser Ala Gln Gln Gly Gly Val Lys Ile Asn Gly Glu Lys Val Asp Asn Val <210> 8 <211> 36 <212> PRT
<213> M.pneumoniae <400> 8 Leu Val Asp Val Ile Val Asp Leu Gly Leu Val Val Ser Arg Ser Glu Ala Arg Arg Val Ile Gln Gln Gly Gly Leu Thr Ile Asn Gln Glu Lys Val Thr Asp Val SUBSTITUTE SHEET (RULE 26) <210> 9 <211> 36 <212> PRT
<213> B. subtilis <400> 9 Met Ile Asp Leu Leu Val Lys Leu Lys Leu Leu Ser Ser Lys Ser Glu Ala Arg Arg Met Ile Gln Asn Gly Gly Val Arg Ile Asp Gly Glu Lys Val Thr Asp Val <210>10 <211>36 <212>PRT
<213>Synechocystis <400> 10 Leu Ala Tyr Leu Leu Ser Ala Ser Gly Leu Cys Pro Ser Ser Ser Glu Gly Arg Arg Gln Ile Lys Gly Gly Ala Val Arg Leu Asp Gly Asp Arg SUBSTITUTE SHEET (RULE 28) WO 00!11141 PCTICA99I00779 Leu Glu Asp Val c210> 11 <211> 36 <212> PRT
<213> T. ferrooxidans <400> 11 Leu Ser Gln Leu Leu Val Gln Val His Leu Ala Ala Ser Thr Ser Glu Ala Met Arg Lys Met Lys Glu Gly Ala Val Arg Val Asp Trp Arg Arg Val Val Asp Pro <210> 12 c211> 36 <212> PRT
<213> B. subtilis <400> 12 Leu Val Asp Val Leu Val Gln Ser Lys Leu Ser Pro Ser Lys Arg Gln SUBSTITUTE SHEET (RULE 2B) Ala Arg Glu Asp Ile Gln Asn Gly Ala Val Tyr Ile Asn Gly Glu Arg Gln Thr Glu Ile <210> 13 <211> 36 <212> PRT
<213> B. stearothermophilus <400> 13 Leu Val Glu Leu Leu Val Ser Ala Gly Ile Ser Pro Ser Lys Arg Gln Ala Arg Glu Asp Ile Gln Asn Gly Ala Ile Tyr Val Asn Gly Glu Arg Leu Gln Asp Val <210>14 <211>36 <212>PRT
<213>E.
coli <400> 14 SUBSTITUTE SHEET (RULE 26) Leu Met Gln Ala Leu Val Asp Ser Glu Leu Gln Pro Ser Arg Gly Gln Ala Arg Lys Thr Ile Ala Ser Asn Ala Ile Thr Ile Asn Gly Glu Lys Gln Ser Asp Pro <210> 15 <211> 36 <212> PRT
<213> E. coli <400> 15 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Ala Thr Arg Ala Glu Ala Arg Gln Leu Val Ser His Lys Ala Ile Met Val Asn Gly Rrg Val Val Asn Ile Ala <210> 16 <211> 36 <212> PRT
SUBSTITUTE SHEET (RULE 26) <213> H. influenzae <400> 16 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Ala Thr Thr Arg Ala Glu Ala Arg Gln Leu Val Ser His Lys Ala Ile Val Val Asn Gly Arg Val Val Asn Ile Pro <210> 17 <211> 36 <212> PRT
<213> B. aphidicola <400> 17 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Cys Thr Arg Ser Glu Ser Arg Gln Leu Ile Ser His Lys Ser Ile Lys Val Asn Asn Asn Ile Val Asn Ile Ala SUBSTITUTE SHEET (RULE 2B) <210> le <211> 36 <212> PRT
<213> M. bovis <400> 18 Leu Asp Asn Val Ile Tyr Arg Ala Gly Leu Ala Arg Thr Arg Arg Met Ala Arg Gln Leu Val Ser His Gly His Phe Asn Val Asn Gly Val His Val Asn Val Pro <210> 19 <211> 36 <212> PRT
<213> M. genitalium <400> 19 Leu Asp Asn Ile Val Tyr Arg Met Gly Phe Ala Pro Thr Arg Lys Ser Ala Arg Gln Met Val Asn His Gly His Val Ile Leu Asn Asp Gln Thr Val Asp Thr Pro SUBSTITUTE SHEET (RULE 26) <210> 20 <211> 36 <212> PRT
<213> M pneumoniae <400> 20 Leu Asp Asn Ile Val Tyr Arg Met Gly Phe Ala Pro Thr Arg Arg Ser Ala Arg Gln Leu Val Asn His Gly His Val Leu Leu Asn Asp Arg Thr Val Asp Thr Pro <210> 21 <211> 36 <212> PRT
<213> H. pylorii <400> 21 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Ala Thr Thr Arg Ser Ser Ala Arg Gln Leu Val Thr His Gly His Val Leu Val Asp Gly Lys Arg SUBSTITUTE SHEET (RULE 28) Leu Asp Ile Pro <210> 22 <211> 36 <212> PRT
<213> B. sublitis <400> 22 Leu Asp Asn Val Val Tyr Lys Leu Gly Leu Ala Arg Thr Arg Arg Gln Ala Arg Gln Leu Val Asn His Gly His Ile Leu Val Asp Gly Ser Arg Val Asp Ile Pro <210> 23 <211> 36 <212> PRT
<213> B. stearothermophilus <400> 23 Leu Asp Asn Leu Val Tyr Arg Leu Gly Leu Ala Arg Thr Arg Arg Gln SUBSTITUTE SHEET (RULE 26) Ala Arg Gln Leu Val Thr His Gly His Ile Leu Val Asp Gly Ser Arg Val Asn Ile Pro <210> 24 <211> 36 <212> PRT
<213> S. oleracea <400> 24 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Pro Thr Ile Pro Gly Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 25 <211> 36 <212> PRT
<213> O. sativa SUBSTITUTE SHEET (RULE 26) <400> 25 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Ser Thr Ile Pro Glu Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 26 <211> 36 <212> PRT
<213> N. tabacum <400> 26 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Ser Thr Ile Pro Ala Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 27 SUBSTITUTE SHEET (RULE 26) <211> 36 <212> PRT
<213> M. polymorpha <400> 27 Leu Asp Asn Ile Ile Phe Arg Leu Gly Met Ala Pro Thr Ile Pro Gly Ala Arg Gln Leu Val Asn His Arg His Ile Leu Ile Asn Asn Asn Thr Val Asp Ile Pro <210>28 <211>36 <212>PRT
<213>Synechocystis <400> 28 Leu Asp Asn Thr Val Phe Arg Leu Gly Met Ala Gly Thr Ile Pro Gly Ala Arg Gln Leu Val Cys His Gly His Ile Thr Val Asn Gly Gln Val Val Asp Ile Pro SUBSTITUTE SHEET (RULE 28) wo oon i ia~ Pc~r~cA99roo~~9 <210> 29 <211> 36 <212> PRT
<213> C. reinhardtii <400> 29 Leu Asp Asn Ile Val Phe Arg Leu Asn Met Ala Pro Thr Ile Pro Ala Ala Arg Gln Leu Ile Ser His Gly His Ile Arg Val Asn Asn Lys Lys Val Asn Ile Pro <210> 30 <211> 37 <212> PRT
<213> Crypt. Phi <400> 30 Leu Asp Asn Val Ile Phe Arg Leu Gly Met Ala Pro Thr Thr Ile Pro Ala Ala Arg Gln Leu Val Asn His Gly His Ile Lys Val Asn Asn Thr SUBSTITUTE SHEET (RULE 25) 1$
Arg Val Ser Ile Pro <210> 31 <211> 35 <212> PRT
<213> C. vulgaris <400> 31 Leu Asp Thr His Phe Arg Leu Gly Phe Ala Pro Thr Ile Ala Ala Ala Arg Gln Leu Ile Asn His Gly His Ile Val Val Asn Gly Arg Arg Val Asp Ile Pro <210> 32 <211> 36 <212> PRT
<213> D. melanogaster <400> 32 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His SUBSTITUTE SHEET {RULE 28) Ala Arg Val Leu Ile Arg Gln Arg Thr Phe Val Leu Ala Ser Arg Trp Ser Thr Ile Pro <210> 33 <211> 36 <212> PRT
<213> T. brucei <400> 33 Leu Gln Thr Val Val Phe Lys His Gly Leu Ala Lys Ser Val His His Ser Arg Val Leu Ile Gln Gln Arg His Ile Ala Val Ala Lys Gln Ile Val Thr Ile Pro <210> 34 <211> 36 <212> PRT
<213> S. cerevisiae SUBSTITUTE SKEET (RULE 26) <400> 34 Leu Gln Thr Gln Val Tyr Lys Leu Gly Leu Ala Lys Ser Val His His Ala Arg Val Leu Ile Thr Gln Arg His Ile Ala Val Gly Lys Gln Ile Val Asn Ile Pro <210> 35 <211> 36 <212> PRT
<213> R. norvegicus <400> 35 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Leu Lys Gln Val Val Asn Ile Pro <210> 36 <211> 36 SUBSTITUTE SHEET (RULE 26) <212> PRT
<213> H. sapiens <400> 36 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Arg Lys Gln Val Val Asn Ile Pro <210> 37 <211> 36 <212> PRT
<213> s. pombe <400> 37 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Phe Gln Arg His Ile Arg Val Gly Lys Gln Ile Val Asn Val Pro SUBSTITUTE SHEET (RULE 28) <210> 38 <211> 36 <212> PRT
<213> P. anserina <400> 38 Leu Gln Thr Leu Val Tyr Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Gly Lys Gln Ile Val Asn Val Pro <210> 39 <211> 36 <212> PRT
<213> D. discoidium <400> 39 Leu Gln Thr Leu Val Phe Lys Asn Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Lys Gly Arg His Ile Arg Va1 Gly Lys Gln Leu SUBSTITUTE SHEET (RULE 26) Val Asn Val Pro <210> 40 <211> 36 <212> PRT
<213> N. fowleri <400> 40 Leu Gln Thr Val Val Gln Lys Leu Gly Leu Ser Lys Ser Ile His His Ala Arg Gln Leu Ile Phe Gln Arg His Ile Arg Val Gly Lys Gln Thr Val Asn Val Pro <210> 41 <211> 36 <212> PRT
<213> C. elegans <400> 41 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His SUBSTITUTE SHEET (RULE 28) Ala Arg Ile Leu Ile Lys Gln His His Ile Arg Val Arg Arg Gln Val Val Asp Val Pro <210> 42 <211> 36 <212> PRT
<213> H. marismortui <400> 42 Leu Gln Thr Val Val Tyr Arg Lys Gly Tyr Ala Asn Thr Pro Glu Gln Ala Arg Gln Phe Ile Val His Gly His Ile Val Leu Asp Asp Ala Arg Val Thr Arg Pro <210>43 <211>36 <212>PRT
<213>S. acidocaldarius <400> 43 SUBSTITUTE SHEET (RULE 26) WO 00/11 ldl PCT/CA99/00779 Leu Gln Thr Ile Val Tyr Lys Lys Gly Leu Ala Arg Thr Ile Tyr Gln Ala Arg Gln Leu Ile Thr His Gly His Ile Ala Ile Ser Gly Arg Lys Val Thr Ser Pro <210> 44 <211> 36 <212> PRT
<213> s. solfataricus <400> 44 Leu Gln Thr Ile Val Tyr Lys Lys Gly Leu Ser Asn Thr Ile Tyr Gln 1 5 10 . 15 Ala Arg Gln Leu Ile Thr His Gly His Ile Ala Val Asn Gly Lys Arg Val Thr Ser Pro <210> 45 <211> 36 <212> PRT
SUBSTITUTE SHEET (RULE 2B) <213> M. jannaschii <400> 45 Leu Gln Thr Leu Val Phe Arg Lys Gly Leu Ala Arg Thr Pro Arg Gln Ala Arg Gln Leu Ile Val His Gly His Ile Ala Val Asn Gly Arg Val Val Thr Ala Pro <210>46 <211>63 <212>PRT
<213>Synechocystis <400> 46 Leu Ala Tyr Leu Leu Ser Ala Ser Gly Leu Cys Pro Ser Ser Ser Glu Gly Arg Arg Gln Ile Lys Gly Gly Ala Val Arg Leu Asp Gly Asp Arg Leu Glu Asp Val Asn Gln Glu Tyr Ala Asp Pro Lys Met Leu Ile Asn Lys Val Leu Gln Met Gly Lys Lys Lys Phe Ile Arg Leu Ile Ser SUBSTITUTE SHEET (RULE 26) <210> 47 <211> 66 <212> PRT
<213> B. stearothermophilus <400> 47 Leu Asp Asn Leu Val Tyr Arg Leu Gly Leu Ala Arg Thr Arg Arg Gln Ala Arg Gln Leu Val Thr Asn Gly His Ile Leu Val Asp Gly Ser Arg Val Asn Ile Pro Ser Tyr Arg Val Lys Pro Gly Gln Thr Ile Ala Val Arg Glu Lys Ser Arg Asn Leu Gln Val Ile Lys Glu Ala Leu Glu Ala Asn Asn <210> 48 <211> 85 <212> PRT
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 2B) <220>
<223> Description of Artificial Sequence: tRNA binding domain <400> 48 Met Leu Xaa Xaa Xaa Leu Ile Val Met Cys Xaa Xaa Xaa Gly Met Asn Glu His Lys Met Leu Phe Tyr Ile Xaa Xaa Ser Thr Xaa Xaa Xaa Ser Ala Ile Gly Arg Met Ile Lys Xaa Xaa Met Ile Val Ala Xaa Xaa Gly Asn Arg His Lys Xaa Ile Val Leu Phe Xaa Leu Ile Val Asn Asp Ser Arg Gly Ala Leu Xaa Xaa Xaa Leu Ile Val Ser Gln Xaa Xaa Pro Ile Leu Val Thr Ala Cys SUBSTITUTE SHEET (RULE 26)
30 1S Schultz, D.W. and Yarus, M. (1994) J.Mol.Biol. 235, 1395-1405 16 Huttenhofer, A. and Noller, H.F. ( 1992) Proc.Natl AcadSci. U.S.A. 89, 78S
17 Pongs, O. and Lanka, E. (1975) Proc.Natl.AcadSci. U.S.A. 72, (SOS-1509 18 Pongs, O. and Rossner, E. (1975) Hoppe Seylers.Z.Physiol.Chem. 356, 1297-19 Kirsebom, L.A. and Isaksson, L.A. (1986) Mol.Gen.Genet. 205, 240-247 40 20 Karimi, R. and Ehrenberg, M. ( 1996) EMBO J. 1 S, 1149-11 S4 21 Baker, A.M. and Draper, D.E. (1995) J.Biol.Chem. 270, 22939-22945 22 Conrad, R.C. and Craven, G.R. (1987) Nucleic.Acids.Res. 1S, 10331-10343.
23 Ramakrishnan, V., and White, S.W. (1998) Trends. Biochem. Sci. 23, 208-212.
24 Davies, C., et al. ( 1998) EMBO J. In press.
50 2S Guez-Ivanier, V. and Bedouelle, H. (1996) J.Mol.Biol. 2SS, 110-120.
26 Bryant, S.H. (1996) Proteins 26, 172-185.
27 Grundy, F.J. and Henkin, T.M. (1990)J.Bacteriol. 172, 6372-6379.
SS
28 Doolittle, R.F., and Handy, J. (1998) Current Opinion in Genetics and Development. In Press.
29 Chenniack A.D., et al. (1990) Cell 62, 74S-7S5 30 Bedouelle, H. and Winter, G. (1986) Nature 320, 371-373.
31 Motoki, L, Yosinari, S., Watanabe, K., and Nishikawa, K. (1991) Nucleic.Acids.Symp.Ser. 25, 173-1?4 32 Yoshinari, S. and Nishikawa, K. (1990). Nucleic.Acids.Symp.Ser. 22, 115-116.
33 Gibrat, J-F., Madej, T., Bryant, S.H. (1996) Current Opinion in Structural Biolog~ 6, 377-385 10 34 Lang B.F. et al. {1997) Nature 387, 493-497.
1.
SEQUENCE LISTING
<110> Mount Sinai Hospital et al.
<120> Novel tRNA Binding Domain <130> P170PCT4 <140> PCT/CA99/00779 <141> 1999-08-24 <150> US 60/097,670 <151> 1998-08-24 <160> 48 <170> PatentIn Ver. 2.0 <210> 1 <211> 36 <212> PRT
<213> R. americana <400> 1 Leu Asp Ile Ile Ile Tyr Arg Ala Gly Phe Val Asn Ser Ile Tyr Gln Ala Arg Leu Leu Val Asn His Lys His Val Leu Val Asn Asn Lys Ile SUBSTITUTE SHEET (RULE 2B) Gln Asn Ile Ser <210> 2 <211> 36 <212> PRT
<213> M. polymorpha <400> 2 Leu Asp Val Ile Leu Val Arg Leu Asn Phe Cys Ser Thr Met Phe Gln Ala Arg Gln Leu Ile Ser His Lys Asn Ile Cys Val Asn Tyr Lys Lys Val Asn Ile Pro <210> 3 <211> 36 <212> PRT
<213> S. cerevisiae <400> 3 Leu Asp Phe Ala Leu Phe Arg Ala Met Phe Ala Ser Ser Val Arg Gln SUBSTfTUTE SHEET (RULE 26) Ala Arg Gln Phe Ile Leu His Gly Asn Val Arg Val Asn Gly Val Lys Ile Lys His Pro <210> 4 <211> 36 <212> PRT
<213> A. castellanii <400> 4 Leu Glu Asn Phe Leu Met Arg Leu Asn Leu Phe Pro Ser Ile Tyr Phe Ile Lys Lys Phe Ile Glu Tyr Gly Asn Val Phe Val Asn Asn Lys Ile Ile Asn Tyr Thr <210> 5 <211> 36 <212> PRT
<213> S. cerevisiae SUBSTITUTE SHEET (RULE 26) <400> 5 Asp Leu Ile Lys Leu Ile Cys Lys Leu Val Asn Cys Ser Val Ser Glu Ala Arg Arg Lys Leu Ser Gln Gly Ser Val Tyr Leu His His Ser Lys Ser Lys Val Asn <210>6 <211>36 <212>PRT
<213>M. genitalium <400> 6 Leu Ile Asp Tyr Leu Val Glu Thr Lys Phe Ile Lys Ser Lys Ser Glu Ala Arg Arg Leu Ile Ser Gln Lys Gly Leu Thr Ile Asn Asn Lys His Val Leu Asp Leu <210> 7 <211> 36 SUBSTITUTE SHEET (RULE 26) <212> PRT
<213> H. influenzae <400> 7 Leu Ala Thr Leu Leu Lys Glu Ala Gly Leu Val Pro Ser Thr Ser Glu Ala Ile Arg Ser Ala Gln Gln Gly Gly Val Lys Ile Asn Gly Glu Lys Val Asp Asn Val <210> 8 <211> 36 <212> PRT
<213> M.pneumoniae <400> 8 Leu Val Asp Val Ile Val Asp Leu Gly Leu Val Val Ser Arg Ser Glu Ala Arg Arg Val Ile Gln Gln Gly Gly Leu Thr Ile Asn Gln Glu Lys Val Thr Asp Val SUBSTITUTE SHEET (RULE 26) <210> 9 <211> 36 <212> PRT
<213> B. subtilis <400> 9 Met Ile Asp Leu Leu Val Lys Leu Lys Leu Leu Ser Ser Lys Ser Glu Ala Arg Arg Met Ile Gln Asn Gly Gly Val Arg Ile Asp Gly Glu Lys Val Thr Asp Val <210>10 <211>36 <212>PRT
<213>Synechocystis <400> 10 Leu Ala Tyr Leu Leu Ser Ala Ser Gly Leu Cys Pro Ser Ser Ser Glu Gly Arg Arg Gln Ile Lys Gly Gly Ala Val Arg Leu Asp Gly Asp Arg SUBSTITUTE SHEET (RULE 28) WO 00!11141 PCTICA99I00779 Leu Glu Asp Val c210> 11 <211> 36 <212> PRT
<213> T. ferrooxidans <400> 11 Leu Ser Gln Leu Leu Val Gln Val His Leu Ala Ala Ser Thr Ser Glu Ala Met Arg Lys Met Lys Glu Gly Ala Val Arg Val Asp Trp Arg Arg Val Val Asp Pro <210> 12 c211> 36 <212> PRT
<213> B. subtilis <400> 12 Leu Val Asp Val Leu Val Gln Ser Lys Leu Ser Pro Ser Lys Arg Gln SUBSTITUTE SHEET (RULE 2B) Ala Arg Glu Asp Ile Gln Asn Gly Ala Val Tyr Ile Asn Gly Glu Arg Gln Thr Glu Ile <210> 13 <211> 36 <212> PRT
<213> B. stearothermophilus <400> 13 Leu Val Glu Leu Leu Val Ser Ala Gly Ile Ser Pro Ser Lys Arg Gln Ala Arg Glu Asp Ile Gln Asn Gly Ala Ile Tyr Val Asn Gly Glu Arg Leu Gln Asp Val <210>14 <211>36 <212>PRT
<213>E.
coli <400> 14 SUBSTITUTE SHEET (RULE 26) Leu Met Gln Ala Leu Val Asp Ser Glu Leu Gln Pro Ser Arg Gly Gln Ala Arg Lys Thr Ile Ala Ser Asn Ala Ile Thr Ile Asn Gly Glu Lys Gln Ser Asp Pro <210> 15 <211> 36 <212> PRT
<213> E. coli <400> 15 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Ala Thr Arg Ala Glu Ala Arg Gln Leu Val Ser His Lys Ala Ile Met Val Asn Gly Rrg Val Val Asn Ile Ala <210> 16 <211> 36 <212> PRT
SUBSTITUTE SHEET (RULE 26) <213> H. influenzae <400> 16 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Ala Thr Thr Arg Ala Glu Ala Arg Gln Leu Val Ser His Lys Ala Ile Val Val Asn Gly Arg Val Val Asn Ile Pro <210> 17 <211> 36 <212> PRT
<213> B. aphidicola <400> 17 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Cys Thr Arg Ser Glu Ser Arg Gln Leu Ile Ser His Lys Ser Ile Lys Val Asn Asn Asn Ile Val Asn Ile Ala SUBSTITUTE SHEET (RULE 2B) <210> le <211> 36 <212> PRT
<213> M. bovis <400> 18 Leu Asp Asn Val Ile Tyr Arg Ala Gly Leu Ala Arg Thr Arg Arg Met Ala Arg Gln Leu Val Ser His Gly His Phe Asn Val Asn Gly Val His Val Asn Val Pro <210> 19 <211> 36 <212> PRT
<213> M. genitalium <400> 19 Leu Asp Asn Ile Val Tyr Arg Met Gly Phe Ala Pro Thr Arg Lys Ser Ala Arg Gln Met Val Asn His Gly His Val Ile Leu Asn Asp Gln Thr Val Asp Thr Pro SUBSTITUTE SHEET (RULE 26) <210> 20 <211> 36 <212> PRT
<213> M pneumoniae <400> 20 Leu Asp Asn Ile Val Tyr Arg Met Gly Phe Ala Pro Thr Arg Arg Ser Ala Arg Gln Leu Val Asn His Gly His Val Leu Leu Asn Asp Arg Thr Val Asp Thr Pro <210> 21 <211> 36 <212> PRT
<213> H. pylorii <400> 21 Leu Asp Asn Val Val Tyr Arg Met Gly Phe Ala Thr Thr Arg Ser Ser Ala Arg Gln Leu Val Thr His Gly His Val Leu Val Asp Gly Lys Arg SUBSTITUTE SHEET (RULE 28) Leu Asp Ile Pro <210> 22 <211> 36 <212> PRT
<213> B. sublitis <400> 22 Leu Asp Asn Val Val Tyr Lys Leu Gly Leu Ala Arg Thr Arg Arg Gln Ala Arg Gln Leu Val Asn His Gly His Ile Leu Val Asp Gly Ser Arg Val Asp Ile Pro <210> 23 <211> 36 <212> PRT
<213> B. stearothermophilus <400> 23 Leu Asp Asn Leu Val Tyr Arg Leu Gly Leu Ala Arg Thr Arg Arg Gln SUBSTITUTE SHEET (RULE 26) Ala Arg Gln Leu Val Thr His Gly His Ile Leu Val Asp Gly Ser Arg Val Asn Ile Pro <210> 24 <211> 36 <212> PRT
<213> S. oleracea <400> 24 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Pro Thr Ile Pro Gly Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 25 <211> 36 <212> PRT
<213> O. sativa SUBSTITUTE SHEET (RULE 26) <400> 25 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Ser Thr Ile Pro Glu Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 26 <211> 36 <212> PRT
<213> N. tabacum <400> 26 Leu Asp Asn Ile Leu Phe Arg Leu Gly Met Ala Ser Thr Ile Pro Ala Ala Arg Gln Leu Val Asn His Arg His Ile Leu Val Asn Gly Arg Ile Val Asp Ile Pro <210> 27 SUBSTITUTE SHEET (RULE 26) <211> 36 <212> PRT
<213> M. polymorpha <400> 27 Leu Asp Asn Ile Ile Phe Arg Leu Gly Met Ala Pro Thr Ile Pro Gly Ala Arg Gln Leu Val Asn His Arg His Ile Leu Ile Asn Asn Asn Thr Val Asp Ile Pro <210>28 <211>36 <212>PRT
<213>Synechocystis <400> 28 Leu Asp Asn Thr Val Phe Arg Leu Gly Met Ala Gly Thr Ile Pro Gly Ala Arg Gln Leu Val Cys His Gly His Ile Thr Val Asn Gly Gln Val Val Asp Ile Pro SUBSTITUTE SHEET (RULE 28) wo oon i ia~ Pc~r~cA99roo~~9 <210> 29 <211> 36 <212> PRT
<213> C. reinhardtii <400> 29 Leu Asp Asn Ile Val Phe Arg Leu Asn Met Ala Pro Thr Ile Pro Ala Ala Arg Gln Leu Ile Ser His Gly His Ile Arg Val Asn Asn Lys Lys Val Asn Ile Pro <210> 30 <211> 37 <212> PRT
<213> Crypt. Phi <400> 30 Leu Asp Asn Val Ile Phe Arg Leu Gly Met Ala Pro Thr Thr Ile Pro Ala Ala Arg Gln Leu Val Asn His Gly His Ile Lys Val Asn Asn Thr SUBSTITUTE SHEET (RULE 25) 1$
Arg Val Ser Ile Pro <210> 31 <211> 35 <212> PRT
<213> C. vulgaris <400> 31 Leu Asp Thr His Phe Arg Leu Gly Phe Ala Pro Thr Ile Ala Ala Ala Arg Gln Leu Ile Asn His Gly His Ile Val Val Asn Gly Arg Arg Val Asp Ile Pro <210> 32 <211> 36 <212> PRT
<213> D. melanogaster <400> 32 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His SUBSTITUTE SHEET {RULE 28) Ala Arg Val Leu Ile Arg Gln Arg Thr Phe Val Leu Ala Ser Arg Trp Ser Thr Ile Pro <210> 33 <211> 36 <212> PRT
<213> T. brucei <400> 33 Leu Gln Thr Val Val Phe Lys His Gly Leu Ala Lys Ser Val His His Ser Arg Val Leu Ile Gln Gln Arg His Ile Ala Val Ala Lys Gln Ile Val Thr Ile Pro <210> 34 <211> 36 <212> PRT
<213> S. cerevisiae SUBSTITUTE SKEET (RULE 26) <400> 34 Leu Gln Thr Gln Val Tyr Lys Leu Gly Leu Ala Lys Ser Val His His Ala Arg Val Leu Ile Thr Gln Arg His Ile Ala Val Gly Lys Gln Ile Val Asn Ile Pro <210> 35 <211> 36 <212> PRT
<213> R. norvegicus <400> 35 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Leu Lys Gln Val Val Asn Ile Pro <210> 36 <211> 36 SUBSTITUTE SHEET (RULE 26) <212> PRT
<213> H. sapiens <400> 36 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Arg Lys Gln Val Val Asn Ile Pro <210> 37 <211> 36 <212> PRT
<213> s. pombe <400> 37 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Phe Gln Arg His Ile Arg Val Gly Lys Gln Ile Val Asn Val Pro SUBSTITUTE SHEET (RULE 28) <210> 38 <211> 36 <212> PRT
<213> P. anserina <400> 38 Leu Gln Thr Leu Val Tyr Lys Leu Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Arg Gln Arg His Ile Arg Val Gly Lys Gln Ile Val Asn Val Pro <210> 39 <211> 36 <212> PRT
<213> D. discoidium <400> 39 Leu Gln Thr Leu Val Phe Lys Asn Gly Leu Ala Lys Ser Ile His His Ala Arg Val Leu Ile Lys Gly Arg His Ile Arg Va1 Gly Lys Gln Leu SUBSTITUTE SHEET (RULE 26) Val Asn Val Pro <210> 40 <211> 36 <212> PRT
<213> N. fowleri <400> 40 Leu Gln Thr Val Val Gln Lys Leu Gly Leu Ser Lys Ser Ile His His Ala Arg Gln Leu Ile Phe Gln Arg His Ile Arg Val Gly Lys Gln Thr Val Asn Val Pro <210> 41 <211> 36 <212> PRT
<213> C. elegans <400> 41 Leu Gln Thr Gln Val Phe Lys Leu Gly Leu Ala Lys Ser Ile His His SUBSTITUTE SHEET (RULE 28) Ala Arg Ile Leu Ile Lys Gln His His Ile Arg Val Arg Arg Gln Val Val Asp Val Pro <210> 42 <211> 36 <212> PRT
<213> H. marismortui <400> 42 Leu Gln Thr Val Val Tyr Arg Lys Gly Tyr Ala Asn Thr Pro Glu Gln Ala Arg Gln Phe Ile Val His Gly His Ile Val Leu Asp Asp Ala Arg Val Thr Arg Pro <210>43 <211>36 <212>PRT
<213>S. acidocaldarius <400> 43 SUBSTITUTE SHEET (RULE 26) WO 00/11 ldl PCT/CA99/00779 Leu Gln Thr Ile Val Tyr Lys Lys Gly Leu Ala Arg Thr Ile Tyr Gln Ala Arg Gln Leu Ile Thr His Gly His Ile Ala Ile Ser Gly Arg Lys Val Thr Ser Pro <210> 44 <211> 36 <212> PRT
<213> s. solfataricus <400> 44 Leu Gln Thr Ile Val Tyr Lys Lys Gly Leu Ser Asn Thr Ile Tyr Gln 1 5 10 . 15 Ala Arg Gln Leu Ile Thr His Gly His Ile Ala Val Asn Gly Lys Arg Val Thr Ser Pro <210> 45 <211> 36 <212> PRT
SUBSTITUTE SHEET (RULE 2B) <213> M. jannaschii <400> 45 Leu Gln Thr Leu Val Phe Arg Lys Gly Leu Ala Arg Thr Pro Arg Gln Ala Arg Gln Leu Ile Val His Gly His Ile Ala Val Asn Gly Arg Val Val Thr Ala Pro <210>46 <211>63 <212>PRT
<213>Synechocystis <400> 46 Leu Ala Tyr Leu Leu Ser Ala Ser Gly Leu Cys Pro Ser Ser Ser Glu Gly Arg Arg Gln Ile Lys Gly Gly Ala Val Arg Leu Asp Gly Asp Arg Leu Glu Asp Val Asn Gln Glu Tyr Ala Asp Pro Lys Met Leu Ile Asn Lys Val Leu Gln Met Gly Lys Lys Lys Phe Ile Arg Leu Ile Ser SUBSTITUTE SHEET (RULE 26) <210> 47 <211> 66 <212> PRT
<213> B. stearothermophilus <400> 47 Leu Asp Asn Leu Val Tyr Arg Leu Gly Leu Ala Arg Thr Arg Arg Gln Ala Arg Gln Leu Val Thr Asn Gly His Ile Leu Val Asp Gly Ser Arg Val Asn Ile Pro Ser Tyr Arg Val Lys Pro Gly Gln Thr Ile Ala Val Arg Glu Lys Ser Arg Asn Leu Gln Val Ile Lys Glu Ala Leu Glu Ala Asn Asn <210> 48 <211> 85 <212> PRT
<213> Artificial Sequence SUBSTITUTE SHEET (RULE 2B) <220>
<223> Description of Artificial Sequence: tRNA binding domain <400> 48 Met Leu Xaa Xaa Xaa Leu Ile Val Met Cys Xaa Xaa Xaa Gly Met Asn Glu His Lys Met Leu Phe Tyr Ile Xaa Xaa Ser Thr Xaa Xaa Xaa Ser Ala Ile Gly Arg Met Ile Lys Xaa Xaa Met Ile Val Ala Xaa Xaa Gly Asn Arg His Lys Xaa Ile Val Leu Phe Xaa Leu Ile Val Asn Asp Ser Arg Gly Ala Leu Xaa Xaa Xaa Leu Ile Val Ser Gln Xaa Xaa Pro Ile Leu Val Thr Ala Cys SUBSTITUTE SHEET (RULE 26)
Claims (19)
1. A substantially pure peptide comprising the following sequence [ML]-X(3)-[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2)-[ST]-X(3)-[SAIG]-[RMIK]-X(2)-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC]
wherein X represents any amino acid.
wherein X represents any amino acid.
2. A substantially pure peptide comprising the sequence motif Y1-X(3)-Y2-X(3)-Y3-Y4-X(2)-Y5-X(3)-Y6-Y-X(2)-Y8-X(2)-Y9-X-Y10-X-Y11-Y12-X(3)-Y13-X(2)-Y14 where Y1 is methionine or leucine, preferably leucine, Y3 is leucine, isoleucine, valine, methionine, or cysteine, preferably isoleucine, leucine, or valine,Y3 is glycine, methionine, asparagine, glutamic acid, histidine, or lysine, preferably glycine, Y4 is methionine, leucine, phenylalanine, tyrosine, or isoleucine, preferably phenylalanine, leucine, methionine, or tyrosine, Y5 is serine or threonine, Y6 is serine, alanine, isoleucine, or glycine, preferably alanine, Y7 is arginine, methionine, isoleucine, or lysine, preferably arginine, Y8 is methionine, isoleucine, valine, alanine, preferably valine or isoleucine, Y9 is glycine, asparagine, arginine, histidine, or lysine, preferably lysine, glycine, or arginine, Y10 is isoleucine, valine, leucine, or phenylalanine, preferably valine or isoleucine, Y11 is leucine, isoleucine, or valine, preferably valine or isoleucine, Y12 is asparagine, aspartic acid, serine, arginine, glycine, alanine, or leucine, preferably asparagine, aspartic acid, or glycine,Y13 is leucine, isoleucine, valine, serine, or glutamine, preferably glutamine or valine, Y14 is proline, isoleucine, leucine, valine, threonine, alanine, cysteine, or serine, preferably proline or valine, and X
is any amino acid.
is any amino acid.
3. A peptide as claimed in claim 2 wherein the peptide is from 10 to 200 amino acids in length and comprises the core sequence Y1-X(3)-Y2-X(3)-Y3-Y4-X(2)-Y5-X(3)-Y6-Y-X(2)-Y8-X(2)-Y9-X-Y10-X-Y11-Y12-X(3)-Y13-X(2)-Y14.
4. A peptide as claimed in claim 2 wherein the peptide is from 30 to 75 amino acids in length and comprises the core sequence Y1-X(3)-Y2-X(3)-Y3-Y4-X(2)-Y5-X(3)-Y6-Y7-X(2)-Y8-X(2)-Y9-X-Y10-X-Y11-Y12-X(3)Y13-X(2)-Y14.
5. A peptide as claimed in claim 2 wherein the peptide is from 35 to 65 amino acids in length and comprises the core sequence Y1-X(3)Y2-X(3)-Y3-Y4-X(2)-Y5-X(3)-Y6-Y7-X(2)-Y8-X(2)-Y9-X-Y10-X-Y11-Y12-X(3)-Y13-X(2)-Y14.
6. A peptide as claimed in claim 2 that is a sequence of SEQ.ID.NOs. 1 to 47.
7. A peptide as claimed in claim 2 that binds to a tRNA anticodon stem-loop.
8. A complex comprising a peptide as claimed in claim 2 with a tRNA anticodon stem-loop.
9. An antibody specific for a peptide as claimed in claim 2 or a complex as claimed in claim 8.
10. A method for determining whether a nucleic acid comprises a tRNA anticodon stem-loop comprising the steps of contacting a nucleic acid with a peptide as claimed in claim 2 and determining whether the peptide binds to the nucleic acid, wherein the binding of the peptide to the nucleic acid indicates that the nucleic acid comprises a tRNA anticodon stem-loop.
11. A method for identifying a substance which binds to a peptide as claimed in claim 2 comprising reacting the peptide with at least one substance which potentially can bind with the peptide, under conditions which permit the formation of conjugates between the substance and peptide, and detecting binding.
12. A method as claimed in claim 1 I wherein binding is detected by assaying for conjugates, for free substance, or for non-complexed peptide.
13. A method of determining whether a test compound is an agonist or antagonist of a tRNA
synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA
anticodon stem-loop interaction which comprises the steps of (a) incubating the test compound with a nucleic acid comprising a tRNA anticodon stem-loop, and a peptide as claimed in claim 2; (b) determining the amount of nucleic acid bound to the peptide during the incubating step; and (c) comparing the amount of nucleic acid bound to the peptide during the incubating step to an amount of nucleic acid bound to peptide in the absence of the test compound, wherein an increase in the amount of nucleic acid bound to peptide in the presence of the test compound indicates that the test compound is an agonist of a tRNA synthetase-tRNA
anticodon stem-loop interaction or a ribosomal S4 protein-tRNA anticodon stem-loop interaction, while a decrease indicates that the test compound is an antagonist of an interaction.
synthetase-tRNA anticodon stem-loop interaction or ribosomal S4 protein-tRNA
anticodon stem-loop interaction which comprises the steps of (a) incubating the test compound with a nucleic acid comprising a tRNA anticodon stem-loop, and a peptide as claimed in claim 2; (b) determining the amount of nucleic acid bound to the peptide during the incubating step; and (c) comparing the amount of nucleic acid bound to the peptide during the incubating step to an amount of nucleic acid bound to peptide in the absence of the test compound, wherein an increase in the amount of nucleic acid bound to peptide in the presence of the test compound indicates that the test compound is an agonist of a tRNA synthetase-tRNA
anticodon stem-loop interaction or a ribosomal S4 protein-tRNA anticodon stem-loop interaction, while a decrease indicates that the test compound is an antagonist of an interaction.
14. A method for obtaining a substantially pure nucleic acid comprising a tRNA
anticodon stem-loop from a mixture of different nucleic acids comprising the steps of (a) providing a peptide as claimed in claim 2 bound to a solid support; (b) contacting the mixture of different nucleic acids with the peptide bound to the solid support whereby a nucleic acid comprising a tRNA
anticodon stem-loop is bound to the peptide; and (c) washing the solid support to remove unbound nucleic acids and eluting substantially pure nucleic acids comprising a tRNA
anticodon stem-loop from the solid support.
anticodon stem-loop from a mixture of different nucleic acids comprising the steps of (a) providing a peptide as claimed in claim 2 bound to a solid support; (b) contacting the mixture of different nucleic acids with the peptide bound to the solid support whereby a nucleic acid comprising a tRNA
anticodon stem-loop is bound to the peptide; and (c) washing the solid support to remove unbound nucleic acids and eluting substantially pure nucleic acids comprising a tRNA
anticodon stem-loop from the solid support.
15. A method of interfering with the interaction of a peptide as claimed in claim 2 with a tRNA
anticodon stem-loop comprising contacting the tRNA anticodon stem-loop with the peptide.
anticodon stem-loop comprising contacting the tRNA anticodon stem-loop with the peptide.
16. A method of modulating protein synthesis comprising changing the following sequence motif in a tRNA synthetase or ribosomal protein:
[ML]-X(3)-[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2)-[ST]-X(3)-[SAIG]-[RMIK]-X(2)-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC], wherein X is any amino acid.
[ML]-X(3)-[LIVMC]-X(3)-[GMNEHK]-[MLFYI]-X(2)-[ST]-X(3)-[SAIG]-[RMIK]-X(2)-[MIVA]-X(2)-[GNRHK]-X-[IVLF]-X-[LIV]-[NDSRGAL]-X(3)-[LIVSQ]-X(2)-[PILVTAC], wherein X is any amino acid.
17. A pharmaceutical composition for inhibiting the interaction of a tRNA, with a tRNA
synthetase or ribosomal protein comprising a peptide as claimed in claim 2 and a pharmaceutically acceptable carrier.
synthetase or ribosomal protein comprising a peptide as claimed in claim 2 and a pharmaceutically acceptable carrier.
18. An antibacterial agent, anti-viral agent, immunotoxin, or plant toxin comprising a peptide as claimed in claim 2 or a complex as claimed in claim 3.
19. Use of a peptide as claimed in claim 2 or a complex as claimed in claim 3 in the preparation of an antibacterial agent, anti-viral agent, immunotoxin, or plant toxin.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9767098P | 1998-08-24 | 1998-08-24 | |
US60/097,670 | 1998-08-24 | ||
PCT/CA1999/000779 WO2000011141A2 (en) | 1998-08-24 | 1999-08-24 | tRNA BINDING DOMAIN |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2341765A1 true CA2341765A1 (en) | 2000-03-02 |
Family
ID=22264560
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002341765A Abandoned CA2341765A1 (en) | 1998-08-24 | 1999-08-24 | Trna binding domain |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU5367199A (en) |
CA (1) | CA2341765A1 (en) |
WO (1) | WO2000011141A2 (en) |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU5552396A (en) * | 1995-04-21 | 1996-11-07 | Human Genome Sciences, Inc. | Nucleotide sequence of the haemophilus influenzae rd genome, fragments thereof, and uses thereof |
GB9601067D0 (en) * | 1996-01-19 | 1996-03-20 | Smithkline Beecham Plc | Novel compounds |
GB9608001D0 (en) * | 1996-04-18 | 1996-06-19 | Smithkline Beecham Plc | Novel compounds |
US6107071A (en) * | 1996-09-24 | 2000-08-22 | Smithkline Beecham Corporation | Histidinol dehydrogenase |
-
1999
- 1999-08-24 WO PCT/CA1999/000779 patent/WO2000011141A2/en active Application Filing
- 1999-08-24 AU AU53671/99A patent/AU5367199A/en not_active Abandoned
- 1999-08-24 CA CA002341765A patent/CA2341765A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
AU5367199A (en) | 2000-03-14 |
WO2000011141A2 (en) | 2000-03-02 |
WO2000011141A3 (en) | 2000-06-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DeLeo et al. | Mapping sites of interaction of p47-phox and flavocytochrome b with random-sequence peptide phage display libraries. | |
Koc et al. | Identification of mammalian mitochondrial translational initiation factor 3 and examination of its role in initiation complex formation with natural mRNAs | |
US10788495B2 (en) | System and method for identification and characterization of transglutaminase species | |
Schmees et al. | Functional consequences of mutations in the conserved ‘signature sequence’of the ATP‐binding‐cassette protein MalK | |
US8198021B2 (en) | Target and method for inhibition of bacterial RNA polymerase | |
US20130053544A1 (en) | Peptide tag systems that spontaneously form an irreversible link to protein partners via isopeptide bonds | |
US5977304A (en) | Modification of pertussis toxin | |
Cronan Jr et al. | [27] Biotinylation of proteins in vivo: A useful posttranslational modification for protein analysis | |
EP1718323B1 (en) | Binding peptidomimetics and uses of the same | |
WO1996032411A1 (en) | Binding sites for phosphotyrosine binding domains | |
JP2005536184A (en) | Streptavidin-binding peptide | |
McFarland et al. | Energetics and cooperativity of the hydrogen bonding and anchor interactions that bind peptides to MHC class II protein | |
US11280791B2 (en) | System and method for identification and characterization of transglutaminase species | |
Spencer et al. | The interaction of mitochondrial translational initiation factor 2 with the small ribosomal subunit | |
CA2341765A1 (en) | Trna binding domain | |
Prijatelj et al. | Identification of a novel binding site for calmodulin in ammodytoxin A, a neurotoxic group IIA phospholipase A2 | |
ZA200300707B (en) | Method with a wide range of applications, for identifying modulators of G-protein-coupled receptors. | |
Vetter et al. | Phosphorylation of serine residues affects the conformation of the calmodulin binding domain of human protein 4.1 | |
US8999894B2 (en) | Nucleic acid-like proteins | |
WO2001083518A2 (en) | Molecules that modulate ubiquitin-dependent proteolysis and methods for identifying same | |
Luo et al. | Molecular anatomy of the β′ subunit of the E. coli RNA polymerase: identification of regions involved in polymerase assembly | |
Farmery et al. | Binding of GTP and GDP induces a significant conformational change in the GTPase domain of Ffh, a bacterial homologue of the SRP 54 kDa subunit | |
YOO et al. | Determination of the native form of FadD, the Escherichia coli fatty acyl-CoA synthetase, and characterization of limited proteolysis by outer membrane protease OmpT | |
EP4421166A1 (en) | Improved split halotags | |
Li et al. | The ribosome termination complex remodels release factor RF3 and ejects GDP |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Dead |