CN108707621B - 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 - Google Patents
一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 Download PDFInfo
- Publication number
- CN108707621B CN108707621B CN201810385845.5A CN201810385845A CN108707621B CN 108707621 B CN108707621 B CN 108707621B CN 201810385845 A CN201810385845 A CN 201810385845A CN 108707621 B CN108707621 B CN 108707621B
- Authority
- CN
- China
- Prior art keywords
- sequence
- lys
- target
- glu
- nos
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000006801 homologous recombination Effects 0.000 title claims abstract description 29
- 238000002744 homologous recombination Methods 0.000 title claims abstract description 29
- 238000000034 method Methods 0.000 title claims abstract description 17
- 230000008439 repair process Effects 0.000 title abstract description 28
- 108091032973 (ribonucleotides)n+m Proteins 0.000 title abstract description 21
- 230000001404 mediated effect Effects 0.000 title abstract description 13
- 238000010443 CRISPR/Cpf1 gene editing Methods 0.000 title abstract description 9
- 239000013598 vector Substances 0.000 claims abstract description 50
- 239000012634 fragment Substances 0.000 claims abstract description 46
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 29
- 241000209094 Oryza Species 0.000 claims abstract description 27
- 235000007164 Oryza sativa Nutrition 0.000 claims abstract description 25
- 235000009566 rice Nutrition 0.000 claims abstract description 25
- 241000196324 Embryophyta Species 0.000 claims description 40
- 101710163270 Nuclease Proteins 0.000 claims description 28
- 108091026890 Coding region Proteins 0.000 claims description 21
- 108020004414 DNA Proteins 0.000 claims description 18
- 241000251131 Sphyrna Species 0.000 claims description 18
- 239000002773 nucleotide Substances 0.000 claims description 18
- 125000003729 nucleotide group Chemical group 0.000 claims description 18
- 241000724709 Hepatitis delta virus Species 0.000 claims description 17
- 239000013612 plasmid Substances 0.000 claims description 17
- 208000037262 Hepatitis delta Diseases 0.000 claims description 12
- 238000011144 upstream manufacturing Methods 0.000 claims description 9
- 101800005109 Triakontatetraneuropeptide Proteins 0.000 claims description 8
- NMEHNETUFHBYEG-IHKSMFQHSA-N tttn Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 NMEHNETUFHBYEG-IHKSMFQHSA-N 0.000 claims description 8
- 230000035772 mutation Effects 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 3
- 102000053602 DNA Human genes 0.000 claims description 2
- 102100035102 E3 ubiquitin-protein ligase MYCBP2 Human genes 0.000 claims 2
- 230000001737 promoting effect Effects 0.000 claims 1
- 101150001232 ALS gene Proteins 0.000 abstract description 13
- 230000034431 double-strand break repair via homologous recombination Effects 0.000 abstract description 5
- 238000009395 breeding Methods 0.000 abstract description 4
- 230000001488 breeding effect Effects 0.000 abstract description 4
- 238000000338 in vitro Methods 0.000 abstract description 4
- 238000011160 research Methods 0.000 abstract description 4
- 230000035876 healing Effects 0.000 abstract description 2
- 238000012408 PCR amplification Methods 0.000 description 39
- 108091028043 Nucleic acid sequence Proteins 0.000 description 14
- 206010020649 Hyperkeratosis Diseases 0.000 description 11
- 230000000295 complement effect Effects 0.000 description 10
- 238000012163 sequencing technique Methods 0.000 description 9
- 108010054155 lysyllysine Proteins 0.000 description 8
- 230000009261 transgenic effect Effects 0.000 description 8
- 238000013518 transcription Methods 0.000 description 7
- 230000035897 transcription Effects 0.000 description 7
- 210000004027 cell Anatomy 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 239000007787 solid Substances 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000033616 DNA repair Effects 0.000 description 4
- 241001575908 Doros Species 0.000 description 4
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 4
- 108010038633 aspartylglutamate Proteins 0.000 description 4
- 108010092854 aspartyllysine Proteins 0.000 description 4
- 239000000203 mixture Substances 0.000 description 4
- 108091008146 restriction endonucleases Proteins 0.000 description 4
- 108091033409 CRISPR Proteins 0.000 description 3
- FBPFZTCFMRRESA-FSIIMWSLSA-N D-Glucitol Natural products OC[C@H](O)[C@H](O)[C@@H](O)[C@H](O)CO FBPFZTCFMRRESA-FSIIMWSLSA-N 0.000 description 3
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 3
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 3
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 3
- 229930195725 Mannitol Natural products 0.000 description 3
- 102000018120 Recombinases Human genes 0.000 description 3
- 108010091086 Recombinases Proteins 0.000 description 3
- QARCDOCCDOLJSF-HJPIBITLSA-N Tyr-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QARCDOCCDOLJSF-HJPIBITLSA-N 0.000 description 3
- 108010077245 asparaginyl-proline Proteins 0.000 description 3
- 239000003153 chemical reaction reagent Substances 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 3
- 230000005782 double-strand break Effects 0.000 description 3
- 238000002474 experimental method Methods 0.000 description 3
- 239000010931 gold Substances 0.000 description 3
- 229910052737 gold Inorganic materials 0.000 description 3
- 239000001963 growth medium Substances 0.000 description 3
- 210000005260 human cell Anatomy 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 108010009298 lysylglutamic acid Proteins 0.000 description 3
- 239000000594 mannitol Substances 0.000 description 3
- 235000010355 mannitol Nutrition 0.000 description 3
- 239000006870 ms-medium Substances 0.000 description 3
- 102000004169 proteins and genes Human genes 0.000 description 3
- 239000000600 sorbitol Substances 0.000 description 3
- 238000011426 transformation method Methods 0.000 description 3
- 210000005253 yeast cell Anatomy 0.000 description 3
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 2
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 2
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 2
- 238000010453 CRISPR/Cas method Methods 0.000 description 2
- 102000053642 Catalytic RNA Human genes 0.000 description 2
- 108090000994 Catalytic RNA Proteins 0.000 description 2
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 2
- XMPAXPSENRSOSV-RYUDHWBXSA-N Glu-Gly-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XMPAXPSENRSOSV-RYUDHWBXSA-N 0.000 description 2
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 2
- AQNYKMCFCCZEEL-JYJNAYRXSA-N Glu-Lys-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AQNYKMCFCCZEEL-JYJNAYRXSA-N 0.000 description 2
- FAIXYKHYOGVFKA-UHFFFAOYSA-N Kinetin Natural products N=1C=NC=2N=CNC=2C=1N(C)C1=CC=CO1 FAIXYKHYOGVFKA-UHFFFAOYSA-N 0.000 description 2
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 2
- 241000209510 Liliopsida Species 0.000 description 2
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 2
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 2
- VUTWYNQUSJWBHO-BZSNNMDCSA-N Lys-Leu-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VUTWYNQUSJWBHO-BZSNNMDCSA-N 0.000 description 2
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 2
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- JWQWPTLEOFNCGX-AVGNSLFASA-N Phe-Glu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 JWQWPTLEOFNCGX-AVGNSLFASA-N 0.000 description 2
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 2
- 244000184734 Pyrus japonica Species 0.000 description 2
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010070783 alanyltyrosine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 239000012154 double-distilled water Substances 0.000 description 2
- 238000001976 enzyme digestion Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 238000010362 genome editing Methods 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 238000005286 illumination Methods 0.000 description 2
- QANMHLXAZMSUEX-UHFFFAOYSA-N kinetin Chemical compound N=1C=NC=2N=CNC=2C=1NCC1=CC=CO1 QANMHLXAZMSUEX-UHFFFAOYSA-N 0.000 description 2
- 229960001669 kinetin Drugs 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010038320 lysylphenylalanine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000006780 non-homologous end joining Effects 0.000 description 2
- 108091092562 ribozyme Proteins 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 238000010008 shearing Methods 0.000 description 2
- 230000001954 sterilising effect Effects 0.000 description 2
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- FATXTKJILXPNJL-UHFFFAOYSA-N 2-[[2-[2-[(2-amino-3-methylpentanoyl)amino]propanoylamino]acetyl]amino]-3-phenylpropanoic acid Chemical compound CCC(C)C(N)C(=O)NC(C)C(=O)NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 FATXTKJILXPNJL-UHFFFAOYSA-N 0.000 description 1
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 1
- CVGNCMIULZNYES-WHFBIAKZSA-N Ala-Asn-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CVGNCMIULZNYES-WHFBIAKZSA-N 0.000 description 1
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- ZODMADSIQZZBSQ-FXQIFTODSA-N Ala-Gln-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZODMADSIQZZBSQ-FXQIFTODSA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 1
- ZTKHZAXGTFXUDD-VEVYYDQMSA-N Arg-Asn-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZTKHZAXGTFXUDD-VEVYYDQMSA-N 0.000 description 1
- OTUQSEPIIVBYEM-IHRRRGAJSA-N Arg-Asn-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OTUQSEPIIVBYEM-IHRRRGAJSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- SNBHMYQRNCJSOJ-CIUDSAMLSA-N Arg-Gln-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SNBHMYQRNCJSOJ-CIUDSAMLSA-N 0.000 description 1
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 1
- KZXPVYVSHUJCEO-ULQDDVLXSA-N Arg-Phe-Lys Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=CC=C1 KZXPVYVSHUJCEO-ULQDDVLXSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- HOIFSHOLNKQCSA-FXQIFTODSA-N Asn-Arg-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O HOIFSHOLNKQCSA-FXQIFTODSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 1
- QNJIRRVTOXNGMH-GUBZILKMSA-N Asn-Gln-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(N)=O QNJIRRVTOXNGMH-GUBZILKMSA-N 0.000 description 1
- XVAPVJNJGLWGCS-ACZMJKKPSA-N Asn-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVAPVJNJGLWGCS-ACZMJKKPSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- MDDXKBHIMYYJLW-FXQIFTODSA-N Asn-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N MDDXKBHIMYYJLW-FXQIFTODSA-N 0.000 description 1
- LSJQOMAZIKQMTJ-SRVKXCTJSA-N Asn-Phe-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O LSJQOMAZIKQMTJ-SRVKXCTJSA-N 0.000 description 1
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 1
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 1
- SUIJFTJDTJKSRK-IHRRRGAJSA-N Asn-Pro-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUIJFTJDTJKSRK-IHRRRGAJSA-N 0.000 description 1
- VWADICJNCPFKJS-ZLUOBGJFSA-N Asn-Ser-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O VWADICJNCPFKJS-ZLUOBGJFSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- KSZHWTRZPOTIGY-AVGNSLFASA-N Asn-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O KSZHWTRZPOTIGY-AVGNSLFASA-N 0.000 description 1
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- JNCRAQVYJZGIOW-QSFUFRPTSA-N Asn-Val-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNCRAQVYJZGIOW-QSFUFRPTSA-N 0.000 description 1
- KBQOUDLMWYWXNP-YDHLFZDLSA-N Asn-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KBQOUDLMWYWXNP-YDHLFZDLSA-N 0.000 description 1
- ILJQISGMGXRZQQ-IHRRRGAJSA-N Asp-Arg-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ILJQISGMGXRZQQ-IHRRRGAJSA-N 0.000 description 1
- IAMNNSSEBXDJMN-CIUDSAMLSA-N Asp-Cys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N IAMNNSSEBXDJMN-CIUDSAMLSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- NHSDEZURHWEZPN-SXTJYALSSA-N Asp-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CC(=O)O)N NHSDEZURHWEZPN-SXTJYALSSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- HKEZZWQWXWGASX-KKUMJFAQSA-N Asp-Leu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 HKEZZWQWXWGASX-KKUMJFAQSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- YWLDTBBUHZJQHW-KKUMJFAQSA-N Asp-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N YWLDTBBUHZJQHW-KKUMJFAQSA-N 0.000 description 1
- DPNWSMBUYCLEDG-CIUDSAMLSA-N Asp-Lys-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O DPNWSMBUYCLEDG-CIUDSAMLSA-N 0.000 description 1
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 1
- SJLDOGLMVPHPLZ-IHRRRGAJSA-N Asp-Met-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SJLDOGLMVPHPLZ-IHRRRGAJSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 1
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 1
- MRYDJCIIVRXVGG-QEJZJMRPSA-N Asp-Trp-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O MRYDJCIIVRXVGG-QEJZJMRPSA-N 0.000 description 1
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- BJDHEININLSZOT-KKUMJFAQSA-N Asp-Tyr-Lys Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(O)=O BJDHEININLSZOT-KKUMJFAQSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- 238000010354 CRISPR gene editing Methods 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- 238000012270 DNA recombination Methods 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010042407 Endonucleases Proteins 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- KCJJFESQRXGTGC-BQBZGAKWSA-N Gln-Glu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O KCJJFESQRXGTGC-BQBZGAKWSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- YXQCLIVLWCKCRS-RYUDHWBXSA-N Gln-Gly-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N)O YXQCLIVLWCKCRS-RYUDHWBXSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- JUUNNOLZGVYCJT-JYJNAYRXSA-N Gln-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JUUNNOLZGVYCJT-JYJNAYRXSA-N 0.000 description 1
- DBNLXHGDGBUCDV-KKUMJFAQSA-N Gln-Phe-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O DBNLXHGDGBUCDV-KKUMJFAQSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 1
- CSMHMEATMDCQNY-DZKIICNBSA-N Gln-Val-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CSMHMEATMDCQNY-DZKIICNBSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- VAIWPXWHWAPYDF-FXQIFTODSA-N Glu-Asp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O VAIWPXWHWAPYDF-FXQIFTODSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 1
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 1
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- ZCOJVESMNGBGLF-GRLWGSQLSA-N Glu-Ile-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZCOJVESMNGBGLF-GRLWGSQLSA-N 0.000 description 1
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 1
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 1
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- PMSMKNYRZCKVMC-DRZSPHRISA-N Glu-Phe-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)O)N PMSMKNYRZCKVMC-DRZSPHRISA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- RGJKYNUINKGPJN-RWRJDSDZSA-N Glu-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(=O)O)N RGJKYNUINKGPJN-RWRJDSDZSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- HVKAAUOFFTUSAA-XDTLVQLUSA-N Glu-Tyr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O HVKAAUOFFTUSAA-XDTLVQLUSA-N 0.000 description 1
- HHSKZJZWQFPSKN-AVGNSLFASA-N Glu-Tyr-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O HHSKZJZWQFPSKN-AVGNSLFASA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- VXEFAWJTFAUDJK-AVGNSLFASA-N Glu-Tyr-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O VXEFAWJTFAUDJK-AVGNSLFASA-N 0.000 description 1
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DWUKOTKSTDWGAE-BQBZGAKWSA-N Gly-Asn-Arg Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DWUKOTKSTDWGAE-BQBZGAKWSA-N 0.000 description 1
- JNGJGFMFXREJNF-KBPBESRZSA-N Gly-Glu-Trp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JNGJGFMFXREJNF-KBPBESRZSA-N 0.000 description 1
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 1
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- NWOSHVVPKDQKKT-RYUDHWBXSA-N Gly-Tyr-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O NWOSHVVPKDQKKT-RYUDHWBXSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- CHZRWFUGWRTUOD-IUCAKERBSA-N His-Gly-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N CHZRWFUGWRTUOD-IUCAKERBSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- QSPLUJGYOPZINY-ZPFDUUQYSA-N Ile-Asp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QSPLUJGYOPZINY-ZPFDUUQYSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- BEWFWZRGBDVXRP-PEFMBERDSA-N Ile-Glu-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O BEWFWZRGBDVXRP-PEFMBERDSA-N 0.000 description 1
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 1
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 1
- UOPBQSJRBONRON-STECZYCISA-N Ile-Met-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOPBQSJRBONRON-STECZYCISA-N 0.000 description 1
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 1
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- POJPZSMTTMLSTG-SRVKXCTJSA-N Leu-Asn-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N POJPZSMTTMLSTG-SRVKXCTJSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 1
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- DDVHDMSBLRAKNV-IHRRRGAJSA-N Leu-Met-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O DDVHDMSBLRAKNV-IHRRRGAJSA-N 0.000 description 1
- HDHQQEDVWQGBEE-DCAQKATOSA-N Leu-Met-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O HDHQQEDVWQGBEE-DCAQKATOSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- FUKDBQGFSJUXGX-RWMBFGLXSA-N Lys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)C(=O)O FUKDBQGFSJUXGX-RWMBFGLXSA-N 0.000 description 1
- DEFGUIIUYAUEDU-ZPFDUUQYSA-N Lys-Asn-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DEFGUIIUYAUEDU-ZPFDUUQYSA-N 0.000 description 1
- YVSHZSUKQHNDHD-KKUMJFAQSA-N Lys-Asn-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N YVSHZSUKQHNDHD-KKUMJFAQSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- SSYOBDBNBQBSQE-SRVKXCTJSA-N Lys-Cys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O SSYOBDBNBQBSQE-SRVKXCTJSA-N 0.000 description 1
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 1
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 1
- KKFVKBWCXXLKIK-AVGNSLFASA-N Lys-His-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCCN)N KKFVKBWCXXLKIK-AVGNSLFASA-N 0.000 description 1
- SLQJJFAVWSZLBL-BJDJZHNGSA-N Lys-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN SLQJJFAVWSZLBL-BJDJZHNGSA-N 0.000 description 1
- KYNNSEJZFVCDIV-ZPFDUUQYSA-N Lys-Ile-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O KYNNSEJZFVCDIV-ZPFDUUQYSA-N 0.000 description 1
- IVFUVMSKSFSFBT-NHCYSSNCSA-N Lys-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN IVFUVMSKSFSFBT-NHCYSSNCSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- ZJSZPXISKMDJKQ-JYJNAYRXSA-N Lys-Phe-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=CC=C1 ZJSZPXISKMDJKQ-JYJNAYRXSA-N 0.000 description 1
- GHKXHCMRAUYLBS-CIUDSAMLSA-N Lys-Ser-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O GHKXHCMRAUYLBS-CIUDSAMLSA-N 0.000 description 1
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- TVHCDSBMFQYPNA-RHYQMDGZSA-N Lys-Thr-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TVHCDSBMFQYPNA-RHYQMDGZSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- SUZVLFWOCKHWET-CQDKDKBSSA-N Lys-Tyr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O SUZVLFWOCKHWET-CQDKDKBSSA-N 0.000 description 1
- RQILLQOQXLZTCK-KBPBESRZSA-N Lys-Tyr-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O RQILLQOQXLZTCK-KBPBESRZSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 1
- JQEBITVYKUCBMC-SRVKXCTJSA-N Met-Arg-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JQEBITVYKUCBMC-SRVKXCTJSA-N 0.000 description 1
- BLIPQDLSCFGUFA-GUBZILKMSA-N Met-Arg-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O BLIPQDLSCFGUFA-GUBZILKMSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- KBTQZYASLSUFJR-KKUMJFAQSA-N Met-Phe-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KBTQZYASLSUFJR-KKUMJFAQSA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 1
- OVTOTTGZBWXLFU-QXEWZRGKSA-N Met-Val-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(O)=O OVTOTTGZBWXLFU-QXEWZRGKSA-N 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 1
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- LDSOBEJVGGVWGD-DLOVCJGASA-N Phe-Asp-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 LDSOBEJVGGVWGD-DLOVCJGASA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 1
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 1
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 1
- WKTSCAXSYITIJJ-PCBIJLKTSA-N Phe-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O WKTSCAXSYITIJJ-PCBIJLKTSA-N 0.000 description 1
- KZRQONDKKJCAOL-DKIMLUQUSA-N Phe-Leu-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZRQONDKKJCAOL-DKIMLUQUSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- ZUQACJLOHYRVPJ-DKIMLUQUSA-N Phe-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 ZUQACJLOHYRVPJ-DKIMLUQUSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- SZYBZVANEAOIPE-UBHSHLNASA-N Phe-Met-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SZYBZVANEAOIPE-UBHSHLNASA-N 0.000 description 1
- KAJLHCWRWDSROH-BZSNNMDCSA-N Phe-Phe-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=CC=C1 KAJLHCWRWDSROH-BZSNNMDCSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- JHSRGEODDALISP-XVSYOHENSA-N Phe-Thr-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O JHSRGEODDALISP-XVSYOHENSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- RGMLUHANLDVMPB-ULQDDVLXSA-N Phe-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N RGMLUHANLDVMPB-ULQDDVLXSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- DBALDZKOTNSBFM-FXQIFTODSA-N Pro-Ala-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DBALDZKOTNSBFM-FXQIFTODSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- IQAGKQWXVHTPOT-FHWLQOOXSA-N Pro-Lys-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O IQAGKQWXVHTPOT-FHWLQOOXSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- 230000006819 RNA synthesis Effects 0.000 description 1
- 101100355955 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) RCR2 gene Proteins 0.000 description 1
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- WXUBSIDKNMFAGS-IHRRRGAJSA-N Ser-Arg-Tyr Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXUBSIDKNMFAGS-IHRRRGAJSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- SFTZTYBXIXLRGQ-JBDRJPRFSA-N Ser-Ile-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O SFTZTYBXIXLRGQ-JBDRJPRFSA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- ADPHPKGWVDHWML-PPCPHDFISA-N Thr-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N ADPHPKGWVDHWML-PPCPHDFISA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 1
- CJXURNZYNHCYFD-WDCWCFNPSA-N Thr-Lys-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O CJXURNZYNHCYFD-WDCWCFNPSA-N 0.000 description 1
- FDQXPJCLVPFKJW-KJEVXHAQSA-N Thr-Met-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O FDQXPJCLVPFKJW-KJEVXHAQSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 1
- 241000209140 Triticum Species 0.000 description 1
- 235000021307 Triticum Nutrition 0.000 description 1
- RNFZZCMCRDFNAE-WFBYXXMGSA-N Trp-Asn-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O RNFZZCMCRDFNAE-WFBYXXMGSA-N 0.000 description 1
- UKWSFUSPGPBJGU-VFAJRCTISA-N Trp-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O UKWSFUSPGPBJGU-VFAJRCTISA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- OFTGYORHQMSPAI-PJODQICGSA-N Trp-Met-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O OFTGYORHQMSPAI-PJODQICGSA-N 0.000 description 1
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 1
- MTEQZJFSEMXXRK-CFMVVWHZSA-N Tyr-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N MTEQZJFSEMXXRK-CFMVVWHZSA-N 0.000 description 1
- AYPAIRCDLARHLM-KKUMJFAQSA-N Tyr-Asn-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O AYPAIRCDLARHLM-KKUMJFAQSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 1
- UABYBEBXFFNCIR-YDHLFZDLSA-N Tyr-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UABYBEBXFFNCIR-YDHLFZDLSA-N 0.000 description 1
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 1
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 1
- AZGZDDNKFFUDEH-QWRGUYRKSA-N Tyr-Gly-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AZGZDDNKFFUDEH-QWRGUYRKSA-N 0.000 description 1
- NSGZILIDHCIZAM-KKUMJFAQSA-N Tyr-Leu-Ser Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N NSGZILIDHCIZAM-KKUMJFAQSA-N 0.000 description 1
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 1
- JLKVWTICWVWGSK-JYJNAYRXSA-N Tyr-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JLKVWTICWVWGSK-JYJNAYRXSA-N 0.000 description 1
- FMXFHNSFABRVFZ-BZSNNMDCSA-N Tyr-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FMXFHNSFABRVFZ-BZSNNMDCSA-N 0.000 description 1
- PMHLLBKTDHQMCY-ULQDDVLXSA-N Tyr-Lys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMHLLBKTDHQMCY-ULQDDVLXSA-N 0.000 description 1
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- ISERLACIZUGCDX-ZKWXMUAHSA-N Val-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N ISERLACIZUGCDX-ZKWXMUAHSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- ZXAGTABZUOMUDO-GVXVVHGQSA-N Val-Glu-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZXAGTABZUOMUDO-GVXVVHGQSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- MLADEWAIYAPAAU-IHRRRGAJSA-N Val-Lys-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N MLADEWAIYAPAAU-IHRRRGAJSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 108010044940 alanylglutamine Proteins 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- FUHMZYWBSHTEDZ-UHFFFAOYSA-M bispyribac-sodium Chemical compound [Na+].COC1=CC(OC)=NC(OC=2C(=C(OC=3N=C(OC)C=C(OC)N=3)C=CC=2)C([O-])=O)=N1 FUHMZYWBSHTEDZ-UHFFFAOYSA-M 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 235000005822 corn Nutrition 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 108010054812 diprotin A Proteins 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 101150110946 gatC gene Proteins 0.000 description 1
- 238000003209 gene knockout Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 1
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 1
- 108010073628 glutamyl-valyl-phenylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010077515 glycylproline Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010057952 lysyl-phenylalanyl-lysine Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 108010005942 methionylglycine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000012113 quantitative test Methods 0.000 description 1
- 230000013120 recombinational repair Effects 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 239000002689 soil Substances 0.000 description 1
- 108010005652 splenotritin Proteins 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 108010061238 threonyl-glycine Proteins 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
- 230000029663 wound healing Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8201—Methods for introducing genetic material into plant cells, e.g. DNA, RNA, stable or transient incorporation, tissue culture methods adapted for transformation
- C12N15/8213—Targeted insertion of genes into the plant genome by homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8274—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8261—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield
- C12N15/8271—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance
- C12N15/8274—Phenotypically and genetically modified plants via recombinant DNA technology with agronomic (input) traits, e.g. crop yield for stress resistance, e.g. heavy metal resistance for herbicide resistance
- C12N15/8278—Sulfonylurea
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
- C12N15/902—Stable introduction of foreign DNA into chromosome using homologous recombination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Organic Chemistry (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Microbiology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Cell Biology (AREA)
- Crystallography & Structural Chemistry (AREA)
- Mycology (AREA)
- Medicinal Chemistry (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明公开了一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法。本发明以水稻ALS基因为研究对象,构建了同源重组载体。将RCR1‑RCR2‑RDR片段进行体外转录,通过RNP方法,以RNA转录本作为修复模板,在水稻愈伤中实现了目的基因的同源重组修复。同时,利用基因枪方法将载体导入水稻愈伤中,获得了ALS基因定点修饰的水稻植株。结果表明,以RNA作为修复模板可成功介导目的基因的同源重组,为农作物育种提供了新思路,因此在农业育种方面具有强大的应用潜力。
Description
技术领域
本发明涉及一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法。
背景技术
CRISPR/Cpf1极大拓展了基因编辑范围,已开始应用于农作物遗传改良研究中。利用CRISPR/Cas9介导的基因组编辑技术进行基因敲除,已经在水稻等农作物中得到应用。但是,由于植物中同源重组频率低,利用CRISPR/Cas9介导的同源重组,在农作物中实现基因定点替换或定点整合却少有报道。目前,利用CRISPR/Cpf1系统介导的目的基因片段替换尚未有报道。
有假设提出RNA转录本可作为修复模板参与到DNA双链断裂(DSBs)导致的DNA同源重组修复(HDR)中去,而在酵母和人类细胞中,此假设已被证实。2014年,在一项酵母的研究中,RNA为修复模板介导基因组DNA的同源重组修复的有效性进一步被证实。然而,在酵母和人类细胞中,这一技术并未被广泛应用,主要由于在酵母和人类细胞中DNA修复模板可通过电转化、显微注射或转染等转化方法高效进入细胞,从而介导DNA的重组修复。但是在植物细胞中,由于细胞壁的存在,这些转化方法均不适用,尤其对于一些作物品种如:玉米、小麦、水稻等单子叶植物而言。因此在农作物中通过CRISPR/Cas系统实现目的基因的同源重组修复有很大难度,主要因为:1)在植物细胞中,DSBs主要通过非同源末端连接(non-homologous end joining,NHEJ)的方式进行修复,同源重组介导的修复(homology-directed repair,HDR)发生几率极其小;2)将修复模板转入植物细胞中的量十分有限,目前有两种方法可以提高修复模板的量,但效果仍不理想,一种方式为通过基因枪转化法将修复模板片段导入细胞内;另外一种方法是将修复模板连入病毒来源的replicon载体中,将载体转化细胞,从而增加修复模板的量。
发明内容
本发明的目的是提供一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法。
本发明提供了一种用于取代植物基因组中的目标片段的表达盒甲,包括启动子甲和终止子,其特征在于:在启动子甲和终止子之间包括如下三个区段:区段Ⅰ、区段Ⅱ和区段Ⅲ;区段Ⅲ为区段Ⅲ-1或区段Ⅲ-2;
区段Ⅰ中具有两个核酸酶的编码序列和一个位于它们之间的crRNA1的编码序列;
区段Ⅱ中具有两个核酸酶的编码序列和一个位于它们之间的crRNA2的编码序列;
区段Ⅲ-1中具有两个核酸酶的编码序列和位于它们之间的模板区段;
区段Ⅲ-2中具有两个靶标序列和位于它们之间的模板区段;
所述模板区段包括上游同源臂、供体片段序列和下游同源臂;
所述目标片段的一个末端为区段Ⅰ中crRNA1的靶标序列,另一个末端为区段Ⅱ中crRNA2的靶标序列;
供体片段与目标片段具有如下差异:①预期在目标片段中引入的差异核苷酸;②将crRNA1的靶标中的TTTN突变为非TTTN;③将crRNA2的靶标中的TTTN突变为非TTTN。
区段Ⅰ自5’至3’端依次具有Hammerhead型核酸酶的编码序列、crRNA1的编码序列和丁型肝炎病毒核酸酶的编码序列。
区段Ⅱ自5’至3’端依次具有Hammerhead型核酸酶的编码序列、crRNA2的编码序列和丁型肝炎病毒核酸酶的编码序列。
区段Ⅲ-1中自5’至3’端依次具有Hammerhead型核酸酶的编码序列、上游同源臂、供体片段序列、下游同源臂和丁型肝炎病毒核酸酶的编码序列。
区段Ⅲ-2中自5’至3’端依次具有crRNA1的靶标序列、上游同源臂、供体片段序列、下游同源臂和crRNA2的靶标序列。
所述目标片段中,crRNA1的靶标和crRNA2的靶标之间具有限制性内切酶的识别序列;所述供体片段与目标片段的区别还包括如下④:将所述限制性内切酶的识别序列突变为非识别序列。
所述Hammerhead型核酸酶的编码序列如序列表中序列1自5’端第394至436位所示或序列表的序列1自5’端第724至766位所示。
所述丁型肝炎病毒核酸酶的编码序列如序列表中序列1自5’端第481至548位所示。
所述crRNA1的编码序列如序列表的序列1自5’端第437至480位所示。
所述crRNA2的编码序列如序列表的序列1自5’端第602至645位所示。
所述上游同源臂如序列表的序列1自5’端第767至863位所示。
所述下游同源臂如序列表的序列1自5’端第1245至1365位所示。
所述供体片段序列如序列表的序列1自5’端第864至1244位所示。
所述区段Ⅰ如序列表的序列1自5’端第394至548位所示。
所述区段Ⅱ如序列表的序列1自5’端第559至713位所示。
crRNA1的靶标序列如序列表的序列2自5’端第709至735位所示。
crRNA2的靶标序列如序列表的序列2自5’端第1335至1361位所示。
所述区段Ⅲ-1如序列表的序列1自5’端第724-1433位所示。
所述区段Ⅲ-2如序列表的序列2自5’端第709-1361位所示。
所述启动子甲为OsU3启动子。所述OsU3启动子如序列表的序列1自5’端第13至393位所示。
所述终止子为Nos终止子。所述Nos终止子的序列如序列表的序列1自5’端第1434至1686位所示。
所述表达盒甲如序列表的序列1自5’端第13-1686位所示。
所述表达盒甲如序列表的序列2所示。
所述目标片段具体可为植物基因组中ALS基因中序列表的序列6所示的片段。
本发明还保护含有以上任一所述表达盒甲的重组载体。
所述重组载体还包括表达盒乙;所述表达盒乙中由启动子乙启动LbCpf1核酸酶的编码基因表达。
所述启动子乙为Ubi启动子。所述Ubi启动子的反向互补序列如序列表的序列1自5’端第5912至7897位所示。
所述LbCpf1核酸酶的编码基因的反向互补序列如序列表的序列1自5’端第2061至5909位所示。
所述表达盒乙还包括终止子。所述所述终止子为Nos终止子。所述Nos终止子的反向互补序列如序列表的序列1自5’端第1789至2041位所示。
所述表达盒乙的反向互补序列如序列表的1自5’端1789至7897位所示。
所述重组载体为序列表的序列1所示的环形质粒。
所述重组载体为采用序列2所示的双链DNA分子替代序列1自5’端第13-1686位得到的环形质粒。
本发明还保护以上任一所述表达盒甲,或,以上任一所述的重组载体在实现植物中以RNA转录本为模板进行靶基因同源重组中的应用。
本发明一种植物中以RNA转录本为模板进行靶基因同源重组的方法,包括如下步骤:将以上任一所述的重组载体导入出发植物,实现植物中靶基因同源重组。
以上任一所述靶基因为ALS基因。
以上任一所述植物可为1)或2)或3)或4)或5):1)单子叶植物;2)双子叶植物;3)禾本科植物;4)水稻;5)水稻品种中花11(Japonica cv.)。
本发明以水稻ALS基因为研究对象,构建了同源重组载体:pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos和pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos。将RCR1-RCR2-RDR片段进行体外转录,通过RNP方法,以RNA转录本作为修复模板,在水稻愈伤中实现了目的基因的同源重组修复。同时,利用基因枪方法将载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos、pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos和pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armeddonor(with targets)分别导入水稻愈伤中,获得了ALS基因定点修饰的水稻植株,其中pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)为DNA修复模板的对照载体。研究结果表明,以RNA作为修复模板可成功介导目的基因的同源重组,为农作物育种提供了新思路,因此在农业育种方面具有强大的应用潜力。
附图说明
图1为三个载体框架图。
图2为水稻愈伤组织中目的基因的测序鉴定结果。
图3为转基因植株中目的基因的测序鉴定结果。
具体实施方式
以下的实施例便于更好地理解本发明,但并不限定本发明。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的试验材料,如无特殊说明,均为自常规生化试剂商店购买得到的。以下实施例中的定量试验,均设置三次重复实验,结果取平均值。
下述实施例中的用于水稻转化的水稻材料为中花11(Japonica cv.),由中国农业科学院作物科学研究所提供。
质粒pCXUN-Cas9记载于如下文献中:He et al.,2017和Sun et al.,2016;公众可以从中国农业科学院作物科学研究所获得。
质粒pRS316-RCR-GFP记载于如下文献中:Zhang et al.,2017;公众可以从中国农业科学院作物科学研究所获得。
LbCpf1-OsU6载体记载于如下文献中:Wang et al.,2017;公众可以从中国农业科学院作物科学研究所获得。
pCXUN-Cas9-OsU3记载于如下文献中:Sun et al.,2016;公众可以从中国农业科学院作物科学研究所获得。
下述实施例中所用的内切酶、试剂盒和PCR酶均购自试剂公司。其它试剂均为国产分析纯。
下述实施例中的引物、DNA合成及测序均由华大公司完成。
下述实施例中所用的引物如表1。
表1引物序列
实施例1、利用CRISPR/Cpf1系统实现以RNA转录本作为修复模板介导的ALS基因的精确修饰
一、表达载体的构建
1、质粒pCXUN-LbCpf1-Nos的构建
(1)用限制性内切酶BamHI和HindIII双酶切质粒pCXUN-Cas9,得到约9282bp的载体骨架1。
(2)用限制性内切酶BamHI和HindIII双酶切LbCpf1-OsU6载体,得到约5846bp的Ubi-LbCpf1表达盒。
(3)将载体骨架1和Ubi-LbCpf1表达盒用T4连接酶连接,得到质粒pCXUN-LbCpf1-Nos。
2、OsU3-RCR1-RCR2表达盒的构建
(1)以质粒pRS316-RCR-GFP为模板,采用引物RCR1F2和引物RCR-common-R组成的引物对进行第一轮PCR扩增,得到第一轮PCR扩增产物。
(2)以步骤(1)得到的第一轮PCR扩增产物为模板,采用引物RCRF1和引物RCR-common-R组成的引物对进行第二轮PCR扩增,得到第二轮PCR扩增产物(RCR1)。
(3)以质粒pRS316-RCR-GFP为模板,采用引物RCR2-F2和引物RCR-common-R组成的引物对进行第一轮PCR扩增,得到第一轮PCR扩增产物。
(4)以步骤(3)得到的第一轮PCR扩增产物为模板,采用引物RCR-F1和引物RCR-common-R组成的引物对进行第二轮PCR扩增,得到第二轮PCR扩增产物(RCR2)。
(5)以pCXUN-Cas9-OsU3为模板,采用引物OsU3F和引物OsU3-RCR1R组成的引物对进行PCR扩增,得到第一轮PCR扩增产物(OsU3启动子序列)。
(6)以步骤(2)得到的第二轮PCR扩增产物(RCR1)为模板,采用引物RCR-Common-F和引物RCR1-10random-R组成的引物对进行第二轮PCR扩增,得到第二轮PCR扩增产物。
(7)将步骤(5)得到的第一轮PCR扩增产物(OsU3启动子序列)和步骤(6)得到的第二轮PCR扩增产物按照摩尔比1:1混合后作为模板,采用引物OsU3F和引物RCR1-10 random-R组成的引物对进行第三轮PCR扩增,得到第三轮PCR产物(OsU3-RCR1表达盒)。
(8)以步骤(4)得到的第二轮PCR扩增产物(RCR2)为模板,采用引物RCR2-10random-F和引物SacI-RCR2-R组成的引物对进行第四轮PCR扩增,得到第四轮PCR扩增产物。
(9)将步骤(7)得到的第三轮PCR产物(OsU3-RCR1表达盒)和步骤(8)得到的第四轮PCR扩增产物按照摩尔比1:1混合后作为模板,采用引物SacI-OsU3-F和引物SacI-RCR2-R进行第五轮PCR扩增,得到第五轮PCR扩增产物(OsU3-RCR1-RCR2表达盒)。
3、RDR片段的合成
(1)将引物HHF和引物HHR退火形成HH片段(第一轮产物)。
(2)以化学合成定点修饰的ALS基因片段(序列表的序列4)为模板,采用引物donor-HH-F和引物donor-HH-F组成的引物对进行PCR扩增,得到第二轮产物。
(3)以质粒pRS316-RGR-GFP为模板,采用引物HDVF和引物HDVR组成的引物对进行PCR扩增,得到第三轮产物。
(4)以质粒pCXUN-Cas9为模板,采用引物Nos-HDVF和引物KPN-NosR组成的引物对进行PCR扩增,得到第四轮产物
(5)将第一轮产物、第二轮产物、第三轮产物和第四轮产物按照摩尔比1:1:1:1进行混合后,采用引物Kpn-HHF和引物Kpn-NosR组成的引物对进行PCR扩增,得到RDR片段。
4、armed donor(with targets)-Nos片段的合成
(1)以化学合成定点修饰的ALS基因片段(序列表的序列4)为模板,采用引物Kpn-donorF和引物donor-R组成的引物对进行PCR扩增,得到第一轮产物。
(2)以pCXUN-Ubi-LbCpf1-Nos质粒为模板,采用引物Nos-donorF和引物Kpn-NosR组成的引物对进行PCR扩增,得到第二轮产物。
(3)将第一轮产物和第二轮产物按照摩尔比1:1混合后作为模板采用引物Kpn-donorF和引物Kpn-NosR组成的引物对进行PCR扩增,得到armed donor(with targets)-Nos片段。
5、载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos的合成
将步骤2制备的OsU3-RCR1-RCR2表达盒和步骤1制备的质粒pCXUN-LbCpf1-Nos利用同源重组酶(全式金,北京,中国)进行连接获得重组载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos,将步骤3得到的RDR片段插入重组载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos的KpnI位点中,得到载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos。
载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos经测序如序列表的序列1所示。序列表中序列1自5’末端起,第13至713位为OsU3-RCR1-RCR2表达盒的核苷酸序列,其中,第13至393位为OsU3启动子的核苷酸序列,第394至436位和第559至601位均为Hammerhead(HH)型核酸酶的核苷酸序列,第481至548位和第646至713位均为丁型肝炎病毒(HDV)核酸酶的核苷酸序列,第437至480位为crRNA1的核苷酸序列,第602至645位为crRNA2的核苷酸序列。序列表中序列1自5’末端起,第724至1433位为RDR片段,其中,第724至766位为Hammerhead(HH)型核酸酶的核苷酸序列,第1366至1433位为丁型肝炎病毒(HDV)核酸酶的核苷酸序列,第767至1365位为DRT序列。序列表中序列1自5’末端起,第1434至1686位为Nos终止子的核苷酸序列,第1789至2041位为Nos终止子的核苷酸序列的反向互补序列;第2061至5909位为编码LbCpf1的核苷酸序列的反向互补序列,第5912至7897位为Ubi启动子的核苷酸序列的反向互补序列。
RDR片段中,第767至863位为上游同源臂,第864至1244位为突变区段,第1245至1365位为下游同源臂。
6、载体pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos的合成
将步骤2制备的OsU3-RCR1-RCR2表达盒和步骤1制备的质粒pCXUN-LbCpf1-Nos利用同源重组酶(全式金,北京,中国)进行连接获得重组载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos,将步骤4得到的armed donor(with targets)-Nos片段插入重组载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos的KpnI位点中,得到载体pCXUN-OsU3-RCR1-RCR2-armeddonor(with targets)-Nos-Ubi-LbCpf1-Nos。
经测序,载体pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos与载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos的区别在于:采用序列表的序列2所示的片段替代了序列表的序列1自5’端第13-1686位。
序列2所示的片段,自5’端第1至701位为OsU3-RCR1-RCR2表达盒的核苷酸序列,其中,第1至381位为OsU3启动子的核苷酸序列,第382至424位和第547至589位均为Hammerhead(HH)型核酸酶的核苷酸序列,第469至536位和第634至701位为丁型肝炎病毒(HDV)核酸酶的核苷酸序列,第425至468位为crRNA1的核苷酸序列,第590至453位为crRNA2的核苷酸序列。序列表中序列2自5’末端起,第709至1361位为armeddonor(with targets)片段,其中,第709至735位为靶点1的核苷酸序列,第1335至1361位为靶点2的核苷酸序列,第736至1334位为DRT序列。序列表中序列2自5’末端起,第1362至1614位为Nos终止子的核苷酸序列的核苷酸序列。
DRT序列中,第736至832为上游同源臂,第833至1213位为突变区段,第1214至1334位为下游同源臂。
7、载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)的合成
以化学合成定点修饰的ALS基因片段(序列表的序列4)为模板,采用引物Pme-donorF和引物Pme-donorR组成的引物对进行PCR扩增,得到PCR扩增产物(armed-DRT)。
将步骤2制备的OsU3-RCR1-RCR2表达盒和步骤1制备的质粒pCXUN-LbCpf1利用同源重组酶(全式金,北京,中国)进行连接获得重组载体pCXUN-LbCpf1-OsU3-RCR1-RCR2,将armed-DRT插入重组载体pCXUN-LbCpf1-OsU3-RCR1-RCR2的PmeI位点中,得到载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)。
载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)经测序如序列表的序列3所示。序列表中序列3自5’末端起,第13至713位为OsU3-RCR1-RCR2表达盒的核苷酸序列,第13至393位为OsU3启动子的核苷酸序列,第394至436位和第559至601位均为Hammerhead(HH)型核酸酶的核苷酸序列,第481至548位和第646至713位为丁型肝炎病毒(HDV)核酸酶的核苷酸序列,第437至480位为crRNA1的核苷酸序列,第602至645位为crRNA2的核苷酸序列,第817至1069位为Nos终止子的核苷酸序列的反向互补序列;第1089至4937位为编码LbCpf1的核苷酸序列的反向互补序列,第4940至6925位为Ubi启动子的核苷酸序列的反向互补序列,第7217至7886位为DNA修复模板armed-DRT。
DNA修复模板armed-DRT中,第7225至7251位为crRNA1的靶标序列,第7252至7348位为上游同源臂,第7349至7729位为突变区段,第7730至7850位为下游同源臂,第7851-7877位为crRNA2的靶标序列。
载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos、载体pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos和载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)(对照载体)部分元件结构示意图见图1。
载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos通过OsU3启动的基因转录,可获得转录本RCR1-RCR2-RDR片段,其中的HH和HDV核酶对转录本进行自剪切,crRNAs和RNA修复模板被精确释放。
载体pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos通过OsU3启动的基因转录,可获得转录本RCR1-RCR2-armed donor(with targets)片段,其中的HH和HDV核酶对转录本进行自剪切,crRNAs与armed donor(with targets)分开,crRNAs被精确释放,LbCpf1蛋白可在RNA水平armed donor(with targets)片段进行切割,从而获得精确的修复模板。
二、水稻愈伤中RNA作为修复模板介导的DNA重组修复活性检测
1、选取饱满的中花11水稻种子,剥去种皮,灭菌洗涤后,均匀的点入在含有2毫克/升2,4-D的灭菌NB固体培养基中,28℃黑暗培养40-50天以诱导愈伤组织的产生。
2、将步骤1得到的愈伤组织在含有0.3M甘露醇和0.3M山梨醇的MS培养基中高渗处理4-6小时。
3、以pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos载体为模板,利用引物T7-F和引物T7-Nos-R组成的引物对进行PCR扩增,获得体外转录模板RCR1-RCR2-RDR片段,根据HiScribe T7Quick High Yield RNA Synthesis Kit(NEB)说明书要求,配制如下体系,37℃孵育6h,进行体外转录,获得转录产物(crRNAs与RNA修复模板)。
反应体系:
模板 | 2μL(400ng) |
NTP Buffer Mix | 10μL |
T7RNA polymerase Mix | 2μL |
RNase-Free ddH2O | 6μL |
总体系 | 20μL |
4、将步骤3得到的转录产物加入2μLDNase I和30μL RNase-Free ddH2O进行处理,去除DNA,并经试剂盒纯化后与LbCpf1蛋白(序列表的序列7所示)进行组装,室温放置15min,形成RNP,组装体系如下:
组装体系:
LbCpf1蛋白 | 10μg |
转录产物 | 10μg |
10×Buffer 3 | 2μL |
RNaseinhibitior | 1μL |
RNase-Free ddH2O | xμL |
总体系 | 20μL |
5、将步骤4得到的RNP通过基因枪转化水稻愈伤,采用0.6μm金粉,轰击压力为900psi进行轰击。
6、完成步骤5后,将水稻愈伤28℃暗培养36h后提取基因组DNA,以基因组DNA为模板,采用引物ALSTestF和引物T2MR组成的引物对进行PCR扩增,将扩增产物测序检测是否发生ALS基因同源重组。
结果如图2所示。其中,WT ALS为野生型ALS基因(序列表的序列6);Donor为修复模板序列(序列表的序列5);下划线序列分别为靶点1和靶点2序列;斜体的碱基为定点突变的PAM位点及EcoRV酶切位点,斜体加粗的碱基为目标替换成的碱基。
结果显示,得到的愈伤组织中,愈伤RDR35中检测到有完整同源重组,RDR41愈伤有部分同源重组。结果表明,以RNA作为修复模板,可成功介导基因组DNA的同源重组修复。
三、转基因水稻的获得
1、选取饱满的中花11水稻种子,剥去种皮,灭菌洗涤后,均匀的点入在含有2毫克/升2,4-D的灭菌NB固体培养基中,28℃黑暗培养40-50天以诱导愈伤组织的产生。
2、将步骤1得到的愈伤组织在含有0.3M甘露醇和0.3M山梨醇的MS培养基中高渗处理4-6小时后,将pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos通过基因枪轰击水稻愈伤,采用0.6μm金粉,轰击压力为900psi进行轰击,轰击后在含有0.3M甘露醇和0.3M山梨醇的MS培养基上28℃暗培养16小时后转移至NB筛选培养基(含有2毫克/升的2,4-D和50毫克/升的潮霉素的NB固体培养基)中,28℃持续暗培养2周。
3、完成步骤2后,选取生长良好呈嫩黄色的阳性愈伤组织,用无菌镊子移至NB预分化培养基(含有1毫克/升NAA、5毫克/升ABA、2毫克/升kinetin和50毫克/升的潮霉素的NB固体培养基)上,28℃持续暗培养2周。
4、完成步骤3后,挑选生长旺盛的愈伤组织转入MS分化培养基(含有0.02毫克/升NAA、2毫克/升kinetin和0.4μM双草醚钠盐的MS固体培养基)中,28℃持续光照培养。
5、完成步骤4后,待分化出来的幼苗长至2至5毫米,转入MS固体培养基中28℃光照培养2到3周,之后移入土中置于温室生长(温度28-30℃,16小时光照/8小时黑暗),得到T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos)。
6、采用pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos替代pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos,按照步骤1-5进行操作,得到T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos)。
7、采用pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)替代pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos,按照步骤1-5进行操作,得到T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets))。
四、转基因水稻的基因型鉴定
待测植株:野生型中花11水稻(WT)、T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos)、T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos)和T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets))。
提取待测植株的基因组DNA,以基因组DNA为模板,采用引物ALStestF和引物ALStestR组成的引物对进行PCR扩增,将PCR扩增产物采用EcoRV酶切,野生对照可以被EcoRV切开并产生481bp和322bp两种类型片段,不能被EcoRV完全酶切的植株鉴定为同源重组成功植株。将完全没有或者部分切开PCR产物进行克隆测序。统计结果见表2和图3。
表2转基因水稻的基因型鉴定统计结果
图3中,图3A为T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos)的检测结果,图3B为T0代转基因植株(转pCXUN-OsU3-RCR1-RCR2-armed donor(withtargets)-Nos-Ubi-LbCpf1-Nos)的检测结果。其中,WT ALS为野生型ALS基因(序列表的序列6);Donor为修复模板序列(序列表的序列5);下划线序列分别为靶点1和靶点2序列;斜体的碱基为定点突变的PAM位点及EcoRV酶切位点,斜体加粗的碱基为目标替换成的碱基。
对于载体pCXUN-OsU3-RCR1-RCR2-RDR-Nos-Ubi-LbCpf1-Nos而言,共获得58棵植株。对58棵植株PCR产物用EcoRV酶切鉴定后结果表明,288-6一条链为完整同源重组,另一条链为野生型。289-4和293-1一条链为部分同源重组,另一条链为野生型。。
对于载体pCXUN-OsU3-RCR1-RCR2-armed donor(with targets)-Nos-Ubi-LbCpf1-Nos而言,共获得87棵植株183-2,185-5和278-4一条链为完整同源重组,另一条链为野生型。198-1一条链为完整同源重组,另一条链为部分同源重组。193一条链为部分同源重组并伴随28bp缺失,另一条链为野生型。
载体pCXUN-OsU3-RCR1-RCR2-Ubi-LbCpf1-Nos-armed donor(with targets)未得到重组植株。
五、脱靶分析
本实验对8颗植株进行PCR靶点1和靶点2的脱靶进行鉴定,PCR产物克隆并测序结果表明,本实验所设计的crRNA1和crRNA2并不存在脱靶情况。
对8颗植株进行靶标1和靶标2的脱靶情况的鉴定,具体步骤为:提取植株的基因组DNA,采用特异引物对进行PCR扩增,然后将PCR扩增产物进行测序。
靶标1存在三个可能脱靶的位点,ALS1-OFF1、ALS1-OFF2和ALS1-OFF3。
靶标2存在两个可能脱靶的位点,ALS2-OFF4和ALS2-OFF5。
用于各个脱靶位点的引物对见表1。
表3脱靶分析统计结果
注:PAM位点用下划线表示,错配碱基用斜体表示。
序列表
<110> 中国农业科学院作物科学研究所
<120> 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法
<160> 7
<170> SIPOSequenceListing 1.0
<210> 1
<211> 16802
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 1
gaattcgagc tcaaggaatc tttaaacata cgaacagatc acttaaagtt cttctgaagc 60
aacttaaagt tatcaggcat gcatggatct tggaggaatc agatgtgcag tcagggacca 120
tagcacaaga caggcgtctt ctactggtgc taccagcaaa tgctggaagc cgggaacact 180
gggtacgttg gaaaccacgt gatgtgaaga agtaagataa actgtaggag aaaagcattt 240
cgtagtgggc catgaagcct ttcaggacat gtattgcagt atgggccggc ccattacgca 300
attggacgac aacaaagact agtattagta ccacctcggc tatccacata gatcaaagct 360
gatttaaaag agttgtgcag atgatccgtg gcaaaattac tgatgagtcc gtgaggacga 420
aacgagtaag ctcgtctaat ttctactaag tgtagatggt atggtggtgc aatgggagga 480
ggccggcatg gtcccagcct cctcgctggc gccggctggg caacatgctt cggcatggcg 540
aatgggacga atacgaccaa attactgatg agtccgtgag gacgaaacga gtaagctcgt 600
ctaatttcta ctaagtgtag atacctgaat gacccataaa gagtgggccg gcatggtccc 660
agcctcctcg ctggcgccgg ctgggcaaca tgcttcggca tggcgaatgg gaccggtacc 720
acacatcaac tgatgagtcc gtgaggacga aacgagtaag ctcgtcttga tggggatggt 780
agcttcctca tgaacattca ggagctggca ttgatccgca ttgagaacct ccctgtgaag 840
gtgatggtgt tgaacaacca acacctaggc atggtcgtcc agttggagga taggttttac 900
aaggcgaata gggcgcatac atacttgggc aacccggaat gtgagagcga gatatatcca 960
gattttgtga ctattgctaa ggggttcaat attcctgcag tccgtgtaac aaagaagagt 1020
gaagtccgtg ccgccatcaa gaagatgctc gagactccag ggccatactt gttggacatc 1080
atcgtcccgc accaggagca tgtgctgcct atgatcccaa ttgggggcgc attcaaggac 1140
atgatcctgg atggtgatgg caggactgtg tattaatcta taatctgtat gttggcaaag 1200
caccagcccg gcctatgtct gacgtgaatg actcataaag agtggtatgc ctatgatgtt 1260
tgtatgtgct ctatcaataa ctaaggtgtc aactatgaac catatgctct tctgttttac 1320
ttgtttgatg tgcttggcat ggtaatccta attagcttcc tgctgggccg gcatggtccc 1380
agcctcctcg ctggcgccgg ctgggcaaca tgcttcggca tggcgaatgg gacgatcgtt 1440
caaacatttg gcaataaagt ttcttaagat tgaatcctgt tgccggtctt gcgatgatta 1500
tcatataatt tctgttgaat tacgttaagc atgtaataat taacatgtaa tgcatgacgt 1560
tatttatgag atgggttttt atgattagag tcccgcaatt atacatttaa tacgcgatag 1620
aaaacaaaat atagcgcgca aactaggata aattatcgcg cgcggtgtca tctatgttac 1680
tagatcggta cccctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc 1740
agggttttcc cagtcacgac gttgtaaaac gacggccagt gaattcccga tctagtaaca 1800
tagatgacac cgcgcgcgat aatttatcct agtttgcgcg ctatattttg ttttctatcg 1860
cgtattaaat gtataattgc gggactctaa tcataaaaac ccatctcata aataacgtca 1920
tgcattacat gttaattatt acatgcttaa cgtaattcaa cagaaattat atgataatca 1980
tcgcaagacc ggcaacagga ttcaatctta agaaacttta ttgccaaatg tttgaacgat 2040
cggggaaatt cggatcctta ctttttcttt tttgcctggc cggccttttt cgtggccgcc 2100
ggccttttgt gcttcacgct ggtctgggcg tactccagcc actccttgtt agagatggcg 2160
atcttcacct tatccagctt ctcgtcctcg gccttcttga actggccgat ggcccacagc 2220
acctttctgg cgatgttata ggcgccattg gcgtcggcgt tctttggcag gatggcattc 2280
tcctgggcct catagttccg gctatcgtag aagatgccgt cggagttctt cacagggctg 2340
atcagaaaat ccacgtcggt gcggcctgtg atgctgttcc gcatctgcag catcaggctc 2400
atcagggcca taaagctaga gtagaaggcc ttgtcggact gctcgcacag cagggctctg 2460
atatcgccct gctgataatt gatgccgtac ttgttgaaca gctccttata ggcgctggtc 2520
aggcacacct cctcccagtc gaacacgttg ttcttcttag gattccggaa gattctgatc 2580
cggttgccgt aggagtacag cttccacttc ttgatgtaat cggcgtctgt gcgagagaag 2640
ttcttatagt ccagggcaaa ctcgaacaga tcctcctcgg gcacgtacat gatcctgtca 2700
aaggagctga tgaacttctt ggaatcggcg atgctggtat acttggtttt cagcaggttc 2760
acaaagccgg tagatggatc gatcttggat gtcagccagg cagggatgta aaagatgaag 2820
ccgttctggg tagacatgga cttaaagctc tcgaacttat tggtgatctg atagcccttc 2880
agggcgccgc ctgttgcaca aggattagac ttcttgtcca ccatgtagtt cagcttatcg 2940
atcagcatct tctcgaactt ctgatacacc tgcttctcca ccttcacgcg gctattctta 3000
aagccagagt tcaggtcctc cagggcgatc acggcatcgt acttctccac cagctcgcag 3060
atcttgtgca ccacctgaga gatatagccg gccttcagct ccttgatatt ctcgatggag 3120
gtccagttct ggcgggcctc gaacctctcc ttctccttct tgtccagcag agagtggtaa 3180
tctgtcttga tcctgatgcc gttgaagttg ttgatgatct cgttcaggga atactgctcc 3240
acgatgttgc ccttgccgtc caccaccacg atatacagca gattgcgctc gcccctatcg 3300
atgccgatca cataggggtt atcgtcgtgc ttcagcagca cgcgcacctc tgtattgatc 3360
ttgaagatgt tcttggggca cttattgatg gcgattggga tgtgcagctc gtactggtcc 3420
tcagaaaacc tcttatcctt atacacgtcg taggacaggg ttgtggtttt cttgggatta 3480
tctggattct tgttggcgat aggggagttg gctgggtgca ccaccagctc ctccttcttc 3540
agggaggcgc gcctcatgaa cagctctgct cctccgctca gcctgatctg tccgtgattg 3600
ttctcgtcaa acagcagctt gaagtacatg gtgtgcagat tgggtgtgcc gtgagactta 3660
tcggaaaagt ccttgttata gatctggaac atatacagct tgccctcctc caccagctta 3720
tccacctcct tcttgctggc agactcgaag ctcaccttat agccctgctc ctccacctct 3780
ctgtaaaagc cggcgatgtc cttatacttc tctgtctcag aaaagttgaa atcgtaggca 3840
ttggaccact ttggataccg ggagatgcta tccttaaaga agtcgatcag cttgtgacag 3900
tcattcaggt taaacatatc gcccttcttg aatgtgccat tcttgtagat cttctggatg 3960
tcctcgctgg ggttatagta ggccatccac ttcttagaaa agaacacctt tggcagcatc 4020
ttattagggc cgggcagcag cttatagttg atcttctcgt aattgccgtt cacatcgtcc 4080
ttgtcgatct tctgcaggca cttggcgtac ttcttatcca tgatggccag atagtacttg 4140
gagccgtatc tcaggatggt ggcccgatag tctgtctcct tatccttgtc ccagccgccc 4200
atgaactgag ggttctgaaa atacagcttg aacttatcct tagagtaggg cttctgggtc 4260
acataattgc ggatggcatc gtagatgtgg tccaccttca gcaggatgtc gtaggccagc 4320
acaaaatcgc catagaagga ctcgtccctg tttgtctcct tgccctcgcc aaagaaggcc 4380
ttgatgtaat tctcgaagct cttcacagaa tccagcaggt ccttcatgat ggccaccacg 4440
gcgtcgttct tcttcaggct cttctccagc acaaaatcgg cgtcgaacag cttctcagag 4500
gagccataca ccttgtagat ctcatccacc ttctggatga tgatctcctt cagcttctcc 4560
accacagaca gatcggcgtc ggcgtactcc tgcagctgct ccagagaaaa ggagccgatc 4620
ttcttgaagg actttctccg atcgtcctcg tacttctcgg tcaccacggc cttcttcttc 4680
aggtggatat cgtcatactc ggcattccac ttgtcccgga tcacgttcca ctcgccgaag 4740
atatccttgg agattgtgct gatggcgggg ccgttcttca caaagatgcc ggcgctagag 4800
tactcgtcaa aattcttgaa cagcttctcc agcttcttga tggagctgaa gatctcgctg 4860
ttcttgttca gggtgtttct aaacacctcc agcacctcct catcggatgt atagccctcg 4920
ccgtagaagc tcagagactc ccgatcgctc agcacctgct tatacagtgg cttaaactta 4980
ggcagcttct gcttggtttt ctgattatac aggttgatgt actcgttcag gcccttgatc 5040
ttctcgccgc tctcggtcac gaagccgccg atgatggcgt tatacacgtc gatgccctcc 5100
tgtgtcagca caaagttaaa gaactcgccc tcaaagaaat cctccacatc atagtcgctg 5160
ttcaggatct tctccttgat ctcctgcacc tcgtgcttat caaagatggc gtccaccttc 5220
tcgaagatgt ccatattaga gatgtagcgg gtcagattct cgttgataca cctgaaggcg 5280
atggatgtgc tcttggcctc ctcggaaaac atattctctc tgttatcaaa gaagccggtg 5340
aaggctgtgg taaagccatt gaagctgttc accagggcga tctcgtcctt atcgtccagg 5400
aactctggca ggattgtctc gatgatatcc ttcttaaaca gggacttgta gccctcgttg 5460
cccttgaagg ccttggcgat ctccttccgc agattgatct ccaggttctc cagctcctta 5520
ttctccttct cggttctggt tttcttccgg aacaggctga tgtaattgtt cagattcttc 5580
agcttgatgc tgtgcagcac gtcgttgata aaagacagat agtagcgatc cagcagcttc 5640
ttcacgccct tataatcctc ggctctcttc tcgtcctcca ccagcagccg cttattgtcg 5700
atgttctcct gggtcttgcc cacagggatg gccttgaacc tcagggtctt agacagggag 5760
tagcagtttg taaacttctc cagcttgctg gctgctggga ctccgtggat accgaccttc 5820
cgcttcttct ttggggccat cttatcgtca tcgtctttgt aatcaatatc atgatccttg 5880
tagtctccgt cgtggtcctt atagtccatg gctgcagaag taacaccaaa caacagggtg 5940
agcatcgaca aaagaaacag taccaagcaa ataaatagcg tatgaaggca gggctaaaaa 6000
aatccacata tagctgctgc atatgccatc atccaagtat atcaagatca aaataattat 6060
aaaacatact tgtttattat aatagatagg tactcaaggt tagagcatat gaatagatgc 6120
tgcatatgcc atcatgtata tgcatcagta aaacccacat caacatgtat acctatccta 6180
gatcgatatt tccatccatc ttaaactcgt aactatgaag atgtatgaca cacacataca 6240
gttccaaaat taataaatac accaggtagt ttgaaacagt attctactcc gatctagaac 6300
gaatgaacga ccgcccaacc acaccacatc atcacaacca agcgaacaaa aagcatctct 6360
gtatatgcat cagtaaaacc cgcatcaaca tgtataccta tcctagatcg atatttccat 6420
ccatcatctt caattcgtaa ctatgaatat gtatggcaca cacatacaga tccaaaatta 6480
ataaatccac caggtagttt gaaacagaat tctactccga tctagaacga ccgcccaacc 6540
agaccacatc atcacaacca agacaaaaaa aagcatgaaa agatgacccg acaaacaagt 6600
gcacggcata tattgaaata aaggaaaagg gcaaaccaaa ccctatgcaa cgaaacaaaa 6660
aaaatcatga aatcgatccc gtctgcggaa cggctagagc catcccagga ttccccaaag 6720
agaaacactg gcaagttagc aatcagaacg tgtctgacgt acaggtcgca tccgtgtacg 6780
aacgctagca gcacggatct aacacaaaca cggatctaac acaaacatga acagaagtag 6840
aactaccggg ccctaaccat ggaccggaac gccgatctag agaaggtaga gagggggggg 6900
gggggaggac gagcggcgta ccttgaagcg gaggtgccga cgggtggatt tgggggagat 6960
ctggttgtgt gtgtgtgcgc tccgaacaac acgaggttgg ggaaagaggg tgtggagggg 7020
gtgtctattt attacggcgg gcgaggaagg gaaagcgaag gagcggtggg aaaggaatcc 7080
cccgtagctg ccgtgccgtg agaggaggag gaggccgcct gccgtgccgg ctcacgtctg 7140
ccgctccgcc acgcaatttc tggatgccga cagcggagca agtccaacgg tggagcggaa 7200
ctctcgagag gggtccagag gcagcgacag agatgccgtg ccgtctgctt cgcttggccc 7260
gacgcgacgc tgctggttcg ctggttggtg tccgttagac tcgtcgacgg cgtttaacag 7320
gctggcatta tctactcgaa acaagaaaaa tgtttcctta gtttttttaa tttcttaaag 7380
ggtatttgtt taatttttag tcactttatt ttattctatt ttatatctaa attattaaat 7440
aaaaaaacta aaatagagtt ttagttttct taatttagag gctaaaatag aataaaatag 7500
atgtactaaa aaaattagtc tataaaaacc attaacccta aaccctaaat ggatgtacta 7560
ataaaatgga tgaagtatta tataggtgaa gctatttgca aaaaaaaagg agaacacatg 7620
cacactaaaa agataaaact gtagagtcct gttgtcaaaa tactcaattg tcctttagac 7680
catgtctaac tgttcattta tatgattctc taaaacactg atattattgt agtactatag 7740
attatattat tcgtagagta aagtttaaat atatgtataa agatagataa actgcacttc 7800
aaacaagtgt gacaaaaaaa atatgtggta attttttata acttagacat gcaatgctca 7860
ttatctctag agaggggcac gaccgggtca cgctgcaaag cttggcactg gccgtcgttt 7920
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 7980
cccctttcgc cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt 8040
tgcgcagcct gaatggcgaa tgctagagca gcttgagctt ggatcagatt gtcgtttccc 8100
gccttcagtt taaactatca gtgtttgaca ggatatattg gcgggtaaac ctaagagaaa 8160
agagcgttta ttagaataac ggatatttaa aagggcgtga aaaggtttat ccgttcgtcc 8220
atttgtatgt gcatgccaac cacagggttc ccctcgggat caaagtactt tgatccaacc 8280
cctccgctgc tatagtgcag tcggcttctg acgttcagtg cagccgtctt ctgaaaacga 8340
catgtcgcac aagtcctaag ttacgcgaca ggctgccgcc ctgccctttt cctggcgttt 8400
tcttgtcgcg tgttttagtc gcataaagta gaatacttgc gactagaacc ggagacatta 8460
cgccatgaac aagagcgccg ccgctggcct gctgggctat gcccgcgtca gcaccgacga 8520
ccaggacttg accaaccaac gggccgaact gcacgcggcc ggctgcacca agctgttttc 8580
cgagaagatc accggcacca ggcgcgaccg cccggagctg gccaggatgc ttgaccacct 8640
agccctggcg acgttgtgac agtgaccagg ctagaccgcc tggcccgcag cacccgcgac 8700
ctactggaca ttgccgagcg catccaggag gccggcgcgg gcctgcgtag cctggcagag 8760
ccgtgggccg acaccaccac gccggccggc cgcatggtgt tgaccgtgtt cgccggcatt 8820
gccgagttcg agcgttccct aatcatcgac cgcacccgga gcgggcgcga ggccgccaag 8880
gcccgaggcg tgaagtttgg cccccgccct accctcaccc cggcacagat cgcgcacgcc 8940
cgcgagctga tcgaccagga aggccgcacc gtgaaagagg cggctgcact gcttggcgtg 9000
catcgctcga ccctgtaccg cgcacttgag cgcagcgagg aagtgacgcc caccgaggcc 9060
aggcggcgcg gtgccttccg tgaggacgca ttgaccgagg ccgacgccct ggcggccgcc 9120
gagaatgaac gccaagagga acaagcatga aaccgcacca ggacggccag gacgaaccgt 9180
ttttcattac cgaagagatc gaggcggaga tgatcgcggc cgggtacgtg ttcgagccgc 9240
ccgcgcacgt ctcaaccgtg cggctgcatg aaatcctggc cggtttgtct gatgccaagc 9300
tggcggcctg gccggccagc ttggccgctg aagaaaccga gcgccgccgt ctaaaaaggt 9360
gatgtgtatt tgagtaaaac agcttgcgtc atgcggtcgc tgcgtatatg atgcgatgag 9420
taaataaaca aatacgcaag gggaacgcat gaaggttatc gctgtactta accagaaagg 9480
cgggtcaggc aagacgacca tcgcaaccca tctagcccgc gccctgcaac tcgccggggc 9540
cgatgttctg ttagtcgatt ccgatcccca gggcagtgcc cgcgattggg cggccgtgcg 9600
ggaagatcaa ccgctaaccg ttgtcggcat cgaccgcccg acgattgacc gcgacgtgaa 9660
ggccatcggc cggcgcgact tcgtagtgat cgacggagcg ccccaggcgg cggacttggc 9720
tgtgtccgcg atcaaggcag ccgacttcgt gctgattccg gtgcagccaa gcccttacga 9780
catatgggca accgccgacc tggtggagct ggttaagcag cgcattgagg tcacggatgg 9840
aaggctacaa gcggcctttg tcgtgtcgcg ggcgatcaaa ggcacgcgca tcggcggtga 9900
ggttgccgag gcgctggccg ggtacgagct gcccattctt gagtcccgta tcacgcagcg 9960
cgtgagctac ccaggcactg ccgccgccgg cacaaccgtt cttgaatcag aacccgaggg 10020
cgacgctgcc cgcgaggtcc aggcgctggc cgctgaaatt aaatcaaaac tcatttgagt 10080
taatgaggta aagagaaaat gagcaaaagc acaaacacgc taagtgccgg ccgtccgagc 10140
gcacgcagca gcaaggctgc aacgttggcc agcctggcag acacgccagc catgaagcgg 10200
gtcaactttc agttgccggc ggaggatcac accaagctga agatgtacgc ggtacgccaa 10260
ggcaagacca ttaccgagct gctatctgaa tacatcgcgc agctaccaga gtaaatgagc 10320
aaatgaataa atgagtagat gaattttagc ggctaaagga ggcggcatgg aaaatcaaga 10380
acaaccaggc accgacgccg tggaatgccc catgtgtgga ggaacgggcg gttggccagg 10440
cgtaagcggc tgggttgtct gccggccctg caatggcact ggaaccccca agcccgagga 10500
atcggcgtga cggtcgcaaa ccatccggcc cggtacaaat cggcgcggcg ctgggtgatg 10560
acctggtgga gaagttgaag gccgcgcagg ccgcccagcg gcaacgcatc gaggcagaag 10620
cacgccccgg tgaatcgtgg caagcggccg ctgatcgaat ccgcaaagaa tcccggcaac 10680
cgccggcagc cggtgcgccg tcgattagga agccgcccaa gggcgacgag caaccagatt 10740
ttttcgttcc gatgctctat gacgtgggca cccgcgatag tcgcagcatc atggacgtgg 10800
ccgttttccg tctgtcgaag cgtgaccgac gagctggcga ggtgatccgc tacgagcttc 10860
cagacgggca cgtagaggtt tccgcagggc cggccggcat ggccagtgtg tgggattacg 10920
acctggtact gatggcggtt tcccatctaa ccgaatccat gaaccgatac cgggaaggga 10980
agggagacaa gcccggccgc gtgttccgtc cacacgttgc ggacgtactc aagttctgcc 11040
ggcgagccga tggcggaaag cagaaagacg acctggtaga aacctgcatt cggttaaaca 11100
ccacgcacgt tgccatgcag cgtacgaaga aggccaagaa cggccgcctg gtgacggtat 11160
ccgagggtga agccttgatt agccgctaca agatcgtaaa gagcgaaacc gggcggccgg 11220
agtacatcga gatcgagcta gctgattgga tgtaccgcga gatcacagaa ggcaagaacc 11280
cggacgtgct gacggttcac cccgattact ttttgatcga tcccggcatc ggccgttttc 11340
tctaccgcct ggcacgccgc gccgcaggca aggcagaagc cagatggttg ttcaagacga 11400
tctacgaacg cagtggcagc gccggagagt tcaagaagtt ctgtttcacc gtgcgcaagc 11460
tgatcgggtc aaatgacctg ccggagtacg atttgaagga ggaggcgggg caggctggcc 11520
cgatcctagt catgcgctac cgcaacctga tcgagggcga agcatccgcc ggttcctaat 11580
gtacggagca gatgctaggg caaattgccc tagcagggga aaaaggtcga aaaggtctct 11640
ttcctgtgga tagcacgtac attgggaacc caaagccgta cattgggaac cggaacccgt 11700
acattgggaa cccaaagccg tacattggga accggtcaca catgtaagtg actgatataa 11760
aagagaaaaa aggcgatttt tccgcctaaa actctttaaa acttattaaa actcttaaaa 11820
cccgcctggc ctgtgcataa ctgtctggcc agcgcacagc cgaagagctg caaaaagcgc 11880
ctacccttcg gtcgctgcgc tccctacgcc ccgccgcttc gcgtcggcct atcgcggccg 11940
ctggccgctc aaaaatggct ggcctacggc caggcaatct accagggcgc ggacaagccg 12000
cgccgtcgcc actcgaccgc cggcgcccac atcaaggcac cctgcctcgc gcgtttcggt 12060
gatgacggtg aaaacctctg acacatgcag ctcccggaga cggtcacagc ttgtctgtaa 12120
gcggatgccg ggagcagaca agcccgtcag ggcgcgtcag cgggtgttgg cgggtgtcgg 12180
ggcgcagcca tgacccagtc acgtagcgat agcggagtgt atactggctt aactatgcgg 12240
catcagagca gattgtactg agagtgcacc atatgcggtg tgaaataccg cacagatgcg 12300
taaggagaaa ataccgcatc aggcgctctt ccgcttcctc gctcactgac tcgctgcgct 12360
cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa ggcggtaata cggttatcca 12420
cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga 12480
accgtaaaaa ggccgcgttg ctggcgtttt tccataggct ccgcccccct gacgagcatc 12540
acaaaaatcg acgctcaagt cagaggtggc gaaacccgac aggactataa agataccagg 12600
cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat 12660
acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca cgctgtaggt 12720
atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa ccccccgttc 12780
agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg gtaagacacg 12840
acttatcgcc actggcagca gccactggta acaggattag cagagcgagg tatgtaggcg 12900
gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg acagtatttg 12960
gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc tcttgatccg 13020
gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag attacgcgca 13080
gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac gctcagtgga 13140
acgaaaactc acgttaaggg attttggtca tgcattctag gtactaaaac aattcatcca 13200
gtaaaatata atattttatt ttctcccaat caggcttgat ccccagtaag tcaaaaaata 13260
gctcgacata ctgttcttcc ccgatatcct ccctgatcga ccggacgcag aaggcaatgt 13320
cataccactt gtccgccctg ccgcttctcc caagatcaat aaagccactt actttgccat 13380
ctttcacaaa gatgttgctg tctcccaggt cgccgtggga aaagacaagt tcctcttcgg 13440
gcttttccgt ctttaaaaaa tcatacagct cgcgcggatc tttaaatgga gtgtcttctt 13500
cccagttttc gcaatccaca tcggccagat cgttattcag taagtaatcc aattcggcta 13560
agcggctgtc taagctattc gtatagggac aatccgatat gtcgatggag tgaaagagcc 13620
tgatgcactc cgcatacagc tcgataatct tttcagggct ttgttcatct tcatactctt 13680
ccgagcaaag gacgccatcg gcctcactca tgagcagatt gctccagcca tcatgccgtt 13740
caaagtgcag gacctttgga acaggcagct ttccttccag ccatagcatc atgtcctttt 13800
cccgttcaac atcataggtg gtccctttat accggctgtc cgtcattttt aaatataggt 13860
tttcattttc tcccaccagc ttatatacct tagcaggaga cattccttcc gtatctttta 13920
cgcagcggta tttttcgatc agttttttca attccggtga tattctcatt ttagccattt 13980
attatttcct tcctcttttc tacagtattt aaagataccc caagaagcta attataacaa 14040
gacgaactcc aattcactgt tccttgcatt ctaaaacctt aaataccaga aaacagcttt 14100
ttcaaagttg ttttcaaagt tggcgtataa catagtatcg acggagccga ttttgaaacc 14160
gcggtgatca caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga 14220
gatcatccgt gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac 14280
atgagcaaag tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg 14340
ctgcctgtat cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct 14400
ggtggcagga tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg 14460
cggacgtttt taatgtactg aattaacgcc gaattaattc gggggatctg gattttagta 14520
ctggattttg gttttaggaa ttagaaattt tattgataga agtattttac aaatacaaat 14580
acatactaag ggtttcttat atgctcaaca catgagcgaa accctatagg aaccctaatt 14640
cccttatctg ggaactactc acacattatt atggagaaac tcgagcttgt cgatcgacag 14700
atccggtcgg catctactct atttctttgc cctcggacga gtgctggggc gtcggtttcc 14760
actatcggcg agtacttcta cacagccatc ggtccagacg gccgcgcttc tgcgggcgat 14820
ttgtgtacgc ccgacagtcc cggctccgga tcggacgatt gcgtcgcatc gaccctgcgc 14880
ccaagctgca tcatcgaaat tgccgtcaac caagctctga tagagttggt caagaccaat 14940
gcggagcata tacgcccgga gtcgtggcga tcctgcaagc tccggatgcc tccgctcgaa 15000
gtagcgcgtc tgctgctcca tacaagccaa ccacggcctc cagaagaaga tgttggcgac 15060
ctcgtattgg gaatccccga acatcgcctc gctccagtca atgaccgctg ttatgcggcc 15120
attgtccgtc aggacattgt tggagccgaa atccgcgtgc acgaggtgcc ggacttcggg 15180
gcagtcctcg gcccaaagca tcagctcatc gagagcctgc gcgacggacg cactgacggt 15240
gtcgtccatc acagtttgcc agtgatacac atggggatca gcaatcgcgc atatgaaatc 15300
acgccatgta gtgtattgac cgattccttg cggtccgaat gggccgaacc cgctcgtctg 15360
gctaagatcg gccgcagcga tcgcatccat agcctccgcg accggttgta gaacagcggg 15420
cagttcggtt tcaggcaggt cttgcaacgt gacaccctgt gcacggcggg agatgcaata 15480
ggtcaggctc tcgctaaact ccccaatgtc aagcacttcc ggaatcggga gcgcggccga 15540
tgcaaagtgc cgataaacat aacgatcttt gtagaaacca tcggcgcagc tatttacccg 15600
caggacatat ccacgccctc ctacatcgaa gctgaaagca cgagattctt cgccctccga 15660
gagctgcatc aggtcggaga cgctgtcgaa cttttcgatc agaaacttct cgacagacgt 15720
cgcggtgagt tcaggctttt tcatatctca ttgccccccg gatctgcgaa agctcgagag 15780
agatagattt gtagagagag actggtgatt tcagcgtgtc ctctccaaat gaaatgaact 15840
tccttatata gaggaaggtc ttgcgaagga tagtgggatt gtgcgtcatc ccttacgtca 15900
gtggagatat cacatcaatc cacttgcttt gaagacgtgg ttggaacgtc ttctttttcc 15960
acgatgctcc tcgtgggtgg gggtccatct ttgggaccac tgtcggcaga ggcatcttga 16020
acgatagcct ttcctttatc gcaatgatgg catttgtagg tgccaccttc cttttctact 16080
gtccttttga tgaagtgaca gatagctggg caatggaatc cgaggaggtt tcccgatatt 16140
accctttgtt gaaaagtctc aatagccctt tggtcttctg agactgtatc tttgatattc 16200
ttggagtaga cgagagtgtc gtgctccacc atgttatcac atcaatccac ttgctttgaa 16260
gacgtggttg gaacgtcttc tttttccacg atgctcctcg tgggtggggg tccatctttg 16320
ggaccactgt cggcagaggc atcttgaacg atagcctttc ctttatcgca atgatggcat 16380
ttgtaggtgc caccttcctt ttctactgtc cttttgatga agtgacagat agctgggcaa 16440
tggaatccga ggaggtttcc cgatattacc ctttgttgaa aagtctcaat agccctttgg 16500
tcttctgaga ctgtatcttt gatattcttg gagtagacga gagtgtcgtg ctccaccatg 16560
ttggcaagct gctctagcca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt 16620
aatgcagctg gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta 16680
atgtgagtta gctcactcat taggcacccc aggctttaca ctttatgctt ccggctcgta 16740
tgttgtgtgg aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt 16800
ac 16802
<210> 2
<211> 1614
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 2
aaggaatctt taaacatacg aacagatcac ttaaagttct tctgaagcaa cttaaagtta 60
tcaggcatgc atggatcttg gaggaatcag atgtgcagtc agggaccata gcacaagaca 120
ggcgtcttct actggtgcta ccagcaaatg ctggaagccg ggaacactgg gtacgttgga 180
aaccacgtga tgtgaagaag taagataaac tgtaggagaa aagcatttcg tagtgggcca 240
tgaagccttt caggacatgt attgcagtat gggccggccc attacgcaat tggacgacaa 300
caaagactag tattagtacc acctcggcta tccacataga tcaaagctga tttaaaagag 360
ttgtgcagat gatccgtggc aaaattactg atgagtccgt gaggacgaaa cgagtaagct 420
cgtctaattt ctactaagtg tagatggtat ggtggtgcaa tgggaggagg ccggcatggt 480
cccagcctcc tcgctggcgc cggctgggca acatgcttcg gcatggcgaa tgggacgaat 540
acgaccaaat tactgatgag tccgtgagga cgaaacgagt aagctcgtct aatttctact 600
aagtgtagat acctgaatga cccataaaga gtgggccggc atggtcccag cctcctcgct 660
ggcgccggct gggcaacatg cttcggcatg gcgaatggga ccggtacctt tgggtatggt 720
ggtgcaatgg gaggattgat ggggatggta gcttcctcat gaacattcag gagctggcat 780
tgatccgcat tgagaacctc cctgtgaagg tgatggtgtt gaacaaccaa cacctaggca 840
tggtcgtcca gttggaggat aggttttaca aggcgaatag ggcgcataca tacttgggca 900
acccggaatg tgagagcgag atatatccag attttgtgac tattgctaag gggttcaata 960
ttcctgcagt ccgtgtaaca aagaagagtg aagtccgtgc cgccatcaag aagatgctcg 1020
agactccagg gccatacttg ttggacatca tcgtcccgca ccaggagcat gtgctgccta 1080
tgatcccaat tgggggcgca ttcaaggaca tgatcctgga tggtgatggc aggactgtgt 1140
attaatctat aatctgtatg ttggcaaagc accagcccgg cctatgtctg acgtgaatga 1200
ctcataaaga gtggtatgcc tatgatgttt gtatgtgctc tatcaataac taaggtgtca 1260
actatgaacc atatgctctt ctgttttact tgtttgatgt gcttggcatg gtaatcctaa 1320
ttagcttcct gctgtttgac ctgaatgacc cataaagagt ggatcgttca aacatttggc 1380
aataaagttt cttaagattg aatcctgttg ccggtcttgc gatgattatc atataatttc 1440
tgttgaatta cgttaagcat gtaataatta acatgtaatg catgacgtta tttatgagat 1500
gggtttttat gattagagtc ccgcaattat acatttaata cgcgatagaa aacaaaatat 1560
agcgcgcaaa ctaggataaa ttatcgcgcg cggtgtcatc tatgttacta gatc 1614
<210> 3
<211> 16675
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 3
gaattcgagc tcaaggaatc tttaaacata cgaacagatc acttaaagtt cttctgaagc 60
aacttaaagt tatcaggcat gcatggatct tggaggaatc agatgtgcag tcagggacca 120
tagcacaaga caggcgtctt ctactggtgc taccagcaaa tgctggaagc cgggaacact 180
gggtacgttg gaaaccacgt gatgtgaaga agtaagataa actgtaggag aaaagcattt 240
cgtagtgggc catgaagcct ttcaggacat gtattgcagt atgggccggc ccattacgca 300
attggacgac aacaaagact agtattagta ccacctcggc tatccacata gatcaaagct 360
gatttaaaag agttgtgcag atgatccgtg gcaaaattac tgatgagtcc gtgaggacga 420
aacgagtaag ctcgtctaat ttctactaag tgtagatggt atggtggtgc aatgggagga 480
ggccggcatg gtcccagcct cctcgctggc gccggctggg caacatgctt cggcatggcg 540
aatgggacga atacgaccaa attactgatg agtccgtgag gacgaaacga gtaagctcgt 600
ctaatttcta ctaagtgtag atacctgaat gacccataaa gagtgggccg gcatggtccc 660
agcctcctcg ctggcgccgg ctgggcaaca tgcttcggca tggcgaatgg gaccggtacc 720
cctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag ggttttccca 780
gtcacgacgt tgtaaaacga cggccagtga attcccgatc tagtaacata gatgacaccg 840
cgcgcgataa tttatcctag tttgcgcgct atattttgtt ttctatcgcg tattaaatgt 900
ataattgcgg gactctaatc ataaaaaccc atctcataaa taacgtcatg cattacatgt 960
taattattac atgcttaacg taattcaaca gaaattatat gataatcatc gcaagaccgg 1020
caacaggatt caatcttaag aaactttatt gccaaatgtt tgaacgatcg gggaaattcg 1080
gatccttact ttttcttttt tgcctggccg gcctttttcg tggccgccgg ccttttgtgc 1140
ttcacgctgg tctgggcgta ctccagccac tccttgttag agatggcgat cttcacctta 1200
tccagcttct cgtcctcggc cttcttgaac tggccgatgg cccacagcac ctttctggcg 1260
atgttatagg cgccattggc gtcggcgttc tttggcagga tggcattctc ctgggcctca 1320
tagttccggc tatcgtagaa gatgccgtcg gagttcttca cagggctgat cagaaaatcc 1380
acgtcggtgc ggcctgtgat gctgttccgc atctgcagca tcaggctcat cagggccata 1440
aagctagagt agaaggcctt gtcggactgc tcgcacagca gggctctgat atcgccctgc 1500
tgataattga tgccgtactt gttgaacagc tccttatagg cgctggtcag gcacacctcc 1560
tcccagtcga acacgttgtt cttcttagga ttccggaaga ttctgatccg gttgccgtag 1620
gagtacagct tccacttctt gatgtaatcg gcgtctgtgc gagagaagtt cttatagtcc 1680
agggcaaact cgaacagatc ctcctcgggc acgtacatga tcctgtcaaa ggagctgatg 1740
aacttcttgg aatcggcgat gctggtatac ttggttttca gcaggttcac aaagccggta 1800
gatggatcga tcttggatgt cagccaggca gggatgtaaa agatgaagcc gttctgggta 1860
gacatggact taaagctctc gaacttattg gtgatctgat agcccttcag ggcgccgcct 1920
gttgcacaag gattagactt cttgtccacc atgtagttca gcttatcgat cagcatcttc 1980
tcgaacttct gatacacctg cttctccacc ttcacgcggc tattcttaaa gccagagttc 2040
aggtcctcca gggcgatcac ggcatcgtac ttctccacca gctcgcagat cttgtgcacc 2100
acctgagaga tatagccggc cttcagctcc ttgatattct cgatggaggt ccagttctgg 2160
cgggcctcga acctctcctt ctccttcttg tccagcagag agtggtaatc tgtcttgatc 2220
ctgatgccgt tgaagttgtt gatgatctcg ttcagggaat actgctccac gatgttgccc 2280
ttgccgtcca ccaccacgat atacagcaga ttgcgctcgc ccctatcgat gccgatcaca 2340
taggggttat cgtcgtgctt cagcagcacg cgcacctctg tattgatctt gaagatgttc 2400
ttggggcact tattgatggc gattgggatg tgcagctcgt actggtcctc agaaaacctc 2460
ttatccttat acacgtcgta ggacagggtt gtggttttct tgggattatc tggattcttg 2520
ttggcgatag gggagttggc tgggtgcacc accagctcct ccttcttcag ggaggcgcgc 2580
ctcatgaaca gctctgctcc tccgctcagc ctgatctgtc cgtgattgtt ctcgtcaaac 2640
agcagcttga agtacatggt gtgcagattg ggtgtgccgt gagacttatc ggaaaagtcc 2700
ttgttataga tctggaacat atacagcttg ccctcctcca ccagcttatc cacctccttc 2760
ttgctggcag actcgaagct caccttatag ccctgctcct ccacctctct gtaaaagccg 2820
gcgatgtcct tatacttctc tgtctcagaa aagttgaaat cgtaggcatt ggaccacttt 2880
ggataccggg agatgctatc cttaaagaag tcgatcagct tgtgacagtc attcaggtta 2940
aacatatcgc ccttcttgaa tgtgccattc ttgtagatct tctggatgtc ctcgctgggg 3000
ttatagtagg ccatccactt cttagaaaag aacacctttg gcagcatctt attagggccg 3060
ggcagcagct tatagttgat cttctcgtaa ttgccgttca catcgtcctt gtcgatcttc 3120
tgcaggcact tggcgtactt cttatccatg atggccagat agtacttgga gccgtatctc 3180
aggatggtgg cccgatagtc tgtctcctta tccttgtccc agccgcccat gaactgaggg 3240
ttctgaaaat acagcttgaa cttatcctta gagtagggct tctgggtcac ataattgcgg 3300
atggcatcgt agatgtggtc caccttcagc aggatgtcgt aggccagcac aaaatcgcca 3360
tagaaggact cgtccctgtt tgtctccttg ccctcgccaa agaaggcctt gatgtaattc 3420
tcgaagctct tcacagaatc cagcaggtcc ttcatgatgg ccaccacggc gtcgttcttc 3480
ttcaggctct tctccagcac aaaatcggcg tcgaacagct tctcagagga gccatacacc 3540
ttgtagatct catccacctt ctggatgatg atctccttca gcttctccac cacagacaga 3600
tcggcgtcgg cgtactcctg cagctgctcc agagaaaagg agccgatctt cttgaaggac 3660
tttctccgat cgtcctcgta cttctcggtc accacggcct tcttcttcag gtggatatcg 3720
tcatactcgg cattccactt gtcccggatc acgttccact cgccgaagat atccttggag 3780
attgtgctga tggcggggcc gttcttcaca aagatgccgg cgctagagta ctcgtcaaaa 3840
ttcttgaaca gcttctccag cttcttgatg gagctgaaga tctcgctgtt cttgttcagg 3900
gtgtttctaa acacctccag cacctcctca tcggatgtat agccctcgcc gtagaagctc 3960
agagactccc gatcgctcag cacctgctta tacagtggct taaacttagg cagcttctgc 4020
ttggttttct gattatacag gttgatgtac tcgttcaggc ccttgatctt ctcgccgctc 4080
tcggtcacga agccgccgat gatggcgtta tacacgtcga tgccctcctg tgtcagcaca 4140
aagttaaaga actcgccctc aaagaaatcc tccacatcat agtcgctgtt caggatcttc 4200
tccttgatct cctgcacctc gtgcttatca aagatggcgt ccaccttctc gaagatgtcc 4260
atattagaga tgtagcgggt cagattctcg ttgatacacc tgaaggcgat ggatgtgctc 4320
ttggcctcct cggaaaacat attctctctg ttatcaaaga agccggtgaa ggctgtggta 4380
aagccattga agctgttcac cagggcgatc tcgtccttat cgtccaggaa ctctggcagg 4440
attgtctcga tgatatcctt cttaaacagg gacttgtagc cctcgttgcc cttgaaggcc 4500
ttggcgatct ccttccgcag attgatctcc aggttctcca gctccttatt ctccttctcg 4560
gttctggttt tcttccggaa caggctgatg taattgttca gattcttcag cttgatgctg 4620
tgcagcacgt cgttgataaa agacagatag tagcgatcca gcagcttctt cacgccctta 4680
taatcctcgg ctctcttctc gtcctccacc agcagccgct tattgtcgat gttctcctgg 4740
gtcttgccca cagggatggc cttgaacctc agggtcttag acagggagta gcagtttgta 4800
aacttctcca gcttgctggc tgctgggact ccgtggatac cgaccttccg cttcttcttt 4860
ggggccatct tatcgtcatc gtctttgtaa tcaatatcat gatccttgta gtctccgtcg 4920
tggtccttat agtccatggc tgcagaagta acaccaaaca acagggtgag catcgacaaa 4980
agaaacagta ccaagcaaat aaatagcgta tgaaggcagg gctaaaaaaa tccacatata 5040
gctgctgcat atgccatcat ccaagtatat caagatcaaa ataattataa aacatacttg 5100
tttattataa tagataggta ctcaaggtta gagcatatga atagatgctg catatgccat 5160
catgtatatg catcagtaaa acccacatca acatgtatac ctatcctaga tcgatatttc 5220
catccatctt aaactcgtaa ctatgaagat gtatgacaca cacatacagt tccaaaatta 5280
ataaatacac caggtagttt gaaacagtat tctactccga tctagaacga atgaacgacc 5340
gcccaaccac accacatcat cacaaccaag cgaacaaaaa gcatctctgt atatgcatca 5400
gtaaaacccg catcaacatg tatacctatc ctagatcgat atttccatcc atcatcttca 5460
attcgtaact atgaatatgt atggcacaca catacagatc caaaattaat aaatccacca 5520
ggtagtttga aacagaattc tactccgatc tagaacgacc gcccaaccag accacatcat 5580
cacaaccaag acaaaaaaaa gcatgaaaag atgacccgac aaacaagtgc acggcatata 5640
ttgaaataaa ggaaaagggc aaaccaaacc ctatgcaacg aaacaaaaaa aatcatgaaa 5700
tcgatcccgt ctgcggaacg gctagagcca tcccaggatt ccccaaagag aaacactggc 5760
aagttagcaa tcagaacgtg tctgacgtac aggtcgcatc cgtgtacgaa cgctagcagc 5820
acggatctaa cacaaacacg gatctaacac aaacatgaac agaagtagaa ctaccgggcc 5880
ctaaccatgg accggaacgc cgatctagag aaggtagaga gggggggggg gggaggacga 5940
gcggcgtacc ttgaagcgga ggtgccgacg ggtggatttg ggggagatct ggttgtgtgt 6000
gtgtgcgctc cgaacaacac gaggttgggg aaagagggtg tggagggggt gtctatttat 6060
tacggcgggc gaggaaggga aagcgaagga gcggtgggaa aggaatcccc cgtagctgcc 6120
gtgccgtgag aggaggagga ggccgcctgc cgtgccggct cacgtctgcc gctccgccac 6180
gcaatttctg gatgccgaca gcggagcaag tccaacggtg gagcggaact ctcgagaggg 6240
gtccagaggc agcgacagag atgccgtgcc gtctgcttcg cttggcccga cgcgacgctg 6300
ctggttcgct ggttggtgtc cgttagactc gtcgacggcg tttaacaggc tggcattatc 6360
tactcgaaac aagaaaaatg tttccttagt ttttttaatt tcttaaaggg tatttgttta 6420
atttttagtc actttatttt attctatttt atatctaaat tattaaataa aaaaactaaa 6480
atagagtttt agttttctta atttagaggc taaaatagaa taaaatagat gtactaaaaa 6540
aattagtcta taaaaaccat taaccctaaa ccctaaatgg atgtactaat aaaatggatg 6600
aagtattata taggtgaagc tatttgcaaa aaaaaaggag aacacatgca cactaaaaag 6660
ataaaactgt agagtcctgt tgtcaaaata ctcaattgtc ctttagacca tgtctaactg 6720
ttcatttata tgattctcta aaacactgat attattgtag tactatagat tatattattc 6780
gtagagtaaa gtttaaatat atgtataaag atagataaac tgcacttcaa acaagtgtga 6840
caaaaaaaat atgtggtaat tttttataac ttagacatgc aatgctcatt atctctagag 6900
aggggcacga ccgggtcacg ctgcaaagct tggcactggc cgtcgtttta caacgtcgtg 6960
actgggaaaa ccctggcgtt acccaactta atcgccttgc agcacatccc cctttcgcca 7020
gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc ccaacagttg cgcagcctga 7080
atggcgaatg ctagagcagc ttgagcttgg atcagattgt cgtttcccgc cttcagtttg 7140
tttaaacgta aaacgacggc cagtgaattg gagatcggta cttcgcgaat gcgtcgagat 7200
gacccaatgc tctagaaacc aacatttggg tatggtggtg caatgggagg attgatgggg 7260
atggtagctt cctcatgaac attcaggagc tggcattgat ccgcattgag aacctccctg 7320
tgaaggtgat ggtgttgaac aaccaacacc taggcatggt cgtccagttg gaggataggt 7380
tttacaaggc gaatagggcg catacatact tgggcaaccc ggaatgtgag agcgagatat 7440
atccagattt tgtgactatt gctaaggggt tcaatattcc tgcagtccgt gtaacaaaga 7500
agagtgaagt ccgtgccgcc atcaagaaga tgctcgagac tccagggcca tacttgttgg 7560
acatcatcgt cccgcaccag gagcatgtgc tgcctatgat cccaattggg ggcgcattca 7620
aggacatgat cctggatggt gatggcagga ctgtgtatta atctataatc tgtatgttgg 7680
caaagcacca gcccggccta tgtctgacgt gaatgactca taaagagtgg tatgcctatg 7740
atgtttgtat gtgctctatc aataactaag gtgtcaacta tgaaccatat gctcttctgt 7800
tttacttgtt tgatgtgctt ggcatggtaa tcctaattag cttcctgctg tttgacctga 7860
atgacccata aagagtggta tgcctaacta gtccattggg tcatcggatg ccgggaccga 7920
cgagtgcaga ggcgtgcaag cgagcttggc gtaatcatgg tcatagctgt ttcctggttt 7980
aaacaaacta tcagtgtttg acaggatata ttggcgggta aacctaagag aaaagagcgt 8040
ttattagaat aacggatatt taaaagggcg tgaaaaggtt tatccgttcg tccatttgta 8100
tgtgcatgcc aaccacaggg ttcccctcgg gatcaaagta ctttgatcca acccctccgc 8160
tgctatagtg cagtcggctt ctgacgttca gtgcagccgt cttctgaaaa cgacatgtcg 8220
cacaagtcct aagttacgcg acaggctgcc gccctgccct tttcctggcg ttttcttgtc 8280
gcgtgtttta gtcgcataaa gtagaatact tgcgactaga accggagaca ttacgccatg 8340
aacaagagcg ccgccgctgg cctgctgggc tatgcccgcg tcagcaccga cgaccaggac 8400
ttgaccaacc aacgggccga actgcacgcg gccggctgca ccaagctgtt ttccgagaag 8460
atcaccggca ccaggcgcga ccgcccggag ctggccagga tgcttgacca cctagccctg 8520
gcgacgttgt gacagtgacc aggctagacc gcctggcccg cagcacccgc gacctactgg 8580
acattgccga gcgcatccag gaggccggcg cgggcctgcg tagcctggca gagccgtggg 8640
ccgacaccac cacgccggcc ggccgcatgg tgttgaccgt gttcgccggc attgccgagt 8700
tcgagcgttc cctaatcatc gaccgcaccc ggagcgggcg cgaggccgcc aaggcccgag 8760
gcgtgaagtt tggcccccgc cctaccctca ccccggcaca gatcgcgcac gcccgcgagc 8820
tgatcgacca ggaaggccgc accgtgaaag aggcggctgc actgcttggc gtgcatcgct 8880
cgaccctgta ccgcgcactt gagcgcagcg aggaagtgac gcccaccgag gccaggcggc 8940
gcggtgcctt ccgtgaggac gcattgaccg aggccgacgc cctggcggcc gccgagaatg 9000
aacgccaaga ggaacaagca tgaaaccgca ccaggacggc caggacgaac cgtttttcat 9060
taccgaagag atcgaggcgg agatgatcgc ggccgggtac gtgttcgagc cgcccgcgca 9120
cgtctcaacc gtgcggctgc atgaaatcct ggccggtttg tctgatgcca agctggcggc 9180
ctggccggcc agcttggccg ctgaagaaac cgagcgccgc cgtctaaaaa ggtgatgtgt 9240
atttgagtaa aacagcttgc gtcatgcggt cgctgcgtat atgatgcgat gagtaaataa 9300
acaaatacgc aaggggaacg catgaaggtt atcgctgtac ttaaccagaa aggcgggtca 9360
ggcaagacga ccatcgcaac ccatctagcc cgcgccctgc aactcgccgg ggccgatgtt 9420
ctgttagtcg attccgatcc ccagggcagt gcccgcgatt gggcggccgt gcgggaagat 9480
caaccgctaa ccgttgtcgg catcgaccgc ccgacgattg accgcgacgt gaaggccatc 9540
ggccggcgcg acttcgtagt gatcgacgga gcgccccagg cggcggactt ggctgtgtcc 9600
gcgatcaagg cagccgactt cgtgctgatt ccggtgcagc caagccctta cgacatatgg 9660
gcaaccgccg acctggtgga gctggttaag cagcgcattg aggtcacgga tggaaggcta 9720
caagcggcct ttgtcgtgtc gcgggcgatc aaaggcacgc gcatcggcgg tgaggttgcc 9780
gaggcgctgg ccgggtacga gctgcccatt cttgagtccc gtatcacgca gcgcgtgagc 9840
tacccaggca ctgccgccgc cggcacaacc gttcttgaat cagaacccga gggcgacgct 9900
gcccgcgagg tccaggcgct ggccgctgaa attaaatcaa aactcatttg agttaatgag 9960
gtaaagagaa aatgagcaaa agcacaaaca cgctaagtgc cggccgtccg agcgcacgca 10020
gcagcaaggc tgcaacgttg gccagcctgg cagacacgcc agccatgaag cgggtcaact 10080
ttcagttgcc ggcggaggat cacaccaagc tgaagatgta cgcggtacgc caaggcaaga 10140
ccattaccga gctgctatct gaatacatcg cgcagctacc agagtaaatg agcaaatgaa 10200
taaatgagta gatgaatttt agcggctaaa ggaggcggca tggaaaatca agaacaacca 10260
ggcaccgacg ccgtggaatg ccccatgtgt ggaggaacgg gcggttggcc aggcgtaagc 10320
ggctgggttg tctgccggcc ctgcaatggc actggaaccc ccaagcccga ggaatcggcg 10380
tgacggtcgc aaaccatccg gcccggtaca aatcggcgcg gcgctgggtg atgacctggt 10440
ggagaagttg aaggccgcgc aggccgccca gcggcaacgc atcgaggcag aagcacgccc 10500
cggtgaatcg tggcaagcgg ccgctgatcg aatccgcaaa gaatcccggc aaccgccggc 10560
agccggtgcg ccgtcgatta ggaagccgcc caagggcgac gagcaaccag attttttcgt 10620
tccgatgctc tatgacgtgg gcacccgcga tagtcgcagc atcatggacg tggccgtttt 10680
ccgtctgtcg aagcgtgacc gacgagctgg cgaggtgatc cgctacgagc ttccagacgg 10740
gcacgtagag gtttccgcag ggccggccgg catggccagt gtgtgggatt acgacctggt 10800
actgatggcg gtttcccatc taaccgaatc catgaaccga taccgggaag ggaagggaga 10860
caagcccggc cgcgtgttcc gtccacacgt tgcggacgta ctcaagttct gccggcgagc 10920
cgatggcgga aagcagaaag acgacctggt agaaacctgc attcggttaa acaccacgca 10980
cgttgccatg cagcgtacga agaaggccaa gaacggccgc ctggtgacgg tatccgaggg 11040
tgaagccttg attagccgct acaagatcgt aaagagcgaa accgggcggc cggagtacat 11100
cgagatcgag ctagctgatt ggatgtaccg cgagatcaca gaaggcaaga acccggacgt 11160
gctgacggtt caccccgatt actttttgat cgatcccggc atcggccgtt ttctctaccg 11220
cctggcacgc cgcgccgcag gcaaggcaga agccagatgg ttgttcaaga cgatctacga 11280
acgcagtggc agcgccggag agttcaagaa gttctgtttc accgtgcgca agctgatcgg 11340
gtcaaatgac ctgccggagt acgatttgaa ggaggaggcg gggcaggctg gcccgatcct 11400
agtcatgcgc taccgcaacc tgatcgaggg cgaagcatcc gccggttcct aatgtacgga 11460
gcagatgcta gggcaaattg ccctagcagg ggaaaaaggt cgaaaaggtc tctttcctgt 11520
ggatagcacg tacattggga acccaaagcc gtacattggg aaccggaacc cgtacattgg 11580
gaacccaaag ccgtacattg ggaaccggtc acacatgtaa gtgactgata taaaagagaa 11640
aaaaggcgat ttttccgcct aaaactcttt aaaacttatt aaaactctta aaacccgcct 11700
ggcctgtgca taactgtctg gccagcgcac agccgaagag ctgcaaaaag cgcctaccct 11760
tcggtcgctg cgctccctac gccccgccgc ttcgcgtcgg cctatcgcgg ccgctggccg 11820
ctcaaaaatg gctggcctac ggccaggcaa tctaccaggg cgcggacaag ccgcgccgtc 11880
gccactcgac cgccggcgcc cacatcaagg caccctgcct cgcgcgtttc ggtgatgacg 11940
gtgaaaacct ctgacacatg cagctcccgg agacggtcac agcttgtctg taagcggatg 12000
ccgggagcag acaagcccgt cagggcgcgt cagcgggtgt tggcgggtgt cggggcgcag 12060
ccatgaccca gtcacgtagc gatagcggag tgtatactgg cttaactatg cggcatcaga 12120
gcagattgta ctgagagtgc accatatgcg gtgtgaaata ccgcacagat gcgtaaggag 12180
aaaataccgc atcaggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 12240
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 12300
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 12360
aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa 12420
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 12480
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 12540
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 12600
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 12660
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 12720
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 12780
agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg 12840
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 12900
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 12960
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 13020
ctcacgttaa gggattttgg tcatgcattc taggtactaa aacaattcat ccagtaaaat 13080
ataatatttt attttctccc aatcaggctt gatccccagt aagtcaaaaa atagctcgac 13140
atactgttct tccccgatat cctccctgat cgaccggacg cagaaggcaa tgtcatacca 13200
cttgtccgcc ctgccgcttc tcccaagatc aataaagcca cttactttgc catctttcac 13260
aaagatgttg ctgtctccca ggtcgccgtg ggaaaagaca agttcctctt cgggcttttc 13320
cgtctttaaa aaatcataca gctcgcgcgg atctttaaat ggagtgtctt cttcccagtt 13380
ttcgcaatcc acatcggcca gatcgttatt cagtaagtaa tccaattcgg ctaagcggct 13440
gtctaagcta ttcgtatagg gacaatccga tatgtcgatg gagtgaaaga gcctgatgca 13500
ctccgcatac agctcgataa tcttttcagg gctttgttca tcttcatact cttccgagca 13560
aaggacgcca tcggcctcac tcatgagcag attgctccag ccatcatgcc gttcaaagtg 13620
caggaccttt ggaacaggca gctttccttc cagccatagc atcatgtcct tttcccgttc 13680
aacatcatag gtggtccctt tataccggct gtccgtcatt tttaaatata ggttttcatt 13740
ttctcccacc agcttatata ccttagcagg agacattcct tccgtatctt ttacgcagcg 13800
gtatttttcg atcagttttt tcaattccgg tgatattctc attttagcca tttattattt 13860
ccttcctctt ttctacagta tttaaagata ccccaagaag ctaattataa caagacgaac 13920
tccaattcac tgttccttgc attctaaaac cttaaatacc agaaaacagc tttttcaaag 13980
ttgttttcaa agttggcgta taacatagta tcgacggagc cgattttgaa accgcggtga 14040
tcacaggcag caacgctctg tcatcgttac aatcaacatg ctaccctccg cgagatcatc 14100
cgtgtttcaa acccggcagc ttagttgccg ttcttccgaa tagcatcggt aacatgagca 14160
aagtctgccg ccttacaacg gctctcccgc tgacgccgtc ccggactgat gggctgcctg 14220
tatcgagtgg tgattttgtg ccgagctgcc ggtcggggag ctgttggctg gctggtggca 14280
ggatatattg tggtgtaaac aaattgacgc ttagacaact taataacaca ttgcggacgt 14340
ttttaatgta ctgaattaac gccgaattaa ttcgggggat ctggatttta gtactggatt 14400
ttggttttag gaattagaaa ttttattgat agaagtattt tacaaataca aatacatact 14460
aagggtttct tatatgctca acacatgagc gaaaccctat aggaacccta attcccttat 14520
ctgggaacta ctcacacatt attatggaga aactcgagct tgtcgatcga cagatccggt 14580
cggcatctac tctatttctt tgccctcgga cgagtgctgg ggcgtcggtt tccactatcg 14640
gcgagtactt ctacacagcc atcggtccag acggccgcgc ttctgcgggc gatttgtgta 14700
cgcccgacag tcccggctcc ggatcggacg attgcgtcgc atcgaccctg cgcccaagct 14760
gcatcatcga aattgccgtc aaccaagctc tgatagagtt ggtcaagacc aatgcggagc 14820
atatacgccc ggagtcgtgg cgatcctgca agctccggat gcctccgctc gaagtagcgc 14880
gtctgctgct ccatacaagc caaccacggc ctccagaaga agatgttggc gacctcgtat 14940
tgggaatccc cgaacatcgc ctcgctccag tcaatgaccg ctgttatgcg gccattgtcc 15000
gtcaggacat tgttggagcc gaaatccgcg tgcacgaggt gccggacttc ggggcagtcc 15060
tcggcccaaa gcatcagctc atcgagagcc tgcgcgacgg acgcactgac ggtgtcgtcc 15120
atcacagttt gccagtgata cacatgggga tcagcaatcg cgcatatgaa atcacgccat 15180
gtagtgtatt gaccgattcc ttgcggtccg aatgggccga acccgctcgt ctggctaaga 15240
tcggccgcag cgatcgcatc catagcctcc gcgaccggtt gtagaacagc gggcagttcg 15300
gtttcaggca ggtcttgcaa cgtgacaccc tgtgcacggc gggagatgca ataggtcagg 15360
ctctcgctaa actccccaat gtcaagcact tccggaatcg ggagcgcggc cgatgcaaag 15420
tgccgataaa cataacgatc tttgtagaaa ccatcggcgc agctatttac ccgcaggaca 15480
tatccacgcc ctcctacatc gaagctgaaa gcacgagatt cttcgccctc cgagagctgc 15540
atcaggtcgg agacgctgtc gaacttttcg atcagaaact tctcgacaga cgtcgcggtg 15600
agttcaggct ttttcatatc tcattgcccc ccggatctgc gaaagctcga gagagataga 15660
tttgtagaga gagactggtg atttcagcgt gtcctctcca aatgaaatga acttccttat 15720
atagaggaag gtcttgcgaa ggatagtggg attgtgcgtc atcccttacg tcagtggaga 15780
tatcacatca atccacttgc tttgaagacg tggttggaac gtcttctttt tccacgatgc 15840
tcctcgtggg tgggggtcca tctttgggac cactgtcggc agaggcatct tgaacgatag 15900
cctttccttt atcgcaatga tggcatttgt aggtgccacc ttccttttct actgtccttt 15960
tgatgaagtg acagatagct gggcaatgga atccgaggag gtttcccgat attacccttt 16020
gttgaaaagt ctcaatagcc ctttggtctt ctgagactgt atctttgata ttcttggagt 16080
agacgagagt gtcgtgctcc accatgttat cacatcaatc cacttgcttt gaagacgtgg 16140
ttggaacgtc ttctttttcc acgatgctcc tcgtgggtgg gggtccatct ttgggaccac 16200
tgtcggcaga ggcatcttga acgatagcct ttcctttatc gcaatgatgg catttgtagg 16260
tgccaccttc cttttctact gtccttttga tgaagtgaca gatagctggg caatggaatc 16320
cgaggaggtt tcccgatatt accctttgtt gaaaagtctc aatagccctt tggtcttctg 16380
agactgtatc tttgatattc ttggagtaga cgagagtgtc gtgctccacc atgttggcaa 16440
gctgctctag ccaatacgca aaccgcctct ccccgcgcgt tggccgattc attaatgcag 16500
ctggcacgac aggtttcccg actggaaagc gggcagtgag cgcaacgcaa ttaatgtgag 16560
ttagctcact cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg 16620
tggaattgtg agcggataac aatttcacac aggaaacagc tatgaccatg attac 16675
<210> 4
<211> 670
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 4
aaccaacatt tgggtatggt ggtgcaatgg gaggattgat ggggatggta gcttcctcat 60
gaacattcag gagctggcat tgatccgcat tgagaacctc cctgtgaagg tgatggtgtt 120
gaacaaccaa cacctaggca tggtcgtcca gttggaggat aggttttaca aggcgaatag 180
ggcgcataca tacttgggca acccggaatg tgagagcgag atatatccag attttgtgac 240
tattgctaag gggttcaata ttcctgcagt ccgtgtaaca aagaagagtg aagtccgtgc 300
cgccatcaag aagatgctcg agactccagg gccatacttg ttggacatca tcgtcccgca 360
ccaggagcat gtgctgccta tgatcccaat tgggggcgca ttcaaggaca tgatcctgga 420
tggtgatggc aggactgtgt attaatctat aatctgtatg ttggcaaagc accagcccgg 480
cctatgtctg acgtgaatga ctcataaaga gtggtatgcc tatgatgttt gtatgtgctc 540
tatcaataac taaggtgtca actatgaacc atatgctctt ctgttttact tgtttgatgt 600
gcttggcatg gtaatcctaa ttagcttcct gctgtttgac ctgaatgacc cataaagagt 660
ggtatgccta 670
<210> 5
<211> 384
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 5
cctaggcatg gtcgtccagt tggaggatag gttttacaag gcgaataggg cgcatacata 60
cttgggcaac ccggaatgtg agagcgagat atatccagat tttgtgacta ttgctaaggg 120
gttcaatatt cctgcagtcc gtgtaacaaa gaagagtgaa gtccgtgccg ccatcaagaa 180
gatgctcgag actccagggc catacttgtt ggacatcatc gtcccgcacc aggagcatgt 240
gctgcctatg atcccaattg ggggcgcatt caaggacatg atcctggatg gtgatggcag 300
gactgtgtat taatctataa tctgtatgtt ggcaaagcac cagcccggcc tatgtctgac 360
gtgaatgact cataaagagt ggta 384
<210> 6
<211> 384
<212> DNA
<213> 人工序列(Artificial Sequence)
<400> 6
tttgggtatg gtggtgcaat gggaggatag gttttacaag gcgaataggg cgcatacata 60
cttgggcaac ccggaatgtg agagcgagat atatccagat tttgtgacta ttgctaaggg 120
gttcaatatt cctgcagtcc gtgtaacaaa gaagagtgaa gtccgtgccg ccatcaagaa 180
gatgctcgag actccagggc catacttgtt ggatatcatc gtcccgcacc aggagcatgt 240
gctgcctatg atcccaagtg ggggcgcatt caaggacatg atcctggatg gtgatggcag 300
gactgtgtat taatctataa tctgtatgtt ggcaaagcac cagcccggcc tatgtttgac 360
ctgaatgacc cataaagagt ggta 384
<210> 7
<211> 1260
<212> PRT
<213> 人工序列(Artificial Sequence)
<400> 7
Met Ala Pro Lys Lys Lys Arg Lys Val Gly Ile His Gly Val Pro Ala
1 5 10 15
Ala Ser Lys Leu Glu Lys Phe Thr Asn Cys Tyr Ser Leu Ser Lys Thr
20 25 30
Leu Arg Phe Lys Ala Ile Pro Val Gly Lys Thr Gln Glu Asn Ile Asp
35 40 45
Asn Lys Arg Leu Leu Val Glu Asp Glu Lys Arg Ala Glu Asp Tyr Lys
50 55 60
Gly Val Lys Lys Leu Leu Asp Arg Tyr Tyr Leu Ser Phe Ile Asn Asp
65 70 75 80
Val Leu His Ser Ile Lys Leu Lys Asn Leu Asn Asn Tyr Ile Ser Leu
85 90 95
Phe Arg Lys Lys Thr Arg Thr Glu Lys Glu Asn Lys Glu Leu Glu Asn
100 105 110
Leu Glu Ile Asn Leu Arg Lys Glu Ile Ala Lys Ala Phe Lys Gly Asn
115 120 125
Glu Gly Tyr Lys Ser Leu Phe Lys Lys Asp Ile Ile Glu Thr Ile Leu
130 135 140
Pro Glu Phe Leu Asp Asp Lys Asp Glu Ile Ala Leu Val Asn Ser Phe
145 150 155 160
Asn Gly Phe Thr Thr Ala Phe Thr Gly Phe Phe Asp Asn Arg Glu Asn
165 170 175
Met Phe Ser Glu Glu Ala Lys Ser Thr Ser Ile Ala Phe Arg Cys Ile
180 185 190
Asn Glu Asn Leu Thr Arg Tyr Ile Ser Asn Met Asp Ile Phe Glu Lys
195 200 205
Val Asp Ala Ile Phe Asp Lys His Glu Val Gln Glu Ile Lys Glu Lys
210 215 220
Ile Leu Asn Ser Asp Tyr Asp Val Glu Asp Phe Phe Glu Gly Glu Phe
225 230 235 240
Phe Asn Phe Val Leu Thr Gln Glu Gly Ile Asp Val Tyr Asn Ala Ile
245 250 255
Ile Gly Gly Phe Val Thr Glu Ser Gly Glu Lys Ile Lys Gly Leu Asn
260 265 270
Glu Tyr Ile Asn Leu Tyr Asn Gln Lys Thr Lys Gln Lys Leu Pro Lys
275 280 285
Phe Lys Pro Leu Tyr Lys Gln Val Leu Ser Asp Arg Glu Ser Leu Ser
290 295 300
Phe Tyr Gly Glu Gly Tyr Thr Ser Asp Glu Glu Val Leu Glu Val Phe
305 310 315 320
Arg Asn Thr Leu Asn Lys Asn Ser Glu Ile Phe Ser Ser Ile Lys Lys
325 330 335
Leu Glu Lys Leu Phe Lys Asn Phe Asp Glu Tyr Ser Ser Ala Gly Ile
340 345 350
Phe Val Lys Asn Gly Pro Ala Ile Ser Thr Ile Ser Lys Asp Ile Phe
355 360 365
Gly Glu Trp Asn Val Ile Arg Asp Lys Trp Asn Ala Glu Tyr Asp Asp
370 375 380
Ile His Leu Lys Lys Lys Ala Val Val Thr Glu Lys Tyr Glu Asp Asp
385 390 395 400
Arg Arg Lys Ser Phe Lys Lys Ile Gly Ser Phe Ser Leu Glu Gln Leu
405 410 415
Gln Glu Tyr Ala Asp Ala Asp Leu Ser Val Val Glu Lys Leu Lys Glu
420 425 430
Ile Ile Ile Gln Lys Val Asp Glu Ile Tyr Lys Val Tyr Gly Ser Ser
435 440 445
Glu Lys Leu Phe Asp Ala Asp Phe Val Leu Glu Lys Ser Leu Lys Lys
450 455 460
Asn Asp Ala Val Val Ala Ile Met Lys Asp Leu Leu Asp Ser Val Lys
465 470 475 480
Ser Phe Glu Asn Tyr Ile Lys Ala Phe Phe Gly Glu Gly Lys Glu Thr
485 490 495
Asn Arg Asp Glu Ser Phe Tyr Gly Asp Phe Val Leu Ala Tyr Asp Ile
500 505 510
Leu Leu Lys Val Asp His Ile Tyr Asp Ala Ile Arg Asn Tyr Val Thr
515 520 525
Gln Lys Pro Tyr Ser Lys Asp Lys Phe Lys Leu Tyr Phe Gln Asn Pro
530 535 540
Gln Phe Met Gly Gly Trp Asp Lys Asp Lys Glu Thr Asp Tyr Arg Ala
545 550 555 560
Thr Ile Leu Arg Tyr Gly Ser Lys Tyr Tyr Leu Ala Ile Met Asp Lys
565 570 575
Lys Tyr Ala Lys Cys Leu Gln Lys Ile Asp Lys Asp Asp Val Asn Gly
580 585 590
Asn Tyr Glu Lys Ile Asn Tyr Lys Leu Leu Pro Gly Pro Asn Lys Met
595 600 605
Leu Pro Lys Val Phe Phe Ser Lys Lys Trp Met Ala Tyr Tyr Asn Pro
610 615 620
Ser Glu Asp Ile Gln Lys Ile Tyr Lys Asn Gly Thr Phe Lys Lys Gly
625 630 635 640
Asp Met Phe Asn Leu Asn Asp Cys His Lys Leu Ile Asp Phe Phe Lys
645 650 655
Asp Ser Ile Ser Arg Tyr Pro Lys Trp Ser Asn Ala Tyr Asp Phe Asn
660 665 670
Phe Ser Glu Thr Glu Lys Tyr Lys Asp Ile Ala Gly Phe Tyr Arg Glu
675 680 685
Val Glu Glu Gln Gly Tyr Lys Val Ser Phe Glu Ser Ala Ser Lys Lys
690 695 700
Glu Val Asp Lys Leu Val Glu Glu Gly Lys Leu Tyr Met Phe Gln Ile
705 710 715 720
Tyr Asn Lys Asp Phe Ser Asp Lys Ser His Gly Thr Pro Asn Leu His
725 730 735
Thr Met Tyr Phe Lys Leu Leu Phe Asp Glu Asn Asn His Gly Gln Ile
740 745 750
Arg Leu Ser Gly Gly Ala Glu Leu Phe Met Arg Arg Ala Ser Leu Lys
755 760 765
Lys Glu Glu Leu Val Val His Pro Ala Asn Ser Pro Ile Ala Asn Lys
770 775 780
Asn Pro Asp Asn Pro Lys Lys Thr Thr Thr Leu Ser Tyr Asp Val Tyr
785 790 795 800
Lys Asp Lys Arg Phe Ser Glu Asp Gln Tyr Glu Leu His Ile Pro Ile
805 810 815
Ala Ile Asn Lys Cys Pro Lys Asn Ile Phe Lys Ile Asn Thr Glu Val
820 825 830
Arg Val Leu Leu Lys His Asp Asp Asn Pro Tyr Val Ile Gly Ile Asp
835 840 845
Arg Gly Glu Arg Asn Leu Leu Tyr Ile Val Val Val Asp Gly Lys Gly
850 855 860
Asn Ile Val Glu Gln Tyr Ser Leu Asn Glu Ile Ile Asn Asn Phe Asn
865 870 875 880
Gly Ile Arg Ile Lys Thr Asp Tyr His Ser Leu Leu Asp Lys Lys Glu
885 890 895
Lys Glu Arg Phe Glu Ala Arg Gln Asn Trp Thr Ser Ile Glu Asn Ile
900 905 910
Lys Glu Leu Lys Ala Gly Tyr Ile Ser Gln Val Val His Lys Ile Cys
915 920 925
Glu Leu Val Glu Lys Tyr Asp Ala Val Ile Ala Leu Glu Asp Leu Asn
930 935 940
Ser Gly Phe Lys Asn Ser Arg Val Lys Val Glu Lys Gln Val Tyr Gln
945 950 955 960
Lys Phe Glu Lys Met Leu Ile Asp Lys Leu Asn Tyr Met Val Asp Lys
965 970 975
Lys Ser Asn Pro Cys Ala Thr Gly Gly Ala Leu Lys Gly Tyr Gln Ile
980 985 990
Thr Asn Lys Phe Glu Ser Phe Lys Ser Met Ser Thr Gln Asn Gly Phe
995 1000 1005
Ile Phe Tyr Ile Pro Ala Trp Leu Thr Ser Lys Ile Asp Pro Ser Thr
1010 1015 1020
Gly Phe Val Asn Leu Leu Lys Thr Lys Tyr Thr Ser Ile Ala Asp Ser
1025 1030 1035 1040
Lys Lys Phe Ile Ser Ser Phe Asp Arg Ile Met Tyr Val Pro Glu Glu
1045 1050 1055
Asp Leu Phe Glu Phe Ala Leu Asp Tyr Lys Asn Phe Ser Arg Thr Asp
1060 1065 1070
Ala Asp Tyr Ile Lys Lys Trp Lys Leu Tyr Ser Tyr Gly Asn Arg Ile
1075 1080 1085
Arg Ile Phe Arg Asn Pro Lys Lys Asn Asn Val Phe Asp Trp Glu Glu
1090 1095 1100
Val Cys Leu Thr Ser Ala Tyr Lys Glu Leu Phe Asn Lys Tyr Gly Ile
1105 1110 1115 1120
Asn Tyr Gln Gln Gly Asp Ile Arg Ala Leu Leu Cys Glu Gln Ser Asp
1125 1130 1135
Lys Ala Phe Tyr Ser Ser Phe Met Ala Leu Met Ser Leu Met Leu Gln
1140 1145 1150
Met Arg Asn Ser Ile Thr Gly Arg Thr Asp Val Asp Phe Leu Ile Ser
1155 1160 1165
Pro Val Lys Asn Ser Asp Gly Ile Phe Tyr Asp Ser Arg Asn Tyr Glu
1170 1175 1180
Ala Gln Glu Asn Ala Ile Leu Pro Lys Asn Ala Asp Ala Asn Gly Ala
1185 1190 1195 1200
Tyr Asn Ile Ala Arg Lys Val Leu Trp Ala Ile Gly Gln Phe Lys Lys
1205 1210 1215
Ala Glu Asp Glu Lys Leu Asp Lys Val Lys Ile Ala Ile Ser Asn Lys
1220 1225 1230
Glu Trp Leu Glu Tyr Ala Gln Thr Ser Val Lys His Lys Arg Pro Ala
1235 1240 1245
Ala Thr Lys Lys Ala Gly Gln Ala Lys Lys Lys Lys
1250 1255 1260
Claims (6)
1.一种用于取代植物基因组中的目标片段的表达盒甲,包括启动子甲和终止子,其特征在于:在启动子甲和终止子之间包括如下三个区段:区段Ⅰ、区段Ⅱ和区段Ⅲ;区段Ⅲ为区段Ⅲ-1或区段Ⅲ-2;
所述启动子甲为OsU3启动子;
区段Ⅰ自5’至3’端依次具有Hammerhead型核酸酶的编码序列、crRNA1的编码序列和丁型肝炎病毒核酸酶的编码序列;
区段Ⅱ自5’至3’端依次具有Hammerhead型核酸酶的编码序列、crRNA2的编码序列和丁型肝炎病毒核酸酶的编码序列;
区段Ⅲ-1中自5’至3’端依次具有Hammerhead型核酸酶的编码序列、上游同源臂、供体片段序列、下游同源臂和丁型肝炎病毒核酸酶的编码序列;
区段Ⅲ-2中自5’至3’端依次具有crRNA1的靶标序列、上游同源臂、供体片段序列、下游同源臂和crRNA2的靶标序列;
所述目标片段的一个末端为区段Ⅰ中crRNA1的靶点序列,另一个末端为区段Ⅱ中crRNA2的靶点序列;
供体片段与目标片段具有如下差异:①预期在目标片段中引入的差异核苷酸;②将crRNA1中的PAM序列TTTN突变为非TTTN;③将crRNA2的PAM序列TTTN突变为非TTTN;将crRNA1和crRNA2中的靶点序列进行同义突变;
所述表达盒甲如序列表的序列1自5’端第13-1686位所示,或,所述表达盒甲如序列表的序列2所示。
2.含有权利要求1所述表达盒甲的重组载体。
3.如权利要求2所述的重组载体,其特征在于:所述重组载体还包括表达盒乙;所述表达盒乙中由启动子乙启动LbCpf1核酸酶的编码基因表达,所述启动子乙为Ubi启动子。
4.如权利要求2或3所述的重组载体,其特征在于:所述重组载体为序列表的序列1所示的环形质粒,或,所述重组载体为采用序列2所示的双链DNA分子替代序列1自5’端第13-1686位得到的环形质粒。
5.权利要求1所述表达盒甲或权利要求2至4任一所述的重组载体在实现植物中以RNA转录本为模板进行靶基因同源重组中的应用;所述植物为水稻。
6.一种植物中以RNA转录本为模板进行靶基因同源重组的方法,包括如下步骤:将权利要求2至4任一所述的重组载体导入出发植物,实现植物中靶基因同源重组;所述植物为水稻。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810385845.5A CN108707621B (zh) | 2018-04-26 | 2018-04-26 | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 |
PCT/GB2019/050140 WO2019207274A1 (en) | 2018-04-26 | 2019-01-18 | Gene replacement in plants |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810385845.5A CN108707621B (zh) | 2018-04-26 | 2018-04-26 | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108707621A CN108707621A (zh) | 2018-10-26 |
CN108707621B true CN108707621B (zh) | 2021-02-12 |
Family
ID=63867413
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810385845.5A Active CN108707621B (zh) | 2018-04-26 | 2018-04-26 | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN108707621B (zh) |
WO (1) | WO2019207274A1 (zh) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013066438A2 (en) | 2011-07-22 | 2013-05-10 | President And Fellows Of Harvard College | Evaluation and improvement of nuclease cleavage specificity |
US20150044192A1 (en) | 2013-08-09 | 2015-02-12 | President And Fellows Of Harvard College | Methods for identifying a target site of a cas9 nuclease |
US9359599B2 (en) | 2013-08-22 | 2016-06-07 | President And Fellows Of Harvard College | Engineered transcription activator-like effector (TALE) domains and uses thereof |
US9228207B2 (en) | 2013-09-06 | 2016-01-05 | President And Fellows Of Harvard College | Switchable gRNAs comprising aptamers |
US9388430B2 (en) | 2013-09-06 | 2016-07-12 | President And Fellows Of Harvard College | Cas9-recombinase fusion proteins and uses thereof |
US9737604B2 (en) | 2013-09-06 | 2017-08-22 | President And Fellows Of Harvard College | Use of cationic lipids to deliver CAS9 |
US11053481B2 (en) | 2013-12-12 | 2021-07-06 | President And Fellows Of Harvard College | Fusions of Cas9 domains and nucleic acid-editing domains |
EP3177718B1 (en) | 2014-07-30 | 2022-03-16 | President and Fellows of Harvard College | Cas9 proteins including ligand-dependent inteins |
EP3365356B1 (en) | 2015-10-23 | 2023-06-28 | President and Fellows of Harvard College | Nucleobase editors and uses thereof |
CN110214183A (zh) | 2016-08-03 | 2019-09-06 | 哈佛大学的校长及成员们 | 腺苷核碱基编辑器及其用途 |
US11661590B2 (en) | 2016-08-09 | 2023-05-30 | President And Fellows Of Harvard College | Programmable CAS9-recombinase fusion proteins and uses thereof |
WO2018039438A1 (en) | 2016-08-24 | 2018-03-01 | President And Fellows Of Harvard College | Incorporation of unnatural amino acids into proteins using base editing |
CA3039928A1 (en) | 2016-10-14 | 2018-04-19 | President And Fellows Of Harvard College | Aav delivery of nucleobase editors |
WO2018119359A1 (en) | 2016-12-23 | 2018-06-28 | President And Fellows Of Harvard College | Editing of ccr5 receptor gene to protect against hiv infection |
US11898179B2 (en) | 2017-03-09 | 2024-02-13 | President And Fellows Of Harvard College | Suppression of pain by gene editing |
CN110914310A (zh) | 2017-03-10 | 2020-03-24 | 哈佛大学的校长及成员们 | 胞嘧啶至鸟嘌呤碱基编辑器 |
IL269458B2 (en) | 2017-03-23 | 2024-02-01 | Harvard College | Nucleic base editors that include nucleic acid programmable DNA binding proteins |
US11560566B2 (en) | 2017-05-12 | 2023-01-24 | President And Fellows Of Harvard College | Aptazyme-embedded guide RNAs for use with CRISPR-Cas9 in genome editing and transcriptional activation |
US11732274B2 (en) | 2017-07-28 | 2023-08-22 | President And Fellows Of Harvard College | Methods and compositions for evolving base editors using phage-assisted continuous evolution (PACE) |
EP3676376A2 (en) | 2017-08-30 | 2020-07-08 | President and Fellows of Harvard College | High efficiency base editors comprising gam |
WO2019079347A1 (en) | 2017-10-16 | 2019-04-25 | The Broad Institute, Inc. | USES OF BASIC EDITORS ADENOSINE |
CN108707621B (zh) * | 2018-04-26 | 2021-02-12 | 中国农业科学院作物科学研究所 | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 |
WO2020191248A1 (en) | 2019-03-19 | 2020-09-24 | The Broad Institute, Inc. | Method and compositions for editing nucleotide sequences |
WO2020259210A1 (zh) * | 2019-06-23 | 2020-12-30 | 苏州克睿基因生物科技有限公司 | 一种检测非洲猪瘟病毒的方法和试剂盒 |
CN111019968B (zh) * | 2019-12-31 | 2023-06-23 | 北京市农林科学院 | NTS/dNTS组合在制备植物突变体中的应用 |
WO2021226558A1 (en) | 2020-05-08 | 2021-11-11 | The Broad Institute, Inc. | Methods and compositions for simultaneous editing of both strands of a target double-stranded nucleotide sequence |
LU102162B1 (en) * | 2020-10-26 | 2022-04-27 | Univ Hamburg | Transcriptional synchronization of two or more functional transcription products |
WO2022090153A1 (en) * | 2020-10-26 | 2022-05-05 | Universität Hamburg | Transcriptional synchronization of two or more functional transcription products |
CN112680474A (zh) * | 2021-01-19 | 2021-04-20 | 中国农业科学院作物科学研究所 | 一种荧光标记CRISPR/SpCas9系统介导的基因替换体系及其在植物中的应用 |
WO2023148291A1 (en) * | 2022-02-02 | 2023-08-10 | Biotalys NV | Methods for genome editing |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105177038A (zh) * | 2015-09-29 | 2015-12-23 | 中国科学院遗传与发育生物学研究所 | 一种高效定点编辑植物基因组的CRISPR/Cas9系统 |
CN107012164A (zh) * | 2017-01-11 | 2017-08-04 | 电子科技大学 | CRISPR/Cpf1植物基因组定向修饰功能单元、包含该功能单元的载体及其应用 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8697359B1 (en) | 2012-12-12 | 2014-04-15 | The Broad Institute, Inc. | CRISPR-Cas systems and methods for altering expression of gene products |
KR101885901B1 (ko) * | 2015-11-13 | 2018-08-07 | 기초과학연구원 | 5' 말단의 인산기가 제거된 rna를 포함하는 리보핵산단백질 전달용 조성물 |
CN106811479B (zh) * | 2015-11-30 | 2019-10-25 | 中国农业科学院作物科学研究所 | 利用CRISPR/Cas9系统定点修饰ALS基因获得抗除草剂水稻的系统及其应用 |
CN108707621B (zh) * | 2018-04-26 | 2021-02-12 | 中国农业科学院作物科学研究所 | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 |
-
2018
- 2018-04-26 CN CN201810385845.5A patent/CN108707621B/zh active Active
-
2019
- 2019-01-18 WO PCT/GB2019/050140 patent/WO2019207274A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105177038A (zh) * | 2015-09-29 | 2015-12-23 | 中国科学院遗传与发育生物学研究所 | 一种高效定点编辑植物基因组的CRISPR/Cas9系统 |
CN107012164A (zh) * | 2017-01-11 | 2017-08-04 | 电子科技大学 | CRISPR/Cpf1植物基因组定向修饰功能单元、包含该功能单元的载体及其应用 |
Non-Patent Citations (2)
Title |
---|
Engineering Herbicide-Resistant Rice Plants through CRISPR/Cas9-Mediated Homologous Recombination of Acetolactate Synthase;Yongwei Sun等;《Molecular Plant》;20160105;第9卷;628-631 * |
新一代基因组编辑系统CRISPR/Cpf1;杨帆等;《生物工程学报》;20170325;第33卷(第3期);361-371 * |
Also Published As
Publication number | Publication date |
---|---|
CN108707621A (zh) | 2018-10-26 |
WO2019207274A1 (en) | 2019-10-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108707621B (zh) | 一种CRISPR/Cpf1系统介导的以RNA转录本为修复模板的同源重组方法 | |
CN108203714B (zh) | 一种棉花基因的编辑方法 | |
CN110551752B (zh) | xCas9n-epBE碱基编辑系统及其在基因组碱基替换中的应用 | |
CN108546712A (zh) | 一种利用CRISPR/LbCpf1系统实现目的基因在植物中同源重组的方法 | |
KR20100098652A (ko) | 바실루스 내에서의 향상된 단백질 생산 | |
US20110321190A1 (en) | Method of positive plant selection using sorbitol dehydrogenase | |
CN110885868B (zh) | 一种利用细胞色素P450酶合成2α-羟基化类固醇化合物的方法 | |
CN107418954B (zh) | 毛白杨基因PtomiR390a及其应用 | |
CN110760538B (zh) | 一种创制枯萎病抗性西瓜种质材料的方法 | |
CN109206496B (zh) | 蛋白质GhFLS1在调控植物耐热性中的应用 | |
CN112778405B (zh) | 一种与植物开花期相关的蛋白及其编码基因与应用 | |
CN113121662B (zh) | 棉花GhBZR3蛋白及其编码基因在调节植物生长发育中的应用 | |
CN109232726B (zh) | 蛋白质OsVPE2在调控植物液泡无机磷输出能力中的应用 | |
CN112662672B (zh) | 一种启动子及其制备方法 | |
CN110408646A (zh) | 一种植物遗传转化筛选载体及其应用 | |
CN110835631B (zh) | 一种改造的sgRNA及其在提高碱基编辑效率中的应用 | |
CN110923263B (zh) | 水稻β-淀粉酶BA1及其编码基因与应用 | |
CN109485707B (zh) | 蛋白质OsVPE1在调控植物液泡无机磷输出能力中的应用 | |
CN111423990B (zh) | 一种乙氧氟草醚敏感型酵母菌及其制备方法 | |
KR100592490B1 (ko) | 선발표지유전자가 제거된 형질전환식물체의 제조를 위한벡터 및 이를 이용한 형질전환체의 제조방법 | |
CN115232757B (zh) | 酿酒酵母菌株、发酵菌株及其构建方法和生物乙醇生产方法 | |
CN106459161A (zh) | 涉及谷氨酸受体多肽编码基因的构建体和方法 | |
CN114591996B (zh) | 一种凝结芽孢杆菌h-1的表达载体及其构建方法与应用 | |
CN110835630A (zh) | 一种高效的sgRNA及其在基因编辑中的应用 | |
CN111269298B (zh) | 蛋白质GhCCoAOMT7在调控植物耐热性中的应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |