WO2025017033A1 - Prime editing of the -115 region in the hbg1 and/or hbg2 promoter for increasing fetal hemoglobin content in a eukaryotic cell - Google Patents
Prime editing of the -115 region in the hbg1 and/or hbg2 promoter for increasing fetal hemoglobin content in a eukaryotic cell Download PDFInfo
- Publication number
- WO2025017033A1 WO2025017033A1 PCT/EP2024/070167 EP2024070167W WO2025017033A1 WO 2025017033 A1 WO2025017033 A1 WO 2025017033A1 EP 2024070167 W EP2024070167 W EP 2024070167W WO 2025017033 A1 WO2025017033 A1 WO 2025017033A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- sequence
- prime editing
- rna
- prime
- Prior art date
Links
- 210000003527 eukaryotic cell Anatomy 0.000 title claims description 33
- 108010044495 Fetal Hemoglobin Proteins 0.000 title claims description 19
- 230000001965 increasing effect Effects 0.000 title claims description 18
- 230000035772 mutation Effects 0.000 claims abstract description 84
- 102100038614 Hemoglobin subunit gamma-1 Human genes 0.000 claims abstract description 61
- 101001031977 Homo sapiens Hemoglobin subunit gamma-1 Proteins 0.000 claims abstract description 61
- 102100022976 B-cell lymphoma/leukemia 11A Human genes 0.000 claims abstract description 59
- 101000903703 Homo sapiens B-cell lymphoma/leukemia 11A Proteins 0.000 claims abstract description 59
- 102100031690 Erythroid transcription factor Human genes 0.000 claims abstract description 46
- 101001066268 Homo sapiens Erythroid transcription factor Proteins 0.000 claims abstract description 45
- 102100022248 Krueppel-like factor 1 Human genes 0.000 claims abstract description 40
- 101001046587 Homo sapiens Krueppel-like factor 1 Proteins 0.000 claims abstract description 37
- 102100038617 Hemoglobin subunit gamma-2 Human genes 0.000 claims abstract description 32
- 230000014509 gene expression Effects 0.000 claims abstract description 30
- 238000011282 treatment Methods 0.000 claims abstract description 24
- 239000012190 activator Substances 0.000 claims abstract description 15
- 108091005886 Hemoglobin subunit gamma Proteins 0.000 claims abstract description 13
- 208000034737 hemoglobinopathy Diseases 0.000 claims abstract description 13
- 210000004027 cell Anatomy 0.000 claims description 92
- 108091033409 CRISPR Proteins 0.000 claims description 82
- 125000003729 nucleotide group Chemical group 0.000 claims description 73
- 108020004414 DNA Proteins 0.000 claims description 70
- 239000002773 nucleotide Substances 0.000 claims description 70
- 101710163270 Nuclease Proteins 0.000 claims description 68
- 108090000623 proteins and genes Proteins 0.000 claims description 64
- 238000000034 method Methods 0.000 claims description 61
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims description 59
- 102000004190 Enzymes Human genes 0.000 claims description 46
- 108090000790 Enzymes Proteins 0.000 claims description 46
- 108020005004 Guide RNA Proteins 0.000 claims description 44
- 230000027455 binding Effects 0.000 claims description 44
- 102000004169 proteins and genes Human genes 0.000 claims description 40
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 39
- 150000007523 nucleic acids Chemical group 0.000 claims description 39
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 33
- 238000012217 deletion Methods 0.000 claims description 33
- 230000037430 deletion Effects 0.000 claims description 33
- 125000006850 spacer group Chemical group 0.000 claims description 31
- 238000010453 CRISPR/Cas method Methods 0.000 claims description 30
- 230000036961 partial effect Effects 0.000 claims description 21
- 101001031961 Homo sapiens Hemoglobin subunit gamma-2 Proteins 0.000 claims description 20
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 20
- 102000053602 DNA Human genes 0.000 claims description 18
- 210000003958 hematopoietic stem cell Anatomy 0.000 claims description 18
- 108010008532 Deoxyribonuclease I Proteins 0.000 claims description 16
- 102000007260 Deoxyribonuclease I Human genes 0.000 claims description 16
- 238000011144 upstream manufacturing Methods 0.000 claims description 16
- 241000193996 Streptococcus pyogenes Species 0.000 claims description 15
- 108020001507 fusion proteins Proteins 0.000 claims description 12
- 102000037865 fusion proteins Human genes 0.000 claims description 12
- 230000008569 process Effects 0.000 claims description 11
- 230000033607 mismatch repair Effects 0.000 claims description 10
- 238000010839 reverse transcription Methods 0.000 claims description 9
- 108091006106 transcriptional activators Proteins 0.000 claims description 8
- 230000033616 DNA repair Effects 0.000 claims description 6
- 102220605874 Cytosolic arginine sensor for mTORC1 subunit 2_D10A_mutation Human genes 0.000 claims description 5
- 108010026664 MutL Protein Homolog 1 Proteins 0.000 claims description 5
- 102000013609 MutL Protein Homolog 1 Human genes 0.000 claims description 5
- 108091023037 Aptamer Proteins 0.000 claims description 4
- 230000008859 change Effects 0.000 claims description 4
- 210000001671 embryonic stem cell Anatomy 0.000 claims description 4
- 230000002708 enhancing effect Effects 0.000 claims description 4
- 210000004263 induced pluripotent stem cell Anatomy 0.000 claims description 4
- 230000010076 replication Effects 0.000 claims description 4
- 108020004422 Riboswitch Proteins 0.000 claims description 3
- 230000002401 inhibitory effect Effects 0.000 claims description 3
- 102100021147 DNA mismatch repair protein Msh6 Human genes 0.000 claims description 2
- 108010010789 G-T mismatch-binding protein Proteins 0.000 claims description 2
- 101000577853 Homo sapiens DNA mismatch repair protein Mlh1 Proteins 0.000 claims description 2
- 101001134036 Homo sapiens DNA mismatch repair protein Msh2 Proteins 0.000 claims description 2
- 101000738911 Homo sapiens Mismatch repair endonuclease PMS2 Proteins 0.000 claims description 2
- 102000008071 Mismatch Repair Endonuclease PMS2 Human genes 0.000 claims description 2
- 102000057079 human MSH2 Human genes 0.000 claims description 2
- 230000001771 impaired effect Effects 0.000 claims description 2
- 102200070544 rs202198133 Human genes 0.000 claims description 2
- 102200111286 rs2234704 Human genes 0.000 claims description 2
- 230000010474 transient expression Effects 0.000 claims description 2
- 102100034343 Integrase Human genes 0.000 claims 2
- 208000007056 sickle cell anemia Diseases 0.000 abstract description 25
- 208000020451 hereditary persistence of fetal hemoglobin Diseases 0.000 abstract description 23
- 102100021519 Hemoglobin subunit beta Human genes 0.000 abstract description 18
- 108091005904 Hemoglobin subunit beta Proteins 0.000 abstract description 17
- 208000005980 beta thalassemia Diseases 0.000 abstract description 11
- 108010054147 Hemoglobins Proteins 0.000 abstract description 9
- 102000001554 Hemoglobins Human genes 0.000 abstract description 9
- 230000007420 reactivation Effects 0.000 abstract description 9
- 208000018337 inherited hemoglobinopathy Diseases 0.000 abstract description 7
- 238000003786 synthesis reaction Methods 0.000 abstract description 7
- 230000015572 biosynthetic process Effects 0.000 abstract description 6
- 230000003389 potentiating effect Effects 0.000 abstract description 3
- 208000024556 Mendelian disease Diseases 0.000 abstract description 2
- 208000018020 Sickle cell-beta-thalassemia disease syndrome Diseases 0.000 abstract description 2
- 206010043391 Thalassaemia beta Diseases 0.000 abstract description 2
- 102100031780 Endonuclease Human genes 0.000 description 58
- 235000018102 proteins Nutrition 0.000 description 37
- 230000000295 complement effect Effects 0.000 description 27
- 125000005647 linker group Chemical group 0.000 description 27
- 102000039446 nucleic acids Human genes 0.000 description 24
- 108020004707 nucleic acids Proteins 0.000 description 24
- 210000000130 stem cell Anatomy 0.000 description 24
- 239000013612 plasmid Substances 0.000 description 23
- VLEIUWBSEKKKFX-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O VLEIUWBSEKKKFX-UHFFFAOYSA-N 0.000 description 22
- 101001105486 Homo sapiens Proteasome subunit alpha type-7 Proteins 0.000 description 18
- 102100021201 Proteasome subunit alpha type-7 Human genes 0.000 description 18
- 108090000765 processed proteins & peptides Proteins 0.000 description 18
- 235000001014 amino acid Nutrition 0.000 description 17
- 238000005516 engineering process Methods 0.000 description 17
- 238000007481 next generation sequencing Methods 0.000 description 17
- 239000000872 buffer Substances 0.000 description 15
- 102000004196 processed proteins & peptides Human genes 0.000 description 14
- 108091079001 CRISPR RNA Proteins 0.000 description 13
- 150000001413 amino acids Chemical class 0.000 description 13
- 108020004999 messenger RNA Proteins 0.000 description 13
- 238000006243 chemical reaction Methods 0.000 description 12
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 230000008685 targeting Effects 0.000 description 12
- 238000001890 transfection Methods 0.000 description 12
- 102100027685 Hemoglobin subunit alpha Human genes 0.000 description 11
- 101150063416 add gene Proteins 0.000 description 11
- 230000002950 deficient Effects 0.000 description 11
- 230000005782 double-strand break Effects 0.000 description 11
- 238000010362 genome editing Methods 0.000 description 11
- 239000000203 mixture Substances 0.000 description 11
- 230000004048 modification Effects 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 230000008439 repair process Effects 0.000 description 11
- 108010016797 Sickle Hemoglobin Proteins 0.000 description 10
- 102100031573 Hematopoietic progenitor cell antigen CD34 Human genes 0.000 description 9
- 101000777663 Homo sapiens Hematopoietic progenitor cell antigen CD34 Proteins 0.000 description 9
- 241000700605 Viruses Species 0.000 description 9
- 239000003814 drug Substances 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- 239000013598 vector Substances 0.000 description 9
- 102000004389 Ribonucleoproteins Human genes 0.000 description 8
- 108010081734 Ribonucleoproteins Proteins 0.000 description 8
- 239000003795 chemical substances by application Substances 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 229940079593 drug Drugs 0.000 description 8
- 210000003743 erythrocyte Anatomy 0.000 description 8
- 108060003196 globin Proteins 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 108091093088 Amplicon Proteins 0.000 description 7
- 101001009007 Homo sapiens Hemoglobin subunit alpha Proteins 0.000 description 7
- 210000001185 bone marrow Anatomy 0.000 description 7
- 230000000925 erythroid effect Effects 0.000 description 7
- 102000018146 globin Human genes 0.000 description 7
- 239000003112 inhibitor Substances 0.000 description 7
- 238000012423 maintenance Methods 0.000 description 7
- 238000013518 transcription Methods 0.000 description 7
- 238000010356 CRISPR-Cas9 genome editing Methods 0.000 description 6
- 101150086355 HBG gene Proteins 0.000 description 6
- 108091005902 Hemoglobin subunit alpha Proteins 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 6
- 210000004369 blood Anatomy 0.000 description 6
- 239000008280 blood Substances 0.000 description 6
- 210000000601 blood cell Anatomy 0.000 description 6
- 238000013461 design Methods 0.000 description 6
- 230000001605 fetal effect Effects 0.000 description 6
- 239000012634 fragment Substances 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 238000009396 hybridization Methods 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 102000040430 polynucleotide Human genes 0.000 description 6
- 108091033319 polynucleotide Proteins 0.000 description 6
- 239000002157 polynucleotide Substances 0.000 description 6
- 230000001105 regulatory effect Effects 0.000 description 6
- 241000894007 species Species 0.000 description 6
- 238000011285 therapeutic regimen Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- 108091006107 transcriptional repressors Proteins 0.000 description 6
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 5
- 102100025169 Max-binding protein MNT Human genes 0.000 description 5
- 241000713869 Moloney murine leukemia virus Species 0.000 description 5
- 108020004682 Single-Stranded DNA Proteins 0.000 description 5
- 210000002960 bfu-e Anatomy 0.000 description 5
- 230000001413 cellular effect Effects 0.000 description 5
- 239000003623 enhancer Substances 0.000 description 5
- 238000002474 experimental method Methods 0.000 description 5
- 229910052739 hydrogen Inorganic materials 0.000 description 5
- 239000001257 hydrogen Substances 0.000 description 5
- 230000006698 induction Effects 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 239000000543 intermediate Substances 0.000 description 5
- 238000004519 manufacturing process Methods 0.000 description 5
- 230000001404 mediated effect Effects 0.000 description 5
- 230000030648 nucleus localization Effects 0.000 description 5
- 230000037361 pathway Effects 0.000 description 5
- 238000007480 sanger sequencing Methods 0.000 description 5
- 230000004570 RNA-binding Effects 0.000 description 4
- 239000000556 agonist Substances 0.000 description 4
- 238000000137 annealing Methods 0.000 description 4
- 230000015556 catabolic process Effects 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 238000006731 degradation reaction Methods 0.000 description 4
- 208000035475 disorder Diseases 0.000 description 4
- -1 e.g. Proteins 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 108010038853 gamma-Globins Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 238000004128 high performance liquid chromatography Methods 0.000 description 4
- 210000005259 peripheral blood Anatomy 0.000 description 4
- 239000011886 peripheral blood Substances 0.000 description 4
- 238000006116 polymerization reaction Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000007115 recruitment Effects 0.000 description 4
- 230000002441 reversible effect Effects 0.000 description 4
- UCSJYZPVAKXKNQ-HZYVHMACSA-N streptomycin Chemical compound CN[C@H]1[C@H](O)[C@@H](O)[C@H](CO)O[C@H]1O[C@@H]1[C@](C=O)(O)[C@H](C)O[C@H]1O[C@@H]1[C@@H](NC(N)=N)[C@H](O)[C@@H](NC(N)=N)[C@H](O)[C@H]1O UCSJYZPVAKXKNQ-HZYVHMACSA-N 0.000 description 4
- 210000001519 tissue Anatomy 0.000 description 4
- PZNPLUBHRSSFHT-RRHRGVEJSA-N 1-hexadecanoyl-2-octadecanoyl-sn-glycero-3-phosphocholine Chemical compound CCCCCCCCCCCCCCCCCC(=O)O[C@@H](COP([O-])(=O)OCC[N+](C)(C)C)COC(=O)CCCCCCCCCCCCCCC PZNPLUBHRSSFHT-RRHRGVEJSA-N 0.000 description 3
- 108091032955 Bacterial small RNA Proteins 0.000 description 3
- 102100039398 C-X-C motif chemokine 2 Human genes 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- 230000007018 DNA scission Effects 0.000 description 3
- 101150013707 HBB gene Proteins 0.000 description 3
- 102100026236 Interleukin-8 Human genes 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 3
- 101000910035 Streptococcus pyogenes serotype M1 CRISPR-associated endonuclease Cas9/Csn1 Proteins 0.000 description 3
- 241000194020 Streptococcus thermophilus Species 0.000 description 3
- 108091023040 Transcription factor Proteins 0.000 description 3
- 102000040945 Transcription factor Human genes 0.000 description 3
- 238000003556 assay Methods 0.000 description 3
- 238000003776 cleavage reaction Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 108010089558 erythroid Kruppel-like factor Proteins 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 230000003394 haemopoietic effect Effects 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 238000001802 infusion Methods 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 238000009126 molecular therapy Methods 0.000 description 3
- 239000013642 negative control Substances 0.000 description 3
- 210000000056 organ Anatomy 0.000 description 3
- 229920000642 polymer Polymers 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 230000007017 scission Effects 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- ZXIATBNUWJBBGT-JXOAFFINSA-N 5-methoxyuridine Chemical group O=C1NC(=O)C(OC)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](CO)O1 ZXIATBNUWJBBGT-JXOAFFINSA-N 0.000 description 2
- 102100031585 ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Human genes 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- 241000713840 Avian erythroblastosis virus Species 0.000 description 2
- 241000713838 Avian myeloblastosis virus Species 0.000 description 2
- 241000714197 Avian myeloblastosis-associated virus Species 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 102100031172 C-C chemokine receptor type 1 Human genes 0.000 description 2
- 102100036166 C-X-C chemokine receptor type 1 Human genes 0.000 description 2
- 102100031650 C-X-C chemokine receptor type 4 Human genes 0.000 description 2
- 102000019063 CCAAT-Binding Factor Human genes 0.000 description 2
- 108010026988 CCAAT-Binding Factor Proteins 0.000 description 2
- 102000000013 Chemokine CCL3 Human genes 0.000 description 2
- 108010008951 Chemokine CXCL12 Proteins 0.000 description 2
- 108010014414 Chemokine CXCL2 Proteins 0.000 description 2
- 108010077544 Chromatin Proteins 0.000 description 2
- 108091026890 Coding region Proteins 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 2
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 2
- 230000004568 DNA-binding Effects 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004150 Flap endonucleases Human genes 0.000 description 2
- 108090000652 Flap endonucleases Proteins 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 2
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 2
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 2
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 2
- 108060003760 HNH nuclease Proteins 0.000 description 2
- 102000029812 HNH nuclease Human genes 0.000 description 2
- 101000777636 Homo sapiens ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Proteins 0.000 description 2
- 101000947174 Homo sapiens C-X-C chemokine receptor type 1 Proteins 0.000 description 2
- 101000922348 Homo sapiens C-X-C chemokine receptor type 4 Proteins 0.000 description 2
- 101000899111 Homo sapiens Hemoglobin subunit beta Proteins 0.000 description 2
- 101000891113 Homo sapiens T-cell acute lymphocytic leukemia protein 1 Proteins 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 102000000646 Interleukin-3 Human genes 0.000 description 2
- 108010002386 Interleukin-3 Proteins 0.000 description 2
- 108090001007 Interleukin-8 Proteins 0.000 description 2
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 2
- 229930182555 Penicillin Natural products 0.000 description 2
- JGSARLDLIJGVTE-MBNYWOFBSA-N Penicillin G Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)CC1=CC=CC=C1 JGSARLDLIJGVTE-MBNYWOFBSA-N 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- 241000714474 Rous sarcoma virus Species 0.000 description 2
- 241000713824 Rous-associated virus Species 0.000 description 2
- 101000702553 Schistosoma mansoni Antigen Sm21.7 Proteins 0.000 description 2
- 101000714192 Schistosoma mansoni Tegument antigen Proteins 0.000 description 2
- 241000187191 Streptomyces viridochromogenes Species 0.000 description 2
- 241000203587 Streptosporangium roseum Species 0.000 description 2
- 102100021669 Stromal cell-derived factor 1 Human genes 0.000 description 2
- 108091027544 Subgenomic mRNA Proteins 0.000 description 2
- 102100040365 T-cell acute lymphocytic leukemia protein 1 Human genes 0.000 description 2
- 208000002903 Thalassemia Diseases 0.000 description 2
- 102000036693 Thrombopoietin Human genes 0.000 description 2
- 108010041111 Thrombopoietin Proteins 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 210000005006 adaptive immune system Anatomy 0.000 description 2
- OIRDTQYFTABQOQ-KQYNXXCUSA-N adenosine Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 2
- 208000007502 anemia Diseases 0.000 description 2
- 239000005557 antagonist Substances 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 239000011324 bead Substances 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- OWMVSZAMULFTJU-UHFFFAOYSA-N bis-tris Chemical compound OCCN(CCO)C(CO)(CO)CO OWMVSZAMULFTJU-UHFFFAOYSA-N 0.000 description 2
- 238000004113 cell culture Methods 0.000 description 2
- 210000003483 chromatin Anatomy 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000002860 competitive effect Effects 0.000 description 2
- 239000000562 conjugate Substances 0.000 description 2
- 230000021615 conjugation Effects 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000004069 differentiation Effects 0.000 description 2
- 230000012361 double-strand break repair Effects 0.000 description 2
- 230000011559 double-strand break repair via nonhomologous end joining Effects 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000001415 gene therapy Methods 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 239000003102 growth factor Substances 0.000 description 2
- 239000001963 growth medium Substances 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 238000009434 installation Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 229940076264 interleukin-3 Drugs 0.000 description 2
- 229940096397 interleukin-8 Drugs 0.000 description 2
- XKTZWUACRZHVAN-VADRZIEHSA-N interleukin-8 Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](NC(C)=O)CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N1[C@H](CCC1)C(=O)N1[C@H](CCC1)C(=O)N[C@@H](C)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CCC(O)=O)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC=1C=CC(O)=CC=1)C(=O)N[C@H](CO)C(=O)N1[C@H](CCC1)C(N)=O)C1=CC=CC=C1 XKTZWUACRZHVAN-VADRZIEHSA-N 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 150000002632 lipids Chemical class 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 230000000869 mutational effect Effects 0.000 description 2
- 239000002105 nanoparticle Substances 0.000 description 2
- 239000002853 nucleic acid probe Substances 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 239000005022 packaging material Substances 0.000 description 2
- 229940049954 penicillin Drugs 0.000 description 2
- 210000004976 peripheral blood cell Anatomy 0.000 description 2
- 239000013600 plasmid vector Substances 0.000 description 2
- YIQPUIGJQJDJOS-UHFFFAOYSA-N plerixafor Chemical compound C=1C=C(CN2CCNCCCNCCNCCC2)C=CC=1CN1CCCNCCNCCCNCC1 YIQPUIGJQJDJOS-UHFFFAOYSA-N 0.000 description 2
- 229960002169 plerixafor Drugs 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 239000002243 precursor Substances 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000012552 review Methods 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- DAEPDZWVDSPTHF-UHFFFAOYSA-M sodium pyruvate Chemical compound [Na+].CC(=O)C([O-])=O DAEPDZWVDSPTHF-UHFFFAOYSA-M 0.000 description 2
- 210000000952 spleen Anatomy 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 229960005322 streptomycin Drugs 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- RYYWUUFWQRZTIU-UHFFFAOYSA-K thiophosphate Chemical compound [O-]P([O-])([O-])=S RYYWUUFWQRZTIU-UHFFFAOYSA-K 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 239000004474 valine Substances 0.000 description 2
- 125000002987 valine group Chemical class [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- 239000013603 viral vector Substances 0.000 description 2
- GUAHPAJOXVYFON-ZETCQYMHSA-N (8S)-8-amino-7-oxononanoic acid zwitterion Chemical compound C[C@H](N)C(=O)CCCCCC(O)=O GUAHPAJOXVYFON-ZETCQYMHSA-N 0.000 description 1
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 1
- MWBWWFOAEOYUST-UHFFFAOYSA-N 2-aminopurine Chemical compound NC1=NC=C2N=CNC2=N1 MWBWWFOAEOYUST-UHFFFAOYSA-N 0.000 description 1
- BGFHMYJZJZLMHW-UHFFFAOYSA-N 4-[2-[[2-(1-benzothiophen-3-yl)-9-propan-2-ylpurin-6-yl]amino]ethyl]phenol Chemical compound N1=C(C=2C3=CC=CC=C3SC=2)N=C2N(C(C)C)C=NC2=C1NCCC1=CC=C(O)C=C1 BGFHMYJZJZLMHW-UHFFFAOYSA-N 0.000 description 1
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 1
- AGFIRQJZCNVMCW-UAKXSSHOSA-N 5-bromouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(Br)=C1 AGFIRQJZCNVMCW-UAKXSSHOSA-N 0.000 description 1
- XISVSTPEXYIKJL-UHFFFAOYSA-N 7-methyl-2-[(7-methyl-[1,2,4]triazolo[1,5-a]pyridin-6-yl)amino]-9-(oxan-4-yl)purin-8-one Chemical compound CN1C(=O)N(C2CCOCC2)C2=NC(NC3=CN4N=CN=C4C=C3C)=NC=C12 XISVSTPEXYIKJL-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 229940126071 ART558 Drugs 0.000 description 1
- 229940126288 AZD7648 Drugs 0.000 description 1
- 241000007910 Acaryochloris marina Species 0.000 description 1
- 241001135192 Acetohalobium arabaticum Species 0.000 description 1
- 241001464929 Acidithiobacillus caldus Species 0.000 description 1
- 241000605222 Acidithiobacillus ferrooxidans Species 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 241000640374 Alicyclobacillus acidocaldarius Species 0.000 description 1
- 241000190857 Allochromatium vinosum Species 0.000 description 1
- 241000147155 Ammonifex degensii Species 0.000 description 1
- 241000180579 Arca Species 0.000 description 1
- 241000620196 Arthrospira maxima Species 0.000 description 1
- 240000002900 Arthrospira platensis Species 0.000 description 1
- 235000016425 Arthrospira platensis Nutrition 0.000 description 1
- 241001495183 Arthrospira sp. Species 0.000 description 1
- 241000713834 Avian myelocytomatosis virus 29 Species 0.000 description 1
- 241000906059 Bacillus pseudomycoides Species 0.000 description 1
- 241000616876 Belliella baltica Species 0.000 description 1
- 208000019838 Blood disease Diseases 0.000 description 1
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 1
- 241000823281 Burkholderiales bacterium Species 0.000 description 1
- YHMDHAMZFMNMTF-MSOLQXFVSA-N C(#N)C=1C(=NC(=CC=1C(F)(F)F)C)N1[C@@H]([C@@H](CC1)O)C(=O)N(C=1C=C(C=CC=1)C)C Chemical compound C(#N)C=1C(=NC(=CC=1C(F)(F)F)C)N1[C@@H]([C@@H](CC1)O)C(=O)N(C=1C=C(C=CC=1)C)C YHMDHAMZFMNMTF-MSOLQXFVSA-N 0.000 description 1
- 101710149814 C-C chemokine receptor type 1 Proteins 0.000 description 1
- 101710155856 C-C motif chemokine 3 Proteins 0.000 description 1
- 102100028989 C-X-C chemokine receptor type 2 Human genes 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 108700012434 CCL3 Proteins 0.000 description 1
- 101150018129 CSF2 gene Proteins 0.000 description 1
- 101150069031 CSN2 gene Proteins 0.000 description 1
- 241000589875 Campylobacter jejuni Species 0.000 description 1
- 241001496650 Candidatus Desulforudis Species 0.000 description 1
- 102000019034 Chemokines Human genes 0.000 description 1
- 108010012236 Chemokines Proteins 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 241000193163 Clostridioides difficile Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000907165 Coleofasciculus chthonoplastes Species 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000918600 Corynebacterium ulcerans Species 0.000 description 1
- 241000065716 Crocosphaera watsonii Species 0.000 description 1
- 101150074775 Csf1 gene Proteins 0.000 description 1
- 241000159506 Cyanothece Species 0.000 description 1
- CMSMOCZEIVJLDB-UHFFFAOYSA-N Cyclophosphamide Chemical compound ClCCN(CCCl)P1(=O)NCCCO1 CMSMOCZEIVJLDB-UHFFFAOYSA-N 0.000 description 1
- HMFHBZSHGGEWLO-SOOFDHNKSA-N D-ribofuranose Chemical group OC[C@H]1OC(O)[C@H](O)[C@@H]1O HMFHBZSHGGEWLO-SOOFDHNKSA-N 0.000 description 1
- 229940126289 DNA-PK inhibitor Drugs 0.000 description 1
- 208000021321 Dias-Logan syndrome Diseases 0.000 description 1
- 101710191360 Eosinophil cationic protein Proteins 0.000 description 1
- 208000031637 Erythroblastic Acute Leukemia Diseases 0.000 description 1
- 101710100588 Erythroid transcription factor Proteins 0.000 description 1
- 208000036566 Erythroleukaemia Diseases 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- 241000326311 Exiguobacterium sibiricum Species 0.000 description 1
- 241000192016 Finegoldia magna Species 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 101150083167 HBG1 gene Proteins 0.000 description 1
- 101150034267 HBG2 gene Proteins 0.000 description 1
- 208000034502 Haemoglobin C disease Diseases 0.000 description 1
- 206010018910 Haemolysis Diseases 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 229920000209 Hexadimethrine bromide Polymers 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000777564 Homo sapiens C-C chemokine receptor type 1 Proteins 0.000 description 1
- 101000889128 Homo sapiens C-X-C motif chemokine 2 Proteins 0.000 description 1
- 101001055222 Homo sapiens Interleukin-8 Proteins 0.000 description 1
- 101000716729 Homo sapiens Kit ligand Proteins 0.000 description 1
- 101000738771 Homo sapiens Receptor-type tyrosine-protein phosphatase C Proteins 0.000 description 1
- 101000868152 Homo sapiens Son of sevenless homolog 1 Proteins 0.000 description 1
- 108091006905 Human Serum Albumin Proteins 0.000 description 1
- 102000008100 Human Serum Albumin Human genes 0.000 description 1
- 206010021143 Hypoxia Diseases 0.000 description 1
- 208000026350 Inborn Genetic disease Diseases 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 229930010555 Inosine Natural products 0.000 description 1
- UGQMRVRMYYASKQ-KQYNXXCUSA-N Inosine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C2=NC=NC(O)=C2N=C1 UGQMRVRMYYASKQ-KQYNXXCUSA-N 0.000 description 1
- 108010015268 Integration Host Factors Proteins 0.000 description 1
- 108010018951 Interleukin-8B Receptors Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 241001430080 Ktedonobacter racemifer Species 0.000 description 1
- 241000186673 Lactobacillus delbrueckii Species 0.000 description 1
- 241000186869 Lactobacillus salivarius Species 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- 241000186805 Listeria innocua Species 0.000 description 1
- 241001134698 Lyngbya Species 0.000 description 1
- 241000501784 Marinobacter sp. Species 0.000 description 1
- 241000204637 Methanohalobium evestigatum Species 0.000 description 1
- 241000192710 Microcystis aeruginosa Species 0.000 description 1
- 241000190928 Microscilla marina Species 0.000 description 1
- 101100335081 Mus musculus Flt3 gene Proteins 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 108091061960 Naked DNA Proteins 0.000 description 1
- 241000167285 Natranaerobius thermophilus Species 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 229930193140 Neomycin Natural products 0.000 description 1
- 102000003729 Neprilysin Human genes 0.000 description 1
- 108090000028 Neprilysin Proteins 0.000 description 1
- 101100385413 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) csm-3 gene Proteins 0.000 description 1
- 241000919925 Nitrosococcus halophilus Species 0.000 description 1
- 241001515112 Nitrosococcus watsonii Species 0.000 description 1
- 241000203619 Nocardiopsis dassonvillei Species 0.000 description 1
- 241001223105 Nodularia spumigena Species 0.000 description 1
- 241000192673 Nostoc sp. Species 0.000 description 1
- 108010077850 Nuclear Localization Signals Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 241000192520 Oscillatoria sp. Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241000142651 Pelotomaculum thermopropionicum Species 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 241000983938 Petrotoga mobilis Species 0.000 description 1
- 241001599925 Polaromonas naphthalenivorans Species 0.000 description 1
- 241001472610 Polaromonas sp. Species 0.000 description 1
- 241001135221 Prevotella intermedia Species 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 101710150593 Protein beta Proteins 0.000 description 1
- 241000590028 Pseudoalteromonas haloplanktis Species 0.000 description 1
- 229930185560 Pseudouridine Natural products 0.000 description 1
- PTJWIQPHWPFNBW-UHFFFAOYSA-N Pseudouridine C Natural products OC1C(O)C(CO)OC1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-UHFFFAOYSA-N 0.000 description 1
- 241001647888 Psychroflexus Species 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 238000011529 RT qPCR Methods 0.000 description 1
- 101100047461 Rattus norvegicus Trpm8 gene Proteins 0.000 description 1
- 102100020718 Receptor-type tyrosine-protein kinase FLT3 Human genes 0.000 description 1
- 101710151245 Receptor-type tyrosine-protein kinase FLT3 Proteins 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 241000712909 Reticuloendotheliosis virus Species 0.000 description 1
- 102100036007 Ribonuclease 3 Human genes 0.000 description 1
- 101710192197 Ribonuclease 3 Proteins 0.000 description 1
- 102000003661 Ribonuclease III Human genes 0.000 description 1
- 108010057163 Ribonuclease III Proteins 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 239000008156 Ringer's lactate solution Substances 0.000 description 1
- 241001606419 Spiroplasma syrphidicola Species 0.000 description 1
- 241000203029 Spiroplasma taiwanense Species 0.000 description 1
- 241000191967 Staphylococcus aureus Species 0.000 description 1
- 241000194056 Streptococcus iniae Species 0.000 description 1
- 241000194022 Streptococcus sp. Species 0.000 description 1
- 241001518258 Streptomyces pristinaespiralis Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 241000206213 Thermosipho africanus Species 0.000 description 1
- RYYWUUFWQRZTIU-UHFFFAOYSA-N Thiophosphoric acid Chemical group OP(O)(S)=O RYYWUUFWQRZTIU-UHFFFAOYSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108010022394 Threonine synthase Proteins 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 206010060872 Transplant failure Diseases 0.000 description 1
- 241000078013 Trichormus variabilis Species 0.000 description 1
- 239000007984 Tris EDTA buffer Substances 0.000 description 1
- 241001069823 UR2 sarcoma virus Species 0.000 description 1
- 208000027418 Wounds and injury Diseases 0.000 description 1
- 241000714476 Y73 sarcoma virus Species 0.000 description 1
- 101710185494 Zinc finger protein Proteins 0.000 description 1
- 102100023597 Zinc finger protein 816 Human genes 0.000 description 1
- 241001673106 [Bacillus] selenitireducens Species 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 108010023082 activin A Proteins 0.000 description 1
- 208000021841 acute erythroid leukemia Diseases 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000000735 allogeneic effect Effects 0.000 description 1
- 238000011316 allogeneic transplantation Methods 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 125000003368 amide group Chemical group 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 238000000540 analysis of variance Methods 0.000 description 1
- 230000002869 anti-sickling effect Effects 0.000 description 1
- 239000000427 antigen Substances 0.000 description 1
- 108091007433 antigens Proteins 0.000 description 1
- 102000036639 antigens Human genes 0.000 description 1
- 239000002246 antineoplastic agent Substances 0.000 description 1
- 239000007864 aqueous solution Substances 0.000 description 1
- 229940011019 arthrospira platensis Drugs 0.000 description 1
- 208000005266 avian sarcoma Diseases 0.000 description 1
- 230000008970 bacterial immunity Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- WGDUUQDYDIIBKT-UHFFFAOYSA-N beta-Pseudouridine Natural products OC1OC(CN2C=CC(=O)NC2=O)C(O)C1O WGDUUQDYDIIBKT-UHFFFAOYSA-N 0.000 description 1
- 210000001772 blood platelet Anatomy 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 238000006664 bond formation reaction Methods 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- ZEWYCNBZMPELPF-UHFFFAOYSA-J calcium;potassium;sodium;2-hydroxypropanoic acid;sodium;tetrachloride Chemical compound [Na].[Na+].[Cl-].[Cl-].[Cl-].[Cl-].[K+].[Ca+2].CC(O)C(O)=O ZEWYCNBZMPELPF-UHFFFAOYSA-J 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 150000004649 carbonic acid derivatives Chemical class 0.000 description 1
- 238000005341 cation exchange Methods 0.000 description 1
- 239000006143 cell culture medium Substances 0.000 description 1
- 230000022131 cell cycle Effects 0.000 description 1
- 230000030833 cell death Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000003855 cell nucleus Anatomy 0.000 description 1
- 230000007541 cellular toxicity Effects 0.000 description 1
- 230000000739 chaotic effect Effects 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 229940044683 chemotherapy drug Drugs 0.000 description 1
- DQLATGHUWYMOKM-UHFFFAOYSA-L cisplatin Chemical compound N[Pt](N)(Cl)Cl DQLATGHUWYMOKM-UHFFFAOYSA-L 0.000 description 1
- 229960004316 cisplatin Drugs 0.000 description 1
- 238000012761 co-transfection Methods 0.000 description 1
- 239000002299 complementary DNA Substances 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010924 continuous production Methods 0.000 description 1
- 238000011340 continuous therapy Methods 0.000 description 1
- 101150055601 cops2 gene Proteins 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 229960004397 cyclophosphamide Drugs 0.000 description 1
- 230000007711 cytoplasmic localization Effects 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000006392 deoxygenation reaction Methods 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000037213 diet Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 102000004419 dihydrofolate reductase Human genes 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 238000002224 dissection Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 230000005684 electric field Effects 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 210000000267 erythroid cell Anatomy 0.000 description 1
- 230000010437 erythropoiesis Effects 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- VYSYZMNJHYOXGN-UHFFFAOYSA-N ethyl n-aminocarbamate Chemical compound CCOC(=O)NN VYSYZMNJHYOXGN-UHFFFAOYSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 239000011888 foil Substances 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 208000016361 genetic disease Diseases 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 239000010931 gold Substances 0.000 description 1
- 229910052737 gold Inorganic materials 0.000 description 1
- 230000002709 granulomonocytic effect Effects 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 208000014951 hematologic disease Diseases 0.000 description 1
- 238000011134 hematopoietic stem cell transplantation Methods 0.000 description 1
- 208000018706 hematopoietic system disease Diseases 0.000 description 1
- 230000008588 hemolysis Effects 0.000 description 1
- 208000007475 hemolytic anemia Diseases 0.000 description 1
- 102000055151 human KITLG Human genes 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 210000003917 human chromosome Anatomy 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 230000001146 hypoxic effect Effects 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000001727 in vivo Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 238000011221 initial treatment Methods 0.000 description 1
- 208000014674 injury Diseases 0.000 description 1
- 150000002485 inorganic esters Chemical class 0.000 description 1
- 229960003786 inosine Drugs 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 239000008176 lyophilized powder Substances 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- GRPSNTXTTSBKGW-BVGHQBMWSA-J magnesium;potassium;sodium;(3r,4s,5s,6r)-6-(hydroxymethyl)oxane-2,3,4,5-tetrol;triacetate;chloride Chemical compound [Na+].[Mg+2].[Cl-].[K+].CC([O-])=O.CC([O-])=O.CC([O-])=O.OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O GRPSNTXTTSBKGW-BVGHQBMWSA-J 0.000 description 1
- FVVLHONNBARESJ-NTOWJWGLSA-H magnesium;potassium;trisodium;(2r,3s,4r,5r)-2,3,4,5,6-pentahydroxyhexanoate;acetate;tetrachloride;nonahydrate Chemical compound O.O.O.O.O.O.O.O.O.[Na+].[Na+].[Na+].[Mg+2].[Cl-].[Cl-].[Cl-].[Cl-].[K+].CC([O-])=O.OC[C@@H](O)[C@@H](O)[C@H](O)[C@@H](O)C([O-])=O FVVLHONNBARESJ-NTOWJWGLSA-H 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 229920000609 methyl cellulose Polymers 0.000 description 1
- 239000001923 methylcellulose Substances 0.000 description 1
- 238000002493 microarray Methods 0.000 description 1
- 239000011325 microbead Substances 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011859 microparticle Substances 0.000 description 1
- 238000002156 mixing Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- CWJJHESJXJQCJA-UHFFFAOYSA-N n-(pyridin-2-ylmethyl)-1-[4-(1,4,8,11-tetrazacyclotetradec-1-ylmethyl)phenyl]methanamine Chemical compound C=1C=C(CN2CCNCCCNCCNCCC2)C=CC=1CNCC1=CC=CC=N1 CWJJHESJXJQCJA-UHFFFAOYSA-N 0.000 description 1
- 229960004927 neomycin Drugs 0.000 description 1
- 230000030147 nuclear export Effects 0.000 description 1
- 230000025308 nuclear transport Effects 0.000 description 1
- 230000037360 nucleotide metabolism Effects 0.000 description 1
- 230000009437 off-target effect Effects 0.000 description 1
- 230000008816 organ damage Effects 0.000 description 1
- 150000002895 organic esters Chemical class 0.000 description 1
- 230000000065 osmolyte Effects 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000000123 paper Substances 0.000 description 1
- 230000001575 pathological effect Effects 0.000 description 1
- 239000008188 pellet Substances 0.000 description 1
- 239000000863 peptide conjugate Substances 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
- 238000001050 pharmacotherapy Methods 0.000 description 1
- 239000010452 phosphate Chemical group 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical group [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 229920003023 plastic Polymers 0.000 description 1
- 239000004033 plastic Substances 0.000 description 1
- BASFCYQUMIYNBI-UHFFFAOYSA-N platinum Chemical group [Pt] BASFCYQUMIYNBI-UHFFFAOYSA-N 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000000379 polymerizing effect Effects 0.000 description 1
- 230000003449 preventive effect Effects 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 210000001938 protoplast Anatomy 0.000 description 1
- PTJWIQPHWPFNBW-GBNDHIKLSA-N pseudouridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1C1=CNC(=O)NC1=O PTJWIQPHWPFNBW-GBNDHIKLSA-N 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000003762 quantitative reverse transcription PCR Methods 0.000 description 1
- 239000002096 quantum dot Substances 0.000 description 1
- 238000006722 reduction reaction Methods 0.000 description 1
- 230000008844 regulatory mechanism Effects 0.000 description 1
- 238000007634 remodeling Methods 0.000 description 1
- 230000008263 repair mechanism Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000003938 response to stress Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 230000003007 single stranded DNA break Effects 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229940054269 sodium pyruvate Drugs 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 230000003319 supportive effect Effects 0.000 description 1
- 230000001629 suppression Effects 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 208000024891 symptom Diseases 0.000 description 1
- 238000002636 symptomatic treatment Methods 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 238000007492 two-way ANOVA Methods 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
- 230000003442 weekly effect Effects 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/113—Non-coding nucleic acids modulating the expression of genes, e.g. antisense oligonucleotides; Antisense DNA or RNA; Triplex- forming oligonucleotides; Catalytic nucleic acids, e.g. ribozymes; Nucleic acids used in co-suppression or gene silencing
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P7/00—Drugs for disorders of the blood or the extracellular fluid
- A61P7/06—Antianaemics
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/795—Porphyrin- or corrin-ring-containing peptides
- C07K14/805—Haemoglobins; Myoglobins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/102—Mutagenizing nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/87—Introduction of foreign genetic material using processes not otherwise provided for, e.g. co-transformation
- C12N15/90—Stable introduction of foreign DNA into chromosome
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/10—Transferases (2.)
- C12N9/12—Transferases (2.) transferring phosphorus containing groups, e.g. kinases (2.7)
- C12N9/1241—Nucleotidyltransferases (2.7.7)
- C12N9/1276—RNA-directed DNA polymerase (2.7.7.49), i.e. reverse transcriptase or telomerase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/16—Hydrolases (3) acting on ester bonds (3.1)
- C12N9/22—Ribonucleases RNAses, DNAses
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2310/00—Structure or type of the nucleic acid
- C12N2310/10—Type of nucleic acid
- C12N2310/20—Type of nucleic acid involving clustered regularly interspaced short palindromic repeats [CRISPRs]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/22—Vectors comprising a coding region that has been codon optimised for expression in a respective host
Definitions
- the present invention is in the field of medicine, in particular haematology.
- BACKGROUND OF THE INVENTION ⁇ -hemoglobinopathies, ⁇ -thalassemia and sickle cell disease (SCD), are monogenic diseases caused by mutations in the ⁇ -globin locus, affecting the synthesis or the structure of the adult hemoglobin (Hb).
- ⁇ -thalassemia is caused by mutations in the ⁇ -globin gene (HBB) locus that reduce ( ⁇ +) or abolish ( ⁇ 0) the production of ⁇ -globin chains included in the adult hemoglobin (HbA) tetramer, leading to the precipitation of uncoupled ⁇ -globin chains, erythroid cell death and severe anemia (Taher, Ali T., David J. Weatherall, and Maria Domenica Cappellini. "Thalassaemia.” The Lancet 391.10116 (2016): 155-167).
- HBB ⁇ -globin gene
- HPFH mutations either generate de novo DNA motifs recognized by transcriptional activators (e.g., KLF1) (Wienert, Beeke, et al. "Editing the genome to introduce a beneficial naturally occurring mutation associated with increased fetal globin.” Nature communications 6.1 (2015): 1-8; Wienert, Beeke, et al.
- KLF1 transcriptional activators
- KLF1 drives the expression of fetal hemoglobin in British HPFH. Blood, The Journal of the American Society of Hematology 130.6 (2017): 803- 807; Martyn, Gabriella E., et al. "A natural regulatory mutation in the proximal promoter elevates fetal globin expression by creating a de novo GATA1 site.” Blood, The Journal of the American Society of Hematology 133.8 (2019): 852-856.) or disrupt binding sites (BS) for transcriptional repressors (e.g., LRF and BCL11A) (Martyn, Gabriella E., Kate GR Quinlan, and Merlin Crossley.
- BS disrupt binding sites
- ⁇ -hemoglobinopathy has its general meaning in the art and refers to any defect in the structure or function of any hemoglobin of an individual, and includes defects in the primary, secondary, tertiary or quaternary structure of hemoglobin caused by any mutation, such as deletion mutations or substitution mutations in the coding regions of the HBB gene, or mutations in, or deletions of, the promoters or enhancers of such gene that cause a reduction in the amount of hemoglobin produced as compared to a normal or standard condition.
- the term "sickle cell disease” has its general meaning in the art and refers to a group of autosomal recessive genetic blood disorders, which results from mutations in a globin gene and which is characterized by red blood cells that assume an abnormal, rigid, sickle shape. They are defined by the presence of ⁇ S-globin gene coding for a ⁇ -globin chain variant in which glutamic acid is substituted by valine at amino acid position 6 of the peptide: incorporation of the ⁇ S-globin in the Hb tetramers (HbS, sickle Hb) leads to Hb polymerization and to a clinical phenotype.
- HbSS sickle cell anemia
- HbSC sickle-hemoglobin C disease
- HbS/ ⁇ + sickle ⁇ -plus- thalassaemia
- HbS/ ⁇ 0 sickle ⁇ -zerothalassaemia
- ⁇ -thalassemia refers to a hemoglobinopathy that results from an altered ratio of ⁇ -globin to ⁇ -like globin polypeptide chains resulting in the underproduction of normal hemoglobin tetrameric proteins and the precipitation of free, unpaired ⁇ -globin chains.
- alpha globin or “ ⁇ -globin” has its general meaning in the art and refers to protein that is encoded in human by the HBA1 and HBA2 genes.
- the human alpha globin gene cluster located on chromosome 16 spans about 30 kb and includes seven loci: 5'- zeta - pseudozeta - mu - pseudoalpha-1 - alpha-2 - alpha-1 - theta - 3'.
- the alpha-2 (HBA2) and alpha-1 (HBA1) coding sequences are identical. These genes differ slightly over the 5' untranslated regions and the introns, but they differ significantly over the 3' untranslated regions.
- the ENSEMBL IDs i.e.
- HBA1 and HBA2 are ENSG00000206172 and ENSG00000188536 respectively.
- ⁇ -globin has its general meaning in the art and refers to a globin protein, which along with alpha globin (HBA), makes up the most common form of haemoglobin (Hb) in adult humans.
- HBA alpha globin
- Hb haemoglobin
- Normal adult human Hb is a heterotetramer consisting of two alpha chains and two beta chains.
- the ⁇ -globin is encoded by the HBB gene on human chromosome 11. It is 146 amino acids long and has a molecular weight of 15,867 Da.
- gamma globin or “ ⁇ -globin” has its general meaning in the art and refers to protein that is encoded in human by the HBG1 and HBG2 genes.
- the HBG1 and HBG2 genes are normally expressed in the fetal liver, spleen and bone marrow.
- Two ⁇ -globin chains together with two ⁇ -globin chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) in the year following birth.
- the ENSEMBL IDs i.e. the gene identifier number from the Ensembl Genome Browser database
- HBG1 and HBG2 are ENSG00000213934 and ENSG00000196565 respectively.
- the term “expression” refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene product”. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell. Any method known in the art can be used to measure the expression of the gene (e. g.
- the expression "increasing the fetal hemoglobin content” indicates that fetal hemoglobin is at least 5% higher in the eukaryotic cell treated with the prime editing platform, than in a comparable, eukaryotic cell, wherein a prime editing platform targeting an unrelated locus is present or where no prime editing platform is present.
- the percentage of fetal hemoglobin expression in the eukaryotic cell is at least 10% higher, at least 20% higher, at least 30% higher, at least 40% higher, at least 50% higher, at least 60% higher, at least 70% higher, at least 80% higher, at least 90% higher, at least 1-fold higher, at least 2- fold higher, at least 5-fold higher, at least 10-fold higher, at least 100 fold higher, at least 1000- fold higher, or more than an eukaryotic cell.
- promoter has its general meaning in the art and refers to a nucleic acid sequence which is required for expression of a gene operably linked to the promoter sequence.
- HBG1 promoter refers to the promoter of the HBG1 gene.
- HBG2 promoter refers to the promoter of the HBG2 gene.
- HBG1 and HBG2 promoters are identical up to –221 bp and comprise the nucleic acid sequence as set forth in SEQ ID NO:1 and depicted in Figure 1.
- the first nucleotide in SEQ ID NO:1 denotes the nucleotide located at position -147 upstream of the HBG transcription starting site and the last nucleotide in SEQ ID NO:1 denotes the nucleotide located at position -91 upstream of the HBG transcription starting site.
- the “-115 region” in the HBG1 or HBG2 promoter refers to the region which encompasses the nucleotides at position -123, -124 and -113 and thus relates to the region encompassing the region starting from the nucleotide at position 24 to the nucleotide at position 35 in SEQ ID NO:1.
- the term “activator” refers to a transcriptional activator that is a protein (transcription factor) that increases gene transcription of a gene or set of genes.
- activators are DNA-binding proteins that bind to enhancers or promoter-proximal elements.
- the activator is KLF1 or GATA1.
- transcriptional activator binding site refers to a site present on DNA whereby the transcriptional activator according to the present disclosure binds.
- the prime-editing platform of the present invention edits the genome sequence of the eukaryotic cell so that the activator is able to bind to its transcriptional activator binding sites.
- KLF1 has its general meaning in the art and refers to the Kruppel like factor 1 protein. The term is also known as EKLF; EKLF/KLF1.
- KLF1 is a hematopoietic-specific transcription factor that induces high-level expression of adult beta-globin and other erythroid genes.
- the zinc-finger protein binds to a DNA sequence found in the beta globin promoter.
- GATA1 has its general meaning in the art and refers to the erythroid differentiation factor. The term is also known as ERYF1 or GF1.
- repressor refers to a transcriptional repressor that is a protein (transcription factor) that decreases gene transcription of a gene or set of genes. Most repressors are DNA-binding proteins that bind to enhancers or promoter-proximal elements.
- the repressor is BCL11A.
- transcriptional repressor binding site refers to a site present on DNA whereby the transcription repressor binds.
- the prime-editing platform of the present invention edits the genome sequence of the eukaryotic cell so that the transcriptional repressor is not able to bind to its transcriptional repressor binding sites.
- the prime-editing platform of the present invention of the present invention will inhibit the binding of BCL11A to its binding site.
- BCL11A has its general meaning in the art and refers to the gene encoding for BAF chromatin remodeling complex subunit BCL11A (Gene ID: 53335).
- the term is also known as EVI9; CTIP1; DILOS; ZNF856; HBFQTL5; BCL11A-L; BCL11A-S; BCL11a-M; or BCL11A-XL.
- the protein associates with the SWI/SNF complex that regulates gene expression via chromatin remodeling.
- BCL11A is highly expressed in several hematopoietic lineages, and plays a role in the switch from ⁇ - to ⁇ -globin expression during the fetal to adult erythropoiesis transition (Sankaran VJ et al.
- polypeptide “peptide”, and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, phosphorylation, or conjugation with a labeling component.
- Polypeptides when discussed in the context of gene therapy refer to the respective intact polypeptide, or any fragment or genetically engineered derivative thereof, which retains the desired biochemical function of the intact protein.
- polynucleotide refers to a polymeric form of nucleotides of any length, including deoxyribonucleotides or ribonucleotides, or analogs thereof.
- a polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs, and may be interrupted by non-nucleotide components. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer.
- the term polynucleotide, as used herein, refers interchangeably to double- and single-stranded molecules.
- any embodiment of the invention described herein that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form.
- the expression “derived from” refers to a process whereby a first component (e.g., a first polypeptide), or information from that first component, is used to isolate, derive or make a different second component (e.g., a second polypeptide that is different from the first one).
- fusion protein means a protein created by joining two or more polypeptide sequences together.
- the fusion polypeptides encompassed in this invention include translation products of a chimeric gene construct that joins the nucleic acid sequences encoding a first polypeptide, e.g., an RNA-binding domain, with the nucleic acid sequence encoding a second polypeptide, e.g., an effector domain, to form a single open-reading frame.
- a “fusion protein” is a recombinant protein of two or more proteins which are joined by a peptide bond or via several peptides.
- the fusion protein may also comprise a peptide linker between the two domains.
- the comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm, as described below.
- the percent identity between two amino acid sequences can be determined using the Needleman and Wunsch algorithm (Needleman, Saul B. & Wunsch, Christian D. (1970). "A general method applicable to the search for similarities in the amino acid sequence of two proteins". Journal of Molecular Biology.48 (3): 443–53.).
- the percent identity between two nucleotide or amino acid sequences may also be determined using for example algorithms such as EMBOSS Needle (pair wise alignment; available at www.ebi.ac.uk).
- EMBOSS Needle may be used with a BLOSUM62 matrix, a “gap open penalty” of 10, a “gap extend penalty” of 0.5, a false “end gap penalty”, an “end gap open penalty” of 10 and an “end gap extend penalty” of 0.5.
- the “percent identity” is a function of the number of matching positions divided by the number of positions compared and multiplied by 100. For instance, if 6 out of 10 sequence positions are identical between the two compared sequences after alignment, then the identity is 60%.
- % identity is typically determined over the whole length of the query sequence on which the analysis is performed.
- Two molecules having the same primary amino acid sequence or nucleic acid sequence are identical irrespective of any chemical and/or biological modification.
- a first amino acid sequence having at least 90% of identity with a second amino acid sequence means that the first sequence has 90; 91; 92; 93; 94; 95; 96; 97; 98; 99 or 100% of identity with the second amino acid sequence.
- linker refers to any means, entity or moiety used to join two or more entities.
- a linker can be a covalent linker or a non-covalent linker.
- covalent linkers include covalent bonds or a linker moiety covalently attached to one or more of the proteins or domains to be linked.
- the linker can also be a non-covalent bond, e.g., an organometallic bond through a metal center such as platinum atom.
- various functionalities can be used, such as amide groups, including carbonic acid derivatives, ethers, esters, including organic and inorganic esters, amino, urethane, urea and the like.
- the domains can be modified by oxidation, hydroxylation, substitution, reduction etc. to provide a site for coupling. Methods for conjugation are well known by persons skilled in the art and are encompassed for use in the present invention.
- Linker moieties include, but are not limited to, chemical linker moieties, or for example a peptide linker moiety (a linker sequence). It will be appreciated that modification which do not significantly decrease the function of the RNA- binding domain and effector domain are preferred.
- the “linked” as used herein refers to the attachment of two or more entities to form one entity.
- a conjugate encompasses both peptide-small molecule conjugates as well as peptide-protein/peptide conjugates.
- complementarity refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base- pairing or other non-traditional types.
- a percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary).
- Perfectly complementary means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence.
- “Substantially complementary” as used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions.
- stringent conditions for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors.
- hybridization or “hybridizing” refers to a process where completely or partially complementary nucleic acid strands come together under specified hybridization conditions to form a double-stranded structure or region in which the two constituent strands are joined by hydrogen bonds.
- the technology can mediate targeted insertions, deletions, and base-to-base conversions without the need for double strand breaks (DSBs) or donor DNA templates.
- the term “prime editing enzyme” refers to a fusion protein comprising a defective CRISPR/Cas nuclease linked to a reverse transcriptase. The term is also known as “prime editor”.
- the term “nuclease” includes an enzyme that induces a break in a nucleic acid sequence, e.g., a single or a double strand break in a double-stranded DNA sequence.
- CRISPR/Cas nuclease has its general meaning in the art and refers to segments of prokaryotic DNA containing clustered regularly interspaced short palindromic repeats (CRISPR) and associated nucleases encoded by Cas genes.
- CRISPR clustered regularly interspaced short palindromic repeats
- the CRISPR/Cas loci encode RNA-guided adaptive immune systems against mobile genetic elements (viruses, transposable elements and conjugative plasmids).
- CRISPR clusters contain spacers, the sequences complementary to antecedent mobile elements.
- CRISPR clusters are transcribed and processed into mature CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) RNA (crRNA).
- the CRISPR/Cas nucleases Cas9 and Cpf1 belong to the type II and type V CRISPR/Cas system and have strong endonuclease activity to cut target DNA.
- Cas9 is guided by a mature crRNA that contains about 20 nucleotides of unique target sequence (called spacer) and a trans-activating small RNA (tracrRNA) that also serves as a guide for ribonuclease III-aided processing of pre-crRNA.
- spacer a mature crRNA that contains about 20 nucleotides of unique target sequence
- tracrRNA trans-activating small RNA
- the crRNA:tracrRNA duplex directs Cas9 to target DNA via complementary base pairing between the spacer on the crRNA and the complementary sequence (called protospacer) on the target DNA.
- Cas9 recognizes a trinucleotide (NGG for S.
- Cas9 Pyogenes Cas9 protospacer adjacent motif (PAM) to specify the cut site (the 3rd or the 4th nucleotide upstream from PAM).
- Cas9 or “Cas9 nuclease” refers to an RNA-guided nuclease comprising a Cas9 protein, or a fragment thereof (e.g., a protein comprising an active or inactive DNA cleavage domain of Cas9, and/or the gRNA binding domain of Cas9).
- a Cas9 nuclease is also referred to sometimes as a casn1 nuclease or a CRISPR (clustered regularly interspaced short palindromic repeat)-associated nuclease.
- CRISPR is an adaptive immune system that provides protection against mobile genetic elements (viruses, transposable elements and conjugative plasmids).
- CRISPR clusters contain spacers, sequences complementary to antecedent mobile elements, and target invading nucleic acids.
- CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA).
- crRNA CRISPR RNA
- type II CRISPR systems correct processing of pre-crRNA requires a trans-encoded small RNA (tracrRNA), endogenous ribonuclease 3 (rnc) and a Cas9 protein.
- tracrRNA serves as a guide for ribonuclease 3-aided processing of pre- crRNA.
- Cas9/crRNA/tracrRNA endonucleolytically cleaves linear or circular dsDNA target complementary to the spacer.
- the target strand not complementary to crRNA is first cut endonucleolytically, then trimmed 3’-5’ exonucleolytically.
- DNA-binding and cleavage typically requires protein and both RNAs.
- single guide RNAs (“sgRNA”, or simply “gRNA”) can be engineered so as to incorporate aspects of both the crRNA and tracrRNA into a single RNA species. See, e.g., Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J. A., Charpentier E.
- Cas9 recognizes a short motif in the CRISPR repeat sequences (the PAM or protospacer adjacent motif) to help distinguish self versus non-self.
- Cas9 nuclease sequences and structures are well known to those of skill in the art (see, e.g., “Complete genome sequence of an M1 strain of Streptococcus pyogenes.” Ferretti et al., J. J., McShan W. M., Ajdic D. J., Savic D. J., Savic G., Lyon K., Primeaux C., Sezate S., Suvorov A. N., Kenton S., Lai H.
- Cas9 nucleases and sequences include Cas9 sequences from the organisms and loci disclosed in Chylinski, Rhun, and Charpentier, “The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems” (2013) RNA Biology 10:5, 726-737; the entire contents of which are incorporated herein by reference.
- Cas9 refers to Cas9 from: Corynebacterium ulcerans (NCBI Refs: NC_015683.1, NC_017317.1); Corynebacterium diphtheria (NCBI Refs: NC_016782.1, NC_016786.1); Spiroplasma syrphidicola (NCBI Ref: NC_021284.1); Prevotella intermedia (NCBI Ref: NC_017861.1); Spiroplasma taiwanense (NCBI Ref: NC_021846.1); Streptococcus iniae (NCBI Ref: NC_021314.1); Belliella baltica (NCBI Ref: NC_018010.1); Psychroflexus torquisI (NCBI Ref: NC_018721.1); Streptococcus thermophilus (NCBI Ref: YP_820832.1); Listeria innocua (NCBI Ref: NP_472073.1);
- Cas9 nuclease comprises the amino acid sequence as set forth in SEQ ID NO: 2.
- SEQ ID NO:2 Cas9 sequence MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTAR RRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHL RKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVD AKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLD NLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQQ
- nickase has its general meaning in the art and refers to an endonuclease which cleaves only a single strand of a DNA duplex (“nicking”). Accordingly, the term “Cas9 nickase” refers to a nickase derived from a Cas9 protein, typically by inactivating one nuclease domain of Cas9 protein.
- protospacer adjacent motif sequence or “PAM” refers to an approximately 2-6 base pair DNA sequence that is an important targeting component of a Cas9 nuclease. Typically, the PAM sequence is on either strand, and is downstream in the 5’ to 3’ direction of Cas9 cut site.
- the canonical PAM sequence i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes is 5’-NGG-3’ wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases.
- Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms.
- any given Cas9 nuclease e.g., SpCas9, may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes alternative PAM sequence.
- the term “protospacer” refers to the sequence ( ⁇ 20 bp) in DNA adjacent to the PAM (protospacer adjacent motif) sequence.
- the protospacer shares the same sequence as the spacer sequence of the guide RNA.
- the guide RNA anneals to the complement of the protospacer sequence on the target DNA (specifically, one strand thereof, i.e., the “target strand” versus the “non-target strand” of the target sequence).
- PAM protospacer adjacent motif
- guide RNA or “gRNA” has its general meaning in the art and is a particular type of guide nucleic acid which associates with a CRISPR/Cas nuclease (e.g. Cas9), directing the nuclease to a specific sequence in a DNA molecule that includes complementarity to protospacer sequence of the guide RNA.
- CRISPR/Cas nuclease e.g. Cas9
- the terms “prime editing guide RNA” or “pegRNA” refers to a specialized form of a guide RNA that has been modified to include one or more additional sequences for implementing the prime editing as described herein.
- the pegRNAs of the present invention comprise in the 5’ to 3’ direction a spacer, a gRNA core, and an extension arm.
- the pegRNA of the present invention may also further comprise elements, such as, but not limited to aptamers, stem loops, hairpins, toe loops (e.g., a 3’ toeloop), or an RNA-protein recruitment domain (e.g., MS2 hairpin).
- the pegRNA may contain one or more structural elements for minimizing its degradation.
- the pegRNA of the present invention incorporates one or more stable pseudoknots at its 3’ end such as a modified prequeosine1-1 riboswitch aptamer (evopreQ1) or the frameshifting pseudoknot from Moloney murine leukemia virus (MMLV), hereafter referred to as “mpknot” as described in Nelson, James W., et al. "Engineered pegRNAs improve prime editing efficiency.” Nature biotechnology 40.3 (2022): 402-410 for which the teaching is incorporated by reference.
- the pegRNA may comprise a transcriptional termination sequence at the 3’ of the molecule.
- the term “spacer” in connection with a guide RNA or a pegRNA refers to the portion of the guide RNA or pegRNA of about 20 nucleotides which contains a nucleotide sequence that is complementary to the protospacer sequence in the target nucleic sequence.
- the spacer sequence anneals to the protospacer sequence to form a ssRNA/ssDNA hybrid structure at the target site and a corresponding R loop ssDNA structure of the endogenous DNA strand that is complementary to the protospacer sequence.
- the term “gRNA core” or “gRNA scaffold” refers to the sequence within the gRNA that is responsible for Cas9 binding, it does not include the 20 bp spacer/targeting sequence that is used to guide Cas9 to target DNA.
- the term “extension arm” refers to a nucleotide sequence component of a pegRNA which provides several functions. Typically, the extension arm is located at the 3’ end of the guide RNA, and comprises the following components in a 5’ to 3’ direction: a reverse transcriptase (RT) template and the primer binding site.
- RT reverse transcriptase
- the term “reverse transcriptase template” or “RT template” or “RTT” refers to the portion of the extension arm that spans from the 5’ end of the primer binding site (PBS) to 3’ end of the gRNA core that may operate as a template for the synthesis of a single-strand of DNA by the reverse transcriptase of the prime editing enzyme.
- the RT template encodes (by the reverse transcriptase of the prime editing enzyme) a single-stranded DNA which, in turn, has been designed to be (a) homologous with the endogenous target DNA to be edited, and (b) which comprises at least one desired nucleotide change (e.g.
- the term “primer binding site” or “PBS” refers to the nucleotide sequence located on a pegRNA as component of the extension arm (typically at the 3’ end of the extension arm) and serves to bind to the primer sequence that is formed after Cas9 nicking of the target sequence by the prime editing enzyme.
- PBS primer binding site
- the Cas9 nickase component of a prime editing enzyme nicks one strand of the target sequence, a 3’-ended ssDNA flap is formed, which serves a primer sequence that anneals to the primer binding site on the pegRNA to prime reverse transcription.
- RT reverse transcriptase
- target nucleic acid refers to a nucleic acid containing a target nucleic acid sequence.
- a target nucleic acid may be single-stranded or double-stranded, and often is double-stranded DNA.
- a “target nucleic acid sequence,” “target sequence” or “target region”, as used herein, means a specific sequence or the complement thereof that one wishes to bind to using the CRISPR system as disclosed herein.
- the term “mutation” has its general meaning in the art and refers to a substitution, deletion or insertion.
- substitution means that a specific nucleotide at a specific position is removed and another nucleotide is inserted into the same position.
- deletion means that a specific nucleotide is removed.
- insertion means that one or more nucleotides are inserted before or after a specific nucleotide.
- variant refers to a first composition (e.g., a first molecule), that is related to a second composition (e.g., a second molecule, also termed a “parent” molecule).
- the variant molecule can be derived from, isolated from, based on or homologous to the parent molecule.
- a variant molecule can have entire sequence identity with the original parent molecule, or alternatively, can have less than 100% sequence identity with the parent molecule.
- a variant of a sequence can be a second sequence that is at least 50; 51; 52; 53; 54; 55; 56; 57; 58; 59; 60; 61; 62; 63; 64; 65; 66; 67; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79; 80; 81; 82; 83; 84; 85; 86; 87; 88; 89; 90; 91; 92; 93; 94; 95; 96; 97; 98; 99; 100% identical in sequence compare to the original sequence.
- treatment refers to both prophylactic or preventive treatment as well as curative or disease modifying treatment, including treatment of patient at risk of contracting the disease or suspected to have contracted the disease as well as patients who are ill or have been diagnosed as suffering from a disease or medical condition, and includes suppression of clinical relapse.
- the treatment may be administered to a subject having a medical disorder or who ultimately may acquire the disorder, in order to prevent, cure, delay the onset of, reduce the severity of, or ameliorate one or more symptoms of a disorder or recurring disorder, or in order to prolong the survival of a subject beyond that expected in the absence of such treatment.
- therapeutic regimen is meant the pattern of treatment of an illness, e.g., the pattern of dosing used during therapy.
- a therapeutic regimen may include an induction regimen and a maintenance regimen.
- the phrase “induction regimen” or “induction period” refers to a therapeutic regimen (or the portion of a therapeutic regimen) that is used for the initial treatment of a disease.
- the general goal of an induction regimen is to provide a high level of drug to a patient during the initial period of a treatment regimen.
- An induction regimen may employ (in part or in whole) a "loading regimen", which may include administering a greater dose of the drug than a physician would employ during a maintenance regimen, administering a drug more frequently than a physician would administer the drug during a maintenance regimen, or both.
- maintenance regimen refers to a therapeutic regimen (or the portion of a therapeutic regimen) that is used for the maintenance of a patient during treatment of an illness, e.g., to keep the patient in remission for long periods of time (months or years).
- a maintenance regimen may employ continuous therapy (e.g., administering a drug at regular intervals, e.g., weekly, monthly, yearly, etc.) or intermittent therapy (e.g., interrupted treatment, intermittent treatment, treatment at relapse, or treatment upon achievement of a particular predetermined criteria [e.g., pain, disease manifestation, etc.]).
- the term "therapeutically effective amount” is meant a sufficient amount of population of cells to treat the disease at a reasonable benefit/risk ratio applicable to any medical treatment. It will be understood that the total usage compositions of the present invention will be decided by the attending physician within the scope of sound medical judgment.
- the specific therapeutically effective dose level for any particular patient will depend upon a variety of factors including the age, body weight, general health, sex and diet of the patient, the time of administration, route of administration, the duration of the treatment, drugs used in combination or coincidental with the population of cells, and like factors well known in the medical arts.
- the first object of the present invention relates to a method of increasing fetal hemoglobin content in a eukaryotic cell comprising the step of contacting the eukaryotic cell with a prime editing platform that consists of (a) one prime editing enzyme and (b) one prime editing guide RNA (pegRNA) for guiding the prime editing enzyme to one target nucleic acid sequence in the -115 region of the HBG1 and/or HBG2 promoter, thereby prime editing said region and subsequently increasing the expression of gamma-globin in said eukaryotic cell.
- a prime editing platform that consists of (a) one prime editing enzyme and (b) one prime editing guide RNA (pegRNA) for guiding the prime editing enzyme to one target nucleic acid sequence in the -115 region of the HBG1 and/or HBG2 promoter, thereby prime editing said region and subsequently increasing the expression of gamma-globin in said eukaryotic cell.
- pegRNA prime editing guide RNA
- the prime editing platform is suitable for introducing a combination of mutations in the -115 region of HBG1 and/or HBG2 promoter so that i) new transcriptional activator binding sites for KLF1 and GATA1 are introduced in said promoter and ii) the binding site for the BCL11A repressor is disrupted.
- the prime editing platform herein disclosed introduces i) the -123T>C and -124T>C mutations so that the KFL1 activator can bind to the promoter ii) the - 113 A>G mutation so that the GATA1 activator can bind to the promoter and iii) with or without the complete or partial deletion of the BCL11A binding-motif (i.e TGACCA) so that the binding site for the BCL11A repressor is disrupted.
- the BCL11A binding-motif i.e TGACCA
- Hematopoietic stem progenitor cells display a number of phenotypes, such as Lin-CD34+CD38 ⁇ CD90+CD45RA ⁇ , Lin- CD34+CD38 ⁇ CD90 ⁇ CD45RA ⁇ , Lin-CD34+CD38+IL-3aloCD45RA ⁇ , and Lin- CD34+CD38+CD10+(Daley et al., Focus 18:62-67, 1996; Pimentel, E., Ed., Handbook of Growth Factors Vol. III: Hematopoietic Growth Factors and Cytokines, pp. 1-2, CRC Press, Boca Raton, Fla., 1994).
- the stem cells self-renew and maintain continuous production of hematopoietic stem cells that give rise to all mature blood cells throughout life.
- the hematopoietic progenitor cells or hematopoietic stem cells are isolated form peripheral blood cells.
- peripheral blood cells refer to the cellular components of blood, including red blood cells, white blood cells, and platelets, which are found within the circulating pool of blood.
- the eukaryotic cell is a bone marrow derived stem cell.
- bone marrow-derived stem cells refers to stem cells found in the bone marrow.
- Stem cells may reside in the bone marrow, either as an adherent stromal cell type that possess pluripotent capabilities, or as cells that express CD34 or CD45 cell-surface protein, which identifies hematopoietic stem cells able to differentiate into blood cells.
- the eukaryotic cell results from a stem cell mobilization.
- the term “mobilization” or “stem cell mobilization” refers to a process involving the recruitment of stem cells from their tissue or organ of residence to peripheral blood following treatment with a mobilization agent. This process mimics the enhancement of the physiological release of stem cells from tissues or organs in response to stress signals during injury and inflammation. The mechanism of the mobilization process depends on the type of mobilization agent administered.
- mobilization agents act as agonists or antagonists that prevent the attachment of stem cells to cells or tissues of their microenvironment. Other mobilization agents induce the release of proteases that cleave the adhesion molecules or support structures between stem cells and their sites of attachment.
- the term “mobilization agent” refers to a wide range of molecules that act to enhance the mobilization of stem cells from their tissue or organ of residence, e.g., bone marrow (e.g., CD34+ stem cells) and spleen (e.g., Hox11+ stem cells), into peripheral blood.
- Mobilization agents include chemotherapeutic drugs, e.g., cyclophosphamide and cisplatin; cytokines, and chemokines, e.g., granulocyte colony- stimulating factor (G-CSF), granulocyte-macrophage colony-stimulating factor (GM-CSF), stem cell factor (SCF), Fms-related tyrosine kinase 3 (flt-3) ligand, stromal cell-derived factor 1 (SDF-1); agonists of the chemokine (C—C motif) receptor 1 (CCR1), such as chemokine (C—C motif) ligand 3 (CCL3, also known as macrophage inflammatory protein-1 ⁇ (Mip-1 ⁇ )); agonists of the chemokine (C—X—C motif) receptor 1 (CXCR1) and 2 (CXCR2), such as chemokine (C—X—C motif) ligand 2 (CXCL2) (also known as
- the prime editing enzyme of the present invention comprises a defective CRISPR/Cas nuclease.
- the sequence recognition mechanism is the same as for the non- defective CRISPR/Cas nuclease.
- the defective CRISPR/Cas nuclease of the invention comprises at least one RNA binding domain.
- the RNA binding domain interacts with a guide RNA molecule as defined hereinafter.
- the defective CRISPR/Cas nuclease of the invention is a modified version with no nuclease activity.
- the CRISPR/Cas nuclease consists of a mutant CRISPR/Cas nuclease i.e. a protein having one or more point mutations, insertions, deletions, truncations, a fusion protein, or a combination thereof.
- the mutant has the RNA-guided DNA binding activity, but lacks one or both of its nuclease active sites.
- the mutant comprises an amino acid sequence having at least 50% of identity with the wild type amino acid sequence of the CRISPR/Cas nuclease.
- Various CRISPR/Cas nucleases can be used in this invention.
- Non-limiting examples of suitable CRISPR/CRISPR/Cas nucleases include Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9, Cas10, Cas10d, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Cs
- the CRISPR/Cas nuclease is derived from a type II CRISPR-Cas system. In some embodiments, the CRISPR/Cas nuclease is derived from a Cas9 protein.
- the Cas9 protein can be from Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp., Nocardiopsis rougevillei, Streptomyces pristinaespiralis, Streptomyces viridochromogenes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, Alicyclobacillus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Microscilla marina, Burkholderiales bacterium, Polaromonas naphthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Synechococcus s
- the CRISPR/Cas nuclease is a mutant of a wild type CRISPR/Cas nuclease (such as Cas9) or a fragment thereof.
- the CRISPR/Cas nuclease is a mutant Cas9 protein from S. pyogenes.
- Methods for generating a Cas9 protein (or a fragment thereof) having an inactive DNA cleavage domain are known (See, e.g., Jinek et al., Science.337:816-821(2012); Qi et al., “Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression” (2013) Cell.
- the DNA cleavage domain of Cas9 is known to include two subdomains, the HNH nuclease subdomain and the RuvC1 subdomain.
- the HNH subdomain cleaves the strand complementary to the gRNA, whereas the RuvC1 subdomain cleaves the non-complementary strand. Mutations within these subdomains can silence the nuclease activity of Cas9.
- the mutations D10A and H840A completely inactivate the nuclease activity of S.
- the CRISPR/Cas nuclease of the present invention is a nickase and more particularly a Cas9 nickase i.e. the Cas9 from S. pyogenes having one mutation selected from the group consisting of D10A and H840A.
- the nickase of the present invention comprises the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO:4. SEQ ID NO: 3> S.
- variants of dCas9 are provided which are at least about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% to SEQ ID NO: 3 or 4.
- the nickase of the present invention comprises the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO:4 and further comprises the R221K and N394K mutations that was previously shown to improve Cas9 nuclease activity (Spencer, Jeffrey M., and Xiaoliu Zhang. "Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains.” Scientific reports 7.1 (2017): 1-14).
- variants of dCas9 are provided having amino acid sequences which are shorter, or longer than SEQ ID NO: 3 or 4, by about 5 amino acids, by about 10 amino acids, by about 15 amino acids, by about 20 amino acids, by about 25 amino acids, by about 30 amino acids, by about 40 amino acids, by about 50 amino acids, by about 75 amino acids, by about 100 amino acids or more.
- Some aspects of the disclosure provide Cas9 proteins that have different PAM specificities. Typically, Cas9 proteins, such as Cas9 from S. pyogenes (spCas9), require a canonical NGG PAM sequence to bind a particular nucleic acid region.
- any of the Cas proteins provided herein may be capable of binding a nucleotide sequence that does not contain a canonical (e.g., NGG) PAM sequence.
- a canonical PAM sequence e.g., NGG
- Cas9 proteins that bind non-canonical PAM sequences have been described in Kleinstiver, B. P., et al., “Engineered CRISPR-Cas9 nucleases with altered PAM specificities” Nature 523, 481- 485 (2015); and Kleinstiver, B.
- the Cas9 protein of the present invention comprises the following mutations D1135L, S1136W, G1218K, E1219Q, R1335Q and T1337R. This variant is capable of targeting an expanded set of NGN PAMs (i.e.
- the Ca9 variant SpG can further be optimized to develop a near-PAMless SpCas9 variant named “SpRY” that furthers includes the 5 additional mutations A61R, L1111R, N1317R, A1322R, and R1333P.
- the second component of the prime editing enzyme herein disclosed comprises a reverse transcriptase.
- the disclosure contemplates any wild type reverse transcriptase obtained from any naturally- occurring organism or virus, or obtained from a commercial or non-commercial source.
- the reverse transcriptases can include any naturally-occurring mutant RT, engineered mutant RT, or other variant RT, including truncated variants that retain function.
- the RTs may also be engineered to contain specific amino acid substitutions, such as those specifically disclosed herein.
- the reverse transcriptase that is usable in the prime editing enzymes of the present invention has the amino acid sequence as set forth in SEQ ID NO:5 but comprises four mutations: D200N, T306K, W313F, and T330P.
- the reverse transcriptase is fused to the N-terminus of the defective CRISPR/Cas nuclease.
- the reverse transcriptase is fused to the C- terminus of the defective CRISPR/Cas nuclease.
- the defective CRISPR/Cas nuclease and the reverse transcriptase are fused via a linker.
- the linker comprises a (GGGGS)n (SEQ ID NO:6), a (G)n, an (EAAAK)n (SEQ ID NO: 7), a (GGS)n, an SGSETPGTSESATPES (SEQ ID NO: 8) motif
- GGGGS GGGGSn
- EAAAK EAAAK
- SEQ ID NO: 7 a (GGS)n
- SGSETPGTSESATPES SEQ ID NO: 8 motif
- suitable linker motifs and configurations include those described in Chen et al., Fusion protein linkers: property, design and functionality. Adv Drug Deliv Rev.2013; 65(10):1357-69, the entire contents of which are incorporated herein by reference).
- the prime editing enzyme may comprise additional features.
- Other exemplary features that may be present are localization sequences, such as nuclear localization sequences (NLS), cytoplasmic localization sequences, export sequences, such as nuclear export sequences, or other localization sequences, as well as sequence tags that are useful for solubilization, purification, or detection of the fusion proteins.
- the prime editing enzyme incorporates one or more nuclear localization sequence.
- nuclear localization sequences such as nuclear localization sequences (NLS)
- cytoplasmic localization sequences such as nuclear export sequences, or other localization sequences
- sequence tags that are useful for solubilization, purification, or detection of the fusion proteins.
- the prime editing enzyme incorporates one or more nuclear localization sequence.
- nuclear localization sequence or “NLS” refers to an amino acid sequence that promotes import of a protein into the cell nucleus, for example, by nuclear transport. Nuclear localization sequences are known in the art and would be apparent to the skilled artisan.
- NLS sequences are described in Plank et al., international PCT application, PCT/EP2000/011690, filed Nov. 23, 2000, published as WO/2001/038547 on May 31, 2001, the contents of which are incorporated herein by reference for its disclosure of exemplary nuclear localization sequences.
- a NLS comprises the amino acid sequence PKKKRKV (SEQ ID NO: 9) or MDSLLMNRRKFLYQFKNVRWAKGRRETYLC (SEQ ID NO: 10).
- Various prime editing enzymes are known in the art and typically include the PE1, PE2, PEmax, PEmax-SpRY prime editors.
- PE1 protein refers to a prime editing enzyme that consists of a fusion protein having the following structure [NLS]-[Cas9(H840A)]-[linker]- [MMLV_RT(wt)], and having the amino acid sequence as set forth in SEQ ID NO:11.
- the pegRNA comprises (a) a spacer sequence that comprises a region of complementarity to a first strand of the double-stranded target nucleic sequence located in the HBG1/2 promoters; (b) an extension arm that comprises a RT template and a primer binding site in a 5’ to 3’ orientation, wherein the primer binding site comprises a region of complementarity to a region upstream of a nick site in the second strand of the double- stranded target sequence, and wherein the RT template encodes the desired nucleotide changes (e.g.
- the pegRNA molecule of the present invention thus comprises a spacer sequence for providing the targeting specificity.
- this spacer sequence can comprise from about 10 nucleotides to more than about 25 nucleotides.
- the region of base pairing between the spacer sequence and the corresponding target site sequence can be about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25, or more than 25 nucleotides in length.
- the spacer sequence is about 17-20 nucleotides in length, such as 20 nucleotides.
- a software program is used to identify candidate CRISPR target sequences on both strands of the DNA nucleic acid molecule containing the HBG genes based on desired guide sequence length and a CRISPR motif sequence (PAM) for a specified CRISPR enzyme.
- PAM CRISPR motif sequence
- One requirement for selecting a suitable target nucleic acid is that it has a 3′ PAM site/sequence.
- Each target sequence and its corresponding PAM site/sequence are referred herein as a Cas-targeted site.
- Type II CRISPR system one of the most well characterized systems, needs only Cas 9 protein and a guide RNA complementary to a target sequence to affect target cleavage. For example, target sites for Cas9 from S.
- pyogenes with PAM sequences NGG, may be identified by searching for 5′-Nx-NGG- 3′ both on the input sequence and on the reverse-complement of the input. Since multiple occurrences in the genome of the DNA target site may lead to nonspecific genome editing, after identifying all potential sites, the program filters out sequences based on the number of times they appear in the relevant reference genome. For those CRISPR enzymes for which sequence specificity is determined by a “seed” sequence, such as the 11-12 bp 5′ from the PAM sequence, including the PAM sequence itself, the filtering step may be based on the seed sequence. Thus, to avoid editing at additional genomic loci, results are filtered based on the number of occurrences of the seed:PAM sequence in the relevant genome.
- the user may be allowed to choose the length of the seed sequence.
- the user may also be allowed to specify the number of occurrences of the seed:PAM sequence in a genome for purposes of passing the filter.
- the default is to screen for unique sequences. Filtration level is altered by changing both the length of the seed sequence and the number of occurrences of the sequence in the genome.
- the program may in addition or alternatively provide the sequence of a guide sequence complementary to the reported target sequence(s) by providing the reverse complement of the identified target sequence(s). Further details of methods and algorithms to optimize sequence selection can be found in U.S. application Ser. No.61/836,080; incorporated herein by reference.
- the spacer sequence is selected from the group consisting of : pegRNA spacer sequence SEQ ID NO : Spacer_10 GTTTGCCTTGTCAAGGCTAT 15 Spacer_11 TGCCTTGTCAAGGCTATTGG 16 Spacer_12 TTGCCTTGTCAAGGCTATTG 17 Spacer_13 TTTGCCTTGTCAAGGCTATT 18 Spacer_14 AGTTTGCCTTGTCAAGGCTA 19
- the spacer sequence is : Spacer_10 GTTTGCCTTGTCAAGGCTAT SEQ ID NO:15
- the gRNA core sequence of the pegRNA is selected from the group consisting of : Scaffold sequence SE Q ID NO : Scaffold GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTGAAAAAG 20 TGGCACCGAGTCGGTGC scaffold GTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGGCTAGTCCGTTATCAA 21 _opt CTTGAAAAAGTG
- the synthesized single-stranded DNA product of the RT template sequence is homologous to the non-target strand and contains desired nucleotide changes (e.g.
- the single-stranded DNA product of the RT template sequence hybridizes in equilibrium with the complementary target strand sequence, thereby displacing the homologous endogenous target strand sequence.
- the displaced endogenous strand may be referred to in some embodiments as a 5’ endogenous DNA flap species.
- This 5’ endogenous DNA flap species can be removed by a 5’ flap endonuclease (e.g., FEN1) and the single-stranded DNA product, now hybridized to the endogenous target strand, may be ligated, thereby creating a mismatch between the endogenous sequence and the newly synthesized strand.
- the mismatch may be resolved by the cell's innate DNA repair and/or replication processes.
- the cellular repair of the single-strand DNA flap results in installation of the desired nucleotide changes, thereby forming a desired product.
- the nucleotide sequence of the RT template sequence corresponds to the nucleotide sequence of the non-target strand which becomes displaced as the 5’ flap species and which overlaps with the site to be edited.
- the RT template sequence is selected from the group consisting of: RTT sequence SEQ ID NO : RTT_10 GCCccGCCTgATA 22 RTT_10.2 GCCccGCCTTGAgATA 23 RTT_10.3 GCCccGCCTTGACCgATA 24 RTT_11 GCCAGCCTTGCCTG 25 RTT_12 AGCCTTGCCTGA 26 RTT_13 GCCTTGCCTGAT 27 RTT_14 TTGCCTGATAG 28
- the primer binding site sequence is selected from the group consisting of: PBS sequence SEQ ID NO : PBS_10 GCCTTGACAAGGC 29 PBS_11 ATAGCCTTGACAA 30 PBS_12 TAGCCTTGACAAG 31 PBS_13 AGCCTTGACA
- Contacting the eukaryotic cell with the prime editing platform of the present invention thus results in i) nicking the second strand of the double-stranded target sequence to form a free 3’ end at the nick site; ii) annealing the primer binding site with the region of the second strand of the double-stranded target sequence upstream of the nick site; iii) synthesizing a single strand of DNA encoded by the RT template from the free 3’ end of the second strand of the double- stranded target sequence; and iv) replacing the region downstream of the nick site in the second strand of the double-stranded target sequence with the single strand of DNA, thereby modifying the sequence of the double-stranded target sequence.
- the primer binding site of pegRNA binds to the primer sequence that is formed from the endogenous DNA strand of the target sequence when it becomes nicked by the prime editing platform, thereby exposing a 3’ end on the endogenous nicked strand.
- the binding of the primer sequence to the primer binding site on the extension arm of the pegRNA creates a duplex region with an exposed 3’ end (i.e., the 3’ of the primer sequence), which then provides a substrate for the reverse transcriptase to begin polymerizing a single strand of DNA from the exposed 3’ end along the length of the RT template.
- the sequence of the single strand DNA product is the complement of the RT template.
- the RT template represents the portion of the extension arm that is encoded into a single strand DNA product (i.e., the 3’ single strand DNA flap containing the desired genetic edit information) by the reverse transcriptase of the prime editing enzyme complex and which ultimately replaces the corresponding endogenous DNA strand of the target sequence that sits immediate downstream of the PE-induced nick site.
- the pegRNA of the present invention is selected from the group consisting of : Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACAAGGC 5 10 peg TGCCTTGTCAAGGCTATTGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCCAGCCTTGCCTGATAGCCTTGACAA 6 11 peg TTGCCTTGTCAAGGCTATTGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCCAGCCTTGCCTGATAGCCTTGACAA 6 11 peg TTGCCTTGTCAAGGCTATTGGTTTTAGAGAA
- ACAAGGC 3 peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 1 10.
- AGCCTTGACAAGGC 4 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACA 2 A10 AGGCTCCTAATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .1 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG 3 A10 ACAAGGCTCCTTATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .2 epe 4 g RN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 A10
- the ngRNA is designed for temporal control.
- temporary second-strand nicking refers to a variant of second strand nicking whereby the installation of the second nick in the unedited strand occurs only after the desired edit is installed in the edited strand. This avoids concurrent nicks on both strands that could lead to double-stranded DNA breaks. This is achieved by designing a ngRNA with a spacer sequence that matches only the edited strand, but not the original allele. Using this strategy, mismatches between the protospacer and the unedited allele should disfavor nicking by the ngRNA until after the editing event on the PAM strand takes place.
- the ngRNA is selected from the group consisting of: ngRNA sequence SEQ ID NO : ngRNA10.1 TAGTCTTAGAGTATCCAGTG 46 ngRNA10.2 TAGAGTATCCAGTGAGGCCA 47 ngRNA10.3 AGAGTATCCAGTGAGGCCAG 48 ngRNA10.4 TATCCAGTGAGGCCAGGGGC 49 ngRNA10.5 GGCTAGGGATGAAGAATAAA 50
- the ngRNA is : ngRNA10.5 GGCTAGGGATGAAGAATAAA SEQ ID NO :50
- the guide RNA molecules of the present invention i.e.
- the pegRNA and optionally the ngRNA can be made by various methods known in the art including cell-based expression, in vitro transcription, and chemical synthesis.
- cell-based expression in vitro transcription
- chemical synthesis The ability to chemically synthesize relatively long RNAs (as long as 200 mers or more) using TC-RNA chemistry (see, e.g., U.S. Pat. No. 8,202,983) allows one to produce RNAs with special features that outperform those enabled by the basic four ribonucleotides (A, C, G and U).
- the RNA molecule of the present invention can be made with recombinant technology using a host cell system or an in vitro translation-transcription system known in the art.
- the guide RNA molecules may include one or more modifications. Such modifications may include inclusion of at least one non-naturally occurring nucleotide, or a modified nucleotide, or analogs thereof. Modified nucleotides may be modified at the ribose, phosphate, and/or base moiety.
- Modified nucleotides may include 2’-O-methyl analogs, 2’-deoxy analogs, or 2’-fluoro analogs.
- the nucleic acid backbone may be modified, for example, a phosphorothioate backbone may be used.
- LNA locked nucleic acids
- BNA bridged nucleic acids
- Further examples of modified bases include, but are not limited to, 2-aminopurine, 5-bromo-uridine, pseudouridine, inosine, 7-methylguanosine.
- the prime editing platform further involves the transient expression of an engineered DNA mismatch repair (MMR) inhibiting protein (PE5 system) for enhancing the efficiency of the prime editing.
- MMR engineered DNA mismatch repair
- PE5 system engineered DNA mismatch repair
- the MMR inhibiting protein is selected among catalytically impaired mutants of human MSH2, MSH6, PMS2, and MLH1.
- prime editing platform involves the use of a dominant negative MMR protein (MLH1dn) as described in Chen, Peter J., et al. "Enhanced prime editing systems by manipulating cellular determinants of editing outcomes.” Cell 184.22 (2021): 5635-5652 and having the amino acid sequence as set forth in SEQ ID NO:51.
- the nucleic acids encoding the pegRNA or the prime editing enzyme can be cloned into one or more vectors for introducing them into the eukaryotic cell.
- the vectors are typically prokaryotic vectors, e.g., plasmids, or shuttle vectors, or insect vectors, for storage or manipulation of the nucleic acid encoding the pegRNA or the prime editing enzyme herein disclosed.
- the nucleic acids are isolated and/or purified.
- the present invention provides recombinant constructs or vectors having sequences encoding one or more of the pegRNA or prime editing enzymes described above.
- the vector can be capable of autonomous replication or integration into a host DNA.
- the vector may also include appropriate sequences for amplifying expression.
- the expression vector preferably contains one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell cultures, or such as tetracycline or ampicillin resistance in E. coli.
- any of the procedures known in the art for introducing foreign nucleotide sequences into host cells may be used. Examples include the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, nucleofection, liposomes, microinjection, naked DNA, plasmid vectors, viral vectors, both episomal and integrative, and any of the other well-known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell.
- the different components of the prime editing platform of the present invention are provided to the population of cells through the use of an RNA-encoded system.
- the editing system may be provided to the population of cells through the use of a chemically modified mRNA-encoded prime editor together with the pegRNA.
- engineered RNA-encoded prime editing enzymes are prepared by introducing various chemical modifications to both mRNA that encoded the prime editing enzyme and guide RNA.
- said modifications consist in uridine depleted mRNAs modified with 5- methoxyuridine: synonymous codons may be introduced to deplete uridines as much as possible without altering the coding sequence and replaced all the remaining uridines with 5- methoxyuridine.
- Said optimized editing system exhibits higher editing efficiency at some genomic sites compared to DNA-encoded system.
- the different components of the prime editing platform of the present invention are provided to the population of cells through the use of ribonucleoprotein (RNP) complexes.
- RNP ribonucleoprotein
- the prime editing enzyme can be pre-complexed with one or more pegRNAs to form a ribonucleoprotein (RNP) complex.
- RNP ribonucleoprotein
- the RNP complex can thus be introduced into the eukaryotic cell. Introduction of the RNP complex can be timed.
- the cell can be synchronized with other cells at G1, S, and/or M phases of the cell cycle.
- RNP delivery avoids many of the pitfalls associated with mRNA, DNA, or viral delivery.
- the RNP complex is produced simply by mixing the proteins (i.e. the prime editing enzyme) and one or more pegRNAs in an appropriate buffer. This mixture is incubated for 5-10 min at room temperature before electroporation. Electroporation is a delivery technique in which an electrical field is applied to one or more cells in order to increase the permeability of the cell membrane. In some embodiments, genome editing efficiency can be improved by adding a transfection enhancer oligonucleotide.
- a further object of the present invention relates to a method for increasing fetal hemoglobin levels in a subject in need thereof, the method comprising transplanting a therapeutically effective amount of the population of eukaryotic cells obtained by the method as above described.
- the population of cell is autologous to the subject, meaning the population of cells is derived from the same subject.
- the subject has been diagnosed with a hemoglobinopathy.
- the method of the present invention is thus particularly suitable for the treatment of hemoglobinopathies.
- the ⁇ -hemoglobinopathy is a sickle cell disease.
- the hemoglobinopathy is a ⁇ -thalassemia.
- the cells are formulated by first harvesting them from their culture medium, and then washing and concentrating the cells in a medium and container system suitable for administration (a "pharmaceutically acceptable" carrier) in a treatment-effective amount.
- a medium and container system suitable for administration a "pharmaceutically acceptable” carrier
- Suitable infusion medium can be any isotonic medium formulation, typically normal saline, Normosol R (Abbott) or Plasma-Lyte A (Baxter), but also 5% dextrose in water or Ringer's lactate can be utilized.
- the infusion medium can be supplemented with human serum albumin.
- a treatment-effective amount of cells in the composition is dependent on the relative representation of the cells with the desired specificity, on the age and weight of the recipient, and on the severity of the targeted condition.
- the amount of cells can be as low as approximately 103/kg, preferably 5x103/kg; and as high as 107/kg, preferably 108/kg.
- the number of cells will depend upon the ultimate use for which the composition is intended, as will the type of cells included therein. Typically, the minimal dose is 2 million of cells per kg. Usually 2 to 20 million of cells are injected in the subject. The desired purity can be achieved by introducing a sorting step.
- the cells are generally in a volume of a liter or less, can be 500 ml or less, even 250 ml or 100 ml or less.
- the clinically relevant number of cells can be apportioned into multiple infusions that cumulatively equal or exceed the desired total amount of cells.
- Kits This invention further provides kits containing reagents for performing the above-described methods, including all component of the prime editing platform as disclosed herein for performing mutagenesis.
- one or more of the reaction components e.g., guide RNA molecules (i.e. the pegRNA and optionally the ngRNA), and nucleic acid molecules encoding for the prime editing enzymes for the methods disclosed herein can be supplied in the form of a kit for use.
- the kit comprises one or more prime editing enzymes and one or more guide RNA molecules (i.e. the pegRNA and optionally the ngRNA).
- the kit consists of one of the following combinations: Prime editor pegRNA ngRNA MLH1 PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 ( SEQ ID No NO:13) ID NO:41) NO :50) PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 ( SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:41) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 (SEQ ID No NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA
- an appropriate amount of one or more reaction components is provided in one or more containers or held on a substrate.
- additional components of the kits include, but are not limited to, one or more host cells, one or more reagents for introducing foreign nucleotide sequences into host cells, one or more reagents (e.g., probes or PCR primers) for detecting expression of the guide RNA or prime editing enzymes or verifying the target nucleic acid's status, and buffers or culture media for the reactions.
- the kit may also include one or more of the following components: supports, terminating, modifying or digestion reagents, osmolytes, and an apparatus for detection.
- the components used can be provided in a variety of forms.
- kits and systems include solid matrices (e.g., glass, plastic, paper, foil, micro-particles and the like) that hold the reaction components or detection probes in any of a variety of configurations (e.g., in a vial, microtiter plate well, microarray, and the like).
- the kits may further include instructions recorded in a tangible form for use of the components.
- FIGURES Figure 1: Screening of pegRNAs and ngRNAs targeting the HBG1/2 promoters in K562 cells. (A) Upper panel.
- pegRNAs contain from 5’ to 3’, the spacer sequence (black/dark grey arrows), the scaffold sequence (grey lines) and the 3’ extension composed of the reverse transcription template (RTT, grey light boxes) and the primer binding site (PBS, white boxes).
- pegRNA10 is compatible with a prime editor harboring the Cas9n that recognizes an NGG PAM (PEmax), while pegRNA11 to pegRNA14 work with a PAM-less (SpRY) prime editor (PEmax-SpRY).
- pegRNA10 to pegRNA14 were designed to delete the BCL11A-binding motif (TGACC from -114 to -118 upstream of the TSS) (grey light box), insert the -113 A>G (GATA1 BS) and the -123/-124 T>C (KLF1 BS) mutations.
- C Frequency (%) of InDels generated by the pegRNAs, described in panel B, using a Cas9 (pegRNA10) or a Cas9-SpRY (pegRNA11 to pegRNA14) nuclease in K562 cells. Bars represent the mean ⁇ SEM of 3 biologically independent replicates.
- C D
- MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls.
- PCR products were subjected to Sanger sequencing and InDels were measured using the TIDE software.
- Figure 2 Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.1 in K562 cells.
- the epegRNAs contain the tevopreQ1 sequence fused to the 3’ end of the pegRNA via a linker.
- the epegRNA10.1 deletes the entire BCL11A BS ( ⁇ 5_BCL11A) and inserts the GATA1 (-113 A>G) and the KLF1 (-123/-124 T>C) BSs.
- B Left panel. Percentage of NGS reads with: (i) desired edits alone, (ii) desired edits with alternative repair events with or without additional mutations, and (iii) Indels in GFPhigh K562 cells.
- the reads with InDels include InDels located at the nicking site of the epegRNA and of the ngRNA10.3 or ngRNA10.5.
- Figure 3 Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.2 in K562 cells.
- A Schematic representation of epegRNA10.2 and ngRNAs (ngRNA10.3 and ngRNA10.5).
- the epegRNAs contain the tevopreQ1 sequence fused to the 3’ end of the pegRNA via a linker.
- the epegRNA10.2 deletes two cytosines of the BCL11A BS ( ⁇ 2_BCL11A),and insert the GATA1 and the KLF1 BSs.
- B Left panel.
- the reads with InDels include InDels located at the nicking site of the epegRNA and of the ngRNA10.3 or ngRNA10.5.
- MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ⁇ SEM of 3 biologically independent replicates.
- FIG. 1 Schematic representation of the resolution of the 3’-5’ flap intermediates using epegRNA10.2.
- Figure 4 Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.3 in K562 cells.
- A Schematic representation of epegRNA10.3 and ngRNAs (ngRNA10.3 and ngRNA10.5). The epegRNAs contain the tevopreQ1 sequence fused to the 3’-end of the pegRNA via a linker.
- the epegRNA10.3 inserts the GATA1 and the KLF1 BSs.
- B Left panel. Percentage of NGS reads with: (i) desired edits alone, (ii) desired edits with alternative mutations (alt. mut.), (iii) partial edits and (iv) Indels in GFPhigh K562 cells.
- the reads with InDels include InDels located at the nicking site of the epegRNA and of the ngRNA10.3 or ngRNA10.5.
- MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ⁇ SEM of 3 biologically independent replicates.
- Right panel Example of NGS reads.
- the top line corresponds to the wild-type (WT) sequence of the HBG1/2 promoters aligned to the PBS/RTT of the epegRNA.
- the different editing profiles are displayed below the WT sequence.
- the KLF1 BS and the GATA1 BS are respectively highlighted in grey and bold.
- the desired base conversions are indicated in lower case.
- Statistical significance was assessed using one-way test ANOVA with multiple comparisons.* p ⁇ 0.05** p ⁇ 0,005, ***p ⁇ 0,0005, **** p ⁇ 0,0001
- Figure 5 Optimization of the prime editing efficiency in K562 cells.
- NGS reads containing the desired edits (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (Partial edit 1: 1 mutation generating the GATA1 BS and 2-bp deletion in the BCL11A BS; partial edit 2: only 1 mutation generating the GATA1 BS) or InDels (located at the nicking induced by the epegRNA or the ngRNA) generated using the epegRNA10.2 or the epegRNA10.4 with the ngRNA10.5 in the -115 region of the HBG1/2 promoters.
- MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls.
- Prime editing efficiency in HSPCs percentage of NGS reads containing the desired edit (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (1 mutation generating the GATA1 BS and 2-bp deletion in the BCL11A BS) or InDels (located at the nicking induced by the epegRNA and/or the ngRNA) generated using the epegRNA10.4 with or without the ngRNA10.5 (for PE and PEn strategies, respectively) in the -115 region of the HBG1/2 promoters.
- MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls.
- HBB ⁇ - globin
- HbS hemoglobin S
- Naturally occurring HPFH mutations identified in the promoters of the two ⁇ -globin genes are known to either generate de novo DNA motifs recognized by transcriptional activators (e.g., KLF1, TAL1 and GATA1) or disrupt transcriptional repressor (e.g., LRF and BCL11A) binding sites (BSs).
- transcriptional activators e.g., KLF1, TAL1 and GATA1
- disrupt transcriptional repressor e.g., LRF and BCL11A binding sites
- the disruption of the LRF or BCL11A BS with CRISPR/Cas9- mediated strategy leads to HbF reactivation and correction of the SCD phenotype3.
- the use of a nuclease can cause cell toxicity and generate several double strand break (DSB) associated with large deletions and high risk of genomic rearrangements4–6.
- DSB double strand break
- New DSB-free genome editing tools allowed the development of novel, efficient and safer therapeutic strategies for the development of ⁇ -hemoglobinopathies.
- the introduction of HPFH mutations generating the KLF1 (-123/-124 T>C)7, TAL1 (-175 T>C)8 or GATA1 (-113 A>G)9 activator BSs in the HBG1/2 promoters results in HbF reactivation using base editing8.
- the creation of the -113 A>G mutation partially disrupt the BCL11A repressor BS.
- the co-occurrence of multiple HPFH mutations is associated with higher HbF levels compared to individual mutations10.
- the prime editing system is a “search and replace” genome editing technology allowing all 12 possible base conversions, insertions, deletions, or the simultaneous combination of these changes11–13 in a specific target region.
- This system is composed of: (1) the prime editor (PE), a fusion of a Cas9 nickase (Cas9n) and an engineered reverse transcriptase (RT) and (2) a prime editing guide RNA (pegRNA).
- PE prime editor
- Cas9n Cas9 nickase
- RT engineered reverse transcriptase
- pegRNA prime editing guide RNA
- the pegRNA contains the spacer sequence that targets a specific region in the genome and the 3’ extension sequence composed of the primer binding site (PBS) that hybridizes to the 3’ end of the nicked DNA strand to start reverse transcription and the RT- template (RTT) that contains the desired edits (DE), which are eventually inserted into the genomic DNA after reverse transcription (PE2 system).
- PBS primer binding site
- RTT RT- template
- DE desired edits
- PE2 system genomic DNA after reverse transcription
- the prime editing tool was improved to increase editing efficiency by using a single guide RNA (ngRNA) that nicks the non-edited strand (PE3 system)11 and a dominant negative MLH1 (MLH1dn) that inhibits the mismatch DNA repair pathway (MMR; PE5 system), which reincorporates the original nucleotides in the edited strand14–17.
- ngRNA single guide RNA
- MLH1dn dominant negative MLH1
- MMR mismatch DNA repair pathway
- the PE by itself was also engineered to further improve its processivity and editing activity by the addition of nuclear localization signal (NLS) sequences, linkers, and two mutations in the Cas9n18. Moreover, its sequence was codon optimized, generating the PEmax15.
- NLS nuclear localization signal
- nickase-based PE a Cas9 nuclease to generate the prime editor nuclease (PEn - SEQ ID NO: 86) that facilitates precise insertion of DNA sequences and improves editing efficiency for some pegRNAs that are poorly efficient when used with the nickase-based PE33.
- This technology allows prime editing by using DSBs and DNA end joining repair pathways33.
- K562 cell culture Human erythroleukemia K562 cells were maintained at a concentration of 5x105cells/ml in RPMI 1640 containing glutamine (Gibco) and supplemented with 10% fetal bovine serum (Gibco), 2% Hepes (Life Technologies), 1% sodium pyruvate (Life Technologies), and 1% penicillin and streptomycin (Life Technologies) at 37°C and 5%CO2.
- HSPC purification and culture We obtained human non-mobilized peripheral blood CD34+ HSPCs from patients with SCD from the “Hôpital Necker-Enfantsments” Hospital (Paris, France). Healthy donors were either obtained from the “Hôpital Necker-Enfantsmats” Hospital (Paris, France). All experiments were performed in accordance with the Declaration of Helsinki. The study was approved by the regional investigational review board (reference: DC 2022-5364, CPP Ile-de- France II “Hôpital Necker-Enfants disclosures”). HSPCs were purified by immunomagnetic selection with AutoMACS (Miltenyi Biotec) after immunostaining with the CD34 MicroBead Kit (Miltenyi Biotec).
- CD34+ cells were thawed and cultured at a concentration of 5 ⁇ 105 cells/ml in the “HSPC medium” containing StemSpan (STEMCELL Technologies) supplemented with penicillin/streptomycin (Gibco), 250 nM StemRegenin1 (STEMCELL Technologies), and the following recombinant human cytokines (PeproTech): human stem-cell factor (SCF) (300 ng/ml), Flt-3L (300 ng/ml), thrombopoietin (TPO) (100 ng/ml), and interleukin-3 (IL-3) (60 ng/ml).
- SCF human stem-cell factor
- Flt-3L 300 ng/ml
- TPO thrombopoietin
- IL-3 interleukin-3
- Plasmids used in this study include: pMJ920 Cas9-GFP-expressing plasmid (Addgene #42234) pCMV-T7-SpRY-P2A-EGFP (Addgene #139989) pCMV-PE2 (Addgene #258720) pCMV-PEmax (Addgene #174820) pU6-pegRNA-GG-acceptor (Addgene #132777) pU6-tevopreq1-GG-acceptor (Addgene #174038) pEF1a-hMLH1dn (Addgene #174824) Generation of pegRNA and epegRNA constructs pegRNAs were designed using the web-based tools (PrimeDesign26 https://drugthatgene.pinellolab.partners.org, pegFinder27 http://pegfinder.sidichenlab.org, and EasyPrime28, http://easy-prime.cc).
- epegRNAs were generated from pegRNAs by adding the tevopreQ1 sequence at the 3’ end of the pegRNA via a linker designed by the web-based tool pegLIT14 (https://peglit.liugroup.us). pegRNAs and epegRNAs were cloned as previously described11,14.
- oligonucleotide duplexes containing the pegRNAs’ or the epegRNAs’ spacer , scaffold and 3’extension, with or without the tevopreQ1 sequence were cloned using the Golden Gate Assembly method into the pU6-pegRNA-GG-acceptor (Addgene #132777) for pegRNAs and the pU6-tevopreq1-GG-acceptor (Addgene#174038) for epegRNAs. Plasmids were transformed in TOP10 and XL10-gold bacteria for pegRNAs and epegRNAs, respectively, following manufacturer’s instructions. After an overnight culture, transformed bacterial colonies were selected and amplified.
- ngRNAs were designed using the web-based tool CRISPOR29 (http://crispor.tefor.net/ - version 4.99). We selected ngRNAs with the lowest off-target activity, which was predicted using web- based tools (COSMID, https://crispr.bme.gatech.edu/30; DeepSpCas9, http://deepcrispr.info/DeepSpCas9/31).
- Oligonucleotide duplexes containing the annealed ngRNA spacer were cloned into the MA128 plasmid (provided by M. Amendola, Genethon, France). Cloning strategies and plasmid selection were similar to those described for peg/epegRNAs. ngRNAs’ sequences are displayed in Table S1.
- K562 cells 106 cells/condition
- K562 cells 106 cells/condition
- Cas9-GFP Additional growth factor
- Cas9-SpRY-GFP-expressing plasmid Additional growth factor
- 1.2 ⁇ g of the ngRNA/pegRNAs-containing plasmids using AMAXA Cell Line Nucleofector Kit V (VCA- 1003, Lonza) and U-16 program (Nucleofector 2b, Lonza).
- TE Tris EDTA
- RNA transfection 1 x 105 to 2 x 105 HSPCs from SCD donors were transfected per condition 24 h after thawing, with 3.2 or 6.4 ⁇ g of the enzyme-encoding PEmax mRNA (generated using the MEGAscript and Poly-A tailing kits, Ambion and the ARCA, Trilink, as described in Antoniou et al.34) or the PEnmax (provided by AstraZeneca) enzyme-encoding mRNA, and 200 to 400 pmol of synthetic pegRNAs (IDT) and/or ngRNA (Synthego).
- IDT synthetic pegRNAs
- Synthego synthetic pegRNAs
- cells were treated 24h with deoxynucleosides (Sigma-Aldrich; dA, catalog no. D8668; dG, catalog no. D0901; dC, catalog no. D0776; dT, catalog no. T1895) at a final concentration of 100uM, or with small inhibitors at a final concentration of 1 ⁇ M for AZD7648 (DNA-PKi, provided by AstraZeneca), 3 ⁇ M for Pol ⁇ i (ART558, MedChem Express), or a combination of both drug treatments.
- AZD7648 DNA-PKi, provided by AstraZeneca
- Pol ⁇ i ART558, MedChem Express
- Synthetic epegRNAs and ngRNA generation Synthetic epegRNAs were ordered from Integrated DNA Technologies (IDT). Each construct contained 2’-O-methyl modifications at the first and last three nucleotides and phosphorothioate linkages between the three first and last nucleotides. Synthetic nicking gRNAs were obtained from Synthego with 2’-O-methyl modifications at the first and last three nucleotides as well and phosphorothioate linkages between the three first and last two nucleotides. Evaluation of editing efficiency Genomic DNA from K562 cells was extracted using the PureLink Genomic DNA Mini Kit (Invitrogen) following the manufacturer’s instructions.
- on-target sites (HBG1/2 promoters and ngRNA target sites) were PCR-amplified using the Recombinant Taq DNA Polymerase (Thermo Fisher) according to the manufacturer’s instructions, and subjected to Sanger sequencing. PCR primers are listed in Table S2. Indels were evaluated using the TIDE software32 (http://shinyapps.datacurators.nl/tide/).
- on-target sites (HBG1/2 promoters) were PCR amplified using the Phusion High-Fidelity polymerase (New England BioLabs), the HF buffer (New England BioLabs) and primers containing specific DNA stretches (MR3 for forward primers and MR4 for reverse primers) 5’ to the sequence recognizing the on-target site. Amplicons were purified using Ampure XP beads (Beckman Coulter).
- Illumina-compatible barcoded DNA amplicon libraries were prepared by a 2-step PCR using the Phusion High- Fidelity polymerase (New England BioLabs), the HF buffer (New England BioLabs) and primers containing Unique Dual Index (UDI) barcodes and annealing to MR3 and MR4 sequences. Primers used for PCR amplification are listed in Table S2. Libraries were pooled, purified by High Pure PCR Product Purification Kit (Sigma-Aldrich), and sequenced using Illumina NovaSeq 6000 system (paired-end sequencing; 2 ⁇ 100-bp).
- NGS data were analyzed using a custom python pipeline that allows to align reads to a reference amplicon sequence and to count, (i) reads with sequence modifications (both expected modifications and other ones) in a window including the editing site in the proximity of the PE nicking site, and (ii) reads with Indels occurring between the epegRNA nicking site and the second nicking site, either alone or in combination with sequence modification near the PE nicking site.
- sequence modifications both expected modifications and other ones
- Indels occurring between the epegRNA nicking site and the second nicking site, either alone or in combination with sequence modification near the PE nicking site.
- Amplicons were generated using Phusion Flash High-Fidelity PCR Mastermix (F548, Thermo Scientific) in a 15 ⁇ L reaction, containing 1.5 ⁇ L of genomic DNA extract and 0.5 ⁇ M of target- specific primers (Forward: 5’-AAACGGTCCCTGGCTAAACT-3’ (SEQ ID NO:83) and Reverse: 5’- CCAGAAGCGAGTGTGTGGAA-3’ (SEQ ID NO:84)) with barcodes and NGS adapters.
- Applied PCR cycling conditions 98 °C for 3 min, 30x (98 °C for 10 s, 60 °C for 20 s, 72 °C for 30 s).
- PCR products were purified using HighPrep PCR Clean-up System (MagBio Genomics). Size, purity, and concentration of amplicons were determined using a fragment analyzer (Agilent). Amplicons were subjected to the second round of PCR to add unique Illumina indexes. Indexing PCR was performed using KAPA HiFi HotStart Ready Mix (Roche), 1 ng of PCR template and 0.5 ⁇ M of indexed primers in the total reaction volume of 25 ⁇ L. PCR cycling conditions: 72 °C for 3 min, 98 °C for 30 s, 10x (98 °C for 10 s, 63 °C for 30 s, 72 °C for 3 min), 72 °C for 5 min.
- Indexed amplicons were purified using HighPrep PCR Clean- up System (MagBio Genomics) and analyzed using a fragment analyzer (Agilent). Samples were quantified using Qubit 4 Fluorometer (Life Technologies) and subjected to sequencing using Illumina NextSeq system according to manufacturer’s instructions. Demultiplexing of the NGS sequencing data was performed using bcl2fastq software. The fastq files were analyzed using CRISPResso2 in the prime editing mode with the quantification window of 48 and 18 for pegRNAs targeting the -115 region of HBG1/2 promoters. Prime edited override sequences were used for each site. To generate the representative alignments, the window was extended to 40 to visualize homology arm integrations of different lengths.
- CFC assay HSPCs were plated at a concentration of 5 ⁇ 102 cells/mL in a methylcellulose-containing medium (GFH4435, STEMCELL Technologies) under conditions supporting erythroid and granulo-monocytic differentiation. After 14 days, BFU-Es were randomly picked and collected as bulk populations (containing at least 25 colonies) to evaluate editing efficiency and globin expression. HPLC Hemoglobin tetramers from BFU-E bulks were separated by CE-HPLC using a 2-cation exchange column (PolyCAT A, PolyLC, Columbia).
- the pegRNA10 is compatible with the PEmax harboring the Cas9n recognizing an NGG protospacer adjacent motif (PAM) and contains a 13-nucleotide long RTT that generates a de novo BS for GATA1 (-113 A>G), deletes the entire BCL11A BS (TGACC motif from -114 to -118 upstream of the TSS), and creates the KLF1 BS (-123/-124 T>C).
- the protospacer recognized by pegRNA10 is located close to the first nucleotide to be edited (i.e. the -113 nucleotide located at position +4 from the PE nick) and the PBS has the recommended length of 13 nucleotides.
- pegRNAs compatible with the PE-SpRY variant which contains a nearly PAM-less Cas9n (pegRNA11 to pegRNA14)21,22 and induce the same mutations as pegRNA10.
- the -113 base is located in position +1 to +8 (from the PE-induced nick) ( Figure 1B).
- pegRNAs with a Cas9 nuclease was tested to select the best-performing pegRNAs’ spacers that efficiently bind the HBG1/2 promoters.
- ngRNA10.1 to ngRNA10.5 we designed five ngRNAs (ngRNA10.1 to ngRNA10.5) inducing a DNA nick in a region spanning from +42 to +81 nucleotides downstream of the nick induced by pegRNA10.
- ngRNA10.1 to ngRNA10.5 We tested these ngRNAs by plasmid transfection in K562 cells and we selected the most efficient ones based on the efficiency of cleavage induced by the Cas9 nuclease.
- the ngRNA10.3 and ngRNA10.5 were the most efficient ngRNAs (inducing 39.0% and 45.8% of InDels in K562 cells, respectively) and were selected for further analyses (Figure 1D).
- the co- transfection with a GFPmax-encoding plasmid enabled us to measure transfection efficiency and evaluate the editing efficiency in FACS-sorted GFPhigh cells.
- HBG promoters were amplified by PCR and subjected to NGS sequencing. Editing efficiency was calculated using a custom Python pipeline.
- the deletion of 5- nucleotides of the BCL11A BS and, as a consequence, the reduced complementarity of the 3’ flap to the opposite strand likely determines the annealing of the GCC trinucleotide of the 3’ flap to the closest complementary motif in the DNA target region (Figure 2C).
- epegRNA10.1 was further modified to generate epegRNA10.2 and epegRNA10.3.
- the epegRNA10.2 contains the same spacer as epegRNA10.1 and an RTT of 16 nucleotides inserting both GATA1 (-113 A>G) and KLF1 (-123/-124 T>C) BSs, and simultaneously deleting the 2-nucleotide-long core of the BCL11A BS (CC motif located -114 to -115 upstream of the TSS) in the -115 region of the HBG1/2 promoter ( Figure 3A).
- the epegRNA10.3 contains the same spacer as epegRNA10.1 and an 18-nucleotide-long RTT inserting only the GATA1 (-113 A>G) and the KLF1 (-123/-124 T>C) BS in the -115 region of the HBG1/2 promoter, without deleting the BCL11A BS ( Figure 4A).
- the introduction of the GATA1 BS partially disrupt the BCL11A BS.
- Edited promoters contain either both activator BSs or partial edits ( Figure 4B).
- Partials edits included mainly the introduction of only the GATA1 BS ( Figure 4B), suggesting the low processivity of the RT when the RTT is longer than 13 nucleotides or the degradation of the 3’ end newly synthesized 3’ flap.
- a low frequency of edited promoters harbored only the KLF1 BS ( Figure 4B), suggesting error-prone and competitive repair pathways mechanisms involved in the resolution of the 3’ flap.
- Indel frequency tended to be lower when using ngRNA10.5 compared to ngRNA10.3.
- these results show that our strategy can simultaneously insert multiple HPFH and HPFH-like mutations in the -115 region of the HBG1/2 promoters in K562 cells.
- epegRNA10.2 that successfully inserted both GATA1 and KLF1 BSs and simultaneously disrupted the BCL11A repressor BS, reaching up to 34% of total editing efficiency in K562 cells.
- NGS sequencing revealed editing profiles that are indicative of competitive and alternative repair pathways underneath the resolution of the 3’ intermediates.
- epegRNA10.4 a novel epegRNA (i.e., epegRNA10.4), which contains a 7-nucleotide longer RTT compared to the epegRNA10.2 (Table S1).
- the elongated RTT should prevent 3’ flap degradation and favor the annealing of the 3’ flap over the 5’ flap to the complementary strand and the desired DNA repair, thus increasing the insertion of the desired edits.
- the epegRNA10.4 outperformed the epegRNA10.2 in inserting the desired motif reaching up to 50% of precise edits with the PE5max ( Figure 5).
- InDels remained low and comparable to the frequency observed with the epegRNA10.2 ( Figure 5).
- G gamma A gamma (beta+) hereditary persistence of fetal hemoglobin the G gamma -158 C-->T mutation in cis to the -175 T-->C mutation of the A gamma-globin gene results in increased G gamma-globin synthesis.
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Organic Chemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Microbiology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Gastroenterology & Hepatology (AREA)
- General Chemical & Material Sciences (AREA)
- Mycology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Diabetes (AREA)
- Hematology (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Crystallography & Structural Chemistry (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Pharmacology & Pharmacy (AREA)
- Animal Behavior & Ethology (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Peptides Or Proteins (AREA)
Abstract
β-hemoglobinopathies, β-thalassemia and sickle cell disease (SCD), are monogenic diseases caused by mutations in the β-globin locus, affecting the synthesis or the structure of the adult hemoglobin (Hb). It has been shown that the severity of both SCD and β-thalassemia is lessened by the hereditary persistence of fetal hemoglobin (HbF) in adulthood (HPFH). Now the inventors exploited the capacity of prime editing to simultaneously generate multiple HPFH mutations and disrupt the BCL11A repressor BS in the -115 region of the HBG1/2 γ-globin promoters to further boost HbF expression and correct the β-thalassemia and SCD phenotypes. Specifically, the inventors selected two potent HPFH/HPFH-like mutations generating BSs for the GATA1 and KLF1 activators, that are associated with a strong γ-globin reactivation in patients and experimental models. This strategy would be very appealing for the treatment of β-hemoglobinopathies.
Description
PRIME EDITING OF THE -115 REGION IN THE HBG1 AND/OR HBG2 PROMOTER FOR INCREASING FETAL HEMOGLOBIN CONTENT IN A EUKARYOTIC CELL FIELD OF THE INVENTION: The present invention is in the field of medicine, in particular haematology. BACKGROUND OF THE INVENTION: β-hemoglobinopathies, β-thalassemia and sickle cell disease (SCD), are monogenic diseases caused by mutations in the β-globin locus, affecting the synthesis or the structure of the adult hemoglobin (Hb). β-thalassemia is caused by mutations in the β-globin gene (HBB) locus that reduce (β+) or abolish (β0) the production of β-globin chains included in the adult hemoglobin (HbA) tetramer, leading to the precipitation of uncoupled α-globin chains, erythroid cell death and severe anemia (Taher, Ali T., David J. Weatherall, and Maria Domenica Cappellini. "Thalassaemia." The Lancet 391.10116 (2018): 155-167). In SCD, an A>T mutation in the HBB gene causes the substitution of valine for glutamic acid at position 6 of the β-globin chain (βS), which is responsible for deoxygenation-induced polymerization of sickle hemoglobin (HbS). This primary event drives red blood cell (RBC) sickling, hemolysis, vaso-occlusive crises, multi-organ damage, often associated with severely reduced life expectancy (Kato, Gregory J., et al. "Sickle cell disease." Nature Reviews Disease Primers 4.1 (2018): 1-22). It has been shown that the severity of both SCD and β-thalassemia is lessened by the hereditary persistence of fetal hemoglobin (HbF) in adulthood (HPFH) (Forget, Bernard G. "Molecular basis of hereditary persistence of fetal hemoglobin." Annals of the New York Academy of Sciences 850.1 (1998): 38-44.). This persistence is due to mutations located 200 to 115 nucleotides upstream of the transcription start sites of the identical HBG1 and HBG2 γ-globin promoters. HPFH mutations either generate de novo DNA motifs recognized by transcriptional activators (e.g., KLF1) (Wienert, Beeke, et al. "Editing the genome to introduce a beneficial naturally occurring mutation associated with increased fetal globin." Nature communications 6.1 (2015): 1-8; Wienert, Beeke, et al. "KLF1 drives the expression of fetal hemoglobin in British HPFH." Blood, The Journal of the American Society of Hematology 130.6 (2017): 803- 807; Martyn, Gabriella E., et al. "A natural regulatory mutation in the proximal promoter elevates fetal globin expression by creating a de novo GATA1 site." Blood, The Journal of the American Society of Hematology 133.8 (2019): 852-856.) or disrupt binding sites (BS) for
transcriptional repressors (e.g., LRF and BCL11A) (Martyn, Gabriella E., Kate GR Quinlan, and Merlin Crossley. "The regulation of human globin promoters by CCAAT box elements and the recruitment of NF-Y." Biochimica et Biophysica Acta (BBA)-Gene Regulatory Mechanisms 1860.5 (2017): 525-536). Recently, base editing approaches were used to generate a variety of mutations in the −200 or -115 region of the HBG1/2 promoters, including potent γ-globin- inducing mutations generating a KLF1 activator binding site or disrupting al LRF repressor binding site (Antoniou, Panagiotis, et al. "Base-editing-mediated dissection of a γ-globin cis- regulatory element for the therapeutic reactivation of fetal hemoglobin expression." Nature Communications 13.1 (2022): 1-22 and WO 2021/228944). Said strategy showed that editing of patient hematopoietic stem/progenitor cells was safe, lead to fetal hemoglobin reactivation and rescued the pathological phenotype. SUMMARY OF THE INVENTION: The present invention is defined by the claims. In particular, the present invention relates to the prime editing of the -115 region in the HBG1 and/or HBG2 promoter for increasing fetal hemoglobin content in a eukaryotic cell. DETAILED DESCRIPTION OF THE INVENTION: Main definitions: As used herein, the term "β-hemoglobinopathy" has its general meaning in the art and refers to any defect in the structure or function of any hemoglobin of an individual, and includes defects in the primary, secondary, tertiary or quaternary structure of hemoglobin caused by any mutation, such as deletion mutations or substitution mutations in the coding regions of the HBB gene, or mutations in, or deletions of, the promoters or enhancers of such gene that cause a reduction in the amount of hemoglobin produced as compared to a normal or standard condition. As used herein, the term "sickle cell disease" has its general meaning in the art and refers to a group of autosomal recessive genetic blood disorders, which results from mutations in a globin gene and which is characterized by red blood cells that assume an abnormal, rigid, sickle shape. They are defined by the presence of βS-globin gene coding for a β-globin chain variant in which glutamic acid is substituted by valine at amino acid position 6 of the peptide: incorporation of the βS-globin in the Hb tetramers (HbS, sickle Hb) leads to Hb polymerization and to a clinical
phenotype. The term includes sickle cell anemia (HbSS), sickle-hemoglobin C disease (HbSC), sickle β-plus- thalassaemia (HbS/β+), or sickle β-zerothalassaemia (HbS/β0). As used herein, the term "β-thalassemia" refers to a hemoglobinopathy that results from an altered ratio of α-globin to β-like globin polypeptide chains resulting in the underproduction of normal hemoglobin tetrameric proteins and the precipitation of free, unpaired α-globin chains. As used herein, the term “alpha globin” or “ ^-globin” has its general meaning in the art and refers to protein that is encoded in human by the HBA1 and HBA2 genes. The human alpha globin gene cluster located on chromosome 16 spans about 30 kb and includes seven loci: 5'- zeta - pseudozeta - mu - pseudoalpha-1 - alpha-2 - alpha-1 - theta - 3'. The alpha-2 (HBA2) and alpha-1 (HBA1) coding sequences are identical. These genes differ slightly over the 5' untranslated regions and the introns, but they differ significantly over the 3' untranslated regions. The ENSEMBL IDs (i.e. the gene identifier number from the Ensembl Genome Browser database) for HBA1 and HBA2 are ENSG00000206172 and ENSG00000188536 respectively. As used herein, the term “beta globin” or “β-globin”” has its general meaning in the art and refers to a globin protein, which along with alpha globin (HBA), makes up the most common form of haemoglobin (Hb) in adult humans. Normal adult human Hb is a heterotetramer consisting of two alpha chains and two beta chains. The β-globin is encoded by the HBB gene on human chromosome 11. It is 146 amino acids long and has a molecular weight of 15,867 Da. As used herein, the term “gamma globin” or “γ-globin” has its general meaning in the art and refers to protein that is encoded in human by the HBG1 and HBG2 genes. The HBG1 and HBG2 genes are normally expressed in the fetal liver, spleen and bone marrow. Two γ-globin chains together with two α-globin chains constitute fetal hemoglobin (HbF) which is normally replaced by adult hemoglobin (HbA) in the year following birth. The ENSEMBL IDs (i.e. the gene identifier number from the Ensembl Genome Browser database) for HBG1 and HBG2 are ENSG00000213934 and ENSG00000196565 respectively. As used herein, the term “expression” refers to the process by which a polynucleotide is transcribed from a DNA template (such as into and mRNA or other RNA transcript) and/or the process by which a transcribed mRNA is subsequently translated into peptides, polypeptides, or proteins. Transcripts and encoded polypeptides may be collectively referred to as “gene
product”. If the polynucleotide is derived from genomic DNA, expression may include splicing of the mRNA in a eukaryotic cell. Any method known in the art can be used to measure the expression of the gene (e. g. HPLC analysis of protein and RT-qPCR analysis of mRNA.) As used herein, the expression "increasing the fetal hemoglobin content” indicates that fetal hemoglobin is at least 5% higher in the eukaryotic cell treated with the prime editing platform, than in a comparable, eukaryotic cell, wherein a prime editing platform targeting an unrelated locus is present or where no prime editing platform is present. In some embodiments, the percentage of fetal hemoglobin expression in the eukaryotic cell is at least 10% higher, at least 20% higher, at least 30% higher, at least 40% higher, at least 50% higher, at least 60% higher, at least 70% higher, at least 80% higher, at least 90% higher, at least 1-fold higher, at least 2- fold higher, at least 5-fold higher, at least 10-fold higher, at least 100 fold higher, at least 1000- fold higher, or more than an eukaryotic cell. As used herein, the term “promoter” has its general meaning in the art and refers to a nucleic acid sequence which is required for expression of a gene operably linked to the promoter sequence. As used herein, the term “HBG1 promoter” refers to the promoter of the HBG1 gene. As used herein, the term “HBG2 promoter” refers to the promoter of the HBG2 gene. HBG1 and HBG2 promoters are identical up to –221 bp and comprise the nucleic acid sequence as set forth in SEQ ID NO:1 and depicted in Figure 1. According to the present invention, the first nucleotide in SEQ ID NO:1 denotes the nucleotide located at position -147 upstream of the HBG transcription starting site and the last nucleotide in SEQ ID NO:1 denotes the nucleotide located at position -91 upstream of the HBG transcription starting site. For instance, the nucleotide at position -123 in the the HBG1 or HBG2 promoter denotes the nucleotide at position 24 in SEQ ID NO:1; the nucleotide at position -124 in the the HBG1 or HBG2 promoter denotes the nucleotide at position 25 in SEQ ID NO:1 and the nucleotide at position -113 in the HBG1 or HBG2 promoter denotes the nucleotide at position 35 in SEQ ID NO:1. SEQ ID NO 1:> Sequence of the HBG1 or HBG2 promoter CTCCACCCATGGGTTGGCCAGCCTTGCCTTGACCAATAGCCTTGACAAGGCAAACTT As used herein, the “-115 region” in the HBG1 or HBG2 promoter refers to the region which encompasses the nucleotides at position -123, -124 and -113 and thus relates to the region
encompassing the region starting from the nucleotide at position 24 to the nucleotide at position 35 in SEQ ID NO:1. As used herein, the term “activator” refers to a transcriptional activator that is a protein (transcription factor) that increases gene transcription of a gene or set of genes. Most activators are DNA-binding proteins that bind to enhancers or promoter-proximal elements. According to the present disclosure, the activator is KLF1 or GATA1. As used herein, the term “transcriptional activator binding site” refers to a site present on DNA whereby the transcriptional activator according to the present disclosure binds. According to the present invention, the prime-editing platform of the present invention edits the genome sequence of the eukaryotic cell so that the activator is able to bind to its transcriptional activator binding sites. As used herein, the term “KLF1” has its general meaning in the art and refers to the Kruppel like factor 1 protein. The term is also known as EKLF; EKLF/KLF1. KLF1 is a hematopoietic- specific transcription factor that induces high-level expression of adult beta-globin and other erythroid genes. The zinc-finger protein binds to a DNA sequence found in the beta globin promoter. As used herein, the term “GATA1” has its general meaning in the art and refers to the erythroid differentiation factor. The term is also known as ERYF1 or GF1. As used herein, the term “repressor” refers to a transcriptional repressor that is a protein (transcription factor) that decreases gene transcription of a gene or set of genes. Most repressors are DNA-binding proteins that bind to enhancers or promoter-proximal elements. In particular, the repressor is BCL11A. Accordingly, the term “transcriptional repressor binding site” refers to a site present on DNA whereby the transcription repressor binds. In some embodiments, the prime-editing platform of the present invention edits the genome sequence of the eukaryotic cell so that the transcriptional repressor is not able to bind to its transcriptional repressor binding sites. In some embodiments, the prime-editing platform of the present invention of the present invention will inhibit the binding of BCL11A to its binding site.
As used herein, the term “BCL11A” has its general meaning in the art and refers to the gene encoding for BAF chromatin remodeling complex subunit BCL11A (Gene ID: 53335). The term is also known as EVI9; CTIP1; DILOS; ZNF856; HBFQTL5; BCL11A-L; BCL11A-S; BCL11a-M; or BCL11A-XL. Five alternatively spliced transcript variants of this gene, which encode distinct isoforms, have been reported. The protein associates with the SWI/SNF complex that regulates gene expression via chromatin remodeling. BCL11A is highly expressed in several hematopoietic lineages, and plays a role in the switch from γ- to β-globin expression during the fetal to adult erythropoiesis transition (Sankaran VJ et al. "Human fetal hemoglobin expression is regulated by the developmental stage-specific repressor BCL11A”, Science Science.2008 Dec 19;322(5909):1839-42). As used herein, the terms “polypeptide”, “peptide”, and “protein” are used interchangeably herein to refer to polymers of amino acids of any length. The terms also encompass an amino acid polymer that has been modified; for example, disulfide bond formation, glycosylation, lipidation, phosphorylation, or conjugation with a labeling component. Polypeptides, when discussed in the context of gene therapy refer to the respective intact polypeptide, or any fragment or genetically engineered derivative thereof, which retains the desired biochemical function of the intact protein. As used herein, the term “polynucleotide” refers to a polymeric form of nucleotides of any length, including deoxyribonucleotides or ribonucleotides, or analogs thereof. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs, and may be interrupted by non-nucleotide components. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The term polynucleotide, as used herein, refers interchangeably to double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of the invention described herein that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form. As used herein, the expression “derived from” refers to a process whereby a first component (e.g., a first polypeptide), or information from that first component, is used to isolate, derive or make a different second component (e.g., a second polypeptide that is different from the first one).
As used herein, the term “fusion protein” means a protein created by joining two or more polypeptide sequences together. The fusion polypeptides encompassed in this invention include translation products of a chimeric gene construct that joins the nucleic acid sequences encoding a first polypeptide, e.g., an RNA-binding domain, with the nucleic acid sequence encoding a second polypeptide, e.g., an effector domain, to form a single open-reading frame. In other words, a “fusion protein” is a recombinant protein of two or more proteins which are joined by a peptide bond or via several peptides. The fusion protein may also comprise a peptide linker between the two domains. As used herein the term “wild type” is a term of the art understood by skilled persons and means the typical form of an organism, strain, gene or characteristic as it occurs in nature as distinguished from mutant or variant forms. As used herein, the term “derived from” refers to a process whereby a first component (e.g., a first molecule), or information from that first component, is used to isolate, derive or make a different second component (e.g., a second molecule that is different from the first). As used herein, the “percent identity” between the two sequences is a function of the number of identical positions shared by the sequences (i.e., % identity = number of identical positions/total number of positions x 100), taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm, as described below. The percent identity between two amino acid sequences can be determined using the Needleman and Wunsch algorithm (Needleman, Saul B. & Wunsch, Christian D. (1970). "A general method applicable to the search for similarities in the amino acid sequence of two proteins". Journal of Molecular Biology.48 (3): 443–53.). The percent identity between two nucleotide or amino acid sequences may also be determined using for example algorithms such as EMBOSS Needle (pair wise alignment; available at www.ebi.ac.uk). For example, EMBOSS Needle may be used with a BLOSUM62 matrix, a “gap open penalty” of 10, a “gap extend penalty” of 0.5, a false “end gap penalty”, an “end gap open penalty” of 10 and an “end gap extend penalty” of 0.5. In general, the “percent identity” is a function of the number of matching positions divided by the number of positions compared and multiplied by 100. For instance, if 6 out of 10 sequence
positions are identical between the two compared sequences after alignment, then the identity is 60%. The % identity is typically determined over the whole length of the query sequence on which the analysis is performed. Two molecules having the same primary amino acid sequence or nucleic acid sequence are identical irrespective of any chemical and/or biological modification. According to the invention a first amino acid sequence having at least 90% of identity with a second amino acid sequence means that the first sequence has 90; 91; 92; 93; 94; 95; 96; 97; 98; 99 or 100% of identity with the second amino acid sequence. As used herein, the term “linker” refers to any means, entity or moiety used to join two or more entities. A linker can be a covalent linker or a non-covalent linker. Examples of covalent linkers include covalent bonds or a linker moiety covalently attached to one or more of the proteins or domains to be linked. The linker can also be a non-covalent bond, e.g., an organometallic bond through a metal center such as platinum atom. For covalent linkages, various functionalities can be used, such as amide groups, including carbonic acid derivatives, ethers, esters, including organic and inorganic esters, amino, urethane, urea and the like. To provide for linking, the domains can be modified by oxidation, hydroxylation, substitution, reduction etc. to provide a site for coupling. Methods for conjugation are well known by persons skilled in the art and are encompassed for use in the present invention. Linker moieties include, but are not limited to, chemical linker moieties, or for example a peptide linker moiety (a linker sequence). It will be appreciated that modification which do not significantly decrease the function of the RNA- binding domain and effector domain are preferred. As used herein, the “linked” as used herein refers to the attachment of two or more entities to form one entity. A conjugate encompasses both peptide-small molecule conjugates as well as peptide-protein/peptide conjugates. As used herein, the term “complementarity” refers to the ability of a nucleic acid to form hydrogen bond(s) with another nucleic acid sequence by either traditional Watson-Crick base- pairing or other non-traditional types. A percent complementarity indicates the percentage of residues in a nucleic acid molecule which can form hydrogen bonds (e.g., Watson-Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 50%, 60%, 70%, 80%, 90%, and 100% complementary). “Perfectly complementary” means that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same number of contiguous residues in a second nucleic acid sequence. “Substantially complementary” as
used herein refers to a degree of complementarity that is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% over a region of 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 30, 35, 40, 45, 50, or more nucleotides, or refers to two nucleic acids that hybridize under stringent conditions. As used herein, the term “stringent conditions” for hybridization refer to conditions under which a nucleic acid having complementarity to a target sequence predominantly hybridizes with the target sequence, and substantially does not hybridize to non-target sequences. Stringent conditions are generally sequence-dependent, and vary depending on a number of factors. In general, the longer the sequence, the higher the temperature at which the sequence specifically hybridizes to its target sequence. Non-limiting examples of stringent conditions are described in detail in Tijssen (1993), Laboratory Techniques In Biochemistry And Molecular Biology- Hybridization With Nucleic Acid Probes Part I, Second Chapter “Overview of principles of hybridization and the strategy of nucleic acid probe assay”, Elsevier, N.Y. As used herein, the term “hybridization” or “hybridizing” refers to a process where completely or partially complementary nucleic acid strands come together under specified hybridization conditions to form a double-stranded structure or region in which the two constituent strands are joined by hydrogen bonds. Although hydrogen bonds typically form between adenine and thymine or uracil (A and T or U) or cytosine and guanine (C and G), other base pairs may form (e.g., Adams et al., The Biochemistry of the Nucleic Acids, 11th ed., 1992). As used herein, the term “prime editing” refers to the “search-and-replace” genome editing technology that was first disclosed in Anzalone, Andrew V., et al. "Search-and-replace genome editing without double-strand breaks or donor DNA." Nature 576.7785 (2019): 149-157. The technology directly writes new genetic information into a targeted DNA site. The technology can mediate targeted insertions, deletions, and base-to-base conversions without the need for double strand breaks (DSBs) or donor DNA templates. As used herein, the term “prime editing enzyme” refers to a fusion protein comprising a defective CRISPR/Cas nuclease linked to a reverse transcriptase. The term is also known as “prime editor”.
As used herein, the term “nuclease” includes an enzyme that induces a break in a nucleic acid sequence, e.g., a single or a double strand break in a double-stranded DNA sequence. As used herein, the term “CRISPR/Cas nuclease” has its general meaning in the art and refers to segments of prokaryotic DNA containing clustered regularly interspaced short palindromic repeats (CRISPR) and associated nucleases encoded by Cas genes. In bacteria the CRISPR/Cas loci encode RNA-guided adaptive immune systems against mobile genetic elements (viruses, transposable elements and conjugative plasmids). Three types of CRISPR systems have been identified. CRISPR clusters contain spacers, the sequences complementary to antecedent mobile elements. CRISPR clusters are transcribed and processed into mature CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) RNA (crRNA). The CRISPR/Cas nucleases Cas9 and Cpf1 belong to the type II and type V CRISPR/Cas system and have strong endonuclease activity to cut target DNA. Cas9 is guided by a mature crRNA that contains about 20 nucleotides of unique target sequence (called spacer) and a trans-activating small RNA (tracrRNA) that also serves as a guide for ribonuclease III-aided processing of pre-crRNA. The crRNA:tracrRNA duplex directs Cas9 to target DNA via complementary base pairing between the spacer on the crRNA and the complementary sequence (called protospacer) on the target DNA. Cas9 recognizes a trinucleotide (NGG for S. Pyogenes Cas9) protospacer adjacent motif (PAM) to specify the cut site (the 3rd or the 4th nucleotide upstream from PAM). As used herein, the term “Cas9” or “Cas9 nuclease” refers to an RNA-guided nuclease comprising a Cas9 protein, or a fragment thereof (e.g., a protein comprising an active or inactive DNA cleavage domain of Cas9, and/or the gRNA binding domain of Cas9). A Cas9 nuclease is also referred to sometimes as a casn1 nuclease or a CRISPR (clustered regularly interspaced short palindromic repeat)-associated nuclease. CRISPR is an adaptive immune system that provides protection against mobile genetic elements (viruses, transposable elements and conjugative plasmids). CRISPR clusters contain spacers, sequences complementary to antecedent mobile elements, and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA). In type II CRISPR systems correct processing of pre-crRNA requires a trans-encoded small RNA (tracrRNA), endogenous ribonuclease 3 (rnc) and a Cas9 protein. The tracrRNA serves as a guide for ribonuclease 3-aided processing of pre- crRNA. Subsequently, Cas9/crRNA/tracrRNA endonucleolytically cleaves linear or circular dsDNA target complementary to the spacer. The target strand not complementary to crRNA is first cut endonucleolytically, then trimmed 3’-5’ exonucleolytically. In nature, DNA-binding
and cleavage typically requires protein and both RNAs. However, single guide RNAs (“sgRNA”, or simply “gRNA”) can be engineered so as to incorporate aspects of both the crRNA and tracrRNA into a single RNA species. See, e.g., Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J. A., Charpentier E. Science 337:816-821(2012), the entire contents of which is hereby incorporated by reference. Cas9 recognizes a short motif in the CRISPR repeat sequences (the PAM or protospacer adjacent motif) to help distinguish self versus non-self. Cas9 nuclease sequences and structures are well known to those of skill in the art (see, e.g., “Complete genome sequence of an M1 strain of Streptococcus pyogenes.” Ferretti et al., J. J., McShan W. M., Ajdic D. J., Savic D. J., Savic G., Lyon K., Primeaux C., Sezate S., Suvorov A. N., Kenton S., Lai H. S., Lin S. P., Qian Y., Jia H. G., Najar F. Z., Ren Q., Zhu H., Song L., White J., Yuan X., Clifton S. W., Roe B. A., McLaughlin R. E., Proc. Natl. Acad. Sci. U.S.A. 98:4658-4663(2001); “CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III.” Deltcheva E., Chylinski K., Sharma C. M., Gonzales K., Chao Y., Pirzada Z. A., Eckert M. R., Vogel J., Charpentier E., Nature 471:602-607(2011); and “A programmable dual- RNA-guided DNA endonuclease in adaptive bacterial immunity.” Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J. A., Charpentier E. Science 337:816-821(2012), the entire contents of each of which are incorporated herein by reference). Cas9 orthologs have been described in various species, including, but not limited to, S. pyogenes and S. thermophilus. Additional suitable Cas9 nucleases and sequences will be apparent to those of skill in the art based on this disclosure, and such Cas9 nucleases and sequences include Cas9 sequences from the organisms and loci disclosed in Chylinski, Rhun, and Charpentier, “The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems” (2013) RNA Biology 10:5, 726-737; the entire contents of which are incorporated herein by reference. In some embodiments, the term “Cas9” refers to Cas9 from: Corynebacterium ulcerans (NCBI Refs: NC_015683.1, NC_017317.1); Corynebacterium diphtheria (NCBI Refs: NC_016782.1, NC_016786.1); Spiroplasma syrphidicola (NCBI Ref: NC_021284.1); Prevotella intermedia (NCBI Ref: NC_017861.1); Spiroplasma taiwanense (NCBI Ref: NC_021846.1); Streptococcus iniae (NCBI Ref: NC_021314.1); Belliella baltica (NCBI Ref: NC_018010.1); Psychroflexus torquisI (NCBI Ref: NC_018721.1); Streptococcus thermophilus (NCBI Ref: YP_820832.1); Listeria innocua (NCBI Ref: NP_472073.1); Campylobacter jejuni (NCBI Ref: YP_002344900.1); or Neisseria. meningitidis (NCBI Ref: YP_002342100.1). Typically the Cas9 nuclease comprises the amino acid sequence as set forth in SEQ ID NO: 2. SEQ ID NO:2: Cas9 sequence
MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTAR RRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHL RKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVD AKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLD NLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPE KYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQI HLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVD KGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLL FKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVL TLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFAN RNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENI VIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQEL DINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDF RKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAK YFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGG FSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSS FEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHY EKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIH LFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD As used herein, the term “defective CRISPR/Cas nuclease” refers to a CRISPR/Cas nuclease having lost at least one nuclease domain. As used herein, the term “nickase” has its general meaning in the art and refers to an endonuclease which cleaves only a single strand of a DNA duplex (“nicking”). Accordingly, the term “Cas9 nickase” refers to a nickase derived from a Cas9 protein, typically by inactivating one nuclease domain of Cas9 protein. As used herein, the term “protospacer adjacent motif sequence” or “PAM” refers to an approximately 2-6 base pair DNA sequence that is an important targeting component of a Cas9 nuclease. Typically, the PAM sequence is on either strand, and is downstream in the 5’ to 3’ direction of Cas9 cut site. The canonical PAM sequence (i.e., the PAM sequence that is associated with the Cas9 nuclease of Streptococcus pyogenes is 5’-NGG-3’ wherein “N” is any nucleobase followed by two guanine (“G”) nucleobases. Different PAM sequences can be associated with different Cas9 nucleases or equivalent proteins from different organisms. In addition, any given Cas9 nuclease, e.g., SpCas9, may be modified to alter the PAM specificity of the nuclease such that the nuclease recognizes alternative PAM sequence. As used herein, the term “protospacer” refers to the sequence (˜20 bp) in DNA adjacent to the PAM (protospacer adjacent motif) sequence. The protospacer shares the same sequence as the spacer sequence of the guide RNA. The guide RNA anneals to the complement of the protospacer sequence on the target DNA (specifically, one strand thereof, i.e., the “target
strand” versus the “non-target strand” of the target sequence). In order for Cas9 to function it also requires a specific protospacer adjacent motif (PAM) that varies depending on the bacterial species of the Cas9 gene. The most commonly used Cas9 nuclease, derived from S. pyogenes, recognizes a PAM sequence of NGG that is found directly downstream of the target sequence in the genomic DNA, on the non-target strand. As used herein, the term “guide RNA” or “gRNA” has its general meaning in the art and is a particular type of guide nucleic acid which associates with a CRISPR/Cas nuclease (e.g. Cas9), directing the nuclease to a specific sequence in a DNA molecule that includes complementarity to protospacer sequence of the guide RNA. As used herein, the terms “prime editing guide RNA” or “pegRNA” refers to a specialized form of a guide RNA that has been modified to include one or more additional sequences for implementing the prime editing as described herein. Typically, the pegRNAs of the present invention comprise in the 5’ to 3’ direction a spacer, a gRNA core, and an extension arm. In some embodiments, the pegRNA of the present invention may also further comprise elements, such as, but not limited to aptamers, stem loops, hairpins, toe loops (e.g., a 3’ toeloop), or an RNA-protein recruitment domain (e.g., MS2 hairpin). In particular, the pegRNA may contain one or more structural elements for minimizing its degradation. In particular, the pegRNA of the present invention incorporates one or more stable pseudoknots at its 3’ end such as a modified prequeosine1-1 riboswitch aptamer (evopreQ1) or the frameshifting pseudoknot from Moloney murine leukemia virus (MMLV), hereafter referred to as “mpknot” as described in Nelson, James W., et al. "Engineered pegRNAs improve prime editing efficiency." Nature biotechnology 40.3 (2022): 402-410 for which the teaching is incorporated by reference. In some embodiments, the pegRNA may comprise a transcriptional termination sequence at the 3’ of the molecule. As used herein, the term “spacer” in connection with a guide RNA or a pegRNA refers to the portion of the guide RNA or pegRNA of about 20 nucleotides which contains a nucleotide sequence that is complementary to the protospacer sequence in the target nucleic sequence. The spacer sequence anneals to the protospacer sequence to form a ssRNA/ssDNA hybrid structure at the target site and a corresponding R loop ssDNA structure of the endogenous DNA strand that is complementary to the protospacer sequence.
As used herein, the term “gRNA core” or “gRNA scaffold” refers to the sequence within the gRNA that is responsible for Cas9 binding, it does not include the 20 bp spacer/targeting sequence that is used to guide Cas9 to target DNA. As used herein, the term “extension arm” refers to a nucleotide sequence component of a pegRNA which provides several functions. Typically, the extension arm is located at the 3’ end of the guide RNA, and comprises the following components in a 5’ to 3’ direction: a reverse transcriptase (RT) template and the primer binding site. In the case of a 3’ extension arm, the term “reverse transcriptase template” or “RT template” or “RTT” refers to the portion of the extension arm that spans from the 5’ end of the primer binding site (PBS) to 3’ end of the gRNA core that may operate as a template for the synthesis of a single-strand of DNA by the reverse transcriptase of the prime editing enzyme. Thus the RT template encodes (by the reverse transcriptase of the prime editing enzyme) a single-stranded DNA which, in turn, has been designed to be (a) homologous with the endogenous target DNA to be edited, and (b) which comprises at least one desired nucleotide change (e.g. a particular mutation) to be introduced or integrated into the endogenous target DNA. As used herein, the term “primer binding site” or “PBS” refers to the nucleotide sequence located on a pegRNA as component of the extension arm (typically at the 3’ end of the extension arm) and serves to bind to the primer sequence that is formed after Cas9 nicking of the target sequence by the prime editing enzyme. As detailed elsewhere, when the Cas9 nickase component of a prime editing enzyme nicks one strand of the target sequence, a 3’-ended ssDNA flap is formed, which serves a primer sequence that anneals to the primer binding site on the pegRNA to prime reverse transcription. As used herein, the term “reverse transcriptase” or “RT” describes a class of polymerases characterized as RNA-dependent DNA polymerases. All known reverse transcriptases require a primer to synthesize a DNA transcript from an RNA template. As used herein, the term “target nucleic acid” or “target” refers to a nucleic acid containing a target nucleic acid sequence. A target nucleic acid may be single-stranded or double-stranded, and often is double-stranded DNA. A “target nucleic acid sequence,” “target sequence” or “target region”, as used herein, means a specific sequence or the complement thereof that one wishes to bind to using the CRISPR system as disclosed herein.
As used herein, the term “mutation” has its general meaning in the art and refers to a substitution, deletion or insertion. The term "substitution" means that a specific nucleotide at a specific position is removed and another nucleotide is inserted into the same position. The term "deletion" means that a specific nucleotide is removed. The term "insertion" means that one or more nucleotides are inserted before or after a specific nucleotide. As used herein, the term “variant” refers to a first composition (e.g., a first molecule), that is related to a second composition (e.g., a second molecule, also termed a “parent” molecule). The variant molecule can be derived from, isolated from, based on or homologous to the parent molecule. A variant molecule can have entire sequence identity with the original parent molecule, or alternatively, can have less than 100% sequence identity with the parent molecule. For example, a variant of a sequence can be a second sequence that is at least 50; 51; 52; 53; 54; 55; 56; 57; 58; 59; 60; 61; 62; 63; 64; 65; 66; 67; 68; 69; 70; 71; 72; 73; 74; 75; 76; 77; 78; 79; 80; 81; 82; 83; 84; 85; 86; 87; 88; 89; 90; 91; 92; 93; 94; 95; 96; 97; 98; 99; 100% identical in sequence compare to the original sequence. As used herein, the term "treatment" or "treat" refer to both prophylactic or preventive treatment as well as curative or disease modifying treatment, including treatment of patient at risk of contracting the disease or suspected to have contracted the disease as well as patients who are ill or have been diagnosed as suffering from a disease or medical condition, and includes suppression of clinical relapse. The treatment may be administered to a subject having a medical disorder or who ultimately may acquire the disorder, in order to prevent, cure, delay the onset of, reduce the severity of, or ameliorate one or more symptoms of a disorder or recurring disorder, or in order to prolong the survival of a subject beyond that expected in the absence of such treatment. By "therapeutic regimen" is meant the pattern of treatment of an illness, e.g., the pattern of dosing used during therapy. A therapeutic regimen may include an induction regimen and a maintenance regimen. The phrase "induction regimen" or "induction period" refers to a therapeutic regimen (or the portion of a therapeutic regimen) that is used for the initial treatment of a disease. The general goal of an induction regimen is to provide a high level of drug to a patient during the initial period of a treatment regimen. An induction regimen may employ (in part or in whole) a "loading regimen", which may include administering a greater dose of the drug than a physician would employ during a maintenance regimen, administering a drug more frequently than a physician would administer the drug during a
maintenance regimen, or both. The phrase "maintenance regimen" or "maintenance period" refers to a therapeutic regimen (or the portion of a therapeutic regimen) that is used for the maintenance of a patient during treatment of an illness, e.g., to keep the patient in remission for long periods of time (months or years). A maintenance regimen may employ continuous therapy (e.g., administering a drug at regular intervals, e.g., weekly, monthly, yearly, etc.) or intermittent therapy (e.g., interrupted treatment, intermittent treatment, treatment at relapse, or treatment upon achievement of a particular predetermined criteria [e.g., pain, disease manifestation, etc.]). As used herein, the term "therapeutically effective amount" is meant a sufficient amount of population of cells to treat the disease at a reasonable benefit/risk ratio applicable to any medical treatment. It will be understood that the total usage compositions of the present invention will be decided by the attending physician within the scope of sound medical judgment. The specific therapeutically effective dose level for any particular patient will depend upon a variety of factors including the age, body weight, general health, sex and diet of the patient, the time of administration, route of administration, the duration of the treatment, drugs used in combination or coincidental with the population of cells, and like factors well known in the medical arts. Methods of the present invention: The first object of the present invention relates to a method of increasing fetal hemoglobin content in a eukaryotic cell comprising the step of contacting the eukaryotic cell with a prime editing platform that consists of (a) one prime editing enzyme and (b) one prime editing guide RNA (pegRNA) for guiding the prime editing enzyme to one target nucleic acid sequence in the -115 region of the HBG1 and/or HBG2 promoter, thereby prime editing said region and subsequently increasing the expression of gamma-globin in said eukaryotic cell. In some embodiments, the prime editing platform is suitable for introducing a combination of mutations in the -115 region of HBG1 and/or HBG2 promoter so that i) new transcriptional activator binding sites for KLF1 and GATA1 are introduced in said promoter and ii) the binding site for the BCL11A repressor is disrupted. In some embodiments, the prime editing platform herein disclosed introduces i) the -123T>C and -124T>C mutations so that the KFL1 activator can bind to the promoter ii) the - 113 A>G
mutation so that the GATA1 activator can bind to the promoter and iii) with or without the complete or partial deletion of the BCL11A binding-motif (i.e TGACCA) so that the binding site for the BCL11A repressor is disrupted. In some embodiments, the method of the present invention comprises the steps of: (i) contacting the eukaryotic cell with a) the prime editing enzyme and b) a guide RNA comprising an RT template comprising the desired nucleotide changes; (ii) conducting target-primed reverse transcription of the RT template to generate a single strand DNA comprising the desired nucleotide changes; and (iii) incorporating the desired nucleotide change into the -115 region of the HBG1 and/or HBG2 promoter at the target sequence through a DNA repair and/or replication process. Eukaryotic cell: In some embodiments, the eukaryotic cell is selected from the group consisting of hematopoietic progenitor cells, hematopoietic stem cells (HSCs), pluripotent cells (i.e. embryonic stem cells (ES) and induced pluripotent stem cells (iPS)). More preferably the eukaryotic cell is a hematopoietic stem cell. As used herein, the term “hematopoietic stem cell” or “HSC” refers to blood cells that have the capacity to self-renew and to differentiate into precursors of blood cells. These precursor cells are immature blood cells that cannot self-renew and must differentiate into mature blood cells. Hematopoietic stem progenitor cells display a number of phenotypes, such as Lin-CD34+CD38−CD90+CD45RA−, Lin- CD34+CD38−CD90−CD45RA−, Lin-CD34+CD38+IL-3aloCD45RA−, and Lin- CD34+CD38+CD10+(Daley et al., Focus 18:62-67, 1996; Pimentel, E., Ed., Handbook of Growth Factors Vol. III: Hematopoietic Growth Factors and Cytokines, pp. 1-2, CRC Press, Boca Raton, Fla., 1994). Within the bone marrow microenvironment, the stem cells self-renew and maintain continuous production of hematopoietic stem cells that give rise to all mature blood cells throughout life. In some embodiments, the hematopoietic progenitor cells or hematopoietic stem cells are isolated form peripheral blood cells. As used herein, the term “peripheral blood cells” refer to the cellular components of blood, including red blood cells, white blood cells, and platelets, which are found within the circulating pool of blood.
In some embodiments, the eukaryotic cell is a bone marrow derived stem cell. As used herein the term “bone marrow-derived stem cells” refers to stem cells found in the bone marrow. Stem cells may reside in the bone marrow, either as an adherent stromal cell type that possess pluripotent capabilities, or as cells that express CD34 or CD45 cell-surface protein, which identifies hematopoietic stem cells able to differentiate into blood cells. Typically, the eukaryotic cell results from a stem cell mobilization. As used herein, the term “mobilization” or “stem cell mobilization” refers to a process involving the recruitment of stem cells from their tissue or organ of residence to peripheral blood following treatment with a mobilization agent. This process mimics the enhancement of the physiological release of stem cells from tissues or organs in response to stress signals during injury and inflammation. The mechanism of the mobilization process depends on the type of mobilization agent administered. Some mobilization agents act as agonists or antagonists that prevent the attachment of stem cells to cells or tissues of their microenvironment. Other mobilization agents induce the release of proteases that cleave the adhesion molecules or support structures between stem cells and their sites of attachment. As used herein, the term “mobilization agent” refers to a wide range of molecules that act to enhance the mobilization of stem cells from their tissue or organ of residence, e.g., bone marrow (e.g., CD34+ stem cells) and spleen (e.g., Hox11+ stem cells), into peripheral blood. Mobilization agents include chemotherapeutic drugs, e.g., cyclophosphamide and cisplatin; cytokines, and chemokines, e.g., granulocyte colony- stimulating factor (G-CSF), granulocyte-macrophage colony-stimulating factor (GM-CSF), stem cell factor (SCF), Fms-related tyrosine kinase 3 (flt-3) ligand, stromal cell-derived factor 1 (SDF-1); agonists of the chemokine (C—C motif) receptor 1 (CCR1), such as chemokine (C—C motif) ligand 3 (CCL3, also known as macrophage inflammatory protein-1α (Mip-1α)); agonists of the chemokine (C—X—C motif) receptor 1 (CXCR1) and 2 (CXCR2), such as chemokine (C—X—C motif) ligand 2 (CXCL2) (also known as growth-related oncogene protein-β (Gro-β)), and CXCL8 (also known as interleukin-8 (IL-8)); agonists of CXCR4, such as CTCE-02142, and Met-SDF-1,; Very Late Antigen (VLA)-4 inhibitors; antagonists of CXCR4, such as TG-0054, plerixafor (also known as AMD3100), and AMD3465, or any combination of the previous agents. A mobilization agent increases the number of stem cells in peripheral blood, thus allowing for a more accessible source of stem cells. Prime editing enzyme:
In some embodiments, the prime editing enzyme of the present invention comprises a defective CRISPR/Cas nuclease. The sequence recognition mechanism is the same as for the non- defective CRISPR/Cas nuclease. Typically, the defective CRISPR/Cas nuclease of the invention comprises at least one RNA binding domain. The RNA binding domain interacts with a guide RNA molecule as defined hereinafter. However, the defective CRISPR/Cas nuclease of the invention is a modified version with no nuclease activity. Accordingly, the defective CRISPR/Cas nuclease specifically recognizes the guide RNA molecule and thus guides the prime editing enzyme to its target sequence. In some embodiments, the defective CRISPR/Cas nuclease can be modified to increase nucleic acid binding affinity and/or specificity, alter an enzymatic activity, and/or change another property of the protein. In some embodiments, the nuclease domains of the protein can be modified, deleted, or inactivated. In some embodiments, the protein can be truncated to remove domains that are not essential for the function of the protein. In some embodiments, the protein is truncated or modified to optimize the activity of the RNA binding domain. In some embodiments, the CRISPR/Cas nuclease consists of a mutant CRISPR/Cas nuclease i.e. a protein having one or more point mutations, insertions, deletions, truncations, a fusion protein, or a combination thereof. In some embodiments, the mutant has the RNA-guided DNA binding activity, but lacks one or both of its nuclease active sites. In some embodiments, the mutant comprises an amino acid sequence having at least 50% of identity with the wild type amino acid sequence of the CRISPR/Cas nuclease. Various CRISPR/Cas nucleases can be used in this invention. Non-limiting examples of suitable CRISPR/CRISPR/Cas nucleases include Cas3, Cas4, Cas5, Cas5e (or CasD), Cas6, Cas6e, Cas6f, Cas7, Cas8a1, Cas8a2, Cas8b, Cas8c, Cas9, Cas10, Cas10d, CasF, CasG, CasH, Csy1, Csy2, Csy3, Cse1 (or CasA), Cse2 (or CasB), Cse3 (or CasE), Cse4 (or CasC), Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csz1, Csx15, Csf1, Csf2, Csf3, Csf4, and Cu1966. See e.g., WO2014144761 WO2014144592, WO2013176772, US20140273226, and US20140273233, the contents of which are incorporated herein by reference in their entireties. In some embodiments, the CRISPR/Cas nuclease is derived from a type II CRISPR-Cas system. In some embodiments, the CRISPR/Cas nuclease is derived from a Cas9 protein. The Cas9 protein can be from Streptococcus pyogenes, Streptococcus thermophilus, Streptococcus sp.,
Nocardiopsis dassonvillei, Streptomyces pristinaespiralis, Streptomyces viridochromogenes, Streptomyces viridochromogenes, Streptosporangium roseum, Streptosporangium roseum, Alicyclobacillus acidocaldarius, Bacillus pseudomycoides, Bacillus selenitireducens, Exiguobacterium sibiricum, Lactobacillus delbrueckii, Lactobacillus salivarius, Microscilla marina, Burkholderiales bacterium, Polaromonas naphthalenivorans, Polaromonas sp., Crocosphaera watsonii, Cyanothece sp., Microcystis aeruginosa, Synechococcus sp., Acetohalobium arabaticum, Ammonifex degensii, Caldicelulosiruptor becscii, Candidatus Desulforudis, Clostridium botulinum, Clostridium difficile, Finegoldia magna, Natranaerobius thermophilus, Pelotomaculum thermopropionicum, Acidithiobacillus caldus, Acidithiobacillus ferrooxidans, Allochromatium vinosum, Marinobacter sp., Nitrosococcus halophilus, Nitrosococcus watsoni, Pseudoalteromonas haloplanktis, Ktedonobacter racemifer, Methanohalobium evestigatum, Anabaena variabilis, Nodularia spumigena, Nostoc sp., Arthrospira maxima, Arthrospira platensis, Arthrospira sp., Lyngbya sp., Microcoleus chthonoplastes, Oscillatoria sp., Petrotoga mobilis, Thermosipho africanus, or Acaryochloris marina, inter alia. In some embodiments, the CRISPR/Cas nuclease is a mutant of a wild type CRISPR/Cas nuclease (such as Cas9) or a fragment thereof. In some embodiments, the CRISPR/Cas nuclease is a mutant Cas9 protein from S. pyogenes. Methods for generating a Cas9 protein (or a fragment thereof) having an inactive DNA cleavage domain are known (See, e.g., Jinek et al., Science.337:816-821(2012); Qi et al., “Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression” (2013) Cell. 28; 152(5):1173-83, the entire contents of each of which are incorporated herein by reference). For example, the DNA cleavage domain of Cas9 is known to include two subdomains, the HNH nuclease subdomain and the RuvC1 subdomain. The HNH subdomain cleaves the strand complementary to the gRNA, whereas the RuvC1 subdomain cleaves the non-complementary strand. Mutations within these subdomains can silence the nuclease activity of Cas9. For example, the mutations D10A and H840A completely inactivate the nuclease activity of S. pyogenes Cas9 (Jinek et al., Science.337:816-821(2012); Qi et al., Cell. 28; 152(5):1173-83 (2013). In some embodiments, the CRISPR/Cas nuclease of the present invention is a nickase and more particularly a Cas9 nickase i.e. the Cas9 from S. pyogenes having one mutation selected from
the group consisting of D10A and H840A. In some embodiments, the nickase of the present invention comprises the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO:4. SEQ ID NO: 3> S. pyogenes nCas9 Protein Sequence having the D10A mutation MDKKYSIGLAIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTAR RRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHL RKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVD AKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLD NLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPE KYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQI HLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVD KGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLL FKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVL TLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFAN RNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENI VIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQEL DINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDF RKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAK YFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGG FSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSS FEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHY EKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIH LFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD SEQ ID NO: 4> S. pyogenes nCas9 Protein Sequence having the H840A mutation MDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGETAEATRLKRTAR RRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKYPTIYHL RKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVD AKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLD NLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPE KYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNREDLLRKQRTFDNGSIPHQI HLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVD KGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLL FKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVL TLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDKQSGKTILDFLKSDGFAN RNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVKVVDELVKVMGRHKPENI VIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKLYLYYLQNGRDMYVDQEL DINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNYWRQLLNAKLITQRKF DNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVITLKSKLVSDF RKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKATAK YFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGG FSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSS FEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHY EKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIH LFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGD In some embodiments, the Cas9 variants having mutations other than D10A or H840A are used, which e.g., result in nuclease inactivated Cas9 (dCas9). Such mutations, by way of example, include other amino acid substitutions at D10 and H840, or other substitutions within the nuclease domains of Cas9 (e.g., substitutions in the HNH nuclease subdomain and/or the RuvC1 subdomain). In some embodiments, variants of dCas9 are provided which are at least
about 70% identical, at least about 80% identical, at least about 90% identical, at least about 95% identical, at least about 98% identical, at least about 99% identical, at least about 99.5% identical, or at least about 99.9% to SEQ ID NO: 3 or 4. In some embodiments, the nickase of the present invention comprises the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO:4 and further comprises the R221K and N394K mutations that was previously shown to improve Cas9 nuclease activity (Spencer, Jeffrey M., and Xiaoliu Zhang. "Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains." Scientific reports 7.1 (2017): 1-14). In some embodiments, variants of dCas9 are provided having amino acid sequences which are shorter, or longer than SEQ ID NO: 3 or 4, by about 5 amino acids, by about 10 amino acids, by about 15 amino acids, by about 20 amino acids, by about 25 amino acids, by about 30 amino acids, by about 40 amino acids, by about 50 amino acids, by about 75 amino acids, by about 100 amino acids or more. Some aspects of the disclosure provide Cas9 proteins that have different PAM specificities. Typically, Cas9 proteins, such as Cas9 from S. pyogenes (spCas9), require a canonical NGG PAM sequence to bind a particular nucleic acid region. This may limit the ability to of the Cas9 protein to bind to a particular nucleotide sequence within a genome. Accordingly, in some embodiments, any of the Cas proteins provided herein may be capable of binding a nucleotide sequence that does not contain a canonical (e.g., NGG) PAM sequence. For example, Cas9 proteins that bind non-canonical PAM sequences have been described in Kleinstiver, B. P., et al., “Engineered CRISPR-Cas9 nucleases with altered PAM specificities” Nature 523, 481- 485 (2015); and Kleinstiver, B. P., et al., “Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition” Nature Biotechnology 33, 1293-1298 (2015); Walton, Russell T., et al. "Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants." Science 368.6488 (2020): 290-296 the entire contents of each are hereby incorporated by reference. In some embodiments, the Cas9 protein of the present invention comprises the following mutations D1135L, S1136W, G1218K, E1219Q, R1335Q and T1337R. This variant is capable of targeting an expanded set of NGN PAMs (i.e. the “SpG” variant as described in Walton, Russell T., et al. "Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants." Science 368.6488 (2020): 290-296). In some embodiments, the Ca9 variant SpG can further be optimized to develop a near-PAMless
SpCas9 variant named “SpRY” that furthers includes the 5 additional mutations A61R, L1111R, N1317R, A1322R, and R1333P. According to the present invention, the second component of the prime editing enzyme herein disclosed comprises a reverse transcriptase. The disclosure contemplates any wild type reverse transcriptase obtained from any naturally- occurring organism or virus, or obtained from a commercial or non-commercial source. In addition, the reverse transcriptases can include any naturally-occurring mutant RT, engineered mutant RT, or other variant RT, including truncated variants that retain function. The RTs may also be engineered to contain specific amino acid substitutions, such as those specifically disclosed herein. A person of ordinary skill in the art will recognize that wild type reverse transcriptases, including but not limited to, Moloney Murine Leukemia Virus (M-MLV); Human Immunodeficiency Virus (HIV) reverse transcriptase and avian Sarcoma-Leukosis Virus (ASLV) reverse transcriptase, which includes but is not limited to Rous Sarcoma Virus (RSV) reverse transcriptase, Avian Myeloblastosis Virus (AMV) reverse transcriptase, Avian Erythroblastosis Virus (AEV) Helper Virus MCAV reverse transcriptase, Avian Myelocytomatosis Virus MC29 Helper Virus MCAV reverse transcriptase, Avian Reticuloendotheliosis Virus (REV-T) Helper Virus REV-A reverse transcriptase, Avian Sarcoma Virus UR2 Helper Virus UR2AV reverse transcriptase, Avian Sarcoma Virus Y73 Helper Virus YAV reverse transcriptase, Rous Associated Virus (RAV) reverse transcriptase, and Myeloblastosis Associated Virus (MAV) reverse transcriptase may be suitably used in the subject methods and composition described herein. In some embodiments, the reverse transcriptase that is usable in the prime editing enzymes of the present invention comprises an amino acid sequence having at least 90% of identity with the amino acid sequence as set forth in SEQ ID NO:5. SEQ ID NO:5 > M-MLV reverse transcriptase TLNIEDEYRLHETSKEPDVSLGSTWLSDFPQAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQE ARLGIKPHIQRLLDQGILVPCQSPWNTPLLPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSG LPPSHQWYTVLDLKDAFFCLRLHPTSQPLFAFEWRDPEMGISGQLTWTRLPQGFKNSPTLFDEALHRDL ADFRIQHPDLILLQYVDDLLLAATSELDCQQGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEG QRWLTEARKETVMGQPTPKTPRQLREFLGTAGFCRLWIPGFAEMAAPLYPLTKTGTLFNWGPDQQKAYQ
EIKQALLTAPALGLPDLTKPFELFVDEKQGYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVA AIAVLTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNARMTHYQALLLDTDRVQFGPVVALNPATL LPLPEEGLQHNCLDILAEAHGTRPDLTDQPLPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKAL PAGTSAQRAELIALTQALKMAEGKKLNVYTDSRYAFATAHIHGEIYRRRGLLTSEGKEIKNKDEILALL KALFLPKRLSIIHCPGHQKGHSAEARGNRMADQAARKAAITETPDTSTLLIENSSP In some embodiments, the reverse transcriptase that is usable in the prime editing enzymes of the present invention has the amino acid sequence as set forth in SEQ ID NO:5 but comprises one or more of the following mutations: P51L, S67K, E69K, L139P, T197A, D200N, H204R, F209N, E302K, E302R, T306K, F309N, W313F, T330P, L345G, L435G, N454K, D524G, E562Q, D583N, H594Q, L603W, E607K, or D653N. In some embodiments, the reverse transcriptase that is usable in the prime editing enzymes of the present invention has the amino acid sequence as set forth in SEQ ID NO:5 but comprises four mutations: D200N, T306K, W313F, and T330P. In some embodiments, the reverse transcriptase is fused to the N-terminus of the defective CRISPR/Cas nuclease. In some embodiments, the reverse transcriptase is fused to the C- terminus of the defective CRISPR/Cas nuclease. In some embodiments, the defective CRISPR/Cas nuclease and the reverse transcriptase are fused via a linker. In some embodiments, the linker comprises a (GGGGS)n (SEQ ID NO:6), a (G)n, an (EAAAK)n (SEQ ID NO: 7), a (GGS)n, an SGSETPGTSESATPES (SEQ ID NO: 8) motif (see, e.g., Guilinger J P, Thompson D B, Liu D R. Additional suitable linker motifs and linker configurations will be apparent to those of skill in the art. In some embodiments, suitable linker motifs and configurations include those described in Chen et al., Fusion protein linkers: property, design and functionality. Adv Drug Deliv Rev.2013; 65(10):1357-69, the entire contents of which are incorporated herein by reference). In some embodiments, the prime editing enzyme may comprise additional features. Other exemplary features that may be present are localization sequences, such as nuclear localization sequences (NLS), cytoplasmic localization sequences, export sequences, such as nuclear export sequences, or other localization sequences, as well as sequence tags that are useful for solubilization, purification, or detection of the fusion proteins. In particular, the prime editing enzyme incorporates one or more nuclear localization sequence. As used herein, the term “nuclear localization sequence” or “NLS” refers to an amino acid sequence that promotes import of a protein into the cell nucleus, for example, by nuclear transport. Nuclear localization sequences are known in the art and would be apparent to the skilled artisan. For example, NLS
sequences are described in Plank et al., international PCT application, PCT/EP2000/011690, filed Nov. 23, 2000, published as WO/2001/038547 on May 31, 2001, the contents of which are incorporated herein by reference for its disclosure of exemplary nuclear localization sequences. In some embodiments, a NLS comprises the amino acid sequence PKKKRKV (SEQ ID NO: 9) or MDSLLMNRRKFLYQFKNVRWAKGRRETYLC (SEQ ID NO: 10). Various prime editing enzymes are known in the art and typically include the PE1, PE2, PEmax, PEmax-SpRY prime editors. As used herein, the term “PE1 protein” refers to a prime editing enzyme that consists of a fusion protein having the following structure [NLS]-[Cas9(H840A)]-[linker]- [MMLV_RT(wt)], and having the amino acid sequence as set forth in SEQ ID NO:11. SEQ ID NO:11 > PE1 prime editor MKRTADGSEFESPKKKRKVDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFG NIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLV QTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDL AEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDE HHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNRE DLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWM TRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGM RKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKD KDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRD KQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTV KVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEK LYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKM KNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDK LIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKV LSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSK KLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSA YNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDL SQLGGDSGGSSGGSSGSETPGTSESATPESSGGSSGGSSTLNIEDEYRLHETSKEPDVSLGSTWLSDFP QAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPWNTPLL PVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLF AFEWRDPEMGISGQLTWTRLPQGFKNSPTLFDEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQ QGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLGT AGFCRLWIPGFAEMAAPLYPLTKTGTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQG YAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAVEALV KQPPDRWLSNARMTHYQALLLDTDRVQFGPVVALNPATLLPLPEEGLQHNCLDILAEAHGTRPDLTDQP LPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKALPAGTSAQRAELIALTQALKMAEGKKLNVYT DSRYAFATAHIHGEIYRRRGLLTSEGKEIKNKDEILALLKALFLPKRLSIIHCPGHQKGHSAEARGNRM ADQAARKAAITETPDTSTLLLIENSSPSGGSKRTADGSEFEPKKKRKV As used herein, the term “PE2 protein” refers to a prime editing enzyme that consists of a fusion protein having the following structure [NLS]-[Cas9(H840A)]-[linker]-
[MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] and having the amino acid sequence as set forth in SEQ ID NO:12. SEQ ID NO:12 > PE2 prime editor MKRTADGSEFESPKKKRKVDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFG NIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLV QTYNQLFEENPINASGVDAKAILSARLSKSRRLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDL AEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDE HHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLNRE DLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWM TRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGM RKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKD KDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRD KQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTV KVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEK LYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKM KNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDK LIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKV LSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSK KLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSA YNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDL SQLGGDSGGSSGGSSGSETPGTSESATPESSGGSSGGSSTLNIEDEYRLHETSKEPDVSLGSTWLSDFP QAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPWNTPLL PVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLF AFEWRDPEMGISGQLTWTRLPQGFKNSPTLFNEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQ QGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLGK AGFCRLFIPGFAEMAAPLYPLTKPGTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQG YAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAVEALV KQPPDRWLSNARMTHYQALLLDTDRVQFGPVVALNPATLLPLPEEGLQHNCLDILAEAHGTRPDLTDQP LPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKALPAGTSAQRAELIALTQALKMAEGKKLNVYT DSRYAFATAHIHGEIYRRRGWLTSEGKEIKNKDEILALLKALFLPKRLSIIHCPGHQKGHSAEARGNRM ADQAARKAAITETPDTSTLLIENSSPSGGSKRTADGSEFEPKKKRKV As used herein, the term “PEmax protein” refers to a prime editing enzyme that consists of a fusion protein having the following structure [NLS]-[Cas9(R221K, N394K H840A)]-[linker]- [MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] and having the amino acid sequence as set forth in SEQ ID NO:13 as described in Chen, Peter J., et al. "Enhanced prime editing systems by manipulating cellular determinants of editing outcomes." Cell 184.22 (2021): 5635- 5652. SEQ ID NO:13 > PEmax protein MKRTADGSEFESPKKKRKVDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGAL LFDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFG NIVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLV QTYNQLFEENPINASGVDAKAILSARLSKSRKLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDL AEDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDE HHQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLKRE DLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWM TRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGM
RKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKD KDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRD KQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTV KVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEK LYLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKM KNYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDK LIREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYD VRKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKV LSMPQVNIVKKTEVQTGGFSKESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSK KLKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGN ELALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSA YNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTIDRKRYTSTKEVLDATLIHQSITGLYETRIDL SQLGGDSGGSSGGSKRTADGSEFESPKKKRKVSGGSSGGSTLNIEDEYRLHETSKEPDVSLGSTWLSDF PQAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPWNTPL LPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPL FAFEWRDPEMGISGQLTWTRLPQGFKNSPTLFNEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDC QQGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLG KAGFCRLFIPGFAEMAAPLYPLTKPGTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQ GYAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAVEAL VKQPPDRWLSNARMTHYQALLLDTDRVQFGPVVALNPATLLPLPEEGLQHNCLDILAEAHGTRPDLTDQ PLPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKALPAGTSAQRAELIALTQALKMAEGKKLNVY TDSRYAFATAHIHGEIYRRRGWLTSEGKEIKNKDEILALLKALFLPKRLSIIHCPGHQKGHSAEARGNR MADQAARKAAITETPDTSTLLIENSSPSGGSKRTADGSEFESPKKKRKVGSGPAAKRVKLD* As used herein, the term “PEmax-SpRY protein” refers to a prime editing enzyme that consists of a fusion protein having the following structure [NLS]-[Cas9(R221K, N394K H840A)+ D1135L, S1136W, G1218K, E1219Q, R1335Q T1337R + A61R, L1111R, N1317R, A1322R, R1333P)]-[linker]-[MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] to a prime editing enzyme having the amino acid sequence as set forth in SEQ ID NO:14. SEQ ID NO:14 > PEmax-SpRY protein KRTADGSEFESPKKKRKVDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALL FDSGETAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGN IVDEVAYHEKYPTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQ TYNQLFEENPINASGVDAKAILSARLSKSRKLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLA EDAKLQLSKDTYDDDLDNLLAQIGDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEH HQDLTLLKALVRQQLPEKYKEIFFDQSKNGYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLKRED LLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPFLKDNREKIEKILTFRIPYYVGPLARGNSRFAWMT RKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKVLPKHSLLYEYFTVYNELTKVKYVTEGMR KPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISGVEDRFNASLGTYHDLLKIIKDK DFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYTGWGRLSRKLINGIRDK QSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPAIKKGILQTVK VVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQNEKL YLYYLQNGRDMYVDQELDINRLSDYDVDAIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMK NYWRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKL IREVKVITLKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDV RKMIAKSEQEIGKATAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVL SMPQVNIVKKTEVQTGGFSKESIRPKRNSDKLIARKKDWDPKKYGGFLWPTVAYSVLVVAKVEKGKSKK LKSVKELLGITIMERSSFEKNPIDFLEAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAKQLQKGNE LALPSKYVNFLYLASHYEKLKGSPEDNEQKQLFVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAY NKHRDKPIREQAENIIHLFTLTRLGAPRAFKYFDTTIDPKQYRSTKEVLDATLIHQSITGLYETRIDLS QLGGDSGGSSGGSKRTADGSEFESPKKKRKVSGGSSGGSTLNIEDEYRLHETSKEPDVSLGSTWLSDFP QAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQRLLDQGILVPCQSPWNTPLL PVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAFFCLRLHPTSQPLF AFEWRDPEMGISGQLTWTRLPQGFKNSPTLFNEALHRDLADFRIQHPDLILLQYVDDLLLAATSELDCQ QGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLGK
AGFCRLFIPGFAEMAAPLYPLTKPGTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQG YAKGVLTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAVEALV KQPPDRWLSNARMTHYQALLLDTDRVQFGPVVALNPATLLPLPEEGLQHNCLDILAEAHGTRPDLTDQP LPDADHTWYTDGSSLLQEGQRKAGAAVTTETEVIWAKALPAGTSAQRAELIALTQALKMAEGKKLNVYT DSRYAFATAHIHGEIYRRRGWLTSEGKEIKNKDEILALLKALFLPKRLSIIHCPGHQKGHSAEARGNRM ADQAARKAAITETPDTSTLLIENSSPSGGSKRTADGSEFESPKKKRKVGSGPAAKRVKLD* The second component of the prime editing platform disclosed herein consists of a pegRNA suitable for guiding the prime editing enzyme to one target sequence located in the -115 region of HBG1 or HBG2 promoter. According to the present invention, the pegRNA comprises (a) a spacer sequence that comprises a region of complementarity to a first strand of the double-stranded target nucleic sequence located in the HBG1/2 promoters; (b) an extension arm that comprises a RT template and a primer binding site in a 5’ to 3’ orientation, wherein the primer binding site comprises a region of complementarity to a region upstream of a nick site in the second strand of the double- stranded target sequence, and wherein the RT template encodes the desired nucleotide changes (e.g. -124T>C, -123T>C, -113A>G, and a complete deletion of the entire TGACC BCL11A- binding motif localized from position -114 to -118 upstream of the TSS of HBG1/2 or a partial deletion of the BCL11A binding-motif , namely the deletion of the CC dinucleotide localized from position -114 to -115 upstream of the TSS of HBG1/2) compared to a region downstream of the nick site in the second strand of the double-stranded target sequence. The pegRNA molecule of the present invention thus comprises a spacer sequence for providing the targeting specificity. It includes a spacer that is complementary and capable of hybridization to a pre-selected target site of interest in the HBG1/2 promoters. In some embodiment, this spacer sequence can comprise from about 10 nucleotides to more than about 25 nucleotides. For example, the region of base pairing between the spacer sequence and the corresponding target site sequence can be about 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 25, or more than 25 nucleotides in length. In some embodiments, the spacer sequence is about 17-20 nucleotides in length, such as 20 nucleotides. Typically, a software program is used to identify candidate CRISPR target sequences on both strands of the DNA nucleic acid molecule containing the HBG genes based on desired guide sequence length and a CRISPR motif sequence (PAM) for a specified CRISPR enzyme. One requirement for selecting a suitable target nucleic acid is that it has a 3′ PAM site/sequence. Each target sequence and its corresponding PAM site/sequence are referred herein as a Cas-targeted site. Type II CRISPR
system, one of the most well characterized systems, needs only Cas 9 protein and a guide RNA complementary to a target sequence to affect target cleavage. For example, target sites for Cas9 from S. pyogenes, with PAM sequences NGG, may be identified by searching for 5′-Nx-NGG- 3′ both on the input sequence and on the reverse-complement of the input. Since multiple occurrences in the genome of the DNA target site may lead to nonspecific genome editing, after identifying all potential sites, the program filters out sequences based on the number of times they appear in the relevant reference genome. For those CRISPR enzymes for which sequence specificity is determined by a “seed” sequence, such as the 11-12 bp 5′ from the PAM sequence, including the PAM sequence itself, the filtering step may be based on the seed sequence. Thus, to avoid editing at additional genomic loci, results are filtered based on the number of occurrences of the seed:PAM sequence in the relevant genome. The user may be allowed to choose the length of the seed sequence. The user may also be allowed to specify the number of occurrences of the seed:PAM sequence in a genome for purposes of passing the filter. The default is to screen for unique sequences. Filtration level is altered by changing both the length of the seed sequence and the number of occurrences of the sequence in the genome. The program may in addition or alternatively provide the sequence of a guide sequence complementary to the reported target sequence(s) by providing the reverse complement of the identified target sequence(s). Further details of methods and algorithms to optimize sequence selection can be found in U.S. application Ser. No.61/836,080; incorporated herein by reference. In some embodiments, the spacer sequence is selected from the group consisting of : pegRNA spacer sequence SEQ ID NO : Spacer_10 GTTTGCCTTGTCAAGGCTAT 15 Spacer_11 TGCCTTGTCAAGGCTATTGG 16 Spacer_12 TTGCCTTGTCAAGGCTATTG 17 Spacer_13 TTTGCCTTGTCAAGGCTATT 18 Spacer_14 AGTTTGCCTTGTCAAGGCTA 19 Preferably, the spacer sequence is : Spacer_10 GTTTGCCTTGTCAAGGCTAT SEQ ID NO:15 In some embodiments, the gRNA core sequence of the pegRNA is selected from the group consisting of : Scaffold sequence SE Q
ID NO : Scaffold GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAACTTGAAAAAG 20 TGGCACCGAGTCGGTGC scaffold GTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGGCTAGTCCGTTATCAA 21 _opt CTTGAAAAAGTGGCACCGAGTCGGTGC According to the present invention, the RT template sequence encodes a single-stranded DNA molecule which is homologous to the non-target strand (and thus, complementary to the corresponding site of the target strand) but includes the desired nucleotide changes (e.g. the - 124T>C, -123T>C, -113A>G mutations and a complete deletion of the entire TGACC BCL11A-binding motif localized from position -114 to -118 upstream of the TSS of HBG1/2 or a partial deletion of the BCL11A binding-motif , namely the deletion of the CC dinucleotide localized from position -114 to -115 upstream of the TSS of HBG1/2). In particular, the synthesized single-stranded DNA product of the RT template sequence is homologous to the non-target strand and contains desired nucleotide changes (e.g. the -124T>C, -123T>C, - 113A>G mutations and a complete deletion of the entire TGACC BCL11A-binding motif localized from position -114 to -118 upstream of the TSS of HBG1/2 or a partial deletion of the BCL11A binding-motif , namely the deletion of the CC dinucleotide localized from position - 114 to -115 upstream of the TSS of HBG1/2). The single-stranded DNA product of the RT template sequence hybridizes in equilibrium with the complementary target strand sequence, thereby displacing the homologous endogenous target strand sequence. The displaced endogenous strand may be referred to in some embodiments as a 5’ endogenous DNA flap species. This 5’ endogenous DNA flap species can be removed by a 5’ flap endonuclease (e.g., FEN1) and the single-stranded DNA product, now hybridized to the endogenous target strand, may be ligated, thereby creating a mismatch between the endogenous sequence and the newly synthesized strand. The mismatch may be resolved by the cell's innate DNA repair and/or replication processes. The cellular repair of the single-strand DNA flap results in installation of the desired nucleotide changes, thereby forming a desired product. Thus, in some embodiments, the nucleotide sequence of the RT template sequence corresponds to the nucleotide sequence of the non-target strand which becomes displaced as the 5’ flap species and which overlaps with the site to be edited. In some embodiments, the RT template sequence is selected from the group consisting of: RTT sequence SEQ ID NO : RTT_10 GCCccGCCTgATA 22
RTT_10.2 GCCccGCCTTGAgATA 23 RTT_10.3 GCCccGCCTTGACCgATA 24 RTT_11 GCCAGCCTTGCCTG 25 RTT_12 AGCCTTGCCTGA 26 RTT_13 GCCTTGCCTGAT 27 RTT_14 TTGCCTGATAG 28 In some embodiments, the primer binding site sequence (PBS) is selected from the group consisting of: PBS sequence SEQ ID NO : PBS_10 GCCTTGACAAGGC 29 PBS_11 ATAGCCTTGACAA 30 PBS_12 TAGCCTTGACAAG 31 PBS_13 AGCCTTGACAAGG 32 PBS_14 CCTTGACAAGGCA 33 In some embodiments, the pegRNA of the present invention also comprises a modified prequeosine1-1 riboswitch aptamer (tevopreQ1) having the sequence of CGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA (SEQ ID NO:34). Contacting the eukaryotic cell with the prime editing platform of the present invention thus results in i) nicking the second strand of the double-stranded target sequence to form a free 3’ end at the nick site; ii) annealing the primer binding site with the region of the second strand of the double-stranded target sequence upstream of the nick site; iii) synthesizing a single strand of DNA encoded by the RT template from the free 3’ end of the second strand of the double- stranded target sequence; and iv) replacing the region downstream of the nick site in the second strand of the double-stranded target sequence with the single strand of DNA, thereby modifying the sequence of the double-stranded target sequence. In particular, the primer binding site of pegRNA binds to the primer sequence that is formed from the endogenous DNA strand of the target sequence when it becomes nicked by the prime editing platform, thereby exposing a 3’ end on the endogenous nicked strand. As explained herein, the binding of the primer sequence to the primer binding site on the extension arm of the pegRNA creates a duplex region with an exposed 3’ end (i.e., the 3’ of the primer sequence), which then provides a substrate for the reverse transcriptase to begin polymerizing a single strand of DNA from the exposed 3’ end along the length of the RT template. The sequence of the single strand DNA product is the
complement of the RT template. Reverse transcription continues towards the 5’ of the RT template until polymerization terminates. Thus, the RT template represents the portion of the extension arm that is encoded into a single strand DNA product (i.e., the 3’ single strand DNA flap containing the desired genetic edit information) by the reverse transcriptase of the prime editing enzyme complex and which ultimately replaces the corresponding endogenous DNA strand of the target sequence that sits immediate downstream of the PE-induced nick site. In some embodiments, the pegRNA of the present invention is selected from the group consisting of : Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACAAGGC 5 10 peg TGCCTTGTCAAGGCTATTGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCAGCCTTGCCTGATAGCCTTGACAA 6 11 peg TTGCCTTGTCAAGGCTATTGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCAGCCTTGCCTGATAGCCTTGACAAG 7 12 peg TTTGCCTTGTCAAGGCTATTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCTTGCCTGATAGCCTTGACAAGG 8 13 peg AGTTTGCCTTGTCAAGGCTAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGCCTGATAGCCTTGACAAGGCA 9 14 peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG 0 10. ACAAGGC 3 peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 1 10. AGCCTTGACAAGGC 4 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACA 2 A10 AGGCTCCTAATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .1 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG 3 A10 ACAAGGCTCCTTATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .2 epe 4 gRN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4
A10 CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGACCgATAGCCT .3 TGACAAGGCCTCTTCTACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 5 A10 AGCCTTGACAAGGCTCCGGTAACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .4 Preferably, the pegRNA sequence is selected from: Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACAAGGC 5 10 peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG 0 10. ACAAGGC 3 peg 4 RNA GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 1 10. CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 4 AGCCTTGACAAGGC epe 4 gRN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 3 A10 CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG .2 ACAAGGCTCCTTATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGACCgATAGCCT 4 A10 TGACAAGGCCTCTTCTACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .3 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 5 A10 AGCCTTGACAAGGCTCCGGTAACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .4 In some embodiments, the prime editing platform further involves the use of a second strand nicking guide RNA (“nicking guide RNA” or ngRNA”) that complexes with the prime editing platform and introduces a nick in the non-edited DNA strand in order to induce preferential replacement of the edited strand (PE3 system). In particular, the ngRNA is designed for temporal control. As used herein, the term “temporal second-strand nicking” refers to a variant of second strand nicking whereby the installation of the second nick in the unedited strand occurs only after the desired edit is installed in the edited strand. This avoids concurrent nicks on both strands that could lead to double-stranded DNA breaks. This is achieved by
designing a ngRNA with a spacer sequence that matches only the edited strand, but not the original allele. Using this strategy, mismatches between the protospacer and the unedited allele should disfavor nicking by the ngRNA until after the editing event on the PAM strand takes place. In some embodiments, the ngRNA is selected from the group consisting of: ngRNA sequence SEQ ID NO : ngRNA10.1 TAGTCTTAGAGTATCCAGTG 46 ngRNA10.2 TAGAGTATCCAGTGAGGCCA 47 ngRNA10.3 AGAGTATCCAGTGAGGCCAG 48 ngRNA10.4 TATCCAGTGAGGCCAGGGGC 49 ngRNA10.5 GGCTAGGGATGAAGAATAAA 50 Preferably, the ngRNA is : ngRNA10.5 GGCTAGGGATGAAGAATAAA SEQ ID NO :50 The guide RNA molecules of the present invention (i.e. the pegRNA and optionally the ngRNA) can be made by various methods known in the art including cell-based expression, in vitro transcription, and chemical synthesis. The ability to chemically synthesize relatively long RNAs (as long as 200 mers or more) using TC-RNA chemistry (see, e.g., U.S. Pat. No. 8,202,983) allows one to produce RNAs with special features that outperform those enabled by the basic four ribonucleotides (A, C, G and U). In particular, the RNA molecule of the present invention can be made with recombinant technology using a host cell system or an in vitro translation-transcription system known in the art. Details of such systems and technology can be found in e.g., WO2014144761 WO2014144592, WO2013176772, US20140273226, and US20140273233, the contents of which are incorporated herein by reference in their entireties. In some embodiments, the guide RNA molecules (i.e. the pegRNA and optionally the ngRNA) may include one or more modifications. Such modifications may include inclusion of at least one non-naturally occurring nucleotide, or a modified nucleotide, or analogs thereof. Modified nucleotides may be modified at the ribose, phosphate, and/or base moiety. Modified nucleotides may include 2’-O-methyl analogs, 2’-deoxy analogs, or 2’-fluoro analogs. The nucleic acid backbone may be modified, for example, a phosphorothioate backbone may be used. The use of locked nucleic acids (LNA) or bridged nucleic acids (BNA) may also be possible. Further
examples of modified bases include, but are not limited to, 2-aminopurine, 5-bromo-uridine, pseudouridine, inosine, 7-methylguanosine. In some embodiments, the prime editing platform further involves the transient expression of an engineered DNA mismatch repair (MMR) inhibiting protein (PE5 system) for enhancing the efficiency of the prime editing. In particular, the MMR inhibiting protein is selected among catalytically impaired mutants of human MSH2, MSH6, PMS2, and MLH1. More preferably, prime editing platform involves the use of a dominant negative MMR protein (MLH1dn) as described in Chen, Peter J., et al. "Enhanced prime editing systems by manipulating cellular determinants of editing outcomes." Cell 184.22 (2021): 5635-5652 and having the amino acid sequence as set forth in SEQ ID NO:51. SEQ ID NO:51 > MLH1dn protein MGGRRVRWEVYISRAGLVNRQISSSVPNTTHYKEIREKRRVRRNIRATMSFVAGVIRRLDETVVNRIAA GEVIQRPANAIKEMIENCLDAKSTSIQVIVKEGGLKLIQIQDNGTGIRKEDLDIVCERFTTSKLQSFED LASISTYGFRGEALASISHVAHVTITTKTADGKCAYRASYSDGKLKAPPKPCAGNQGTQITVEDLFYNI ATRRKALKNPSEEYGKILEVVGRYSVHNAGISFSVKKQGETVADVRTLPNASTVDNIRSIFGNAVSREL IEIGCEDKTLAFKMNGYISNANYSVKKCIFLLFINHRLVESTSLRKAIETVYAAYLPKNTHPFLYLSLE ISPQNVDVNVHPTKHEVHFLHEESILERVQQHIESKLLGSNSSRMYFTQTLLPGLAGPSGEMVKSTTSL TSSSTSGSSDKVYAHQMVRTDSREQKLDAFLQPLSKPLSSQPQAIVTEDKTDISSGRARQQDEEMLELP APAEVAAKNQSLEGDTTKGTSEMSEKRGPTSSNPRKRHREDSDVEMVEDDSRKEMTAACTPRRRIINLT SVLSLQEEINEQGHEVLREMLHNHSFVGCVNPQWALAQHQTKLYLLNTTKLSEELFYQILIYDFANFGV LRLSEPAPLFDLAMLALDSPESGWTEEDGPKEGLAEYIVEFLKKKAEMLADYFSLEIDEEGNLIGLPLL IDNYVPPLEGLPIFILRLATEVNWDEEKECFESLSKECAMFYSIRKQYISEESTLSGQQSEVPGSIPNS WKWTVEHIVYKALRSHILPPKHFTEDGNILQLANLPDLYKVF* In some embodiments, the prime editing platform preferably consists in one of the following combinations : Prime editor pegRNA ngRNA MLH1 PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 (SEQ ID No NO:13) ID NO:41) NO :50) PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:41) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 ( SEQ ID No NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 ( SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.4 (SEQ ngRNA10.5 (SEQ ID No NO:13) ID NO:45) NO :50)
PEmax (SEQ ID epegRNA10.4 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:45) NO :50) In some embodiments, the different components of the prime editing platform of the present invention are provided to the eukaryotic cell through expression from one or more expression vectors. For example, the nucleic acids encoding the pegRNA or the prime editing enzyme can be cloned into one or more vectors for introducing them into the eukaryotic cell. The vectors are typically prokaryotic vectors, e.g., plasmids, or shuttle vectors, or insect vectors, for storage or manipulation of the nucleic acid encoding the pegRNA or the prime editing enzyme herein disclosed. Preferably, the nucleic acids are isolated and/or purified. Thus, the present invention provides recombinant constructs or vectors having sequences encoding one or more of the pegRNA or prime editing enzymes described above. Examples of the constructs include a vector, such as a plasmid or viral vector, into which a nucleic acid sequence of the invention has been inserted, in a forward or reverse orientation. In some embodiments, the construct further includes regulatory sequences. A “regulatory sequence” includes promoters, enhancers, and other expression control elements (e.g., polyadenylation signals). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence, as well as inducible regulatory sequences. The design of the expression vector can depend on such factors as the choice of the eukaryotic cell to be transformed, transfected, or infected, the desired expression level, and the like. Large numbers of suitable vectors and promoters are known to those of skill in the art, and are commercially available. Appropriate cloning and expression vectors for use with eukaryotic hosts are also described in e.g., Sambrook et al. (2001, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press). The vector can be capable of autonomous replication or integration into a host DNA. The vector may also include appropriate sequences for amplifying expression. In addition, the expression vector preferably contains one or more selectable marker genes to provide a phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or neomycin resistance for eukaryotic cell cultures, or such as tetracycline or ampicillin resistance in E. coli. Any of the procedures known in the art for introducing foreign nucleotide sequences into host cells may be used. Examples include the use of calcium phosphate transfection, polybrene, protoplast fusion, electroporation, nucleofection, liposomes, microinjection, naked DNA, plasmid vectors, viral vectors, both episomal and integrative, and any of the other well-known methods for introducing cloned genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell.
In some embodiments, the different components of the prime editing platform of the present invention are provided to the population of cells through the use of an RNA-encoded system. For instance, the editing system may be provided to the population of cells through the use of a chemically modified mRNA-encoded prime editor together with the pegRNA. In particular, engineered RNA-encoded prime editing enzymes are prepared by introducing various chemical modifications to both mRNA that encoded the prime editing enzyme and guide RNA. In particular said modifications consist in uridine depleted mRNAs modified with 5- methoxyuridine: synonymous codons may be introduced to deplete uridines as much as possible without altering the coding sequence and replaced all the remaining uridines with 5- methoxyuridine. Said optimized editing system exhibits higher editing efficiency at some genomic sites compared to DNA-encoded system. It is also possible to encapsulate the modified mRNA and guide RNA into lipid nanoparticle (LNP) for allowing lipid nanoparticle (LNP)- mediated delivery. In some embodiments, the different components of the prime editing platform of the present invention are provided to the population of cells through the use of ribonucleoprotein (RNP) complexes. For instance the prime editing enzyme can be pre-complexed with one or more pegRNAs to form a ribonucleoprotein (RNP) complex. The RNP complex can thus be introduced into the eukaryotic cell. Introduction of the RNP complex can be timed. The cell can be synchronized with other cells at G1, S, and/or M phases of the cell cycle. RNP delivery avoids many of the pitfalls associated with mRNA, DNA, or viral delivery. Typically, the RNP complex is produced simply by mixing the proteins (i.e. the prime editing enzyme) and one or more pegRNAs in an appropriate buffer. This mixture is incubated for 5-10 min at room temperature before electroporation. Electroporation is a delivery technique in which an electrical field is applied to one or more cells in order to increase the permeability of the cell membrane. In some embodiments, genome editing efficiency can be improved by adding a transfection enhancer oligonucleotide. Methods of treatment: A further object of the present invention relates to a method for increasing fetal hemoglobin levels in a subject in need thereof, the method comprising transplanting a therapeutically effective amount of the population of eukaryotic cells obtained by the method as above described.
In some embodiments, the population of cell is autologous to the subject, meaning the population of cells is derived from the same subject. In some embodiments, the subject has been diagnosed with a hemoglobinopathy. The method of the present invention is thus particularly suitable for the treatment of hemoglobinopathies. In some embodiments, the β-hemoglobinopathy is a sickle cell disease. In some embodiments, the hemoglobinopathy is a β-thalassemia. In some embodiments, the cells are formulated by first harvesting them from their culture medium, and then washing and concentrating the cells in a medium and container system suitable for administration (a "pharmaceutically acceptable" carrier) in a treatment-effective amount. Suitable infusion medium can be any isotonic medium formulation, typically normal saline, Normosol R (Abbott) or Plasma-Lyte A (Baxter), but also 5% dextrose in water or Ringer's lactate can be utilized. The infusion medium can be supplemented with human serum albumin. A treatment-effective amount of cells in the composition is dependent on the relative representation of the cells with the desired specificity, on the age and weight of the recipient, and on the severity of the targeted condition. These amount of cells can be as low as approximately 103/kg, preferably 5x103/kg; and as high as 107/kg, preferably 108/kg. The number of cells will depend upon the ultimate use for which the composition is intended, as will the type of cells included therein. Typically, the minimal dose is 2 million of cells per kg. Usually 2 to 20 million of cells are injected in the subject. The desired purity can be achieved by introducing a sorting step. For uses provided herein, the cells are generally in a volume of a liter or less, can be 500 ml or less, even 250 ml or 100 ml or less. The clinically relevant number of cells can be apportioned into multiple infusions that cumulatively equal or exceed the desired total amount of cells. Kits This invention further provides kits containing reagents for performing the above-described methods, including all component of the prime editing platform as disclosed herein for performing mutagenesis. To that end, one or more of the reaction components, e.g., guide RNA molecules (i.e. the pegRNA and optionally the ngRNA), and nucleic acid molecules encoding for the prime editing enzymes for the methods disclosed herein can be supplied in the form of
a kit for use. In some embodiments, the kit comprises one or more prime editing enzymes and one or more guide RNA molecules (i.e. the pegRNA and optionally the ngRNA). In some embodiments, the kit consists of one of the following combinations: Prime editor pegRNA ngRNA MLH1 PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 ( SEQ ID No NO:13) ID NO:41) NO :50) PEmax (SEQ ID pegRNA10.4 (SEQ ngRNA10.5 ( SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:41) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 (SEQ ID No NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.2 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:43) NO :50) PEmax (SEQ ID epegRNA10.4 (SEQ ngRNA10.5 (SEQ ID No NO:13) ID NO:45) NO :50) PEmax (SEQ ID epegRNA10.4 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID NO:51) NO:13) ID NO:45) NO :50) In some embodiments, the kit can include one or more other reaction components. In some embodiments, an appropriate amount of one or more reaction components is provided in one or more containers or held on a substrate. Examples of additional components of the kits include, but are not limited to, one or more host cells, one or more reagents for introducing foreign nucleotide sequences into host cells, one or more reagents (e.g., probes or PCR primers) for detecting expression of the guide RNA or prime editing enzymes or verifying the target nucleic acid's status, and buffers or culture media for the reactions. The kit may also include one or more of the following components: supports, terminating, modifying or digestion reagents, osmolytes, and an apparatus for detection. The components used can be provided in a variety of forms. For example, the components (e.g., enzymes, RNAs, probes and/or primers) can be suspended in an aqueous solution or as a freeze-dried or lyophilized powder, pellet, or bead. In the latter case, the components, when reconstituted, form a complete mixture of components for use in an assay. The kits of the invention can be provided at any suitable temperature. For
example, for storage of kits containing protein components or complexes thereof in a liquid, it is preferred that they are provided and maintained below 0° C., preferably at or below −20° C., or otherwise in a frozen state. The kits can also include packaging materials for holding the container or combination of containers. Typical packaging materials for such kits and systems include solid matrices (e.g., glass, plastic, paper, foil, micro-particles and the like) that hold the reaction components or detection probes in any of a variety of configurations (e.g., in a vial, microtiter plate well, microarray, and the like). The kits may further include instructions recorded in a tangible form for use of the components. The invention will be further illustrated by the following figures and examples. However, these examples and figures should not be interpreted in any way as limiting the scope of the present invention. FIGURES: Figure 1: Screening of pegRNAs and ngRNAs targeting the HBG1/2 promoters in K562 cells. (A) Upper panel. Schematic representation of the β-globin locus on chromosome (chr) 11 including the HBG1 and HBG2 genes and their respective promoters (prom). Lower panel. Hereditary persistence of fetal hemoglobin (HPFH) mutations (highlighted in bold), in HBG promoters generate activators (GATA1, KLF1) BSs (black boxes) close to the BCL11A repressor BS (highlighted in grey). (B) Schematic representation of pegRNAs targeting the -115 region of the HBG1/2 promoters. The pegRNAs contain from 5’ to 3’, the spacer sequence (black/dark grey arrows), the scaffold sequence (grey lines) and the 3’ extension composed of the reverse transcription template (RTT, grey light boxes) and the primer binding site (PBS, white boxes). pegRNA10 is compatible with a prime editor harboring the Cas9n that recognizes an NGG PAM (PEmax), while pegRNA11 to pegRNA14 work with a PAM-less (SpRY) prime editor (PEmax-SpRY). pegRNA10 to pegRNA14 were designed to delete the BCL11A-binding motif (TGACC from -114 to -118 upstream of the TSS) (grey light box), insert the -113 A>G (GATA1 BS) and the -123/-124 T>C (KLF1 BS) mutations. (C) Frequency (%) of InDels generated by the pegRNAs, described in panel B, using a Cas9 (pegRNA10) or a Cas9-SpRY (pegRNA11 to pegRNA14) nuclease in K562 cells. Bars represent the mean ± SEM of 3 biologically independent replicates.
(D) Frequency (%) of InDels generated by ngRNAs (ngRNA10.1 to ngRNA10.5) and Cas9 nuclease in K562 cells. Bars represent the mean ± SEM of 3 biologically independent experiments. (C, D) MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. PCR products were subjected to Sanger sequencing and InDels were measured using the TIDE software. Figure 2: Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.1 in K562 cells. (A) Schematic representation of epegRNA10.1 and ngRNAs (ngRNA10.3 and ngRNA10.5). The epegRNAs contain the tevopreQ1 sequence fused to the 3’ end of the pegRNA via a linker. The epegRNA10.1 deletes the entire BCL11A BS (∆5_BCL11A) and inserts the GATA1 (-113 A>G) and the KLF1 (-123/-124 T>C) BSs. (B) Left panel. Percentage of NGS reads with: (i) desired edits alone, (ii) desired edits with alternative repair events with or without additional mutations, and (iii) Indels in GFPhigh K562 cells. The reads with InDels include InDels located at the nicking site of the epegRNA and of the ngRNA10.3 or ngRNA10.5. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 3 biologically independent replicates. Right panel. Example of NGS reads. The top line corresponds to the wild-type (WT) sequence of the HBG1/2 promoters aligned to the PBS/RTT of the epegRNA. The different editing profiles are displayed below the WT sequence. The KLF1 BS and the GATA1 BS are respectively highlighted in grey and bold.The desired base conversions are indicated in lower case. (C) Schematic representation of the resolution of the 3’-5’ flap intermediates using the epegRNA10.1. Figure 3: Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.2 in K562 cells. (A) Schematic representation of epegRNA10.2 and ngRNAs (ngRNA10.3 and ngRNA10.5). The epegRNAs contain the tevopreQ1 sequence fused to the 3’ end of the pegRNA via a linker. The epegRNA10.2 deletes two cytosines of the BCL11A BS (∆2_BCL11A),and insert the GATA1 and the KLF1 BSs. (B) Left panel. Percentage of NGS reads with: (i) desired edits alone, (ii) desired edits with alternative mutations (alt. mut.), (iii) partial edits and (iv) Indels in GFPhigh K562 cells. The reads with InDels include InDels located at the nicking site of the epegRNA and of the
ngRNA10.3 or ngRNA10.5. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 3 biologically independent replicates. Right panel. Example of NGS reads. The top line corresponds to the wild-type (WT) sequence of the HBG1/2 promoters aligned to the PBS/RTT of the epegRNA. The different editing profiles are displayed below the WT sequence. The KLF1 BS and the GATA1 BS are respectively highlighted in grey and bold. The desired base conversions are indicated in lower case. (C) Schematic representation of the resolution of the 3’-5’ flap intermediates using epegRNA10.2. Figure 4: Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.3 in K562 cells. (A) Schematic representation of epegRNA10.3 and ngRNAs (ngRNA10.3 and ngRNA10.5). The epegRNAs contain the tevopreQ1 sequence fused to the 3’-end of the pegRNA via a linker. The epegRNA10.3 inserts the GATA1 and the KLF1 BSs. (B) Left panel. Percentage of NGS reads with: (i) desired edits alone, (ii) desired edits with alternative mutations (alt. mut.), (iii) partial edits and (iv) Indels in GFPhigh K562 cells. The reads with InDels include InDels located at the nicking site of the epegRNA and of the ngRNA10.3 or ngRNA10.5. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 3 biologically independent replicates. Right panel. Example of NGS reads. The top line corresponds to the wild-type (WT) sequence of the HBG1/2 promoters aligned to the PBS/RTT of the epegRNA. The different editing profiles are displayed below the WT sequence. The KLF1 BS and the GATA1 BS are respectively highlighted in grey and bold. The desired base conversions are indicated in lower case. Statistical significance was assessed using one-way test ANOVA with multiple comparisons.* p<0.05** p<0,005, ***p<0,0005, **** p<0,0001 Figure 5: Optimization of the prime editing efficiency in K562 cells. Percentage of NGS reads containing the desired edits (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (Partial edit 1: 1 mutation generating the GATA1 BS and 2-bp deletion in the BCL11A BS; partial edit 2: only 1 mutation generating the GATA1 BS) or InDels (located at the nicking induced by the epegRNA or the ngRNA) generated using the epegRNA10.2 or the epegRNA10.4 with the ngRNA10.5 in the -115 region of the HBG1/2 promoters. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean
± SEM of 3-4 biologically independent replicates. Statistical significance was assessed using two-way ANOVA with multiple comparisons. ****p<0.0001. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Desired and partial edits did not contain InDels other than the 2-bp deletion in the BCL11A BS. Figure 6: Prime editing strategies to simultaneously disrupt the BCL11A BS and insert the GATA1 and KLF1 BS in the HBG1/2 promoters using epegRNA10.4 and ngRNA10.5 in primary SCD patients’ cells. (A) Prime editing efficiency in HSPCs: percentage of NGS reads containing the desired edit (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (1 mutation generating the GATA1 BS and 2-bp deletion in the BCL11A BS) or InDels (located at the nicking induced by the epegRNA and/or the ngRNA) generated using the epegRNA10.4 with or without the ngRNA10.5 (for PE and PEn strategies, respectively) in the -115 region of the HBG1/2 promoters. MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 1-3 technical replicates for each donor (n=3). (B) Prime editing efficiency in BFU-E bulk populations: percentage of NGS reads containing the desired edit (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (1 mutation generating the GATA1 BS and 2-bp deletion in the BCL11A BS) or InDels (located at the nicking induced by the epegRNA and/or the ngRNA) generated using the epegRNA10.4 with or without the ngRNA10.5 (for PE and PEn strategies, respectively) in the -115 region of the HBG1/2 promoters. Samples transfected only with the mRNA of the PEmax or the PEnmax were used as controls. Data represent the mean ± SEM of 1-3 technical replicates for each donor (n=2). (C) HbF and HbS level measured by CE-HPLC in bulk BFU-Es derived from treatedHSPCs. Samples transfected only with the mRNA of the PEmax or the PEnmax were used as controls. Data represent the mean ± SEM of 1-3 technical replicates for each donor (n=2). Figure 7: Optimization of prime editing efficiency in HSPCs. (A) Prime editing efficiency in HSPCs: percentage of NGS reads containing the desired edit (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (1 mutation generating the GATA1 BS and the 2-bp deletion in the BCL11A BS) or InDels (located at the nicking induced by the epegRNA and/or the ngRNA) generated using the epegRNA10.4 and the PEnmax in the -115 region of the HBG1/2 promoters,
with or without treatment with deoxynucleosides (dN). MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 2 biological replicates (n=1 donor). (B) Prime editing efficiency in HSPCs: percentage of NGS reads containing the desired edit (3 mutations: 1 generating the GATA1 BS, 1 generating the KLF1 BS and the 2-bp deletion in the BCL11A BS), the partial edits (1 mutation generating the GATA1 BS and the 2-bp deletion in the BCL11A BS) or InDels (located at the nicking induced by the epegRNA and/or the ngRNA) generated using the epegRNA10.4 and the PEnmax in the -115 region of the HBG1/2 promoters, with or without treatment with DNA-PK (DNA-PKi) POLQ (POLQi) inhibitor or both inhibitors (DNA-PKi + POLQi). MOCK samples transfected with TE (Tris-EDTA) buffer were used as controls. Data represent the mean ± SEM of 1 biological replicate (n=1 donor). EXAMPLE: Introduction: β-hemoglobinopathies are genetic disorders caused by mutations that reduce adult β-globin production (β-thalassemia) or generate an abnormal β-globin chain (Sickle Cell Disease, SCD). β-thalassemia and SCD are the most common inherited diseases affecting millions of people worldwide. In β-thalassemia, the reduced production of β-globin chains leads to insufficiently hemoglobinized red blood cells (RBCs) and anemia. In SCD, a single point mutation in the β- globin (HBB) gene leads to the production of a sickle βS-globin chain and the hemoglobin S (HbS) that polymerizes under hypoxic conditions. HbS polymerization leads to RBC sickling that in turn causes vaso-occlusive crises, hemolytic anemia and organ damage. Symptomatic treatments, such as RBC transfusions and supportive care, are associated with high costs, and still a poor quality of life. Nowadays, the only curative option is allogeneic transplantation of hematopoietic stem cells (HSCs), which is severely limited by immunological risks and availability of compatible donors1. The clinical severity of β-hemoglobinopathies is alleviated by the co-inheritance of genetic mutations termed hereditary persistence of fetal hemoglobin (HPFH), which cause the synthesis of the fetal γ-globin and the production of fetal hemoglobin (HbF) in adult life2. γ-globin compensates β-chain deficiency in β-thalassemia and exerts an anti-sickling effect in SCD. Naturally occurring HPFH mutations identified in the promoters of the two ^-globin genes
(HBG1/2), are known to either generate de novo DNA motifs recognized by transcriptional activators (e.g., KLF1, TAL1 and GATA1) or disrupt transcriptional repressor (e.g., LRF and BCL11A) binding sites (BSs). The disruption of the LRF or BCL11A BS with CRISPR/Cas9- mediated strategy leads to HbF reactivation and correction of the SCD phenotype3. However, the use of a nuclease can cause cell toxicity and generate several double strand break (DSB) associated with large deletions and high risk of genomic rearrangements4–6. New DSB-free genome editing tools allowed the development of novel, efficient and safer therapeutic strategies for the development of β-hemoglobinopathies. The introduction of HPFH mutations generating the KLF1 (-123/-124 T>C)7, TAL1 (-175 T>C)8 or GATA1 (-113 A>G)9 activator BSs in the HBG1/2 promoters results in HbF reactivation using base editing8. Of note, the creation of the -113 A>G mutation partially disrupt the BCL11A repressor BS. Interestingly, the co-occurrence of multiple HPFH mutations is associated with higher HbF levels compared to individual mutations10. However base editors cannot introduce simultaneous HPFH mutations in the HBG promoters, a strategy that is desirable in HSCs for clinical applications. The prime editing system is a “search and replace” genome editing technology allowing all 12 possible base conversions, insertions, deletions, or the simultaneous combination of these changes11–13 in a specific target region. This system is composed of: (1) the prime editor (PE), a fusion of a Cas9 nickase (Cas9n) and an engineered reverse transcriptase (RT) and (2) a prime editing guide RNA (pegRNA). The pegRNA contains the spacer sequence that targets a specific region in the genome and the 3’ extension sequence composed of the primer binding site (PBS) that hybridizes to the 3’ end of the nicked DNA strand to start reverse transcription and the RT- template (RTT) that contains the desired edits (DE), which are eventually inserted into the genomic DNA after reverse transcription (PE2 system). This results in an intermediate that contains two DNA flaps: a 3’ flap that contains the newly edited sequence, and a 5’ flap that contains the unedited DNA sequence, which is cleaved allowing 3’ flap ligation. The target region then contains mismatches that are repaired by copying the edited strand into the complementary strand to permanently inert the desired edit. Recently, the prime editing tool was improved to increase editing efficiency by using a single guide RNA (ngRNA) that nicks the non-edited strand (PE3 system)11 and a dominant negative MLH1 (MLH1dn) that inhibits the mismatch DNA repair pathway (MMR; PE5 system), which reincorporates the original nucleotides in the edited strand14–17. The PE by itself was also engineered to further improve its processivity and editing activity by the addition of nuclear localization signal (NLS) sequences, linkers, and two mutations in the Cas9n18. Moreover, its sequence was codon optimized,
generating the PEmax15. Despite the late advances to improve the editing efficiency, some target regions are still hard-to-edit with the original prime editing system using a nickase-based PE. To overcome this limitation, the nickase was replaced by a Cas9 nuclease to generate the prime editor nuclease (PEn - SEQ ID NO: 86) that facilitates precise insertion of DNA sequences and improves editing efficiency for some pegRNAs that are poorly efficient when used with the nickase-based PE33. This technology allows prime editing by using DSBs and DNA end joining repair pathways33. SEQ ID NO: 86 > PEnmax protein MKRTADGSEFESPKKKRKVDKKYSIGLDIGTNSVGWAVITDEYKVPSKKFKVLGNTDRHSIKKNLIGALLFDSGE TAEATRLKRTARRRYTRRKNRICYLQEIFSNEMAKVDDSFFHRLEESFLVEEDKKHERHPIFGNIVDEVAYHEKY PTIYHLRKKLVDSTDKADLRLIYLALAHMIKFRGHFLIEGDLNPDNSDVDKLFIQLVQTYNQLFEENPINASGVD AKAILSARLSKSRKLENLIAQLPGEKKNGLFGNLIALSLGLTPNFKSNFDLAEDAKLQLSKDTYDDDLDNLLAQI GDQYADLFLAAKNLSDAILLSDILRVNTEITKAPLSASMIKRYDEHHQDLTLLKALVRQQLPEKYKEIFFDQSKN GYAGYIDGGASQEEFYKFIKPILEKMDGTEELLVKLKREDLLRKQRTFDNGSIPHQIHLGELHAILRRQEDFYPF LKDNREKIEKILTFRIPYYVGPLARGNSRFAWMTRKSEETITPWNFEEVVDKGASAQSFIERMTNFDKNLPNEKV LPKHSLLYEYFTVYNELTKVKYVTEGMRKPAFLSGEQKKAIVDLLFKTNRKVTVKQLKEDYFKKIECFDSVEISG VEDRFNASLGTYHDLLKIIKDKDFLDNEENEDILEDIVLTLTLFEDREMIEERLKTYAHLFDDKVMKQLKRRRYT GWGRLSRKLINGIRDKQSGKTILDFLKSDGFANRNFMQLIHDDSLTFKEDIQKAQVSGQGDSLHEHIANLAGSPA IKKGILQTVKVVDELVKVMGRHKPENIVIEMARENQTTQKGQKNSRERMKRIEEGIKELGSQILKEHPVENTQLQ NEKLYLYYLQNGRDMYVDQELDINRLSDYDVDHIVPQSFLKDDSIDNKVLTRSDKNRGKSDNVPSEEVVKKMKNY WRQLLNAKLITQRKFDNLTKAERGGLSELDKAGFIKRQLVETRQITKHVAQILDSRMNTKYDENDKLIREVKVIT LKSKLVSDFRKDFQFYKVREINNYHHAHDAYLNAVVGTALIKKYPKLESEFVYGDYKVYDVRKMIAKSEQEIGKA TAKYFFYSNIMNFFKTEITLANGEIRKRPLIETNGETGEIVWDKGRDFATVRKVLSMPQVNIVKKTEVQTGGFSK ESILPKRNSDKLIARKKDWDPKKYGGFDSPTVAYSVLVVAKVEKGKSKKLKSVKELLGITIMERSSFEKNPIDFL EAKGYKEVKKDLIIKLPKYSLFELENGRKRMLASAGELQKGNELALPSKYVNFLYLASHYEKLKGSPEDNEQKQL FVEQHKHYLDEIIEQISEFSKRVILADANLDKVLSAYNKHRDKPIREQAENIIHLFTLTNLGAPAAFKYFDTTID RKRYTSTKEVLDATLIHQSITGLYETRIDLSQLGGDSGGSSGGSKRTADGSEFESPKKKRKVSGGSSGGSTLNIE DEYRLHETSKEPDVSLGSTWLSDFPQAWAETGGMGLAVRQAPLIIPLKATSTPVSIKQYPMSQEARLGIKPHIQR LLDQGILVPCQSPWNTPLLPVKKPGTNDYRPVQDLREVNKRVEDIHPTVPNPYNLLSGLPPSHQWYTVLDLKDAF FCLRLHPTSQPLFAFEWRDPEMGISGQLTWTRLPQGFKNSPTLFNEALHRDLADFRIQHPDLILLQYVDDLLLAA TSELDCQQGTRALLQTLGNLGYRASAKKAQICQKQVKYLGYLLKEGQRWLTEARKETVMGQPTPKTPRQLREFLG KAGFCRLFIPGFAEMAAPLYPLTKPGTLFNWGPDQQKAYQEIKQALLTAPALGLPDLTKPFELFVDEKQGYAKGV LTQKLGPWRRPVAYLSKKLDPVAAGWPPCLRMVAAIAVLTKDAGKLTMGQPLVILAPHAVEALVKQPPDRWLSNA RMTHYQALLLDTDRVQFGPVVALNPATLLPLPEEGLQHNCLDILAEAHGTRPDLTDQPLPDADHTWYTDGSSLLQ EGQRKAGAAVTTETEVIWAKALPAGTSAQRAELIALTQALKMAEGKKLNVYTDSRYAFATAHIHGEIYRRRGWLT SEGKEIKNKDEILALLKALFLPKRLSIIHCPGHQKGHSAEARGNRMADQAARKAAITETPDTSTLLIENSSPSGG SKRTADGSEFESPKKKRKVGSGPAAKRVKLD
Here we exploited the capacity of prime editing (i.e., PE and Pen) to simultaneously generate multiple HPFH mutations and disrupt the BCL11A repressor BS in the -115 region of the HBG1/2 γ-globin promoters to further boost HbF expression and correct the β-thalassemia and SCD phenotypes. Specifically, we selected two potent HPFH/HPFH-like mutations generating BSs for the GATA1 and KLF1 activators, that are associated with a strong γ-globin reactivation in patients and experimental models7,19 (Figure 1A). Materials and Methods: K562 cell culture Human erythroleukemia K562 cells were maintained at a concentration of 5x105cells/ml in RPMI 1640 containing glutamine (Gibco) and supplemented with 10% fetal bovine serum (Gibco), 2% Hepes (Life Technologies), 1% sodium pyruvate (Life Technologies), and 1% penicillin and streptomycin (Life Technologies) at 37°C and 5%CO2. HSPC purification and culture We obtained human non-mobilized peripheral blood CD34+ HSPCs from patients with SCD from the “Hôpital Necker-Enfants malades” Hospital (Paris, France). Healthy donors were either obtained from the “Hôpital Necker-Enfants malades” Hospital (Paris, France). All experiments were performed in accordance with the Declaration of Helsinki. The study was approved by the regional investigational review board (reference: DC 2022-5364, CPP Ile-de- France II “Hôpital Necker-Enfants malades”). HSPCs were purified by immunomagnetic selection with AutoMACS (Miltenyi Biotec) after immunostaining with the CD34 MicroBead Kit (Miltenyi Biotec). Twenty-four hours before transfection, CD34+ cells were thawed and cultured at a concentration of 5 × 105 cells/ml in the “HSPC medium” containing StemSpan (STEMCELL Technologies) supplemented with penicillin/streptomycin (Gibco), 250 nM StemRegenin1 (STEMCELL Technologies), and the following recombinant human cytokines (PeproTech): human stem-cell factor (SCF) (300 ng/ml), Flt-3L (300 ng/ml), thrombopoietin (TPO) (100 ng/ml), and interleukin-3 (IL-3) (60 ng/ml). Plasmids Plasmids used in this study include: pMJ920 Cas9-GFP-expressing plasmid (Addgene #42234) pCMV-T7-SpRY-P2A-EGFP (Addgene #139989)
pCMV-PE2 (Addgene #258720) pCMV-PEmax (Addgene #174820) pU6-pegRNA-GG-acceptor (Addgene #132777) pU6-tevopreq1-GG-acceptor (Addgene #174038) pEF1a-hMLH1dn (Addgene #174824) Generation of pegRNA and epegRNA constructs pegRNAs were designed using the web-based tools (PrimeDesign26 https://drugthatgene.pinellolab.partners.org, pegFinder27 http://pegfinder.sidichenlab.org, and EasyPrime28, http://easy-prime.cc). epegRNAs were generated from pegRNAs by adding the tevopreQ1 sequence at the 3’ end of the pegRNA via a linker designed by the web-based tool pegLIT14 (https://peglit.liugroup.us). pegRNAs and epegRNAs were cloned as previously described11,14. Briefly, oligonucleotide duplexes containing the pegRNAs’ or the epegRNAs’ spacer , scaffold and 3’extension, with or without the tevopreQ1 sequence, were cloned using the Golden Gate Assembly method into the pU6-pegRNA-GG-acceptor (Addgene #132777) for pegRNAs and the pU6-tevopreq1-GG-acceptor (Addgene#174038) for epegRNAs. Plasmids were transformed in TOP10 and XL10-gold bacteria for pegRNAs and epegRNAs, respectively, following manufacturer’s instructions. After an overnight culture, transformed bacterial colonies were selected and amplified. Plasmid sequences were verified by Sanger sequencing. PegRNAs’ and epegRNAs’ sequences are displayed in Table S1. Generation of ngRNA constructs ngRNAs were designed using the web-based tool CRISPOR29 (http://crispor.tefor.net/ - version 4.99). We selected ngRNAs with the lowest off-target activity, which was predicted using web- based tools (COSMID, https://crispr.bme.gatech.edu/30; DeepSpCas9, http://deepcrispr.info/DeepSpCas9/31). Oligonucleotide duplexes containing the annealed ngRNA spacer were cloned into the MA128 plasmid (provided by M. Amendola, Genethon, France). Cloning strategies and plasmid selection were similar to those described for peg/epegRNAs. ngRNAs’ sequences are displayed in Table S1. Plasmid transfection For pegRNAs’ and ngRNAs’ screening, K562 cells (106 cells/condition) were transfected with 3.6 μg of Cas9-GFP (Addgene #42234) or Cas9-SpRY-GFP-expressing plasmid (Addgene #139989) and 1.2 μg of the ngRNA/pegRNAs-containing plasmids using AMAXA Cell Line
Nucleofector Kit V (VCA- 1003, Lonza) and U-16 program (Nucleofector 2b, Lonza). Cells transfected with TE (Tris EDTA) buffer were used as negative controls. 18 hours after nucleofection, transfection efficiency was evaluated by measuring GFP expression using the Gallios analyzer (Beckman Coulter). Data were analyzed using the FlowJo software (version 10.8.1). For prime editing experiments, K562 cells (106 cells/condition) were transfected with 0.25 μg of GFPmax-expressing plasmid (Lonza), 3.6 μg of PEmax- or PEmax-SpRY-containing plasmids, 1.2 μg of epegRNA- or pegRNA-containing plasmids, 0.4 μg of ngRNA-containing plasmids and 1.8 μg of MLH1dn-containing plasmids (when specified), using the AMAXA Cell Line Nucleofector Kit V (VCA-1003, Lonza) and U-16 program (Nucleofector 2b, Lonza). Cells transfected with TE (Tris EDTA) buffer were used as negative controls. Eighteen hours after nucleofection, the top 30% of GFP+ K562 cells were FACS-sorted using SH800 or MA900 Cell Sorter (Sony Biotechnology). Data were analyzed using the FlowJo software (version 10.8.1). RNA transfection 1 x 105 to 2 x 105 HSPCs from SCD donors were transfected per condition 24 h after thawing, with 3.2 or 6.4 μg of the enzyme-encoding PEmax mRNA (generated using the MEGAscript and Poly-A tailing kits, Ambion and the ARCA, Trilink, as described in Antoniou et al.34) or the PEnmax (provided by AstraZeneca) enzyme-encoding mRNA, and 200 to 400 pmol of synthetic pegRNAs (IDT) and/or ngRNA (Synthego). We used the P3 Primary Cell 4D- Nucleofector X Kit S (Lonza) and the CA-137 program (Nucleofector 4D, Lonza). In some experiments, after nucleofection, cells were treated 24h with deoxynucleosides (Sigma-Aldrich; dA, catalog no. D8668; dG, catalog no. D0901; dC, catalog no. D0776; dT, catalog no. T1895) at a final concentration of 100uM, or with small inhibitors at a final concentration of 1 μM for AZD7648 (DNA-PKi, provided by AstraZeneca), 3 μM for Polθi (ART558, MedChem Express), or a combination of both drug treatments. Cells transfected with TE buffer or with the enzyme-encoding mRNA only served as negative controls. Synthetic epegRNAs and ngRNA generation Synthetic epegRNAs were ordered from Integrated DNA Technologies (IDT). Each construct contained 2’-O-methyl modifications at the first and last three nucleotides and phosphorothioate linkages between the three first and last nucleotides.
Synthetic nicking gRNAs were obtained from Synthego with 2’-O-methyl modifications at the first and last three nucleotides as well and phosphorothioate linkages between the three first and last two nucleotides. Evaluation of editing efficiency Genomic DNA from K562 cells was extracted using the PureLink Genomic DNA Mini Kit (Invitrogen) following the manufacturer’s instructions. To evaluate InDel efficiency, on-target sites (HBG1/2 promoters and ngRNA target sites) were PCR-amplified using the Recombinant Taq DNA Polymerase (Thermo Fisher) according to the manufacturer’s instructions, and subjected to Sanger sequencing. PCR primers are listed in Table S2. Indels were evaluated using the TIDE software32 (http://shinyapps.datacurators.nl/tide/). To evaluate prime editing efficiency in K562 cells, on-target sites (HBG1/2 promoters) were PCR amplified using the Phusion High-Fidelity polymerase (New England BioLabs), the HF buffer (New England BioLabs) and primers containing specific DNA stretches (MR3 for forward primers and MR4 for reverse primers) 5’ to the sequence recognizing the on-target site. Amplicons were purified using Ampure XP beads (Beckman Coulter). Illumina-compatible barcoded DNA amplicon libraries were prepared by a 2-step PCR using the Phusion High- Fidelity polymerase (New England BioLabs), the HF buffer (New England BioLabs) and primers containing Unique Dual Index (UDI) barcodes and annealing to MR3 and MR4 sequences. Primers used for PCR amplification are listed in Table S2. Libraries were pooled, purified by High Pure PCR Product Purification Kit (Sigma-Aldrich), and sequenced using Illumina NovaSeq 6000 system (paired-end sequencing; 2×100-bp). NGS data were analyzed using a custom python pipeline that allows to align reads to a reference amplicon sequence and to count, (i) reads with sequence modifications (both expected modifications and other ones) in a window including the editing site in the proximity of the PE nicking site, and (ii) reads with Indels occurring between the epegRNA nicking site and the second nicking site, either alone or in combination with sequence modification near the PE nicking site. To evaluate prime editing efficiency, patient primary cells or bulk BFU-E were harvested using Quick Extract Solution (Biosearch Technologies) according to manufacturer’s instructions. Amplicons were generated using Phusion Flash High-Fidelity PCR Mastermix (F548, Thermo Scientific) in a 15 μL reaction, containing 1.5 μL of genomic DNA extract and 0.5 μM of target- specific primers (Forward: 5’-AAACGGTCCCTGGCTAAACT-3’ (SEQ ID NO:83) and Reverse: 5’- CCAGAAGCGAGTGTGTGGAA-3’ (SEQ ID NO:84)) with barcodes and NGS adapters. Applied PCR cycling conditions: 98 °C for 3 min, 30x (98 °C for 10 s, 60 °C for 20
s, 72 °C for 30 s). PCR products were purified using HighPrep PCR Clean-up System (MagBio Genomics). Size, purity, and concentration of amplicons were determined using a fragment analyzer (Agilent). Amplicons were subjected to the second round of PCR to add unique Illumina indexes. Indexing PCR was performed using KAPA HiFi HotStart Ready Mix (Roche), 1 ng of PCR template and 0.5 μM of indexed primers in the total reaction volume of 25 μL. PCR cycling conditions: 72 °C for 3 min, 98 °C for 30 s, 10x (98 °C for 10 s, 63 °C for 30 s, 72 °C for 3 min), 72 °C for 5 min. Indexed amplicons were purified using HighPrep PCR Clean- up System (MagBio Genomics) and analyzed using a fragment analyzer (Agilent). Samples were quantified using Qubit 4 Fluorometer (Life Technologies) and subjected to sequencing using Illumina NextSeq system according to manufacturer’s instructions. Demultiplexing of the NGS sequencing data was performed using bcl2fastq software. The fastq files were analyzed using CRISPResso2 in the prime editing mode with the quantification window of 48 and 18 for pegRNAs targeting the -115 region of HBG1/2 promoters. Prime edited override sequences were used for each site. To generate the representative alignments, the window was extended to 40 to visualize homology arm integrations of different lengths. CFC assay HSPCs were plated at a concentration of 5 × 102 cells/mL in a methylcellulose-containing medium (GFH4435, STEMCELL Technologies) under conditions supporting erythroid and granulo-monocytic differentiation. After 14 days, BFU-Es were randomly picked and collected as bulk populations (containing at least 25 colonies) to evaluate editing efficiency and globin expression. HPLC Hemoglobin tetramers from BFU-E bulks were separated by CE-HPLC using a 2-cation exchange column (PolyCAT A, PolyLC, Columbia). Samples were eluted with a gradient mixture of solution A (20 mM Bis Tris, 2 mM KCN [pH 6.5]) and solution B (20 mM Bis Tris, 2 mM KCN, 250 mM NaCl [pH 6.8]). The absorbance was measured at 415 nm. Results: Design and screening of pegRNAs and ngRNAs targeting the -115 region of the HBG1/2 promoters in K562 cells
First, we designed several pegRNAs to simultaneously insert the HPFH/HPFH-like mutations into the -115 region of the HBG1/2 promoters generating the GATA1 (-113 A>G) and the KLF1 (-123/-124 T>C) BSs and delete completely the BCL11A BS (Figure 1A). Importantly, the design of the pegRNAs requires the protospacer sequence to be located as close as possible to the first nucleotide to be edited (in our strategy the nucleotide in position -113), to reduce the length of the RTT and favor the simultaneous incorporation of all the DEs20. The pegRNA10 is compatible with the PEmax harboring the Cas9n recognizing an NGG protospacer adjacent motif (PAM) and contains a 13-nucleotide long RTT that generates a de novo BS for GATA1 (-113 A>G), deletes the entire BCL11A BS (TGACC motif from -114 to -118 upstream of the TSS), and creates the KLF1 BS (-123/-124 T>C). Of note, the protospacer recognized by pegRNA10 is located close to the first nucleotide to be edited (i.e. the -113 nucleotide located at position +4 from the PE nick) and the PBS has the recommended length of 13 nucleotides. Remarkably, the introduction of the GATA1 BS naturally disrupts the PAM recognized by the enzyme, thus preventing re-targeting of the region by the pegRNA after prime editing occurred. We also designed pegRNAs compatible with the PE-SpRY variant, which contains a nearly PAM-less Cas9n (pegRNA11 to pegRNA14)21,22 and induce the same mutations as pegRNA10. The -113 base is located in position +1 to +8 (from the PE-induced nick) (Figure 1B). We then tested the pegRNAs with a Cas9 nuclease to select the best-performing pegRNAs’ spacers that efficiently bind the HBG1/2 promoters. A recent study showed a positive correlation between the efficiency of Cas9 nuclease-mediated InDels and the prime editing frequency induced by a pegRNA20. We evaluated the InDel frequency after transfection of K562 cells with plasmids expressing individual pegRNAs and Cas9 nuclease (for pegRNA10) or Cas9-SpRY nuclease (for pegRNA11 to pegRNA14) (Figure 1C). The pegRNA10 induced the highest percentage of InDels (up to 7.5%) and therefore was selected for further analyses. Finally, we designed ngRNAs for the pegRNA10 to take advantage of the PE3 system, which has been described to be more efficient compared to PE2 system11. We designed five ngRNAs (ngRNA10.1 to ngRNA10.5) inducing a DNA nick in a region spanning from +42 to +81 nucleotides downstream of the nick induced by pegRNA10. We tested these ngRNAs by plasmid transfection in K562 cells and we selected the most efficient ones based on the efficiency of cleavage induced by the Cas9 nuclease. The ngRNA10.3 and ngRNA10.5 were
the most efficient ngRNAs (inducing 39.0% and 45.8% of InDels in K562 cells, respectively) and were selected for further analyses (Figure 1D). Simultaneous introduction of HPFH and HPFH-like mutations in the -115 regions of the HBG1/2 promoters in K562 cells. To optimize prime editing efficiency, we engineered pegRNA10 by adding the tevopreQ1 motif that enhances pegRNA stability14, generating the epegRNA10.1 construct. We also replaced the scaffold with an optimized version that favors binding of the epegRNA to the PE and increases its stability23,24. We compared the efficiency of epegRNA10.1 in combination with ngRNA10.3 or ngRNA10.5, with or without the addition of the MLH1dn (PE5max or PE3max system, respectively) by plasmid transfection in K562 cells (Figure 2A)15,25. The co- transfection with a GFPmax-encoding plasmid enabled us to measure transfection efficiency and evaluate the editing efficiency in FACS-sorted GFPhigh cells. HBG promoters were amplified by PCR and subjected to NGS sequencing. Editing efficiency was calculated using a custom Python pipeline. During the resolution of the 3’-5’ flap intermediates, the deletion of 5- nucleotides of the BCL11A BS and, as a consequence, the reduced complementarity of the 3’ flap to the opposite strand, likely determines the annealing of the GCC trinucleotide of the 3’ flap to the closest complementary motif in the DNA target region (Figure 2C). This led to an alternative DNA repair with the concomitant presence of desired edits, with or without additional mutations (Figure 2B and C). Surprisingly, alternative repair events were the most frequent (Figure 2B and C), while we achieved a low frequency of the desired edits occurring without alternative repair events (0.7-1.8%). Overall, we achieved similar frequency of desired edits (with or without alternative repair events) across all conditions (11.2% to 16.8%) (Figure 2B). Importantly, the use of the ngRNA10.3 resulted in significantly higher InDel frequencies (18.4-22%) compared to the ngRNA10.5 (2.9-3.1%) (Figure 2B). To increase complementarity of the 3’ flap to the opposite strand and reduce the occurrence of alternative repair events, epegRNA10.1 was further modified to generate epegRNA10.2 and epegRNA10.3. The epegRNA10.2 contains the same spacer as epegRNA10.1 and an RTT of 16 nucleotides inserting both GATA1 (-113 A>G) and KLF1 (-123/-124 T>C) BSs, and simultaneously deleting the 2-nucleotide-long core of the BCL11A BS (CC motif located -114 to -115 upstream of the TSS) in the -115 region of the HBG1/2 promoter (Figure 3A). Importantly, the deletion
of only two nucleotides of the BCL11A BS favored the introduction of the desired edit profile without the occurrence of alternative repair events or only with some additional mutations (Figure 3B and 3C). However, these latter mutations do not alter the generation of the GATA1 and the KLF1 activator BS, and the disruption of the BCL11A BS in the target region (Figure 3B and C). Interestingly, amongst the total edited events in each condition, one third represent partial edits inducing only the generation of the GATA1 BS and the disruption of the BCL11A BS (Figure 3B), suggesting either the low processivity of the RT leading to a partial reverse transcription of the >13-nucleotide-long RTT (compared to epegRNA10.1) or the degradation of the 3’ end newly synthesized 3’ flap (which is longer than the 3’ flap generated by epegRNA10.1). Notably, the addition of the MLH1dn-expressing plasmid (PE5max) further improved the overall editing efficiency (Figure 3B). We detected relatively low InDel frequency in all conditions, particularly when using ngRNA10.5. The epegRNA10.3 contains the same spacer as epegRNA10.1 and an 18-nucleotide-long RTT inserting only the GATA1 (-113 A>G) and the KLF1 (-123/-124 T>C) BS in the -115 region of the HBG1/2 promoter, without deleting the BCL11A BS (Figure 4A). Of note, the introduction of the GATA1 BS partially disrupt the BCL11A BS. Edited promoters contain either both activator BSs or partial edits (Figure 4B). Partials edits included mainly the introduction of only the GATA1 BS (Figure 4B), suggesting the low processivity of the RT when the RTT is longer than 13 nucleotides or the degradation of the 3’ end newly synthesized 3’ flap. Surprisingly, a low frequency of edited promoters harbored only the KLF1 BS (Figure 4B), suggesting error-prone and competitive repair pathways mechanisms involved in the resolution of the 3’ flap. As previously observed, Indel frequency tended to be lower when using ngRNA10.5 compared to ngRNA10.3. In conclusion, these results show that our strategy can simultaneously insert multiple HPFH and HPFH-like mutations in the -115 region of the HBG1/2 promoters in K562 cells. We selected the epegRNA10.2 that successfully inserted both GATA1 and KLF1 BSs and simultaneously disrupted the BCL11A repressor BS, reaching up to 34% of total editing efficiency in K562 cells. Interestingly, NGS sequencing revealed editing profiles that are indicative of competitive and alternative repair pathways underneath the resolution of the 3’ intermediates.
In order to favor the perfect introduction of the desired edits, we designed a novel epegRNA (i.e., epegRNA10.4), which contains a 7-nucleotide longer RTT compared to the epegRNA10.2 (Table S1). The elongated RTT should prevent 3’ flap degradation and favor the annealing of the 3’ flap over the 5’ flap to the complementary strand and the desired DNA repair, thus increasing the insertion of the desired edits. We compared the efficiency of these two epegRNAs in generating the desired edit using the PE3max and PE5max systems in K562 cells. The epegRNA10.4 outperformed the epegRNA10.2 in inserting the desired motif reaching up to 50% of precise edits with the PE5max (Figure 5). Importantly, InDels remained low and comparable to the frequency observed with the epegRNA10.2 (Figure 5). Simultaneous introduction of HPFH and HPFH-like mutations in the -115 regions of the HBG1/2 promoters in patient primary cells We tested our best-performing prime editing strategy (i.e., epegRNA10.4 +/- ngRNA10.5) in CD34+ HSPCs from three SCD donors using the PE3max and the PEn systems. Of note, we used also the PEn technology knowing that this tool can outperform PE strategies in inserting complex edits, such as those generated using the epegRNA10.433. In addition, the PE5max system was not tested in these cells as it was recently shown that MMR was poorly active in these cells35 and that the addition of MLH1dn does not further improve prime editing efficiency compared to the PE3max. Despite the low frequency of precise edits that was also heterogenous amongst donors, the PEnmax strategy enabled the insertion of desired edits at a higher frequency compared to PE3max (Figure 6A). Of note, the InDels were also increased in PEnmax-treated cells as a consequence of the generation of DSBs by the nuclease; these DSBs are potentially repaired by NHEJ, or other alternative endogenous repair mechanisms such as MMEJ (Figure 6A). Then, we evaluated if the introduction of these mutations is able to reactivate HbF expression in erythroid colonies derived from treated SCD HSPCs (Figures 6B and 6C). We confirmed that the prime editing efficiency was similar between HSPCs and erythroid colonies, demonstrating that these mutations persist after differentiation toward the erythroid lineage (Figure 6B). The introduction of the desired edit enabled HbF reactivation in both PE3max- and PEnmax-treated cells; however, HbF levels were higher in cells treated with the PEnmax strategy which was associated to an increased frequency of precise edits (Figure 6C). Finally, we tested different molecules that could improve prime editing efficiency in HSPCs and/or reduce InDels33,36. First, we supplemented the cell culture medium with the four deoxynucleosides after transfection, and we showed that this addition increased the frequency
of precise and partial edits in HSPCs; however, it also increased the frequency of InDels (Figure 7A). To reduce InDels and possibly favor the introduction of the precise edits, we then treated the HSPCs post-transfection with inhibitors of NHEJ or MMEJ (the main repair pathways that generate InDels after DSBs) such as inhibitors of the DNA-PK or the POLQ, respectively, that have already proven their efficiency in reducing InDels. While only the DNA- PK inhibitor increased the frequency of precise edits, the combination of both inhibitors also decreased the proportion of InDels (Figure 7B). Overall, these data confirmed that prime editing strategy can generate simultaneously multiple HPFH/HPFH-like mutations in the HBG1/2 promoter in patient primary HSPCs that lead to HbF reactivation in their erythroid progeny. TABLES: Table S1: Sequences of pegRNAs, epegRNAs and ngRNAs used in K562 cells. All sequences are shown in 5’ to 3’ orientation. Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACAAGGC 5 10 peg TGCCTTGTCAAGGCTATTGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCAGCCTTGCCTGATAGCCTTGACAA 6 11 peg TTGCCTTGTCAAGGCTATTGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCAGCCTTGCCTGATAGCCTTGACAAG 7 12 peg TTTGCCTTGTCAAGGCTATTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCTTGCCTGATAGCCTTGACAAGG 8 13 peg AGTTTGCCTTGTCAAGGCTAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTT 3 RNA ATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGCCTGATAGCCTTGACAAGGCA 9 14 peg 4 RNA GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 0 10. CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG 3 ACAAGGC peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 1 AGCCTTGACAAGGC
10. 4 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgATAGCCTTGACA 2 A10 AGGCTCCTAATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .1 epe 4 gRN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 3 A10 CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGAgATAGCCTTG .2 ACAAGGCTCCTTATCCGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTTGACCgATAGCCT 4 A10 TGACAAGGCCTCTTCTACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .3 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGG 4 gRN CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGGCCAGCCccGCCTTGAgAT 5 A10 AGCCTTGACAAGGCTCCGGTAACGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA .4 ngRNA sequence SEQ ID NO : ngRNA10.1 TAGTCTTAGAGTATCCAGTG 46 ngRNA10.2 TAGAGTATCCAGTGAGGCCA 47 ngRNA10.3 AGAGTATCCAGTGAGGCCAG 48 ngRNA10.4 TATCCAGTGAGGCCAGGGGC 49 ngRNA10.5 GGCTAGGGATGAAGAATAAA 50 Table S2: Primers used for Sanger sequencing analysis. All primers sequences are shown in 5’ to 3’ orientation. Sanger sequencing Amplified F/ Sequence (5' to 3') region R HBGex1 F TATCCTCTTGGGGGCCCCTT (SEQ ID NO :52) R TCAGCACCTTCTTGCCATGTGCC (SEQ ID NO :53) NGS analysis Amplified F/ Sequence (5' to 3') region R HBG-pegRNA10 F GCAGCGTCAGATGTGTATAAGAGACAGAAACGGTCCCTGGCTAAACT (SEQ ID NO :54) R TGGGCTCGGAGATGTGTATAAGAGACAGCCAGAAGCGAGTGTGTGGAA (SEQ ID NO :55) REFERENCES: Throughout this application, various references describe the state of the art to which this invention pertains. The disclosures of these references are hereby incorporated by reference into the present disclosure.
1. Locatelli, F., Lucarelli, B. & Merli, P. Current and future approaches to treat graft failure after allogeneic hematopoietic stem cell transplantation. Expert Opinion on Pharmacotherapy 15, 23–36 (2014). 2. Forget, B. G. Molecular Basis of Hereditary Persistence of Fetal Hemoglobin. Annals NY Acad Sci 850, 38–44 (1998). 3. Weber, L. et al. Editing a γ-globin repressor binding site restores fetal hemoglobin synthesis and corrects the sickle cell disease phenotype. Sci. Adv.6, eaay9392 (2020). 4. Cromer, M. K. et al. Global Transcriptional Response to CRISPR/Cas9-AAV6-Based Genome Editing in CD34+ Hematopoietic Stem and Progenitor Cells. Molecular Therapy 26, 2431–2442 (2018). 5. Schep, R. et al. Impact of chromatin context on Cas9-induced DNA double-strand break repair pathway balance. Molecular Cell 81, 2216-2230.e10 (2021). 6. Boutin, J. et al. ON-Target Adverse Events of CRISPR-Cas9 Nuclease: More Chaotic than Expected. The CRISPR Journal 5, 19–30 (2022). 7. Ravi, N. S. et al. Identification of novel HPFH-like mutations by CRISPR base editing that elevate the expression of fetal hemoglobin. eLife 11, e65421 (2022). 8. Mayuranathan, T. et al. Adenosine Base Editing of γ-Globin Promoters Induces Fetal Hemoglobin and Inhibit Erythroid Sickling. Blood 136, 21–22 (2020). 9. Li C, et al. In vivo HSPC gene therapy with base editors allows for efficient reactivation of fetal γ-globin in β-YAC mice. Blood advances, 5(4), 1122–1135 (2021) 10. Coleman, M. B. et al. G gamma A gamma (beta+) hereditary persistence of fetal hemoglobin: the G gamma -158 C-->T mutation in cis to the -175 T-->C mutation of the A gamma-globin gene results in increased G gamma-globin synthesis. Am J Hematol 42, 186– 190 (1993). 11. Anzalone, A. V. et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157 (2019). 12. Yan, J., Cirincione, A. & Adamson, B. Prime Editing: Precision Genome Editing by Reverse Transcription. Molecular Cell 77, 210–212 (2020). 13. Yang, L., Yang, B. & Chen, J. One Prime for All Editing. Cell 179, 1448–1450 (2019). 14. Nelson, J. W. et al. Engineered pegRNAs improve prime editing efficiency. Nat Biotechnol 40, 402–410 (2022). 15. Chen, P. J. et al. Enhanced prime editing systems by manipulating cellular determinants of editing outcomes. Cell 184, 5635-5652.e29 (2021).
16. Hussmann, J. A. et al. Mapping the genetic landscape of DNA double-strand break repair. Cell 184, 5653-5669.e25 (2021). 17. Ferreira da Silva, J. et al. Prime editing efficiency and fidelity are enhanced in the absence of mismatch repair. Nat Commun 13, 760 (2022). 18. Spencer, J. M. & Zhang, X. Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains. Sci Rep 7, 16836 (2017). 19. Martyn, G. E., Quinlan, K. G. R. & Crossley, M. The regulation of human globin promoters by CCAAT box elements and the recruitment of NF-Y. Biochim Biophys Acta Gene Regul Mech 1860, 525–536 (2017). 20. Kim, H. K. et al. Predicting the efficiency of prime editing guide RNAs in human cells. Nat Biotechnol 39, 198–206 (2021). 21. Walton, R. T., Christie, K. A., Whittaker, M. N. & Kleinstiver, B. P. Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368, 290– 296 (2020). 22. Kweon, J. et al. Engineered prime editors with PAM flexibility. Molecular Therapy 29, 2001–2007 (2021). 23. Dang, Y. et al. Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency. Genome Biol 16, 280 (2015). 24. Liu, Y. et al. Enhancing prime editing by Csy4-mediated processing of pegRNA. Cell Res 31, 1134–1136 (2021). 25. Doman, J. L., Sousa, A. A., Randolph, P. B., Chen, P. J. & Liu, D. R. Designing and executing prime editing experiments in mammalian cells. Nat Protoc 17, 2431–2468 (2022). 26. Hsu, J. Y. et al. PrimeDesign software for rapid and simplified design of prime editing guide RNAs. Nat Commun 12, 1034 (2021). 27. Chow, R. D., Chen, J. S., Shen, J. & Chen, S. A web tool for the design of prime-editing guide RNAs. Nat Biomed Eng 5, 190–194 (2021). 28. Li, Y., Chen, J., Tsai, S. Q. & Cheng, Y. Easy-Prime: a machine learning–based prime editor design tool. Genome Biol 22, 235 (2021). 29. Concordet, J.-P. & Haeussler, M. CRISPOR: intuitive guide selection for CRISPR/Cas9 genome editing experiments and screens. Nucleic Acids Research 46, W242–W245 (2018). 30. Cradick, T. J., Qiu, P., Lee, C. M., Fine, E. J. & Bao, G. COSMID: A Web-based Tool for Identifying and Validating CRISPR/Cas Off-target Sites. Molecular Therapy - Nucleic Acids 3, e214 (2014).
31. Kim, H. K. et al. SpCas9 activity prediction by DeepSpCas9, a deep learning–based model with high generalization performance. Sci. Adv.5, eaax9249 (2019). 32. Brinkman, E. K., Chen, T., Amendola, M. & van Steensel, B. Easy quantitative assessment of genome editing by sequence trace decomposition. Nucleic Acids Research 42, e168–e168 (2014). 33. Peterka M, Akrap N, Li S, Wimberger S, Hsieh PP, Degtev D, Bestas B, Barr J, van de Plassche S, Mendoza-Garcia P, Šviković S, Sienski G, Firth M, Maresca M. Harnessing DSB repair to promote efficient homology-dependent and -independent prime editing. Nat Commun. 2022 Mar 24;13(1):1240. 34. Antoniou P, Hardouin G, Martinucci P, Frati G, Felix T, Chalumeau A, Fontana L, Martin J, Masson C, Brusson M, Maule G, Rosello M, Giovannangeli C, Abramowski V, de Villartay JP, Concordet JP, Del Bene F, El Nemer W, Amendola M, Cavazzana M, Cereseto A, Romano O, Miccio A. Base-editing-mediated dissection of a γ-globin cis-regulatory element for the therapeutic reactivation of fetal hemoglobin expression. Nat Commun.2022 Nov 4;13(1):6618. 35. Everette KA, Newby GA, Levine RM, Mayberry K, Jang Y, Mayuranathan T, Nimmagadda N, Dempsey E, Li Y, Bhoopalan SV, Liu X, Davis JR, Nelson AT, Chen PJ, Sousa AA, Cheng Y, Tisdale JF, Weiss MJ, Yen JS, Liu DR. Ex vivo prime editing of patient haematopoietic stem cells rescues sickle-cell disease phenotypes after engraftment in mice. Nat Biomed Eng.2023 May;7(5):616-628. 36. Levesque S, Cosentino A, Verma A, Genovese P, Bauer DE. Enhancing prime editing in hematopoietic stem and progenitor cells by modulating nucleotide metabolism. Nat Biotechnol. 2024 May 28.
Claims
CLAIMS: 1. A method of increasing fetal hemoglobin content in a eukaryotic cell comprising the step of contacting the eukaryotic cell with a prime editing platform that consists of (a) one prime editing enzyme and (b) one prime editing guide RNA (pegRNA) for guiding the prime editing enzyme to one target nucleic acid sequence in the -115 region of the HBG1 and/or HBG2 promoter, thereby prime editing said region and subsequently increasing the expression of gamma-globin in said eukaryotic cell. 2. The method of claim 1 wherein the prime editing platform is suitable for introducing a combination of mutations in the -115 region of HBG1 and/or HBG2 promoter so that i) new transcriptional activator binding sites for KLF1 and GATA1 are introduced in said promoter and ii) the binding site for the BCL11A repressor is disrupted. 3. The method of claim 2 wherein the prime editing platform herein disclosed introduces i) the -123>C and -124T>C mutations so that the KFL1 activator can bind to the promoter ii) the - 113 A>G mutation so that the GATA1 activator can bind to the promoter and iii) with or without the complete or partial deletion of the BCL11A- binding motif so that the binding site for the BCL11A repressor is disrupted. 4. The method according to any one of claims 1 to 3 that comprises the steps of: (i) contacting the eukaryotic cell with a) the prime editing enzyme and b) a guide RNA comprising an RT template comprising the desired nucleotide changes; (ii) conducting target-primed reverse transcription of the RT template to generate a single strand DNA comprising the desired nucleotide changes; and (iii) incorporating the desired nucleotide change into the -115 region of the HBG1 or HBG2 promoter at the target sequence through a DNA repair and/or replication process. 5. The method according to any one of claims 1 to 4 wherein the eukaryotic cell is selected from the group consisting of hematopoietic progenitor cells, hematopoietic stem cells (HSCs), pluripotent cells (i.e. embryonic stem cells (ES) and induced pluripotent stem cells (iPS)). 6. The method according to any one of claims 1 to 5 wherein the prime editing enzyme comprises a CRISPR/Cas nuclease that is a nickase and more particularly a Cas9 nickase
i.e. the Cas9 from S. pyogenes having one mutation selected from the group consisting of D10A and H840A. 7. The method of claim 6 wherein the nickase comprises the amino acid sequence as set forth in SEQ ID NO: 3 or SEQ ID NO:4. 8. The method according to any one of claims 1 to 7 wherein prime editing enzyme comprises the reverse transcriptase that comprises an amino acid sequence having at least 90% of identity with the amino acid sequence as set forth in SEQ ID NO:5. 9. The method of claim 8 wherein the reverse transcriptase has the amino acid sequence as set forth in SEQ ID NO:5 but comprises one or more of the following mutations: P51L, S67K, E69K, L139P, T197A, D200N, H204R, F209N, E302K, E302R, T306K, F309N, W313F, T330P, L345G, L435G, N454K, D524G, E562Q, D583N, H594Q, L603W, E607K, or D653N or has the amino acid sequence as set forth in SEQ ID NO:5 but comprises four mutations: D200N, T306K, W313F, and T330P. 10. The method according to any one of claims 1 to 9 wherein the prime editing enzyme is the PEmax that consists of a fusion protein having the following structure [NLS]- [Cas9(R221K, N394K H840A)]-[linker]- [MMLV_RT(D200N)(T330P)(L603W)(T306K)(W313F)] and having the amino acid sequence as set forth in SEQ ID NO:13. 11. The method according to any one of claims 1 to 10 wherein the pegRNA comprises (a) a spacer sequence that comprises a region of complementarity to a first strand of the double-stranded target nucleic sequence located in the HBG1/2 promoters; (b) an extension arm that comprises a RT template and a primer binding site in a 5’ to 3’ orientation, wherein the primer binding site comprises a region of complementarity to a region upstream of a nick site in the second strand of the double-stranded target sequence, and wherein the RT template encodes the desired nucleotide changes (e.g. - 124T>C, -123T>C , -113A>G and with or without a complete or partial deletion of the BCL11A binding-motif) compared to a region downstream of the nick site in the second strand of the double-stranded target sequence. 12. The method of claim 11 wherein the spacer sequence is selected from the group consisting of :
pegRNA spacer sequence SEQ ID NO : Spacer_10 GTTTGCCTTGTCAAGGCTAT 15 Spacer_11 TGCCTTGTCAAGGCTATTGG 16 Spacer_12 TTGCCTTGTCAAGGCTATTG 17 Spacer_13 TTTGCCTTGTCAAGGCTATT 18 Spacer_14 AGTTTGCCTTGTCAAGGCTA 19 13. The method of claim 12 wherein the spacer sequence is Spacer_10 GTTTGCCTTGTCAAGGCTAT SEQ ID NO:15 14. The method according to any one of claims 11 to 13 wherein the gRNA core sequence of the pegRNA is selected from the group consisting of : Scaffo sequence SE ld Q ID NO : Scaffo GTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGGCTAGTCCGTTATCAA 20 ld CTTGAAAAAGTGGCACCGAGTCGGTGC scaffo GTTTCAGAGCTATGCTGGAAACAGCATAGCAAGTTGAAATAAGGCTAGT 21 ld_opt CCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGC 15. The method according to any one of claims 11 to 14 wherein the RT template sequence is selected from the group consisting of: RTT sequence SEQ ID NO : RTT_10 GCCccGCCTgATA 22 RTT_10.2 GCCccGCCTTGAgATA 23 RTT_10.3 GCCccGCCTTGACCgATA 24 RTT_11 GCCAGCCTTGCCTG 25 RTT_12 AGCCTTGCCTGA 26 RTT_13 GCCTTGCCTGAT 27 RTT_14 TTGCCTGATAG 28 16. The method according to any one of claims 11 to 14 wherein the the primer binding site sequence (PBS) is selected from the group consisting of: PBS sequence SEQ ID NO : PBS_10 GCCTTGACAAGGC 29 PBS_11 ATAGCCTTGACAA 30 PBS_12 TAGCCTTGACAAG 31 PBS_13 AGCCTTGACAAGG 32 PBS_14 CCTTGACAAGGCA 33
17. The method according to any one of claims 11 to 16 wherein the pegRNA also comprises a modified prequeosine1-1 riboswitch aptamer (tevopreQ1) having the sequence of CGCGGTTCTATCTAGTTACGCGTTAAACCAACTAGAA (SEQ ID NO:45). 18. The method according to any one of claims 11 to 17 wherein the pegRNA is selected from the group consisting of : Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgAT 5 10 AGCCTTGACAAGGC peg TGCCTTGTCAAGGCTATTGGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCAGCCTTGCC 6 11 TGATAGCCTTGACAA peg TTGCCTTGTCAAGGCTATTGGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCAGCCTTGCCTGA 7 12 TAGCCTTGACAAG peg TTTGCCTTGTCAAGGCTATTGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCTTGCCTGAT 8 13 AGCCTTGACAAGG peg AGTTTGCCTTGTCAAGGCTAGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTTGCCTGATAGC 9 14 CTTGACAAGGCA peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 RNA TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC 0 10. CccGCCTTGAgATAGCCTTGACAAGGC 3 peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 RNA TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTT 1 10. GGCCAGCCccGCCTTGAgATAGCCTTGACAAGGC 4 epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 gRN TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC 2 A10 CccGCCTgATAGCCTTGACAAGGCTCCTAATCCGCGGTTCTATCTAGTTACGCG .1 TTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 gRN TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC 3 A10 CccGCCTTGAgATAGCCTTGACAAGGCTCCTTATCCGCGGTTCTATCTAGTTAC .2 GCGTTAAACCAACTAGAA epe 4 gRN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 A10 TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC .3 CccGCCTTGACCgATAGCCTTGACAAGGCCTCTTCTACGCGGTTCTATCTAGTT ACGCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 gRN TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTT 5 A10 GGCCAGCCccGCCTTGAgATAGCCTTGACAAGGCTCCGGTAACGCGGTTCTATC .4 TAGTTACGCGTTAAACCAACTAGAA
19. The method of claim 18 wherein the pegRNA sequence is selected from: Nam full length 5'-3' S e E Q I D N O : peg GTTTGCCTTGTCAAGGCTATGTTTTAGAGCTAGAAATAGCAAGTTAAAATAAGG 3 RNA CTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGCCccGCCTgAT 5 10 AGCCTTGACAAGGC peg GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 RNA TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC 0 10. CccGCCTTGAgATAGCCTTGACAAGGC 3 peg 4 RNA GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 1 10. TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTT 4 GGCCAGCCccGCCTTGAgATAGCCTTGACAAGGC epe 4 gRN GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 3 A10 TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC .2 CccGCCTTGAgATAGCCTTGACAAGGCTCCTTATCCGCGGTTCTATCTAGTTAC GCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 gRN TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCGC 4 A10 CccGCCTTGACCgATAGCCTTGACAAGGCCTCTTCTACGCGGTTCTATCTAGTT .3 ACGCGTTAAACCAACTAGAA epe GTTTGCCTTGTCAAGGCTATGTTTCAGAGCTATGCTGGAAACAGCATAGCAAGT 4 gRN TGAAATAAGGCTAGTCCGTTATCAACTTGAAAAAGTGGCACCGAGTCGGTGCTT 5 A10 GGCCAGCCccGCCTTGAgATAGCCTTGACAAGGCTCCGGTAACGCGGTTCTATC .4 TAGTTACGCGTTAAACCAACTAGAA 20. The method according to any one of claims 1 to 19 wherein the prime editing platform further involves the use of a second strand nicking guide RNA (“nicking guide RNA” or ngRNA”) that complexes with the prime editing platform and introduces a nick in the non-edited DNA strand in order to induce preferential replacement of the edited strand. 21. The method of claim 20 wherein the ngRNA is selected from the group consisting of: ngRNA sequence SEQ ID NO : ngRNA10.1 TAGTCTTAGAGTATCCAGTG 46 ngRNA10.2 TAGAGTATCCAGTGAGGCCA 47 ngRNA10.3 AGAGTATCCAGTGAGGCCAG 48 ngRNA10.4 TATCCAGTGAGGCCAGGGGC 49
ngRNA10.5 GGCTAGGGATGAAGAATAAA 50 22. The method of claim 21 wherein the ngRNA is from ngRNA10.5 GGCTAGGGATGAAGAATAAA SEQ ID NO :50 23. The method of claims 1 to 22 wherein the prime editing platform further involves the transient expression of an engineered DNA mismatch repair (MMR) inhibiting protein for enhancing the efficiency of the prime editing that is preferably selected among catalytically impaired mutants of human MSH2, MSH6, PMS2, and MLH1. 24. The method of claim 23 wherein the prime editing platform involves the use of a dominant negative MMR protein (MLH1dn) that has the amino acid sequence as set forth in SEQ ID NO:51. 25. The method according to any one of claim 1 to 24 wherein the prime editing platform preferably consists in one of the following combinations: Prime editor pegRNA ngRNA MLH1 PEmax (SEQ pegRNA10.4 (SEQ ngRNA10.5 (SEQ ID No ID NO:13) ID NO:41) NO :50) PEmax (SEQ pegRNA10.4 (SEQ ngRNA10.5 ( SEQ ID Yes (SEQ ID ID NO:13) ID NO:41) NO :50) NO:51) PEmax (SEQ epegRNA10.2 (SEQ ngRNA10.5 ( SEQ ID No ID NO:13) ID NO:43) NO :50) PEmax (SEQ epegRNA10.2 (SEQ ngRNA10.5 ( SEQ ID Yes (SEQ ID ID NO:13) ID NO:43) NO :50) NO:51) PEmax (SEQ epegRNA10.4 (SEQ ngRNA10.5(SEQ ID No ID NO:13) ID NO:45) NO :50) PEmax (SEQ epegRNA10.4 (SEQ ngRNA10.5 (SEQ ID Yes (SEQ ID ID NO:13) ID NO:45) NO :50) NO:51) 26. A population of edited eukaryotic cells obtainable by the method according to any one of claims 1 to 25.
27. A method for increasing fetal hemoglobin levels in a subject in need thereof, the method comprising transplanting a therapeutically effective amount of the population of eukaryotic cells of claim 26. 28. The method of claim 27 for the treatment of a hemoglobinopathy. 29. A kit that consists of one of the following combinations: Prime editor pegRNA ngRNA MLH1 PEmax (SEQ pegRNA10.4 (SEQ ngRNA10.5 (SEQ No ID NO:13) ID NO:41) ID NO :50) PEmax (SEQ pegRNA10.4 (SEQ ngRNA10.5 (SEQ Yes (SEQ ID ID NO:13) ID NO:41) ID NO :50) NO:51) PEmax (SEQ epegRNA10.2 (SEQ ngRNA10.5 (SEQ No ID NO:13) ID NO:43) ID NO :50) PEmax (SEQ epegRNA10.2 (SEQ ngRNA10.5 (SEQ Yes (SEQ ID ID NO:13) ID NO:43) ID NO :50) NO:51) PEmax (SEQ epegRNA10.4 (SEQ ngRNA10.5 (SEQ No ID NO:13) ID NO:45) ID NO :50) PEmax (SEQ epegRNA10.4 (SEQ ngRNA10.5 (SEQ Yes (SEQ ID ID NO:13) ID NO:45) ID NO :50) NO:51)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP23306230.6 | 2023-07-17 | ||
EP23306230 | 2023-07-17 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2025017033A1 true WO2025017033A1 (en) | 2025-01-23 |
Family
ID=88016321
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2024/070167 WO2025017033A1 (en) | 2023-07-17 | 2024-07-16 | Prime editing of the -115 region in the hbg1 and/or hbg2 promoter for increasing fetal hemoglobin content in a eukaryotic cell |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2025017033A1 (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001038547A2 (en) | 1999-11-24 | 2001-05-31 | Mcs Micro Carrier Systems Gmbh | Polypeptides comprising multimers of nuclear localization signals or of protein transduction domains and their use for transferring molecules into cells |
US8202983B2 (en) | 2007-05-10 | 2012-06-19 | Agilent Technologies, Inc. | Thiocarbon-protecting groups for RNA synthesis |
WO2013176772A1 (en) | 2012-05-25 | 2013-11-28 | The Regents Of The University Of California | Methods and compositions for rna-directed target dna modification and for rna-directed modulation of transcription |
WO2014144761A2 (en) | 2013-03-15 | 2014-09-18 | The General Hospital Corporation | Increasing specificity for rna-guided genome editing |
US20140273233A1 (en) | 2013-03-15 | 2014-09-18 | Sigma-Aldrich Co., Llc | Crispr-based genome modification and regulation |
US20140273226A1 (en) | 2013-03-15 | 2014-09-18 | System Biosciences, Llc | Crispr/cas systems for genomic modification and gene modulation |
WO2017077394A2 (en) * | 2015-11-04 | 2017-05-11 | Crispr Therapeutics Ag | Materials and methods for treatment of hemoglobinopathies |
WO2019173654A2 (en) * | 2018-03-07 | 2019-09-12 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
CN111876416A (en) * | 2020-07-01 | 2020-11-03 | 广州瑞风生物科技有限公司 | Methods and compositions for activating gamma-globin gene expression |
WO2020240523A1 (en) * | 2019-05-31 | 2020-12-03 | The Governing Council Of The University Of Toronto | Methods and compositions for multiplex gene editing |
WO2021228944A1 (en) | 2020-05-13 | 2021-11-18 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Base editing approaches for the treatment of betahemoglobinopathies |
WO2022150790A2 (en) * | 2021-01-11 | 2022-07-14 | The Broad Institute, Inc. | Prime editor variants, constructs, and methods for enhancing prime editing efficiency and precision |
-
2024
- 2024-07-16 WO PCT/EP2024/070167 patent/WO2025017033A1/en unknown
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001038547A2 (en) | 1999-11-24 | 2001-05-31 | Mcs Micro Carrier Systems Gmbh | Polypeptides comprising multimers of nuclear localization signals or of protein transduction domains and their use for transferring molecules into cells |
US8202983B2 (en) | 2007-05-10 | 2012-06-19 | Agilent Technologies, Inc. | Thiocarbon-protecting groups for RNA synthesis |
WO2013176772A1 (en) | 2012-05-25 | 2013-11-28 | The Regents Of The University Of California | Methods and compositions for rna-directed target dna modification and for rna-directed modulation of transcription |
US20140273226A1 (en) | 2013-03-15 | 2014-09-18 | System Biosciences, Llc | Crispr/cas systems for genomic modification and gene modulation |
WO2014144592A2 (en) | 2013-03-15 | 2014-09-18 | The General Hospital Corporation | Using truncated guide rnas (tru-grnas) to increase specificity for rna-guided genome editing |
US20140273233A1 (en) | 2013-03-15 | 2014-09-18 | Sigma-Aldrich Co., Llc | Crispr-based genome modification and regulation |
WO2014144761A2 (en) | 2013-03-15 | 2014-09-18 | The General Hospital Corporation | Increasing specificity for rna-guided genome editing |
WO2017077394A2 (en) * | 2015-11-04 | 2017-05-11 | Crispr Therapeutics Ag | Materials and methods for treatment of hemoglobinopathies |
WO2019173654A2 (en) * | 2018-03-07 | 2019-09-12 | Editas Medicine, Inc. | Systems and methods for the treatment of hemoglobinopathies |
WO2020240523A1 (en) * | 2019-05-31 | 2020-12-03 | The Governing Council Of The University Of Toronto | Methods and compositions for multiplex gene editing |
WO2021228944A1 (en) | 2020-05-13 | 2021-11-18 | INSERM (Institut National de la Santé et de la Recherche Médicale) | Base editing approaches for the treatment of betahemoglobinopathies |
CN111876416A (en) * | 2020-07-01 | 2020-11-03 | 广州瑞风生物科技有限公司 | Methods and compositions for activating gamma-globin gene expression |
WO2022150790A2 (en) * | 2021-01-11 | 2022-07-14 | The Broad Institute, Inc. | Prime editor variants, constructs, and methods for enhancing prime editing efficiency and precision |
Non-Patent Citations (67)
Title |
---|
"Handbook of Growth Factors", vol. III, CRC PRESS, article "Hematopoietic Growth Factors and Cytokines", pages: 1 - 2 |
ADAMS ET AL., THE BIOCHEMISTRY OF THE NUCLEIC ACIDS, 1992 |
ANTONIOU PHARDOUIN GMARTINUCCI PFRATI GFELIX TCHALUMEAU AFONTANA LMARTIN JMASSON CBRUSSON M: "Base-editing-mediated dissection of a γ-globin cis-regulatory element for the therapeutic reactivation of fetal hemoglobin expression", NAT COMMUN., vol. 13, no. 1, 4 November 2022 (2022-11-04), pages 6618 |
ANTONIOU, PANAGIOTIS ET AL.: "Base-editing-mediated dissection of a γ-globin cis-regulatory element for the therapeutic reactivation of fetal hemoglobin expression", NATURE COMMUNICATIONS, vol. 13, no. 1, 2022, pages 1 - 22 |
ANZALONE, ANDREW V. ET AL.: "Search-and-replace genome editing without double-strand breaks or donor DNA", NATURE, vol. 576, no. 7785, 2019, pages 149 - 157, XP055980447, DOI: 10.1038/s41586-019-1711-4 |
ARIF TAQDEES ET AL: "Prime editing: A potential treatment option for [beta]-thalassemia", vol. 47, no. 4, 8 December 2022 (2022-12-08), GB, pages 699 - 713, XP093114240, ISSN: 1065-6995, Retrieved from the Internet <URL:https://onlinelibrary.wiley.com/doi/full-xml/10.1002/cbin.11972> DOI: 10.1002/cbin.11972 * |
BOUTIN, J. ET AL.: "ON-Target Adverse Events of CRISPR-Cas9 Nuclease: More Chaotic than Expected", THE CRISPR JOURNAL, vol. 5, 2022, pages 19 - 30, XP093082198, DOI: 10.1089/crispr.2021.0120 |
BRINKMAN, E. K.CHEN, T.AMENDOLA, M.VAN STEENSEL, B.: "Easy quantitative assessment of genome editing by sequence trace decomposition", NUCLEIC ACIDS RESEARCH, vol. 42, 2014, XP055788071, DOI: 10.1093/nar/gku936 |
CHALUMEAU A. ET AL: "Presidential Symposium and Presentation of Top Abstracts 96 Development of prime editing strategies for treatment of beta-hemoglobinopathies", MOLECULAR THERAPY, VOLUME 31, ISSUE 4, SUPPLEMENT 1, 1 May 2023 (2023-05-01), XP093114202, Retrieved from the Internet <URL:https://www.sciencedirect.com/science/article/pii/S1525001623002484?via%3Dihub> [retrieved on 20231220], DOI: https://doi.org/10.1016/j.ymthe.2023.04.017 * |
CHEN ET AL.: "Fusion protein linkers: property, design and functionality", ADV DRUG DELIV REV., vol. 65, no. 10, 2013, pages 1357 - 69, XP028737352, DOI: 10.1016/j.addr.2012.09.039 |
CHEN, PETER J ET AL.: "Enhanced prime editing systems by manipulating cellular determinants of editing outcomes", CELL, vol. 184, no. 22, 2021, pages 5635 - 5652, XP055915530, DOI: 10.1016/j.cell.2021.09.018 |
CHOW, R. D.CHEN, J. S.SHEN, J.CHEN, S: "A web tool for the design of prime-editing guide RNAs", NAT BIOMED ENG, vol. 5, 2021, pages 190 - 194, XP037367896, DOI: 10.1038/s41551-020-00622-8 |
CHYLINSKI, RHUNCHARPENTIER: "The tracrRNA and Cas9 families of type II CRISPR-Cas immunity systems", RNA BIOLOGY, vol. 10, no. 5, 2013, pages 726 - 737, XP055116068, DOI: 10.4161/rna.24321 |
COLEMAN, M. B. ET AL.: "G gamma A gamma (betat) hereditary persistence of fetal hemoglobin: the G gamma -158 C-->T mutation in cis to the -175 T-->C mutation of the A gamma-globin gene results in increased G gamma-globin synthesis", AM J HEMATOL, vol. 42, 1993, pages 186 - 190 |
CONCORDET, J.-P.HAEUSSLER, M.: "CRISPOR: intuitive guide selection for CRISPR/Cas9 genome editing experiments and screens", NUCLEIC ACIDS RESEARCH, vol. 46, 2018 |
CRADICK, T. J.QIU, P.LEE, C. M.FINE, E. J.BAO, G: "COSMID: A Web-based Tool for Identifying and Validating CRISPR/Cas Off-target Sites.", MOLECULAR THERAPY - NUCLEIC ACIDS, vol. 3, 2014, pages 214 |
CROMER, M. K. ET AL.: "Global Transcriptional Response to CRISPR/Cas9-AAV6-Based Genome Editing in CD34+ Hematopoietic Stem and Progenitor Cells", MOLECULAR THERAPY, vol. 26, 2018, pages 2431 - 2442, XP093012831, DOI: 10.1016/j.ymthe.2018.06.002 |
DALEY ET AL., FOCUS, vol. 18, 1996, pages 62 - 67 |
DANG, Y. ET AL.: "Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency", GENOME BIOL, vol. 16, 2015, pages 280 |
DELTCHEVA E.CHYLINSKI K.SHARMA C. M.GONZALES K.CHAO Y.PIRZADA Z. A.ECKERT M. R.VOGEL J.CHARPENTIER E.: "CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III", NATURE, vol. 471, 2011, pages 602 - 607, XP055308803, DOI: 10.1038/nature09886 |
DOMAN, J. L.SOUSA, A. A.RANDOLPH, P. B.CHEN, P. J.LIU, D. R: "Designing and executing prime editing experiments in mammalian cells.", NAT PROTOC, vol. 17, 2022, pages 2431 - 2468, XP093149104, DOI: 10.1038/s41596-022-00724-4 |
EVERETTE KANEWBY GALEVINE RMMAYBERRY KJANG YMAYURANATHAN TNIMMAGADDA NDEMPSEY ELI YBHOOPALAN SV: "Ex vivo prime editing of patient haematopoietic stem cells rescues sickle-cell disease phenotypes after engraftment in mice", NAT BIOMED ENG., vol. 7, no. 5, May 2023 (2023-05-01), pages 616 - 628, XP093113740, DOI: 10.1038/s41551-023-01026-0 |
FERREIRA DA SILVA, J. ET AL.: "Prime editing efficiency and fidelity are enhanced in the absence of mismatch repair", NAT COMMUN, vol. 13, 2022, pages 760 |
FERRETTI, COMPLETE GENOME SEQUENCE OF AN M1 STRAIN OF STREPTOCOCCUS PYOGENES |
FORGET, B. G.: "Molecular Basis of Hereditary Persistence of Fetal Hemoglobin", ANNALS NY ACAD SCI, vol. 850, 1998, pages 38 - 44, XP071390909, DOI: 10.1111/j.1749-6632.1998.tb10460.x |
FORGET, BERNARD G.: "Molecular basis of hereditary persistence of fetal hemoglobin", ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, vol. 850, no. 1, 1998, pages 38 - 44, XP071390909, DOI: 10.1111/j.1749-6632.1998.tb10460.x |
HSU, J. Y. ET AL.: "PrimeDesign software for rapid and simplified design of prime editing guide RNAs", NAT COMMUN, vol. 12, 2021, pages 1034 |
HUSSMANN, J. A. ET AL.: "Mapping the genetic landscape of DNA double-strand break repair", CELL, vol. 184, 2021, pages 5653 - 5669 |
J. J., MCSHAN W. MAJDIC D. J.SAVIC D. J.SAVIC G.LYON K.PRIMEAUX C.SEZATE S.SUVOROV A. N.KENTON S.LAI H. S., PROC. NATL. ACAD. SCI. U.S.A., vol. 98, 2001, pages 4658 - 4663 |
JINEK ET AL., SCIENCE, vol. 337, 2012, pages 816 - 821 |
JINEK M.CHYLINSKI K.FONFARA I.HAUER M.DOUDNA J. A.CHARPENTIER E.: "A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity", SCIENCE, vol. 337, 2012, pages 816 - 821, XP055229606, DOI: 10.1126/science.1225829 |
KATO, GREGORY J ET AL.: "Sickle cell disease", NATURE REVIEWS DISEASE PRIMERS, vol. 4, no. 1, 2018, pages 1 - 22 |
KIM, H. K. ET AL.: "SpCas9 activity prediction by DeepSpCas9, a deep learning-based model with high generalization performance", SCI. ADV., vol. 5, 2019, pages 9249 |
KIM, H. K: "Predicting the efficiency of prime editing guide RNAs in human cells", NAT BIOTECHNOL, vol. 39, 2021, pages 198 - 206, XP037365130, DOI: 10.1038/s41587-020-0677-y |
KLEINSTIVER, B. P. ET AL.: "Broadening the targeting range of Staphylococcus aureus CRISPR-Cas9 by modifying PAM recognition", NATURE BIOTECHNOLOGY, vol. 33, 2015, pages 1293 - 1298, XP055309933, DOI: 10.1038/nbt.3404 |
KLEINSTIVER, B. P. ET AL.: "Engineered CRISPR-Cas9 nucleases with altered PAM specificities", NATURE, vol. 523, 2015, pages 481 - 485, XP055293257, DOI: 10.1038/nature14592 |
KWEON, J. ET AL.: "Engineered prime editors with PAM flexibility", MOLECULAR THERAPY, vol. 29, 2021, pages 2001 - 2007, XP055960547, DOI: 10.1016/j.ymthe.2021.02.022 |
LEVESQUE SCOSENTINO AVERMA AGENOVESE PBAUER DE.: "Enhancing prime editing in hematopoietic stem and progenitor cells by modulating nucleotide metabolism", NAT BIOTECHNOL. |
LI C ET AL.: "In vivo HSPC gene therapy with base editors allows for efficient reactivation of fetal -globin in β-YAC mice", BLOOD ADVANCES, vol. 5, no. 4, 2021, pages 1122 - 1135, XP093031763, DOI: 10.1182/bloodadvances.2020003702 |
LI, Y.CHEN, JTSAI, S. Q.CHENG, Y.: "Easy-Prime: a machine learning-based prime editor design tool", GENOME BIOL, vol. 22, 2021, pages 235 |
LIU, Y. ET AL.: "Enhancing prime editing by Csy4-mediated processing of pegRNA", CELL RES, vol. 31, 2021, pages 1134 - 1136, XP037578331, DOI: 10.1038/s41422-021-00520-x |
LOCATELLI, F.LUCARELLI, B.MERLI, P: "Current and future approaches to treat graft failure after allogeneic hematopoietic stem cell transplantation", EXPERT OPINION ON PHARMACOTHERAPY, vol. 15, 2014, pages 23 - 36 |
MARTYN, G. E.QUINLAN, K. G. R.CROSSLEY, M.: "The regulation of human globin promoters by CCAAT box elements and the recruitment of NF-Y", BIOCHIM BIOPHYS ACTA GENE REGUL MECH, vol. 1860, 2017, pages 525 - 536 |
MARTYN, GABRIELLA E. ET AL.: "A natural regulatory mutation in the proximal promoter elevates fetal globin expression by creating a de novo GATA1 site", BLOOD, THE JOURNAL OF THE AMERICAN SOCIETY OF HEMATOLOGY, vol. 133, no. 8, 2019, pages 852 - 856 |
MARTYN, GABRIELLA E.KATE GR QUINLANMERLIN CROSSLEY: "The regulation of human globin promoters by CCAAT box elements and the recruitment of NF-Y", BIOCHIMICA ET BIOPHYSICA ACTA (BBA)-GENE REGULATORY MECHANISMS, vol. 1860, no. 5, 2017, pages 525 - 536 |
MAYURANATHAN, T. ET AL.: "Adenosine Base Editing of γ-Globin Promoters Induces Fetal Hemoglobin and Inhibit Erythroid Sickling", BLOOD, vol. 136, 2020, pages 21 - 22 |
NEEDLEMAN, SAUL B.WUNSCH, CHRISTIAN D.: "A general method applicable to the search for similarities in the amino acid sequence of two proteins", JOURNAL OF MOLECULAR BIOLOGY, vol. 48, no. 3, 1970, pages 443 - 53, XP024011703, DOI: 10.1016/0022-2836(70)90057-4 |
NELSON, J. W. ET AL.: "Engineered pegRNAs improve prime editing efficiency", NAT BIOTECHNOL, vol. 40, 2022, pages 402 - 410, XP093043230, DOI: 10.1038/s41587-021-01039-7 |
NELSON, JAMES W. ET AL.: "Engineered pegRNAs improve prime editing efficiency", NATURE BIOTECHNOLOGY, vol. 40, no. .3, 2022, pages 402 - 410, XP093043230, DOI: 10.1038/s41587-021-01039-7 |
PETERKA MAKRAP NLI SWIMBERGER SHSIEH PPDEGTEV DBESTAS BBARR JVAN DE PLASSCHE SMENDOZA-GARCIA P: "Harnessing DSB repair to promote efficient homology-dependent and -independent prime editing", NAT COMMUN |
QI ET AL., CELL, vol. 152, no. 5, 2013, pages 1173 - 83 |
QI ET AL.: "Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression", CELL, vol. 152, no. 5, 2013, pages 1173 - 83, XP055346792, DOI: 10.1016/j.cell.2013.02.022 |
RAVI, N. S. ET AL.: "Identification of novel HPFH-like mutations by CRISPR base editing that elevate the expression of fetal hemoglobin", ELIFE, vol. 11, 2022, pages 65421 |
SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 2001, COLD SPRING HARBOR PRESS |
SANKARAN VJ ET AL.: "Human fetal hemoglobin expression is regulated by the developmental stage-specific repressor BCL11A", SCIENCE SCIENCE, vol. 322, no. 5909, 19 December 2008 (2008-12-19), pages 1839 - 42, XP055568754, DOI: 10.1126/science.1165409 |
SCHEP, R. ET AL.: "Impact of chromatin context on Cas9-induced DNA double-strand break repair pathway balance", MOLECULAR CELL, vol. 81, 2021, pages 2216 - 2230 |
SPENCER, J. M.ZHANG, X: "Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains", SCI REP, vol. 7, 2017, pages 16836, XP055557376, DOI: 10.1038/s41598-017-17081-y |
SPENCER, JEFFREY M.,XIAOLIU ZHANG: "Deep mutational scanning of S. pyogenes Cas9 reveals important functional domains", SCIENTIFIC REPORTS, vol. 7, no. 1, 2017, pages 1 - 14, XP055557376, DOI: 10.1038/s41598-017-17081-y |
TAHER, ALI TDAVID J. WEATHERALLMARIA DOMENICA CAPPELLINI: "Thalassaemia", THE LANCET, vol. 391, no. 10116, 2018, pages 155 - 167 |
TIJSSEN: "Laboratory Techniques In Biochemistry And Molecular Biology-Hybridization With Nucleic Acid Probes", 1993, ELSEVIER, article "Overview of principles of hybridization and the strategy of nucleic acid probe assay" |
WALTON, R. T.CHRISTIE, K. A.WHITTAKER, M. N.KLEINSTIVER, B. P.: "Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants", SCIENCE, vol. 368, 2020, pages 290 - 296, XP055957984, DOI: 10.1126/science.aba8853 |
WALTON, RUSSELL T. ET AL.: "Unconstrained genome targeting with near-PAM less engineered CRISPR-Cas9 variants", SCIENCE, vol. 368, no. 6488, 2020, pages 290 - 296 |
WEBER, L. ET AL.: "Editing a γ-globin repressor binding site restores fetal hemoglobin synthesis and corrects the sickle cell disease phenotype", SCI. ADV., vol. 6, 2020, pages 9392 |
WIENERT, BEEKE ET AL.: "KLF1 drives the expression of fetal hemoglobin in British HPFH", BLOOD, THE JOURNAL OF THE AMERICAN SOCIETY OF HEMATOLOGY, vol. 130, no. 6, 2017, pages 803 - 807 |
WIENERT, BEEKE, ET AL.: "Editing the genome to introduce a beneficial naturally occurring mutation associated with increased fetalglobin", NATURE COMMUNICATIONS, vol. 6, no. 1, 2015, pages 1 - 8 |
YAN, JCIRINCIONE, A.ADAMSON, B: "Prime Editing: Precision Genome Editing by Reverse Transcription", MOLECULAR CELL, vol. 77, 2020, pages 210 - 212, XP086007503, DOI: 10.1016/j.molcel.2019.12.016 |
YANG, L.YANG, B.CHEN, J.: "One Prime for All Editing", CELL, vol. 179, 2019, pages 1448 - 1450, XP085946337, DOI: 10.1016/j.cell.2019.11.030 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
AU2016381313B2 (en) | Compositions and methods for the treatment of hemoglobinopathies | |
JP2024041081A (en) | Use of adenosine base editors | |
EP4019635A1 (en) | Crispr/cas-related methods, compositions and components | |
CN112020558A (en) | Systems and methods for treating hemoglobinopathies | |
Antoniou et al. | Base-editing-mediated dissection of a γ-globin cis-regulatory element for the therapeutic reactivation of fetal hemoglobin expression | |
WO2018209158A2 (en) | Crispr/rna-guided nuclease systems and methods | |
AU2016262521A1 (en) | CRISPR/CAS-related methods and compositions for treating HIV infection and AIDS | |
JP2019508051A (en) | CRISPR / CAS-related methods and compositions for treating beta-hemoglobinopathy | |
US20230279438A1 (en) | Base editing approaches for the treatment of betahemoglobinopathies | |
EP3765617A1 (en) | Systems and methods for the treatment of hemoglobinopathies | |
WO2022155458A1 (en) | Systems and methods for base editing of hbg1/2 gene promoter and fetal hemoglobin induction | |
WO2025017033A1 (en) | Prime editing of the -115 region in the hbg1 and/or hbg2 promoter for increasing fetal hemoglobin content in a eukaryotic cell | |
US20220228142A1 (en) | Compositions and methods for editing beta-globin for treatment of hemaglobinopathies | |
WO2025017030A1 (en) | Prime editing of the -200 region in the hbg1 and/or hbg2 promoter for increasing fetal hemoglobin content in a eukaryotic cell | |
WO2023052366A1 (en) | Base editing approaches for the treatment of beta-hemoglobinopathies | |
EP4441089A1 (en) | Methods for increasing fetal hemoglobin content by editing the +55-kb region of the erythroid-specific bcl11a enhancer | |
WO2023217888A1 (en) | Base editing approaches for correcting the cd39 (cag>tag) mutation in patients suffering from βeta-thalassemia | |
WO2023144104A1 (en) | Base editing approaches for the treatment of βeta-thalassemia | |
WO2024018056A1 (en) | Base editing approaches for correcting the ivs2-1 (g>a) mutation in patients suffering from βeta-thalassemia | |
WO2024165484A1 (en) | Enrichment of genetically modified hematopoietic stem cells through multiplex base editing | |
WO2024206125A1 (en) | Use of prime editing for treating sickle cell disease | |
Fontana et al. | Multiplex base editing of BCL11A regulatory elements to treat sickle cell disease | |
CN117940566A (en) | Systems and methods for treating hemoglobinopathies | |
CN119365599A (en) | Compositions and methods for genome editing | |
CN118048397A (en) | Single-base editing system for targeted knockout of ZNF410 gene and its application |