WO2023019470A1 - 孔蛋白单体的突变体、蛋白孔及其应用 - Google Patents
孔蛋白单体的突变体、蛋白孔及其应用 Download PDFInfo
- Publication number
- WO2023019470A1 WO2023019470A1 PCT/CN2021/113270 CN2021113270W WO2023019470A1 WO 2023019470 A1 WO2023019470 A1 WO 2023019470A1 CN 2021113270 W CN2021113270 W CN 2021113270W WO 2023019470 A1 WO2023019470 A1 WO 2023019470A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- seq
- mutant
- amino acid
- protein
- porin
- Prior art date
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 209
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 195
- 239000011148 porous material Substances 0.000 title claims abstract description 189
- 108010013381 Porins Proteins 0.000 title claims abstract description 164
- 239000000178 monomer Substances 0.000 title claims abstract description 159
- 102000017033 Porins Human genes 0.000 title abstract description 144
- 150000001413 amino acids Chemical class 0.000 claims abstract description 123
- 230000035772 mutation Effects 0.000 claims abstract description 78
- 239000012491 analyte Substances 0.000 claims abstract description 54
- 102000040430 polynucleotide Human genes 0.000 claims description 127
- 108091033319 polynucleotide Proteins 0.000 claims description 127
- 239000002157 polynucleotide Substances 0.000 claims description 127
- 102000039446 nucleic acids Human genes 0.000 claims description 75
- 108020004707 nucleic acids Proteins 0.000 claims description 75
- 150000007523 nucleic acids Chemical class 0.000 claims description 75
- 125000003729 nucleotide group Chemical group 0.000 claims description 67
- 239000002773 nucleotide Substances 0.000 claims description 66
- 238000000034 method Methods 0.000 claims description 56
- 108020004414 DNA Proteins 0.000 claims description 50
- 239000012528 membrane Substances 0.000 claims description 47
- 229910052770 Uranium Inorganic materials 0.000 claims description 43
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 30
- 238000005259 measurement Methods 0.000 claims description 25
- 238000003780 insertion Methods 0.000 claims description 24
- 230000037431 insertion Effects 0.000 claims description 24
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 22
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 20
- 238000006467 substitution reaction Methods 0.000 claims description 20
- 238000012217 deletion Methods 0.000 claims description 17
- 230000037430 deletion Effects 0.000 claims description 17
- 108091006146 Channels Proteins 0.000 claims description 16
- 229910052799 carbon Inorganic materials 0.000 claims description 13
- 239000013598 vector Substances 0.000 claims description 13
- 102000014914 Carrier Proteins Human genes 0.000 claims description 12
- 241000589516 Pseudomonas Species 0.000 claims description 12
- 108091008324 binding proteins Proteins 0.000 claims description 12
- 229920001184 polypeptide Polymers 0.000 claims description 12
- 108091034117 Oligonucleotide Proteins 0.000 claims description 8
- 229920000642 polymer Polymers 0.000 claims description 8
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 claims description 7
- 239000003814 drug Substances 0.000 claims description 7
- 102220469367 Apolipoprotein C-I_T71S_mutation Human genes 0.000 claims description 6
- 102220470964 Carcinoembryonic antigen-related cell adhesion molecule 5_Y68F_mutation Human genes 0.000 claims description 6
- 229940079593 drug Drugs 0.000 claims description 6
- 102200026990 rs137852764 Human genes 0.000 claims description 6
- 102220094204 rs876659508 Human genes 0.000 claims description 6
- 102200131327 rs879255262 Human genes 0.000 claims description 6
- 229910052739 hydrogen Inorganic materials 0.000 claims description 5
- 239000004172 quinoline yellow Substances 0.000 claims description 5
- 239000004408 titanium dioxide Substances 0.000 claims description 5
- 239000000975 dye Substances 0.000 claims description 4
- 241000894007 species Species 0.000 claims description 4
- 150000001875 compounds Chemical class 0.000 claims description 3
- 239000003344 environmental pollutant Substances 0.000 claims description 3
- 239000002360 explosive Substances 0.000 claims description 3
- 150000004676 glycans Chemical class 0.000 claims description 3
- 229910021645 metal ion Inorganic materials 0.000 claims description 3
- 229920001282 polysaccharide Polymers 0.000 claims description 3
- 239000005017 polysaccharide Substances 0.000 claims description 3
- 230000001939 inductive effect Effects 0.000 claims description 2
- 238000004519 manufacturing process Methods 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 102000007739 porin activity proteins Human genes 0.000 claims 20
- 102220474385 Solute carrier family 13 member 3_S75A_mutation Human genes 0.000 claims 1
- 229910017053 inorganic salt Inorganic materials 0.000 claims 1
- 238000012512 characterization method Methods 0.000 abstract description 5
- 235000018102 proteins Nutrition 0.000 description 170
- 235000001014 amino acid Nutrition 0.000 description 107
- 238000012163 sequencing technique Methods 0.000 description 50
- 102000053602 DNA Human genes 0.000 description 49
- 108060004795 Methyltransferase Proteins 0.000 description 40
- 238000005516 engineering process Methods 0.000 description 24
- 239000010410 layer Substances 0.000 description 17
- 229920002477 rna polymer Polymers 0.000 description 16
- 230000005945 translocation Effects 0.000 description 14
- 125000001176 L-lysyl group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C([H])([H])C([H])([H])C([H])([H])C(N([H])[H])([H])[H] 0.000 description 13
- 238000010586 diagram Methods 0.000 description 13
- 150000003839 salts Chemical class 0.000 description 13
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 12
- 239000000872 buffer Substances 0.000 description 11
- 238000007672 fourth generation sequencing Methods 0.000 description 11
- -1 hexitol nucleic acid Chemical class 0.000 description 11
- 102000004190 Enzymes Human genes 0.000 description 10
- 108090000790 Enzymes Proteins 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- WCUXLLCKKVVCTQ-UHFFFAOYSA-M Potassium chloride Chemical compound [Cl-].[K+] WCUXLLCKKVVCTQ-UHFFFAOYSA-M 0.000 description 9
- 230000003993 interaction Effects 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 239000002585 base Substances 0.000 description 8
- 239000007787 solid Substances 0.000 description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 7
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 6
- 230000008878 coupling Effects 0.000 description 6
- 238000010168 coupling process Methods 0.000 description 6
- 238000005859 coupling reaction Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 6
- 238000001914 filtration Methods 0.000 description 6
- 150000002500 ions Chemical class 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 108091093094 Glycol nucleic acid Proteins 0.000 description 5
- 102220539443 Nitric oxide synthase, brain_S75A_mutation Human genes 0.000 description 5
- 108091093037 Peptide nucleic acid Proteins 0.000 description 5
- 108091046915 Threose nucleic acid Proteins 0.000 description 5
- 239000003153 chemical reaction reagent Substances 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 5
- 238000011160 research Methods 0.000 description 5
- 108020004682 Single-Stranded DNA Proteins 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- 125000000539 amino acid group Chemical group 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 4
- 239000007864 aqueous solution Substances 0.000 description 4
- 239000002800 charge carrier Substances 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 150000003904 phospholipids Chemical class 0.000 description 4
- 239000001103 potassium chloride Substances 0.000 description 4
- 235000011164 potassium chloride Nutrition 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 235000000346 sugar Nutrition 0.000 description 4
- 102000035160 transmembrane proteins Human genes 0.000 description 4
- 108091005703 transmembrane proteins Proteins 0.000 description 4
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 3
- 239000007995 HEPES buffer Substances 0.000 description 3
- 239000000232 Lipid Bilayer Substances 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 3
- 229960000723 ampicillin Drugs 0.000 description 3
- 125000003118 aryl group Chemical group 0.000 description 3
- 230000004888 barrier function Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000001493 electron microscopy Methods 0.000 description 3
- 238000001962 electrophoresis Methods 0.000 description 3
- 150000002632 lipids Chemical class 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 238000003752 polymerase chain reaction Methods 0.000 description 3
- 150000003384 small molecules Chemical class 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 239000000243 solution Substances 0.000 description 3
- 238000010186 staining Methods 0.000 description 3
- 208000035657 Abasia Diseases 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- 102000034573 Channels Human genes 0.000 description 2
- 238000001712 DNA sequencing Methods 0.000 description 2
- 102220585783 Glutamate receptor ionotropic, delta-2_F76D_mutation Human genes 0.000 description 2
- 102220535112 Inhibin beta E chain_R62T_mutation Human genes 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 102000008300 Mutant Proteins Human genes 0.000 description 2
- 108010021466 Mutant Proteins Proteins 0.000 description 2
- 102220539439 Nitric oxide synthase, brain_S75G_mutation Human genes 0.000 description 2
- 102220475857 Phosphoglycerate kinase 1_D63A_mutation Human genes 0.000 description 2
- 102220559091 Potassium voltage-gated channel subfamily E member 1_K69H_mutation Human genes 0.000 description 2
- 108091028664 Ribonucleotide Proteins 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 230000027455 binding Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- AIYUHDOJVYHVIT-UHFFFAOYSA-M caesium chloride Chemical compound [Cl-].[Cs+] AIYUHDOJVYHVIT-UHFFFAOYSA-M 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 150000003841 chloride salts Chemical class 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- 229940104302 cytosine Drugs 0.000 description 2
- UTLVQENMPIUJSJ-QTARTGBFSA-N dT10 Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)COP(O)(=O)O[C@@H]2[C@H](O[C@H](C2)N2C(NC(=O)C(C)=C2)=O)CO)[C@@H](O)C1 UTLVQENMPIUJSJ-QTARTGBFSA-N 0.000 description 2
- 230000006378 damage Effects 0.000 description 2
- 239000005547 deoxyribonucleotide Substances 0.000 description 2
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 239000000032 diagnostic agent Substances 0.000 description 2
- 229940039227 diagnostic agent Drugs 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000005684 electric field Effects 0.000 description 2
- 239000003792 electrolyte Substances 0.000 description 2
- 238000000635 electron micrograph Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- KWIUHFFTVRNATP-UHFFFAOYSA-N glycine betaine Chemical compound C[N+](C)(C)CC([O-])=O KWIUHFFTVRNATP-UHFFFAOYSA-N 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 239000002502 liposome Substances 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 102000035118 modified proteins Human genes 0.000 description 2
- 108091005573 modified proteins Proteins 0.000 description 2
- 230000004481 post-translational protein modification Effects 0.000 description 2
- 239000000276 potassium ferrocyanide Substances 0.000 description 2
- 239000002336 ribonucleotide Substances 0.000 description 2
- 125000002652 ribonucleotide group Chemical group 0.000 description 2
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 2
- 102200150035 rs121909265 Human genes 0.000 description 2
- 102220322567 rs147013097 Human genes 0.000 description 2
- 102220072405 rs191342808 Human genes 0.000 description 2
- 102200037714 rs2655655 Human genes 0.000 description 2
- 102200005931 rs375912738 Human genes 0.000 description 2
- 102220243337 rs876659508 Human genes 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 238000001542 size-exclusion chromatography Methods 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- XOGGUFAVLNCTRS-UHFFFAOYSA-N tetrapotassium;iron(2+);hexacyanide Chemical compound [K+].[K+].[K+].[K+].[Fe+2].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-].N#[C-] XOGGUFAVLNCTRS-UHFFFAOYSA-N 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 229920000428 triblock copolymer Polymers 0.000 description 2
- MQAYPFVXSPHGJM-UHFFFAOYSA-M trimethyl(phenyl)azanium;chloride Chemical compound [Cl-].C[N+](C)(C)C1=CC=CC=C1 MQAYPFVXSPHGJM-UHFFFAOYSA-M 0.000 description 2
- BMQZYMYBQZGEEY-UHFFFAOYSA-M 1-ethyl-3-methylimidazolium chloride Chemical compound [Cl-].CCN1C=C[N+](C)=C1 BMQZYMYBQZGEEY-UHFFFAOYSA-M 0.000 description 1
- HWPZZUQOWRWFDB-UHFFFAOYSA-N 1-methylcytosine Chemical compound CN1C=CC(N)=NC1=O HWPZZUQOWRWFDB-UHFFFAOYSA-N 0.000 description 1
- PIINGYXNCHTJTF-UHFFFAOYSA-N 2-(2-azaniumylethylamino)acetate Chemical group NCCNCC(O)=O PIINGYXNCHTJTF-UHFFFAOYSA-N 0.000 description 1
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 1
- OGHAROSJZRTIOK-KQYNXXCUSA-O 7-methylguanosine Chemical compound C1=2N=C(N)NC(=O)C=2[N+](C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OGHAROSJZRTIOK-KQYNXXCUSA-O 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 101710092462 Alpha-hemolysin Proteins 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 241000701844 Bacillus virus phi29 Species 0.000 description 1
- LSNNMFCWUKXFEE-UHFFFAOYSA-M Bisulfite Chemical compound OS([O-])=O LSNNMFCWUKXFEE-UHFFFAOYSA-M 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- 229920000858 Cyclodextrin Polymers 0.000 description 1
- 150000008574 D-amino acids Chemical class 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 241000588724 Escherichia coli Species 0.000 description 1
- LYCAIKOWRPUZTN-UHFFFAOYSA-N Ethylene glycol Chemical group OCCO LYCAIKOWRPUZTN-UHFFFAOYSA-N 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 102000004310 Ion Channels Human genes 0.000 description 1
- 108090000862 Ion Channels Proteins 0.000 description 1
- 150000007649 L alpha amino acids Chemical class 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- LRQKBLKVPFOOQJ-YFKPBYRVSA-N L-norleucine Chemical compound CCCC[C@H]([NH3+])C([O-])=O LRQKBLKVPFOOQJ-YFKPBYRVSA-N 0.000 description 1
- 108010052285 Membrane Proteins Proteins 0.000 description 1
- OKIZCWYLBDKLSU-UHFFFAOYSA-M N,N,N-Trimethylmethanaminium chloride Chemical compound [Cl-].C[N+](C)(C)C OKIZCWYLBDKLSU-UHFFFAOYSA-M 0.000 description 1
- 101100436871 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) atp-3 gene Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 108010076504 Protein Sorting Signals Proteins 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 241000640973 Pseudomonas taeanensis MS-3 Species 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 108091093078 Pyrimidine dimer Proteins 0.000 description 1
- 238000003559 RNA-seq method Methods 0.000 description 1
- 102220577380 Ras-related protein Rab-22A_Q64L_mutation Human genes 0.000 description 1
- PYMYPHUHKUWMLA-LMVFSUKVSA-N Ribose Natural products OC[C@@H](O)[C@@H](O)[C@@H](O)C=O PYMYPHUHKUWMLA-LMVFSUKVSA-N 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 101710183280 Topoisomerase Proteins 0.000 description 1
- 108091011223 Transmembrane protein 121 Proteins 0.000 description 1
- 238000002835 absorbance Methods 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 229910052783 alkali metal Inorganic materials 0.000 description 1
- 229910001514 alkali metal chloride Inorganic materials 0.000 description 1
- HMFHBZSHGGEWLO-UHFFFAOYSA-N alpha-D-Furanose-Ribose Natural products OCC1OC(O)C(O)C1O HMFHBZSHGGEWLO-UHFFFAOYSA-N 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 229920000469 amphiphilic block copolymer Polymers 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 150000001576 beta-amino acids Chemical class 0.000 description 1
- 229960003237 betaine Drugs 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000007385 chemical modification Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- HGCIXCUEYOPUTN-UHFFFAOYSA-N cis-cyclohexene Natural products C1CCC=CC1 HGCIXCUEYOPUTN-UHFFFAOYSA-N 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 238000012937 correction Methods 0.000 description 1
- 208000030381 cutaneous melanoma Diseases 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 229940097362 cyclodextrins Drugs 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 150000001945 cysteines Chemical class 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- OGQYPPBGSLZBEG-UHFFFAOYSA-N dimethyl(dioctadecyl)azanium Chemical compound CCCCCCCCCCCCCCCCCC[N+](C)(C)CCCCCCCCCCCCCCCCCC OGQYPPBGSLZBEG-UHFFFAOYSA-N 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 238000010828 elution Methods 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- 230000009144 enzymatic modification Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 230000004907 flux Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000000446 fuel Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 125000000524 functional group Chemical group 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 1
- 230000003370 grooming effect Effects 0.000 description 1
- 229920001519 homopolymer Polymers 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 239000001257 hydrogen Substances 0.000 description 1
- 238000002847 impedance measurement Methods 0.000 description 1
- 239000002054 inoculum Substances 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 239000000138 intercalating agent Substances 0.000 description 1
- 239000002608 ionic liquid Substances 0.000 description 1
- 230000029226 lipidation Effects 0.000 description 1
- 235000018977 lysine Nutrition 0.000 description 1
- 150000002669 lysines Chemical class 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 238000002483 medication Methods 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 150000002894 organic compounds Chemical class 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 230000003647 oxidation Effects 0.000 description 1
- 238000007254 oxidation reaction Methods 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 239000008363 phosphate buffer Substances 0.000 description 1
- 125000002467 phosphate group Chemical group [H]OP(=O)(O[H])O[*] 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 150000003222 pyridines Chemical class 0.000 description 1
- 239000013635 pyrimidine dimer Substances 0.000 description 1
- 238000005215 recombination Methods 0.000 description 1
- 230000006798 recombination Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 229920006395 saturated elastomer Polymers 0.000 description 1
- 238000005204 segregation Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 239000002356 single layer Substances 0.000 description 1
- 201000003708 skin melanoma Diseases 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 150000008163 sugars Chemical class 0.000 description 1
- 229920001059 synthetic polymer Polymers 0.000 description 1
- 125000000080 threosyl group Chemical group C1([C@@H](O)[C@H](O)CO1)* 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 230000005641 tunneling Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/21—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Pseudomonadaceae (F)
Definitions
- the invention belongs to the technical field of characterization of target analyte characteristics, and particularly relates to a mutant of a porin monomer, a protein pore containing the same and its application for detecting target analytes.
- nucleic acid sequencing technology continues to develop, becoming the core field of life science research, and playing a huge role in promoting the development of technology in the fields of biology, chemistry, electricity, life science, and medicine.
- Using nanopores to develop a new type of rapid, accurate, low-cost, high-precision and high-throughput nucleic acid sequencing technology is one of the hot spots of the post-human genome project.
- Nanopore sequencing technology also known as the fourth-generation sequencing technology, is a single-stranded nucleic acid molecule as a sequencing unit, using a nanopore that can provide an ion current channel, so that the single-stranded nucleic acid molecule is driven by electrophoresis. Through this nanopore, when the nucleic acid passes through the nanopore, the current of the nanopore is reduced, and the gene sequencing technology reads the sequence information in real time for the different signals generated.
- Nanopore sequencing can not only realize natural DNA and RNA sequencing, but also directly obtain the base modification information of DNA and RNA.
- the use of bisulfite treatment provides a great impetus for the direct study of epigenetic-related phenomena at the genomic level.
- nanopore detection technology has the advantages of low cost, high throughput, and label-free.
- Nanopore analysis technology originated from the invention of the Coulter counter and the recording technology of single-channel current.
- Physiology and Medicine Nobel Prize winners Neher and Sakamann used patch clamp technology to measure membrane potential in 1976 to study membrane proteins and ion channels, which promoted the practical application of nanopore sequencing technology.
- Kasianowicz et al. proposed a new idea of using ⁇ -hemolysin to sequence DNA, which was a milestone in the sequencing of biological nanopore single molecules.
- biological nanopores such as MspA porin and phage Phi29 connector enriched the research on nanopore analysis technology.
- Li et al. opened a new era of solid-state nanopore research. Limited by the development of the semiconductor and material industries, solid-state nanopore sequencing progresses slowly.
- nanopore sequencing technology lies in the design of a special biological nanopore.
- the read head structure formed by the constriction area in the hole can cause the channel current when single-stranded nucleic acid (such as ssDNA) molecules pass through the nanopore.
- the blockage of the nanopore briefly affects the current intensity flowing through the nanopore (the magnitude of the current change affected by each base is different), and finally the highly sensitive electronic device detects these changes to identify the base passing through.
- protein pores are used as nanopores for sequencing, and porins are mainly derived from Escherichia coli.
- nanopore sequencing technology there is a single nanopore protein, and it is necessary to develop alternative nanopore proteins to realize nanopore sequencing technology. Porins are also closely related to sequencing accuracy, and porins are also involved in changes in the interaction mode with rate-controlling proteins. Further optimizing the stability of the interaction interface between porins and rate-controlling proteins is crucial to improving the consistency and stability of sequencing data. have a positive impact. The accuracy of nanopore sequencing technology also needs to be improved. Therefore, it is necessary to develop improved nanopore proteins to further improve the resolution of nanopore sequencing.
- the purpose of the embodiments of the present invention is to provide a mutant of an alternative porin monomer, a protein pore comprising the same, and an application thereof.
- an embodiment of the present invention provides a mutant of a porin monomer, wherein the amino acid of the mutant of the porin monomer includes the sequence shown in SEQ ID NO: 1 or has at least 99%, 98 %, 97%, 96%, 95%, 90%, 80%, 70%, 60% or 50% identity sequence, or consists of it, and the amino acid of the mutant of the porin monomer is included in the corresponding A mutation at one or more of positions T71, S75, F76 of SEQ ID NO:1;
- T71, S75, F76 is specifically, T71, S75, F76, T71 and S75, S75 and F76, T71 and S76, or T71 and S75 and F76.
- the amino acid of the mutant of the porin monomer comprises one or Mutations at multiple positions.
- amino acids of the mutant of the porin monomer include:
- one or more positions corresponding to K69, P70, T71, P72, A73, S74, S75, and F76 of SEQ ID NO:1 have amino acid insertions, deletions and/or substitutions; (2) corresponding to SEQ ID There are amino acid insertions, deletions and (3) one or more positions corresponding to R62, D63, Y68, K69, T71, S75, F76, and E104 of SEQ ID NO:1 have amino acid insertions, deletions and/or substitutions; or ( 4) One or more positions corresponding to Y68, K69, P70, T71, P72, A73, S74, S75, F76, E171 and D175 of SEQ ID NO:1 have amino acid insertions, deletions and/or substitutions.
- amino acid mutation of the mutant of the porin monomer is selected from the following:
- the mutation at position 62 corresponding to SEQ ID NO:1 is 0 to 5 of S, C, U, T, M; the mutation at position 63 is 0 to 5 of V, G, A, L, and I;
- the mutation at position 68 is 0 to 2 among F and W; the mutation at position 69 is 0 to 2 among R and H; the mutation at position 71 is 0 to 4 among S, C, U, M; the mutation at position 75 is 0 to 5 of A, G, V, L, I; 76 mutations to 0 to 4 of Q, D, E, N; 104 mutations to 0 of V, G, A, L, I to 5 species; and
- amino acid mutation of the mutant of the porin monomer is selected from the following:
- R62 corresponding to SEQ ID NO: 1 is mutated to R62S, R62C, R62U, R62T or R62M; D63 is mutated to D63V, D63G, D63A, D63L or D63I; Y68 is mutated to Y68F or Y68W; K69 is mutated to K69R or K69H; T71 mutation to T71S, T71C, T71U, or T71M; S75 mutation to S75A, S75G, S75V, S75L, or S75I; F76 mutation to F76Q, F76D, F76E, or F76N; E104 mutation to E104V, E104G, E104A, E104L, or E104I; and
- amino acid mutation of the mutant of the porin monomer is selected from the following:
- an embodiment of the present invention provides a protein pore comprising at least one mutant of a porin monomer.
- the embodiment of the present invention provides a complex for characterizing a target analyte, characterized in that: the protein pore and the rate-controlling protein bound thereto.
- embodiments of the present invention provide nucleic acids encoding mutants of porin monomers, protein pores, or complexes.
- the embodiments of the present invention provide a vector or a genetically engineered host cell comprising the nucleic acid.
- the embodiments of the present invention provide mutants of porin monomers, protein pores, complexes, nucleic acids, vectors or host cells in detecting the presence, absence or one or more characteristics of target analytes or preparing detection target Application in a product of the presence, absence, or one or more characteristics of an analyte.
- the embodiment of the present invention provides a method for producing a protein pore or a polypeptide thereof, comprising transforming the host cell with the vector, and inducing the host cell to express the protein pore or a polypeptide thereof.
- embodiments of the present invention provide a method for determining the presence, absence or one or more characteristics of a target analyte, comprising:
- the method comprises: said target analyte interacting with said protein pore present in a membrane such that said target analyte moves relative to said protein pore.
- the target analyte is a nucleic acid molecule.
- a method for determining the presence, absence, or one or more characteristics of a target analyte comprises coupling said target analyte to a membrane; The protein pore interacts so that the target analyte moves relative to the protein pore.
- the embodiment of the present invention provides a kit for determining the presence, absence or one or more characteristics of a target analyte, including the mutant of the porin monomer, the protein pore, The complex, the nucleic acid, or the vector or host, and the membrane components.
- the embodiments of the present invention provide a device for determining the presence, absence or one or more characteristics of a target analyte, including the protein pore or the complex, and the membrane.
- the target analytes include polysaccharides, metal ions, inorganic salts, polymers, amino acids, peptides, proteins, nucleotides, oligonucleotides, polynucleotides, dyes, drugs, diagnostic agents, explosives substances or environmental pollutants;
- said target analyte comprises a polynucleotide
- said polynucleotide comprises DNA or RNA; and/or, said one or more characteristics are selected from (i) the length of said polynucleotide; (ii) the identity of said polynucleotide; (iii) the sequence of the polynucleotide; (iv) the secondary structure of the polynucleotide and (v) whether the polynucleotide is modified; and/or, the rate-controlling protein in the complex Including polynucleotide binding proteins.
- Figure 1 illustrates the basic working principle of a nanopore according to one embodiment.
- Fig. 2 shows a schematic diagram of DNA sequencing according to one embodiment.
- Figure 3 shows the corresponding pore blocking signal when nucleotides pass through the protein pore according to one embodiment.
- FIGS. 4A, 4B and 4C show a wild-type protein pore channel surface structure and ribbon diagram model according to one embodiment.
- 4A is a side view of the surface structure model
- FIG. 4B is a top view of the streamer structure model
- FIG. 4C is a streamer structure model.
- FIG. 5A and 5B show the amino acid model diagram of the mutation hole 1 according to an embodiment, FIG. 5A is a top view, and FIG. 5B is a side view.
- Fig. 6 shows the size information of various parts of the mutation hole 1 according to an embodiment.
- Fig. 7 shows a monomeric amino acid model diagram of porin mutant 1 according to an embodiment, (b) is the core amino acid composition of the constriction region (eye loop) enlarged in (a).
- Fig. 8 shows the results of negative staining electron microscopy of porin mutant 1 according to an embodiment.
- FIG. 9A shows a cryo-EM image of porin mutant 1 according to one embodiment.
- FIG. 9B shows 2D classification results.
- FIGS 10A and 10B show the locally refined Fourier shell correlation coefficient (FSC) results of porin mutant 1 according to one embodiment, a: rIn FSC unmasked graph; b: rIn FSC phase random masked graph; c: rIn FSC correction map; d: rIn FSC shielding map.
- FSC Fourier shell correlation coefficient
- Figure 11 shows the three-dimensional reconstruction of porin mutant 1 cryo-EM according to one embodiment High resolution electron density map.
- Figure 12 shows porin mutant 1 according to one embodiment in The electron density map below.
- Figure 13 shows the structure of the DNA construct BS7-4C3-SE1 according to one embodiment.
- Figure 14 shows the structure of the DNA construct BS7-4C3-PLT according to one embodiment.
- FIG. 15A shows the opening current and gating characteristics of mutation hole 1 at a voltage of ⁇ 180 mV according to one embodiment.
- FIG. 15B shows the condition of nucleic acid passing through the mutation hole 1 under the voltage of +180mV according to one embodiment.
- 16A and 16B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 1, according to one embodiment.
- Fig. 17 is an enlarged display diagram of a single signal area in the embodiment of Fig. 16B.
- FIG. 18A shows the opening current and gating characteristics of the mutation hole 2 at a voltage of ⁇ 180 mV according to one embodiment.
- FIG. 18B shows the condition of nucleic acid passing through the mutation hole 2 under the voltage of +180mV according to one embodiment.
- Figures 19A and 19B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 2, according to one embodiment.
- Fig. 20 is an enlarged display diagram of a single signal area in the embodiments of Fig. 19A and B.
- FIG. 21A shows the opening current and gating characteristics of the mutation hole 3 at a voltage of ⁇ 180 mV according to one embodiment.
- FIG. 21B shows the condition of nucleic acid passing through the mutation hole 3 under the voltage of +180mV according to one embodiment.
- 22A and 22B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 3, according to one embodiment.
- Fig. 23 is an enlarged display diagram of a single signal area in the embodiment of Fig. 22A.
- FIG. 24A shows the opening current and gating characteristics of the mutation hole 4 at a voltage of ⁇ 180 mV according to one embodiment.
- FIG. 24B shows the situation of nucleic acid passing through mutant hole 4 under the voltage of +180mV according to one embodiment.
- 25A and 25B show example current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 4, according to one embodiment.
- Fig. 26 is an enlarged display diagram of a single signal in the embodiment of Figs. 25A and 25B.
- FIG. 27 shows the opening current and gating characteristics of the mutation hole 5 at a voltage of ⁇ 180 mV according to one embodiment.
- Figure 28 shows an example current trace when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 5, according to one embodiment.
- Fig. 29 is an enlarged display diagram of a single signal area in the embodiment of Fig. 28 .
- FIG. 30A shows the opening current and gating characteristics of the mutation hole 6 at a voltage of ⁇ 180 mV according to one embodiment.
- FIG. 30B shows the condition of nucleic acid passing through the mutation hole 6 under the voltage of +180mV according to one embodiment.
- 31A and 31B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 6, according to one embodiment.
- Fig. 32 is an enlarged display diagram of a single signal area in the embodiment of Fig. 31A.
- Figure 33 shows the SDS-PAGE electrophoresis detection results of mutant 1 according to an embodiment.
- Figure 34 shows a size exclusion chromatogram of mutant 1 protein according to one embodiment.
- a nucleotide includes two or more nucleotides
- a helicase includes two or more helicases.
- nucleotide sequence refers to a polymeric form of nucleotides (ribonucleotides or deoxyribonucleotides) of any length. The term refers only to the primary structure of the molecule. Thus, the term includes double- and single-stranded DNA and RNA.
- nucleic acid refers to a single- or double-stranded covalently linked sequence of nucleotides wherein the 3' and 5' ends on each nucleotide are linked by a phosphodiester bond.
- Nucleotides can consist of deoxyribonucleotide bases or ribonucleotide bases.
- Nucleic acids can include DNA and RNA, and can be prepared synthetically in vitro or isolated from natural sources.
- the nucleic acid may further comprise modified DNA or RNA, such as methylated DNA or RNA, or RNA that has been post-translationally modified, such as 5'-capping with 7-methylguanosine, 3'-end processing, such as cleavage and polyadenylation, and splicing.
- Nucleic acids can also include synthetic nucleic acids (XNA), such as hexitol nucleic acid (HNA), cyclohexene nucleic acid (CeNA), threose nucleic acid (TNA), glycerol nucleic acid (GNA), locked nucleic acid (LNA) and peptide nucleic acid (PNA).
- HNA hexitol nucleic acid
- CeNA cyclohexene nucleic acid
- TAA threose nucleic acid
- GNA glycerol nucleic acid
- LNA locked nucleic acid
- PNA peptide nucleic acid
- nucleic acid or polynucleotide
- bp base pairs
- nt nucleotides
- kb kilobase pair
- Polynucleotides of less than about 40 nucleotides in length are commonly referred to as “oligonucleotides” and may contain primers used in DNA manipulations, such as by polymerase chain reaction (PCR).
- Polynucleotides such as nucleic acids, are macromolecules comprising two or more nucleotides.
- the polynucleotide or nucleic acid may comprise any combination of nucleotides.
- the nucleotides may be naturally occurring or synthetic.
- One or more nucleotides in the polynucleotide may be oxidized or methylated.
- One or more nucleotides in the polynucleotide may be damaged.
- the polynucleotide may comprise a pyrimidine dimer. This dimer is often associated with damage caused by UV light and is a major cause of skin melanoma.
- One or more nucleotides in the polynucleotide may be modified, for example with conventional labels or tags.
- the polynucleotide may comprise one or more abasic (ie, lacking a nucleobase), or lacking a nucleobase and sugar (ie, being C3) nucleotides.
- the nucleotides in the polynucleotide may be linked to each other in any manner.
- the nucleotides are usually linked by their sugar and phosphate groups, as in nucleic acids.
- the nucleotides may be linked via their nucleobases, as in pyridine dimers.
- a polynucleotide can be single-stranded or double-stranded. At least a portion of the polynucleotide is preferably double-stranded.
- a polynucleotide may be a nucleic acid, such as deoxyribonucleic acid (DNA) or ribonucleic acid (RNA).
- a polynucleotide may comprise an RNA strand hybridized to a DNA strand.
- the polynucleotide may be any synthetic nucleic acid known in the art, such as peptide nucleic acid (PNA), glycerol nucleic acid (GNA), threose nucleic acid (TNA), locked nucleic acid (LNA) or other synthetic nucleic acid having nucleotide side chains.
- PNA peptide nucleic acid
- GNA glycerol nucleic acid
- TNA locked nucleic acid
- LNA locked nucleic acid having nucleotide side chains.
- the PNA backbone is composed of repeating N-(2-aminoethyl)-glycine units linked by peptide bonds.
- the GNA backbone is composed of repeating ethylene glycol units linked by phosphodiester bonds.
- the TNA backbone is composed of repeating threosyl groups linked together by phosphodiester bonds.
- LNA is formed from ribonucleic acid as described above, with an additional bridge connecting the 2' oxygen and 4' carbon in the ribose moiety.
- Bridged nucleic acids are modified RNA nucleotides. They can also be called restricted or inaccessible RNA13BNA monomers can contain 5-membered, 6-membered or even 7-membered bridges with "fixed" C3'-endo sugar puckering .
- the bridging structure is synthetically introduced into the 2',4'-position of ribose to produce a 2',4'-BNA monomer.
- the polynucleotide is most preferably ribonucleic acid (RNA) or deoxyribonucleic acid (DNA).
- a polynucleotide can be of any length.
- a polynucleotide can be at least 10, at least 50, at least 100, at least 150, at least 200, at least 250, at least 300, at least 400 or at least 500 nucleotides or nucleotide pairs in length.
- the polynucleotide may be 1000 or more nucleotides or nucleotide pairs, 5000 or more nucleotides or nucleotide pairs or 100000 or more nucleotides or cores in length nucleotide pair.
- any number of polynucleotides can be studied.
- the methods of the embodiments may involve characterizing 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 50, 100 or more polynucleotides. If two or more polynucleotides are characterized, they may be different polynucleotides or in the case of the same polynucleotide.
- Polynucleotides can be naturally occurring or synthetic.
- the methods can be used to verify the sequence of prepared oligonucleotides.
- the methods are typically performed in vitro.
- amino acid is used in its broadest sense and is meant to include amino acids containing amine ( NH2 ) and carboxyl (COOH) functional groups as well as side chains (such as R groups) that are unique to each amino acid. of organic compounds.
- amino acid refers to a naturally occurring L ⁇ -amino acid or residue.
- amino acid also includes D-amino acids, retro-inverse amino acids, and chemically modified amino acids (such as amino acid analogs), naturally occurring amino acids (such as norleucine) that are not usually incorporated into proteins, and amino acids that are known in the art.
- Chemically synthesized compounds such as ⁇ -amino acids with properties known to be characteristic of amino acids.
- analogs or mimetics of phenylalanine or proline are included in the definition of amino acids, which allow the same conformational constraints on peptide compounds as native Phe or Pro. Such analogs and mimetics are referred to herein as "functional equivalents" of the corresponding amino acids.
- protein protein
- polypeptide and “peptide” are further used interchangeably herein to refer to polymers of amino acid residues as well as variants and synthetic analogs of amino acid residues. Accordingly, these terms apply to amino acid polymers in which one or more amino acid residues are a synthetic non-naturally occurring amino acid, such as a chemical analog of the corresponding naturally occurring amino acid, as well as to naturally occurring amino acid polymers.
- Polypeptides may also undergo maturation or post-translational modification processes, which may include, but are not limited to, glycosylation, proteolytic cleavage, lipidation, signal peptide cleavage, propeptide cleavage, phosphorylation, and the like.
- “Homologues” of a protein encompass peptides, oligonucleotides, oligonucleotides having amino acid substitutions, deletions and/or insertions relative to the unmodified or wild-type protein in question and having similar biological and functional activity to the unmodified protein from which they were derived. Peptides, polypeptides, proteins and enzymes.
- amino acid identity refers to the degree to which sequences are identical on an amino acid-by-amino acid basis over a comparison window.
- percent sequence identity is calculated by comparing two optimally aligned sequences over a comparison window and determining the occurrence of identical amino acid residues (e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gln, Cys, and Met) to get the number of matching positions, divide the number of matching positions by the positions in the comparison window total (i.e., window size), and multiply the result by 100 to obtain percent sequence identity.
- amino acid residues e.g., Ala, Pro, Ser, Thr, Gly, Val, Leu, Ile, Phe, Tyr, Trp, Lys, Arg, His, Asp, Glu, Asn, Gln, Cys, and Met
- Sequence identity may also be to fragments or portions of full-length polynucleotides or polypeptides. Thus, a sequence may have only 50% overall sequence identity to the full-length reference sequence, but the sequence of a particular region, domain or subunit may have 80%, 90%, or as much as 99% sequence identity to the reference sequence.
- wild-type refers to a gene or gene product isolated from a naturally occurring source.
- a wild-type gene is the most frequently observed gene in a population, and thus is arbitrarily designed to be the "normal” or "wild-type” form of that gene.
- modified refers to a gene or gene product that exhibits sequence modifications (for example, substitutions, truncations or insertions), post-translational modifications and/or A gene or gene product that has a functional property (eg, altered property). Note that naturally occurring mutants can be isolated; these mutants are identified by the fact that they have altered characteristics compared to the wild-type gene or gene product.
- arginine (R) can be substituted for methionine (M) by substituting a codon for arginine (CGT) for methionine (ATG) at the relevant position in the polynucleotide encoding the mutated monomer.
- CCT codon for arginine
- ATG methionine
- non-naturally occurring amino acids can be introduced by including a synthetic aminoacyl-tRNA in the IVTT system used to express the mutated monomer.
- non-naturally-occurring amino acids can be introduced by expressing mutated monomers in Pseudomonas taeanensis MS-3, which are involved in the synthesis of those specific amino acids (i.e., non-naturally-occurring ) analogs are auxotrophic for a particular amino acid.
- Mutant monomers can also be produced by naked ligation if they are produced using partial peptide synthesis. Conservative substitutions replace an amino acid with another amino acid of similar chemical structure, similar chemical properties, or similar side chain volume.
- the introduced amino acids may have a similar polarity, hydrophilicity, hydrophobicity, basicity, acidity, neutrality or charge to the amino acids they replace.
- a conservative substitution can introduce another aromatic or aliphatic amino acid in place of a pre-existing aromatic or aliphatic amino acid.
- Conservative amino acid changes are well known in the art and can be selected based on the properties of the 20 major amino acids defined in Table 1 below. In the case of amino acids with similar polarity, this can also be determined by reference to the hydrophilicity scale for amino acid side chains in Table 2.
- Mutated or modified proteins, monomers or peptides may also be chemically modified at any point in any manner.
- the mutated or modified monomer or peptide is preferably by attachment of the molecule to one or more cysteines (cysteine linkage), attachment of the molecule to one or more lysines, attachment of the molecule to one or more chemical modification by attachment of an unnatural amino acid, enzymatic modification of the epitope, or modification of the terminus. Suitable methods for making such modifications are well known in the art.
- Modified mutants of proteins, monomers or peptides can be chemically modified by attachment of any molecule.
- mutants of modified proteins, monomers, or peptides can be chemically modified by the attachment of dyes or fluorophores.
- a mutated or modified monomer or peptide is chemically modified with a molecular adapter that facilitates the interaction between a pore comprising the monomer or peptide and a target nucleotide or target polynucleotide sequence.
- Molecular adapters are preferably cyclic molecules, cyclodextrins, substances capable of hybridization, DNA binding or intercalating agents, peptides or peptide analogs, synthetic polymers, aromatic planar molecules, positively charged small molecules or capable of hydrogen bonding bonded small molecules.
- the presence of the adapter improves the host-guest chemistry of the pore and the nucleotide or polynucleotide sequence, thereby improving the sequencing capability of the pore formed from the mutated monomer.
- the principles of host-guest chemistry are well known in the art.
- Adapters have an effect on the physical or chemical properties of the pore that improves the interaction of the pore with the nucleotide or polynucleotide sequence.
- An adapter can alter the charge of the barrel or channel of the pore, or specifically interact or bind to a nucleotide or polynucleotide sequence, thereby facilitating its interaction with the pore.
- a “protein pore” is a transmembrane protein structure that defines a channel or pore that allows the translocation of molecules and ions from one side of the membrane to the other. The translocation of ionic species through the pore can be driven by a potential difference applied to either side of the pore.
- a “nanopore” is a protein pore in which the smallest diameter of the pathway through which molecules or ions pass is on the nanometer scale (10 -9 meters).
- the protein pore may be a transmembrane protein pore.
- the transmembrane protein structure of the protein pore can be monomeric or oligomeric in nature.
- a pore comprises multiple polypeptide subunits arranged around a central axis, forming a protein-lined channel extending substantially perpendicular to the membrane in which the nanopore resides.
- the number of polypeptide subunits is not limited. Typically, the number of subunits is from 5 to 30, suitably from 6 to 10. Alternatively, the number of subunits is not as defined as in the case of perfringolysin or related large membrane pores.
- the portion of the protein subunit that forms the protein-lined channel within the nanopore typically contains secondary structural motifs that may include one or more transmembrane ⁇ -barrel and/or ⁇ -helical portions.
- the protein pore comprises one or more porin monomers.
- Each porin monomer may be from Pseudomonas cestodes.
- the protein pore comprises a mutant of one or more porin monomers (ie, one or more porin mutated monomers).
- the porin is from a wild-type protein from the kingdom of Bacteria, a wild-type homologue, or a mutant thereof. Mutants can be modified porins or porin mutants. Modifications in a mutant include, but are not limited to, any one or more modifications or combinations of modifications disclosed herein.
- the kingdom wild-type protein is a protein from Pseudomonas cestodes.
- a porin homologue refers to a protein having at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91 %, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50% complete sequence identity of the polypeptides.
- a porin homologue refers to a polynucleotide having at least 99%, 98%, 97%, 96%, 95%, 94%, 93% of the polynucleotide encoding the protein shown in SEQ ID NO:2 , 92%, 91%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, 50% complete sequence identity.
- the polynucleotide sequence may comprise a sequence that differs from SEQ ID NO: 2 based on the degeneracy of the genetic code.
- Polynucleotide sequences can be derived and replicated using standard methods in the art. Chromosomal DNA encoding a wild-type porin can be extracted from a pore-producing organism such as Pseudomonas cestodes. The gene encoding the pore subunit can be amplified using PCR including specific primers. The amplified sequence can then be subjected to site-directed mutagenesis. Suitable methods of site-directed mutagenesis are known in the art and include, for example, combinatorial chain reactions.
- the constructed polynucleotides encoding the embodiments can be prepared by techniques known in the art, for example, in Sambrook, J. and Russell, D. (2001). Molecular Cloning A Laboratory Manual, 3rd Edition. Cold Spring Harbor Laboratory Press, Cold Spring Those described in Harbor, NY.
- polynucleotide sequences can then be incorporated into recombinant replicable vectors, such as cloning vectors.
- the vector can be used to replicate the polynucleotide in a compatible host cell.
- polynucleotide sequences can be prepared by introducing the polynucleotide into a replicable vector, introducing the vector into a compatible host cell, and growing the host cell under conditions that cause replication of the vector. The vector can be recovered from the host cell.
- an insulating film 102 with nanoscale pores divides the cavity into two small chambers, as shown in FIG. 1 , when a voltage acts on the electrolyte chamber, ions or other Small molecules pass through the pores under the action of an electric field, forming a stable and detectable ionic current.
- ions or other Small molecules pass through the pores under the action of an electric field, forming a stable and detectable ionic current.
- Different types of biomolecules can be detected by knowing the size and surface properties of the nanopores, the applied voltage, and the solution conditions.
- ssDNA single-stranded DNA
- FIG. 2 shows a schematic diagram 200 of DNA sequencing.
- the nanopore is the only channel for ions on both sides of the phospholipid membrane to pass through.
- Rate-controlling proteins such as polynucleotide binding proteins act as motor proteins for nucleic acid molecules such as DNA, pulling DNA strands sequentially through the nanopore/protein pore in steps of single nucleotides.
- the corresponding pore blocking signal is recorded ( Figure 3).
- porins are screened from different species (mainly bacteria and archaea) in nature by means of bioinformatics and evolution.
- the porin is from any organism, preferably from Pseudomonas cestodes.
- sequence analysis the porin has a complete functional domain.
- structural biology methods to predict and analyze the 3D structure model of the porin, select the channel protein with a suitable reading head architecture.
- genetic engineering, protein engineering, protein directed evolution, and computer-aided protein design are used to transform, test, and optimize candidate channel proteins (or porins).
- multiple homologous protein mutants are obtained, preferably Six (different homologous protein backbones), with different signal characteristics and signal distribution patterns.
- porins in the examples can be applied to the fourth generation sequencing technology.
- the porin is a nanoporin.
- porins can be applied to solid state pores for sequencing.
- a new protein backbone is used to form a new constriction region (reading head region) structure, thereby providing a new mode of action during the sequencing process.
- the porins of the examples have good edge-hopping distribution and efficiency of recombination with phospholipid membranes.
- the wild-type porin monomer is genetically mutated to form a mutant of the porin monomer.
- the amino acid of the mutant of the porin monomer comprises the sequence shown in SEQ ID NO: 1 or comprises at least 99%, 98%, 97%, 96%, 95%, 94%, 93% , 92%, 91%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, or 50% identical sequence, and the mutant of the porin monomer
- the amino acids correspond to one or more of the positions 62-79, 104, 170 and 175 of SEQ ID NO: 1 with mutations.
- mutations include insertions, deletions and/or substitutions of amino acids.
- one or more positions of 62-79, 104, 170 and 175 of SEQ ID NO:1 have a mutation, which is among 62-79, 104, 170 and 175 of SEQ ID NO:1 have amino acid insertions, deletions and/or substitutions at one or more positions.
- the amino acid of the mutant of the porin monomer corresponds to (1) 69-76, (2) 64-79, (3) 62, 63, 68, 69 of SEQ ID NO: 1 , 71, 75, 76, and 104, or (4) one or more of positions 68-76 and 171 and 175.
- the amino acid of the mutant of the porin monomer corresponds to (1) 69-76, (2) 64-79, (3) 62, 63, 68, 69 of SEQ ID NO: 1 , 71, 75, 76, and 104, or (4) one or more of positions 68-76 and 171 and 175 have amino acid insertions, deletions and/or substitutions.
- the amino acid of the mutant of the porin monomer has mutations only at positions 69-76 corresponding to SEQ ID NO: 1 (i.e. K69, P70, T71, P72, A73, S74, S75, F76) , or have amino acid insertions, deletions and/or substitutions at one or more positions.
- the amino acid of the mutant of the porin monomer is only at positions 64-79 corresponding to SEQ ID NO: 1 (i.e. Q64, T65, G66, Q67, Y68, K69, P70, T71, P72, A73, S74, S75, F76, S77, T78 and S79) have mutations, or have insertions, deletions and/or substitutions of amino acids at one or more positions.
- the amino acid of the mutant of the porin monomer has mutations only at positions R62, D63, Y68, K69, T71, S75, F76Q, and E104 corresponding to SEQ ID NO: 1, or at one or There are insertions, deletions and/or substitutions of amino acids at various positions.
- the amino acid of the mutant of the porin monomer is only at positions 68-76 corresponding to SEQ ID NO: 1 (i.e. Y68, K69, P70, T71, P72, A73, S74, S75 and F76) , E171 and D175 have mutations, or have insertions, deletions and/or substitutions of amino acids at one or more positions.
- the position corresponding to SEQ ID NO: 1 means that no matter whether the sequence number is changed by amino acid insertion or deletion or the use of an identity sequence, the relative position remains unchanged, and the sequence of SEQ ID NO: 1 can still be used number.
- Q64 corresponding to SEQ ID NO: 1 can be mutated to Q64L, even if the sequence number of SEQ ID NO: 1 changes or adopts a sequence with the identity defined herein with SEQ ID NO: 1, corresponding to the sequence of SEQ ID NO: 1 Amino acid Q at position 64 (even if it is not position 64 in another sequence) can also be mutated into L, which is still within the protection scope of the present invention.
- the amino acids of the mutant of the porin monomer consist of the sequence shown in SEQ ID NO: 1, or have at least 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 85%, 80%, 75%, or 70%, 65%, 60%, 55%, or 50% identity sequence composition, and the porin monomer
- the amino acid of the mutant corresponds to one or more positions of 62-79, 104, 170 and 175 of SEQ ID NO:1.
- the sequence of SEQ ID NO: 1 of the porin monomer is from Pseudomonas cestodes.
- the nucleotide sequence encoding the amino acid of SEQ ID NO:1 is SEQ ID NO:2.
- the amino acid KPTPASSF at position 69-76 is mutated into M 1 M 2 M 3 M 4 M 5 M 6 M 7 M 8 , wherein M 1 is selected from R, K or H; M 2 is selected from P; M 3 is selected from S, G, C, U, T, M, A, V, L or I; M 4 is selected from P; M 5 is selected from A, G, V, L or I; M 6 is selected from S, C , U, T or M; M 7 is selected from A, T, G, V, L, I, S, C, U or M; M 8 is selected from Q, D, E, N, K, H or R.
- the amino acid QTGQYKPTPASSFSTS corresponding to positions 64-79 of SEQ ID NO: 1 is mutated to M 9 M 10 M 11 M 12 M 13 M 14 M 15 M 16 M 17 M 18 M 19 M 20 M 21 M 22 M 23 M 24 , wherein M 9 is selected from L, G, A, V or I; M 10 is selected from T, S, C, U or M; M 11 is selected from G, A, V, L or I; M 12 is selected from Q, D, E or N; M 13 is selected from Y, F or W; M 14 is selected from R, H or K; M 15 is selected from P; M 16 is selected from S, C, U, T or M M M 17 is selected from P; M 18 is selected from A, G, V, L or I; M 19 is selected from S, C, U, T or M; M 20 is selected from A, G, V, L or I; 21 is selected from N, D, E, Q, L, G, A, V or I; M 22 is selected from S, C, C
- R62 corresponding to SEQ ID NO:1 is mutated to R62S, R62C, R62U, R62T or R62M; D63 is mutated to D63V, D63G, D63A, D63L or D63I; Y68 is mutated to Y68F or Y68W; K69 is mutated to K69R or K69H; T71 to T71S, T71C, T71U, or T71M; S75 to S75A, S75G, S75V, S75L, or S75I; F76 to F76Q, F76D, F76E, or F76N; E104 to E104V, E104G, E104A, E104L, or E104I .
- amino acid YKPTPASSF corresponding to positions 68-76 of SEQ ID NO: 1 is mutated to M 25 M 26 M 27 M 28 M 29 M 30 M 31 M 32 M 33
- E171 is mutated to E171N, E171D or E171Q
- D175 is mutated to D175N, D175E or D175Q
- M25 is selected from F, Y or W
- M26 is selected from R, H or K
- M27 is selected from P
- M28 is selected from S, C, U, T or M M 29 is selected from P
- M 30 is selected from A, G, V, L or I
- M 31 is selected from S, C, U, T or M
- M 32 is selected from A, G, V, L or I
- 33 is selected from Q, D, E, or N.
- a mutant of a porin monomer wherein the amino acid mutation is selected from the following:
- the amino acid sequence of the mutant of the porin monomer comprises SEQ ID NO: 24, SEQ ID NO: 26, SEQ ID NO: 27, SEQ ID NO: 28, SEQ ID NO: 29 or SEQ ID NO: 29 or SEQ ID NO: 24 ID NO:30, or consisting of it.
- the protein pore comprises at least one mutant of a porin monomer (or a mutated monomer of a porin). In one embodiment, the protein pore comprises at least two, three, four, five, six, seven, eight, nine, or ten or more mutants of the porin monomer. In one embodiment, the protein pore comprises mutants of at least two porin monomers, which may be the same or different. In one embodiment, the protein pore comprises mutants of two or more monomers of the porin, preferably the mutants of the two or more monomers are the same. In one embodiment, the diameter of the constricted region of the protein pore is 0.7nm-2.2nm, 0.9nm-1.6nm, 1.4-1.6nm or
- mutants of porin monomers or protein pores comprising the same for detecting the presence, absence or one or more characteristics of a target analyte.
- mutants of porin monomers or protein pores are used to detect the sequence of a nucleic acid molecule, or characterize a polynucleotide sequence, eg, sequence a polynucleotide sequence, because they can distinguish between different nucleotides with high sensitivity.
- Mutants of porin monomers or protein pores that include them can discriminate between four nucleotides in DNA and RNA, and even between methylated and unmethylated nucleotides, with unexpectedly high resolution .
- dCMP deoxycytosine monophosphate
- Mutants of porin monomers or protein pores can also discriminate between different nucleotides under a range of conditions.
- mutants of said porin monomers or protein pores discriminate between nucleotides under conditions favorable for nucleic acid characterization, such as sequencing.
- the degree to which mutants of porin monomers or protein pores discriminate between different nucleotides can be controlled by varying the applied potential, salt concentration, buffer, temperature and the presence of additives such as urea, betaine and DTT. This allows mutants of porin monomers or the function of the protein pore to be finely tuned, especially when sequenced.
- Mutants of porin monomers or protein pores can also be used to identify polynucleotide polymers by interacting with one or more monomers rather than at nucleotide-based nucleotides.
- a mutant of a porin monomer or a protein pore may be isolated, substantially isolated, purified or substantially purified. Mutants of the porin monomers or protein pores of the examples are isolated or purified if they are completely free of any other components, such as liposomes or other protein pores/porins. A mutant porin monomer or protein pore is substantially isolated if the mutant porin monomer or protein pore is mixed with a carrier or diluent that does not interfere with its intended use.
- the mutant of the porin monomer or the protein pore can contain less than 10%, less than 5%, less than 2% or less than 1% of other components such as triblock copolymers, liposomes or other protein pores/pores If the mutant porin monomer or protein pore exists in the form of a protein, the porin monomer is substantially isolated or substantially purified. Alternatively, mutants of porin monomers or protein pores can be present in the membrane.
- the membrane is preferably an amphiphilic layer.
- the amphiphilic layer is a layer formed of amphiphilic molecules, for example, phospholipids, which have both hydrophilic and lipophilic properties.
- Amphiphiles can be synthetic or naturally occurring.
- the amphiphilic layer can be monolayer or bilayer.
- the amphiphilic layer is usually planar.
- the amphiphile may be curved.
- the amphiphilic layer may be supported.
- the membrane can be a lipid bilayer.
- a lipid bilayer is formed by two opposing layers of lipids. The two layers of lipids are aligned such that their hydrophobic tail groups face each other to form a hydrophobic interior.
- the hydrophilic headgroups of the lipids face outward towards the aqueous environment on each side of the bilayer.
- the membrane includes a solid state layer. Solid layers can be formed from organic and inorganic materials. If the membrane includes a solid state layer, the pores are typically present in the amphiphilic membrane or in a layer comprised within the solid state layer, eg, in holes, wells, gaps, channels, trenches or slits within the solid state layer.
- Embodiments provide a method of determining the presence, absence, or one or more properties of a target analyte.
- the method involves contacting the target analyte with a mutant porin monomer or protein pore such that the target analyte moves relative to, e.g., through, the mutant porin monomer or protein pore, and One or more measurements are taken as the target analyte moves relative to the mutant porin monomer or protein pore, thereby determining the presence, absence, or one or more properties of the target analyte.
- the target analyte may also be referred to as a template analyte or an analyte of interest.
- Target analytes are preferably polysaccharides, metal ions, inorganic salts, polymers, amino acids, peptides, polypeptides, proteins, nucleotides, oligonucleotides, polynucleotides, dyes, drugs, diagnostic agents, explosives or environmental pollutants .
- the method may involve determining the presence, absence or one or more properties of two or more target analytes of the same class, eg, two or more proteins, two or more nucleotides, or two or more drugs.
- the method may involve determining the presence, absence or one or more properties of two or more different classes of target analytes, e.g., one or more proteins, one or more nucleotides and one or more medications.
- the method comprises contacting the target analyte with a mutant porin monomer or protein pore such that the target analyte moves through the mutant porin monomer or protein pore.
- the protein pore typically comprises at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9 or at least 10 porin mutant monomers, e.g. , 7, 8, 9 or 10 monomers.
- the protein pores comprise identical monomers or different porin monomers, preferably 8 or 9 identical monomers. One or more of said monomers, eg 2, 3, 4, 5, 6, 7, 8, 9 or 10, are preferably chemically modified as discussed above.
- the amino acid of each monomer comprises SEQ ID NO: 1 and the above-mentioned mutants thereof.
- the amino acid of each monomer consists of SEQ ID NO: 1 and its above-mentioned mutants.
- the methods of the embodiments may measure two, three, four or five or more characteristics of a polynucleotide.
- the one or more characteristics are preferably selected from (i) the length of the polynucleotide, (ii) the identity of the polynucleotide, (iii) the sequence of the polynucleotide, (iv) the secondary structure of the polynucleotide, and ( v) whether the polynucleotide has been modified. In one embodiment, any combination of (i) to (v) may be measured.
- (i) for example, by determining the number of interactions between the polynucleotide and the mutant/protein pore of the protein monomer or the duration of the interaction between the polynucleotide and the mutant/protein pore of the protein monomer vs. The length of the polynucleotide is measured.
- the identity of the polynucleotide can be measured in a variety of ways, either in conjunction with or without measurement of the polynucleotide sequence.
- the former is simpler; the polynucleotides are sequenced and then identified.
- the latter can be done in several different ways. For example, the presence of a particular motif in a polynucleotide can be measured (without measuring the rest of the sequence of the polynucleotide).
- measurement of specific electrical and/or optical signals in the method can identify the polynucleotide as coming from a specific source.
- the sequence of the polynucleotide can be determined as previously described. Suitable sequencing methods, in particular those using electrical measurements, are described in Stoddart D et al., ProC Natl Acad Sci, 12; 106(19)7702-7, Lieberman KR et al, J Am Chem SoC. 2010; 132(50)17961-72, and in International Application W02000/28312.
- secondary structure can be measured in a number of ways. For example, if the method involves an electrical measurement method, the secondary structure can be measured using a change in residence time or a change in the current flowing through the pore. This allows distinguishing between regions of single- and double-stranded polynucleotides.
- the presence or absence of any modification can be measured.
- the method comprises determining whether the polynucleotide is abasic or lacks nucleobases and sugars by methylation, oxidation, damage, with one or more proteins or with one or more labels, tagging or grooming. Certain modifications will result in specific interactions with the pore, which can be measured using the methods described below. For example, methylcytosine can be distinguished from cytosine based on the current flowing through the pore during its interaction with each nucleotide.
- the target polynucleotide is contacted with a mutant protein monomer/protein pore, such as a mutant protein monomer/protein pore as in the examples.
- Mutants/protein pores of said protein monomers are usually present in membranes. Suitable membranes are described above.
- the method can be performed using any device suitable for studying membranes/protein pores or mutant systems of porin monomers - where mutants of protein monomers/protein pores are present in membranes.
- the method can be performed using any device suitable for use with the transmembrane porosity side.
- the device comprises a chamber containing an aqueous solution and a barrier dividing the chamber into two parts.
- the barrier typically has pores in which a membrane comprising pores is formed.
- the barrier forms a membrane in which mutants/protein pores of protein monomers are present.
- the method can be performed using the apparatus described in International Application No. PCT/GB08/000562 (WO 2008/102120).
- Electrical measurements include voltage measurement, capacitance measurement, current measurement, impedance measurement, tunneling measurement (Ivanov AP et al., Nano Lett.2011Jan 12; 11(I):279-85) and FET measurement (international application TO 2005/124888).
- Optical measurements can be combined with electrical measurements (Soni GV et al., Rev Sci Instrum. 2010 Jan; 81 (1) 014301).
- the measurement may be a transmembrane current measurement, eg measurement of ionic current flowing through the pore.
- the electrical or optical measurements may employ conventional electrical or optical measurements.
- Electrical measurements can be used as described in Stoddart D et al., ProC Natl Acad Sci, 12; 106(19) 7702-7, Lieberman KR et al, J Am Chem SoC. 2010; 132(50) 17961-72 and the international application WO Standard single channel recording equipment in 2000/28312.
- electrical measurements can be performed using a multi-channel system, for example as described in International Application WO 2009/077734 and International Application WO 2011/067559.
- the method is preferably carried out using an electrical potential applied across the membrane.
- the applied potential may be a voltage potential.
- the applied potential may be a chemical potential.
- An example of this is using a salt gradient across a membrane, such as an amphiphilic layer. Salt gradients are disclosed in Holden et al., J Am Chem SoC. 2007 Jul 11;129(27):8650-5.
- the current flowing through the mutant/protein pore of the protein monomer as the polynucleotide moves relative to the mutant/protein pore of the protein monomer is used to estimate or determine the sequence of the polynucleotide. This is chain sequencing.
- the method may comprise measuring the current flowing through the pore as the polynucleotide moves relative to the pore.
- the apparatus used in the method may thus also include circuitry capable of applying an electrical potential and measuring electrical signals across the membrane and pores.
- the method can be performed using patch clamp or voltage clamp.
- the method may include measuring the current flowing through the pore as the polynucleotide moves relative to the pore. Suitable conditions for measuring ion flux through transmembrane protein pores are known in the art and disclosed in the Examples.
- the method is generally carried out by applying a voltage across the membrane and the pore.
- the voltage used is typically from +5V to -5V, eg from +4V to -4V, from +3V to -3V or from +2V to -2V.
- the voltage used is typically from -600mV to +600V or -400mV to +400mV.
- the voltage used preferably has a lower limit selected from -400mV, -300mV, -200mV, -150mV, -100mV, -50mV, -20mV and 0mV and independently selected from +10mV, +20mV, +50mV, +100mV, +150mV, +200mV, +300nA ⁇ P+400mV upper limit range.
- the voltage used is more preferably in the range of 100mV to 240mV and most preferably in the range of 120mV to 220mV.
- the process is generally carried out in the presence of any charge carrier, such as a metal salt such as an alkali metal salt, a halide salt such as a chloride salt, such as an alkali metal chloride salt.
- Charge carriers may include ionic liquids or organic salts such as tetramethylammonium chloride, trimethylphenylammonium chloride, phenyltrimethylammonium chloride or 1-ethyl-3-methylimidazolium chloride.
- the salt is present in the aqueous solution in the chamber.
- potassium chloride (KCl), sodium chloride (NaCl), cesium chloride (CsCl) or a mixture of potassium ferrocyanide and potassium ferricyanide is used.
- KCl, NaCl and mixtures of potassium ferrocyanide and potassium ferricyanide are preferred.
- Charge carriers may be asymmetric on the membrane. For example, the type and/or concentration of charge carriers may be different on each side of the membrane.
- the concentration of the salt may be saturated.
- the concentration of the salt may be 3M or less, and is typically 0.1 to 2.5M, 0.3 to 1.9M, 0.5 to 1.8M, 0.7 to 1.7M, 0.9 to 1.6M or 1M to 1.4M.
- the concentration of the salt is preferably from 150 mM to 1M.
- the method is preferably performed using a salt concentration of at least 0.3M, eg at least 0.4M, at least 0.5M, at least 0.6M, at least 0.8M, at least 1.0M, at least 1.5M, at least 2.0M, at least 2.5M or at least 3.0M.
- a high salt concentration provides a high signal-to-noise ratio and allows the passing of the current to indicate the presence of the nucleotide to be identified against the background of normal current fluctuations.
- the methods are generally performed in the presence of a buffer.
- the buffer is present in the aqueous solution in the chamber. Any buffer may be used in the methods of the invention.
- the buffer is a phosphate buffer.
- Other suitable buffers are HEPES or Tris-HCl buffer.
- the process is typically performed at a pH of 4.0 to 12.0, 4.5 to 10.0, 5.0 to 9.0, 5.5 to 8.8, 6.0 to 8.7, 7.0 to 8.8, or 7.5 to 8.5.
- the pH used is preferably about 7.5.
- the method can be carried out at a temperature of 0°C to 100°C, 15°C to 95°C, 16°C to 90°C, 17°C to 85°C, 18°C to 80°C, 19°C to 70°C or 20°C to 60°C.
- the method is generally performed at room temperature.
- the method is optionally performed at a temperature that supports enzyme function, eg, about 37°C.
- a method for determining the presence, absence, or one or more characteristics of a target analyte comprises coupling the target analyte to a membrane; and combining the target analyte with The protein pores present in the membrane interact (eg, contact) such that the target analyte moves relative to (eg, passes through) the protein pore.
- the current flow through the protein pore as the target analyte moves relative to the protein pore is measured to determine the presence, absence or one or more characteristics of the target analyte (e.g., multinucleated). nucleotide sequence).
- the characterization methods of the embodiments preferably comprise contacting the polynucleotide with a polynucleotide binding protein such that the protein controls movement of the polynucleotide relative to the mutant/protein pore of the protein monomer, e.g. Mutant/protein pores.
- the method comprises (a) contacting the polynucleotide with a mutant/protein pore of a protein monomer and a polynucleotide binding protein such that the protein controls the relative relation of the polynucleotide to the mutant/pore of the protein monomer. Movement of the protein pore, e.g., through the mutant/protein pore of the protein monomer, and (b) obtaining one or more measurements as the polynucleotide moves relative to the mutant/protein pore of the protein monomer, wherein, Such measurements are indicative of one or more characteristics of the polynucleotide, thereby characterizing the polynucleotide.
- the method comprises (a) contacting the polynucleotide with a mutant/protein pore of a protein monomer and a polynucleotide binding protein such that the protein controls the relative relation of the polynucleotide to the mutant/pore of the protein monomer. Movement of the protein pore, e.g., through the mutant/protein pore of the protein monomer, and (b) measuring through the mutant/protein pore of the protein monomer as the polynucleotide moves relative to the mutant/protein pore of the protein monomer wherein the current is indicative of one or more characteristics of the polynucleotide, thereby characterizing the polynucleotide.
- a polynucleotide binding protein can be any protein capable of binding a polynucleotide and controlling its movement through a pore.
- a polynucleotide binding protein generally interacts with and modifies at least one property of a polynucleotide. Proteins can modify polynucleotides by cleaving them to form individual nucleotides or short chains of nucleotides (eg, dinucleotides or trinucleotides).
- a protein can modify a polynucleotide by orienting it or moving it to a specific position, ie, controlling its movement.
- the polynucleotide binding protein is preferably derived from a polynucleotide handling enzyme.
- a polynucleotide-handling enzyme is a polypeptide capable of interacting with and modifying at least one property of a polynucleotide.
- the enzymes can modify polynucleotides by cleaving them to form individual nucleotides or short chains of nucleotides (eg, dinucleotides or trinucleotides).
- the enzyme can modify the polynucleotide by orienting it or moving it to a specific position.
- a polynucleotide-handling enzyme need not exhibit enzymatic activity as long as it is capable of binding polynucleotides and controlling their movement through the pore.
- the enzyme may be modified to remove its enzymatic activity, or may be used under conditions that prevent its use as an enzyme.
- Polynucleotide-handling enzymes are preferably polymerases, exonucleases, helicases and topoisomerases, eg, gyrase.
- the enzyme is preferably a helicase, such as Hel308Mbu, Hel308Csy, Hel308Tga, Hel308Mhu, Tral Eco, XPD Mbu, Dda or variants thereof. Any helicase can be used in the embodiments.
- any number of helicases can be used. For example, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more helicases may be used. In some embodiments, different numbers of helicases can be used.
- the methods of the embodiments preferably comprise contacting the polynucleotide with two or more helicases.
- the two or more helicases are typically the same helicase.
- the two or more helicases may be different helicases.
- the two or more helicases may be any combination of the aforementioned helicases.
- the two or more helicases may be two or more Dda helicases.
- the two or more helicases may be one or more Dda helicases and one or more TrwC helicases.
- the two or more helicases may be different variants of the same helicase.
- the two or more helicases are preferably linked to each other.
- the two or more helicases are more preferably covalently linked to each other.
- the helicases can be linked in any order and using any method.
- the invention also provides a kit for characterizing a target analyte (eg, a target polynucleotide).
- a target analyte eg, a target polynucleotide
- the kit contains the well and membrane components of the examples.
- the film is preferably formed from components. Pores are preferably present in the membrane.
- the kit may comprise the components of any of the membranes disclosed above, such as amphiphilic or triblock copolymer membranes.
- the kit may further comprise a polynucleotide binding protein. Any of the polynucleotide binding proteins discussed above can be used.
- the membrane is an amphiphilic layer, a solid state layer, or a lipid bilayer.
- the kit may further comprise one or more anchors for coupling the polynucleotide to the membrane.
- the kit is preferably for characterizing double-stranded polynucleotides, and preferably comprises Y adapters and hairpin loop adapters.
- Y adapters preferably have one or more helicases attached, and hairpin loop adapters preferably have one or more molecular brakes attached.
- Y adapters preferably comprise one or more first anchors for coupling the polynucleotide to the membrane
- hairpin loop adapters preferably comprise one or more second anchors for coupling the polynucleotide to the membrane Anchor, and the strength of coupling of the hairpin loop adapter to the membrane is preferably greater than that of the Y adapter to the membrane.
- the kit may additionally comprise one or more other reagents or instruments that enable performance of any of the above-mentioned embodiments.
- reagents or instruments include one or more of the following: a suitable buffer (aqueous solution), a device for obtaining a sample from an individual (such as a container or instrument containing a needle), a device for amplifying and/or expressing a polynucleotide device, or voltage or patch clamp equipment.
- the reagents may be present in the kit in a dry form such that the fluid sample resuspends the reagents.
- the kit may also optionally contain instructions enabling use of the kit with the methods of the invention or details as to which organisms may use the methods.
- the invention also provides an apparatus for characterizing a target analyte (eg, a target polynucleotide).
- the device includes single or multiple protein monomer mutants/protein pores, and single or multiple membranes.
- a mutant/protein pore of said protein monomer is preferably present in said membrane.
- the number of pores and membranes is preferably equal. Preferably there is a single pore in each membrane.
- the apparatus preferably also includes instructions for implementing the methods of the embodiments.
- the device can be any conventional device for analyte analysis, eg, an array or a chip. Any of the embodiments discussed in connection with the method of an embodiment applies equally to the device.
- the device may also include any of the features present in the kits described herein.
- the equipment used in the embodiment can specifically be the gene sequencer QNome-9604 of Qitan Technology.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence encoding the amino acid sequence is shown in SEQ ID NO: 2.
- Mutant 1 of the porin monomer is that the wild-type porin has a mutation at positions 69-76 corresponding to SEQ ID NO: 1, specifically KPTPASSF at positions 69-76 is replaced by RPSPASAQ.
- the protein pore of mutant 1 that includes a porin monomer is mutant pore 1.
- the amino acid sequence of mutant 1 of the protein monomer is shown in SEQ ID NO:24, and the nucleic acid sequence is shown in SEQ ID NO:25.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence of the sequence encoding this amino acid is shown in SEQ ID NO: 2 .
- Mutant 2 of the porin monomer is that the wild-type porin has a mutation at positions 69-76 corresponding to SEQ ID NO: 1, specifically KPTPASSF at positions 69-76 is replaced by KPGPASTK.
- the protein pore of mutant 2 that includes a porin monomer is mutant pore 2.
- the amino acid sequence of mutant 2 of the protein monomer is shown in SEQ ID NO:26.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence of the sequence encoding this amino acid is shown in SEQ ID NO: 2 .
- Mutant 3 of the porin monomer is that the wild-type porin has a mutation at positions 64-79 corresponding to SEQ ID NO: 1, specifically QTGQYKPTPASSFSTS at positions 64-79 is replaced by LTGQYRPSPASANSTA.
- the protein pore of mutant 3 that includes a porin monomer is mutant pore 3.
- the amino acid sequence of mutant 3 of the protein monomer is shown in SEQ ID NO:27.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence of the sequence encoding this amino acid is shown in SEQ ID NO: 2 .
- Mutant 4 of the porin monomer is that the wild-type porin has a mutation at positions 64-79 corresponding to SEQ ID NO: 1, specifically QTGQYKPTPASSFSTS at positions 64-79 is replaced by LTGQYRPSPASALSTA.
- the protein pore of mutant 4 that includes a porin monomer is mutant pore 4.
- the amino acid sequence of mutant 4 of the protein monomer is shown in SEQ ID NO:28.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence of the sequence encoding this amino acid is shown in SEQ ID NO: 2 .
- Mutant 5 of the porin monomer is a wild-type porin mutated at the following positions corresponding to SEQ ID NO: 1: R62S, D63V, Y68F, K69R, T71S, S75A, F76Q, and E104V.
- the protein pore of mutant 5 that includes a porin monomer is mutant pore 5.
- the amino acid sequence of mutant 5 of the protein monomer is shown in SEQ ID NO:29.
- the wild-type porin is from Pseudomonas cestodes, and the amino acid sequence of the wild-type porin is SEQ ID NO: 1, and the nucleotide sequence of the sequence encoding this amino acid is shown in SEQ ID NO: 2 .
- Mutant 6 of the porin monomer is that the wild-type porin has mutations at positions 68-76, 171, and 175 corresponding to SEQ ID NO: 1, specifically YKPTPASSF at positions 68-76 is replaced by FRPSPASAQ, and E at position 171 Replace D with N for N and 175 bits.
- the protein pore of mutant 6 that includes a porin monomer is mutant pore6.
- the amino acid sequence of mutant 6 of the protein monomer is shown in SEQ ID NO:30.
- FIG. 4A is a side view 400 of a predicted protein structure model, where a protein monomer 402 is shown in a darker color.
- FIG. 4B is a top view 404 of the ribbon structure model, in which the darker part shows a protein monomer 406 .
- FIG. 4C is a streamer structure model diagram 408 , and the darker part is the protein monomer 410 .
- Figure 5A and Figure 5B show the amino acid model diagram of mutant hole 1, where the plus sign "+" represents a water molecule.
- Figure 6 shows the size of each part of the mutant 1 of the porin monomer, and the diameter of the constriction zone in the middle of the mutants of the two porin monomers (i.e. two mutant porin monomers) 602 and 604 is the largest. followed by The minimum diameter is The head-to-tail distances between the two mutant porin monomers are and Mutant porin monomer full length The height of the mutant porin monomer head from the constriction zone channel is The height of the channel in the constriction zone is shown in Figure 6 as and
- Fig. 7 shows the monomeric amino acid model of porin mutant 1
- the enlarged display shows the amino acid composition of the constriction region structure, namely Gln76, Ser74 and Ser71.
- FIGs 10A and 10B show the results of the local refined Fourier shell correlation coefficient (FSC). It can be seen that the results of different regional resolutions of porin mutant 1 and the final cryo-EM single particle reconstruction resolution are Reconstruction resolution was determined based on the gold standard FSC 0.143 criterion and high-resolution noise surrogate.
- Figure 11 shows the 3D reconstruction of porin mutant 1 cryo-EM High resolution electron density maps.
- Figure 12 shows porin mutant 1 in The electron density map below. The map shows the channel eye-loop region and is overlaid on the final refined model.
- BS7-4C3-SE1 Two DNA constructs were prepared, BS7-4C3-SE1 and BS7-4C3-PLT.
- the structure of BS7-4C3-SE1 is shown in Figure 13, and the sequence information is as follows:
- C3, C18, dSpacer and iSpC3 are marker sequences introduced to indicate the resolution characteristics of well sequencing.
- the c rate-controlling protein in Fig. 13 and Fig. 14 is helicase Mph-MP1-E105C/A362C (has mutation E105C/A362C), and amino acid sequence is SEQ ID NO:22, and nucleotide sequence is SEQ ID NO: 23.
- Mutation hole 1 was used as a protein hole, and was detected by single-hole sequencing technology. After insertion of a single porin with amino acid sequence Mutant 1 into the phospholipid bilayer, buffer (625 mM KCl, 10 mM HEPES pH 8.0, 50 mM MgCl 2 ) was passed through the system to remove any excess Mutant 1 nanopores .
- Mutation hole 1 opens at a voltage of ⁇ 180mV.
- Fig. 15A shows the opening current and gating characteristics of mutant pore 1 at a voltage of ⁇ 180 mV.
- FIG. 15B shows the condition of single-stranded nucleic acid passing through mutation hole 1 at +180mV voltage. Nucleic acids can pass through the pores. After adding ssnucleic acid, the downward line shows the signal of the nucleic acid passing through the hole.
- the DNA construct BS7-4C3-PLT was sequenced through the mutant hole 1 by single-hole sequencing technology, and the nucleic acid sequencing signal generated by the sequencing system was added after the hole insertion was completed.
- Figures 16A and 16B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 1. According to the signal characteristics, mutation hole 1 can be used for nucleic acid sequencing.
- FIG. 17 is an enlarged result showing the current trace in the part of FIG. 16B .
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the enlarged display of this single signal area further proves that mutant hole 1 can be used for nucleic acid sequencing.
- Example 11 uses mutation hole 2 for empty testing and via testing.
- FIG. 18A shows the opening current and gating characteristics of mutant pore 2 at a voltage of ⁇ 180 mV.
- FIG. 18B shows the condition of single-stranded nucleic acid passing through mutant hole 2 at +180mV voltage. Nucleic acids can pass through the pores. After adding ssnucleic acid, the downward line shows the signal of the nucleic acid passing through the hole.
- FIGS 19A and 19B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 2.
- signal characteristics related characteristics such as sequencing resolution, stability, and signal consistency of mutant hole 2 can be obtained.
- the well has clear steps, obvious transition distribution, and high-precision sequencing capability. From the point of view of signal characteristics, the sequence signal consistency is high.
- Figure 20 shows a zoom-in of part of the current trace.
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the enlarged display of the region of this single signal shows that the mutant pore has high resolution for nucleic acid sequencing.
- Example 12 uses mutation holes 3 to perform empty testing and via testing.
- FIG. 21A shows the opening current and gating characteristics of mutant pore 3 at a voltage of ⁇ 180 mV.
- FIG. 21B shows the situation of single-stranded nucleic acid passing through mutation hole 3 at +180mV voltage. Nucleic acids can pass through the well. After adding ssnucleic acid, the downward line shows the signal of the nucleic acid passing through the hole.
- FIGS. 22A and 22B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 3. According to the signal characteristics, mutation hole 3 can be used for nucleic acid sequencing.
- FIG. 23 is an enlarged result of the current trace shown in the part of FIG. 22A.
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the enlarged display of the region of this single signal further proves that the mutant hole 3 can be used for nucleic acid sequencing.
- Example 13 uses mutation holes 4 to perform empty testing and via testing.
- FIG. 24A shows the opening current and gating characteristics of mutant pore 4 at a voltage of ⁇ 180 mV.
- FIG. 24B shows the situation of single-stranded nucleic acid passing through mutant hole 4 at +180mV voltage. Nucleic acids can pass through the pores.
- Figures 25A and 25B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 4. According to the signal characteristics, mutation hole 4 can be used for nucleic acid sequencing.
- Fig. 26 is an enlarged result showing the current trace of the embodiment part of Figs. 25A and 25B.
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the enlarged display of the region of this single signal further proves that the mutant hole 4 can be used for nucleic acid sequencing.
- the embodiment 14 uses the mutation hole 5 to perform the empty detection and via detection.
- Fig. 27 shows the opening current and gating characteristics of mutation hole 5 under the voltage of ⁇ 180mV.
- the DNA construct BS7-4C3-PLT was sequenced through the mutant hole 5 by using the single-hole sequencing technology, and the nucleic acid sequencing signal from the sequencing system was added after the hole insertion was completed.
- Figure 28 shows an exemplary current trace when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 5. According to the signal characteristics, the mutation hole 5 can be used for nucleic acid sequencing.
- Fig. 29 is an enlarged result of the current trace shown in the embodiment part of Fig. 28 .
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the region of this single signal is enlarged and displayed, which further proves that the mutation hole 5 can be used for nucleic acid sequencing.
- Example 15 uses the mutation hole 6 to perform empty testing and via testing.
- FIG. 30A shows the opening current and gating characteristics of mutant pore 6 at a voltage of ⁇ 180 mV.
- FIG. 30B shows the condition of single-stranded nucleic acid passing through mutation hole 6 at +180mV voltage. Nucleic acids can pass through the pores. After adding ssnucleic acid, the downward line shows the signal of the nucleic acid passing through the hole.
- FIGS 31A and 31B show exemplary current traces when the helicase Mph-MP1-E105C/A362C controls the translocation of the DNA construct BS7-4C3-PLT through mutant pore 6.
- signal characteristics related characteristics such as sequencing resolution, stability, and signal consistency of mutant hole 6 can be obtained.
- the well has clear steps, obvious transition distribution, and high-precision sequencing capability. From the point of view of signal characteristics, the sequence signal consistency is high.
- FIG. 32 is an enlarged result of the current trace shown in the part of FIG. 31A.
- the portion indicated by the dotted arrow shows the enlarged result of the current trace.
- the enlarged display of the region of this single signal shows that the mutant pore has high resolution for nucleic acid sequencing.
- FIG. 33 shows the protein purification results of mutant 1, and lanes 1-4 show the SDS-PAGE electrophoresis detection results of different fractions separated.
- SEC mutant 1 protein size exclusion chromatography
Landscapes
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Biophysics (AREA)
- Biochemistry (AREA)
- Genetics & Genomics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Engineering & Computer Science (AREA)
- Wood Science & Technology (AREA)
- Zoology (AREA)
- Gastroenterology & Hepatology (AREA)
- Medicinal Chemistry (AREA)
- Physics & Mathematics (AREA)
- Analytical Chemistry (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Peptides Or Proteins (AREA)
Abstract
Description
侧链 | 亲水性 |
Ile,I | 4.5 |
Val,V | 4.2 |
Leu,L | 3.8 |
Phe,F | 2.8 |
Cys,C | 2.5 |
Met,M | 1.9 |
Ala,A | 1.8 |
Gly,G | -0.4 |
Thr,T | -0.7 |
Ser,S | -0.8 |
Trp,W | -0.9 |
Tyr,Y | -1.3 |
Pro,P | -1.6 |
His,H | -3.2 |
Glu,E | -3.5 |
Gln,Q | -3.5 |
Asp,D | -3.5 |
Asn,N | -3.5 |
Lys,K | -3.9 |
Arg,R | -4.5 |
Claims (21)
- 一种孔蛋白单体的突变体,其中所述孔蛋白单体的突变体的氨基酸包括SEQ ID NO:1所示的序列或与其具有至少99%、98%、97%、96%、95%、90%、80%、70%、60%或50%同一性的序列,并且所述孔蛋白单体的突变体的氨基酸包括在对应SEQ ID NO:1的T71、S75、F76的一个或多个位置处的突变。
- 根据权利要求1所述的孔蛋白单体的突变体,所述孔蛋白单体的突变体的氨基酸包括在对应SEQ ID NO:1的62-175、62-104、68-175、64-79、71-76、或69-76的一个或多个位置处的突变。
- 根据权利要求1或2所述的孔蛋白单体的突变体,所述孔蛋白单体的突变体的氨基酸包括:(1)对应SEQ ID NO:1的K69、P70、T71、P72、A73、S74、S75、和F76的一个或多个位置处具有氨基酸的插入、缺失和/或替换;(2)对应SEQ ID NO:1的Q64、T65、G66、Q67、Y68、K69、P70、T71、P72、A73、S74、S75、F76、S77、T78、和S79的一个或多个位置处具有氨基酸的插入、缺失和/或替换;(3)对应SEQ ID NO:1的R62、D63、Y68、K69、T71、S75、F76、和E104的一个或多个位置处具有氨基酸的插入、缺失和/或替换;或者(4)对应SEQ ID NO:1的Y68、K69、P70、T71、P72、A73、S74、S75、F76、E171和D175的一个或多个位置处具有氨基酸的插入、缺失和/或替换。
- 根据前述权利要求任一项所述的孔蛋白单体的突变体,其中所述SEQ ID NO:1所示的序列来源于绦虫假单胞菌。
- 根据前述权利要求任一项所述的孔蛋白单体的突变体,其中所述孔蛋白单体的突变体的氨基酸突变选自以下:(a)对应SEQ ID NO:1的69-76位的氨基酸KPTPASSF突变为 M 1M 2M 3M 4M 5M 6M 7M 8,其中M 1选自R、K、H中的0至3种;M 2选自P中的0至1种;M 3选自S、G、C、U、T、M、A、V、L、I中的0至10种;M 4选自P中的0至1种;M 5选自A、G、V、L、I中的0至5种;M 6选自S、C、U、T、M中的0至5种;M 7选自A、T、G、V、L、I、S、C、U、M中的0至10种;M 8选自Q、D、E、N、K、H、R中的0至7种;(b)对应SEQ ID NO:1的64-79位的氨基酸QTGQYKPTPASSFSTS突变为M 9M 10M 11M 12M 13M 14M 15M 16M 17M 18M 19M 20M 21M 22M 23M 24,其中M 9选自L、G、A、V、I中的0至5种;M 10选自T、S、C、U、M中的0至5种;M 11选自G、A、V、L、I中的0至5种;M 12选自Q、D、E、N中的0至4种;M 13选自Y、F、W中的0至3种;M 14选自R、H、K中的0至3种;M 15选自P中的0至1种;M 16选自S、C、U、T、M中的0至5种;M 17选自P中的0至1种;M 18选自A、G、V、L、I中的0至5种;M 19选自S、C、U、T、M中的0至5种;M 20选自A、G、V、L、I中的0至5种;M 21选自N、D、E、Q、L、G、A、V、I中的0至9种;M 22选自S、C、U、T、M中的0至5种;M 23选自T、S、C、U、M中的0至5种;M 24选自A、G、V、L、I中的0至5种;(c)对应SEQ ID NO:1的62位突变为S、C、U、T、M中的0至5种;63位突变为V、G、A、L、I中的0至5种;68位突变为F、W中的0至2种;69位突变为R、H中的0至2种;71位突变为S、C、U、M中的0至4种;75位突变为A、G、V、L、I中的0至5种;76位突变为Q、D、E、N中的0至4种;104位突变为V、G、A、L、I中的0至5种;和(d)对应SEQ ID NO:1的68-76位的氨基酸YKPTPASSF突变为M 25M 26M 27M 28M 29M 30M 31M 32M 33,并且171位突变为N、D、Q中的0至3种;175位突变为N、E、Q中的0至3种,其中M 25选自F、Y、W中的0至3种;M 26选自R、H、K中的0至3种;M 27选自P中的0至1种;M 28选自S、C、U、T、M中的0至5种;M 29选自P中的0至1种;M 30选自A、G、V、L、I中的0至5种;M 31选自S、C、U、 T、M中的0至5种;M 32选自A、G、V、L、I中的0至5种;M 33选自Q、D、E、N中的0至4种。
- 根据前述权利要求任一项所述的孔蛋白单体的突变体,其中所述孔蛋白单体的突变体的氨基酸突变选自以下:(a)对应SEQ ID NO:1的69-76位的氨基酸KPTPASSF突变为RPSPASAQ;(b)对应SEQ ID NO:1的69-76位的氨基酸KPTPASSF突变为KPGPASTK;(c)对应SEQ ID NO:1的64-79位的氨基酸QTGQYKPTPASSFSTS突变为LTGQYRPSPASANSTA;(d)对应SEQ ID NO:1的64-79位的氨基酸QTGQYKPTPASSFSTS突变为LTGQYRPSPASALSTA;(e)对应SEQ ID NO:1的R62S、D63V、Y68F、K69R、T71S、S75A、F76Q、和E104V;和(f)对应SEQ ID NO:1的68-76位的氨基酸YKPTPASSF突变为FRPSPASAQ,以及对应SEQ ID NO:1的E171N和D175N。
- 根据前述权利要求任一项所述的孔蛋白单体的突变体,其中所述孔蛋白单体的突变体的氨基酸序列包括SEQ ID NO:24、SEQ ID NO:26、SEQ ID NO:27、SEQ ID NO:28、SEQ ID NO:29或SEQ ID NO:30,或由其组成。
- 一种蛋白孔,包括至少一个前述权利要求任一项所述的孔蛋白单体的突变体。
- 根据权利要求8所述的蛋白孔,其中所述蛋白孔包括至少两个所述的孔蛋白单体的突变体。
- 一种用于表征靶分析物的复合物,其特征在于:包括权利要求8-10任一项所述的蛋白孔及与其结合的控速蛋白。
- 一种核酸,其编码权利要求1-7任一项所述的孔蛋白单体的突变体、权利要求8-10任一项所述的蛋白孔、或权利要求11所述的复合物。
- 根据权利要求12所述的核酸,其中孔蛋白单体的核苷酸序列为SEQ ID NO:2所示的序列。
- 包含权利要求12-13任一项所述的核酸的载体或遗传工程化的宿主细胞。
- 权利要求1-7任一项所述的孔蛋白单体的突变体、权利要求8-10任一项所述的蛋白孔、权利要求11所述复合物、权利要求12-13任一所述核酸、或权利要求14所述载体或宿主细胞、在检测靶分析物存在、不存在或一个或多个特征或制备检测靶分析物存在、不存在或一个或多个特征的产品中的应用。
- 一种产生蛋白孔或其多肽的方法,包括用包含权利要求14所述的载体转化权利要求14所述的宿主细胞,诱导所述宿主细胞表达权利要求8-10任一所述的蛋白孔或其多肽。
- 一种用于确定靶分析物存在、不存在或一个或多个特征的方法,包括:a.使靶分析物与权利要求8-10任一项所述的蛋白孔、权利要求11所述复合物、或权利要求11所述复合物中的所述蛋白孔接触,使得 所述靶分析物相对于所述蛋白孔移动;以及b.在所述靶分析物相对于所述蛋白孔移动时获取一个或多个测量值,从而确定所述靶分析物的存在、不存在或一个或多个特征。
- 根据权利要求17所述的方法,其中所述方法包括:所述靶分析物与存在于膜中的所述蛋白孔相互作用从而使得所述靶分析物相对所述蛋白孔移动。
- 一种用于确定靶分析物存在、不存在或一个或多个特征的试剂盒,包括权利要求1-7任一项所述的孔蛋白单体的突变体、权利要求8-10任一项所述的蛋白孔、权利要求11所述的复合物、权利要求12-13任一项所述的核酸、或权利要求14所述的载体或宿主,和权利要求18所述的膜的组分。
- 一种用于确定靶分析物存在、不存在或一个或多个特征的装置,包括权利要求8-10任一项所述的蛋白孔或权利要求11所述复合物,和权利要求18所述的膜。
- 根据权利要求15-20中任一所述的应用、方法、试剂盒或装置,其中所述靶分析物包括多糖、金属离子、无机盐、聚合物、氨基酸、肽、蛋白、核苷酸、寡核苷酸、多核苷酸、染料、药物、诊断剂、爆炸物或环境污染物;优选地,所述靶分析物包括多核苷酸,更优选地,所述多核苷酸包括DNA或RNA;和/或,所述一个或多个特征选自(i)所述多核苷酸的长度;(ii)所述多核苷酸的一致性;(iii)所述多核苷酸的序列;(iv)所述多核苷酸的二级结构和(v)所述多核苷酸是否经修饰;和/或,所述复合物中所述控速蛋白包括多核苷酸结合蛋白。
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18/682,843 US20240352516A1 (en) | 2021-08-18 | 2021-08-18 | Mutant of pore protein monomer, protein pore, and use thereof |
EP21953717.2A EP4371997A1 (en) | 2021-08-18 | 2021-08-18 | Mutant of pore protein monomer, protein pore, and use thereof |
PCT/CN2021/113270 WO2023019470A1 (zh) | 2021-08-18 | 2021-08-18 | 孔蛋白单体的突变体、蛋白孔及其应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/CN2021/113270 WO2023019470A1 (zh) | 2021-08-18 | 2021-08-18 | 孔蛋白单体的突变体、蛋白孔及其应用 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2023019470A1 true WO2023019470A1 (zh) | 2023-02-23 |
Family
ID=85239951
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/113270 WO2023019470A1 (zh) | 2021-08-18 | 2021-08-18 | 孔蛋白单体的突变体、蛋白孔及其应用 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20240352516A1 (zh) |
EP (1) | EP4371997A1 (zh) |
WO (1) | WO2023019470A1 (zh) |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000028312A1 (en) | 1998-11-06 | 2000-05-18 | The Regents Of The University Of California | A miniature support for thin films containing single channels or nanopores and methods for using same |
WO2008102120A1 (en) | 2007-02-20 | 2008-08-28 | Oxford Nanopore Technologies Limited | Lipid bilayer sensor system |
WO2009077734A2 (en) | 2007-12-19 | 2009-06-25 | Oxford Nanopore Technologies Limited | Formation of layers of amphiphilic molecules |
WO2011067559A1 (en) | 2009-12-01 | 2011-06-09 | Oxford Nanopore Technologies Limited | Biochemical analysis instrument |
CN107207570A (zh) * | 2014-09-01 | 2017-09-26 | 弗拉芒区生物技术研究所 | 突变csgg孔 |
CN108779170A (zh) * | 2016-03-02 | 2018-11-09 | 牛津纳米孔技术公司 | 突变孔 |
CN110621692A (zh) * | 2017-05-04 | 2019-12-27 | 牛津纳米孔技术公司 | 由两个CsgG孔组成的跨膜孔 |
CN110914290A (zh) * | 2017-06-30 | 2020-03-24 | 弗拉芒区生物技术研究所 | 新颖蛋白孔 |
-
2021
- 2021-08-18 EP EP21953717.2A patent/EP4371997A1/en active Pending
- 2021-08-18 WO PCT/CN2021/113270 patent/WO2023019470A1/zh active Application Filing
- 2021-08-18 US US18/682,843 patent/US20240352516A1/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2000028312A1 (en) | 1998-11-06 | 2000-05-18 | The Regents Of The University Of California | A miniature support for thin films containing single channels or nanopores and methods for using same |
WO2008102120A1 (en) | 2007-02-20 | 2008-08-28 | Oxford Nanopore Technologies Limited | Lipid bilayer sensor system |
WO2009077734A2 (en) | 2007-12-19 | 2009-06-25 | Oxford Nanopore Technologies Limited | Formation of layers of amphiphilic molecules |
WO2011067559A1 (en) | 2009-12-01 | 2011-06-09 | Oxford Nanopore Technologies Limited | Biochemical analysis instrument |
CN107207570A (zh) * | 2014-09-01 | 2017-09-26 | 弗拉芒区生物技术研究所 | 突变csgg孔 |
CN108779170A (zh) * | 2016-03-02 | 2018-11-09 | 牛津纳米孔技术公司 | 突变孔 |
CN110621692A (zh) * | 2017-05-04 | 2019-12-27 | 牛津纳米孔技术公司 | 由两个CsgG孔组成的跨膜孔 |
CN110914290A (zh) * | 2017-06-30 | 2020-03-24 | 弗拉芒区生物技术研究所 | 新颖蛋白孔 |
Non-Patent Citations (9)
Title |
---|
HOLDEN ET AL., J AM CHEM SOC., vol. 129, no. 27, 11 July 2007 (2007-07-11), pages 8650 - 5 |
IVANOV AP ET AL., NANO LETT., vol. 11, 12 January 2011 (2011-01-12), pages 279 - 85 |
LEHNINGER, A. L.: "Biochemistry", 1975, WORTH PUBLISHERS, pages: 71 - 92 |
LIEBERMAN KR ET AL., J AM CHEM SOC., vol. 132, no. 50, 2010, pages 17961 - 72 |
ROBERTSVELLACCIO: "The Peptides: Analysis, Synthesis, Biology", vol. 5, 1983, ACADEMIC PRESS, INC, pages: 341 |
SAMBROOK, J.RUSSELL, D.: "Molecular Cloning A Laboratory Manual", 2001, COLD SPRING HARBOR LABORATORY PRESS |
SONI GV ET AL., REV SCI INSTRUM., vol. 81, no. 1, January 2010 (2010-01-01), pages 014301 |
STODDART D ET AL., PROC NATL ACAD SCI, vol. 106, no. 19, pages 7702 - 7 |
WANG SHAOYING, ZHAO ZHENGYI, HAQUE FARZIN, GUO PEIXUAN: "Engineering of protein nanopores for sequencing, chemical or protein sensing and disease diagnosis", CURRENT OPINION IN BIOTECHNOLOGY, vol. 51, 1 June 2018 (2018-06-01), GB , pages 80 - 89, XP055960007, ISSN: 0958-1669, DOI: 10.1016/j.copbio.2017.11.006 * |
Also Published As
Publication number | Publication date |
---|---|
US20240352516A1 (en) | 2024-10-24 |
EP4371997A1 (en) | 2024-05-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113480620B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN113754743B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN113912683B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN113896776B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN113773373B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN113735948B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
US11739377B2 (en) | Method of improving the movement of a target polynucleotide with respect to a transmembrane pore | |
EP3619224B1 (en) | Transmembrane pore consisting of two csgg pores | |
JP6776366B2 (ja) | 変異体ポア | |
CN114957412B (zh) | 一种新型孔蛋白单体及其应用 | |
CN113651876B (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
CN104126018B (zh) | 酶方法 | |
CN107002151B (zh) | 向跨膜孔输送分析物的方法 | |
CN105705656B (zh) | 方法 | |
CN109072295A (zh) | 修饰的纳米孔,包含其的组合物及其应用 | |
CN106414773A (zh) | 利用分子孔表征目标分子的方法 | |
KR20140125874A (ko) | 폴리머의 측정의 분석 | |
WO2023019470A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023060418A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023060420A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023060422A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023050031A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023060419A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023060421A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 | |
WO2023019471A1 (zh) | 孔蛋白单体的突变体、蛋白孔及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21953717 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2021953717 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 18682843 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 2021953717 Country of ref document: EP Effective date: 20240207 |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112024003113 Country of ref document: BR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11202400554V Country of ref document: SG |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01E Ref document number: 112024003113 Country of ref document: BR |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01Y Ref document number: 112024003113 Country of ref document: BR Kind code of ref document: A2 Free format text: ANULADA A PUBLICACAO CODIGO 1.5 NA RPI NO 2786 DE 28/05/2024 POR TER SIDO INDEVIDA. |
|
ENP | Entry into the national phase |
Ref document number: 112024003113 Country of ref document: BR Kind code of ref document: A2 Effective date: 20240216 |