WO2003020968A2 - Procede d'analyse de sequences d'acides nucleiques et de l'expression de genes - Google Patents
Procede d'analyse de sequences d'acides nucleiques et de l'expression de genes Download PDFInfo
- Publication number
- WO2003020968A2 WO2003020968A2 PCT/EP2002/009614 EP0209614W WO03020968A2 WO 2003020968 A2 WO2003020968 A2 WO 2003020968A2 EP 0209614 W EP0209614 W EP 0209614W WO 03020968 A2 WO03020968 A2 WO 03020968A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- nts
- primer
- sequence
- nskfs
- primers
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 170
- 150000007523 nucleic acids Chemical group 0.000 title claims abstract description 82
- 230000014509 gene expression Effects 0.000 title claims abstract description 40
- 125000003729 nucleotide group Chemical group 0.000 claims abstract description 238
- 239000002773 nucleotide Substances 0.000 claims abstract description 216
- 238000006243 chemical reaction Methods 0.000 claims abstract description 172
- 238000001514 detection method Methods 0.000 claims abstract description 72
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 21
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 21
- 239000013615 primer Substances 0.000 claims description 351
- 108090000623 proteins and genes Proteins 0.000 claims description 199
- 230000027455 binding Effects 0.000 claims description 93
- 238000004458 analytical method Methods 0.000 claims description 74
- 239000007850 fluorescent dye Substances 0.000 claims description 63
- 230000000295 complement effect Effects 0.000 claims description 59
- 238000009396 hybridization Methods 0.000 claims description 59
- 239000012634 fragment Substances 0.000 claims description 54
- 239000000499 gel Substances 0.000 claims description 38
- 125000001424 substituent group Chemical group 0.000 claims description 36
- 238000005406 washing Methods 0.000 claims description 25
- 239000000975 dye Substances 0.000 claims description 23
- 125000004122 cyclic group Chemical group 0.000 claims description 19
- 230000005526 G1 to G0 transition Effects 0.000 claims description 15
- 238000012300 Sequence Analysis Methods 0.000 claims description 14
- 238000000137 annealing Methods 0.000 claims description 14
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 claims description 13
- 238000010348 incorporation Methods 0.000 claims description 13
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 claims description 12
- 230000000694 effects Effects 0.000 claims description 12
- 230000036961 partial effect Effects 0.000 claims description 12
- 102000004169 proteins and genes Human genes 0.000 claims description 11
- 239000007787 solid Substances 0.000 claims description 10
- 102100031780 Endonuclease Human genes 0.000 claims description 9
- 229920002401 polyacrylamide Polymers 0.000 claims description 9
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 claims description 8
- 239000011521 glass Substances 0.000 claims description 8
- 108060002716 Exonuclease Proteins 0.000 claims description 7
- 102000013165 exonuclease Human genes 0.000 claims description 7
- 239000004033 plastic Substances 0.000 claims description 7
- 229920003023 plastic Polymers 0.000 claims description 7
- 229920000936 Agarose Polymers 0.000 claims description 6
- 239000000919 ceramic Substances 0.000 claims description 6
- 229920001296 polysiloxane Polymers 0.000 claims description 6
- 239000004793 Polystyrene Substances 0.000 claims description 5
- 239000004417 polycarbonate Substances 0.000 claims description 5
- 229920000515 polycarbonate Polymers 0.000 claims description 5
- 229920002223 polystyrene Polymers 0.000 claims description 5
- 239000003153 chemical reaction reagent Substances 0.000 claims description 4
- 239000003446 ligand Substances 0.000 claims description 4
- 108010042407 Endonucleases Proteins 0.000 claims description 3
- 239000011543 agarose gel Substances 0.000 claims description 3
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims description 3
- 150000004032 porphyrins Chemical class 0.000 claims description 3
- ANRHNWWPFJCPAZ-UHFFFAOYSA-M thionine Chemical compound [Cl-].C1=CC(N)=CC2=[S+]C3=CC(N)=CC=C3N=C21 ANRHNWWPFJCPAZ-UHFFFAOYSA-M 0.000 claims description 3
- 238000000492 total internal reflection fluorescence microscopy Methods 0.000 claims description 3
- 239000003155 DNA primer Substances 0.000 claims description 2
- 241000206602 Eukaryota Species 0.000 claims description 2
- 150000003732 xanthenes Chemical class 0.000 claims 2
- 108020005202 Viral DNA Proteins 0.000 claims 1
- 230000001580 bacterial effect Effects 0.000 claims 1
- 238000001317 epifluorescence microscopy Methods 0.000 claims 1
- 238000000166 fluorescence laser scanning microscopy Methods 0.000 claims 1
- 238000012163 sequencing technique Methods 0.000 description 102
- 108020004414 DNA Proteins 0.000 description 54
- 108020004999 messenger RNA Proteins 0.000 description 39
- 239000000243 solution Substances 0.000 description 36
- 239000002299 complementary DNA Substances 0.000 description 31
- 239000000203 mixture Substances 0.000 description 29
- 239000000463 material Substances 0.000 description 25
- 210000004027 cell Anatomy 0.000 description 24
- 230000008569 process Effects 0.000 description 24
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical group N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 18
- 238000002360 preparation method Methods 0.000 description 17
- 238000003786 synthesis reaction Methods 0.000 description 17
- 230000015572 biosynthetic process Effects 0.000 description 16
- 108091034117 Oligonucleotide Proteins 0.000 description 15
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 13
- 125000000524 functional group Chemical group 0.000 description 11
- 238000010369 molecular cloning Methods 0.000 description 11
- 239000000523 sample Substances 0.000 description 11
- 238000003776 cleavage reaction Methods 0.000 description 10
- 230000035772 mutation Effects 0.000 description 10
- 239000002777 nucleoside Substances 0.000 description 10
- -1 nucleoside triphosphates Chemical class 0.000 description 10
- 238000000018 DNA microarray Methods 0.000 description 9
- 108091028043 Nucleic acid sequence Proteins 0.000 description 9
- 229960002685 biotin Drugs 0.000 description 9
- 235000020958 biotin Nutrition 0.000 description 9
- 239000011616 biotin Substances 0.000 description 9
- 238000004422 calculation algorithm Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 102000053602 DNA Human genes 0.000 description 8
- 238000001712 DNA sequencing Methods 0.000 description 8
- 241000282414 Homo sapiens Species 0.000 description 8
- 239000002585 base Substances 0.000 description 8
- 238000009434 installation Methods 0.000 description 8
- 239000003550 marker Substances 0.000 description 8
- 230000007017 scission Effects 0.000 description 8
- 108010017826 DNA Polymerase I Proteins 0.000 description 7
- 102000004594 DNA Polymerase I Human genes 0.000 description 7
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 150000002148 esters Chemical class 0.000 description 7
- 230000003252 repetitive effect Effects 0.000 description 7
- 230000000692 anti-sense effect Effects 0.000 description 6
- 238000000799 fluorescence microscopy Methods 0.000 description 6
- 238000002493 microarray Methods 0.000 description 6
- 238000012986 modification Methods 0.000 description 6
- 230000004048 modification Effects 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 6
- 238000011160 research Methods 0.000 description 6
- 239000007790 solid phase Substances 0.000 description 6
- 239000007858 starting material Substances 0.000 description 6
- 210000001519 tissue Anatomy 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000013467 fragmentation Methods 0.000 description 5
- 238000006062 fragmentation reaction Methods 0.000 description 5
- 238000000386 microscopy Methods 0.000 description 5
- 230000009871 nonspecific binding Effects 0.000 description 5
- 102000054765 polymorphisms of proteins Human genes 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 108700039887 Essential Genes Proteins 0.000 description 4
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 4
- 229910052782 aluminium Inorganic materials 0.000 description 4
- XAGFODPZIPBFFR-UHFFFAOYSA-N aluminium Chemical compound [Al] XAGFODPZIPBFFR-UHFFFAOYSA-N 0.000 description 4
- 239000011324 bead Substances 0.000 description 4
- 230000000903 blocking effect Effects 0.000 description 4
- 150000001875 compounds Chemical class 0.000 description 4
- 238000001218 confocal laser scanning microscopy Methods 0.000 description 4
- 238000004624 confocal microscopy Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 238000006911 enzymatic reaction Methods 0.000 description 4
- 238000010195 expression analysis Methods 0.000 description 4
- PCHJSUWPFVWCPO-UHFFFAOYSA-N gold Chemical compound [Au] PCHJSUWPFVWCPO-UHFFFAOYSA-N 0.000 description 4
- 229910052737 gold Inorganic materials 0.000 description 4
- 239000010931 gold Substances 0.000 description 4
- 229910052751 metal Inorganic materials 0.000 description 4
- 239000002184 metal Substances 0.000 description 4
- 238000001634 microspectroscopy Methods 0.000 description 4
- 239000003068 molecular probe Substances 0.000 description 4
- 125000003835 nucleoside group Chemical group 0.000 description 4
- 238000000399 optical microscopy Methods 0.000 description 4
- 239000002987 primer (paints) Substances 0.000 description 4
- 125000006239 protecting group Chemical group 0.000 description 4
- 229910052709 silver Inorganic materials 0.000 description 4
- 239000004332 silver Substances 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 4
- 108090001008 Avidin Proteins 0.000 description 3
- 108010090804 Streptavidin Proteins 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000010276 construction Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000001035 drying Methods 0.000 description 3
- 230000005284 excitation Effects 0.000 description 3
- 230000002068 genetic effect Effects 0.000 description 3
- 238000011534 incubation Methods 0.000 description 3
- 238000002515 oligonucleotide synthesis Methods 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 230000037452 priming Effects 0.000 description 3
- 238000004445 quantitative analysis Methods 0.000 description 3
- 241000894007 species Species 0.000 description 3
- 235000011178 triphosphate Nutrition 0.000 description 3
- 239000001226 triphosphate Substances 0.000 description 3
- 238000002604 ultrasonography Methods 0.000 description 3
- 229910052727 yttrium Inorganic materials 0.000 description 3
- PXFBZOLANLWPMH-UHFFFAOYSA-N 16-Epiaffinine Natural products C1C(C2=CC=CC=C2N2)=C2C(=O)CC2C(=CC)CN(C)C1C2CO PXFBZOLANLWPMH-UHFFFAOYSA-N 0.000 description 2
- ASJSAQIRZKANQN-CRCLSJGQSA-N 2-deoxy-D-ribose Chemical compound OC[C@@H](O)[C@@H](O)CC=O ASJSAQIRZKANQN-CRCLSJGQSA-N 0.000 description 2
- 102100027211 Albumin Human genes 0.000 description 2
- 108010088751 Albumins Proteins 0.000 description 2
- 108050001427 Avidin/streptavidin Proteins 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 108091060211 Expressed sequence tag Proteins 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 2
- 125000002777 acetyl group Chemical class [H]C([H])([H])C(*)=O 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 210000004102 animal cell Anatomy 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 2
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 2
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 2
- 150000002019 disulfides Chemical class 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000007654 immersion Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- QSHDDOUJBYECFT-UHFFFAOYSA-N mercury Chemical compound [Hg] QSHDDOUJBYECFT-UHFFFAOYSA-N 0.000 description 2
- 230000002503 metabolic effect Effects 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 238000003909 pattern recognition Methods 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- 238000010839 reverse transcription Methods 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 230000002441 reversible effect Effects 0.000 description 2
- 238000004621 scanning probe microscopy Methods 0.000 description 2
- 239000010703 silicon Substances 0.000 description 2
- 229910052710 silicon Inorganic materials 0.000 description 2
- 238000001179 sorption measurement Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- 150000007970 thio esters Chemical class 0.000 description 2
- MBXADFLSFSJJOZ-SNAWJCMRSA-N 2-[1-[(4-ethoxyphenyl)methyl]-4-[(e)-3-(4-methoxyphenyl)prop-2-enyl]piperazin-2-yl]ethanol Chemical compound C1=CC(OCC)=CC=C1CN1C(CCO)CN(C\C=C\C=2C=CC(OC)=CC=2)CC1 MBXADFLSFSJJOZ-SNAWJCMRSA-N 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- OIRDTQYFTABQOQ-KQYNXXCUSA-N Adenosine Natural products C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](CO)[C@@H](O)[C@H]1O OIRDTQYFTABQOQ-KQYNXXCUSA-N 0.000 description 1
- 108020004491 Antisense DNA Proteins 0.000 description 1
- 239000002126 C01EB10 - Adenosine Substances 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 102100033215 DNA nucleotidylexotransferase Human genes 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 102000004190 Enzymes Human genes 0.000 description 1
- 108090000790 Enzymes Proteins 0.000 description 1
- 108090000371 Esterases Proteins 0.000 description 1
- 108010078851 HIV Reverse Transcriptase Proteins 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 102000003960 Ligases Human genes 0.000 description 1
- 108090000364 Ligases Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 238000000636 Northern blotting Methods 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-M Thiocyanate anion Chemical compound [S-]C#N ZMZDMBWJUHKJPS-UHFFFAOYSA-M 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- 108010033576 Transferrin Receptors Proteins 0.000 description 1
- 229910052770 Uranium Inorganic materials 0.000 description 1
- 229960005305 adenosine Drugs 0.000 description 1
- 239000003513 alkali Substances 0.000 description 1
- 239000003816 antisense DNA Substances 0.000 description 1
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000005842 biochemical reaction Methods 0.000 description 1
- 230000008033 biological extinction Effects 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 150000001615 biotins Chemical class 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 238000010804 cDNA synthesis Methods 0.000 description 1
- 150000001735 carboxylic acids Chemical class 0.000 description 1
- 230000032823 cell division Effects 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 239000003638 chemical reducing agent Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical group C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000001493 electron microscopy Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000000295 emission spectrum Methods 0.000 description 1
- DNJIEGIFACGWOD-UHFFFAOYSA-N ethyl mercaptane Natural products CCS DNJIEGIFACGWOD-UHFFFAOYSA-N 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000004992 fission Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- RQFCJASXJCIDSX-UUOKFMHZSA-N guanosine 5'-monophosphate Chemical class C1=2NC(N)=NC(=O)C=2N=CN1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O RQFCJASXJCIDSX-UUOKFMHZSA-N 0.000 description 1
- 235000013928 guanylic acid Nutrition 0.000 description 1
- 238000003505 heat denaturation Methods 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-N hydrogen thiocyanate Natural products SC#N ZMZDMBWJUHKJPS-UHFFFAOYSA-N 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 230000003100 immobilizing effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000002427 irreversible effect Effects 0.000 description 1
- 239000007788 liquid Substances 0.000 description 1
- 238000010841 mRNA extraction Methods 0.000 description 1
- 238000012067 mathematical method Methods 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 108020004084 membrane receptors Proteins 0.000 description 1
- 239000002207 metabolite Substances 0.000 description 1
- 239000013612 plasmid Substances 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 239000011541 reaction mixture Substances 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 125000000548 ribosyl group Chemical group C1([C@H](O)[C@H](O)[C@H](O1)CO)* 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000004904 shortening Methods 0.000 description 1
- 238000004557 single molecule detection Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- ABZLKHKQJHEPAX-UHFFFAOYSA-N tetramethylrhodamine Chemical compound C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=CC=C1C([O-])=O ABZLKHKQJHEPAX-UHFFFAOYSA-N 0.000 description 1
- JGVWCANSWKRBCS-UHFFFAOYSA-N tetramethylrhodamine thiocyanate Chemical compound [Cl-].C=12C=CC(N(C)C)=CC2=[O+]C2=CC(N(C)C)=CC=C2C=1C1=CC=C(SC#N)C=C1C(O)=O JGVWCANSWKRBCS-UHFFFAOYSA-N 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 210000004881 tumor cell Anatomy 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
- 125000001834 xanthenyl group Chemical class C1=CC=CC=2OC3=CC=CC=C3C(C12)* 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
Definitions
- the invention relates to a method for analyzing nucleic acid chains and gene expression.
- the basis of the method is the detection of fluorescence signals of individual nucleotide molecules labeled with dyes, which are incorporated into a growing nucleic acid chain by a polymerase.
- the reaction takes place on a flat surface. Many individual nucleic acid molecules are bound to this surface. All of these nucleic acid molecules are exposed to the same conditions, so that a build-up reaction can take place simultaneously on all nucleic acid molecules.
- the process essentially comprises the following steps:
- NSKFs nucleic acid chain fragments
- the cycle is repeated several times.
- DNA - deoxyribonucleic acid of different origins and different lengths (genomic DNA, cDNA, ssDNA, dsDNA)
- RNA - ribonucleic acid (mostly mRNA)
- Polymerases - enzymes that can incorporate complementary nucleotides into a growing DNA or RNA strand e.g. DNA polymerases, reverse transcriptases, RNA polymerases
- dNTP - 2 deoxy nucleoside triphosphates, substrates for DNA polymerases and reverse transcriptases
- NT - natural nucleotide usually dNTP, unless expressly stated otherwise.
- NT is also used when specifying the length of a nucleic acid sequence, e.g. NT 1,000.
- NT stands for nucleoside monophosphates.
- NT * - modified nucleotide, usually dNTP, unless expressly stated otherwise.
- NTs * means: modified nucleotides
- NSK - nucleic acid chain DNA or RNA in their original length
- NSKFs Nucleic acid chain fragments.
- the sum of the NSKFs is equivalent to the overall sequence.
- the NSKFs can, for example, be fragments of the entire DNA or RNA sequence that arise after a fragmentation step.
- Total sequence the sequence or sequences used in the sequencing reaction, mostly converted into NSKFs. It can originally consist of one or more NSKs. The total sequence can be parts or equivalents
- Represent 0 of another sequence or of sequence populations e.g. mRNA, cDNA, plasmid DNA with insert, BAC, YAC
- Represent 0 of another sequence or of sequence populations e.g. mRNA, cDNA, plasmid DNA with insert, BAC, YAC
- Primer binding site (PBS) - part of the sequence in the NSK or 5 NSKF to which the primer binds.
- Reference sequence - a sequence already known, to which the deviations in the sequence to be examined or in the sequences to be examined (overall sequence) are determined.
- D Sequences accessible in databases can be used as reference sequences, e.g. from the NCBI database.
- Plane surface - surface which preferably has the following features: 1) It allows several individual molecules, preferably more than 100, more preferably more than 1000, to be detected simultaneously with the given objective-surface distance at one objective position. 2) The
- the immobilized individual molecules are in the same focal plane, which can be set reproducibly.
- Wide-field optical detection system - detection system that can simultaneously detect fluorescence signals from individual molecules distributed over a surface, the surface being approximately 100 ⁇ m 2 and larger.
- An example of wide-field detection optics is provided by the Axiovert 200 or fluorescence microscope Axioplan 2e (Zeiss) with a planofluar objective lOOx NA 1.4 oil immersion (Zeiss), or a planapochromatic objective lOOx NA 1.4 oil immersion (Zeiss);
- the fluorescence can be excited using a lamp, for example a mercury vapor lamp, or a laser or diode. Both epifluorescence mode and total internal reflection fluorescence microscopy (TIRF microscopy) mode or laser scanning microscopy mode can be used.
- This wide field detection optics is used in this application
- termination is the reversible stop in the installation of the modified, uncleaved NTs * .
- the termination comes after the installation of a modified NT * .
- the substituent can be split off under mild conditions, so that the 3" - OH function again for the incorporation of an NT * is available.
- a fluorescent dye is coupled to these substituents.
- Gene products - The gene products are the primary gene products of the genes. Essentially, these are RNA transcripts of the genes mentioned, which are also referred to as target sequences (or target nucleic acid sequences). In addition to mRNA, these target sequences also include single-stranded and double-stranded cDNA derived therefrom, RNA derived from cDNA or DNA amplified from cDNA.
- Single nucleotide poly orphisms single nucleotide polymorphisms, SNPs
- SNPs single nucleotide polymorphisms
- Another sequencing method is based on the hybridization of nucleic acid chains with short oligonucleotides. Mathematical methods are used to calculate how many oligonucleotides of a certain length are required to determine a complete sequence (ZT Strezoska et al. PNAS 1991 v.88 p.10089, RSDrmanac et al. Science 1993 v.260 p.1649 ). This method also has problems: only one sequence can be determined in one approach, secondary structures disrupt it Hybridization and repeats prevent correct analysis.
- a large number of identical single-stranded DNA pieces are fixed at a defined location on a surface and the signal from all of these many identical DNA pieces is analyzed.
- a solution with polymerase and nucleotides is added to this fixed DNA so that a complementary strand can be synthesized.
- the polymerase should work step by step: only one nucleotide is incorporated in each step. This is detected, whereupon the polymerase incorporates the next nucleotide in a next cycle.
- the analysis of gene expression is a complex task: the number of genes active in a cell type can be several thousand. However, the analysis should consider all genes contained in the genome of the species in question (approximately 32,000 in humans). In addition, the genes active in the respective cell type are first of all mostly not yet completely known and secondly they are expressed to different extents.
- the major disadvantages of the hybridization method include: The production of the oligonucleotides bound to the surface is expensive. The analysis is limited to genes whose sequences are already known. Multiple mismatch controls increase the number of oligonucleotides that need to be immobilized.
- the object of the present invention is therefore to provide a method for the sequence analysis of nucleic acid chains and the analysis of gene expression which does not have the disadvantages of the above-mentioned methods and, above all, enables a cheaper, faster and more efficient analysis of nucleic acid sequences.
- the method should be able to determine many sequences in parallel. It can then be used, for example, for the analysis of very long nucleic acid chains (several Mb) or for variant analysis on many short chains (mutation analysis, SNP analysis) in one approach.
- the present invention relates to a method for parallel sequence analysis of nucleic acid sequences (nucleic acid chains, NSKs), in which
- NSKFs Single-stranded NSKs with a length of about 50 to 1000 nucleotides are generated, which represent overlapping partial sequences of the total sequences
- the NSKFs are bound on a reaction surface in a random arrangement using one or more different primers in the form of NSKF-primer complexes cyclically build the complementary strand of the NSKFs using one or more polymerases by:
- a solution is added to the NSKF primer complexes bound to the surface, which contains one or more polymerases and one to four modified nucleotides (NTs * ) which are labeled with fluorescent dyes, the simultaneous use of at least two NTs * Fluorescent dyes located on the NTs * are selected such that the NTs * used can be distinguished from one another by measuring different fluorescence signals, the NTs * being structurally modified such that the polymerase does not form after incorporating such an NT * into a growing complementary strand is able to incorporate another NT * in the same strand, the terminating substituent being cleavable with the fluorescent dye
- stage b) the stationary phase obtained in stage a) is incubated under conditions which are suitable for the extension of the complementary strands, the complementary strands being extended in each case by one NT * , man
- stage b) the stationary phase obtained in stage b) is washed under conditions which are not suitable for the removal of NTs * which are not incorporated into a complementary strand
- the individual NTs * built into complementary strands are detected by measuring the signal which is characteristic of the respective fluorescent dye, the relative position at the same time of the individual fluorescence signals on the reaction surface is determined
- step f) the stationary phase obtained in step e) is washed under conditions which are suitable for removing the fluorescent dyes and the ligands
- stages a) to f) are repeated several times, if necessary,
- the overall sequence of the NSKs can be determined, for example, from the partial sequences determined.
- a parallel sequence analysis is understood to mean the simultaneous sequence analysis of many NSKFs (for example 1,000,000 to 10,000,000), these NSKFs being derived from a uniform NSK population or from several different NSK populations.
- the resulting population of overlapping partial sequences can be combined, for example, in de novo sequencing with commercially available programs to form the overall sequence of the NSK (Huang et al. Genom Res. 1999 v.9 p.868, Huang Genomics 1996 v.33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J. Comput. Biol. 1994 vl p.257).
- mutations or single nucleotide polymorphisms can be identified by comparing the obtained overlapping partial sequences with the reference sequence.
- the process can be carried out by repeating steps a) to f) of the cyclic build-up reaction several times, with
- the method can also be carried out by repeating steps a) to f) of the cyclic build-up reaction several times, alternately using two differently marked NTs * and two unmarked NTs in the cycles and one Total sequences determined by comparison with the reference sequence.
- the present invention also relates to a method for highly parallel analysis of gene expression, in which
- the gene products are bound on a reaction surface in a random arrangement using one or more different primers in the form of gene product-primer complexes, one
- a solution is added to the gene product-primer complexes bound to the surface which contains one or more polymerases and one to four modified nucleotides (NTs * ) which are labeled with fluorescent dyes, the simultaneous use of at least two NTs * each * located at the NTs fluorescent dyes are selected so located that can differ from each other, the NTs used * by measuring different fluorescent signals, the NTs * are structurally modified 'so that the polymerase after incorporation of such NT * not in a growing complementary strand is able to incorporate another NT * in the same strand, the terminating substituent being cleavable with the fluorescent dye, man
- stage b) the stationary phase obtained in stage a) is incubated under conditions which are suitable for the extension of the complementary strands, the complementary strands being extended in each case by one NT * , man
- stage b) the stationary phase obtained in stage b) is washed under conditions which are not suitable for the removal of NTs * which are not incorporated into a complementary strand
- the individual NTs * built into complementary strands are detected by measuring the signal characteristic of the respective fluorescent dye, at the same time determining the relative position of the individual fluorescent signals on the reaction surface
- step f) the stationary phase obtained in step e) is washed under conditions which are suitable for removing the fluorescent dyes and the ligands
- stages a) to f) are repeated several times, if necessary,
- the gene products are the primary gene products of the genes whose expression is to be analyzed. Essentially, these are RNA transcripts of the genes mentioned, which are also referred to as target sequences (or target nucleic acid sequences). In addition to RNA, these target sequences also include single-stranded and double-stranded cDNA derived therefrom, RNA derived from cDNA or DNA amplified from cDNA.
- the gene products or target sequences can either be isolated as mRNAs directly from a biological sample (e.g. cell extract, tissue extract or extract from whole organisms) or obtained as cDNAs by reverse transcription of the mRNAs.
- a highly parallel analysis is understood to mean the simultaneous sequence analysis of many gene product molecules (for example 1,000,000 to 10,000,000), these gene product molecules being a complex one represent a heterogeneous population that corresponds, for example, to a complete expression profile or an expression spectrum of a tissue.
- the process can be carried out by repeating steps a) to f) of the cyclic build-up reaction several times, with
- the method can also be carried out by repeating steps a) to f) of the cyclic build-up reaction several times, alternately using two differently labeled NTs * and two unlabeled NTs in the cycles and comparing the identity of the gene products with the reference sequences determined.
- the invention further relates to a kit for carrying out the method which contains a reaction surface, reaction solutions required for carrying out the method, one or more polymerases, and nucleotides (NTs), one to four of which are labeled with fluorescent dyes, the NTs * on the 3 'position are structurally modified so that after installing such an NT * in a growing complementary strand, the polymerase is not able to incorporate another NT * in the same strand, the terminating substituent being cleavable with the fluorescent dye.
- NTs nucleotides
- the kit also contains for the production of Single strands of double strands required reagents, single stranded nucleic acid molecules, which are introduced as PBS in the NSKFs, oligonucleotide primers, for cleaving off the fluorescent dyes and the reagents and / or washing solutions leading to termination.
- the method according to the invention is used to determine the nucleic acid sequences and can be used in various areas of genetics. This includes in particular the determination of unknown, long sequences, analysis of sequence polymorphisms and point mutations as well as the parallel analysis of a large number of gene sequences and the analysis of gene expression.
- the preparation of the material to be analyzed depends on the task at hand and the aim is to create a population of relatively small, single-stranded nucleic acid chain fragments (NSKFs) from a long nucleic acid chain. to form, to provide these fragments with a primer suitable for starting the sequencing reaction (NSKF-primer complexes) and to fix them on a flat surface.
- NSKFs single-stranded nucleic acid chain fragments
- NSKFs are fixed on a flat surface in such a way that an enzymatic reaction can take place on these molecules.
- different types of immobilization are possible, which depend on the objective, the type of NSK and the polymerase used for the reaction.
- the NSKFs are randomly distributed on the surface during immobilization or binding, that is, it is not necessary to ensure that the individual chains are positioned exactly.
- NSKF primer complexes can be bound to the surface via the NSKFs or primers.
- the NSKF primer complexes must be fixed to the surface in such a density ensure that the later-detected signals are clearly assigned by the installed NT * s to individual NSKFs.
- the sequencing reaction is started with all NSKF primer complex molecules immobilized on the surface.
- the sequencing is based on the synthesis of the complementary strand to each individual bound NSKF.
- Labeled NTs * are installed in the newly synthesized strand.
- the polymerase incorporates only one labeled NT * into the growing chain in one cycle.
- a cycle comprises the following steps:
- NTs * labeled nucleotides
- reaction conditions of step (b) in a cycle are selected so that the polymerases can incorporate a labeled NT * on more than 50% of the NSKFs (NSKF primer complexes capable of expansion) which are involved in the sequencing reaction, preferably on more than 90%.
- NSKFs NSKF primer complexes capable of expansion
- the number of cycles to be carried out depends on the task at hand, and is not theoretically limited and is preferably between 20 and 5000.
- the original NSK sequence can be reconstructed from the overlapping NSKF sequences ("Automated DNA sequencing and analysis” p. 231 ff. 1994 M. Adams et al. Academic Press, Huang et al. Genom Res. 1999 v. 9 p.868, Huang Genomics 1996 v.33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J.Comput. Biol. 1994 vl p.257).
- the entire population of NSKF sequences is searched for matches / overlaps in the sequences of NSKFs. Through these matches / overlaps, the NSKF can be aligned, e.g .:
- the errors in the method can be recorded and corrected using various means. All steps of the process can be largely automated.
- the gene products can bind to the surface in any arrangement. A previous complex synthesis of different oligonucleotides at certain positions (such as in the hybridization method) is therefore not necessary.
- the material can be analyzed on a standardized surface.
- the method is based on several principles:
- Short nucleotide sequences (10-50 NTs) contain enough information to identify the corresponding gene if the gene sequence itself is already contained in a database.
- a sequence of, for example, 10 NTs can form more than 10 ⁇ different combinations. For example, this is sufficient for most genes in the human genome, which according to current estimates contains 32,000 genes. The sequence can be even shorter for organisms with fewer genes.
- the method is based on a new method for sequencing individual nucleic acid chain molecules.
- the sequencing reaction takes place simultaneously on many molecules, the sequence of each individual bound nucleic acid chain being analyzed.
- mRNAs or nucleic acid chains derived from the mRNA can be used to investigate gene expression. Regardless of the exact composition, they are referred to below as gene products. Partial sequences of these gene products are also referred to below as gene products.
- the synthesis is based on the synthesis of a strand that is complementary to the gene product.
- the aim of the preparation is to provide gene product-primer complexes which are bound in a random manner on a flat surface and on which the incorporation of NT * s by the polymerase can take place (gene product-primer complexes which can be extended).
- the sequencing reaction is carried out with these bound gene product-primer complexes.
- a cycle comprises the following steps: a) addition of a solution with labeled nucleotides (NTs * ) and polymerase to bound gene product-primer complexes, b) incubation of the bound gene product-primer complexes with this solution under conditions which extend the complementary strands are suitable for an NT, c) washing, d) detection of the signals from individual modified NTs * molecules incorporated into the newly synthesized strands, e) removal of the label from the built-in nucleotides, f) washing.
- NTs * labeled nucleotides
- This cycle can be repeated several times, so that preferably 10 to 50 NTs are determined for each gene product-primer complex participating in the sequencing reaction.
- the nucleic acid sequences are then reconstructed from the detected signals.
- the determined sequences of the bound gene products are compared with one another to determine the abundances and are assigned to specific genes by comparison with gene sequences in databases.
- the sequencing and reconstruction of nucleic acid sequences is based on the shotgun principle ("Automated DNA sequencing and analysis” p. 231 ff. 1994 M. Adams et al. Academic Press, Huang et al. Genom Res. 1999 v.9 p. 868 , Huang Genomics 1996 v.33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J.Comput. Biol. 1994 vl p.257).
- the sequence of a long piece of DNA is determined by sequencing small DNA fragments and subsequent reconstruction.
- the material to be analyzed (1) is prepared for the sequencing reaction by breaking it down into fragments of preferably 50 to 1000 bp in length (2). Each fragment is then provided with a primer binding site and a primer (3). This mixture of different DNA fragments is now fixed on a flat surface (4). The unbound DNA fragments are removed by a washing step. The sequencing reaction is then carried out on the entire reaction surface. This reaction is cyclical. In the first step of the cycle, an NT * labeled with a fluorescent dye is incorporated into the growing strand: the reaction is controlled so that only one labeled NT * from a polymerase can be incorporated into the growing strand in each cycle.
- NTs * which have a reversibly coupled substituent leading to the termination at the 3 "position of the deoxyribose.
- the incorporation of a further labeled NT * is thereby made impossible.
- the polymerase and the labeled NTs * become simultaneous (5).
- the reaction mixture is then removed and washed the surface in a suitable manner (6).
- a detection step (7) The surface is scanned with a device suitable for single molecule detection (consisting of light source, microscope, camera, scanning table, computer with control and image recognition or image processing software) and the signals of the individual, built-in marked NTs * identified. After the detection step, the marker and the substituent leading to the termination are removed from all built-in NTs * (8).
- a new cycle can begin.
- the DNA fragments should be several hundred NT long if the reconstruction is carried out according to the shotgun principle ("Automated DNA sequencing and analysis” p. 231 ff 1994 M. Adams et al. Academic Press, Huang et al. Genom Res. 1999 v.9 p.868, Huang Genomics 1996 v.33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J. Comput. Biol. 1994 vl p.257). Since only one marked NT * is installed per cycle, at least 300 cycles are required for sequencing.
- the aim of the material preparation is to obtain bound single-stranded NSKFs with a length of preferably 50-1000 NTs, a single primer binding site and a hybridized primer (bound NSKF-primer complexes).
- NSKF primer complexes have, for example, the structure shown in FIG. 2.
- very variable constructions can be derived from this general structure.
- here are some examples, whereby the methods listed can be used individually or in combination.
- NSKs are fragmented in such a way that fragments are obtained which represent overlapping partial sequences of the total sequences. This is achieved by methods in which fragments of different lengths are formed as fission products in a random distribution.
- the nucleic acid chain fragments can be produced by several methods, for example by fragmentation of the starting material with ultrasound or by endonucleases ("Molecular cloning”. 1989 j.Sambrook et al. Cold Spring Harbor Laborotary Press), such as, for example, by non-specific endonuclease mixtures .
- ultrasound fragmentation is preferred.
- the conditions can be set so that fragments with an average Length of 100 bp to 1 kb arise. These fragments are then filled in at their ends by the Klenow fragment (E. coli polymerase I) or by the T4 DNA polymerase ("Molecular cloning" 1989 J.Sambrook et al. Cold Spring Harbor Laborotary Press).
- complementary short NSKFs can be synthesized from a long NSK using randomized primers. This method is particularly preferred when analyzing the gene sequences.
- Single-stranded DNA fragments with randomized primers and a reverse transcriptase are formed on the mRNA (Zhang-J et al. Biochem. J. 1999 v.337 p.231, Ledbetter et al. J.Biol.Chem. 1994 v.269 P.31544, Rolls et al. Anal.Biochem. 1993 v.208 p.264, Decraene et al. Biotechniques 1999 v.27 p.962).
- the primer binding site is a sequence section which is intended to enable selective binding of the primer to the NSKF.
- the primer binding sites can be different, so that several different primers must be used.
- certain sequence segments of the overall sequence can serve as natural PBSs for specific primers. This embodiment is particularly suitable for the investigation of already known SNP sites, see. Example 4 "SNP analysis with sequence-specific primers".
- the primer binding sites are therefore introduced separately into the NSKFs. In this way, primers with a uniform structure can be used for the reaction. This embodiment will be described in detail below.
- the composition of the primer binding site is not restricted. Their length is preferably between 20 and 50 NTs.
- the primer binding site can carry a functional group for immobilizing the NSKF. This functional group can e.g. be a biotin group.
- FIG. 3a A double-stranded oligonucleotide complex with a primer binding site is used (FIG. 3a). This is ligated to the DNA fragments using commercially available ligases ("Molecular cloning" 1989 J.Sambrook et al. Cold Spring Harbor Laborotary Press). It is important that only a single primer binding site be ligated to the DNA fragment. This is achieved e.g. by modification of one side of the oligonucleotide complex on both strands (FIG. 3b). The results after the ligation or after subsequent denaturation are shown in FIGS. 3c and 3d. The modifying groups on the oligonucleotide complex can be used for immobilization.
- oligonucleotide complex can be carried out according to standardized regulations.
- synthesis e.g. the DNA synthesizer 380 A Applied Biosystems can be used.
- Oligonucleotides with a certain composition with or without modifications are also commercially available as custom synthesis, e.g. from MWG-Biotech GmbH, Germany.
- one terminal deoxynucleotidyl transferase can be used for several (e.g. between 10 and 20) attach nucleoside monophosphates to the 3 'end of an SS DNA fragment ("molecular cloning" 1989 J.Sambrook et al. Cold Spring Harbor Laborotary Press, "Method in Enzymology” 1999 v.303, p. 37-38) (FIG. 4), for example several guanosine monophosphates (called (G) n-tailing).
- the resulting fragment is used to bind the primer, in this example a (C) n primer.
- Single-stranded NSKFs are required for the sequencing reaction. If the starting material is in double-stranded form, there are several possibilities for producing a single-stranded form from double-stranded DNA (e.g. heat denaturation or alkali denaturation) ("Molecular cloning" 1989 J.Sambrook et al. Cold Spring Harbor Laborotary Press).
- the composition and the length of the primer are not restricted.
- the primer can also perform other functions, such as to create a connection to the reaction surface. Primers should be adapted to the length and composition of the primer binding site so that the primer enables the sequencing reaction to be started with the respective polymerase.
- sequence-specific primers for the respective primer binding site are used.
- a primer mixture is used for sequencing.
- a single primer is used at the primer binding site coupled to NSKFs.
- the length of the primer is preferably between 6 and 100 NTs, optimally between 15 and 30 NTs.
- the primer can carry a functional group which serves to immobilize the NSKF, for example such a functional group is a biotin group (see section Immobilization). It should not interfere with sequencing.
- the synthesis of such a primer can e.g. can be carried out with the DNA synthesizer 380 A Applied Biosystems or as a custom synthesis from a commercial provider, e.g. MWG-Biotech GmbH, Germany).
- the primer Prior to hybridization to the NSKFs to be analyzed, the primer can be fixed on the surface using various techniques or synthesized directly on the surface, for example according to (McGall et al. US Patent 5412087, Barrett et al. US Patent 5482867, Mirzabekov et al. US Patent 5981734, "Microarray biochip technology” 2000 M. Schena Eaton Publishing, “DNA Microarrays” 1999 M. Schena Oxford University Press, Fodor et al. Science 1991 v.285 p.767, Timofeev et al. Nucleic Acid Research (NAR) 1996, v.24 p.3142, Ghosh et al. NAR 1987 v.15 p.5353, Gingeras et al. NAR 1987 v.15 p.5373, Maskos et al. NAR 1992 v.20 p.1679).
- the primers are on the surface of microns, for example, in a density of between 10 to 100 microns per 100 2, 100 to 10,000 per 100 2, or 10,000 to 1,000,000 per lOO ⁇ m 2 bound.
- the primer or mixture of primers is incubated with NSKFs under hybridization conditions, which allow it to bind selectively to the primer binding site of the NSKF.
- This primer hybridization (annealing) can take place before (1), during (2) or after (3) the binding of the NSKFs to the surface.
- the optimization of the hybridization conditions depends on the exact structure of the primer binding site and the primer and can be according to Rychlik et al. Calculate NAR 1990 v.18 p.6409. In the following, these hybridization conditions are referred to as standardized hybridization conditions.
- primer binding site with a known structure that is common to all NSKFs is introduced, for example by ligation, primers with a uniform structure can be used.
- the primer binding site can carry a functional group at its 3 'end which e.g. is used for immobilization.
- this group is a biotin group.
- the primer has a structure that is complementary to the primer binding site.
- the aim of the fixation is to fix NSKF primer complexes on a suitable flat surface in such a way that a cyclic enzymatic sequencing reaction can take place. This can be done, for example, by binding the primer (see above) or the NSKF to the surface.
- the NSKF primer complexes can first be formed in a solution by hybridization (annealing) and then bound to the surface.
- NSKFs can then be hybridized to the bound primers, whereby NSKF-primer complexes are formed (NSKFs indirectly bound to the surface)
- the NSKFs can first be bound to the surface (NSKFs bound directly to the surface) and in the subsequent step the primers can be hybridized to the bound NSKFs, resulting in NSKF-primer complexes.
- the NSKFs can therefore be immobilized on the surface by direct or indirect binding.
- the surface and the reaction surface are to be understood as equivalent terms, unless explicitly referred to another meaning.
- the surface of a solid phase of any material serves as the reaction surface. This material is preferably inert towards enzymatic reactions and does not cause any interference with the detection. Silicon, glass, ceramics, plastics (e.g. polycarbonates or polystyrenes), metal (gold, silver, or aluminum) or any other material that meets these functional requirements can be used.
- the surface is preferably not deformable, since otherwise the signals are likely to be distorted upon repeated detection.
- this gel can e.g. be an agarose or polyacrylamide gel.
- the gel is preferably freely passable for molecules with a molecular mass below 5000 Da
- a gel surface has the advantage over other solid surfaces that there is a significantly lower non-specific binding of NT * s to the surface.
- the gel is preferably attached to a solid surface
- This firm base can be silicone, glass, ceramic, Plastic (e.g. polycarbonates or polystyrenes), metal (gold, silver, or aluminum) or any other material.
- the thickness of the gel is preferably not more than 0.1 mm.
- the gel thickness is preferably greater than the simple depth of field of the lens, so that NTs * bound non-specifically to the solid base do not reach the focal plane and are therefore detected. If the depth of focus is 0.3 ⁇ m, for example, the gel thickness is preferably between 1 ⁇ m and 100 ⁇ m.
- the surface can be produced as a continuous surface or as a discontinuous surface composed of individual small components (eg agarose beads) (FIG. 5b).
- the reaction surface must be large enough to be able to immobilize the necessary number of NSKFs with the appropriate density.
- the reaction surface should preferably not be larger than 20 cm 2 .
- the different cycle steps require an exchange of the different reaction solutions above the surface.
- the reaction surface is preferably part of a reaction vessel.
- the reaction vessel is in turn preferably part of a reaction apparatus with a flow device.
- the flow device enables the solutions in the reaction vessel to be exchanged.
- the exchange can take place with a pump device controlled by a computer or manually. It is important that the surface does not dry out.
- the volume of the reaction vessel is preferably less than 50 ⁇ l. Ideally, its volume is less than 1 ⁇ l. An example of such a flow system is given in Fig.6.
- NSKF primer complexes are fixed on the surface via the NSKFs, this can be done, for example, by binding the NSKFs to one of the two chain ends. This can be achieved by appropriate covalent, affine or other bonds.
- Immobilization of nucleic acids is known (McGall et al. US Patent 5412087, Nikiforov et al. US Patent 5610287, Barrett et al. US Patent 5482867, Mirzabekov et al. US Patent 5981734, "Microarray biochip technology” 2000 M. Schena Eaton Publishing, " DNA Microarrays "1999 M. Schena Oxford University Press, Rasmussen et al.
- the NACFs be on the surface of microns, for example, at a density between 10 and 100 microns NACFs per 100 2, 100 to 10,000 per 100 2, bound 10,000 to 1,000,000 per lOO ⁇ rn. 2
- the density of NSKF primer complexes which can be extended is necessary for the detection to be approximately 10 to 100 per 100 ⁇ m 2 . It can be achieved before, during or after hybridization of the primers to the gene products.
- the NSKFs are immobilized via biotin-avidin or biotin-streptavidin binding.
- Avidin or streptavidin is covalently bound on the surface, the 5 'end of the primer contains biotin.
- the labeled primers After the labeled primers have hybridized with the NSKFs (in solution), they are fixed on the surface coated with avidin / streptavidin.
- the concentration of the hybridization products labeled with biotin and the time of incubation of this solution with the surface is chosen such that a density suitable for sequencing is already in this step is achieved.
- the primers suitable for the sequencing reaction are fixed on the surface using suitable methods before the sequencing reaction (see above).
- the single-stranded NSKFs, each with one primer binding site per NSKF, are thus incubated under hybridization conditions (annealing). They bind to the fixed primers and are thereby bound (indirect binding), resulting in primer-NSKF complexes.
- the concentration of the single-stranded NSKFs and the hybridization conditions are chosen such that an immobilization density of 10 to 100 NSKF primer complexes capable of extension per 100 ⁇ m 2 suitable for sequencing is achieved. After hybridization, unbound NSKFs are removed by a washing step.
- a surface with a high primer density is preferred, for example approximately 1,000,000 primers per 100 ⁇ m 2 or even higher, since the desired density of NSKF-primer complexes is achieved more quickly, the NSKFs only binding to a part of the primers ,
- the NSKFs are bound directly to the surface (see above) and then incubated with primers under hybridization conditions.
- a density of approx. 10 to 100 NSKFs per 100 ⁇ m 2 attempts will be made to prime all available NSKFs and to make them available for the sequencing reaction. This can be achieved, for example, by high primer concentration, for example 1 to 100 mmol / 1.
- high primer concentration for example 1 to 100 mmol / 1.
- the density of the NSKF-primer complexes required for optical detection can be achieved during the primer hybridization.
- the hybridization conditions should be selected so that the primers only bind to a part of the immobilized NSKFs.
- a blocking solution is preferably applied to the surface before step (a) in each cycle, which serves to avoid non-specific adsorption of NTs * on the surface.
- an albumin solution (BSA) with a pH between 8 and 10 fulfills these conditions for a blocking solution.
- thermostable polymerases such as Taq polymerase (GibcoBRL), proHATM polymerase
- Polymerases with 3 '-5' exonuclease activity can be used (eg Klenow fragment of E. coli polymerase I), provided reaction conditions are selected, suppress the existing 3 '-5' exonuclease activity, such as a low pH Value (pH 6.5) for the Klenow fragment (Lehman and Richardson, J. Biol. Chem. 1964 v.239 p.233) or addition of NaF for the installation reaction.
- Another possibility is to use NTs * with a phosphorothioate compound (Kunkel et al. PNAS 1981, v.78 p.6734). Built-in NTs * are not attacked by the 3 '-5' exonuclease activity of the polymerase. All these types of polymerases are referred to below as "polymerase”. 4th 5 chemistry
- NT * s preferably 2'-deoxy nucleotide triphosphates
- This substituent alone or together with the fluorescent dye, can lead to the termination of the incorporation reaction and can be split off from the nucleotide under mild conditions.
- a fluorescent dye which is characteristic of the respective NT * is coupled to these substituents, so that the substituent also assumes the role of a linker between the nucleotide and the fluorescent dye.
- the fluorescent dye is preferably coupled to this linker by a bond which can be cleaved under mild conditions.
- Standard conditions are understood to mean cleavage conditions which neither lead to denaturation of the primer-nucleic acid complex, nor to the cleavage of its individual components.
- Formulas (1-3) are examples of the reversible cleavable terminators:
- NT-3'-0 - represents the 2 'deoxy nucleoside triphosphate residue.
- S (l) - represents a substituent (Formula 1) that can be split off from the NT * under mild conditions.
- a fluorescent dye (F) is coupled to these substituents.
- S (2) -N - represents another substituent (formula 2 and 3), which can be split off from the NT * under mild conditions.
- This substituent is linked to the fluorescent dye (F) by a group (N) which is cleavable under mild conditions.
- the fluorescent dye can be coupled directly to the cleavable group (formula 2) or by a further linker (L) (formula 3).
- Each nucleotide is marked with a characteristic marker (F).
- the marker is a fluorescent dye. The choice is not restricted if the dye meets the following requirements:
- the detection apparatus used must have this marker as the only molecule bound to DNA under mild conditions
- the dyes preferably have great photostability. Their fluorescence is preferably not quenched by the DNA or only to a minor extent.
- NTs * marked with the dye must be incorporated into the nucleic acid chain by the polymerase. d) When labeled with different dyes, these dyes should not have any significant overlaps in their emission spectra.
- fluorophores which can be used in the context of the present invention are compiled with structural formulas in "Handbook of Fluorescent Probes and Research Chemicals" 6th ed. 1996, R.Haugland, Molecular Probes.
- the following classes of dyes are preferably used as markers: cyanine dyes and their derivatives (for example Cy2, Cy3, Cy5, Cy7 Amersham Pharmacia Biotech, Waggoner US Pat. No. 5,268,486), rhodamines and their derivatives (for example TAMRA, TRITC, RG6, R110 , ROX, Molecular Probes, see manual), xanthene derivatives (e.g. Alexa 568, Alexa 594, Molecular Probes, Mao et al. US Pat. No. 6,130,101) and porphyrins (Porphyrin-Systems, Germany). These dyes are commercially available.
- dyes can be selected.
- the dyes are bound to the NT * via a cleavable linker.
- the dyes can be attached to the linker e.g. via thiocyanate or ester linkage ("Handbook of Fluorescent Probes and Research Chemicals" 6th ed. 1996, R.Haugland, Molecular Probes, Jameson et al. Methods in Enzymology 1997 v.278 p.363, Waggoner Methods in Enzymology 1995 v.246 p.362)
- the substituent leading to the termination is coupled to the NT by a bond which can be split under mild conditions.
- esters and acetals examples are esters and acetals.
- the esters are preferably cleaved in the basic pH Range (e.g. 9 to 11). Acetals are split in the acidic range (for example between 3 and 4).
- Esters can also be eliminated enzymatically by polymerases or esterases.
- the substituent is split off together with the fluorescent dye in one step.
- the fluorescent dye is coupled to the substituents by a group which is cleavable under mild conditions.
- the group mentioned preferably belongs to chemically or enzymatically cleavable or photolabile compounds. Ester, thioester, disulfide compounds and photolabile compounds are particularly suitable as a cleavable connection between the substituent and the fluorescent dye.
- ester, thioester and disulfide compounds are preferred ( ⁇ Chemistry of protein conjugation and crosslinking "" Shan S. Wong 1993 CRC Press Inc., Herman et al. Method in Enzymology 1990 v. 184 S. 584, Lomant et al. J. Mol. Biol. 1976 v.104 243, "Chemistry of carboxylic acid and esters” S.Patai 1969 Interscience Publ.).
- photolabile compounds can be found in the following references: "Protective groups in organic synthesis” 1991 John Willey & Sons, Inc., V. Pillai Synthesis 1980 Sl, V. Pillai Org. Photochem.
- the cleavage step is present in every cycle and must take place under mild conditions so that the nucleic acids are not damaged or modified.
- the cleavage preferably proceeds chemically (for example in a mild acidic or basic environment for an ester compound or by adding a reducing agent, for example dithiothreitol or mercaptoethanol (Sigma) when cleaving a disulfide compound), or physically (for example by illuminating the surface with light a certain wavelength for the cleavage of a photolabile group, thesis ⁇ New photolabile protecting groups for the light-controlled oligonucleotide synthesis "" " H. Giegrich, 1996, Constance).
- a reducing agent for example dithiothreitol or mercaptoethanol (Sigma)
- the fluorescent dye is first split off and only then is the substituent which is coupled to the 3 'position and leads to the termination.
- a cycle can preferably be carried out with:
- a label with two dyes can be selected. Two pairs of NTs * are formed, each marked differently, eg A and G are marked "X”, C and U are marked "Y”. Two differently labeled NTs * are used simultaneously in the reaction in one cycle (s), for example C * in combination with A * , and U * and G * are then added in the subsequent cycle (n + 1).
- NT * s are used per cycle and two unmarked NTs (so-called 2NT * s / 2NTs method).
- This embodiment can be used to determine variants (eg mutations, or alternatively spliced genes) of an already known sequence.
- the incorporation of NT * s into the NSKFs preferably takes place in such a way that a labeled NT * is incorporated in more than 50% of the NSKFs involved in the sequencing reaction, preferably in more than 90%. This is due to the fact that the reaction takes place very slowly on some nucleic acid chains.
- An installation of the NTs * at every complementary position in each cycle is aimed for, but is not necessary because only the successful installation reactions are detected and evaluated; a delayed reaction in the subsequent cycle does not lead to a sequencing error.
- the same polymerase is preferred for all NTs * used. However, different polymerases can also be used for different NTs * .
- AtomForce microscopy electron microscopy, near-field fluorescence microscopy, wide-field fluorescence
- fluorescence signals of individual NTs * built into the nucleic acid chain are preferably measured using a wide-field fluorescence microscope (epifluorescence) or a laser scanning microscope (epifluorescence) or a TIRF microscope (Total Internal Reflection Fluorescence Microscope).
- the device for the excitation light can function, for example, on the basis of a laser, a lamp or diodes.
- the detection device can serve both CCD cameras and PMT.
- the detection comprises the following phases:
- each detection step running as a scanning process and comprising the following operations: a) adjusting the position of the lens (X, Y axis), b) adjusting the focal plane (Z axis), c) detecting the Signals of individual molecules, assignment of the signal to NT * and assignment of the signal to the respective NSKF, d) shift to the next position on the surface.
- the signals from NTs * built into the NSKFs are registered by scanning the surface.
- the scanning process can be carried out in various ways ("Confocal Laser Scanning Microscopy” 1997 Ed. Sheppard, BIOS Scientific Publishers, “New Techniques of optical microscopy and microspectroscopy” 1991 Ed. R.Cherry CRC Press, Inc., "Fluorescence microscopy” 1998 2. ed. Herman BIOS Scientific Publishers, “Handbook of biological confocal microscopy” 1995 J. Pavley Plenum Press).
- a discontinuous scanning process is selected.
- the objective is moved step by step over the surface (FIG. 8a), so that a two-dimensional image (2D image) is created from each surface position (FIG. 8b, c).
- This 2D image can be created using various methods: for example by laser scanning a position of the microscope field (laser scanning microscopy) or by Camera recording at one position (see manuals of microscopy).
- laser scanning microscopy laser scanning microscopy
- Camera recording at one position see manuals of microscopy.
- the detection of individual molecules with a CCD camera is described as an example.
- the positions of the NSKFs must be determined for sequencing so that one has a basis for the assignment of the signals. Knowing these positions allows a statement to be made as to whether the signals of individual molecules come from built-in NTs * or from randomly bound NTs * . These positions can be identified using various methods. In a preferred embodiment, the positions of bound NSKF primer complexes are identified during sequencing. This makes use of the fact that the signals from the NTs * built into the nucleic acid chain always have the same coordinates. This is ensured by fixing the nucleic acid chains. The non-specifically bound NTs * bind randomly at different points on the surface.
- the signals are checked for agreement of their coordinates from several successive cycles. This can e.g. at the beginning of the sequencing.
- the matching coordinates are evaluated as coordinates of the DNA fragments and saved.
- the scan system must be able to scan the surface reproducibly over several cycles.
- X, Y and Z axis settings at each surface position can be controlled by a computer.
- the stability and reproducibility of the setting of lens positions in each scanning process determine the quality of the detection and thus the identification of the signals of individual molecules.
- Signals from individual molecules are placed in relation to the pattern, so that an X, Y deviation in the pattern position means the same X, Y deviation in the position of the signals of individual molecules.
- the control image of the pattern can be taken before, during or after the detection of individual molecules. Such a control picture must be made accordingly with each setting on a new surface position.
- the surface is not absolutely flat and has various unevenness. This changes the surface-lens distance when scanning neighboring locations. These differences in distance can lead to individual molecules leaving the focal plane and thus avoiding detection.
- the following method can be used: Since the excitation of individual molecules can lead to the extinction of their fluorescence, a marker is applied to the surface, which serves to adjust the focal plane. The signals of individual molecules are then detected.
- the marker can be of any nature (eg dye or pattern), but must not interfere with the detection and the reaction.
- the two-dimensional image of the reaction surface generated with the aid of the detection system contains the signal information from NT * s built into the NSKFs. Before further processing, these must be extracted from the total amount of image information using suitable methods.
- the algorithms required for scaling, transforming and filtering the image information are part of the standard repertoire of digital image processing and pattern recognition (Haberburger P. "Practice of digital image processing and pattern recognition”. Hanser-Verlag, Kunststoff, Vienna, 1995; Galbiati LJ “Machine vision and digital image processing fundamentals ". Prentice Hall, Englewood Cliffs, New Jersey, 1990).
- the signal extraction is preferably carried out via a gray value image, which depicts the brightness distribution of the reaction surface for the respective fluorescence channel.
- a separate gray value image can first be generated for each fluorescence-labeled nucleotide (A, T, C, G or U).
- A, T, C, G or U fluorescence-labeled nucleotide
- a gray value image is generated for each fluorescence channel by using suitable filters (Zeiss filter sets).
- the relevant color channels are extracted from a recorded multichannel color image with the aid of a suitable algorithm by an image processing program and are individually processed further as a gray-scale image.
- a channel-specific color threshold algorithm is used for channel extraction.
- the relevant image information is then extracted from this amount of data by a suitable program.
- a suitable program should implement the following steps:
- Preprocessing of the image for example, if necessary, reducing the image noise caused by the digitization of the image information, for example by gray value smoothing.
- a pixel (x, y) fulfills these requirements, then a comparison with the coordinates of NSKFs identified in previous sequencing cycles follows. If there is a match, the signal is assigned to the nucleotide resulting from the respective fluorescence channel NACF. Signals with mismatched coordinates are evaluated as background signals and rejected. The signals can be analyzed in parallel with the scanning process.
- an 8-bit gray value image with a resolution of 1317 x 1035 pixels was used.
- the overall image was first preprocessed: the average value of the brightness of its 8 neighbors was assigned to each pixel. With the selected resolution, this results in a typical pattern for a fluorescence dot of a central pixel with the greatest brightness value and neighboring pixels with brightness decreasing on all sides. If a pixel met these criteria and the centrifugal drop in brightness exceeded a certain threshold value (to exclude fluorescence spots that were too weak), this central pixel was evaluated as the coordinate of a fluorescence spot.
- a sequence of recordings can be made with the control of the X, Y position, the adjustment of the focal plane and with the detection of individual molecules at each new lens position. These steps can be controlled by a computer.
- the scanning process and the biochemical reaction take a certain amount of time. If you switch these processes one after the other, you can achieve an optimal performance of the equipment. In a preferred embodiment, the response to performed two separate surfaces.
- a surface with bound NSKF primer complexes can be separated into two spatially isolated parts, so that reactions on these two parts can take place independently of one another.
- NSKFs can also be immobilized on two separate surfaces from the outset.
- the reaction is then started.
- the principle is that while the reaction and washing steps take place on part of the surface, the second part is scanned. This enables the analysis to run continuously and the speed of sequencing to be increased.
- the number of surfaces on which the reaction takes place can also be greater than 2. This makes sense if the reaction occurs as a time-limiting step, i.e. the detection of the signals on the surface is faster than the reaction and washing steps. In order to adapt the total duration of the reaction to the detection duration, each individual step of the reaction can take place on a single surface with a time delay compared to the next surface.
- all four NTs * used in the reaction are labeled with fluorescent dyes.
- the sequencing of long nucleic acid chains will be shown schematically on the basis of the sequencing of a 1Mb piece of DNA (FIG. 1).
- the sequencing is based on the shotgun principle ("Automated DNA sequencing and analysis” p. 231 ff. 1994 M. Adams et al. Academic Press, Huang et al. Genom Res. 1999 v.9 p.868, Huang Genomics 1996 v .33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J.Comput. Biol. 1994 vl p.257).
- the material to be analyzed is prepared for the sequencing reaction by breaking it down into fragments of preferably 50 to 1000 bp in length.
- each fragment is then provided with a primer binding site and a primer.
- This mixture of different DNA fragments is now fixed on a flat surface.
- the unbound DNA fragments are removed by a washing step.
- the sequencing reaction is then carried out on the entire reaction Surface performed.
- the sequences of NSKFs should preferably be longer than 300 NTs, on average approx. 400 bp. Since only one marked NT * is installed per cycle, at least 400 cycles are required for sequencing.
- the NSKF sequences determined represent a population of overlapping partial sequences which can be combined with commercially available programs to form the overall sequence of the NSK ("Automated DNA sequencing and analysis” p. 231 ff. 1994 M. Adams et al. Academic Press, Huang et al. Genom Res. 1999 v.9 p.868, Huang Genomics 1996 v.33 p.21, Bonfield et al. NAR 1995 v.23 p.4992, Miller et al. J. Comput. Biol. 1994 vl S. 257).
- sequences can be analyzed in one approach instead of one sequence.
- the original sequences can be extracted from the raw data e.g. be reconstructed according to the shotgun principle.
- NSKFs are created. For example, one can convert mRNA into a double-stranded cDNA and fragment this cDNA with ultrasound. These NSKFs are then provided with a primer binding site, denatured, immobilized and hybridized with a primer. It should be noted with this variant of the sample preparation that the cDNA molecules can represent incomplete mRNA sequences (Method in Enzymology 1999, v.303, p.19 and other articles in this volume, "cDNA library protocols" 1997 Humana Press).
- NSKFs single-stranded NSKFs from mRNA
- Another possibility for the generation of single-stranded NSKFs from mRNA is the reverse transcription of the mRNA with randomized primers.
- Many relatively short antisense DNA fragments are formed (Zhang-J et al. Biochem. J. 1999 v.337 p.231, Ledbetter et al. J.Biol.Chem. 1994 v.269 p.31544, Rolls et al Anal.Biochem. 1993 v.208 p.264, Decraene et al. Biotechniques 1999 v.27 p.962).
- These fragments can then be provided with a primer binding site (see above). Further steps correspond to the processes described above. With this method, complete mRNA sequences (from the 5 'to the 3' end) can be analyzed, since the randomized primers bind over the entire length of the mRNA.
- Immobilized NSKFs are analyzed using one of the sequencing embodiments listed above. Since mRNA sequences have significantly fewer repetitive sequences than, for example, genomic DNA, the number of signals detected by the built-in NTs * from an NSKF can be less than 300 and is preferably between 20 and 1000. The number of NSKFs that need to be calculated is calculated following the same principles as a shot-shot reconstruction of a long sequence.
- the original gene sequences are reconstructed from NSKF sequences according to the principles of the shotgun method.
- This method allows the sequencing of many mRNAs without prior cloning.
- NSKFs are sequenced using the method according to the invention, wherein both a uniform primer and a uniform primer binding site as well as different, sequence-specific primers and natural ones which occur in the overall sequence to be investigated
- NSKFs can use primer binding sites.
- the determined sequences of NSKFs are then not assembled using the shotgun method, but are compared with the reference sequence and in this way assigned to their positions in the full sequence. This can be genomic or cDNA sequences.
- the length of the NSKF sequences determined should be sufficient for an unambiguous assignment to a specific position in the reference sequence, for example sequences with a length of 20 NTs (e.g. from non-repetitive sections in the human genome) can be clearly identified. Longer sequences are required for the comparative analysis of the repetitive sections. The exact length of the sequences depends on the task.
- the length of the NSKF sequences determined is preferably more than 20 NTs when analyzing non-repetitive sections. For the analysis of the repetitive sections, it is preferably over 500 NTs.
- the objectives when sequencing new variants of an already known full sequence can be very different.
- a comparison of the newly determined sequence with the known full sequence / reference sequence is usually sought.
- the two sequences can originate from species that are evolutionarily different from one another. Different parameters of the composition of these two sequences can be compared. Examples of such an analysis are: mutation or polymorphism analyzes and the analysis of alternative spliced gene products.
- a long sequence to be analyzed is shared in NSKFs using one of the above methods.
- These NSKFs are sequenced using uniform primers using the method according to the invention.
- the sequences determined from each individual NSKF are compared directly with the reference sequence.
- the reference sequence serves as the basis for the assignment of determined NSKF sequences, so that the time-consuming reconstruction using the shotgun method is not necessary.
- the length of the NSKF sequences determined is preferably more than 20 NTs when analyzing non-repetitive sections. For the analysis of the repetitive sections, it is preferably over 500 NTs.
- the number of NSKFs to be analyzed depends on the total length of the sequence to be examined, the average length of the NSKF sequences and the necessary accuracy of the sequencing.
- a total length of the sequence to be examined of 1 Mb and an accuracy that corresponds to the raw sequence determination (i.e. each location should be sequenced only once if possible), e.g. approx. 5 times the amount of raw sequences, i.e. 5 Mb because the NSKFs are randomly distributed over the entire sequence.
- a total of 50,000 NSKFs must be analyzed to cover more than 99% of the total route.
- NSKF sequences are then assigned to the full sequence using a commercially available program and any deviations are detected.
- a program can be based on, for example, BLAST or FASTA algorithm ("Introduction to computational Biolögy "1995 MS Waterman Chapman & Hall)
- 2 modified NTs * and 2 unmodified NTs are used for the analysis of the sequences.
- This method is particularly suitable for analyzing the sequence variants (e.g. SNP or mutation analysis) and requires knowledge of a reference sequence.
- the full sequence is not reconstructed, but the determined sequences are assigned to the reference sequence using a program and any deviations are registered.
- a program can e.g. based on the BLAST or FASTA algorithm ("Introduction to computational Biology” 1995 M.S. Waterman Chapman & Hall).
- This embodiment is based on the principle that a sequence of 2 signals (marked NT * s) can contain enough information to identify a sequence.
- the determined sequence is compared with the reference sequence and assigned to a specific position, for example:
- NSKFs The unknown variant of the reference sequence to be analyzed is prepared for sequencing as described above (NSK is converted into NSKFs, these are ligated with PBS, then hybridized with a primer and immobilized on the reaction surface). NSKFs prepared in this way are sequenced using the 2NTs * / 2NTs method. NSKF sequences are obtained, each NSKF sequence being a sequence of 2NTs * . In order to enable the determined sequence to be clearly assigned to a known reference sequence, this sequence must be long enough. The length of the NSKF sequences determined is preferably more than 40 NT * s.
- the total length of the complementary strand synthesized is approximately twice as long as the sequence of the detected NTs * (with 40 detected NTs * , the total length is z, e.g. 80 NTs on average).
- NTs * marked with a fluorescent dye appear as semiterminators in the present invention, ie termination occurs only when modified NTs * are available , unmodified NTs must be added to the reaction in an additional step in each cycle. The exact position of this step in the cycle can vary. It is important that the marked NTs * and the unmodified NTs are used separately.
- a cycle in this embodiment may look as follows, for example:
- This 2NT * s / 2NTs method is suitable, for example, for the SNP analysis of a genomic stretch of a gene or for double-stranded cDNA analysis. It is based on the following principles:
- NT * s permissible combinations of NT * s in this embodiment are: A * C * ; A * G * ; C * T * / C * U * ; G * T * / G * U * .
- the combination C * and U * is preferred.
- the NSKFs come from both strands of the NSK to be analyzed and • the NSKF sequences determined cover the entire length of the sequence to be analyzed.
- the sequence to be checked is sequenced using the 2NT * s / 2NTs method, so that a population of NSKF sequences (determined NSKF sequences (n)) is formed.
- the NSKF sequences determined contain information from each strand:
- a comparison sequence (reference sequence) is required for analysis:
- NSKF sequences are assigned to specific points in the comparison sequence and possible deviations are detected:
- a double-stranded nucleic acid can be examined for SNP or mutations.
- the NSKF sequences determined are compared with a reference sequence compared.
- the basic rules of comparing a partial sequence and a complete sequence in the analysis with only 2 marked NTs do not differ fundamentally from those that apply when comparing the sequences using all 4 marked NTs * .
- Sequence comparison in mutation analysis and SNP analysis with 4NTs * See Example 1B).
- the basic principles of the sequencing reaction in gene expression analysis correspond to those of the sequencing reaction of long NSKs (Fig. 7).
- the basic principles for carrying out a reaction cycle (the choice of the NT * structure, the polymerase, the reaction conditions for the NT * incorporation reaction and the cleavage reaction) and for the detection of the signals from built-in NT * correspond to those in the process for sequencing long NSKs.
- the main differences between the two methods lie in the selection and preparation of materials and the processing of the data obtained.
- Gene products can come from various biological objects, e.g. of individual cells, cell populations, a tissue or of entire organisms.
- Biological fluids such as blood, sputum or cerebrospinal fluid can also serve as a source of the gene products.
- the methods for obtaining the gene products from the various biological objects can be found, for example, in the following literature sources: "Molecular cloning” 1989, Ed. Maniatis, Cold Spring Harbor Laboratory, “Method in Enzymology” 1999, v303, "cDNA library protocols” 1997, Ed. I. G. Cowell, Humana Press Inc.
- Both the entirety of the gene products isolated and a part thereof selected by preselection can be used in the sequencing reaction.
- preselection one can determine the amount of the to be analyzed Reduce gene products ' .
- the preselection can be carried out, for example, by means of molecular biological methods such as, for example, PCR amplification, gel separation or hybridization with other nucleic acid chains ("Molecular cloning” 1989, Ed. Maniatis, Cold Spring Harbor Laboratory, “Method in Enzymology” 1999, v303, "cDNA library protocols "1997, Ed. IG Cowell, Humana Press Inc.)
- the entirety of the gene products is preferably selected as the starting material.
- the aim of the preparation of the material is to form extensible gene product-primer complexes from the starting material that are bound to the surface. Whereby only a maximum of one primer should bind per gene product.
- PBS Priming site
- a primer binding site is a sequence section which is intended to enable selective binding of the primer to the gene product.
- Sections in the nucleic acid sequence that naturally occur in the sequences to be analyzed can serve as primer binding sites (e.g. polyA stretches in mRNA).
- a primer binding site can also be introduced into the gene product (Molecular cloning "1989, Ed. Maniatis, Cold Spring Harbor Laboratory,” Method in Enzymology “1999, v303,” cDNA library protocols "1997, Ed. IG Cowell, Humana Press Inc. ).
- primer binding site that is as uniform as possible in all gene products. Then primers with a uniform structure can be used in the reaction.
- the composition of the primer binding site is not limits. Their length is preferably between 10 and 100 NTs.
- the primer binding site can carry a functional group, for example to bind the gene product to the surface. This functional group can be, for example, a biotin or digoxigenin group.
- nucleotide tailing of antisense cDNA fragments is described as an example of the introduction of a primer binding site into the gene products.
- cDNAs are synthesized from mRNAs.
- the result is a population of cDNA molecules that represent a copy of the mRNA population, so-called antisense cDNA.
- antisense cDNA Molecular cloning "1989, Ed. Maniatis, Cold Spring Harbor Laboratory,” Method in Enzymology “1999, v303,” cDNA library protocols "1997, Ed. IG Cowell, Humana Press Inc.).
- a terminal deoxynucleotide transferase one can do several ( for example between 10 and 20) attach nucleoside monophosphates to the 3 'end of this antisense cDNA, for example several adenosine monophosphates (called ((dA) n-tail).
- the resulting fragment is used to bind the primer, in this example one (dT ) n-primers used ("Molecular cloning” 1989 J. Sambrook et al. Cold Spring Harbor Laborotary Press, "Method in Enzymology” 1999 v.303, pp. 37-38).
- Primer for the secretion reaction This has the function of enabling starting at a single point in the gene product. It preferably binds to the primer binding site in the gene product.
- the composition and the length of the primer are not restricted. In addition to the start function, the primer can also perform other functions, such as creating a connection between the gene product-primer complexes and the reaction surface. Primers should be adapted to the length and composition of the primer binding site so that the primer enables the sequencing reaction to be started with the respective polymerase.
- the length of the primer is preferably between 6 and 100 NTs, optimally between 15 and 30 NTs.
- the primer can carry a functional group which is used, for example, to bind the primer to the surface, for example such a functional group is a biotin group (see section Immobilization). It should not interfere with sequencing.
- a functional group is a biotin group (see section Immobilization). It should not interfere with sequencing.
- the synthesis of such a primer can be carried out, for example, with the 380 A Applied Biosystems DNA synthesizer or can be created as a custom synthesis from a commercial provider, for example MWG-Biotech GmbH, Germany.
- primers can also be used, a defined primer set, or a primer mixture.
- the primer Before hybridization, the primer can be fixed to the fragments to be analyzed on the surface using various techniques or synthesized directly on the surface, for example according to (McGall et al. US Patent 5412087, Barrett et al. US Patent 5482867, Mirzabekov et al. US Patent 5981734, "Microarray biochip technology” 2000 M. Schena Eaton Publishing, “DNA Microarrays” 1999 M. Schena Oxford University Press, Fodor et al. Science 1991 v.285 p.767, Timofeev et al. Nucleic Acid Research (NAR) 1996 , v.24 p.3142, Ghosh et al. NAR 1987 v.15 p.5353, Gingeras et al. NAR 1987 v.15 p.5373, Maskos et al. NAR 1992 v.20 p.1679).
- the primers are on the surface at a density between 10 to 100 microns per 100 2, 100 to 10,000 per 100 micron 2, bound 10,000 to 1,000,000 per lOO ⁇ m 2 or greater than 1,000,000 per 100 microns. 2
- the primer or mixture of primers is incubated with gene products under hybridization conditions that selectively bind it to the primer binding site of each gene product.
- This primer hybridization (annealing) can take place before (1), during (2) or after (3) the binding of the gene products to the surface respectively.
- gene products are present as double-stranded nucleic acids, they are denatured by heat before hybridization ("Molecular cloning" 1989 J.Sambrook et al. Cold Spring Harbor Laborotary Press).
- the optimization of the hybridization conditions depends on the exact structure of the primer binding site and the primer and can be done according to Rychlik et al. (NAR 1990 v.18 p.6409). In the following, these hybridization conditions are referred to as standardized hybridization conditions.
- an oligo-dT primer can be used.
- a primer mixture consisting of 12 different primers with the following general structure 5 '(K) n MN3' can also be used. Where (n) is between 10 and 50, preferably between 20 and 30.
- K stands for dT or dU
- M and “N” each stand for dA, dT or dU, dC, dG (e.g.
- the aim of the fixation is to fix gene product-primer complexes on a suitable flat surface in such a way that a cyclic enzymatic sequencing reaction can take place. This can be done, for example, by binding the primer (see above) or the gene product to the surface.
- the order of the steps in binding gene product-primer complexes can be variable:
- the gene product-primer complexes can first be formed in a solution by hybridization (annealing) and then be bound to the surface.
- Primers can first be bound to a surface and gene products can then be hybridized to the bound primers, producing gene product-primer complexes (gene products indirectly bound to the surface)
- the gene products can first be bound to the surface (gene products bound directly to the surface) and in the subsequent step the primers can be hybridized to the bound gene products, producing gene product-primer complexes.
- the gene products can therefore be immobilized on the surface by direct or indirect binding.
- the surface and the reaction surface are to be understood as equivalent terms, unless explicitly referred to another meaning.
- the surface of a solid phase of any material serves as the reaction surface. This material is preferably inert towards enzymatic reactions and does not cause any interference with the detection. Silicon, glass, ceramics, plastics (e.g. polycarbonates or polystyrenes), metal (gold, silver, or aluminum) or any other material that meets these functional requirements can be used.
- the surface is preferably not deformable, since otherwise the signals are likely to be distorted upon repeated detection.
- this gel can be, for example, an agarose or polyacrylamide gel.
- the gel is preferably freely passable for molecules with a molecular mass below 5000 Da (for example, a 1 to 2% agarose gel or 5 to 15% polyacrylamide gel can be used).
- Such a gel surface has the advantage over other solid surfaces that there is a significantly lower non-specific binding of NT * s to the surface.
- Primer complexes on the surface make it possible to detect the fluorescence signals from built-in NTs * .
- the signals from free NTs * are not detected because they do not bind to the material of the gel and are therefore not immobilized.
- the gel is preferably attached to a solid surface.
- This solid base can be silicone, glass, ceramic, plastic (for example polycarbonate or polystyrene), metal (gold, silver or aluminum) or any other material.
- the thickness of the gel is preferably not more than 0.1 mm. However, the gel thickness is preferably greater than the simple depth of field of the lens, so that NTs * bound non-specifically to the solid base do not get into the focal plane and are therefore detected. If the depth of focus is 0.3 ⁇ m, for example, the gel thickness is preferably between 1 ⁇ m and 100 ⁇ m.
- the surface can be produced as a continuous surface or as a discontinuous surface composed of individual small components (eg agarose beads). The reaction surface must be large enough to be able to bind the necessary number of gene products with the appropriate density. The reaction surface should preferably not be larger than 20 cm 2 .
- the different cycle steps require an exchange of the different reaction solutions above the surface.
- the reaction surface is preferably part of a reaction vessel.
- the reaction vessel is in turn preferably part of a reaction apparatus with a flow device.
- the flow device enables the solutions in the reaction vessel to be exchanged.
- the exchange can take place with a pump device controlled by a computer or manually. It is important that the surface does not dry out.
- the volume of the reaction vessel is preferably less than 50 ⁇ l. Ideally, its volume is less than 1 ⁇ l.
- the gene product-primer complexes are fixed on the surface via the gene products, this can for example by binding the gene products at one of the two chain ends. This can be achieved by appropriate covalent, affine or other bonds.
- immobilization of nucleic acids are known (McGall et al. US Patent 5412087, Nikiforov et al. US Patent 5610287, Barrett et al. US Patent 5482867, Mirzabekov et al. US Patent 5981734, "Microarray biochip technology” 2000 M. Schena Eaton Publishing, “DNA Microarrays” 1999 M. Schena Oxford University Press, Rasmussen et al.
- the fixation can also be achieved by a non-specific binding, for example by drying out the sample containing gene products on the flat surface.
- the gene products are bound to the surface at a density between 10 and 100 microns per 100 2, 100 to 10,000 per 100 micron 2, 10,000 to 1000,000 per lOO ⁇ m. 2
- the density of extensible gene product-primer complexes required for the detection is approx. 10 to 100 per 100 ⁇ m 2 . It can be achieved before, during or after hybridization of the primers to the gene products.
- the gene product-primer complexes are bound via biotin-avidin or biotin-streptavidin binding.
- Avidin or streptavidin is covalently bound on the surface, the 5 'end of the primer is modified with biotin. After hybridization of the modified primers with the gene products (in solution), these are fixed on the surface coated with avidin / streptavidin.
- concentration of the gene product primer labeled with biotin Complexes as well as the time of incubation of this solution with the surface are chosen in such a way that a density suitable for sequencing is achieved.
- the primers suitable for the sequencing reaction are fixed on the surface using suitable methods before the sequencing reaction (see above).
- the single-stranded gene products, each with one primer binding site per gene product molecule, are thus incubated under hybridization conditions (annealing). They bind to the fixed primers and are thus bound to the surface (indirect binding).
- the concentration of the single-stranded gene products and the hybridization parameters eg temperature, time, buffer
- the concentration of the single-stranded gene products and the hybridization parameters are selected so that a density of approximately 10 to 100 gene product-primer complexes capable of extension per 100 ⁇ m 2 is achieved which is suitable for sequencing.
- unbound gene products are removed by a washing step.
- a surface with a high primer density is preferred, for example approximately 1,000,000 primers per 100 ⁇ m 2 or even higher, since the desired density of gene product-primer complexes is achieved more quickly, the gene products binding only to a part of the primers ,
- the gene products are bound directly to the surface (see above) and then incubated with primers under hybridization conditions.
- a density of approx. 10 to 100 gene products per 100 ⁇ m 2 attempts will be made to provide all available gene products with a primer and to make them available for the sequencing reaction. This can be achieved by high primer concentration, for example 1 to 100 mmol / 1.
- high primer concentration for example 1 to 100 mmol / 1.
- the density of the gene product-primer complexes required for optical detection can be achieved during the primer hybridization.
- the hybridization conditions e.g. temperature, time, buffer, Primer concentrate
- a blocking solution is preferably applied to the surface before step (a) in each cycle, which serves to avoid non-specific adsorption of NTs * on the surface.
- an albumin solution (BSA) with a pH between 8 and 10 fulfills these conditions for a blocking solution.
- RNA or DNA The type of immobilized nucleic acid (RNA or DNA) used plays a decisive role in the choice of polymerase:
- RNA-dependent DNA polymerases can be used, e.g. AMV reverse transcriptase (Sigma), M-MLV reverse transcriptase (Sigma), HIV reverse transcriptase without RNAse activity. All reverse transcriptases must be largely free of RNAse activity ("Molecular cloning" 1989, Ed. Maniatis, Cold Spring Harbor Laboratory).
- DNA is used as a gene product (eg cDNA)
- all DNA-dependent DNA polyases without 3 "-5" exonuclease activity are suitable in principle as polymerases.
- T7 polymerase of the "Sequenase Version 2" type (Amersham Pharmacia Biotech), Klenow fragment of DNA polymerase I without 3 '-5' exonuclease activity (Amersham Pharmacia Biotech), polymerase beta of various origins (animal cell DNA polymerases "1983, Fry M., CRC Press Inc., commercially available from Chimerx) thermostable polymerases such as Taq polymerase (GibcoBRL), proHA DNA polymerase (Eurogentec). detection:
- the detection comprises the following phases:
- each detection step running as a scanning process and comprising the following operations: a) adjusting the position of the objective (X, Y axis), b) adjusting the focal plane (Z axis), c) detecting the signals of individual people Molecules, assignment of the signal to NT * and assignment of the signal to the respective gene product, d) shift to the next position on the surface.
- the signals from NTs * built into the strands complementary to the gene products are registered by scanning the surface.
- the scanning process is carried out as with the sequencing of long NSKs.
- the lens is gradually moved over the surface, so that a two-dimensional image (2D image) is created from each surface position.
- the exact number depends e.g. on the relative presence of the gene products in the approach and on the desired accuracy of the analysis.
- the number of gene products analyzed is preferably between 1000 and 10,000,000. For highly expressed genes, the number of gene products analyzed can be low, e.g. 1000 to 10,000. When analyzing poorly expressed genes, it must be increased, e.g. to 100,000 or even further.
- the data obtained are compared using a program with known gene sequences.
- a program can e.g. based on a BLAST or FASTA algorithm ("Introduction to computational Biology” 1995 M.S. Waterman Chapman & Hall).
- the choice of the method for material preparation determines, among other things, in which sections of the gene products the sequences are determined and to which strand (sense or antisense) they belong. For example, are determined when using the polyA stretches as primer binding sites in mRNA sequences from NTRs (non-translating regions). When using the method with antisense cDNA as a template, the sequences determined come, inter alia, from the protein-coding regions of the gene products.
- the gene expression is only determined qualitatively. Only the fact that certain genes are expressed is important.
- a quantitative determination of the relationships between individual gene products in the batch is of interest. It is known that the activity of a gene in a cell is represented by a population of identical mRNA molecules. Many genes are active in a cell at the same time and are expressed to different extents, which leads to the presence of many different mRNA populations with different strengths. The quantitative analysis of gene expression is discussed in more detail below:
- the number of gene products analyzed is preferably between 1000 and 10,000,000.
- the exact number of gene products to be analyzed depends on the task. It can be low for highly expressed genes, e.g. 1000 to 10,000. When analyzing poorly expressed genes, it must be increased, e.g. to 100,000 or higher.
- RNA control samples are used in the sequence analysis of the mRNA samples, and corresponding DNA control samples are used in the analysis of the cDNA samples. These samples are preferably carried along in all steps. she can be added after mRNA isolation, for example. In general, the control samples are prepared for sequence analysis in the same way as the gene products to be analyzed.
- control sequences are added to the gene products to be analyzed in known, fixed concentrations. Concentrations of the control samples can be different, these concentrations are preferably between 0.01% and 10% of the total concentration of the sample to be analyzed (100%). For example, if the concentration of the mRNA is 10ng / ⁇ l, the concentrations of control samples are between 1pg / ⁇ l and 1ng / ⁇ l.
- the change in the level of expression of a particular gene can occur as a result of the change in the transcription rate of that gene or as a result of a global change in gene expression in the cell.
- the expression of the so-called "house-keeping genes" can be analyzed to observe the metabolic states in the cell. In the absence of important metabolites, for example, the general level of expression in the cell is low, so that constitutively expressed genes also have a low level of expression.
- all constitutively expressed genes can serve as "house-keeping genes". Examples include the transferrin receptor gene or the beta actin gene. The expression of these house-keeping genes thus serves as
- House-keeping genes are preferably part of the analysis program for gene expression.
- all four NTs * used in the reaction are labeled with fluorescent dyes.
- the number of NTs determined for each sequence from a gene product is between 5 and 100, ideally between 20 and 50. These sequences are compared with a sequence with known sequences in gene databases and assigned to corresponding genes. Such a program can e.g. based on the BLAST or FASTA algorithm ("Introduction to computational Biology" 1995 M.S. Waterman Chapman & Hall).
- a cycle has the following steps: a) adding a solution with labeled nucleotides (NTs * ) and polymerase to immobilized nucleic acid chains, b) incubating the immobilized nucleic acid chains with this solution under conditions which are suitable for extending the complementary strands by one NT, c) Washing d) Detection of signals from individual molecules e) Removal of the label from the built-in nucleotides and that leading to the termination Substituents, f) washing.
- NTs * labeled nucleotides
- 2 modified NTs * and 2 unmodified NTs are used for the analysis of the sequences.
- This embodiment is based on the principle that a sequence of 2 signals (marked NT * s) can contain enough information to identify a sequence.
- the determined sequence is compared with the reference sequence and assigned to a specific position, for example:
- the ascertained sequences are preferably assigned to the reference sequence with the aid of a program.
- a program can e.g. based on the BLAST or FASTA algorithm ("Introduction to computational Biology” 1995 M.S. Waterman Chapman & Hall).
- the gene products are prepared for sequencing as described above and sequenced using the 2NTs * / 2NTs method. Sequence sections are obtained from gene products, each sequence representing a sequence of 2NTs * .
- Known gene sequences serve as reference sequences. To be clear To enable the determined sequence to be assigned to a known reference sequence, this sequence must be long enough.
- the length of the sequences determined is preferably more than 20 NT * s. Since 2 marked NTs * represent only part of the sequence, the total length of the complementary strand synthesized is approximately twice as long as the sequence of the detected NTs * (with 20 detected NTs * , the total length is, for example, an average of 40 NTs).
- NTs * marked with a fluorescent dye appear as semiterminators in the present invention, ie termination occurs only when modified NTs * are available , unmodified NTs must be added to the reaction in an additional step in each cycle. The exact position of this step in the cycle can vary. It is important that the marked NTs * and the unmodified NTs are used separately.
- a cycle in this embodiment may look as follows, for example:
- the concentration of the NTs is preferably below 1 M, ideally below 10 ⁇ M.
- a special embodiment of the method is the analysis of single nucleotide polymorphisms with sequence-specific primers.
- Primer - In order to clarify the Indian idea, the following terms are distinguished in this example: a) In the present case, a “primer” is generally understood to mean a population of primer molecules with a uniform structure Understand primer molecules that have different structures. C) A “primer molecule” means a single oligonucleotide molecule. D) ⁇ Several primer molecules ⁇ j ⁇ mean several individual oligonucleotide molecules; they can have a uniform or different structure.
- SNP position - a position in NSK that is checked for the presence or absence of SNP.
- Target Sequence Part of an overall sequence that is sequenced / determined using a specific primer in the sequencing reaction.
- a Entire sequence can contain several target sequences.
- a target sequence is long enough to ensure that it is very likely to be positioned within the overall sequence.
- Target sequences can contain one or more SNP sites, for example.
- Detection Sequence Part of the target sequence that is used to assign this target sequence to the overall sequence.
- SNP analysis For SNP analysis, several potential SNP positions in the reference sequence are selected which are examined in an NSK to be analyzed. Accordingly, different, sequence-specific primers are provided for these positions. These primers can form a standardized primer set for SNP analysis for a specific question and can be used as a kit for the relevant analyzes.
- the preparation of the material to be analyzed has the aim according to the invention of creating a population of relatively small, between 30 and 2000 NT long, single-stranded nucleic acid chain fragments (NSKFs) from one or more long nucleic acid chains (overall sequence) form.
- NNKFs single-stranded nucleic acid chain fragments
- NSKF molecules happen to be random on a flat surface with a density between 10 and 1,000,000 per 100 ⁇ m 2 , preferably 10 and 100 NSKFs per 100 ⁇ m 2 , 100 to 10,000 per 100 ⁇ m 2 or 10,000 to 1,000,000 per 100 ⁇ m 2 immobilized.
- Primers are hybridized to the NSKFs bound on the surface, so that the density of the NSKF primer complexes capable of extension is approximately 10-100 per 100 ⁇ m 2 . After hybridization, unbound primers are removed and the sequencing reaction started.
- Sections of the overall sequence examine what reduces the amount of irrelevant information and shortens the analysis time.
- SNPs single nucleotide polymorphisms
- each SNP site to be examined either occupies the next position in the 3 g direction from the primer or within 2 to 100, preferably 2 to 50, ideally 2 to 20 positions in the 3 ⁇ direction from the primer.
- the SNP site is thus within the target sequence that is determined during the sequencing reaction.
- Several SNP sites are preferably analyzed simultaneously, so that several specific primers have to be used.
- the primers are preferably selected so that they have annealing temperatures that are as uniform as possible, i.e. Differences between melting temperatures of individual primer populations are, for example, within a range of approximately 4 degrees, better within 2 degrees, even better within 1 degree.
- NNKFs Short nucleic acid chain fragments
- NSKF molecules are arranged in a random order immobilized on the surface.
- a cyclic sequencing reaction is carried out, a target sequence being determined for each NSKF molecule involved in the reaction.
- the sequencing reaction takes place on many molecules simultaneously.
- the determined target sequences contain information about the affiliation to a certain section in the overall sequence and about the SNP in this section for the sample to be examined.
- the length of the target sequences and thus the number of cycles should be selected so that an identification of the sequences can be guaranteed.
- the determined target sequences are compared with the reference sequence and assigned by sequence matching. If the target sequence is sufficiently long, it is very likely that it can be assigned to a specific position in the reference sequence. For example, a sequence of 10 NTs can form more than 10 ⁇ different combinations and can therefore be identified with a high probability in an NSC of only 100,000 NT. After assigning the determined target sequence to the specific position within the reference sequence, differences in the sequences, the SNPs, become visible.
- the already known number of primers, their composition and an already known sequence section of the reference sequence which adjoins the primer binding site are used to identify the target sequences used.
- the target sequences determined are analyzed according to their affiliation with the primers, only the sequences close to the primer binding site having to be taken into account. If, for example, only 1000 primers are used, fewer than 10 NTs of the determined target sequences are sufficient to enable an assignment to the corresponding primers.
- the sample to be analyzed usually contains several identical total sequence molecules, e.g. multiple copies of genomic DNA from cells of a tissue or multiple identical mRNA populations from cells of a tissue.
- the method according to the invention can be used to analyze known SNP positions as well as to determine new SNP positions. Any position in the NSK can appear as a potential SNP position. The selection depends on the question, e.g. SNP analysis in genes whose products are associated with certain diseases, or SNP analysis in conserved, coding sections of the genes that code for membrane receptors, or checking known SNP sites in regulatory sequences of genes that are important for cell division.
- An SNP site to be analyzed lies within a target sequence that is determined during the sequencing reaction. You can identify multiple SNP locations within a target sequence. On the other hand, one can also select several target sequences, for example within one gene. It is important that the target sequences are at a sufficient distance from one another in the overall sequence. This distance is necessary so that only one sequence-specific primer hybridizes per NSKF, and it depends on the average NSKF length: the shorter the NSKFs, the closer the target sequences can be. With an appropriate choice of primers, the SNP sites can do both Strands of a double-stranded nucleic acid chain are analyzed.
- the method also offers the possibility, for example, of controlling several SNP positions from many individuals (as a sample of a population) at the same time. This can e.g. the SNP profile of a population are examined.
- Sequencing reaction on a single NSKF molecule is made possible by a primer molecule.
- a sequence-specific primer is necessary in order to be able to carry out the sequencing reaction in each case on a specific / specific target sequence within the overall sequence.
- the sequence-specific primer to be used for the analysis of an SNP site or a target sequence represents a population of primer molecules with an identical structure. Several different primer populations are necessary for the analysis of several different target sequences.
- sequence-specific primers By using sequence-specific primers, only the relevant sequence segments, the target sequences, are analyzed. In the method according to the invention, the length of the sequences to be sequenced is kept as short as possible so that the speed of the analysis increases.
- a sequence-specific primer binds to a specific primer binding site in the sequence to be analyzed, PBS.
- the composition and length of the primers are optimized for each potential SNP site or target sequence. Examples of optimization steps are in Rychlik et al. NAR 1990 v.18 p.6409 shown.
- the SNP position to be analyzed should either be the same after the 3 'end of the primer or within the next 2 to 50 NTs, preferably 2 to 20 NTs.
- the positioning (the choice of the sequence length and the composition) of the PBS to the SNP site should take place in such a way that the different PBS sequences and the corresponding primer sequences have as similar as possible "annealing temperatures" in order to achieve the most uniform hybridization conditions possible tie. This can be done, for example, by changing the PBS position in relation to the respective SNP site to be analyzed or by changing the length of the primer sequence (Rychlik et al. NAR 1990 v.18 p.6409).
- the minimum distance between primers that bind to the same strand in the overall sequence should not be less than the average NSKF length.
- Primers can be used for both strands of a double-strand NSK. This makes it possible, for example, to record SNP points that are close to one another, or to check an SNP point in both lines.
- the length of the primer is preferably between 6 and 100 NTs, optimally between 10-30 or 30-40 or 40-50. Primers with different lengths can be used for different SNP sites or target sequences.
- primers are hybridized according to the invention in a hybridization solution to the NSKFs immobilized on the reaction surface (annealing reaction).
- the NSKF primer complexes are bound to the surface exclusively via the NSKFs (direct binding of NSKFs to the Surface), the NSKF molecules provided being bound to the flat surface in a random arrangement.
- the NSKFs are preferably immobilized at one of the two chain ends (see above).
- the immobilization can also be achieved by a non-specific binding, for example by drying out the sample containing NSKFs on the flat surface.
- the density of the immobilization can be between 10 and 100, 100 and 10,000, 10,000 and 1,000,000 NSKFs per 100 ⁇ m 2 .
- the bound NSKFs and the primers are incubated under stringent hybridization conditions which allow the primers to be linked as selectively as possible (annealing) to the corresponding primer binding sites of the NSKFs.
- Optimal hybridization conditions depend on the exact structure of the primer binding sites and the respective primer structures and can be determined, for example, according to Rychlik et al. Calculate NAR 1990 v.18 p.6409.
- the primers are preferably a mixture of primers.
- concentrations of individual sequence-specific primers are, for example, between 10 pmol / l and 1 mmol / 1, preferably between 0.1 ⁇ mol / l and 10 ⁇ mol / l.
- the total concentration of primers in the primer mixture is preferably between 1 mol / 1 and 10 mmol / l.
- the ratio between individual primer populations can vary. Primers can be added in a significant excess over the immobilized NSKFs, so that the hybridization time is short.
- the density of NSKF primer complexes which can be extended is necessary for the detection to be approximately 10 to 100 per 100 ⁇ m 2 . It can be achieved before, during or after hybridization of the primers.
- the immobilization conditions can be selected such that the NSKFs are bound in a density of approx. 10 to 1000 molecules per 100 ⁇ m 2 . NSKFs thus determine the density of the NSKF primer complexes.
- the density of the immobilized NSKFs can be substantially higher than 1000 NSKFs per 100 ⁇ m 2 , for example 1,000,000 per 100 ⁇ m 2 .
- the density of the NSKF primer complexes required for optical detection is achieved during the primer hybridization.
- the hybridization conditions eg temperature, time, buffer
- the hybridization (annealing) of primers to the NSKFs can lead to a higher than optimal density of NSKF-primer complexes.
- part of the sample containing NSKFs is used to determine the optimal density.
- This part is immobilized on a reaction surface, the primers are hybridized to the NSKFs and the resulting NSKF-primer complexes are marked by the incorporation of fluorescent dye-bearing NT * s (eg Cy3-dCTP, Amersham Pharmacia Biotech).
- the density determined can be used to calculate the dilution or concentration of the original sample that may be necessary for the final sequencing approach (the hybridization conditions are maintained).
- necessary changes in the hybridization conditions can be calculated from this, for example a shortening of the hybridization time, the NSKF immobilization density remaining constant.
- the quantity ratio between primer populations can be different or the same size. A higher primer concentration makes it possible to bind certain, for example rarer, sequences with a higher probability in a certain period of time.
- the great advantage of the described process arrangement compared to a process arrangement with sequence-specific primers immobilized on a surface and a subsequent hybridization of samples to these primers is the significant reduction in the time for hybridization (annealing) between the sequence-specific primers and the samples to be analyzed on the reaction surface.
- NNKs long nucleic acid sequences
- the sequencing and reconstruction of long nucleic acid sequences is based on the shotgun principle.
- the sequence of a long piece of DNA is determined by sequencing small fragments (NSKFs) and a subsequent reconstruction.
- a uniform primer binding site (PBS) is coupled to the 3 'end of the NSKFs and a uniform primer binds to this PBS.
- PBS uniform primer binding site
- a double-stranded oligonucleotide complex (3a) which has, for example, a modification on both strands (3b), is located on the double-stranded NSKFs (3c).
- NSKFs double-stranded NSKFs with uniform PBS (3d) are created.
- PBS uniform primer binding site
- NTs are coupled to the 3 'end of the single-strand NSKFs (a so-called "tailing").
- a uniform NT By using a uniform NT, a uniform PBS is created.
- a gel layer (2) adheres to a solid base (1), e.g. a polyacrylamide gel (Fig. 5a), or many gel beads (5), e.g. Agarose beads (Fig. 5b).
- NSKFs (4) are bound to the surface of the gel.
- the NSKFs have a functional group, e.g. Biotin, and are bound to the gel via streptavidin or avidin (3).
- a gel-like reaction surface (1) is attached to a solid base (2) which is permeable to the excitation and fluorescent light. Together they form the lid of the flow cell.
- the liquids in the flow cell can be exchanged in a controlled manner, the flow cell forming a flow device together with the reservoir (3), pump (4) and valve (5).
- NSKF primer complexes are bound to the reaction surface (not shown here).
- the signals of the installed NT * s are detected with the detection apparatus (6).
- the analysis is based on the sequencing of short sections of mRNA.
- mRNA - the mRNA population to be analyzed in this example consisting of two different mRNA molecule populations (thin and thick strips represent mRNA molecules)
- a wide-field optical detection system is shown. After installing marked NT * s, the surface (7) scanned, the fluorescence signals being detected by individual dye molecules coupled to the NTs.
- Fig. 8a Schematic representation of a portion of the reaction surface (gray) that is scanned.
- the circles each correspond to the recording of a 2D image and represent the areas from which the fluorescence signals are detected.
- Several signals (for example 100 to 10,000) of individual molecules are registered simultaneously per exposure.
- the reaction surface is scanned in each cycle, several images being taken from different locations on the surface during the scanning process. Up to several million signals can be recorded by built-in NT * s.
- the high degree of parallelism is the basis for the speed of the process.
- Fig. 8b A recording (a 2D image) with signals from individual, built-in NT * s.
- Fig. 8c detail from Figure 8b.
- the section shows signals from four built-in NTs. Each signal has characteristic properties of the single molecule signals (see description) and can be identified on the basis of these (preferably with the aid of a computer program). The corresponding X, Y coordinates are assigned to each of the identified signals.
Landscapes
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Health & Medical Sciences (AREA)
- Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Immunology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Biophysics (AREA)
- Analytical Chemistry (AREA)
- Physics & Mathematics (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002337030A AU2002337030A1 (en) | 2001-08-29 | 2002-08-28 | Method for analyzing nucleic acid sequences and gene expression |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10142256 | 2001-08-29 | ||
DE10142256.3 | 2001-08-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003020968A2 true WO2003020968A2 (fr) | 2003-03-13 |
WO2003020968A3 WO2003020968A3 (fr) | 2003-12-31 |
Family
ID=7696956
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2002/009614 WO2003020968A2 (fr) | 2001-08-29 | 2002-08-28 | Procede d'analyse de sequences d'acides nucleiques et de l'expression de genes |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU2002337030A1 (fr) |
DE (1) | DE10239504A1 (fr) |
WO (1) | WO2003020968A2 (fr) |
Cited By (60)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005044836A2 (fr) | 2003-11-05 | 2005-05-19 | Genovoxx Gmbh | Composes nucleotidiques macromoleculaires et leurs procedes d'utilisation |
WO2005080605A3 (fr) * | 2004-02-19 | 2005-11-03 | Helicos Biosciences Corp | Procedes et kits pour analyser des sequences de polynucleotides |
WO2005116262A1 (fr) * | 2004-05-24 | 2005-12-08 | Helicos Biosciences Corporation | Procedes et dispositifs destines au sequençage d'acides nucleique |
US7169560B2 (en) | 2003-11-12 | 2007-01-30 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
WO2006007207A3 (fr) * | 2004-05-25 | 2007-02-22 | Helicos Biosciences Corp | Procedes et dispositifs pour la determination de sequence d'acides nucleiques |
JP2007509629A (ja) * | 2003-10-29 | 2007-04-19 | バイオアレイ ソリューションズ リミテッド | 二本鎖dnaの切断による複合核酸分析 |
US7220549B2 (en) | 2004-12-30 | 2007-05-22 | Helicos Biosciences Corporation | Stabilizing a nucleic acid for nucleic acid sequencing |
WO2007100637A3 (fr) * | 2006-02-22 | 2007-11-15 | Helicos Biosciences Corp | Procédés de détection de mutations |
US7297518B2 (en) | 2001-03-12 | 2007-11-20 | California Institute Of Technology | Methods and apparatus for analyzing polynucleotide sequences by asynchronous base extension |
US7397546B2 (en) | 2006-03-08 | 2008-07-08 | Helicos Biosciences Corporation | Systems and methods for reducing detected intensity non-uniformity in a laser beam |
US7427673B2 (en) | 2001-12-04 | 2008-09-23 | Illumina Cambridge Limited | Labelled nucleotides |
US7462449B2 (en) | 1999-06-28 | 2008-12-09 | California Institute Of Technology | Methods and apparatuses for analyzing polynucleotide sequences |
US7476734B2 (en) | 2005-12-06 | 2009-01-13 | Helicos Biosciences Corporation | Nucleotide analogs |
US7482120B2 (en) | 2005-01-28 | 2009-01-27 | Helicos Biosciences Corporation | Methods and compositions for improving fidelity in a nucleic acid synthesis reaction |
US7501245B2 (en) | 1999-06-28 | 2009-03-10 | Helicos Biosciences Corp. | Methods and apparatuses for analyzing polynucleotide sequences |
US7645596B2 (en) | 1998-05-01 | 2010-01-12 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US7666593B2 (en) | 2005-08-26 | 2010-02-23 | Helicos Biosciences Corporation | Single molecule sequencing of captured nucleic acids |
WO2011057061A1 (fr) | 2009-11-06 | 2011-05-12 | The Board Of Trustees Of The Leland Stanford Junior University | Diagnostic non invasif de rejet de greffe chez des patients ayant subi une greffe d'organes |
EP2366801A1 (fr) | 2006-06-14 | 2011-09-21 | Verinata Health, Inc | Procédés pour le diagnostic d'anomalies foetales |
WO2014026032A2 (fr) | 2012-08-08 | 2014-02-13 | Apprise Bio, Inc. | Accroissement de la plage dynamique pour identifier de multiples épitopes dans des cellules |
WO2015070086A1 (fr) | 2013-11-07 | 2015-05-14 | The Board Of Trustees Of The Leland Stanford Junior University | Utilisation d'acides nucléiques acellulaires pour l'analyse du microbiome chez l'homme et de ses composants |
US9035035B2 (en) | 2004-11-05 | 2015-05-19 | Genovoxx Gmbh | Macromolecular nucleotide compounds and methods for using the same |
US9051612B2 (en) | 2006-09-28 | 2015-06-09 | Illumina, Inc. | Compositions and methods for nucleotide sequencing |
WO2015089243A1 (fr) | 2013-12-11 | 2015-06-18 | The Regents For Of The University Of California | Procédés de marquage de fragments d'adn pour la recontruction de liaison physique et de phase |
US9096898B2 (en) | 1998-05-01 | 2015-08-04 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US9411930B2 (en) | 2013-02-01 | 2016-08-09 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US9605310B2 (en) | 2001-12-04 | 2017-03-28 | Illumina Cambridge Limited | Labelled nucleotides |
US9637777B2 (en) | 2003-10-28 | 2017-05-02 | Bioarray Solutions, Ltd. | Optimization of gene expression analysis using immobilized capture probes |
US9670539B2 (en) | 2007-10-19 | 2017-06-06 | The Trustees Of Columbia University In The City Of New York | Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis |
US20170166961A1 (en) | 2013-03-15 | 2017-06-15 | Illumina Cambridge Limited | Modified nucleosides or nucleotides |
US9689032B2 (en) | 2011-04-01 | 2017-06-27 | Centrillion Technology Holdings Corporation | Methods and systems for sequencing long nucleic acids |
US9708358B2 (en) | 2000-10-06 | 2017-07-18 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9709559B2 (en) | 2000-06-21 | 2017-07-18 | Bioarray Solutions, Ltd. | Multianalyte molecular analysis using application-specific random particle arrays |
US9715573B2 (en) | 2015-02-17 | 2017-07-25 | Dovetail Genomics, Llc | Nucleic acid sequence assembly |
US9719139B2 (en) | 2000-10-06 | 2017-08-01 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
WO2017197300A1 (fr) | 2016-05-13 | 2017-11-16 | Dovetail Genomics Llc | Récupération d'informations de liaison de longue portée à partir d'échantillons conservés |
US10072287B2 (en) | 2009-09-10 | 2018-09-11 | Centrillion Technology Holdings Corporation | Methods of targeted sequencing |
US10089437B2 (en) | 2013-02-01 | 2018-10-02 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
WO2018195091A1 (fr) | 2017-04-18 | 2018-10-25 | Dovetail Genomics, Llc | Caractéristiques d'acide nucléique utilisées en tant que guides pour l'assemblage de séquence |
US10144950B2 (en) | 2011-01-31 | 2018-12-04 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
WO2018237209A1 (fr) | 2017-06-21 | 2018-12-27 | Bluedot Llc | Systèmes et procédés d'identification d'acides nucléiques dans un échantillon |
US10174368B2 (en) | 2009-09-10 | 2019-01-08 | Centrillion Technology Holdings Corporation | Methods and systems for sequencing long nucleic acids |
US10260094B2 (en) | 2007-10-19 | 2019-04-16 | The Trustees Of Columbia University In The City Of New York | DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators |
WO2019152543A1 (fr) | 2018-01-31 | 2019-08-08 | Dovetail Genomics, Llc | Préparation d'échantillon pour récupération de liaison d'adn |
US10415081B2 (en) | 2001-10-15 | 2019-09-17 | Bioarray Solutions Ltd. | Multiplexed analysis of polymorphic loci by concurrent interrogation and enzyme-mediated detection |
US10457934B2 (en) | 2015-10-19 | 2019-10-29 | Dovetail Genomics, Llc | Methods for genome assembly, haplotype phasing, and target independent nucleic acid detection |
US10487102B2 (en) | 2002-08-23 | 2019-11-26 | Illumina Cambridge Limited | Labelled nucleotides |
US10513731B2 (en) | 2002-08-23 | 2019-12-24 | Illumina Cambridge Limited | Modified nucleotides |
US10526641B2 (en) | 2014-08-01 | 2020-01-07 | Dovetail Genomics, Llc | Tagging nucleic acids for sequence assembly |
US10648026B2 (en) | 2013-03-15 | 2020-05-12 | The Trustees Of Columbia University In The City Of New York | Raman cluster tagged molecules for biological imaging |
US10722858B2 (en) | 2013-03-15 | 2020-07-28 | Lineage Biosciences, Inc. | Methods and compositions for tagging and analyzing samples |
WO2020264185A1 (fr) | 2019-06-27 | 2020-12-30 | Dovetail Genomics, Llc | Procédés et compositions pour ligature de proximité |
US10975417B2 (en) | 2016-02-23 | 2021-04-13 | Dovetail Genomics, Llc | Generation of phased read-sets for genome assembly and haplotype phasing |
US10995111B2 (en) | 2003-08-22 | 2021-05-04 | Illumina Cambridge Limited | Labelled nucleotides |
WO2021103695A1 (fr) * | 2019-11-25 | 2021-06-03 | 齐鲁工业大学 | Procédé de séquençage ciblé de type flux à extension continue à base unique |
EP3836149A1 (fr) | 2011-11-07 | 2021-06-16 | QIAGEN Redwood City, Inc. | Procédés et systèmes pour l'identification de variants génomiques causals |
US11098353B2 (en) | 2006-12-01 | 2021-08-24 | The Trustees Of Columbia University In The City Of New York | Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators |
US11166996B2 (en) | 2018-12-12 | 2021-11-09 | Flagship Pioneering Innovations V, Inc. | Anellovirus compositions and methods of use |
US11781959B2 (en) | 2017-09-25 | 2023-10-10 | Freenome Holdings, Inc. | Methods and systems for sample extraction |
US11807896B2 (en) | 2015-03-26 | 2023-11-07 | Dovetail Genomics, Llc | Physical linkage preservation in DNA storage |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2290364A1 (fr) | 1996-04-25 | 2011-03-02 | BioArray Solutions Ltd. | Ensemble électrocinétique contrôlé par la lumière de particules de surfaces proches |
US7262063B2 (en) | 2001-06-21 | 2007-08-28 | Bio Array Solutions, Ltd. | Directed assembly of functional heterostructures |
AU2003298655A1 (en) | 2002-11-15 | 2004-06-15 | Bioarray Solutions, Ltd. | Analysis, secure access to, and transmission of array images |
CA2539824C (fr) | 2003-09-22 | 2015-02-03 | Xinwen Wang | Polyelectrolyte immobilise en surface a plusieurs groupes fonctionnels pouvant se lier de maniere covalente a des biomolecules |
US7848889B2 (en) | 2004-08-02 | 2010-12-07 | Bioarray Solutions, Ltd. | Automated analysis of multiplexed probe-target interaction patterns: pattern matching and allele identification |
GB0517097D0 (en) | 2005-08-19 | 2005-09-28 | Solexa Ltd | Modified nucleosides and nucleotides and uses thereof |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5547839A (en) * | 1989-06-07 | 1996-08-20 | Affymax Technologies N.V. | Sequencing of surface immobilized polymers utilizing microflourescence detection |
US5302509A (en) * | 1989-08-14 | 1994-04-12 | Beckman Instruments, Inc. | Method for sequencing polynucleotides |
WO1993005183A1 (fr) * | 1991-09-09 | 1993-03-18 | Baylor College Of Medicine | Procede et dispositif pour la determination rapide du sequençage d'adn ou d'arn au moyen d'une methode de sequençage par addition de base |
GB9208733D0 (en) * | 1992-04-22 | 1992-06-10 | Medical Res Council | Dna sequencing method |
DE69928265T3 (de) * | 1998-07-30 | 2013-11-28 | Illumina Cambridge Ltd. | Matrizen von biomolekülen und ihre verwendung in sequenzierung |
AU2002231934A1 (en) * | 2001-01-30 | 2002-08-12 | Solexa Ltd. | Arrayed polynucleotides and their use in genome analysis |
DE10120797B4 (de) * | 2001-04-27 | 2005-12-22 | Genovoxx Gmbh | Verfahren zur Analyse von Nukleinsäureketten |
-
2002
- 2002-08-28 AU AU2002337030A patent/AU2002337030A1/en not_active Abandoned
- 2002-08-28 WO PCT/EP2002/009614 patent/WO2003020968A2/fr not_active Application Discontinuation
- 2002-08-28 DE DE10239504A patent/DE10239504A1/de not_active Ceased
Cited By (164)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9096898B2 (en) | 1998-05-01 | 2015-08-04 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US9212393B2 (en) | 1998-05-01 | 2015-12-15 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US9458500B2 (en) | 1998-05-01 | 2016-10-04 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US9725764B2 (en) | 1998-05-01 | 2017-08-08 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US7645596B2 (en) | 1998-05-01 | 2010-01-12 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US9957561B2 (en) | 1998-05-01 | 2018-05-01 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US10208341B2 (en) | 1998-05-01 | 2019-02-19 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US10214774B2 (en) | 1998-05-01 | 2019-02-26 | Life Technologies Corporation | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US7462449B2 (en) | 1999-06-28 | 2008-12-09 | California Institute Of Technology | Methods and apparatuses for analyzing polynucleotide sequences |
US7501245B2 (en) | 1999-06-28 | 2009-03-10 | Helicos Biosciences Corp. | Methods and apparatuses for analyzing polynucleotide sequences |
US9709559B2 (en) | 2000-06-21 | 2017-07-18 | Bioarray Solutions, Ltd. | Multianalyte molecular analysis using application-specific random particle arrays |
US10633700B2 (en) | 2000-10-06 | 2020-04-28 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9868985B2 (en) | 2000-10-06 | 2018-01-16 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10407458B2 (en) | 2000-10-06 | 2019-09-10 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10457984B2 (en) | 2000-10-06 | 2019-10-29 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10407459B2 (en) | 2000-10-06 | 2019-09-10 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10570446B2 (en) | 2000-10-06 | 2020-02-25 | The Trustee Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10577652B2 (en) | 2000-10-06 | 2020-03-03 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10669577B2 (en) | 2000-10-06 | 2020-06-02 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10435742B2 (en) | 2000-10-06 | 2019-10-08 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10669582B2 (en) | 2000-10-06 | 2020-06-02 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10648028B2 (en) | 2000-10-06 | 2020-05-12 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10662472B2 (en) | 2000-10-06 | 2020-05-26 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9725480B2 (en) | 2000-10-06 | 2017-08-08 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9718852B2 (en) | 2000-10-06 | 2017-08-01 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9719139B2 (en) | 2000-10-06 | 2017-08-01 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US10428380B2 (en) | 2000-10-06 | 2019-10-01 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US9708358B2 (en) | 2000-10-06 | 2017-07-18 | The Trustees Of Columbia University In The City Of New York | Massive parallel method for decoding DNA and RNA |
US7297518B2 (en) | 2001-03-12 | 2007-11-20 | California Institute Of Technology | Methods and apparatus for analyzing polynucleotide sequences by asynchronous base extension |
US10415081B2 (en) | 2001-10-15 | 2019-09-17 | Bioarray Solutions Ltd. | Multiplexed analysis of polymorphic loci by concurrent interrogation and enzyme-mediated detection |
US9605310B2 (en) | 2001-12-04 | 2017-03-28 | Illumina Cambridge Limited | Labelled nucleotides |
US10519496B2 (en) | 2001-12-04 | 2019-12-31 | Illumina Cambridge Limited | Labelled nucleotides |
US7427673B2 (en) | 2001-12-04 | 2008-09-23 | Illumina Cambridge Limited | Labelled nucleotides |
US10480025B2 (en) | 2001-12-04 | 2019-11-19 | Illumina Cambridge Limited | Labelled nucleotides |
US10487102B2 (en) | 2002-08-23 | 2019-11-26 | Illumina Cambridge Limited | Labelled nucleotides |
US10513731B2 (en) | 2002-08-23 | 2019-12-24 | Illumina Cambridge Limited | Modified nucleotides |
US11008359B2 (en) | 2002-08-23 | 2021-05-18 | Illumina Cambridge Limited | Labelled nucleotides |
US11028116B2 (en) | 2003-08-22 | 2021-06-08 | Illumina Cambridge Limited | Labelled nucleotides |
US10995111B2 (en) | 2003-08-22 | 2021-05-04 | Illumina Cambridge Limited | Labelled nucleotides |
US11028115B2 (en) | 2003-08-22 | 2021-06-08 | Illumina Cambridge Limited | Labelled nucleotides |
US9637777B2 (en) | 2003-10-28 | 2017-05-02 | Bioarray Solutions, Ltd. | Optimization of gene expression analysis using immobilized capture probes |
EP1694859A4 (fr) * | 2003-10-29 | 2007-08-08 | Bioarray Solutions Ltd | Analyse multiplexee d'acide nucleique par fragmentation d'adn bicatenaire |
AU2004287069B2 (en) * | 2003-10-29 | 2009-07-16 | Bioarray Solutions, Ltd. | Multiplexed nucleic acid analysis by fragmentation of double-stranded DNA |
JP2007509629A (ja) * | 2003-10-29 | 2007-04-19 | バイオアレイ ソリューションズ リミテッド | 二本鎖dnaの切断による複合核酸分析 |
US7790380B2 (en) | 2003-10-29 | 2010-09-07 | Bioarray Solutions, Ltd. | Kits for multiplexed nucleic acid analysis by capture of single-stranded DNA produced from double-stranded target fragments |
WO2005044836A3 (fr) * | 2003-11-05 | 2007-07-19 | Genovoxx Gmbh | Composes nucleotidiques macromoleculaires et leurs procedes d'utilisation |
WO2005044836A2 (fr) | 2003-11-05 | 2005-05-19 | Genovoxx Gmbh | Composes nucleotidiques macromoleculaires et leurs procedes d'utilisation |
US8637650B2 (en) | 2003-11-05 | 2014-01-28 | Genovoxx Gmbh | Macromolecular nucleotide compounds and methods for using the same |
US7169560B2 (en) | 2003-11-12 | 2007-01-30 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
US7897345B2 (en) | 2003-11-12 | 2011-03-01 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
US9657344B2 (en) | 2003-11-12 | 2017-05-23 | Fluidigm Corporation | Short cycle methods for sequencing polynucleotides |
EP1692312A4 (fr) * | 2003-11-12 | 2007-10-17 | Helicos Biosciences Corp | Procedes courts de sequen age de polynucleotides |
US9012144B2 (en) | 2003-11-12 | 2015-04-21 | Fluidigm Corporation | Short cycle methods for sequencing polynucleotides |
US7491498B2 (en) | 2003-11-12 | 2009-02-17 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
US7981604B2 (en) | 2004-02-19 | 2011-07-19 | California Institute Of Technology | Methods and kits for analyzing polynucleotide sequences |
EP2248911A1 (fr) * | 2004-02-19 | 2010-11-10 | Helicos Biosciences Corporation | Procès et compositions pour l'analyse des séquences de polynucléotides |
WO2005080605A3 (fr) * | 2004-02-19 | 2005-11-03 | Helicos Biosciences Corp | Procedes et kits pour analyser des sequences de polynucleotides |
WO2005116262A1 (fr) * | 2004-05-24 | 2005-12-08 | Helicos Biosciences Corporation | Procedes et dispositifs destines au sequençage d'acides nucleique |
WO2006007207A3 (fr) * | 2004-05-25 | 2007-02-22 | Helicos Biosciences Corp | Procedes et dispositifs pour la determination de sequence d'acides nucleiques |
US7635562B2 (en) | 2004-05-25 | 2009-12-22 | Helicos Biosciences Corporation | Methods and devices for nucleic acid sequence determination |
US9035035B2 (en) | 2004-11-05 | 2015-05-19 | Genovoxx Gmbh | Macromolecular nucleotide compounds and methods for using the same |
US7220549B2 (en) | 2004-12-30 | 2007-05-22 | Helicos Biosciences Corporation | Stabilizing a nucleic acid for nucleic acid sequencing |
US7482120B2 (en) | 2005-01-28 | 2009-01-27 | Helicos Biosciences Corporation | Methods and compositions for improving fidelity in a nucleic acid synthesis reaction |
US7666593B2 (en) | 2005-08-26 | 2010-02-23 | Helicos Biosciences Corporation | Single molecule sequencing of captured nucleic acids |
US9868978B2 (en) | 2005-08-26 | 2018-01-16 | Fluidigm Corporation | Single molecule sequencing of captured nucleic acids |
US7476734B2 (en) | 2005-12-06 | 2009-01-13 | Helicos Biosciences Corporation | Nucleotide analogs |
WO2007100637A3 (fr) * | 2006-02-22 | 2007-11-15 | Helicos Biosciences Corp | Procédés de détection de mutations |
US7397546B2 (en) | 2006-03-08 | 2008-07-08 | Helicos Biosciences Corporation | Systems and methods for reducing detected intensity non-uniformity in a laser beam |
EP4170042A1 (fr) | 2006-06-14 | 2023-04-26 | Verinata Health, Inc. | Procédés pour le diagnostic d'anomalies f tales |
EP3406736A1 (fr) | 2006-06-14 | 2018-11-28 | Verinata Health, Inc | Procédés pour le diagnostic d'anomalies f tales |
EP2366801A1 (fr) | 2006-06-14 | 2011-09-21 | Verinata Health, Inc | Procédés pour le diagnostic d'anomalies foetales |
US9051612B2 (en) | 2006-09-28 | 2015-06-09 | Illumina, Inc. | Compositions and methods for nucleotide sequencing |
US11939631B2 (en) | 2006-12-01 | 2024-03-26 | The Trustees Of Columbia University In The City Of New York | Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators |
US11098353B2 (en) | 2006-12-01 | 2021-08-24 | The Trustees Of Columbia University In The City Of New York | Four-color DNA sequencing by synthesis using cleavable fluorescent nucleotide reversible terminators |
US10144961B2 (en) | 2007-10-19 | 2018-12-04 | The Trustees Of Columbia University In The City Of New York | Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis |
US11208691B2 (en) | 2007-10-19 | 2021-12-28 | The Trustees Of Columbia University In The City Of New York | Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis |
US12331352B2 (en) | 2007-10-19 | 2025-06-17 | The Trustees Of Columbia University In The City Of New York | DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators |
US11242561B2 (en) | 2007-10-19 | 2022-02-08 | The Trustees Of Columbia University In The City Of New York | DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators |
US12180544B2 (en) | 2007-10-19 | 2024-12-31 | The Trustees Of Columbia University In The City Of New York | Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis |
US9670539B2 (en) | 2007-10-19 | 2017-06-06 | The Trustees Of Columbia University In The City Of New York | Synthesis of cleavable fluorescent nucleotides as reversible terminators for DNA sequencing by synthesis |
US10260094B2 (en) | 2007-10-19 | 2019-04-16 | The Trustees Of Columbia University In The City Of New York | DNA sequencing with non-fluorescent nucleotide reversible terminators and cleavable label modified nucleotide terminators |
US10174368B2 (en) | 2009-09-10 | 2019-01-08 | Centrillion Technology Holdings Corporation | Methods and systems for sequencing long nucleic acids |
US10072287B2 (en) | 2009-09-10 | 2018-09-11 | Centrillion Technology Holdings Corporation | Methods of targeted sequencing |
US11597966B2 (en) | 2009-11-06 | 2023-03-07 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US10968479B2 (en) | 2009-11-06 | 2021-04-06 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US9845497B2 (en) | 2009-11-06 | 2017-12-19 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US10494669B2 (en) | 2009-11-06 | 2019-12-03 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US10982275B2 (en) | 2009-11-06 | 2021-04-20 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US8703652B2 (en) | 2009-11-06 | 2014-04-22 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US10329607B2 (en) | 2009-11-06 | 2019-06-25 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US11390918B2 (en) | 2009-11-06 | 2022-07-19 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
EP3719140A1 (fr) | 2009-11-06 | 2020-10-07 | The Board Of Trustees Of The Leland Stanford Junior University | Diagnostic non invasif de rejet de greffe chez des patients ayant subi une greffe d'organes |
US11384389B2 (en) | 2009-11-06 | 2022-07-12 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US10988804B2 (en) | 2009-11-06 | 2021-04-27 | The Board Of Trustees Of The Leland Stanford Junior University | Nucleic acid sequencing apparatus for monitoring status of a transplant recipient |
WO2011057061A1 (fr) | 2009-11-06 | 2011-05-12 | The Board Of Trustees Of The Leland Stanford Junior University | Diagnostic non invasif de rejet de greffe chez des patients ayant subi une greffe d'organes |
US11098350B2 (en) | 2009-11-06 | 2021-08-24 | The Board Of Trustees Of The Leland Stanford Junior University | Non-invasive diagnosis of graft rejection in organ transplant patients |
US12065690B2 (en) | 2011-01-31 | 2024-08-20 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US12129512B2 (en) | 2011-01-31 | 2024-10-29 | Roche Sequencing Solutions, Inc. | Composition comprising cell origination barcodes for the analysis of both protein and nucleic acid from the same cell |
US11939624B2 (en) | 2011-01-31 | 2024-03-26 | Roche Sequencing Solutions, Inc. | Method for labeling ligation products with cell-specific barcodes II |
US11932903B2 (en) | 2011-01-31 | 2024-03-19 | Roche Sequencing Solutions, Inc. | Kit for split-pool barcoding target molecules that are in or on cells or cell organelles |
US10626442B2 (en) | 2011-01-31 | 2020-04-21 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11932902B2 (en) | 2011-01-31 | 2024-03-19 | Roche Sequencing Solutions, Inc. | Barcoded beads and method for making the same by split-pool synthesis |
US12110536B2 (en) | 2011-01-31 | 2024-10-08 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11692214B2 (en) | 2011-01-31 | 2023-07-04 | Roche Sequencing Solutions, Inc. | Barcoded beads and method for making the same by split-pool synthesis |
US11566278B2 (en) | 2011-01-31 | 2023-01-31 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US10144950B2 (en) | 2011-01-31 | 2018-12-04 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11926864B1 (en) | 2011-01-31 | 2024-03-12 | Roche Sequencing Solutions, Inc. | Method for labeling ligation products with cell-specific barcodes I |
US11634752B2 (en) | 2011-01-31 | 2023-04-25 | Roche Sequencing Solutions, Inc. | Kit for split-pool barcoding target molecules that are in or on cells or cell organelles |
US11859240B2 (en) | 2011-01-31 | 2024-01-02 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11512341B1 (en) | 2011-01-31 | 2022-11-29 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11781171B1 (en) | 2011-01-31 | 2023-10-10 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US12129513B1 (en) | 2011-01-31 | 2024-10-29 | Roche Sequencing Solutions, Inc. | Method for barcoding |
US11667956B2 (en) | 2011-01-31 | 2023-06-06 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11732290B2 (en) | 2011-01-31 | 2023-08-22 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US11708599B2 (en) | 2011-01-31 | 2023-07-25 | Roche Sequencing Solutions, Inc. | Methods of identifying multiple epitopes in cells |
US12173353B1 (en) | 2011-01-31 | 2024-12-24 | Roche Sequencing Solutions, Inc. | Composition comprising cell origination barcodes for the analysis of both genomic DNA and CDNA from the same cell |
US9689032B2 (en) | 2011-04-01 | 2017-06-27 | Centrillion Technology Holdings Corporation | Methods and systems for sequencing long nucleic acids |
US10801062B2 (en) | 2011-04-01 | 2020-10-13 | Centrillion Technology Holdings Corporation | Methods and systems for sequencing long nucleic acids |
EP3836149A1 (fr) | 2011-11-07 | 2021-06-16 | QIAGEN Redwood City, Inc. | Procédés et systèmes pour l'identification de variants génomiques causals |
EP4450644A2 (fr) | 2012-08-08 | 2024-10-23 | F. Hoffmann-La Roche AG | Augmentation de la plage dynamique pour identifier de multiples épitopes dans des cellules |
WO2014026032A2 (fr) | 2012-08-08 | 2014-02-13 | Apprise Bio, Inc. | Accroissement de la plage dynamique pour identifier de multiples épitopes dans des cellules |
EP3578669A1 (fr) | 2012-08-08 | 2019-12-11 | F. Hoffmann-La Roche AG | Accroissement de la plage dynamique pour identifier de multiples épitopes dans des cellules |
US10174310B2 (en) | 2012-08-08 | 2019-01-08 | Roche Sequencing Solutions, Inc. | Increasing dynamic range for identifying multiple epitopes in cells |
EP3885446A1 (fr) | 2013-02-01 | 2021-09-29 | The Regents of The University of California | Procédés pour assemblage du génome et phasage d'haplotype |
US11081209B2 (en) | 2013-02-01 | 2021-08-03 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US9411930B2 (en) | 2013-02-01 | 2016-08-09 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US11935626B2 (en) | 2013-02-01 | 2024-03-19 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US10825553B2 (en) | 2013-02-01 | 2020-11-03 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US9910955B2 (en) | 2013-02-01 | 2018-03-06 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US10089437B2 (en) | 2013-02-01 | 2018-10-02 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US10529443B2 (en) | 2013-02-01 | 2020-01-07 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
US10648026B2 (en) | 2013-03-15 | 2020-05-12 | The Trustees Of Columbia University In The City Of New York | Raman cluster tagged molecules for biological imaging |
US20170166961A1 (en) | 2013-03-15 | 2017-06-15 | Illumina Cambridge Limited | Modified nucleosides or nucleotides |
US10407721B2 (en) | 2013-03-15 | 2019-09-10 | Illumina Cambridge Limited | Modified nucleosides or nucleotides |
US10722858B2 (en) | 2013-03-15 | 2020-07-28 | Lineage Biosciences, Inc. | Methods and compositions for tagging and analyzing samples |
US11161087B2 (en) | 2013-03-15 | 2021-11-02 | Lineage Biosciences, Inc. | Methods and compositions for tagging and analyzing samples |
US10982277B2 (en) | 2013-03-15 | 2021-04-20 | Illumina Cambridge Limited | Modified nucleosides or nucleotides |
WO2015070086A1 (fr) | 2013-11-07 | 2015-05-14 | The Board Of Trustees Of The Leland Stanford Junior University | Utilisation d'acides nucléiques acellulaires pour l'analyse du microbiome chez l'homme et de ses composants |
EP4130350A1 (fr) | 2013-11-07 | 2023-02-08 | The Board of Trustees of the Leland Stanford Junior University | Acides nucléiques acellulaires pour l'analyse du microbiome humain et ses composants |
EP3540074A1 (fr) | 2013-12-11 | 2019-09-18 | The Regents of the University of California | Procédé de marquage de régions internes de molécules d'acid nucléique |
WO2015089243A1 (fr) | 2013-12-11 | 2015-06-18 | The Regents For Of The University Of California | Procédés de marquage de fragments d'adn pour la recontruction de liaison physique et de phase |
EP4219710A2 (fr) | 2014-08-01 | 2023-08-02 | Dovetail Genomics, LLC | Marquage d'acides nucléiques pour l'assemblage de séquences |
US10526641B2 (en) | 2014-08-01 | 2020-01-07 | Dovetail Genomics, Llc | Tagging nucleic acids for sequence assembly |
US12180535B2 (en) | 2014-08-01 | 2024-12-31 | Dovetail Genomics, Llc | Tagging nucleic acids for sequence assembly |
US10318706B2 (en) | 2015-02-17 | 2019-06-11 | Dovetail Genomics, Llc | Nucleic acid sequence assembly |
US11600361B2 (en) | 2015-02-17 | 2023-03-07 | Dovetail Genomics, Llc | Nucleic acid sequence assembly |
US9715573B2 (en) | 2015-02-17 | 2017-07-25 | Dovetail Genomics, Llc | Nucleic acid sequence assembly |
US11807896B2 (en) | 2015-03-26 | 2023-11-07 | Dovetail Genomics, Llc | Physical linkage preservation in DNA storage |
US10457934B2 (en) | 2015-10-19 | 2019-10-29 | Dovetail Genomics, Llc | Methods for genome assembly, haplotype phasing, and target independent nucleic acid detection |
US10975417B2 (en) | 2016-02-23 | 2021-04-13 | Dovetail Genomics, Llc | Generation of phased read-sets for genome assembly and haplotype phasing |
US12065691B2 (en) | 2016-05-13 | 2024-08-20 | Dovetail Genomics, Llc | Recovering long-range linkage information from preserved samples |
US10947579B2 (en) | 2016-05-13 | 2021-03-16 | Dovetail Genomics, Llc | Recovering long-range linkage information from preserved samples |
EP3954771A1 (fr) | 2016-05-13 | 2022-02-16 | Dovetail Genomics, LLC | Récupération d'informations de liaison de longue portée à partir d'échantillons conservés |
WO2017197300A1 (fr) | 2016-05-13 | 2017-11-16 | Dovetail Genomics Llc | Récupération d'informations de liaison de longue portée à partir d'échantillons conservés |
WO2018195091A1 (fr) | 2017-04-18 | 2018-10-25 | Dovetail Genomics, Llc | Caractéristiques d'acide nucléique utilisées en tant que guides pour l'assemblage de séquence |
WO2018237209A1 (fr) | 2017-06-21 | 2018-12-27 | Bluedot Llc | Systèmes et procédés d'identification d'acides nucléiques dans un échantillon |
US11781959B2 (en) | 2017-09-25 | 2023-10-10 | Freenome Holdings, Inc. | Methods and systems for sample extraction |
US12247905B2 (en) | 2017-09-25 | 2025-03-11 | Freenome Holdings, Inc. | Methods and systems for sample extraction |
WO2019152543A1 (fr) | 2018-01-31 | 2019-08-08 | Dovetail Genomics, Llc | Préparation d'échantillon pour récupération de liaison d'adn |
US11166996B2 (en) | 2018-12-12 | 2021-11-09 | Flagship Pioneering Innovations V, Inc. | Anellovirus compositions and methods of use |
US11446344B1 (en) | 2018-12-12 | 2022-09-20 | Flagship Pioneering Innovations V, Inc. | Anellovirus compositions and methods of use |
WO2020264185A1 (fr) | 2019-06-27 | 2020-12-30 | Dovetail Genomics, Llc | Procédés et compositions pour ligature de proximité |
EP4545965A2 (fr) | 2019-06-27 | 2025-04-30 | Dovetail Genomics, LLC | Procédés et compositions pour ligature de proximité |
WO2021103695A1 (fr) * | 2019-11-25 | 2021-06-03 | 齐鲁工业大学 | Procédé de séquençage ciblé de type flux à extension continue à base unique |
Also Published As
Publication number | Publication date |
---|---|
WO2003020968A3 (fr) | 2003-12-31 |
AU2002337030A1 (en) | 2003-03-18 |
DE10239504A1 (de) | 2003-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2003020968A2 (fr) | Procede d'analyse de sequences d'acides nucleiques et de l'expression de genes | |
DE10120798B4 (de) | Verfahren zur Bestimmung der Genexpression | |
DE10120797B4 (de) | Verfahren zur Analyse von Nukleinsäureketten | |
DE69837913T2 (de) | Verfahren zur vervielfältigung von nukleinsäure | |
DE69928683T2 (de) | Verfahren zur amplifikation und zur sequenzierung von nukleinsäuren | |
DE10246005A1 (de) | Gerät zur Sequenzierung von Nukleinsäuremolekülen | |
DE69713599T2 (de) | Verfahren zur bestimmung von nukleinsäure-sequenzen und diagnostische anwendungen derselben | |
DE69421277T2 (de) | NUKLEINSäURE-SEQUENZANALYSE DURCH DIE METHODE DER PARALLELEN PRIMEREXTENSION | |
DE60208278T2 (de) | System und Verfahren zum Testen von Nukleinsäuremolekülen | |
EP1204765B1 (fr) | Procede de quantification relative de la methylation des bases du type cytosine dans des echantillons d'adn | |
DE10214395A1 (de) | Verfahren zur Analyse von Einzelnukleotidpolymorphismen | |
DE10149786B4 (de) | Oberfläche für Untersuchungen aus Populationen von Einzelmolekülen | |
EP2109499A2 (fr) | Installation améliorée de traitement biologique moléculaire | |
DE69924140T2 (de) | Bestimmung der länge von repetitiven nukleinsäure-sequenzen durch eine diskontinuierliche primerverlängerung | |
DE102004025695A1 (de) | Verfahren und Oberfläche zur parallelen Sequenzierung von Nukleinsäureketten | |
DE102004025694A1 (de) | Verfahren und Oberfläche zu hochparallelen Analysen von Nukleinsäureketten | |
EP1294946B1 (fr) | PROCEDE DE SEQUENçAGE MULTIPLEX | |
DE102004025744A1 (de) | Oberfläche für die Analysen an einzelnen Nukleinsäuremolekülen | |
DE102004025745A1 (de) | Oberfläche für die Analysen an einzelnen Molekülen | |
DE60038109T2 (de) | Verfahren zur Analyse von AFLP Reaktionsmischungen unter Verwendung von Primer Verlängerungstechniken | |
EP1458892B1 (fr) | Procede de sequen age multiplex base sur le principe de l'evanescence | |
WO2003052136A2 (fr) | Sequençage au moyen de membranes perforees | |
EP1234056B1 (fr) | Determination dynamique d'analytes en utilisant une puce localisee sur une surface interne | |
DE102006062089A1 (de) | Verbesserte molekularbiologische Prozessanlage | |
DE102007018833A1 (de) | Verbesserte molekularbiologische Prozessanlage |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BY BZ CA CH CN CO CR CU CZ DE DM DZ EE ES FI GB GD GE GH GM HU ID IL IN IS JP KE KG KP KR KZ LK LR LS LT LU LV MA MD MG MK MW MX MZ NO NZ PL PT RO RU SD SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ UG ZM ZW AM AZ BY KG KZ RU TJ TM AT BE BG CH CY CZ DK EE ES FI FR GB GR IE IT LU MC PT SE SK TR BF BJ CF CG CI GA GN GQ GW ML MR NE SN TD TG Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase |
Ref country code: JP |
|
WWW | Wipo information: withdrawn in national office |
Country of ref document: JP |