WO2018069450A1 - Methylation biomarkers for lung cancer - Google Patents
Methylation biomarkers for lung cancer Download PDFInfo
- Publication number
- WO2018069450A1 WO2018069450A1 PCT/EP2017/076076 EP2017076076W WO2018069450A1 WO 2018069450 A1 WO2018069450 A1 WO 2018069450A1 EP 2017076076 W EP2017076076 W EP 2017076076W WO 2018069450 A1 WO2018069450 A1 WO 2018069450A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- hist1
- methylation
- lung cancer
- small cell
- cell lung
- Prior art date
Links
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6883—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material
- C12Q1/6886—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for diseases caused by alterations of genetic material for cancer
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q2600/00—Oligonucleotides characterized by their use
- C12Q2600/154—Methylation markers
Definitions
- the present invention relates to DNA methylation biomarkers for non-small cell lung cancer.
- Lung cancer is the most common type of cancer and each year, the disease is responsible for approximately 1 .5 million deaths worldwide.
- SCLC small cell lung cancer
- NSCLC non-small cell lung cancer
- Lung adenocarcinoma (LAC) is the most common subtype of NSCLC, which account for approximately 40% of all lung cancers.
- the overall 5-year survival rate for lung cancer is 15%, but the prognosis is highly dependent on the stage, at which the disease is diagnosed. If the disease is localized at the time of diagnosis, the 5-year survival rate is approximately 50%, compared to approximately 25% for cases with regional disease, and less than 5% for patients that already suffer from metastatic disease.
- CGI CpG islands
- DNA methylation biomarkers have already been established in all aspects of clinical cancer management, including risk assessment, early disease detection, prognostication and treatment personalization.
- biomarkers for clinical implementation is a challenging process that includes biomarker candidate discovery and evaluation of biomarker specificity and sensitivity in large- scale validation studies.
- biomarker candidate discovery includes biomarker candidate discovery and evaluation of biomarker specificity and sensitivity in large- scale validation studies.
- novel methylation biomarkers have been identified for use in clinical lung cancer
- the biomarkers display cancer-specific methylation changes and have been validated and evaluated for sensitivity and specificity.
- the invention relates to methylation biomarkers for non-small cell lung cancer.
- the invention provides a number of methylation markers, which can be used to distinguish between lung tumour tissue and healthy tissue. A plurality of individual methylation biomarkers are identified, which show high sensitivity and specificity.
- a method is provided for determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, the prognosis of a non-small cell lung cancer, and/or monitoring a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ),
- HIST1 H3E HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
- Another aspect pertains to a method of assessing whether a human subject is likely to develop non-small cell lung cancer, said method comprising
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- HIST1 H2AJ/HIST1 H2BM HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), and iii) on the basis of said methylation status identifying a human subject that is more likely to develop non-small cell lung cancer.
- a third aspect pertains to a method for categorizing or predicting the clinical outcome of a non-small cell lung cancer of a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- a fourth aspect relates to a method of evaluating the risk for a subject of contracting cancer, said method comprising in a sample from said subject determining the methylation status of a gene locus selected from the group consisting of SIM1 ,
- a method is provided of treating a non-small cell lung cancer in a human subject, said method comprising the steps of
- determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject by a method according to the present disclosure which involve determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ),
- HIST1 H3E HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
- step iii. subjecting said subjects identified in step ii. to a suitable treatment for non-small cell lung cancer.
- the sample may be a lung tissue sample, or a bodily fluid such as blood or plasma (for example peripheral blood), and methylation status may be determined by any method selected from the group consisting of Methylation- Sensitive High Resolution Melting (MS-HRM), Methylation - Sensitive Melting Curve Analysis (MS-MCA), EpiTyper, Methylation-Specific PCR (MSP), DNA methylation specific qPCR (qMSP), Pyrosequencing, Methyl Light, Amplicon bisulfite sequencing (AmpliconBS), Enrichment bisulfite sequencing (EnrichmentBS), Whole genome bisulfite sequencing (BS-Seq), HELP assays, and Methyl Sensitive Southern Blotting, and determination may involve Methylated DNA immunoprecipitation (MeDIP).
- MS-HRM Methylation- Sensitive High Resolution Melting
- MS-MCA Methylation - Sensitive Melting Curve Analysis
- MSP Me
- the methylation status is determined by a method comprising the steps of
- the methylation status is determined by a method comprising the steps of
- nucleic acid modifying said nucleic acid using an agent which modifies unmethylated cytosine or cleaves nucleic acid sequences in a methylation-dependent manner, iii) amplifying at least one portion of said gene using primers, which span or comprise at least one CpG dinucleotide in said gene in order to obtain an amplification product, and
- the amplified CpG-containing nucleic acid is preferably analyzed by melting curve analysis.
- methylation status may also be determined by methylation specific PCR, bisulfite sequencing, COBRA, endonucleolytic digestion, or DNA methylation arrays.
- kits are also provided in a sixth aspect for determining non-small cell lung cancer, predisposition to non-small cell lung cancer, or categorizing or predicting the clinical outcome of a non-small cell lung cancer, said kit comprising in a package
- oligonucleotide primers that specifically hybridizes under amplification conditions to a region of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), such as at least one primer selected from the group consisting of SEQ ID NO: 37 to 72.
- oligonucleotide primers comprising a sequence, which is a subsequence of a gene loci selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 ,
- a method for identifying a therapeutically effective agent for treatment of non-small cell lung cancer comprising
- non-small cell lung cancer cell line comprising one or more genetic loci selected from the group consisting of SIM1 , Chr6(p22.1 ),
- HIST1 H3E HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ),
- determining methylation status of said one or more genetic loci v. comparing said methylation status of said treated non-small cell lung cancer cells with the methylation status of said non-small cell lung cancer cells, when untreated, wherein a decreased level of methylation positive alleles is indicative of a therapeutic agent.
- Figure 1 Differentially methylated regions in LAC.
- the methylation level of 18 DMRs was investigated in 52 LAC primary tumors and 32 tumor-adjacent normal lung samples using MS-HRM analysis.
- the results of the methylation assessment are shown as stacked bar percentage plots for each DMR in a-r).
- the relative proportion of samples in each category with 0-1 % methylated templates are shown in white, 1 -10% methylated templates in white with light grey stripes, 10-50% methylated templates in dark grey and 50-100% methylated templates in black.
- the statistical significance of the detected differences in methylation between groups was assessed using a Mann-Whitney test of ranks and two-tailed p-values ⁇ 0.05 were considered statistically significant.
- Figure 2 Examples of melting profiles observed for the HOXD3, OSR1 and HIST1 H3E regions.
- the methylation level of the 18 DMRs was determined using MS-HRM analysis.
- Representative normalized melting profiles for 10 tumor-adjacent normal lung samples and 10 LAC tumors are shown in black in a-b) for HOXD3, c-d) for OSR1 and in e-f) for HIST1 H3E.
- the DNA methylation standards were generated as a serial dilution of fully methylated DNA into an unmethylated background. The 100% methylated standard is shown in red, 50% methylated standard in light blue, 10% methylated standard in green, 1 % methylated standard in dark blue and the 0% methylated standard in orange.
- Figure 3 Methylation levels in metastasizing and non-metastasizing LAC tumors.
- the methylation levels of the 18 DMRs were compared between 26 metastasizing and 26 non-metastasizing LAC primary tumors.
- the results are shown as stacked bar percentage plots in a) for HOXB3/HOXB4, b) for LOC648987 and in c) for HOXA5.
- Genome Browser on Human Dec. 2013 (GRCh38/hg38) Assembly), primer sequences and assay-specific PCR cycling and HRM protocols are shown in the sequences section.
- 180 bp of regional genomic sequence (top strand) and corresponding bisulfite-modified sequence (bottom strand) is shown.
- the locations of the primers are indicated in red.
- Normalized melting profiles for the DNA methylation standards are shown for each assay in technical duplicates.
- the DNA methylation standards were generated as a serial dilution of fully methylated DNA into an unmethylated background. The 100% methylated standard is indicated in red, 50% methylated standard in light blue, 10% methylated standard in green, 1 % methylated standard in dark blue and the 0% methylated standard in orange.
- the present invention relates to methylation biomarkers for use in the diagnosis and treatment of non-small cell lung cancer.
- the methylation markers of the invention can be used in methods for identifying subjects, which are predisposed to non-small cell lung cancer; i.e. subjects having an increased likelihood of developing non-small cell lung cancer.
- the methylation markers of the invention can also be used in methods for identifying subjects having non-small cell lung cancer, and in this case, the markers allow early diagnosis, Further, the markers provide prognostic information with respect to non-small cell lung cancer, and this, the markers can be used to identify a subject having non-small cell lung cancer, and the cancer DNA can be tested for predictive prognostic information based on the methylation markers of the invention, as well as information on which curative and/or ameliorative treatment to provide for the non-small cell lung cancer.
- the methylation status of the methylation markers of the invention may also be used to monitor a treatment provided for the curing and/or ameliorating a non-small cell lung cancer. Additionally, the marker methylation status can be used to monitor relapse of non-small cell lung cancer for subject previously cured for non-small cell lung cancer.
- aspects of the present invention relates to i) methods for identifying subjects, which are predisposed to non-small cell lung cancer, and/or which have a non-small cell lung cancer, including early stages, such as asymptomatic stages of non-small cell lung cancer, ii) methods for providing prognostic information of a non-small cell lung cancer and/or inferring a suitable treatment based thereupon, iii) methods of monitoring a treatment of a non-small cell lung cancer, and/or monitoring relapse of a non-small cell lung cancer.
- Amplification according to the present invention is the process wherein a plurality of exact copies of one or more gene loci or gene portions (template) is synthesised.
- amplification of a template comprises the process wherein a template is copied by a nucleic acid polymerase or polymerase homologue, for example a DNA polymerase or an RNA polymerase.
- templates may be amplified using reverse transcription, the polymerase chain reaction (PCR), ligase chain reaction (LCR), in vivo amplification of cloned DNA, isothermal amplification techniques, and other similar procedures capable of generating a complementing nucleic acid sequence.
- PCR polymerase chain reaction
- LCR ligase chain reaction
- Amplified copies of a targeted genetic region are sometimes referred to as an amplicon.
- PCR bias refers to conditions, wherein PCR more efficiently amplifies templates with a specific methylation status. It has been reported that at least some unmethylated nucleic acid templates are more efficiently amplified than methylated nucleic acid template.
- a double stranded nucleic acid contains two strands that are complementary in sequence and capable of hybridizing to one another.
- a gene is defined in terms of its coding strand, but in the context of the present invention, an oligonucleotide primer, which hybridize to a gene as defined by the sequence of its coding strand, also comprise oligonucleotide primers, which hybridize to the complement thereof.
- a nucleotide is herein defined as a monomer of RNA or DNA.
- a nucleotide is a ribose or a deoxyribose ring attached to both a base and a phosphate group. Both mono-, di-, and tri-phosphate nucleosides are referred to as nucleotides.
- oligonucleotide comprises oligonucleotides of both natural and/or non-natural nucleotides, including any combination thereof.
- the natural and/or non-natural nucleotides may be linked by natural phosphodiester bonds or by non-natural bonds.
- oligonucleotides comprise only natural nucleotides linked by phosphodiester bonds.
- the oligomer or polymer sequences of the present invention are formed from the chemical or enzymatic addition of monomer subunits.
- the term "oligonucleotide” as used herein includes linear oligomers of natural or modified monomers or linkages, including deoxyribonucleotides, ribonucleotides, anomeric forms thereof, peptide nucleic acid monomers (PNAs), locked nucleotide acid monomers (LNA), and the like, capable of specifically binding to a single stranded polynucleotide tag by way of a regular pattern of monomer-to-monomer interactions, such as Watson-Crick type of base pairing, base stacking, Hoogsteen or reverse Hoogsteen types of base pairing, or the like.
- oligonucleotides ranging in size from a few monomeric units, e.g. 3-4, to several tens of monomeric units, e.g. 40-60.
- ATGCCTG an oligonucleotide is represented by a sequence of letters, such as "ATGCCTG,” it will be understood that the nucleotides are in 5' ⁇ 3' order from left to right and the "A” denotes deoxyadenosine, "C” denotes deoxycytidine, “G” denotes deoxyguanosine, and "T” denotes thymidine, unless otherwise noted.
- oligonucleotides of the invention comprise the four natural nucleotides; however, they may also comprise methylated or non-natural nucleotide analogs.
- dinucleotide refers to two sequential nucleotides.
- the dinucleotide may be comprised in an oligonucleotide or a nucleic acid sequence.
- the dinucleotide CpG which denotes a cytosine linked to a guanine by a phosphodiester bond, may be comprised in an oligonucleotide according to the present invention, and also comprised in a targeted gene locus sequence according to the present invention.
- a CpG dinucleotide is also herein referred to as a CpG site. CpG sites are targets for methylation of the cytosine residue.
- Methylation status refers to the presence or absence of methylation in a specific nucleic acid region.
- the present invention relates to detection of methylated cytosine (5-methylcytosine).
- a nucleic acid sequence e.g. a gene locus of the invention, may comprise one or more CpG methylation sites.
- the nucleic acid sequence of the gene locus may be methylated on all methylation sites (i.e. 100% methylated), or unmethylated on all methylation sites (i.e. 0% methylated).
- the nucleic acid sequence may also be methylated on a subset of its potential methylation sites (CpG-sites). In this latter case, the nucleic acid molecule is heterogeneously methylated.
- the gene loci methylation markers of the present invention can be used to infer non- small cell lung cancer based on the relative amount of methylation positive (fully methylated) and methylation negative (fully unmethylated) alleles in a sample comprising in a mixture of nucleic acid molecules from a subject.
- the methylation status of a specific gene locus marker of the present invention may be that at least 50%, such as on at least 60%, such as on at least 70%, for example on at least 80%, such as on at least 90%, such as on at least 95%, for example on at least 99%, such as least 99.9% of the nucleic acid sequence molecules (alleles) in a sample are methylation positive (fully methylated).
- Gene locus The term "gene locus” as sued herein, such as the gene loci defined by the genes SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ) is meant to include all regions relevant for expression of a given gene, both the coding region and upstream and downstream regions, which may comprise cis-acting activating signals.
- a gene locus specifically is meant to include at least 1000 bp upstream and/or downstream of the open reading frame of an encoded gene, such as at least 900 bp, such as at least 800 bp, such as at least 700 bp, such as at least 600 bp, such as at least 500 bp, such as at least 400 bp, such as at least 300 bp, such as at least 200 bp, such as at least 100 bp upstream and/or downstream of the open reading frame of an encoded gene.
- a gene locus is also meant to include any intronic sequences in the open reading frame. It is also understood that specific subregions of a gene locus specified herein can be of particular importance for the methods described herein. In particular CG-rich regions also known as CpG islands are particularly relevant, because CG-dinucleotides are targets for methylation.
- a number of methods are provided herein for analysing a human subject with respect to non-small cell lung cancer.
- methods are provided for determining non- small cell lung cancer in a human subject, methods for determining a predisposition to non-small cell lung cancer for a human subject, methods for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable treatment, methods for categorizing or staging a non-small cell lung cancer of a human subject, methods for monitoring a non-small cell lung cancer, such as monitoring the treatment of a non-small cell lung cancer and/or relapse of a non-small cell lung cancer.
- the methylation biomarkers for non-small cell lung cancer are described in more detailed herein below.
- the one or more methylation biomarkers for non-small cell lung cancer are selected from a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- a method for determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, the prognosis of a non-small cell lung cancer, and/or monitoring a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene including regulatory sequences of said gene, wherein said gene locus is selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- a method for categorizing or predicting the clinical outcome of a non-small cell lung cancer of a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- a method for evaluating the risk for a human subject of developing non-small cell lung cancer, or for monitoring relapse of a non-small cell lung cancer, said method comprising in a sample from said subject determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
- a further aspect relates to a method for assessing whether a human subject is likely to develop non-small cell lung cancer, said method comprising
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- the methods thus involve determining the methylation status of one or more gene loci as defined herein.
- methylation status may be determined for multiple gene loci, for example methylation status for at least two gene loci are determined, such as at least three gene loci, such as at least four gene loci, or five or more gene loci.
- the plurality of gene loci is preferably selected from a marker gene loci of the invention, i.e. a gene loci selected from the group consisting of SIM1 , Chr6(p22.1 ),
- HIST1 H3E HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
- increased levels of methylation positive alleles of the respective marker gene locus relative to methylation levels of a predetermined control sample of non- cancer cells is indicative of the presence of a non-small cell lung cancer, higher likelihood of developing cancer, decreased overall survival, negative outcome, different stage cancer, different grade cancer, and/or higher risk of contracting cancer.
- the provided methods may preferably comprise steps of comparing the methylation status of the respective gene locus determined for a subject with a predetermined methylation status for the corresponding gene of a reference sample comprising non-cancer cells, for example tumour adjacent normal lung tissue cells, and/or comprising a different stage cancer cells.
- the predetermined status is preferably determined from non-cancer cells of other subjects, which do not have non-small cell lung cancer and/or are not predisposed to non-small cell lung cancer.
- predetermined methylation status differs between the different methylation markers of the invention.
- Table A Overview of sensitivities and specificities for each marker for preferred cut-off values.
- Table A shows an overview of cut-off values, which are preferred for each of the representative marker loci.
- any level of methylation positive alleles above 1 % is indicative of a non-small cell lung cancer, higher likelihood of developing cancer, decreased overall survival, negative outcome, different stage cancer, different grade cancer, and/or higher risk of contracting cancer for a human subject.
- the methylation marker loci identified as SIM1 , Chr6(p22.1 ) is indicative of a non-small cell lung cancer, higher likelihood of developing cancer, decreased overall survival, negative outcome, different stage cancer, different grade cancer, and/or higher risk of contracting cancer for a human subject.
- methylation marker loci identified as HIST1 H3E, HOXA3 and HOXA5 a level of methylation positive alleles above 50%, such as above 55%, such as above 60%, such as above 65%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
- aspects provided herein also relates to methods for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable treatment, as well as for monitoring a non-small cell lung cancer, and in particular monitoring the treatment of a non-small cell lung cancer and/or monitoring relapse of a non-small cell lung cancer.
- a method for treatment of non-small cell lung cancer in a human subject comprises the steps of
- non-small cell lung cancer i. determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject by a method of the present invention, as defined elsewhere herein,
- step iii. subjecting said subjects identified in step ii. to a suitable treatment for non-small cell lung cancer.
- the step of determining non-small cell lung cancer by a method of the present invention allows early detection of non-small cell lung cancer, and therefore allows treatment of the cancer to be initiated before developing into later stages and/or before forming metastases. This allows the use of less serious types of therapeutic interventions, and may for example avoid the need for surgery, such as surgical removal of the entire lung.
- the selected human subject is subjected to a treatment selected form surgery, chemotherapy, immunotherapy and/or radiotherapy, however, in a preferred embodiment, the treatment is radiotherapy.
- the treatment is a combination of surgery, chemotherapy and radiotherapy, for example surgery followed by chemotherapy and/or radiotherapy.
- the invention provides a method for personalized treatment of a non-small cell lung cancer of a human subject, said method comprising
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- HIST1 H2AJ/HIST1 H2BM HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ) ii) providing a treatment of non-small cell lung cancer to said human subject, iii) after a sufficient amount of time having provided the treatment, in a sample from said human subject, determining the methylation status of said gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- the invention also provides specific oligonucleotide primers and kits for use in determining methylation status of specific gene loci, which are
- methylation biomarkers for non-small cell lung cancer include SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- HIST1 H2AJ/HIST1 H2BM HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ), such as those identified by SEQ ID NO: 1 -36.
- the methylation status is determined for at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
- the methylation status is determined for a gene locus selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21.1 ) and/or HOXB3/HOXB4.
- the methylation status is determined for a gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or HOXB3/HOXB4.
- the methylation status is determined for a gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI and/or Chr6(p22.1 ).
- the methylation status is determined for a gene locus selected from the group consisting of SIM1 and HIST1 H3G/HIST1 H2BI; or the group consisting of SIM1 and Chr6(p22.1 ); or the group consisting of HIST1 H3G/HIST1 H2BI and Chr6(p22.1 ).
- the methylation status is determined for SIM1 and/or Chr6(p22.1 ) and/or HIST1 H3G/HIST1 H2BI and/or HOXB3/HOXB4 and/or OSR1 and/or HOXD10 and/or OTX2 and/or LOC648987 and/or HIST1 H2AJ/HIST1 H2BM.
- the methylation status is determined for one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- the methylation status is determined for that one gene locus and at least one gene locus selected from the remainder of the group.
- methylation status is determined in the SIM1 gene locus and at least one additional gene locus selected from the group consisting of Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
- methylation status is determined in the Chr6(p22.1 ) gene locus and at least one additional gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the
- HIST1 H3G/HIST1 H2BI gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the
- HOXB3/HOXB4 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
- methylation status is determined in the OSR1 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, GHSR, OTX2,
- methylation status is determined in the GHSR gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , OTX2,
- methylation status is determined in the OTX2 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR,
- methylation status is determined in the LOC648987 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the HIST1 H3E gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the
- HIST1 H2AJ/HIST1 H2BM gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the HOXD10 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the HOXD3 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXA3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the HOXA3 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA5 and Chr1 (q21.1 ).
- methylation status is determined in the HOXA5 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, and Chr1 (q21.1 ).
- methylation status is determined in the Chr1 (q21.1 ) gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3 and HOXA5.
- DNA sequences of specific gene loci are provided herein below.
- the methylation status is determined in a gene locus identified by SEQ ID NO: 1 -18.
- the methylation status is determined by a method comprising amplifying a gene locus of the invention using at least one primer selected from the group consisting of SEQ ID NO: 37-72. Methylation status is preferably determined for a gene locus mentioned in table B using the respective forward primer and/or reverse primer identified in table B; i.e.
- SIM1 forward primer SEQ ID NO: 51 and/or reverse primer SEQ ID NO: 52;
- the methylation status is determined in a genetic region of a gene locus of the invention, wherein said region is delineated by the primer pairs identified in table B for each respective gene; i.e.
- HIST1 H3G/HIST1 H2BI primers SEQ ID NO: 47 and/or 48;
- HIST1 H3E primers SEQ ID NO: 71 and/or 72.
- the methylation status of one or more gene loci is determined in a sample from a human subject.
- the sample of the invention comprises biological material, in particular genetic material comprising nucleic acid molecules.
- the nucleic acid molecules may be extracted from the sample prior to the analysis.
- the sample may be obtained or provided from any human source.
- determination of methylation status of a gene locus or genetic region of the invention is performed on samples selected from the group consisting of lung tissue, hematopoietic tissue, bone marrow, expiration air, stem cells, including cancer stem cell, and body fluids, such as sputum, bronchial lavage, urine, blood and sweat.
- the sample is or comprises lung tissue, such as lung cells and/or genetic material of lung cells. It is well-known that tumor DNA may leak to the blood stream or other bodily fluids, so in one preferred embodiment, the sample is a body fluid, such as sputum, urine, blood and sweat. In particular, it is preferred that the sample is a blood or plasma sample. Body fluids are often retrievable by less invasive methods than lung tissue, which must be obtained surgically for example by biopsies.
- the provided sample is in one embodiment a formalin-fixed paraffin-embedded (ffpe) sample, for example an ffpe sample, wherein prestages to non-small cell lung cancer can be seen.
- the sample used for predetermining methylation status can be an ffpe sample.
- Many ffpe samples may be provided, which can give rise to statistically strong predetermined values with respect to evaluation of non-small cell lung cancer risk, categorizing or staging a non-small cell lung cancer of a human subject, methods for monitoring a non-small cell lung cancer, such as monitoring the treatment of a non-small cell lung cancer and/or relapse of a non-small cell lung cancer.
- the nucleic acid to be analysed for the presence of methylated CpG may be extracted from the samples by a variety of techniques such as that described by Maniatis, et al (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp 280, 281 , 1982). However, the sample may be used directly. Any nucleic acid, in purified or nonpurified form, can be utilized as the starting nucleic acid or acids, provided it contains, or is suspected of containing, the specific nucleic acid sequence containing the methylation target site (e.g., CpG).
- the specific nucleic acid sequence which is to be amplified may be a part of a larger molecule or is present initially as a discrete molecule.
- the nucleic acid sequence to be amplified need not to be present in a pure form, it may for example be a fraction of a complex mixture of other DNA molecules, and/or RNA.
- the nucleic acid sequence is a fraction of a genomic nucleic acid preparation.
- Extremely low amounts of nucleic acid may be used as target sequence according to the methods of the present invention. It is appreciated by the person skilled in the art that in practical terms no upper limit for the amount of nucleic acid to be analysed exists. The problem that the skilled person may encounter is that the amount of sample to be analysed is limited. Therefore, it is beneficial that the method of the present invention can be performed on a small amount of sample and thus a limited amount of nucleic acid in said sample.
- the present methods allow the detection of only very few nucleic acid copies.
- the amount of the nucleic acid to be analysed is in one
- At least 0.01 ng such as 0.1 ng, such as 0.5 ng, for example 1 ng, such as at least 10 ng, for example at least 25 ng, such as at least 50 ng, for example at least 75 ng, such as at least 100 ng, for example at least 125 ng, such as at least 150 ng, for example at least 200 ng, such as at least 225 ng, for example at least 250 ng, such as at least 275 ng, for example at least 300 ng, 400 ng, for example at least 500 ng, such as at least 600 ng, for example at least 700 ng, such as at least 800, ng, for example at least 900 ng or such as at least 1000 ng.
- the amount of nucleic acid as the starting material for the method of the present invention is approximately 50 ng, alternatively 100 ng or 200 ng.
- the methods of the present invention for determining non-small cell lung cancer in a human subject include a step of providing or obtaining a sample from the human subject, and in that sample determining the methylation status of at least one genetic locus selected from the group consisting of SIM1 ,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- Methylation status of the target gene loci or genetic regions of the present invention may be determined by any suitable method available to the skilled person for detecting methylation status.
- methylation status is determined by a quantitative method, such as a semi-quantitative method, which is capable of detecting levels of methylation positive alleles and/or methylation negative alleles in a population of target molecules present in a sample.
- the quantitative method is preferably capable of detecting different levels of methylation positive alleles of a given target locus sequence, such as detecting whether 0%, less than 1 %, more that 1 %, such as approximately 10%, 25%, 50%, 75% or 100% of the alleles of a given marker locus are methylation positive.
- a semi- quantitative method provides categorical data, such as the level of 1 -10% methylated templates or 10-50% methylated templates.
- Some techniques in the art merely detect the presence of one or more methylation positive and/or methylation negative alleles of a given target sequence without providing quantitative data, and without providing information of the relative levels of methylation positive and methylation negative alleles.
- preferred methods of the present invention provide a quantitative measure of the relative level of methylation positive alleles of a specific target region.
- methylation status refers to the extent to which a nucleic acid region and/or in particular a CpG methylation site is methylated or unmethylated, which may be expressed as the methylation level of a given sample.
- the methylation status of a single CpG methylation site can be either methylated or unmethylated.
- a nucleic acid sequence comprising multiple potential methylation (CpG) sites may be methylated on only a subset of those CpG sites. Such nucleic acid molecules/alleles are heterogeneous methylated.
- methylation status thus, refers to whether a nucleic acid sequence is methylation positive (methylated on all CpG sites), is methylation negative (all CpG sites of the sequence is unnmethylated), or is
- heterogeneous methylated (a subset of CpG sites of the sequence is methylated.
- the methods for inferring non-small cell lung cancer of the present invention thus determine methylation status of specific methylation markers by determining whether a specific methylation marker in a sample obtained or provided from a subject is methylation positive, methylation negative or heterogeneously methylated, as well as detecting the relative level of methylated alleles of a given locus.
- the methods may also include detecting marker sequences with low methylation, which defines methylation of less than 1 % of the alleles of a sample.
- Methylation status may be determined by any suitable method available to those of skill in the art.
- method may be selected from the group consisting of
- MS-HRM Methylation-Sensitive High Resolution Melting
- MS-MCA Methylation- Sensitive Melting Curve Analysis
- MSP Methylation-Specific PCR
- qMSP DNA methylation specific qPCR
- Pyrosequencing Methyl Light, Amplicon bisulfite sequencing (AmpliconBS), Enrichment bisulfite sequencing (EnrichmentBS), Whole genome bisulfite sequencing (BS-Seq), HELP assays, and Methyl Sensitive Southern Blotting, and determination may involve Methylated DNA immunoprecipitation
- the method is selected from the group consisting of AmpliconBS 1 , AmpliconBS 2, AmpliconBS 3, AmpliconBS 4, EnrichmentBS 1 ,
- EnrichmentBS 2 EpiTyper 1 , EpiTyper 3, Infinium, Pyroseq 1 , Pyroseq 1 (replicate), Pyroseq 2, Pyroseq 3, Pyroseq 4, Pyroseq 5, MethyLight, MS-HRM, MS-MCA, qMSP (preamp), qMSP (standard), DNA-methylation-specific amplification by qPCR, HPLC- MS, Immunoquant, Pyroseq AluYb8, Bisulfite pyrosequencing using primers that amplify AluYb8 repetitive DNA, Pyroseq D4Z4, Pyroseq LINE1 , Pyroseq NBL2 and ClonalBS; cf. Bock et al, 2016, nature biotech. 34 (7).
- the method is selected from the group consisting of High- performance liquid chromatography (HPLC), High-performance capillary
- HPCE high-density polymerase chain reaction
- Sssl assay Gene specific Methylation-specific PCR
- MSRE-PCR Methyl-sensitive restriction enzyme PCR
- MethyLight MethyLight
- MS- SNuPE Methylation-sensitive single nucleotide Primer extension
- COBRA Combined bisulfite restriction analysis
- MS-HRM Methylation sensitive-high resolution melting
- MS-MLPA Methylation-specific multiplex ligationdependent probe amplification
- Mass ARRAY EpiTYPER Restriction landmark genomic scanning (RLGS), Differential methylation hybridization (DMH), Methylated DNA immunoprecipitation and microarray chip (MeDIPchip), Bead arrays (lllumina) Bisulfite, Whole-genome bisulfite sequencing, Single molecule real time (SMRT) sequencing and MethylCap sequencing; cf. Syedmoradi et al, 2016, Royal Soc of Chem. (DOI: 10.1039/c6an01649a).
- the methylation status is determined by use of methylation-sensitive restriction enzymes. Many restriction enzymes are sensitive to the DNA methylation states.
- Cleavage can be blocked or impaired when a particular base in the recognition site is modified.
- the MspJI family of restriction enzymes has been found to be dependent on methylation and hydroxymethylation for cleavage to occur. These enzymes excise ⁇ 32 base pair fragments containing a centrally located 5-hmC or 5-mC modified residue that can be extracted and sequenced. Due to the known position of this epigenetic modification, bisulfite conversion is not required prior to downstream analysis.
- Methylation-sensitive enzymes are well-known in the art and include:
- the digested nucleic acid sample is subsequently analysed by for example gel electrophoresis.
- methylation status is determined by a method comprising the steps of
- a sample such as a lung tissue sample or a blood or plasma sample from said subject comprising nucleic acid material comprising said gene
- processing said nucleic acid sequence using one or more methylation- sensitive restriction endonuclease enzymes, iii) optionally, amplifying said processed nucleic acid sequence in order to obtain an amplification product
- the methodology employed for determining methylation status is determined by a method, which comprises at least the steps of modifying the DNA with an agent which targets either methylated or unmethylated sequences, amplifying the DNA, and analysing the amplification products.
- amplification product is analysed by detecting the presence or absence of amplification product, wherein the presence of amplification product indicates that the target nucleic acid has not been cleaved by the restriction enzymes, and wherein the absence of amplification product indicates that the target nucleic acid has been cleaved by the restriction enzymes.
- methylation status is determined by a method comprising the steps of
- a sample such as a lung tissue sample or a blood or plasma sample from said subject comprising nucleic acid material comprising a gene locus of the invention
- the method comprises the steps of
- a sample such as a lung tissue sample, from said subject comprising nucleic acid material comprising said gene locus
- the amplification product can be analysed for nucleic acid substitutions resulting from conversion of modified cytosine residues, preferably wherein the presence of converted cytosine residues are indicative of unmethylated cytosine residues, and presence of unconverted cytosine residues is indicative of methylated cytosine residues.
- unmethylated cytosine is converted to thymidine after bisulphite treatment and amplification, while methylated cytosine is left unchanged after same treatment.
- the amplification product is analysed by melting curve analysis; cf. herein below.
- the amplification product, the amplicon is in a preferred embodiment a genetic region of a gene of the invention, wherein said region is delineated by the primer pairs identified in table B for each respective gene; i.e.
- For SIM1 primers SEQ ID NO: 51 and/or 52;
- HIST1 H3G/HIST1 H2BI primers SEQ ID NO: 47 and/or 48;
- HIST1 H3E primers SEQ ID NO: 71 and/or 72.
- the method for determining methylation status in the present invention preferably comprise a step of modifying the nucleic acids comprised in the sample, or extracted from the sample, using an agent which specifically modifies unmethylated cytosine in the nucleic acid.
- modify refers the specific modification of either an unmethylated cytosine or a methylated cytosine, for example the specific conversion of an unmethylated cytosine to another nucleotide which will distinguish the modified unmethylated cytosine from a methylated cytosine.
- an agent modifies unmethylated cytosine to uracil.
- Such an agent may be any agent conferring said conversion, wherein unmethylated cytosine is modified, but not methylated cytosine.
- the agent for modifying unmethylated cytosine is sodium bisulfite.
- Sodium bisulfite (NaHS0 3 ) reacts readily with the 5,6-double bond of cytosine, but only poorly with methylated cytosine.
- the cytosine reacts with the bisulfite ion, forming a reaction intermediate in the form of a sulfonated cytosine which is prone to deamination, eventually resulting in a sulfonated uracil.
- Uracil can subsequently be formed under alkaline conditions which removes the sulfonate group.
- uracil will by the Taq polymerase be recognised as a thymidine.
- the product upon PCR amplification of a Sodium bisulfite modified nucleic acid contains cytosine at the position where a methylated cytosine (5- methylcytosine) occurred in the starting template DNA of the sample.
- the product upon PCR amplification of a Sodium bisulfite modified nucleic acid contains thymidine at the position where an unmethylated cytosine (5-methylcytosine) occurred in the starting template DNA of the sample.
- an unmethylated cytosine is converted into a thymidine residue upon amplification of a bisulfite modified nucleic acid.
- the nucleic acids are modified using an agent which modifies unmethylated cytosine in the nucleic acid.
- an agent is a bisulfite, hydrogen sulfite, and/or disulfite reagent, for example sodium bisulfite.
- an agent is used, which specifically modifies methylated cytosine in the nucleic acid and does not modify unmethylated cytosine.
- the specific genetic region selected for determination of methylation status is preferably amplified in order to generate and thereby obtain multiple copies (amplicons) of the respective genetic regions, which can allow its further analysis with respect to methylation status.
- the amplification is preferably preformed using at least one oligonucleotide primer, which targets the specific genetic region comprising methylation markers for non-small cell lung cancer according to the present invention. Most preferably amplification is performed using two oligonucleotide primers, which delineates the analysed region. The skilled person may use his common general knowledge in designing suitable primers.
- At least one, and preferably two methylation-independent oligonucleotide primers are employed for amplification of the modified nucleic acid.
- the nature of methylation-independent primers is described on more detail herein below.
- the amplifying step is a polymerisation reaction wherein an agent for polymerisation is involved, effecting an oligonucleotide primer extension.
- the agent for polymerization may be any compound or system which will function to accomplish the synthesis of primer extension products, including enzymes. Enzymes that are suitable for this purpose include, for example, E. coli DNA polymerase I, Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available DNA polymerases, polymerase muteins, reverse transcriptase, and other enzymes, including heat-stable enzymes (i.e., those enzymes which perform primer extension after being subjected to temperatures sufficiently elevated to cause denaturation also known as Taq polymerases).
- Suitable enzymes will facilitate combination of the nucleotides in the proper manner to form the primer extension products which are complementary to each locus nucleic acid strand.
- the synthesis will be initiated at the 3' end of each primer and proceed in the 5' direction along the template strand, until synthesis terminates, producing molecules of different lengths.
- a preferred method for amplifying the modified nucleic acid by means of at least one methylation-independent oligonucleotide primer is by the polymerase chain reaction (PCR), as described herein and as is commonly used by those skilled in the art.
- PCR polymerase chain reaction
- PCR amplification requires a set of oligonucleotide primers, one forward primer and one reverse primer.
- the forward primer is a methylation independent primer.
- the reverse primer is in another embodiment a methylation independent primer.
- both reverse and forward primer may be methylation independent oligonucleotide primers according to the definitions herein.
- the amplification product may be of any length, however in one preferred embodiment, the amplification product comprise between 15 and 1000 nucleotides, such as between 15 and 500 nucleotides, such as between 50 and 120 nucleotides, preferably between 80 and 100 nucleotides.
- the amplicon is delineated by the primers identified in table B for each respective gene, cf. herein above.
- the PCR reaction is characterised by three steps a) melting a nucleic acid template, b) annealing at least one methylation-independent oligonucleotide primer to said nucleic acid template, and c) elongating said at least one methylation-independent
- the melting of a CpG-containing nucleic acid template may also be referred to as strand separation. Melting is necessary where the target nucleic acid contains two complementary strands bound together by hydrogen bonds. This strand separation can be accomplished using various suitable denaturing conditions, including physical, chemical, or enzymatic means.
- One physical method of separating nucleic acid strands involves heating the nucleic acid until it is denatured. The denaturation by heating is the preferred procedure for melting in the present invention. Heat denaturation involves temperatures ranging from about 60 degrees Celsius to 100 degrees Celsius. The time for melting may be in the range of 5 seconds to 10 minutes or even longer for initial melting of the template.
- the melting temperature is typically between 80 and 90 degrees Celsius, such as at least 81 , for example at least 82, such as at least 84, preferably at least 85, at least 86, such as at least 87, for example at least 88 degrees Celsius.
- the PCR reaction mixture is incubated at the melting temperature for at least 5 seconds, alternatively at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
- Separated strands are used as a template for the synthesis of additional nucleic acid strands. It is understood that the separated strands may result from the separation of complementary strands in an originally double stranded nucleic acid. However, separated strands originally single stranded are also used as templates according to the present invention.
- the synthesis of additional nucleic acid strands is performed under conditions that allow the hybridisation of oligonucleotide primers to templates. Such a step is herein referred to as annealing.
- the oligonucleotide primers form hydrogen bonds with the template.
- the annealing temperature is between 40 and 75 degrees Celsius, such as at least 40, at least 45, for example at least 50, at least 52, at least 54, at least 56, at least 57, at least 58, at least 59 preferably at least 60, at least 61 , at least 62, at least 63, at least 64, at least 65, at least 66, at least 67, for example at least 68, at least 69, at least 70, at least 72, at least 73, at least 75 degrees Celsius.
- the PCR reaction mixture is incubated at the annealing temperature for 1 to 100 seconds, such as at least 1 , at least 2, at least 3, at least 4, preferably at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, alternatively at least 1 1 , at least 13, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
- the annealing temperature is between at least 15 degrees Celsius above the optimal annealing temperature, such as at least 1 , at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1 , at least 12, at least 13, at least 14, at least 15 degrees Celsius above the optimal annealing temperature.
- the optimal annealing temperature can be calculated by standard algorithms, as known to people skilled within the art.
- the optimal primer annealing temperature (Tm) is calculated as:
- the annealing temperature should be empirically determined in respect of each specific primer. The modulation of the annealing temperature is used to adjust hybridization stringency as described elsewhere herein. Thus, the optimal annealing temperature should be set at a level, wherein the PCR bias towards amplification of unmethylated nucleic acid template is balanced by the less efficient annealing of methylation-independent oligonucleotide primer according to the present invention to unmethylated nucleic acid target sequence.
- the choice of annealing temperature depends on the sensitivity of the assay, and the composition of the sample with respect to the relative levels of methylation positive and methylation negative alleles. Thus, optimal annealing temperatures should preferably be determined for each sample. However, in one embodiment, the annealing temperature in respect of specific methylation-independent oligonucleotide primer according to the present invention is as specified below for each assay; cf. sequences.
- the oligonucleotide primer is e.g. SEQ ID NO: 51 and 52 (assay 8), and the annealing temperature is 55°C.
- specific annealing temperatures as well as PCR cycling conditions for preferred oligonucleotide primers of the invention is indicated below for each assay.
- the oligonucleotide primers annealed to the template is elongated to form an amplification product.
- the elongating temperature depends on optimum temperature for the polymerase, and is usually between 30 and 80 degrees Celsius. Typically, the elongating temperature is between 60 and 80 degrees Celsius, such as at least 60, at least 65, at least 68, at least 69, at least 70, preferably at least 71 , at least 72, at least 73, at least 74, alternatively at least 75, at least 76, at least 77, at least 78, at least 79, at least 80 degrees Celsius.
- the PRC reaction mixture is incubated at the elongating temperature for 1 to 100 seconds, such as at least 1 , at least 2, at least 3, at least 4, preferably at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, alternatively at least 1 1 , at least 13, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
- Elongation occurs in a buffered aqueous solution, preferably at a pH of 7-9.
- the two oligonucleotide primers are added to the reaction mixture in a molar excess of primer: template especially when the template is genomic DNA which will ensure an improved efficiency.
- Deoxyribonucleoside triphosphates dATP, dCTP, dGTP, and dTTP are added to the reaction mixture, either separately or together with the primers.
- An appropriate agent for effecting the primer extension reaction referred to and described elsewhere herein as an agent for polymerization is added to the reaction mixture. It is appreciated by a person skilled in the art that for PCR the agent for polymerisation preferable is a heat-stable polymerase enzyme, such as Taq polymerase.
- the PCR method comprises incubating the nucleic acid at a cycle of different specific temperatures in order to control the steps of amplification.
- the amplification buffer and polymerase required for PCR are well known to people of skill within the art.
- the PCR reaction mixture is incubated sequentially at the melting temperature, the annealing temperature and the elongating temperature, respectively, for a number of cycles.
- the PCR reaction may run between 10 and 70 cycles.
- the PCR reaction run between 25 and 55 cycles, such as at least 25, at least 30, at least 35, at least 40, preferably at least 45, at least 50 or at least 55 cycles.
- cycles of melting, annealing and elongation consist of 10-80, 10- 80 and 10-80 seconds, respectively.
- Optimal cycling intervals are easily determined by those of skill in the art. Specific embodiments of cycle intervals for melting, annealing and elongation are indicated in for each assay below together with preferred annealing temperature for the respective primers; i.e.:
- Assay 1 Primers SEQ ID NO: 37 and 38: 1 ,1 ,1 minutes, respectively;
- Assay 18 Primers SEQ ID NO: 71 and 72: 1 ,1 ,1 minutes, respectively;
- PCR can be performed on a PCR machine, which is also known as a thermal cycler.
- the thermal cycler may be coupled to a fluorometer, thus allowing the monitoring of the nucleic acid amplification in real time by use of intercalating fluorescent dyes, or other fluorescent probes.
- Applicable dyes according to the present invention include any DNA intercalating dye.
- Suitable dyes include ethidium bromide, EvaGreen, LC Green, Syto9, SYBR Green, SensiMix HRMTM kit dye, however many dies are available for this same purpose.
- Real-time PCR allows for easy performance of quantitative PCR (qPCR), which is usually aided by algorithms comprised in the software, which is usually supplied with the PCR machines.
- the fluorometer can furthermore be equipped with software that will allow interpretation of the results.
- software for data analyses may also be supplied with the kit of the present invention.
- Another variant of the PCR technique multiplex PCR, enables the simultaneous amplification of many targets of interest in one reaction by using more than one pair of primers.
- PCR according to the present invention comprise all known variants of the PCR technique known to people of skill within the art.
- the PCR technology comprise real-time PCR, qPCR, multiplex PCR.
- the oligonucleotide primer employed for amplification of modified nucleic acid is preferably a methylation-independent primer.
- methylation-independent primer refers to an oligonucleotide primer, which is capable of hybridizing to both methylated and unmethylated nucleic acid alleles and modified as well as unmodified alleles.
- a methylation-independent primer may not anneal with the exact same affinity to methylated/unmethylated nucleic acid alleles or modified/unmodified alleles.
- the oligonucleotide primers of the present invention are capable of being employed in amplification reactions, wherein the primers are used in amplification of template DNA originating from either a methylation positive or amethylation negative strand.
- the preferred methylation-independent primers of the present invention comprise at least one CpG dinucleotide, as described below. Accordingly, in a methylation positive and bisulfite modified nucleic acid target sequence, the primer sequence will anneal to the nucleic acid template with a perfect match, wherein all of the nucleotides in a consecutive region of the primer forms base pairs with a complementary region in the nucleic acid target.
- the methylation-independent primers of the present invention will anneal to the nucleic acid template with an imperfect match, wherein the primer sequence comprise a mis-match (i.e. the primer and template does not form base pairs) at the position of the unmethylated Cytosine at a CpG site in the nucleic acid template.
- the primer sequence comprise a mis-match (i.e. the primer and template does not form base pairs) at the position of the unmethylated Cytosine at a CpG site in the nucleic acid template.
- the primers of the present invention are methylation-independent, the primers will hybridize to both methylation negative and methylation positive nucleic acid sequences after bisulfite modification, and the primers will form a perfect match with the target sequence of a methylated nucleic acid target and an imperfect match, where the primers and target nucleic acid sequence does not form base pairing at the positions of unmethylated Cytosine (which is converted by bisulfite to Uracil) at CpG sites.
- the methylation-independent primers of the present invention will, due to the mismatch after bisulfite modification at positions of unmethylated cytosine of a CpG-site in the nucleic acid target sequence, hybridize less efficiently to a methylation negative nucleic acid sequence.
- the methylation-independent primers of the present invention are able to anneal to the nucleic acid target, also when the nucleic acid target comprise unmethylated CpG- sites, which have been modified by for example bisulfite treatment.
- the stringency is reduced by reducing the annealing temperature as described elsewhere herein.
- oligonucleotide primers suitable for nucleic acid amplification techniques such as PCR
- the design of such primers involves analysis of the primer's melting temperatures and ability to form duplexes, hairpins or other secondary structures. Both the sequence and the length of the oligonucleotide primers are relevant in this context.
- oligonucleotide primers according to the present invention comprise between 10 and 200 consecutive nucleotides, such as at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 1 10, at least 120, at least 130, at least 140, at least 150, at least 160, at least 180 or at least 200 nucleotides.
- the oligonucleotide primers comprise between 15 and 60 consecutive nucleotides, such as 15, 16, 17, 18, 19, 20, preferably 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, such as 31 , 32, 33, 34, 35, 36, 37, 38, 39, 40, alternatively at least 41 , at least 42, at least 44, at least 46, at least 48, at least 50, at least 52, at least 54, at least 56, at least 58, or at least 60 consecutive nucleotides.
- the methods employed for determining the methylation status of a nucleic acid according to the present invention preferably comprise amplification of a modified nucleic acid by use of a methylation independent oligonucleotide primer.
- the oligonucleotide primers of the present invention are able to hybridize to a nucleic acid sequence comprising CpG islands.
- at least one of the oligonucleotide primers according to the present invention comprises at least one CpG dinucleotide.
- the oligonucleotide primers comprise 2, alternatively 3, 4, 5, 6, 7, 8, 9 or 10 CpG
- the oligonucleotide primers of the present invention comprise at least 10 CpG dinucleotides.
- the at least one methylation-independent oligonucleotide primer comprises one CpG dinucleotide at the 5 '-end of the primer.
- the CpG dinucleotide may be located anywhere within the oligonucleotide primer sequence. However, in a preferred embodiment of the present invention, the at least one CpG dinucleotide is located in the 5'-end of the oligonucleotide primer.
- the at least one CpG dinucleotide constitute the first two nucleotides of the 5'-end.
- the at least one CpG dinucleotide is located within the first 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides of the 5'-terminus.
- the at least one CpG dinucleotide is located within the first 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or 120 nucleotides of the 5'-terminus.
- at least two CpG dinucleotides are located within the first 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides of the 5'-terminus, or at least two
- the primers of the present invention may in one preferred embodiment comprise at least one CpG site, whereby annealing with a higher efficiency to a methylated than to an unmethylated template upon modification of unmethylated cytosine is achieved.
- the primers of the present invention comprise at least one CpG site. However, the primers comprise also for example two CpG sites. The at least one CpG site is positioned in the 5' end of the primer.
- the CpG site is introduced immediately after the first nucleotide of the 5' end of the primer.
- Specific hybridization typically is accomplished by a primer having at least 10, for example at least 12, such as at least 14, for example at least 16, such as at least 18, for example at least 20, such as at least 22, for example at least 24, such as at least 26, for example at least 28, or such as at least 30 contiguous nucleotides, which are complementary to the target template. Often the primer will be close to 100% identical to the target template.
- the primer may also be 98% identical to the target template or for example at least 97%, such as at least 96%, for example at least 95%, such as at least 94%, for example at least 93%, such as at least 92%, for example at least 91 %, such as at least 90%, for example at least 89%, such as at least 88%, for example at least 87%, such as at least 86%, for example at least 85%, such as at least 84%, for example at least 83%, such as at least 82%, for example at least 81 %, such as at least 80%, for example at least 79%, such as at least 78%, for example at least 77%, such as at least 76%, for example at least 75%, such as at least 74%, for example at least 73%, such as at least 72%, for example at least 71 %, such as at least 70%, for example at least 68%, such as at least 66%, for example at least 64%, such as at least 62% or for example at least 60% identical to the target template.
- the primer may also contain additional nucleotide residues that do not interfere with hybridization but may be useful for other manipulations.
- residues may be sites for restriction endonuclease cleavage, for ligand binding or for factor binding or linkers.
- the methylation-independent oligonucleotide primer of the present invention is designed to hybridize to nucleic acids in a sample.
- the nucleic acids sample are treated with an agent, which modifies unmethylated cytosine in said nucleic acid.
- an agent which modifies unmethylated cytosine in said nucleic acid.
- any unmethylated Cytosine of CpG dinucleotides comprised in the nucleic acid are converted to Uracil as explained elsewhere herein. Consequently, in primers comprising a CpG dinucleotide, designed to hybridize with the complementary CpG dinucleotide of the nucleic acid of the sample, the CpG dinucleotide will only hybridize to the methylated CpG dinucleotide fraction of the nucleic acid.
- Cytosine are modified to uracil which does not hybridize with the CpG dinucleotide of the oligonucleotide primer.
- methylation-independent oligonucleotide primers are designed to comprise sufficient nucleotides for specific hybridization to the target nucleic acid sequence regardless of its original methylation status.
- the oligonucleotide primers also comprise one or more CpG
- oligonucleotide primers can still be functionally used for amplification of both originally methylated and unmethylated nucleic acids.
- the CpG dinucleotides are typically comprised in the 5'-terminus of the oligonucleotide primers, as described elsewhere herein.
- a primer-template mismatch within the 5'-terminus of the primer usually allow the primers to hybridize with the target nucleic acid, and still function as primers in an amplification reaction.
- the presence of one or more mismatches between the primer and template affects the optimal annealing temperature of said oligonucleotide primer for use in amplification reactions.
- the PCR bias towards amplification of unmethylated alleles of a nucleic acid template is reversed by amplification of said nucleic acid template at a relatively higher annealing temperature, which favours oligonucleotide primer binding and priming of the methylated allele.
- a relatively higher annealing temperature which favours oligonucleotide primer binding and priming of the methylated allele.
- annealing temperature Besides annealing temperature, other factors also affect hybridisation to a target sequence of a methylation-independent primer. At highly stringent conditions, hybridization between perfect matching primer and target sequences are favoured, such as hybridization between a methylation-independent primer according to the present invention and a methylated target sequence upon cytosine modification. Less stringent conditions will tend to favour oligonucleotide primer binding, priming and amplification of the unmethylated allele. Modulation of temperature is one way of adjusting the stringency of hybridization, but the stringency of hybridization may also be modulated by adjusting buffer composition, and/or salt concentrations in the
- the present invention comprises any such method of modulating hybridization stringency to balance the PCR bias towards amplification of unmethylated template.
- modulation of temperature is preferred.
- the oligonucleotide primer of the present invention is selected from the group consisting of SEQ ID NO: 37-72.
- Methylation status is preferably determined for a gene mentioned in table B using the respective forward primer and reverse primer identified in table B; i.e.
- HIST1 H3G/HIST1 H2BI primers SEQ ID NO: 47 and/or 48;
- HIST1 H3E primers SEQ ID NO: 71 and/or 72.
- an oligonucleotide primer of the present invention specifically hybridizes to regions within 1 kb of the gene loci of the present invention.
- the oligonucleotide primers hybridize to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI,
- HIST1 H2AJ/HIST1 H2BM HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- HIST1 H3G/HIST1 H2BI HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21.1 ) and/or
- HOXB3/HOXB4 or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or HOXB3/HOXB4, or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI and/or Chr6(p22.1 ), or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 and HIST1 H3G/HIST1 H2BI; or the group consisting of SIM1 and Chr6(p22.1 ); or the group consisting of HIST1 H3G/HIST1 H2BI and Chr6(p22.1 ), or the complement thereof.
- the oligonucleotide primer hybridizes to a target nucleic acid sequence of SIM1 and/or Chr6(p22.1 ) and/or
- the at least one oligonucleotide primer hybridizes to a target nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18, or the complement thereof (non-modified strand); and/or the oligonucleotide prime hybridizes to a target nucleic acid sequence selected from the group consisting of SEQ ID NO: 19-36, or the complement thereof (modified strand).
- an oligonucleotide primer of the present invention specifically comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a subsequence of a gene loci selected from the group consisting of SIM1 ,
- HOXB3/HOXB4 OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- the present invention relates to oligonucleotide primer pairs, which span or comprise at least one CpG dinucleotide in a gene locus of the invention.
- the term "span” as used in this context is meant to indicated the at least one CpG site is located in the nucleic acid region between the primer pairs; i.e. the amplified nucleic acid region comprise at least one CpG dinucleotide.
- the term “comprising" as used in connection with "primers comprising at least one CpG dinucleotide is meant to specify that the oligonucleotide primer itself comprise a CpG site.
- the oligonucleotide primers comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21 .1 ) and/or HOXB3/HOXB4, or the complement thereof.
- the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 ,
- the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 ,
- HIST1 H3G/HIST1 H2BI Chr6(p22.1 ) and/or the complement thereof.
- the at least one oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18, or the complement thereof (non-modified strand); and/or the oligonucleotide prime comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 19-36, or the complement thereof (modified strand).
- methylation status is preferably determined by amplifying at least one portion of a gene loci selected from SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI,
- HIST1 H2AJ/HIST1 H2BM HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), using at least one primer pair selected from the nucleic acid sequences set forth in table B (SEQ ID NO: SEQ ID NO: 37/38, 39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72, respectively).
- Detection of an amplification product can be performed by hybridizing the amplification product to an oligonucleotide probe, as described below.
- methylation status is determined by amplifying at least one portion of the respective at least one gene loci, and further employing at least one oligonucleotide probe which hybridizes to an amplification product selected from the group consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
- the oligonucleotide probe comprise 10-100 consecutive nucleic acids selected from the group of sequences consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
- One aspect of the invention also relates to the use of oligonucleotide primers of the present invention for determining or prognosing a non-small cell lung cancer, determining a predisposition to non-small cell lung cancer, categorizing or predicting non-small cell lung cancer, or evaluating the risk of contracting a non-small cell lung cancer.
- the present invention provides a use of oligonucleotide primers comprising a subsequence of a loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
- HIST1 H3G/HIST1 H2BI HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ) or the complement thereof for diagnosing non- small cell lung cancer in a method of the invention as defined elsewhere herein.
- the primers are selected from the group set forth in table B (SEQ ID NO: SEQ ID NO: 37-72).
- the oligonucleotide primers comprise a sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
- the oligonucleotide primers comprising a subsequence selected from a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21 .1 ) and/or HOXB3/HOXB4, or the group consisting of SIMI , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or
- the nucleic acid (target) sample is subjected to an agent that converts an unmethylated cytosine to another nucleotide which will distinguish the unmethylated from the methylated cytosine.
- the agent modifies unmethylated cytosine to uracil.
- the modifying agent can be sodium bisulphite.
- uracil will be converted to thymidine.
- the subsequent PCR amplification converts uracils to thymine.
- G:C base pairs are converted to A:T base pairs at positions, where the cytosine was methylated.
- the difference in nucleic acid sequence at previously methylated (methylation positive) or unmethylated (methylation negative) cytosines allows for the analysis of methylation status in a sample.
- This analysis can comprise identifying cytosine residues, which have been converted to thymidine after amplification, as unmethylated cytosine residues, and identifying cytosine residues, which has not been converted under as methylated cytosine residues.
- the method for determining methylation status of a nucleic acid according to the present invention further comprises a step of analyzing the amplified nucleic acids.
- the subsequent analysis can be selected from the group consisting of melting curve analysis, high resolution melting analysis, nucleic acid sequencing, primer extension, denaturing gradient gel electrophoresis, southern blotting, restriction enzyme digestion, methylation-sensitive single-strand conformation analysis (MS- SSCA) and denaturing high performance liquid chromatography (DHPLC).
- the methylation status of the amplified containing nucleic acid is determined by any method selected from the group consisting of Methylation-Specific PCR (MSP), Whole genome bisulfite sequencing (BS-Seq), HELP assays, ChlP-on- chip assays, Restriction landmark genomic scanning, Methylated DNA
- MSP Methylation-Specific PCR
- BS-Seq Whole genome bisulfite sequencing
- HELP assays HELP assays
- ChlP-on- chip assays ChlP-on- chip assays
- Restriction landmark genomic scanning Methylated DNA
- MeDIP Methyl Sensitive Southern Blotting
- the methylation status of the amplified containing nucleic acid is determined by a method selected from the group consisting methylation specific PCR, bisulfite sequencing, COBRA, melting curve analysis, or DNA methylation arrays.
- the analysis of the amplified nucleic acid region is melting curve analysis.
- the analysis of the amplified nucleic acid is high resolution melting analysis (HRM).
- Melting curve analysis or high resolution melting analysis exploits the fact that methylated and unmethylated alleles are predicted to differ in thermal stability because of the difference in GC contents after bisulphite treatment and PCR-amplification, which converts methylated C:G base pairs to A:T base pairs. This means that the melting curve profile of methylated (methylation positive) and unmethylated
- (methylation negative) alleles of PCR products originating from bisulfite modified methylated and unmethylated can be distinguished.
- the level of fluorescence changes, depending on the relative amount template; i.e. the relative amount of methylation positive and methylated negative alleles.
- the relative amount of methylation positive and methylation negative alleles of the unknown sample can be determined.
- the melting curve profile of an amplification product according to the present invention is determined by the composition of methylated and unmethylated alleles in the nucleic acid sample. If the nucleic acid molecules of a sample are all methylation negative, all cytosines are converted to thymines, and the resulting PCR product will have a relatively low melting temperature compared to a methylated nucleic acid, which can be seen in its melting curve. If on the other hand, the nucleic acids comprised in the sample are methylation positive, the melting temperature of the PCR product will be relatively higher, and the melting curve is shifted, as fluorescence is observed at higher temperatures.
- the nucleic acid sample comprises a mixture of methylated and unmethylated alleles
- bisulphite treatment followed by amplification will result in two distinct amplification products.
- the unmethylated alleles will display a low melting temperature and the methylated alleles a high melting temperature, and the melting curve profile of such a sample shows fluorescence from both PCR products
- the amplification product represents a pool of molecules, which are present in different cells of the tumor, with different melting temperatures, which leads to an overall intermediate melting temperature.
- Melting curve analysis is performed by incubating the nucleic acid amplification product at a range of increasing temperatures.
- the temperature is increased from a starting temperature of at least 50 degrees Celsius, alternatively at least 55, at least 60, at least 62, at least 64, preferably at least 65, at least 66, at least 67, at least 68, at least 69, at least 70, at least 71 , at least 72, at least 73, at least 74, at least 75, for example at least 76, at least 78, at least 80, at least 82, at least 84 degrees Celsius.
- the temperature is then increased to a final temperature of at least 70, at least 72, at least 74, at least 76, at least 78, at least 80, at least 82, at least 84, at least 86, preferably at least 88, at least 89, at least 90, at least 91 , at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, at least 99, at least 100 degrees Celsius.
- a final temperature of at least 70, at least 72, at least 74, at least 76, at least 78, at least 80, at least 82, at least 84, at least 86, preferably at least 88, at least 89, at least 90, at least 91 , at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, at least 99, at least 100 degrees Celsius.
- the temperature transitions from the starting temperature to the final temperature are a linear function of time.
- the linear transitions are at least 0.05 degrees Celsius per second, alternatively at least 0.01 , at least 0.02, at least 0.03, at least 0.04, at least 0.06, at least 0.07, at least 0.08, at least 0.09, at least 0.1 , at least 0.2, at least 0.3, at least 0.4, at least 0.5, at least 0.6, at least 0.7, at least 0.8, at least 0.9, at least 1.0, at least 1.1 , at least 1.2, at least 1.3, at least 1.4, at least 1.5, at least 1 .6, at least 1 .7, at least 1.8, at least 1.9, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 degrees Celsius per second.
- the melting curve analysis is performed by incubating the nucleic acid amplification product at increasing temperatures, from 70 to 95 degrees Celsius
- the melting of the nucleic acid can be measured by a number of methods, which are known to people within skill of the art.
- One method involves use of agents, which fluoresce when bound to a nucleic acid in its double stranded conformation.
- agents include fluorescent probes or dyes, such as ethidium bromide, EvaGreen, LC Green, Syto9, SYBR Green, SensiMix HRMTM kit dye.
- the melting curve analysis is performed by measurement of fluorescence.
- the melting of the nucleic acid amplification product according to the present invention can then be monitored as a decrease in the level of fluorescence from the sample. After measurement of the fluorescence the melting curves can be generated by plotting fluorescence as a function of temperature.
- the melting curves for data collected in HRM can be normalized, as described in the examples of the present invention.
- Such normalization methods are known to people of skill in the art.
- One preferred means of normalization include calculation of the 'line of best fit' in between two normalization regions before and after the major fluorescence decrease representing the melting of the amplification product.
- the 'line of best fit' is a statistical measure, designating a line plotted on a scatter plot of data (using a least-squares method) which is closest to most points of the plot. Calculation of the line of best fit is performed differentially on LightCycler and
- a platform with a combined thermal cycler and a fluorescence detector is ideal to perform in-tube melting analyses.
- the melting curve analysis is performed on a thermal cycler coupled to a fluorometre, such as the Ligthcycler, LC480 (Roche) or the Rotorgene 6000 (Corbett Research).
- a fluorometre such as the Ligthcycler, LC480 (Roche) or the Rotorgene 6000 (Corbett Research).
- the measurement of fluorescence corresponding to the melting of the double stranded nucleic acid template, can be monitored in real time.
- the melting curve analysis is performed immediately after amplification. This allows an in-tube
- methylation assay wherein the amplification and melting curve analysis is performed sequentially without transferring the sample from the tube. This procedure reduces the risk of contamination of the sample as a result from handling during the methylation assay.
- Melting curve analysis allows the determination of the relative amount of methylated nucleic acid in a sample.
- the relative amount of methylated CpG- containing nucleic acid can be estimated.
- the present invention relates to a method, wherein the relative amount of methylated nucleic acid is estimated by comparison the melting curve of at least one standard sample comprising said nucleic acid with a control level of methylation.
- said standard sample comprise any combination of methylated and unmethylated nucleic acid.
- said standard sample comprise 100% methylated nucleic acid. In another specific embodiment, said standard sample comprise 100% unmethylated nucleic acid. In yet another specific embodiment, said standard sample comprise 50% methylated nucleic acid and 50% unmethylated nucleic acid. In even another specific embodiment, said standard sample comprise 0.1 % 0.5%, 1 %, 2%, 3%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 100% methylated nucleic acid.
- the relative amount of methylated nucleic acid in the nucleic acid sample is between 40-60%. In another embodiment, the relative amount of methylated nucleic acid in the nucleic acid sample is below 50%. In yet another embodiment, the relative amount of methylated nucleic acid in the nucleic acid sample is below 10%, below 1 % or below 0.1 %.
- the term "presence of methylation” and/or the term “methylation status” as used herein includes the relative amount of methylated nucleic acid in the nucleic acid sample of at least 0.1 %, such as at least 1 %, for example at least 10%, such as at least 20%, for example at least 30%, such as at least 50%, for example at least 70%, such as at least 90%, or for example at least 99%.
- the relative amount of that specific allele (methylation positive or methylation negative) in the unknown sample is also higher than the relative amount of that allele in the standard sample.
- the relative level of menthylation positive alleles in the unknown sample can be inferred to be around 80%.
- the amount of methylation positive alleles can be inferred to be less than 80%, and if the fluorescence is higher than the standard, then the unknown sample comprise more than 80% methylation positive alleles.
- the level of methylated alleles of the unknown sample can be inferred.
- a higher fluorescence level at the peak melting temperature of the amplified nucleic acid sample than of a standard sample comprising a specific allele is indicative of a higher relative amount of that specific allele in that sample than in the standard sample.
- a lower fluorescence level at the peak melting temperature of the amplified nucleic acid sample than of the standard sample comprising a specific allele is indicative of a lower relative amount of that specific allele in that sample than in the standard sample.
- the "peak melting temperature” is mathematical a derivative of melting curve and refers to the temperature at which the largest discrete melting step occurs.
- the top of the peak corresponds to the major drop in fluorescence on the melting curve.
- the level of fluorescence at the peak melting temperature reflects the level of methylation for a given amplicon.
- two amplicons may have peak melting temperatures of for example be 70°C, while having different fluorescence at this temperature, which then reflects that the amplicons have different methylation levels.
- the peak melting temperature corresponds to the highest level of the negative derivative of fluorescence (-dF/dT) over temperature versus temperature (T).
- a nucleic acid sample subjected to melting curve analysis may display more than one peak melting temperature.
- the melting curve analysis display at least 1 , 2 or 3 peak melting temperatures.
- the method for analysis of the amplified nucleic acid is sequencing of the nucleic acid.
- nucleic acid sequencing the order of nucleotides (base sequences) in the nucleic acid is determined. Sequencing is usually performed by extending a primer, which anneals to the nucleic acid sequence of interest. The primer is extended by a polymerase in the presence of
- deoxynucleonucleotides Sequencing may also be performed by pyrosequencing.
- 2,3-Dideoxyribose - a deoxyribose sugar lacking the 3 hydroxyl group is incorporated into the extended nucleic acid chain.
- 2,3- Dideoxyribose is incorporated into a nucleic acid chain, it blocks further chain elongation. This method is also known as the Sanger method or chain termination method.
- the primer is extended in the presence of the normal dNTPs (A, T, G, C) and a small amount of 2,3-DideoxyriboseNTPs (ddNTP).
- the reactions are either performed in four separate reactions, one for each of the ddNTPs (ddATP, ddTTP, ddCTP, ddGTP), or in a joint reaction, wherein ddATP, ddTTP, ddCTP and ddGTP are coupled to different fluorescent dyes.
- the primers are then extended to variable lengths, each transcript being terminated upon incorporation of a ddNTP.
- the sequence of the nucleic acid of interest can then by read after denaturing
- the method for analysis of the amplified nucleic acid is primer extension.
- the primer extension method uses primers designed to hybridize with a target.
- the primers may end one base upstream of the position of the putative single nucleotide polymorphism, in this method, the C of a CpG dinucleotide.
- a single chain-ending nucleotide such as a ddNTP, is added.
- the only one of the four nucleotides that will extend the primer is the one that is complementary.
- the identity of the added nucleotide which reflects the methylation status, is determined in a variety of ways known to people of general skill within the art.
- the chain-ending nucleotide may be radioactively labelled or coupled to a fluorescent dye, which can subsequently be identified. Restriction enzyme digestion
- the method according to the present invention for analysis of the amplified nucleic acid is restriction enzyme digestion. Restriction enzymes can be divided into exonucelases and endonucleases. In a specific embodiment, the analysis of the amplified nucleic acid is restriction endonuclease digestion.
- the method of the present invention results in the specific conversion of unmethylated cytosines to thymines, i.e. G:C base pairs are converted to A:T base pairs at positions, where a cytosine was methylated.
- the modified and amplified nucleic acid is analyzed for disruption of a site specific for the endonuclease Acil, BstUI, Hhal, HinPI I, Hpall, HpyCH4IV, Mspl, Taqal, Fnu4HI, Hpy188l, HpyCH4lll, Neil, ScrFI, BssKI, Hpy99l, Nt.CviPII.
- StyD4l Aatll, Accl, Acll, Afel, Afllll, Agel, Aval, Banl, BmgBI, BsaAI, BsaHI, BsaJI, BsaWI, BsiEI, BsiWI, BsoBI, BspDI, BspEI, BsrBI, BsrFI, BssHII, BssSI, BstBI, Btgl, Cac8l, Clal, Eael, Eagl, Fspl, Haell, Hindi, Hpy188lll, Kasl, Mlul, MspAI I, Nael, Narl, NgoMIV, NlalV, Nrul, PaeR7l, Pmll, Pvul, Sacll, Sail, Sfol, Smal, Smll, SnaBI, Tlil, TspMI, Xhol, Xmal, Zral, Rsrll, Ascl, AsiS
- the method for analysis of the amplified nucleic acid is denaturing gradient gel electrophoresis (DGGE).
- DGGE denaturing gradient gel electrophoresis
- the modified and amplified nucleic acid is loaded on a denaturing gel.
- This techniques allows the resolution of nucleic acids with different melting temperatures, which is based on the conversion of C:G base pairs to A:T base pairs, explained elsewhere herein.
- the nucleic acid is subjected to denaturing polyacrylamide gel electrophoresis, wherein the gel contain an increasing gradient of denaturants, such as for example a combination of urea and formamide.
- the increasing denaturant concentration corresponds to increased temperature, and therefore, a gradient of denaturants mimics a temperature gradient within the gel.
- the gel is immersed in an electrophoresis buffer kept at 54-60 degrees Celsius.
- an electrophoresis buffer kept at 54-60 degrees Celsius.
- a nucleic acid molecule reaches a level of denaturant that matches the melting temperature of the lowest melting domain, a partially melted intermediate will be formed that moves very slowly. Small shifts in the melting temperature of the low melting domain induced by differences in G:C content will cause the domain to unwind at different concentrations of denaturant.
- the modified and amplified nucleic acid of the present invention will be retarded at different positions in the gel, providing the basis for physical separation between species with different G:C contents, which is reflective of methylation status.
- the method for analysis of the amplified nucleic acid is Southern blotting.
- the nucleic acid to be analysed are separated by gel electrophoresis and transferred to a nitrocellulose filter, whereto it is immobilized.
- the transferred nucleic acids can be identified by hybridization with specific probes comprising a complementary nucleic acid. After hybridization and removal of excess unbound probe, the amount of hybridized indicate whether the sequence of interest was represented in the nucleic acids immobilized on the nitrocellulose membrane.
- the probes are usually
- MS-SSCA Methylation-sensitive, single-strand conformation analysis
- MS-SSCA is a method of screening for methylation changes.
- MS-SSCA uses single- strand conformation analysis for the screening of an amplified region of bisulfite- modified nucleic acid.
- the amplified products are denatured and electrophoresed on a nondenaturing polyacrylamide gel, whereby the sequence differences between unmethylated and methylated sequences lead to the formation of different secondary structures (conformers) with different mobilities. Once the normal mobility pattern is established, any variation would indicate some degree of methylation.
- DPLC Denaturing high performance liquid chromatography
- DHPLC is yet another technique for methylation screening of bisulfite-modified PCR products.
- DHPLC identifies single nucleotide polymorphisms, which are arise after bisulfite treatment of unmethylated alleles of the CpG containing nucleic acid.
- the optimum temperature for DHPLC can be predicted by the sequence of the fully methylated product. Subsequently, the temperature is verified to obtain tight peaks.
- the retention time of the peak reflects methylation status, because the more unmethylated the target is, the less GC rich the PCR product is and the lower the retention time is.
- kits for the detection of methylation status of a nucleic acid in a sample will typically comprise both a forward and a reverse primer to be used in the amplifying step of the present invention.
- the forward primer, the reverse primer or both may be a methylation-independent oligonucleotide primer as described herein.
- one aspect of the invention relates to a kit for determining non-small cell lung cancer, predisposition to non-small cell lung cancer, or categorizing or predicting the clinical outcome of a non-small cell lung cancer, or monitoring the treatment of a non-small cell lung cancer, and/or monitoring relapse of a previously treated non-small cell lung cancer.
- the kit of the invention comprise
- an agent that (a) modifies methylated cytosine residues but not non- methylated cytosine residues; or (b) modifies non-methylated cytosine residues but not methylated cytosine residues; or (c) modifies a nucleic acid sequence in a methylation-dependent manner, and
- oligonucleotide primers that specifically hybridizes under amplification conditions to a region of a gene locus selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ).
- the agent is preferably a methylation-dependent endonuclease as described elsewhere herein, and/or an agent capable of modifying non-methylated cytosine residues but not methylated cytosine residues, such as a bisulphite compound as decribed elsewhere heren, for example sodium bisulphite.
- the kit preferably comprises at least one oligonucleotide primer of probe of the present invention, as defined herein above.
- the kit comprise at least one oligonucleotide primer selected from the group consisting of SEQ ID NO: 37-72.
- the kit comprises at least one primer pair selected from table B; i.e.
- the kit may also comprise one or more reference sample, in particular reference samples comprising a nucleic acid sequence selected from a gene locus selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ).
- reference sample in particular reference samples comprising a nucleic acid sequence selected from a gene locus selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HO
- the kit may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19- 36 and/or the complement thereof (modified strand).
- the at least one reference sample comprises 100% methylation positive reference nucleic acid, and/or 100% methylation negative reference nucleic acid.
- the kit comprises at least two reference samples, wherein one of said reference samples comprises 100% methylation positive reference nucleic acid and a second reference sample comprises 100% methylation negative reference nucleic acid.
- the methylation positive and methylation negative reference nucleic acids may be mixed, by a person employing the kit, in ratios that are suitable for the detection of methylation in a particular sample. It is understood that reference samples in different ratios of methylation positive to methylation negative CpG-containing nucleic acids may be comprised in the kit.
- the kit may comprise at least one reference sample comprising 50% methylated and 50% non-methylated nucleic acid alleles of the respective genetic locus marker.
- the nucleic acid comprised on the reference sample of the kit is preferably methylated (methylation positive) or non-methylated (methylation negative), and the kit preferably comprise two or more reference samples with different methylation status; i.e. different levels of methylation positive and methylation negative alleles.
- the specific nucleic acid alleles (e.g. alleles of the gene locus OSR1 ) of the reference sample may be unmethylated (0% methylated), 1 %, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 100% methylated.
- the kit may thus comprise one reference sample with a nucleic acid sequence as defined above, which is
- the kit preferably comprises the following combinations of one or more reference samples and primer pairs:
- the kit may also comprise at least one probe. Probes of the invention are defined herein above, and in a preferred embodiment, the kit comprise at least one
- the kit of the invention preferably comprise at least one oligonucleotide probe which hybridizes to a nucleic acid sequence selected from the group consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group of sequences consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
- the kit of the invention preferably comprise at least one oligonucleotide probe which hybridizes to an amplification product generated by a primer pair selected from the group consisting of SEQ ID NO: SEQ ID NO: 37/38, 39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72.
- a primer pair selected from the group consisting of SEQ ID NO: SEQ ID NO: 37/38, 39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72.
- the kit may also comprise additional reagents used in the amplifying step of the detection method as disclosed herein.
- the kit may further comprise
- kits further comprises an agent that modifies unmethylated cytosine nucleotides.
- an agent may for example be bisulfite, hydrogen sulfite, and/or disulfite reagent.
- the kit may also comprise other components suitable for detection of methylation status.
- the kit may comprise a methylation-sensitive restriction enzyme.
- the instructions for performing the method of the kit comprises for example information of particular annealing temperatures to be used for the at least one methylation- independent primers, as well as for example information on PCR cycling parameters.
- the kit may further comprise instructions for the interpretation of the results obtained by the method. For example how to interpret the amplified products subsequently analysed by melting curve analysis or methods as described elsewhere herein.
- the kit may in preferred embodiments further comprise software comprising an algorithm for calculation of primer annealing temperature and interpretation of results.
- kit may be used for evaluating a non-small cell lung cancer in a human subject based on methylation status of specific genes as specified elsewhere herein.
- the invention relates to methylation biomarkers for non-small cell lung cancer.
- Candidate marker genes were first identified by micro array analysis. Then, the ability of each of these methylation markers to distinguish between lung tumour tissue and healthy tissue was evaluated. From this analysis, 19 highly sensitive and specific methylation biomarkers were identified.
- This example shows a genome-wide methylation screening, which identifies novel methylation biomarkers that can be used in clinical lung cancer management. Cancer- specific methylation changes are identified and the sensitivity and specificity of the most promising biomarkers are validated and evaluated. Results
- Methylation 3x720K CpG Island Plus RefSeq Promoter Array which interrogates 15,980 CpG islands and 20,404 reference gene promoter regions .
- a total of 346 probes oligonucleotides of 60 bp spotted on the array
- the mapping of the probes to genomic regions revealed no enrichment bias in any specific parts of the genome.
- DMRs including 07X2, OSR1 and GHSR, have previously been reported differentially methylated in LAC.
- the DMRs were sorted according to the probe with highest differential methylation score in each region and the asterisks denote the DMRs that were selected for further validation.
- the genomic location of the MS-HRM assays is shown in Table 1 and the technical specifications for each assay are indicated for each of the oligonucleotide sequences below.
- the results of the MS-HRM-based methylation assessment are summarized in Table 2 and displayed as stacked bar percentage plots in Fig. 1. Using this approach, we were able to confirm a significant increase in methylated templates in the tumor samples for 15 of the 18 selected DMRs corresponding to the hypermethylation indicated by the array analysis. Normalized melting curves for representative tumor and normal lung samples are shown in Fig.
- an elevated methylation level was detected in 75% of the tumor samples and 0% of the normal lung samples for the HOXB3/HOXB4 MS-HRM assay, as shown in Fig. 1 c, and in 87.9% of the tumor samples and only 3.2% of the normal lung samples for the OSR1 MS-HRM assay shown in Fig. 1 i.
- a high methylation frequency was also observed in the brain and adrenal gland metastases for all 15 confirmed DMRs. For the majority of the DMRs, the detected increase in methylation was even more prominent in the metastases compared to primary tumors, but due to the considerable difference in average tumor content between the primary tumors and metastases samples, these groups are not directly comparable.
- 15 DMRs that can be targeted as novel biomarkers in LAC.
- Sensitivity and specificity are the most important parameters when describing the potential diagnostic applicability of a biomarker. In order to calculate these values, we determined an unambiguous consensus for when a sample was considered methylation positive or negative for each assay. MS-HRM is a semi-quantitative method capable of determining the relative amount of methylated alleles in a sample, and we therefore determined a specific cutoff value for each of the assays based on the relative amount of methylated alleles that is detected. For each potential cutoff value, we calculated the corresponding sensitivity and specificity, and the cutoff was then set to achieve maximal sensitivity without compromising a specificity limit of 0.8.
- the determined cutoff value, sensitivity and specificity for each candidate biomarker are shown in Table 3.
- the cutoff was set at 1 % methylation and all samples containing more than 1 % methylated templates were therefore considered positive.
- the cutoff was set at >10% methylation for the HOXD3,
- the SIM1 assay correctly identified 91 % of the samples and showed a sensitivity of 0.92 and a specificity of 0.90 and similarly, the HOXB3/HOXB4 assay provided a sensitivity of 0.75 and a specificity of 1 .00 and therefore correctly identified 85% of the tested samples.
- the OSR1, SIM1, HOXB3/HOXB4 DMRs therefore show high clinical potential as biomarkers in LAC.
- Lung cancer has the highest mortality rates among cancers, but the prognosis for the individual patient varies considerably depending on the stage at which the disease is diagnosed. Efficient diagnostic tools that allow early and accurate disease detection are therefore of critical importance in clinical lung cancer management. Compelling evidence supporting the utility of methylation biomarkers in various aspects of cancer management, such as risk assessment, disease detection and personalization of treatment, has accumulated during the last decades. In this study, we aimed to identify and validate novel DMRs in LAC that can be targeted as biomarkers. Using a microarray-based genome-wide methylation screening approach, we identified 74 genomic regions that demonstrated differential methylation in tumor and tumor- adjacent normal lung tissue.
- the HIST1H3E region was only targeted by 3 probes and only showed a differential methylation score of 2.961 and we still confirmed a significant increase in methylation (p ⁇ 0.0001 ) in the tumor samples as illustrated in Fig. 1 r and Fig. 2e and f. Moreover, Rauch et al.
- MS-HRM analysis allows implementation of assay-specific cutoff values, which can be useful when investigating methylation changes in regions with frequent low-level methylation in the surrounding non-cancerous tissue.
- assay-specific cutoff values is challenging for clinical purposes, as a tumor- related increase in methylation can be easily masked by the normal methylation level in contaminating normal cells, which are inevitably present in surgical resections and biopsies.
- the tumor cell content in clinical specimens vary extensively between samples and the biomarker assessment assays therefore require a high dynamic range in order to successfully test samples with both high and low tumor content and this is difficult to achieve when introducing higher cutoff values. While this can be overcome through macro- or microdissection of each specimen prior to biomarker assessment, it greatly reduces the time-efficiency and increases the cost of the individual experiment and thus limits the clinical potential of a candidate biomarker. It is therefore highly preferential to target regions that do not show methylation in normal tissue, as any increase in methylation, regardless of the magnitude, can be attributed to the presence of cancerous cells regardless of the tumor content in the clinical specimen.
- the single-minded homolog 1 (SIM1) region also showed high potential as a biomarker in LAC.
- SIM1 is frequently methylated in astrocytoma and breast cancer, but this study is the first to describe hypermethylation in lung cancer.
- a substantial subset of the 74 identified DMRs including 5/15 of the validated DMRs, HOXD3, HOXB3/HOXB4, HOXD10, HOXA3 and HOXA5, were associated with homeobox genes.
- Hypermethylation of homeobox genes is a common observation in genome-wide methylation screening studies and have been reported in several cancers, including lung cancer. While the homeobox genes lack tumor subtype specificity, they may still be useful in combination with other diagnostic biomarkers in LAC.
- the HOXB3/HOXB4 region showed a tendency towards increased methylation in metastasizing compared to non-metastasizing tumors.
- CGI CpG Islands
- Table 2 DNA methylation frequencies in tumor-adjacent normal lung and primary lung tumors.
- Min-Max (Average) 5-60% (27.0%) 5-80% (39.4%) Metastases, Min-Max (Average) 5-90% (68.7%)
- the first cohort comprises FFPE primary tumor and paired metastatic tissue (20 brain and 4 adrenal gland) from 26 patients that suffered from distant metastatic disease at the time of diagnosis.
- the second cohort comprises FFPE primary tumor tissue from 26 distant metastases-free patients with a minimum of 5 years recurrence-free survival following surgical resection.
- Table 4 The histological and clinical characteristics for the 52 patients are shown in Table 4.
- FFPE tumor-adjacent normal lung tissue was selected by an experienced pathologist from 32 LAC patients and used as a control cohort. Peripheral blood samples obtained from healthy medical students of both sexes were used to generate unmethylated control DNA. Written informed consent was obtained from the subjects.
- PB peripheral blood
- 10 ml blood was incubated for 30 min at 4°C with 40 ml Triton lysis buffer (1 % Triton X-100, 10 mM Tris, 0.32 M sucrose, 5 mM MgCI 2 ) and spun for 30 min at 3-4000 rpm (4°C). The supernatant was then removed and the nuclei were washed using 0.9% NaCI.
- nuclei were lysed using 3 ml nuclei lysis buffer (24 mM EDTA, 75 mM NaCI), 230 ⁇ 10% SDS and 25 ⁇ pronase
- DNA concentrations were measured using a NanoDrop 1000 spectrophotometer (Thermo Scientific, Waltham, MA, USA).
- 500 ng genomic DNA from each sample was subjected to sodium bisulfite treatment using the EZ-96 DNA Methylation- GoldTM kit (Zymo Research, Irvine, USA) according to the manufacturer ' s instructions and eluted in a final volume of 52 ⁇ .
- DMRs differentially methylated regions
- DNA was extracted from primary tumor and tumor-adjacent normal lung tissue from four LAC patients.
- a methylated DNA immunoprecipitation (MeDIP) was performed in order to enrich the methylated fragments.
- MeDIP protocol can be found in 34 .
- Two fractions from each sample (MeDIP enriched and input) were subsequently labeled with Cy5 and Cy3 and cohybridized to the NimbleGen Human DNA Methylation 3x720K CpG Island Plus RefSeq Promoter Array (Roche/NimbleGen, Madison, Wl, USA).
- the arrays were processed using NimbleScan software (Roche/NimbleGen, Madison, Wl, USA) to generate log2 signal ratios for each probe. The ratios were then averaged within each group (tumor and normal lung) and subsequently processed by the NimbleScan software to generate a relative enrichment score for each group. The enrichment scores for each group were then subtracted to produce a differential methylation score indicating an enrichment or depletion of signal in the tumor group relative to the normal lung group. Hence, negative and positive differential methylation scores indicate potentially hypo- and hypermethylated loci in lung cancer, respectively. A threshold of 2 was applied to the differential methylation score.
- DMRs differentially methylated regions
- MS-HRM Methylation-Sensitive High-Resolution Melting
- the methylation status of each DMR was determined by comparing the melting profiles of each sample with a standard dilution series of fully methylated DNA (Universal Methylated Human DNA Standard, Zymo Research, Irvine, CA USA) into unmethylated DNA, which was generated by subjecting DNA extracted from PB to whole genome amplification (WGA) using the lllustra GenomiPhi V2 DNA Amplification Kit (GE Healthcare Life Sciences, Piscataway, NJ, USA) according to the manufacturer's instructions. All analyses were performed in duplicates. The technical specifications for each of the 18 assays, including the genomic location of the used primers, PCR cycling and HRM protocol, as well as melting profiles of the standards are included as Supplementary Information.
- SEQ ID NO: 37 OTX2 F: 5' - GAG CGG TAG TGG GAG AGA GG- SEQ ID NO: 38: OTX2 R: 5' - CAC CCC AAA TTA CAT TCG CAA C Assay 2: HOXD3
- SEQ ID NO: 39 HOXD3 F: 5' - CGG AGG AAT AGG GTA AGT TTG- 3'
- SEQ ID NO: 41 HOXB3/HOXB4 F: 5' - AGG CGG TTA GTA GTA GTT T - 3'
- SEQ ID NO: 43 HOXD10 F: 5' - TAA AGG TCG GTA TGA GTA GAG TTG TT - 3'
- SEQ ID NO: 44 HOXD10 R: 5' - ACC CCG ACA CTC CCT CTC TA- 3'
- SEQ ID NO: 45 Chr1 (q21 .1 ).
- a F 5' - GGT AGC GTT TAT AGA GTG TGG AAT- 3'
- SEQ ID NO: 46 Chr1 (q21 .1 ).
- a R 5' - TTT CCC ACC ACG CAA ACC TTA TA- 3'
- SEQ ID NO: 47 HIST1 H3G/HIST1 H2BI F: 5' - GAT CGT TAA GAT TTT GGA GTT GG - 3'
- SEQ ID NO: 48 HIST1 H3G/HIST1 H2BI R: 5' - CGA TAA TAA CTT TAC CCA ACA ACT T - 3'
- SEQ ID NO: 51 SIM1 F: 5' - GTT TAT TGG TTA ATA GGG TTG AGT GAT - 3'
- SEQ ID NO: 52 SIM1 R: 5' - ATC CCG CGA ATA ATA AAA TTC AAA ACC - 3'
- SEQ ID NO: 53 OSR1 F: 5' - GCG TTG GAG GGG ATT AGT AG- 3'
- SEQ ID NO: 54 OSR1 R: 5' - TCA TCC GAC TAC ACT TAA ATA TCC-
- SEQ ID NO: 56 FRG1 B R: 5' - CGA ATA CAC TCC ACC CCC - 3' Assay 1 1 : Chr6(p22.1 )
- SEQ I D NO: 59 HOXA3 F: 5' - ACG TTG GGG AAG GTT GTA GAG- 3'
- SEQ I D NO: 61 LY75-CD302 F: 5' - GTT GGT CGG AGA ATT GAG GGA- 3'
- SEQ I D NO: 62 LY75-CD302 R: 5' - CCC GCT TCT CAT TTC AAC CC- 3
- SEQ ID NO: 63 CTAGE15 F: 5' - GGT TAT CGT AGT AAT ATT GGT TAT A
- SEQ ID NO: 64 CTAGE15 R: 5' - CAA CCC GCT CCA CAA TAA TTC - 3'
- SEQ ID NO: 69 HOXA5 F: 5' - TGG TTC GGA TTA TTA GTT GTA TAA T - 3'
- SEQ ID NO: 70 HOXA5 R: 5' - TAC CTA CCG AAA TAC ATA CTC - 3'
Landscapes
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Engineering & Computer Science (AREA)
- Immunology (AREA)
- Pathology (AREA)
- Analytical Chemistry (AREA)
- Zoology (AREA)
- Genetics & Genomics (AREA)
- Wood Science & Technology (AREA)
- Physics & Mathematics (AREA)
- Biotechnology (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Hospice & Palliative Care (AREA)
- Biophysics (AREA)
- Oncology (AREA)
- Biochemistry (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
Abstract
A method is provided for determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus including regulatory sequences of said gene locus, wherein said gene locus is selected from the group consisting of SIM1, Chr6(p22.1), HIST1H3G/HIST1H2BI, HOXB3/HOXB4, OSR1, GHSR, OTX2, LOC648987, HIST1H3E, HIST1H2AJ/HIST1H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1(q21.1).
Description
METHYLATION BIOMARKERS FOR LUNG CANCER Field of invention
The present invention relates to DNA methylation biomarkers for non-small cell lung cancer.
Background of invention
Lung cancer is the most common type of cancer and each year, the disease is responsible for approximately 1 .5 million deaths worldwide. There are two major types of lung cancer; small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC) accounting for 10% and 85%, of all newly diagnosed lung cancers, respectively. Lung adenocarcinoma (LAC) is the most common subtype of NSCLC, which account for approximately 40% of all lung cancers. The overall 5-year survival rate for lung cancer is 15%, but the prognosis is highly dependent on the stage, at which the disease is diagnosed. If the disease is localized at the time of diagnosis, the 5-year survival rate is approximately 50%, compared to approximately 25% for cases with regional disease, and less than 5% for patients that already suffer from metastatic disease. Most early stage lung cancers are asymptomatic and consequently, only 15% of lung cancers are diagnosed at a local stage and more than 50% are diagnosed at an advanced stage. Thus, new efficient diagnostic tools for early and accurate disease detection are needed in order to improve the poor prognosis of lung cancer.
Methylation of the carbon-5 position of cytosine residues within CpG dinucleotides is a well-established epigenetic mechanism involved in the regulation of gene expression. Most CpG dinucleotides cluster in CpG rich regions in the genome, known as CpG islands (CGI), and these regions are often located within gene regulatory elements. In fact, the promoter region of more than half of all protein encoding genes contain a CGI and the methylation status of this sequence is instrumental in regulating the
transcriptional activity of the gene. Consequently, disruption of the cell's normal methylation pattern can have severe consequences and contribute to neoplastic transformation. Genome-wide studies have shown that aberrant DNA methylation is a common feature in human cancer and hundreds of tumor suppressor genes have been shown to be subject to DNA-methylation mediated silencing. Gene expression changes as a consequence of aberrant methylation have also been reported for multiple genes in lung cancer, especially hypermethylation-mediated silencing of tumor suppressor
genes such as RASSF1A, APC, RARβ, DAPK and MGMT, and to a lesser extent, hypomethylation-mediated overexpression of proto-oncogenes such as ELM03.
The utility of DNA methylation biomarkers has already been established in all aspects of clinical cancer management, including risk assessment, early disease detection, prognostication and treatment personalization. However, the development of biomarkers for clinical implementation is a challenging process that includes biomarker candidate discovery and evaluation of biomarker specificity and sensitivity in large- scale validation studies. Now, based on a genome-wide methylation screening novel methylation biomarkers have been identified for use in clinical lung cancer
management. The biomarkers display cancer-specific methylation changes and have been validated and evaluated for sensitivity and specificity.
Summary of invention
The invention relates to methylation biomarkers for non-small cell lung cancer. The invention provides a number of methylation markers, which can be used to distinguish between lung tumour tissue and healthy tissue. A plurality of individual methylation biomarkers are identified, which show high sensitivity and specificity. In one aspect, a method is provided for determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, the prognosis of a non-small cell lung cancer, and/or monitoring a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
Another aspect pertains to a method of assessing whether a human subject is likely to develop non-small cell lung cancer, said method comprising
i) providing a sample from said human subject,
ii) determining in said sample the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), and
iii) on the basis of said methylation status identifying a human subject that is more likely to develop non-small cell lung cancer.
A third aspect pertains to a method for categorizing or predicting the clinical outcome of a non-small cell lung cancer of a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
A fourth aspect relates to a method of evaluating the risk for a subject of contracting cancer, said method comprising in a sample from said subject determining the methylation status of a gene locus selected from the group consisting of SIM1 ,
Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
In a fifth aspect, a method is provided of treating a non-small cell lung cancer in a human subject, said method comprising the steps of
i. determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject by a method according to the present disclosure, which involve determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
Chr1 (q21 .1 ),
ii. selecting human subjects having non-small cell lung cancer, a
predisposition to non-small cell lung cancer, or a negative or positive prognosis of a non-small cell lung cancer,
iii. subjecting said subjects identified in step ii. to a suitable treatment for non-small cell lung cancer.
In the methods of the invention, the sample may be a lung tissue sample, or a bodily fluid such as blood or plasma (for example peripheral blood), and methylation status may be determined by any method selected from the group consisting of Methylation-
Sensitive High Resolution Melting (MS-HRM), Methylation - Sensitive Melting Curve Analysis (MS-MCA), EpiTyper, Methylation-Specific PCR (MSP), DNA methylation specific qPCR (qMSP), Pyrosequencing, Methyl Light, Amplicon bisulfite sequencing (AmpliconBS), Enrichment bisulfite sequencing (EnrichmentBS), Whole genome bisulfite sequencing (BS-Seq), HELP assays, and Methyl Sensitive Southern Blotting, and determination may involve Methylated DNA immunoprecipitation (MeDIP).
In one embodiment, the methylation status is determined by a method comprising the steps of
i) providing a lung tissue sample from said subject comprising nucleic acid material comprising said gene,
ii) processing said nucleic acid sequence using one or more methylation- sensitive restriction endonuclease enzymes,
iii) optionally, amplifying said processed nucleic acid sequence in order to obtain an amplification product, and
iv) analyzing said processed nucleic acid sequence or said amplification product for the presence of processed and/or unprocessed nucleic acid sequences, thereby inferring the presence of methylated and/or unmethylated nucleic acid sequences.
In a preferred embodiment of the methods, the methylation status is determined by a method comprising the steps of
i) providing a lung tissue sample from said subject comprising nucleic acid material comprising said gene,
ii) modifying said nucleic acid using an agent which modifies unmethylated cytosine or cleaves nucleic acid sequences in a methylation-dependent manner, iii) amplifying at least one portion of said gene using primers, which span or comprise at least one CpG dinucleotide in said gene in order to obtain an amplification product, and
iv) analyzing said amplification product for the presence of modified and/or unmodified cytosine residues, wherein the presence of modified cytosine residues are indicative of methylated cytosine residues.
The amplified CpG-containing nucleic acid is preferably analyzed by melting curve analysis. However, methylation status may also be determined by methylation specific
PCR, bisulfite sequencing, COBRA, endonucleolytic digestion, or DNA methylation arrays.
A kit is also provided in a sixth aspect for determining non-small cell lung cancer, predisposition to non-small cell lung cancer, or categorizing or predicting the clinical outcome of a non-small cell lung cancer, said kit comprising in a package
i. an agent that (a) modifies methylated cytosine residues but not non- methylated cytosine residues; or (b) modifies non-methylated cytosine residues but not methylated cytosine residues; or (c) modifies a nucleic acid sequence in a methylation- dependent manner,
ii. and at least one pair of oligonucleotide primers that specifically hybridizes under amplification conditions to a region of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), such as at least one primer selected from the group consisting of SEQ ID NO: 37 to 72.
In a seventh aspect of the present disclosure provides a use of oligonucleotide primers comprising a sequence, which is a subsequence of a gene loci selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 ,
GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ) or the complement thereof for diagnosing non-small cell lung cancer in any of the methods provided in the present disclosure. In yet an eighth aspect, a method is provided for identifying a therapeutically effective agent for treatment of non-small cell lung cancer, said method comprising
i. providing a non-small cell lung cancer cell line comprising one or more genetic loci selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ),
ii. providing one or more potential therapeutic agents,
iii. treating said non-small cell lung cancer cells by bringing said agents in contact with said non-small cell lung cancer cells,
iv. determining methylation status of said one or more genetic loci
v. comparing said methylation status of said treated non-small cell lung cancer cells with the methylation status of said non-small cell lung cancer cells, when untreated, wherein a decreased level of methylation positive alleles is indicative of a therapeutic agent.
Description of Drawings
Figure 1 : Differentially methylated regions in LAC.
The methylation level of 18 DMRs was investigated in 52 LAC primary tumors and 32 tumor-adjacent normal lung samples using MS-HRM analysis. The results of the methylation assessment are shown as stacked bar percentage plots for each DMR in a-r). The relative proportion of samples in each category with 0-1 % methylated templates are shown in white, 1 -10% methylated templates in white with light grey stripes, 10-50% methylated templates in dark grey and 50-100% methylated templates in black. The statistical significance of the detected differences in methylation between groups was assessed using a Mann-Whitney test of ranks and two-tailed p-values <0.05 were considered statistically significant.
Figure 2: Examples of melting profiles observed for the HOXD3, OSR1 and HIST1 H3E regions.
The methylation level of the 18 DMRs was determined using MS-HRM analysis.
Representative normalized melting profiles for 10 tumor-adjacent normal lung samples and 10 LAC tumors are shown in black in a-b) for HOXD3, c-d) for OSR1 and in e-f) for HIST1 H3E. The DNA methylation standards were generated as a serial dilution of fully methylated DNA into an unmethylated background. The 100% methylated standard is shown in red, 50% methylated standard in light blue, 10% methylated standard in green, 1 % methylated standard in dark blue and the 0% methylated standard in orange.
Figure 3: Methylation levels in metastasizing and non-metastasizing LAC tumors. The methylation levels of the 18 DMRs were compared between 26 metastasizing and 26 non-metastasizing LAC primary tumors. Three regions demonstrated a tendency towards increased methylation in the metastasizing tumors and a similar increase was observed in the 24 paired metastases (20 brain and 4 adrenal gland). The results are shown as stacked bar percentage plots in a) for HOXB3/HOXB4, b) for LOC648987 and in c) for HOXA5. The relative proportion of samples in each category with 0-1 %
methylated templates are shown in white, 1 -10% methylated templates in white with light grey stripes, 10-50% methylated templates in dark grey and 50-100% methylated templates in black. The statistical significance of the detected differences in methylation between groups was assessed using a Mann-Whitney test of ranks and two-tailed p-values <0.05 were considered statistically significant.
Figure 4: MS-HRM Assays
The technical specifications for 18 MS-HRM assays. Genomic location (UCSC
Genome Browser on Human Dec. 2013 (GRCh38/hg38) Assembly), primer sequences and assay-specific PCR cycling and HRM protocols are shown in the sequences section. For each assay, 180 bp of regional genomic sequence (top strand) and corresponding bisulfite-modified sequence (bottom strand) is shown. The locations of the primers are indicated in red. Normalized melting profiles for the DNA methylation standards are shown for each assay in technical duplicates. The DNA methylation standards were generated as a serial dilution of fully methylated DNA into an unmethylated background. The 100% methylated standard is indicated in red, 50% methylated standard in light blue, 10% methylated standard in green, 1 % methylated standard in dark blue and the 0% methylated standard in orange. Detailed description of the invention
The present invention relates to methylation biomarkers for use in the diagnosis and treatment of non-small cell lung cancer. Generally, the methylation markers of the invention can be used in methods for identifying subjects, which are predisposed to non-small cell lung cancer; i.e. subjects having an increased likelihood of developing non-small cell lung cancer. The methylation markers of the invention can also be used in methods for identifying subjects having non-small cell lung cancer, and in this case, the markers allow early diagnosis, Further, the markers provide prognostic information with respect to non-small cell lung cancer, and this, the markers can be used to identify a subject having non-small cell lung cancer, and the cancer DNA can be tested for predictive prognostic information based on the methylation markers of the invention, as well as information on which curative and/or ameliorative treatment to provide for the non-small cell lung cancer. The methylation status of the methylation markers of the invention may also be used to monitor a treatment provided for the curing and/or ameliorating a non-small cell lung cancer. Additionally, the marker methylation status
can be used to monitor relapse of non-small cell lung cancer for subject previously cured for non-small cell lung cancer.
Thus, aspects of the present invention relates to i) methods for identifying subjects, which are predisposed to non-small cell lung cancer, and/or which have a non-small cell lung cancer, including early stages, such as asymptomatic stages of non-small cell lung cancer, ii) methods for providing prognostic information of a non-small cell lung cancer and/or inferring a suitable treatment based thereupon, iii) methods of monitoring a treatment of a non-small cell lung cancer, and/or monitoring relapse of a non-small cell lung cancer.
In order to facilitate the understanding of the invention a number of definitions are provided below.
Definitions
Amplification according to the present invention is the process wherein a plurality of exact copies of one or more gene loci or gene portions (template) is synthesised. In one preferred embodiment of the present invention, amplification of a template comprises the process wherein a template is copied by a nucleic acid polymerase or polymerase homologue, for example a DNA polymerase or an RNA polymerase. For example, templates may be amplified using reverse transcription, the polymerase chain reaction (PCR), ligase chain reaction (LCR), in vivo amplification of cloned DNA, isothermal amplification techniques, and other similar procedures capable of generating a complementing nucleic acid sequence. Amplified copies of a targeted genetic region are sometimes referred to as an amplicon.
The term "PCR bias" as used herein refers to conditions, wherein PCR more efficiently amplifies templates with a specific methylation status. It has been reported that at least some unmethylated nucleic acid templates are more efficiently amplified than methylated nucleic acid template.
A double stranded nucleic acid contains two strands that are complementary in sequence and capable of hybridizing to one another. In general, a gene is defined in terms of its coding strand, but in the context of the present invention, an oligonucleotide
primer, which hybridize to a gene as defined by the sequence of its coding strand, also comprise oligonucleotide primers, which hybridize to the complement thereof.
A nucleotide is herein defined as a monomer of RNA or DNA. A nucleotide is a ribose or a deoxyribose ring attached to both a base and a phosphate group. Both mono-, di-, and tri-phosphate nucleosides are referred to as nucleotides.
The term oligonucleotide comprises oligonucleotides of both natural and/or non-natural nucleotides, including any combination thereof. The natural and/or non-natural nucleotides may be linked by natural phosphodiester bonds or by non-natural bonds.
Preferred oligonucleotides comprise only natural nucleotides linked by phosphodiester bonds. The oligomer or polymer sequences of the present invention are formed from the chemical or enzymatic addition of monomer subunits. The term "oligonucleotide" as used herein includes linear oligomers of natural or modified monomers or linkages, including deoxyribonucleotides, ribonucleotides, anomeric forms thereof, peptide nucleic acid monomers (PNAs), locked nucleotide acid monomers (LNA), and the like, capable of specifically binding to a single stranded polynucleotide tag by way of a regular pattern of monomer-to-monomer interactions, such as Watson-Crick type of base pairing, base stacking, Hoogsteen or reverse Hoogsteen types of base pairing, or the like. Usually monomers are linked by phosphodiester bonds or analogs thereof to form oligonucleotides ranging in size from a few monomeric units, e.g. 3-4, to several tens of monomeric units, e.g. 40-60. Whenever an oligonucleotide is represented by a sequence of letters, such as "ATGCCTG," it will be understood that the nucleotides are in 5'→ 3' order from left to right and the "A" denotes deoxyadenosine, "C" denotes deoxycytidine, "G" denotes deoxyguanosine, and "T" denotes thymidine, unless otherwise noted. When a double stranded DNA molecule is shown, the nucleotides of the top strand are in 5'→ 3' order from left to right and the nucleotides of the bottom strand are then in 3'→ 5' order from left to right. Usually, oligonucleotides of the invention comprise the four natural nucleotides; however, they may also comprise methylated or non-natural nucleotide analogs.
The term "dinucleotide" as used herein refers to two sequential nucleotides. The dinucleotide may be comprised in an oligonucleotide or a nucleic acid sequence. In particular, the dinucleotide CpG, which denotes a cytosine linked to a guanine by a phosphodiester bond, may be comprised in an oligonucleotide according to the present
invention, and also comprised in a targeted gene locus sequence according to the present invention. A CpG dinucleotide is also herein referred to as a CpG site. CpG sites are targets for methylation of the cytosine residue. Methylation status: the term "methylation status" as used herein, refers to the presence or absence of methylation in a specific nucleic acid region. In particular, the present invention relates to detection of methylated cytosine (5-methylcytosine). A nucleic acid sequence, e.g. a gene locus of the invention, may comprise one or more CpG methylation sites. The nucleic acid sequence of the gene locus may be methylated on all methylation sites (i.e. 100% methylated), or unmethylated on all methylation sites (i.e. 0% methylated). However, the nucleic acid sequence may also be methylated on a subset of its potential methylation sites (CpG-sites). In this latter case, the nucleic acid molecule is heterogeneously methylated.
The gene loci methylation markers of the present invention can be used to infer non- small cell lung cancer based on the relative amount of methylation positive (fully methylated) and methylation negative (fully unmethylated) alleles in a sample comprising in a mixture of nucleic acid molecules from a subject. For example, the methylation status of a specific gene locus marker of the present invention may be that at least 50%, such as on at least 60%, such as on at least 70%, for example on at least 80%, such as on at least 90%, such as on at least 95%, for example on at least 99%, such as least 99.9% of the nucleic acid sequence molecules (alleles) in a sample are methylation positive (fully methylated).
Gene locus: The term "gene locus" as sued herein, such as the gene loci defined by the genes SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ) is meant to include all regions relevant for expression of a given gene, both the coding region and upstream and downstream regions, which may comprise cis-acting activating signals. A gene locus, specifically is meant to include at least 1000 bp upstream and/or downstream of the open reading frame of an encoded gene, such as at least 900 bp, such as at least 800 bp, such as at least 700 bp, such as at least 600 bp, such as at least 500 bp, such as at least 400 bp, such as at least 300 bp, such as at least 200 bp, such as at least 100 bp upstream and/or downstream of the open reading frame of an encoded gene. A gene locus is also meant to include any intronic sequences in the open reading frame. It is also
understood that specific subregions of a gene locus specified herein can be of particular importance for the methods described herein. In particular CG-rich regions also known as CpG islands are particularly relevant, because CG-dinucleotides are targets for methylation.
Method of determining non-small cell lung cancer
A number of methods are provided herein for analysing a human subject with respect to non-small cell lung cancer. In particular, methods are provided for determining non- small cell lung cancer in a human subject, methods for determining a predisposition to non-small cell lung cancer for a human subject, methods for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable treatment, methods for categorizing or staging a non-small cell lung cancer of a human subject, methods for monitoring a non-small cell lung cancer, such as monitoring the treatment of a non-small cell lung cancer and/or relapse of a non-small cell lung cancer. The methylation biomarkers for non-small cell lung cancer are described in more detailed herein below. Generally, the one or more methylation biomarkers for non-small cell lung cancer according to the methods herein are selected from a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
Thus in one aspect, a method is provided for determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, the prognosis of a non-small cell lung cancer, and/or monitoring a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene including regulatory sequences of said gene, wherein said gene locus is selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
In another aspect, a method is provided for categorizing or predicting the clinical outcome of a non-small cell lung cancer of a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chi (q21 .1 ).
In another aspect, a method is provided for evaluating the risk for a human subject of developing non-small cell lung cancer, or for monitoring relapse of a non-small cell lung cancer, said method comprising in a sample from said subject determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
A further aspect relates to a method for assessing whether a human subject is likely to develop non-small cell lung cancer, said method comprising
i) providing a sample from said human subject,
ii) determining in said sample the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), iii) on the basis of said methylation status identifying a human subject that is more likely to develop non-small cell lung cancer.
The methods thus involve determining the methylation status of one or more gene loci as defined herein. Thus, methylation status may be determined for multiple gene loci, for example methylation status for at least two gene loci are determined, such as at least three gene loci, such as at least four gene loci, or five or more gene loci. The plurality of gene loci is preferably selected from a marker gene loci of the invention, i.e. a gene loci selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
Chr1 (q21 .1 ).
Generally, increased levels of methylation positive alleles of the respective marker gene locus relative to methylation levels of a predetermined control sample of non- cancer cells is indicative of the presence of a non-small cell lung cancer, higher
likelihood of developing cancer, decreased overall survival, negative outcome, different stage cancer, different grade cancer, and/or higher risk of contracting cancer.
Thus, the provided methods may preferably comprise steps of comparing the methylation status of the respective gene locus determined for a subject with a predetermined methylation status for the corresponding gene of a reference sample comprising non-cancer cells, for example tumour adjacent normal lung tissue cells, and/or comprising a different stage cancer cells. The predetermined status is preferably determined from non-cancer cells of other subjects, which do not have non-small cell lung cancer and/or are not predisposed to non-small cell lung cancer. The
predetermined methylation status differs between the different methylation markers of the invention.
Table A: Overview of sensitivities and specificities for each marker for preferred cut-off values.
ID Cutoff Value Sensitivity Specificity
Methylation
Level
OSR1 >1 % 0.98 0.97
SIM1 >1 % 0.92 0.90
HOXD10 >1 % 0.90 0.81
HIST1H3E >50% 0.90 0.91
HOXD3 >10% 0.86 0.97
GHSR >10% 0.83 0.97
Chr1 (q21.1 ).A >10% 0.79 1.00
HOXB3/HOXB4 >1 % 0.75 1.00
Chr6(p22.1 ) >1 % 0.67 1.00
HIST1H3G/HIST1
>1 % 0.63 1.00
H2BI
OTX2 >1 % 0.51 1.00
LOC648987 >1 % 0.24 1.00
HIS T1 H2AJ/HIS T
>1 % 0.22 1.00
1H2BM
HOXA3 >50% 0.14 1.00
HOXA5 >50% 0.12 1.00
Table A shows an overview of cut-off values, which are preferred for each of the representative marker loci.
For example, for loci ID SIM1 , any level of methylation positive alleles above 1 % is indicative of a non-small cell lung cancer, higher likelihood of developing cancer, decreased overall survival, negative outcome, different stage cancer, different grade cancer, and/or higher risk of contracting cancer for a human subject. The same applies to the other genetic loci listed in table A above. Thus, for the methylation marker loci identified as SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , HOXD10, OTX2, LOC648987 and HIST1 H2AJ/HIST1 H2BM, a level of methylation positive alleles above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus OSR1 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus SIM1 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus HOXD10 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient. Specifically, a methylation level of the methylation marker locus HOXB3/HOXB4 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus Chr6(p22.1 ) above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus HIST1 H3G/HIST1 H2BI above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus OTX2 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or
relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus LOC648987 above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Specifically, a methylation level of the methylation marker locus
HIST1 H2AJ/HIST1 H2BM above 1 %, such as above 2%, 3%, 4%, 5%, such as above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
For the methylation marker loci identified as HOXD3, GHSR and Chr1 (q21.1 ).A, a level of methylation positive alleles above 10%, such as above 15%, such as above 20%, such as above 25%, such as above 30%, such as above 35%, such as above 40%, such as above 45%, such as above 50%, such as above 55%, such as above 60%, such as above 65%, such as preferably above 65.3%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased
risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient. Indeed for methylation marker loci identified as HIST1 H3E, HOXA3 and HOXA5, a level of methylation positive alleles above 50%, such as above 55%, such as above 60%, such as above 65%, such as above 70%, such as above 75%, such as above 80%, such as above 85%, such as above 90%, such as above 95%, such as above 96%, 97%, 98%, or 99%, such as 100% is indicative of non-small cell lung cancer, a predisposition to non-small cell lung cancer, increased risk of non-small cell lung cancer, the prognosis of non-small cell lung cancer, and/or relapse of non-small cell lung cancer, and thus indicates that a given treatment being monitored is inefficient.
Method for treatment of non-small cell lung cancer
Aspects provided herein also relates to methods for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable treatment, as well as for monitoring a non-small cell lung cancer, and in particular monitoring the treatment of a non-small cell lung cancer and/or monitoring relapse of a non-small cell lung cancer.
So in one aspect, a method is provided for treatment of non-small cell lung cancer in a human subject, the method comprises the steps of
i. determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject by a method of the present invention, as defined elsewhere herein,
ii. selecting human subjects having non-small cell lung cancer, a predisposition to non-small cell lung cancer, or a relapse of a non-small cell lung cancer,
iii. subjecting said subjects identified in step ii. to a suitable treatment for non-small cell lung cancer.
The step of determining non-small cell lung cancer by a method of the present invention allows early detection of non-small cell lung cancer, and therefore allows treatment of the cancer to be initiated before developing into later stages and/or before forming metastases. This allows the use of less serious types of therapeutic
interventions, and may for example avoid the need for surgery, such as surgical removal of the entire lung. In one embodiment, the selected human subject is subjected to a treatment selected form surgery, chemotherapy, immunotherapy and/or radiotherapy, however, in a preferred embodiment, the treatment is radiotherapy. In one embodiment, the treatment is a combination of surgery, chemotherapy and radiotherapy, for example surgery followed by chemotherapy and/or radiotherapy.
The methylation markers also allow monitoring relapse of non-small cell lung cancer, as well as offering a personalized treatment of non-small cell lung cancer by surveillance and quality of control of the treatment offered, thereby allowing terminating ineffective treatments and offering alternative treatments. Thus, in another aspect, the invention provides a method for personalized treatment of a non-small cell lung cancer of a human subject, said method comprising
i) in a sample from said human subject, determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ) ii) providing a treatment of non-small cell lung cancer to said human subject, iii) after a sufficient amount of time having provided the treatment, in a sample from said human subject, determining the methylation status of said gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ), iv) comparing the methylation status of said gene locus before and after treatment, and v) if methylation of said genetic locus is similar to the methylation before treatment, terminating said provided treatment and preferably offering an alternative treatment, or vi) if methylation of said genetic locus is reduced relative to the methylation before treatment, continuing said provided treatment or even terminating the treatment. Methylation biomarkers for non-small cell lung cancer
As described herein above, a number of different methods are provided herein for evaluating non-small cell lung cancer in a human subject based on methylation status of specific gene loci. The invention also provides specific oligonucleotide primers and kits for use in determining methylation status of specific gene loci, which are
methylation biomarkers for non-small cell lung cancer according to the present
invention. These gene loci include SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ), such as those identified by SEQ ID NO: 1 -36.
Generally, in the methods of the invention, the methylation status is determined for at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ). In one embodiment, the methylation status is determined for a gene locus selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21.1 ) and/or HOXB3/HOXB4. In another embodiment, the methylation status is determined for a gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or HOXB3/HOXB4. In another embodiment, the methylation status is determined for a gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI and/or Chr6(p22.1 ).
In one embodiment, the methylation status is determined for a gene locus selected from the group consisting of SIM1 and HIST1 H3G/HIST1 H2BI; or the group consisting of SIM1 and Chr6(p22.1 ); or the group consisting of HIST1 H3G/HIST1 H2BI and Chr6(p22.1 ).
In one embodiment, the methylation status is determined for SIM1 and/or Chr6(p22.1 ) and/or HIST1 H3G/HIST1 H2BI and/or HOXB3/HOXB4 and/or OSR1 and/or HOXD10 and/or OTX2 and/or LOC648987 and/or HIST1 H2AJ/HIST1 H2BM.
In one embodiment, the methylation status is determined for one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ). In further embodiments, the methylation status is determined for that one gene locus and at least one gene locus selected from the remainder of the group.
Thus, in specific one embodiment, methylation status is determined in the SIM1 gene locus and at least one additional gene locus selected from the group consisting of
Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ). In another specific embodiment, methylation status is determined in the Chr6(p22.1 ) gene locus and at least one additional gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the
HIST1 H3G/HIST1 H2BI gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the
HOXB3/HOXB4 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
In another specific embodiment, methylation status is determined in the OSR1 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the GHSR gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the OTX2 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the LOC648987 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the HIST1 H3E gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the
HIST1 H2AJ/HIST1 H2BM gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ). In another specific embodiment, methylation status is determined in the HOXD10 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD3, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the HOXD3 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXA3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the HOXA3 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA5 and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the HOXA5 gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, and Chr1 (q21.1 ).
In another specific embodiment, methylation status is determined in the Chr1 (q21.1 ) gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3 and HOXA5. DNA sequences of specific gene loci are provided herein below. Thus, in a preferred embodiment, the methylation status is determined in a gene locus identified by SEQ ID NO: 1 -18. The corresponding sequences of the same loci after bisulfite modification and amplification are identified by SEQ ID NO: 19-36, respectively. In a preferred embodiment, the methylation status is determined by a method comprising amplifying a gene locus of the invention using at least one primer selected from the group consisting of SEQ ID NO: 37-72. Methylation status is preferably determined for a gene locus mentioned in table B using the respective forward primer and/or reverse primer identified in table B; i.e.
SIM1 : forward primer SEQ ID NO: 51 and/or reverse primer SEQ ID NO: 52;
Chr6(p22.1 ): forward primer SEQ ID NO: 57 and/or reverse primer SEQ ID NO: 58; etc.
Thus, in one preferred embodiment, the methylation status is determined in a genetic region of a gene locus of the invention, wherein said region is delineated by the primer pairs identified in table B for each respective gene; i.e.
for OTX2: primers SEQ ID NO: 37 and/or 38;
for HOXD3: primers SEQ ID NO: 39 and/or 40;...
For SIM1 : primers SEQ ID NO: 51 and/or 52;
For Chr6(p22.1 ): primers SEQ ID NO: 57 and/or 58;
For HIST1 H3G/HIST1 H2BI: primers SEQ ID NO: 47 and/or 48;
etc .; and
for HIST1 H3E: primers SEQ ID NO: 71 and/or 72.
Table B. Markers Sequence table
Marker gene/Loci ID Non- Modified Forward Reverse
modified Gene/loci primer primer
Gene/loci SEQ ID SEQ ID SEQ ID
SEQ ID NO NO NO
NO
OTX2 1 19 37 38
HOXD3 2 20 39 40
HOXB3/HOXB4 3 21 41 42
HOXD10 4 22 43 44
Chr1 (q21 .1 ) 5 23 45 46
HIST1 H3G/HIST1 H2BI 6 24 47 48
GHSR 7 25 49 50
SIM1 8 26 51 52
OSR1 9 27 53 54
FRG1 B 10 28 55 56
Chr6(p22.1 ) 11 29 57 58
HOXA3 12 30 59 60
LY75-CD302 13 31 61 62
CTAGE15 14 32 63 64
LOC648987 15 33 65 66
HIST1 H2AJ/HIST1 H2BM 16 34 67 68
HOXA5 17 35 69 70
HIST1 H3E 18 36 71 72
Sample
According to the present invention, the methylation status of one or more gene loci is determined in a sample from a human subject. Thus, the sample of the invention comprises biological material, in particular genetic material comprising nucleic acid molecules. The nucleic acid molecules may be extracted from the sample prior to the analysis. The sample may be obtained or provided from any human source. In one embodiment, determination of methylation status of a gene locus or genetic region of the invention is performed on samples selected from the group consisting of lung tissue, hematopoietic tissue, bone marrow, expiration air, stem cells, including cancer stem cell, and body fluids, such as sputum, bronchial lavage, urine, blood and sweat.
In preferred embodiments the sample is or comprises lung tissue, such as lung cells and/or genetic material of lung cells. It is well-known that tumor DNA may leak to the blood stream or other bodily fluids, so in one preferred embodiment, the sample is a body fluid, such as sputum, urine, blood and sweat. In particular, it is preferred that the sample is a blood or plasma sample. Body fluids are often retrievable by less invasive methods than lung tissue, which must be obtained surgically for example by biopsies.
The provided sample is in one embodiment a formalin-fixed paraffin-embedded (ffpe) sample, for example an ffpe sample, wherein prestages to non-small cell lung cancer can be seen. In particular, the sample used for predetermining methylation status can be an ffpe sample. Many ffpe samples may be provided, which can give rise to statistically strong predetermined values with respect to evaluation of non-small cell lung cancer risk, categorizing or staging a non-small cell lung cancer of a human subject, methods for monitoring a non-small cell lung cancer, such as monitoring the treatment of a non-small cell lung cancer and/or relapse of a non-small cell lung cancer.
The nucleic acid to be analysed for the presence of methylated CpG may be extracted from the samples by a variety of techniques such as that described by Maniatis, et al (Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, N.Y., pp 280, 281 , 1982). However, the sample may be used directly.
Any nucleic acid, in purified or nonpurified form, can be utilized as the starting nucleic acid or acids, provided it contains, or is suspected of containing, the specific nucleic acid sequence containing the methylation target site (e.g., CpG). The specific nucleic acid sequence which is to be amplified may be a part of a larger molecule or is present initially as a discrete molecule. The nucleic acid sequence to be amplified need not to be present in a pure form, it may for example be a fraction of a complex mixture of other DNA molecules, and/or RNA. In one example, the nucleic acid sequence is a fraction of a genomic nucleic acid preparation. Extremely low amounts of nucleic acid may be used as target sequence according to the methods of the present invention. It is appreciated by the person skilled in the art that in practical terms no upper limit for the amount of nucleic acid to be analysed exists. The problem that the skilled person may encounter is that the amount of sample to be analysed is limited. Therefore, it is beneficial that the method of the present invention can be performed on a small amount of sample and thus a limited amount of nucleic acid in said sample. The present methods allow the detection of only very few nucleic acid copies. The amount of the nucleic acid to be analysed is in one
embodiment at least 0.01 ng, such as 0.1 ng, such as 0.5 ng, for example 1 ng, such as at least 10 ng, for example at least 25 ng, such as at least 50 ng, for example at least 75 ng, such as at least 100 ng, for example at least 125 ng, such as at least 150 ng, for example at least 200 ng, such as at least 225 ng, for example at least 250 ng, such as at least 275 ng, for example at least 300 ng, 400 ng, for example at least 500 ng, such as at least 600 ng, for example at least 700 ng, such as at least 800, ng, for example at least 900 ng or such as at least 1000 ng.
In one preferred embodiment the amount of nucleic acid as the starting material for the method of the present invention is approximately 50 ng, alternatively 100 ng or 200 ng.
Methylation status
The methods of the present invention for determining non-small cell lung cancer in a human subject, methods for determining a predisposition to non-small cell lung cancer for a human subject, methods for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable (personalized) treatment, methods for categorizing or staging a non-small cell lung cancer of a human subject and methods for monitoring a non-small cell lung cancer, all include a step of providing or obtaining a
sample from the human subject, and in that sample determining the methylation status of at least one genetic locus selected from the group consisting of SIM1 ,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), as well as subregions thereof, in particular GC- rich subregions, such as those delineated by the respective primer pairs identified in table B.
Methylation status of the target gene loci or genetic regions of the present invention may be determined by any suitable method available to the skilled person for detecting methylation status. However, in a preferred embodiment, methylation status is determined by a quantitative method, such as a semi-quantitative method, which is capable of detecting levels of methylation positive alleles and/or methylation negative alleles in a population of target molecules present in a sample. For example, the quantitative method is preferably capable of detecting different levels of methylation positive alleles of a given target locus sequence, such as detecting whether 0%, less than 1 %, more that 1 %, such as approximately 10%, 25%, 50%, 75% or 100% of the alleles of a given marker locus are methylation positive. A semi- quantitative method provides categorical data, such as the level of 1 -10% methylated templates or 10-50% methylated templates. Some techniques in the art merely detect the presence of one or more methylation positive and/or methylation negative alleles of a given target sequence without providing quantitative data, and without providing information of the relative levels of methylation positive and methylation negative alleles. However, preferred methods of the present invention provide a quantitative measure of the relative level of methylation positive alleles of a specific target region.
The term "methylation status" as used herein, refers to the extent to which a nucleic acid region and/or in particular a CpG methylation site is methylated or unmethylated, which may be expressed as the methylation level of a given sample. The methylation status of a single CpG methylation site can be either methylated or unmethylated. A nucleic acid sequence comprising multiple potential methylation (CpG) sites, may be methylated on only a subset of those CpG sites. Such nucleic acid molecules/alleles are heterogeneous methylated. The term "methylation status", thus, refers to whether a nucleic acid sequence is methylation positive (methylated on all CpG sites), is
methylation negative (all CpG sites of the sequence is unnmethylated), or is
heterogeneous methylated (a subset of CpG sites of the sequence is methylated.
The methods for inferring non-small cell lung cancer of the present invention, thus determine methylation status of specific methylation markers by determining whether a specific methylation marker in a sample obtained or provided from a subject is methylation positive, methylation negative or heterogeneously methylated, as well as detecting the relative level of methylated alleles of a given locus. The methods may also include detecting marker sequences with low methylation, which defines methylation of less than 1 % of the alleles of a sample.
Methylation status may be determined by any suitable method available to those of skill in the art. For example, method may be selected from the group consisting of
Methylation-Sensitive High Resolution Melting (MS-HRM), Methylation - Sensitive Melting Curve Analysis (MS-MCA), EpiTyper, Methylation-Specific PCR (MSP), DNA methylation specific qPCR (qMSP), Pyrosequencing, Methyl Light, Amplicon bisulfite sequencing (AmpliconBS), Enrichment bisulfite sequencing (EnrichmentBS), Whole genome bisulfite sequencing (BS-Seq), HELP assays, and Methyl Sensitive Southern Blotting, and determination may involve Methylated DNA immunoprecipitation
(MeDIP). In one embodiment, the method is selected from the group consisting of AmpliconBS 1 , AmpliconBS 2, AmpliconBS 3, AmpliconBS 4, EnrichmentBS 1 ,
EnrichmentBS 2, EpiTyper 1 , EpiTyper 3, Infinium, Pyroseq 1 , Pyroseq 1 (replicate), Pyroseq 2, Pyroseq 3, Pyroseq 4, Pyroseq 5, MethyLight, MS-HRM, MS-MCA, qMSP (preamp), qMSP (standard), DNA-methylation-specific amplification by qPCR, HPLC- MS, Immunoquant, Pyroseq AluYb8, Bisulfite pyrosequencing using primers that amplify AluYb8 repetitive DNA, Pyroseq D4Z4, Pyroseq LINE1 , Pyroseq NBL2 and ClonalBS; cf. Bock et al, 2016, nature biotech. 34 (7).
In another embodiment, the method is selected from the group consisting of High- performance liquid chromatography (HPLC), High-performance capillary
electrophoresis (HPCE), Sssl assay , Gene specific Methylation-specific PCR (MSP- PCR), Methyl-sensitive restriction enzyme PCR (MSRE-PCR), MethyLight,
Pyrosequencing , Methylation-sensitive single nucleotide Primer extension (MS- SNuPE), Combined bisulfite restriction analysis (COBRA), Methylation sensitive-high resolution melting (MS-HRM), Methylation-specific multiplex ligationdependent probe amplification (MS-MLPA), Mass ARRAY EpiTYPER , Restriction landmark genomic
scanning (RLGS), Differential methylation hybridization (DMH), Methylated DNA immunoprecipitation and microarray chip (MeDIPchip), Bead arrays (lllumina) Bisulfite, Whole-genome bisulfite sequencing, Single molecule real time (SMRT) sequencing and MethylCap sequencing; cf. Syedmoradi et al, 2016, Royal Soc of Chem. (DOI: 10.1039/c6an01649a).
In one embodiment of methods of the present invention for determining non-small cell lung cancer in a human subject, for determining a predisposition to non-small cell lung cancer for a human subject, for determining the prognosis of a non-small cell lung cancer in a subject and/or inferring a suitable treatment, for categorizing or staging a non-small cell lung cancer of a human subject, and/or for monitoring a non-small cell lung cancer, such as monitoring the treatment of a non-small cell lung cancer and/or relapse of a non-small cell lung cancer, the methylation status is determined by use of methylation-sensitive restriction enzymes. Many restriction enzymes are sensitive to the DNA methylation states. Cleavage can be blocked or impaired when a particular base in the recognition site is modified. For example, the MspJI family of restriction enzymes has been found to be dependent on methylation and hydroxymethylation for cleavage to occur. These enzymes excise ~ 32 base pair fragments containing a centrally located 5-hmC or 5-mC modified residue that can be extracted and sequenced. Due to the known position of this epigenetic modification, bisulfite conversion is not required prior to downstream analysis.
Methylation-sensitive enzymes are well-known in the art and include:
Aatll, Accll, Aor13HI, Aor51 HI, BspT104l, BssHII, Cfr10l, Clal Cpol, Eco52l, Haell, Hapll, Hhal, Mlul, Nael, NotI, Nrul, Nsbl, PmaCI, Psp1406l, Pvul, Sacll, Sail, Smal and SnaBI.
The digested nucleic acid sample is subsequently analysed by for example gel electrophoresis.
So, in one embodiment of the methods of the invention, methylation status is determined by a method comprising the steps of
i) providing a sample, such as a lung tissue sample or a blood or plasma sample from said subject comprising nucleic acid material comprising said gene, ii) processing said nucleic acid sequence using one or more methylation- sensitive restriction endonuclease enzymes,
iii) optionally, amplifying said processed nucleic acid sequence in order to obtain an amplification product, and
iv) analyzing said processed nucleic acid sequence or said amplification product for the presence of processed and/or unprocessed nucleic acid sequences, thereby inferring the presence of methylated and/or unmethylated nucleic acid sequences.
In a preferred embodiment of methods of the present invention, the methodology employed for determining methylation status is determined by a method, which comprises at least the steps of modifying the DNA with an agent which targets either methylated or unmethylated sequences, amplifying the DNA, and analysing the amplification products.
For example, amplification product is analysed by detecting the presence or absence of amplification product, wherein the presence of amplification product indicates that the target nucleic acid has not been cleaved by the restriction enzymes, and wherein the absence of amplification product indicates that the target nucleic acid has been cleaved by the restriction enzymes. Thus, generally, the in the methods of the invention methylation status is determined by a method comprising the steps of
i) providing a sample, such as a lung tissue sample or a blood or plasma sample from said subject comprising nucleic acid material comprising a gene locus of the invention,
ii) modifying said nucleic acid material using an agent, which modifies nucleic acid sequences in a methylation-dependent manner,
iii) amplifying at least one portion of said gene locus using primers, which span or comprise at least one CpG dinucleotide in said gene locus in order to obtain an amplification product, and
iv) analyzing said amplification product for the presence of modified and/or unmodified cytosine residues, wherein the presence of modified cytosine residues are indicative of methylated cytosine residues.
For example, the method comprises the steps of
i) providing a sample, such as a lung tissue sample, from said subject comprising nucleic acid material comprising said gene locus,
ii) modifying said nucleic acid using an agent which modifies unmethylated cytosine,
iii) amplifying at least one portion of said gene locus using primers, which span or comprise at least one CpG dinucleotide in said gene locus in order to obtain an amplification product, and
iv) analyzing said amplification product.
The amplification product can be analysed for nucleic acid substitutions resulting from conversion of modified cytosine residues, preferably wherein the presence of converted cytosine residues are indicative of unmethylated cytosine residues, and presence of unconverted cytosine residues is indicative of methylated cytosine residues. Typically, unmethylated cytosine is converted to thymidine after bisulphite treatment and amplification, while methylated cytosine is left unchanged after same treatment.
In a preferred embodiment, the amplification product is analysed by melting curve analysis; cf. herein below.
The amplification product, the amplicon, is in a preferred embodiment a genetic region of a gene of the invention, wherein said region is delineated by the primer pairs identified in table B for each respective gene; i.e.
for OTX2: primers SEQ ID NO: 37 and/or 38;
for HOXD3: primers SEQ ID NO: 39 and/or 40; For SIM1 : primers SEQ ID NO: 51 and/or 52;
For Chr6(p22.1 ): primers SEQ ID NO: 57 and/or 58;
For HIST1 H3G/HIST1 H2BI: primers SEQ ID NO: 47 and/or 48;
etc .; and
for HIST1 H3E: primers SEQ ID NO: 71 and/or 72.
Modification of DNA
The method for determining methylation status in the present invention preferably comprise a step of modifying the nucleic acids comprised in the sample, or extracted from the sample, using an agent which specifically modifies unmethylated cytosine in the nucleic acid. As used herein the term "modifies" refers the specific modification of
either an unmethylated cytosine or a methylated cytosine, for example the specific conversion of an unmethylated cytosine to another nucleotide which will distinguish the modified unmethylated cytosine from a methylated cytosine. In one preferred embodiment, an agent modifies unmethylated cytosine to uracil. Such an agent may be any agent conferring said conversion, wherein unmethylated cytosine is modified, but not methylated cytosine. In one preferred embodiment the agent for modifying unmethylated cytosine is sodium bisulfite. Sodium bisulfite (NaHS03) reacts readily with the 5,6-double bond of cytosine, but only poorly with methylated cytosine. The cytosine reacts with the bisulfite ion, forming a reaction intermediate in the form of a sulfonated cytosine which is prone to deamination, eventually resulting in a sulfonated uracil. Uracil can subsequently be formed under alkaline conditions which removes the sulfonate group.
During a nucleic acid amplification process, uracil will by the Taq polymerase be recognised as a thymidine. The product upon PCR amplification of a Sodium bisulfite modified nucleic acid contains cytosine at the position where a methylated cytosine (5- methylcytosine) occurred in the starting template DNA of the sample. Moreover, the product upon PCR amplification of a Sodium bisulfite modified nucleic acid contains thymidine at the position where an unmethylated cytosine (5-methylcytosine) occurred in the starting template DNA of the sample. Thus, an unmethylated cytosine is converted into a thymidine residue upon amplification of a bisulfite modified nucleic acid.
In a preferred embodiment of the present invention, the nucleic acids are modified using an agent which modifies unmethylated cytosine in the nucleic acid. In a specific embodiment, such an agent is a bisulfite, hydrogen sulfite, and/or disulfite reagent, for example sodium bisulfite.
However, in another embodiment, an agent is used, which specifically modifies methylated cytosine in the nucleic acid and does not modify unmethylated cytosine.
Amplifying step
After modification of the nucleic acids of the sample, the specific genetic region selected for determination of methylation status is preferably amplified in order to generate and thereby obtain multiple copies (amplicons) of the respective genetic
regions, which can allow its further analysis with respect to methylation status. The amplification is preferably preformed using at least one oligonucleotide primer, which targets the specific genetic region comprising methylation markers for non-small cell lung cancer according to the present invention. Most preferably amplification is performed using two oligonucleotide primers, which delineates the analysed region. The skilled person may use his common general knowledge in designing suitable primers. However, in a preferred embodiment, at least one, and preferably two methylation-independent oligonucleotide primers are employed for amplification of the modified nucleic acid. The nature of methylation-independent primers is described on more detail herein below.
The amplifying step is a polymerisation reaction wherein an agent for polymerisation is involved, effecting an oligonucleotide primer extension. The agent for polymerization may be any compound or system which will function to accomplish the synthesis of primer extension products, including enzymes. Enzymes that are suitable for this purpose include, for example, E. coli DNA polymerase I, Klenow fragment of E. coli DNA polymerase I, T4 DNA polymerase, other available DNA polymerases, polymerase muteins, reverse transcriptase, and other enzymes, including heat-stable enzymes (i.e., those enzymes which perform primer extension after being subjected to temperatures sufficiently elevated to cause denaturation also known as Taq polymerases). Suitable enzymes will facilitate combination of the nucleotides in the proper manner to form the primer extension products which are complementary to each locus nucleic acid strand. Generally, the synthesis will be initiated at the 3' end of each primer and proceed in the 5' direction along the template strand, until synthesis terminates, producing molecules of different lengths. There may be agents for polymerization, however, which initiate synthesis at the 5' end and proceed in the other direction, using the same process as described above.
A preferred method for amplifying the modified nucleic acid by means of at least one methylation-independent oligonucleotide primer is by the polymerase chain reaction (PCR), as described herein and as is commonly used by those skilled in the art.
It is appreciated that PCR amplification requires a set of oligonucleotide primers, one forward primer and one reverse primer. According to the present invention, the forward primer is a methylation independent primer. The reverse primer is in another embodiment a methylation independent primer. However, both reverse and forward
primer may be methylation independent oligonucleotide primers according to the definitions herein.
The amplification product (amplicon) may be of any length, however in one preferred embodiment, the amplification product comprise between 15 and 1000 nucleotides, such as between 15 and 500 nucleotides, such as between 50 and 120 nucleotides, preferably between 80 and 100 nucleotides. In a preferred embodiment, the amplicon is delineated by the primers identified in table B for each respective gene, cf. herein above. The PCR reaction is characterised by three steps a) melting a nucleic acid template, b) annealing at least one methylation-independent oligonucleotide primer to said nucleic acid template, and c) elongating said at least one methylation-independent
oligonucleotide primer. Melting
The melting of a CpG-containing nucleic acid template may also be referred to as strand separation. Melting is necessary where the target nucleic acid contains two complementary strands bound together by hydrogen bonds. This strand separation can be accomplished using various suitable denaturing conditions, including physical, chemical, or enzymatic means. One physical method of separating nucleic acid strands involves heating the nucleic acid until it is denatured. The denaturation by heating is the preferred procedure for melting in the present invention. Heat denaturation involves temperatures ranging from about 60 degrees Celsius to 100 degrees Celsius. The time for melting may be in the range of 5 seconds to 10 minutes or even longer for initial melting of the template.
The melting temperature is typically between 80 and 90 degrees Celsius, such as at least 81 , for example at least 82, such as at least 84, preferably at least 85, at least 86, such as at least 87, for example at least 88 degrees Celsius. The PCR reaction mixture is incubated at the melting temperature for at least 5 seconds, alternatively at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
Annealing
Separated strands are used as a template for the synthesis of additional nucleic acid strands. It is understood that the separated strands may result from the separation of complementary strands in an originally double stranded nucleic acid. However, separated strands originally single stranded are also used as templates according to the present invention. The synthesis of additional nucleic acid strands is performed under conditions that allow the hybridisation of oligonucleotide primers to templates. Such a step is herein referred to as annealing. The oligonucleotide primers form hydrogen bonds with the template.
The annealing temperature is between 40 and 75 degrees Celsius, such as at least 40, at least 45, for example at least 50, at least 52, at least 54, at least 56, at least 57, at least 58, at least 59 preferably at least 60, at least 61 , at least 62, at least 63, at least 64, at least 65, at least 66, at least 67, for example at least 68, at least 69, at least 70, at least 72, at least 73, at least 75 degrees Celsius. The PCR reaction mixture is incubated at the annealing temperature for 1 to 100 seconds, such as at least 1 , at least 2, at least 3, at least 4, preferably at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, alternatively at least 1 1 , at least 13, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
In a specific embodiment of the present invention, the annealing temperature is between at least 15 degrees Celsius above the optimal annealing temperature, such as at least 1 , at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 1 1 , at least 12, at least 13, at least 14, at least 15 degrees Celsius above the optimal annealing temperature.
The optimal annealing temperature can be calculated by standard algorithms, as known to people skilled within the art. In one embodiment, the optimal primer annealing temperature (Tm) is calculated as: Tm = 4(G + C) + 2(A + T), wherein G, C, A, T designates the number of the respective nucleotides. In another embodiment, the optimal primer annealing temperature (Tm) is calculated as:
Tm = 64.9°C + 41 °C x (number of G's and C's in the primer - 16.4)/N, where N is the length of the primer. However, the annealing temperature should be empirically determined in respect of each specific primer. The modulation of the annealing
temperature is used to adjust hybridization stringency as described elsewhere herein. Thus, the optimal annealing temperature should be set at a level, wherein the PCR bias towards amplification of unmethylated nucleic acid template is balanced by the less efficient annealing of methylation-independent oligonucleotide primer according to the present invention to unmethylated nucleic acid target sequence.
Generally, the choice of annealing temperature depends on the sensitivity of the assay, and the composition of the sample with respect to the relative levels of methylation positive and methylation negative alleles. Thus, optimal annealing temperatures should preferably be determined for each sample. However, in one embodiment, the annealing temperature in respect of specific methylation-independent oligonucleotide primer according to the present invention is as specified below for each assay; cf. sequences.
Thus, in one embodiment of the methods and uses of the invention, the oligonucleotide primer is e.g. SEQ ID NO: 51 and 52 (assay 8), and the annealing temperature is 55°C. Similarly, specific annealing temperatures as well as PCR cycling conditions for preferred oligonucleotide primers of the invention is indicated below for each assay.
Elongation
The oligonucleotide primers annealed to the template is elongated to form an amplification product. The elongating temperature depends on optimum temperature for the polymerase, and is usually between 30 and 80 degrees Celsius. Typically, the elongating temperature is between 60 and 80 degrees Celsius, such as at least 60, at least 65, at least 68, at least 69, at least 70, preferably at least 71 , at least 72, at least 73, at least 74, alternatively at least 75, at least 76, at least 77, at least 78, at least 79, at least 80 degrees Celsius. The PRC reaction mixture is incubated at the elongating temperature for 1 to 100 seconds, such as at least 1 , at least 2, at least 3, at least 4, preferably at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, alternatively at least 1 1 , at least 13, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at least 45, at least 50, at least 55, at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, or at least 100 seconds.
Elongation occurs in a buffered aqueous solution, preferably at a pH of 7-9.
The two oligonucleotide primers are added to the reaction mixture in a molar excess of primer: template especially when the template is genomic DNA which will ensure an improved efficiency. Deoxyribonucleoside triphosphates dATP, dCTP, dGTP, and
dTTP are added to the reaction mixture, either separately or together with the primers. An appropriate agent for effecting the primer extension reaction, referred to and described elsewhere herein as an agent for polymerization is added to the reaction mixture. It is appreciated by a person skilled in the art that for PCR the agent for polymerisation preferable is a heat-stable polymerase enzyme, such as Taq polymerase.
Cycling
The PCR method comprises incubating the nucleic acid at a cycle of different specific temperatures in order to control the steps of amplification. The amplification buffer and polymerase required for PCR are well known to people of skill within the art.
The PCR reaction mixture is incubated sequentially at the melting temperature, the annealing temperature and the elongating temperature, respectively, for a number of cycles. The PCR reaction may run between 10 and 70 cycles. Typically, the PCR reaction run between 25 and 55 cycles, such as at least 25, at least 30, at least 35, at least 40, preferably at least 45, at least 50 or at least 55 cycles.
In one embodiment, cycles of melting, annealing and elongation consist of 10-80, 10- 80 and 10-80 seconds, respectively. Optimal cycling intervals are easily determined by those of skill in the art. Specific embodiments of cycle intervals for melting, annealing and elongation are indicated in for each assay below together with preferred annealing temperature for the respective primers; i.e.:
Assay 1 : Primers SEQ ID NO: 37 and 38: 1 ,1 ,1 minutes, respectively;
Assay 18: Primers SEQ ID NO: 71 and 72: 1 ,1 ,1 minutes, respectively;
PCR can be performed on a PCR machine, which is also known as a thermal cycler. Specifically, the thermal cycler may be coupled to a fluorometer, thus allowing the monitoring of the nucleic acid amplification in real time by use of intercalating fluorescent dyes, or other fluorescent probes. Applicable dyes according to the present invention include any DNA intercalating dye.
Suitable dyes include ethidium bromide, EvaGreen, LC Green, Syto9, SYBR Green, SensiMix HRM™ kit dye, however many dies are available for this same purpose.
Real-time PCR allows for easy performance of quantitative PCR (qPCR), which is usually aided by algorithms comprised in the software, which is usually supplied with the PCR machines.
The fluorometer can furthermore be equipped with software that will allow interpretation of the results. Such software for data analyses may also be supplied with the kit of the present invention. Another variant of the PCR technique, multiplex PCR, enables the simultaneous amplification of many targets of interest in one reaction by using more than one pair of primers.
PCR according to the present invention comprise all known variants of the PCR technique known to people of skill within the art. Thus, the PCR technology comprise real-time PCR, qPCR, multiplex PCR.
Oligonucleotide primers
The oligonucleotide primer employed for amplification of modified nucleic acid is preferably a methylation-independent primer. The term "methylation-independent primer" refers to an oligonucleotide primer, which is capable of hybridizing to both methylated and unmethylated nucleic acid alleles and modified as well as unmodified alleles. A methylation-independent primer may not anneal with the exact same affinity to methylated/unmethylated nucleic acid alleles or modified/unmodified alleles.
The oligonucleotide primers of the present invention are capable of being employed in amplification reactions, wherein the primers are used in amplification of template DNA originating from either a methylation positive or amethylation negative strand. The preferred methylation-independent primers of the present invention comprise at least one CpG dinucleotide, as described below. Accordingly, in a methylation positive and bisulfite modified nucleic acid target sequence, the primer sequence will anneal to the nucleic acid template with a perfect match, wherein all of the nucleotides in a consecutive region of the primer forms base pairs with a complementary region in the nucleic acid target. However, in a methylation negative nucleic acid target after bisulfite modification, the methylation-independent primers of the present invention will anneal
to the nucleic acid template with an imperfect match, wherein the primer sequence comprise a mis-match (i.e. the primer and template does not form base pairs) at the position of the unmethylated Cytosine at a CpG site in the nucleic acid template.
Nonetheless, as the primers of the present invention are methylation-independent, the primers will hybridize to both methylation negative and methylation positive nucleic acid sequences after bisulfite modification, and the primers will form a perfect match with the target sequence of a methylated nucleic acid target and an imperfect match, where the primers and target nucleic acid sequence does not form base pairing at the positions of unmethylated Cytosine (which is converted by bisulfite to Uracil) at CpG sites.
The methylation-independent primers of the present invention will, due to the mismatch after bisulfite modification at positions of unmethylated cytosine of a CpG-site in the nucleic acid target sequence, hybridize less efficiently to a methylation negative nucleic acid sequence. However, by reducing the stringency of hybridization, the methylation-independent primers of the present invention are able to anneal to the nucleic acid target, also when the nucleic acid target comprise unmethylated CpG- sites, which have been modified by for example bisulfite treatment. In one example, the stringency is reduced by reducing the annealing temperature as described elsewhere herein.
The design of oligonucleotide primers suitable for nucleic acid amplification techniques, such as PCR, is known to people skilled within the art. The design of such primers involves analysis of the primer's melting temperatures and ability to form duplexes, hairpins or other secondary structures. Both the sequence and the length of the oligonucleotide primers are relevant in this context. The oligonucleotide primers according to the present invention comprise between 10 and 200 consecutive nucleotides, such as at least 60, at least 65, at least 70, at least 75, at least 80, at least 85, at least 90, at least 95, at least 100, at least 1 10, at least 120, at least 130, at least 140, at least 150, at least 160, at least 180 or at least 200 nucleotides. In a specific embodiment, the oligonucleotide primers comprise between 15 and 60 consecutive nucleotides, such as 15, 16, 17, 18, 19, 20, preferably 21 , 22, 23, 24, 25, 26, 27, 28, 29, 30, such as 31 , 32, 33, 34, 35, 36, 37, 38, 39, 40, alternatively at least 41 , at least 42, at least 44, at least 46, at least 48, at least 50, at least 52, at least 54, at least 56, at least 58, or at least 60 consecutive nucleotides.
The methods employed for determining the methylation status of a nucleic acid according to the present invention, preferably comprise amplification of a modified nucleic acid by use of a methylation independent oligonucleotide primer. In one embodiment, the oligonucleotide primers of the present invention are able to hybridize to a nucleic acid sequence comprising CpG islands. In a preferred embodiment, at least one of the oligonucleotide primers according to the present invention comprises at least one CpG dinucleotide. In another embodiment of the present invention, the oligonucleotide primers comprise 2, alternatively 3, 4, 5, 6, 7, 8, 9 or 10 CpG
dinucleotides. In even further embodiments, the oligonucleotide primers of the present invention comprise at least 10 CpG dinucleotides. In one preferred embodiment the at least one methylation-independent oligonucleotide primer comprises one CpG dinucleotide at the 5 '-end of the primer. The CpG dinucleotide may be located anywhere within the oligonucleotide primer sequence. However, in a preferred embodiment of the present invention, the at least one CpG dinucleotide is located in the 5'-end of the oligonucleotide primer. In another preferred embodiment, the at least one CpG dinucleotide constitute the first two nucleotides of the 5'-end. In an even further preferred embodiments of the present invention, the at least one CpG dinucleotide is located within the first 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides of the 5'-terminus. In alternative embodiments, the at least one CpG dinucleotide is located within the first 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or 120 nucleotides of the 5'-terminus. In yet another embodiment, at least two CpG dinucleotides are located within the first 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides of the 5'-terminus, or at least two
CpG dinucleotides are located within the first 1 1 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 , 22, 23, 24, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 or 120 nucleotides of the 5'-terminus. The primers of the present invention may in one preferred embodiment comprise at least one CpG site, whereby annealing with a higher efficiency to a methylated than to an unmethylated template upon modification of unmethylated cytosine is achieved. The primers of the present invention comprise at least one CpG site. However, the primers comprise also for example two CpG sites.
The at least one CpG site is positioned in the 5' end of the primer. For example within the first 10 nucleotides in the 5' end of the primer, within the first 9 nucleotides in the 5' end of the primer, within the first 8 nucleotides in the 5' end of the primer, within the first 7 nucleotides in the 5' end of the primer, within the first 6 nucleotides in the 5' end of the primer, within the first 5 nucleotides in the 5' end of the primer, within the first 4 nucleotides in the 5' end of the primer or within the first 3 nucleotides in the 5' end of the primer. In a preferred embodiment the CpG site is introduced immediately after the first nucleotide of the 5' end of the primer. Specific hybridization typically is accomplished by a primer having at least 10, for example at least 12, such as at least 14, for example at least 16, such as at least 18, for example at least 20, such as at least 22, for example at least 24, such as at least 26, for example at least 28, or such as at least 30 contiguous nucleotides, which are complementary to the target template. Often the primer will be close to 100% identical to the target template. However, the primer may also be 98% identical to the target template or for example at least 97%, such as at least 96%, for example at least 95%, such as at least 94%, for example at least 93%, such as at least 92%, for example at least 91 %, such as at least 90%, for example at least 89%, such as at least 88%, for example at least 87%, such as at least 86%, for example at least 85%, such as at least 84%, for example at least 83%, such as at least 82%, for example at least 81 %, such as at least 80%, for example at least 79%, such as at least 78%, for example at least 77%, such as at least 76%, for example at least 75%, such as at least 74%, for example at least 73%, such as at least 72%, for example at least 71 %, such as at least 70%, for example at least 68%, such as at least 66%, for example at least 64%, such as at least 62% or for example at least 60% identical to the target template. If there is a sufficient region of complementary nucleotides, e.g., at least 10, such as at least 12, for example at least 15, such as at least 18, or for example at least 20, for example at least 30, such as at least 40, for example at least 50, such as at least 60, for example at least 70 nucleotides, then the primer may also contain additional nucleotide residues that do not interfere with hybridization but may be useful for other manipulations.
Examples of such other residues may be sites for restriction endonuclease cleavage, for ligand binding or for factor binding or linkers.
The methylation-independent oligonucleotide primer of the present invention is designed to hybridize to nucleic acids in a sample. Importantly, the nucleic acids
sample are treated with an agent, which modifies unmethylated cytosine in said nucleic acid. Thereby, any unmethylated Cytosine of CpG dinucleotides comprised in the nucleic acid are converted to Uracil as explained elsewhere herein. Consequently, in primers comprising a CpG dinucleotide, designed to hybridize with the complementary CpG dinucleotide of the nucleic acid of the sample, the CpG dinucleotide will only hybridize to the methylated CpG dinucleotide fraction of the nucleic acid. In the unmethylated fraction of CpG dinucleotides comprised in the nucleic acid of the sample, Cytosine are modified to uracil which does not hybridize with the CpG dinucleotide of the oligonucleotide primer.
The methylation-independent oligonucleotide primers according to the present invention are designed to comprise sufficient nucleotides for specific hybridization to the target nucleic acid sequence regardless of its original methylation status. In some embodiments the oligonucleotide primers also comprise one or more CpG
dinucleotides, as specified elsewhere herein. These CpG dinucleotides only hybridize with the originally methylated alleles of the nucleic acids. Nevertheless, the
oligonucleotide primers can still be functionally used for amplification of both originally methylated and unmethylated nucleic acids. The CpG dinucleotides are typically comprised in the 5'-terminus of the oligonucleotide primers, as described elsewhere herein. A primer-template mismatch within the 5'-terminus of the primer usually allow the primers to hybridize with the target nucleic acid, and still function as primers in an amplification reaction.
The presence of one or more mismatches between the primer and template affects the optimal annealing temperature of said oligonucleotide primer for use in amplification reactions. The more hybridizing nucleotides comprised on the oligonucleotide primers, the higher is the optimal annealing temperature. Consequently, amplification of methylated alleles of nucleic acids by CpG-containing oligonucleotide primers according to the present invention is favoured by increased annealing temperature. Conversely, amplification of unmethylated alleles is favoured by decreased annealing temperature. In the present invention, the PCR bias towards amplification of unmethylated alleles of a nucleic acid template is reversed by amplification of said nucleic acid template at a relatively higher annealing temperature, which favours oligonucleotide primer binding and priming of the methylated allele. By modulation of the primer annealing temperature, the priming of either the unmethylated modified
allele or the methylated allele of the nucleic acid can be favoured. By increasing the annealing temperature below the theoretical optimum, the amplification of the methylated allele is favoured, while a decrease of the annealing temperature will tend to favour amplification of the unmethylated allele.
Besides annealing temperature, other factors also affect hybridisation to a target sequence of a methylation-independent primer. At highly stringent conditions, hybridization between perfect matching primer and target sequences are favoured, such as hybridization between a methylation-independent primer according to the present invention and a methylated target sequence upon cytosine modification. Less stringent conditions will tend to favour oligonucleotide primer binding, priming and amplification of the unmethylated allele. Modulation of temperature is one way of adjusting the stringency of hybridization, but the stringency of hybridization may also be modulated by adjusting buffer composition, and/or salt concentrations in the
hybridization mixture, which is known to those of skill within the art. The present invention comprises any such method of modulating hybridization stringency to balance the PCR bias towards amplification of unmethylated template. However, modulation of temperature is preferred. In one embodiment, the oligonucleotide primer of the present invention is selected from the group consisting of SEQ ID NO: 37-72. Methylation status is preferably determined for a gene mentioned in table B using the respective forward primer and reverse primer identified in table B; i.e.
for OTX2: primers SEQ ID NO: 37 and/or 38;
for HOXD3: primers SEQ ID NO: 39 and/or 40;
For SIM1 : primers SEQ ID NO: 51 and/or 52;
For Chr6(p22.1 ): primers SEQ ID NO: 57 and/or 58;
For HIST1 H3G/HIST1 H2BI: primers SEQ ID NO: 47 and/or 48;
etc .; and
for HIST1 H3E: primers SEQ ID NO: 71 and/or 72.
In one embodiment, an oligonucleotide primer of the present invention specifically hybridizes to regions within 1 kb of the gene loci of the present invention. In one embodiment, the oligonucleotide primers hybridize to a target nucleic acid sequence of
a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21.1 ) and/or
HOXB3/HOXB4, or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or HOXB3/HOXB4, or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI and/or Chr6(p22.1 ), or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of a gene loci selected from the group consisting of SIM1 and HIST1 H3G/HIST1 H2BI; or the group consisting of SIM1 and Chr6(p22.1 ); or the group consisting of HIST1 H3G/HIST1 H2BI and Chr6(p22.1 ), or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer hybridizes to a target nucleic acid sequence of SIM1 and/or Chr6(p22.1 ) and/or
HIST1 H3G/HIST1 H2BI and/or HOXB3/HOXB4 and/or OSR1 and/or HOXD10 and/or OTX2 and/or LOC648987 and/or HIST1 H2AJ/HIST1 H2BM, or the complement thereof.
In a preferred embodiment of the present invention the at least one oligonucleotide primer hybridizes to a target nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18, or the complement thereof (non-modified strand); and/or the oligonucleotide prime hybridizes to a target nucleic acid sequence selected from the group consisting of SEQ ID NO: 19-36, or the complement thereof (modified strand).
In one embodiment, an oligonucleotide primer of the present invention specifically comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a subsequence of a gene loci selected from the group consisting of SIM1 ,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ), or the complement thereof. In particular, the present invention relates to oligonucleotide primer pairs, which span or comprise at least one CpG dinucleotide in a gene locus of the invention. The term "span" as used in this context is meant to indicated the at least one CpG site is located in the nucleic acid region between the primer pairs; i.e. the amplified nucleic acid region comprise at least one CpG dinucleotide. The term "comprising" as used in connection with "primers comprising at least one CpG dinucleotide is meant to specify that the oligonucleotide primer itself comprise a CpG site.
In a specific embodiment of the present invention the oligonucleotide primers comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ), or the complement thereof. In another embodiment of the present invention the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21 .1 ) and/or HOXB3/HOXB4, or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 ,
HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or HOXB3/HOXB4, or the complement thereof.
In another embodiment of the present invention the oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence of a gene loci selected from the group consisting of SIM1 ,
HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or the complement thereof.
In a preferred embodiment of the present invention the at least one oligonucleotide primer comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18, or the complement thereof (non-modified strand); and/or the oligonucleotide prime comprises or consists of 5-50, such as 5-30, such as 10-20 consecutive nucleotides of a nucleic acid sequence selected from the group consisting of SEQ ID NO: 19-36, or the complement thereof (modified strand). Thus, in the methods of the present invention for determining or prognosing non-small cell lung cancer, determining a predisposition to non-small cell lung cancer, categorizing or predicting non-small cell lung cancer, or evaluating the risk of contracting a non-small cell lung cancer, methylation status is preferably determined by amplifying at least one portion of a gene loci selected from SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21 .1 ) and Chr6(p22.1 ), using at least one primer pair selected from the nucleic acid sequences set forth in table B (SEQ ID NO: SEQ ID NO: 37/38, 39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72, respectively).
Detection of an amplification product can be performed by hybridizing the amplification product to an oligonucleotide probe, as described below. In a preferred embodiment, methylation status is determined by amplifying at least one portion of the respective at least one gene loci, and further employing at least one oligonucleotide probe which
hybridizes to an amplification product selected from the group consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
In a preferred embodiment, the oligonucleotide probe comprise 10-100 consecutive nucleic acids selected from the group of sequences consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
One aspect of the invention also relates to the use of oligonucleotide primers of the present invention for determining or prognosing a non-small cell lung cancer, determining a predisposition to non-small cell lung cancer, categorizing or predicting non-small cell lung cancer, or evaluating the risk of contracting a non-small cell lung cancer. Thus, in one aspect, the present invention provides a use of oligonucleotide primers comprising a subsequence of a loci selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ) or the complement thereof for diagnosing non- small cell lung cancer in a method of the invention as defined elsewhere herein.
In a preferred embodiment of the use of the invention, the primers are selected from the group set forth in table B (SEQ ID NO: SEQ ID NO: 37-72). In a preferred embodiment, the oligonucleotide primers comprise a sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand). In a preferred embodiment, the oligonucleotide primers comprising a subsequence selected from a gene loci selected from the group consisting of OSR1 , SIM1 , HOXD10, HIST1 H3E, HOXD3, GHSR, Chr1 (q21 .1 ) and/or HOXB3/HOXB4, or the group consisting of SIMI , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or
HOXB3/HOXB. Analysis of amplified CpG-containing nucleic acids
According to the present invention the nucleic acid (target) sample is subjected to an agent that converts an unmethylated cytosine to another nucleotide which will distinguish the unmethylated from the methylated cytosine. In a preferred embodiment the agent modifies unmethylated cytosine to uracil. The modifying agent can be sodium bisulphite. During the amplification process uracil will be converted to thymidine.
Thus, after conversion of unmethylated cytosines to uracils in the nucleic acid (target) sample, the subsequent PCR amplification converts uracils to thymine. As a
consequence of the sodium bisulfite and PCR-mediated specific conversion of unmethylated cytosines to thymines, G:C base pairs are converted to A:T base pairs at positions, where the cytosine was methylated.
The difference in nucleic acid sequence at previously methylated (methylation positive) or unmethylated (methylation negative) cytosines allows for the analysis of methylation status in a sample. This analysis can comprise identifying cytosine residues, which have been converted to thymidine after amplification, as unmethylated cytosine residues, and identifying cytosine residues, which has not been converted under as methylated cytosine residues.
By this method, analysis of the amplified nucleic acid after treatment with a modifying agent such as sodium bisulphite and subsequent PCR amplification can reveal the methylation status of the target nucleic acid sequence. Thus, in one embodiment, the method for determining methylation status of a nucleic acid according to the present invention further comprises a step of analyzing the amplified nucleic acids. Specifically, the subsequent analysis can be selected from the group consisting of melting curve analysis, high resolution melting analysis, nucleic acid sequencing, primer extension, denaturing gradient gel electrophoresis, southern blotting, restriction enzyme digestion, methylation-sensitive single-strand conformation analysis (MS- SSCA) and denaturing high performance liquid chromatography (DHPLC).
In one embodiment, the methylation status of the amplified containing nucleic acid is determined by any method selected from the group consisting of Methylation-Specific PCR (MSP), Whole genome bisulfite sequencing (BS-Seq), HELP assays, ChlP-on- chip assays, Restriction landmark genomic scanning, Methylated DNA
immunoprecipitation (MeDIP), Pyrosequencing of bisulfite treated DNA, Molecular break light assays, and Methyl Sensitive Southern Blotting.
In another embodiment, the methylation status of the amplified containing nucleic acid is determined by a method selected from the group consisting methylation specific PCR, bisulfite sequencing, COBRA, melting curve analysis, or DNA methylation arrays.
In a preferred embodiment of the present invention, the analysis of the amplified nucleic acid region is melting curve analysis. In another preferred embodiment of the present invention, the analysis of the amplified nucleic acid is high resolution melting analysis (HRM).
Melting curve analysis
Melting curve analysis or high resolution melting analysis exploits the fact that methylated and unmethylated alleles are predicted to differ in thermal stability because of the difference in GC contents after bisulphite treatment and PCR-amplification, which converts methylated C:G base pairs to A:T base pairs. This means that the melting curve profile of methylated (methylation positive) and unmethylated
(methylation negative) alleles of PCR products originating from bisulfite modified methylated and unmethylated can be distinguished. Thus, the level of fluorescence changes, depending on the relative amount template; i.e. the relative amount of methylation positive and methylated negative alleles.
By comparing the melting curve profile of an unknown sample with different mixes of controls having known relative amounts of methylation positive and methylation negative alleles, the relative amount of methylation positive and methylation negative alleles of the unknown sample can be determined.
The melting curve profile of an amplification product according to the present invention is determined by the composition of methylated and unmethylated alleles in the nucleic acid sample. If the nucleic acid molecules of a sample are all methylation negative, all cytosines are converted to thymines, and the resulting PCR product will have a relatively low melting temperature compared to a methylated nucleic acid, which can be seen in its melting curve. If on the other hand, the nucleic acids comprised in the sample are methylation positive, the melting temperature of the PCR product will be relatively higher, and the melting curve is shifted, as fluorescence is observed at higher temperatures. If the nucleic acid sample comprises a mixture of methylated and unmethylated alleles, bisulphite treatment followed by amplification will result in two distinct amplification products. The unmethylated alleles will display a low melting temperature and the methylated alleles a high melting temperature, and the melting curve profile of such a sample shows fluorescence from both PCR products
(methylated and unmethylated). If only a subset of the CpG dinucleotides of the target
sequence contain a methylated cytosine (heterogeneous methylation), the amplification product represents a pool of molecules, which are present in different cells of the tumor, with different melting temperatures, which leads to an overall intermediate melting temperature.
Melting curve analysis is performed by incubating the nucleic acid amplification product at a range of increasing temperatures. The temperature is increased from a starting temperature of at least 50 degrees Celsius, alternatively at least 55, at least 60, at least 62, at least 64, preferably at least 65, at least 66, at least 67, at least 68, at least 69, at least 70, at least 71 , at least 72, at least 73, at least 74, at least 75, for example at least 76, at least 78, at least 80, at least 82, at least 84 degrees Celsius. The temperature is then increased to a final temperature of at least 70, at least 72, at least 74, at least 76, at least 78, at least 80, at least 82, at least 84, at least 86, preferably at least 88, at least 89, at least 90, at least 91 , at least 92, at least 93, at least 94, at least 95, at least 96, at least 97, at least 98, at least 99, at least 100 degrees Celsius. In one
embodiment, the temperature transitions from the starting temperature to the final temperature are a linear function of time. In a specific embodiment of the present invention, the linear transitions are at least 0.05 degrees Celsius per second, alternatively at least 0.01 , at least 0.02, at least 0.03, at least 0.04, at least 0.06, at least 0.07, at least 0.08, at least 0.09, at least 0.1 , at least 0.2, at least 0.3, at least 0.4, at least 0.5, at least 0.6, at least 0.7, at least 0.8, at least 0.9, at least 1.0, at least 1.1 , at least 1.2, at least 1.3, at least 1.4, at least 1.5, at least 1 .6, at least 1 .7, at least 1.8, at least 1.9, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 degrees Celsius per second. In a preferred embodiment, the melting curve analysis is performed by incubating the nucleic acid amplification product at increasing temperatures, from 70 to 95 degrees Celsius, wherein the temperature increases by 0.05 degrees per second.
The melting of the nucleic acid can be measured by a number of methods, which are known to people within skill of the art. One method involves use of agents, which fluoresce when bound to a nucleic acid in its double stranded conformation. Such agents include fluorescent probes or dyes, such as ethidium bromide, EvaGreen, LC Green, Syto9, SYBR Green, SensiMix HRM™ kit dye. Thus, in one embodiment, the melting curve analysis is performed by measurement of fluorescence. The melting of the nucleic acid amplification product according to the present invention can then be
monitored as a decrease in the level of fluorescence from the sample. After measurement of the fluorescence the melting curves can be generated by plotting fluorescence as a function of temperature. For direct comparison of melting curves from samples that have different starting fluorescence levels, the melting curves for data collected in HRM can be normalized, as described in the examples of the present invention. Such normalization methods are known to people of skill in the art. One preferred means of normalization include calculation of the 'line of best fit' in between two normalization regions before and after the major fluorescence decrease representing the melting of the amplification product. The 'line of best fit' is a statistical measure, designating a line plotted on a scatter plot of data (using a least-squares method) which is closest to most points of the plot. Calculation of the line of best fit is performed differentially on LightCycler and
LightScanner, as illustrated in the examples of the present invention.
A platform with a combined thermal cycler and a fluorescence detector is ideal to perform in-tube melting analyses. Thus, in one embodiment, the melting curve analysis is performed on a thermal cycler coupled to a fluorometre, such as the Ligthcycler, LC480 (Roche) or the Rotorgene 6000 (Corbett Research). Thereby, the measurement of fluorescence, corresponding to the melting of the double stranded nucleic acid template, can be monitored in real time. In a specific embodiment, the melting curve analysis is performed immediately after amplification. This allows an in-tube
methylation assay, wherein the amplification and melting curve analysis is performed sequentially without transferring the sample from the tube. This procedure reduces the risk of contamination of the sample as a result from handling during the methylation assay.
Melting curve analysis allows the determination of the relative amount of methylated nucleic acid in a sample. By comparison of the melting curve of an amplification product of nucleic acid for which methylation status in unknown sample with the melting curve of at least one standard sample comprising the corresponding amplification product for which methylation status is known, the relative amount of methylated CpG- containing nucleic acid can be estimated. Thus, the present invention relates to a method, wherein the relative amount of methylated nucleic acid is estimated by comparison the melting curve of at least one standard sample comprising said nucleic
acid with a control level of methylation. In one embodiment of the present invention, said standard sample comprise any combination of methylated and unmethylated nucleic acid. In a specific embodiment, said standard sample comprise 100% methylated nucleic acid. In another specific embodiment, said standard sample comprise 100% unmethylated nucleic acid. In yet another specific embodiment, said standard sample comprise 50% methylated nucleic acid and 50% unmethylated nucleic acid. In even another specific embodiment, said standard sample comprise 0.1 % 0.5%, 1 %, 2%, 3%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 100% methylated nucleic acid.
In one embodiment of the present invention, the relative amount of methylated nucleic acid in the nucleic acid sample is between 40-60%. In another embodiment, the relative amount of methylated nucleic acid in the nucleic acid sample is below 50%. In yet another embodiment, the relative amount of methylated nucleic acid in the nucleic acid sample is below 10%, below 1 % or below 0.1 %. Thus, the term "presence of methylation" and/or the term "methylation status" as used herein includes the relative amount of methylated nucleic acid in the nucleic acid sample of at least 0.1 %, such as at least 1 %, for example at least 10%, such as at least 20%, for example at least 30%, such as at least 50%, for example at least 70%, such as at least 90%, or for example at least 99%.
If the fluorescence level at an allele-specific peak melting temperature in the melting curve of an unknown sample is higher than that of a standard sample, then the relative amount of that specific allele (methylation positive or methylation negative) in the unknown sample is also higher than the relative amount of that allele in the standard sample. Thus, if a standard sample comprise 80% methylation positive alleles, for which the peak melting temperature is 70°C, and the fluorescence of an unknown sample at 70°C is the same as that of the standard sample, then the relative level of menthylation positive alleles in the unknown sample can be inferred to be around 80%. If the fluorescence of the unknown sample is less than the standard, the amount of methylation positive alleles can be inferred to be less than 80%, and if the fluorescence is higher than the standard, then the unknown sample comprise more than 80% methylation positive alleles. Thus, by comparing a melting curve fluorescence profile of an unknown sample with the profiles of standard samples with different compositions of
methylation positive and methylation negative alleles, the level of methylated alleles of the unknown sample can be inferred.
The more standard samples, the more precise the relative amount of nucleic acids can be determined.
Thus, in one embodiment of the present invention, a higher fluorescence level at the peak melting temperature of the amplified nucleic acid sample than of a standard sample comprising a specific allele is indicative of a higher relative amount of that specific allele in that sample than in the standard sample. Conversely, a lower fluorescence level at the peak melting temperature of the amplified nucleic acid sample than of the standard sample comprising a specific allele is indicative of a lower relative amount of that specific allele in that sample than in the standard sample.
The "peak melting temperature" is mathematical a derivative of melting curve and refers to the temperature at which the largest discrete melting step occurs. The top of the peak corresponds to the major drop in fluorescence on the melting curve. The level of fluorescence at the peak melting temperature reflects the level of methylation for a given amplicon. Thus, two amplicons (derived from two different samples) may have peak melting temperatures of for example be 70°C, while having different fluorescence at this temperature, which then reflects that the amplicons have different methylation levels. The peak melting temperature corresponds to the highest level of the negative derivative of fluorescence (-dF/dT) over temperature versus temperature (T). A nucleic acid sample subjected to melting curve analysis may display more than one peak melting temperature. In a preferred embodiment of the present invention, the melting curve analysis display at least 1 , 2 or 3 peak melting temperatures.
Melting curve analysis is illustrated in the examples herein below, and figures 1 -3.
Nucleic acid sequencing
In another embodiment of the present invention, the method for analysis of the amplified nucleic acid is sequencing of the nucleic acid. By nucleic acid sequencing the order of nucleotides (base sequences) in the nucleic acid is determined. Sequencing is usually performed by extending a primer, which anneals to the nucleic acid sequence of interest. The primer is extended by a polymerase in the presence of
deoxynucleonucleotides. Sequencing may also be performed by pyrosequencing.
In the dideoxy sequencing method 2,3-Dideoxyribose - a deoxyribose sugar lacking the 3 hydroxyl group is incorporated into the extended nucleic acid chain. When 2,3- Dideoxyribose is incorporated into a nucleic acid chain, it blocks further chain elongation. This method is also known as the Sanger method or chain termination method. The primer is extended in the presence of the normal dNTPs (A, T, G, C) and a small amount of 2,3-DideoxyriboseNTPs (ddNTP). The reactions are either performed in four separate reactions, one for each of the ddNTPs (ddATP, ddTTP, ddCTP, ddGTP), or in a joint reaction, wherein ddATP, ddTTP, ddCTP and ddGTP are coupled to different fluorescent dyes. The primers are then extended to variable lengths, each transcript being terminated upon incorporation of a ddNTP. The sequence of the nucleic acid of interest can then by read after denaturing
polyacrylamid gel electrophoresis. Such sequencing techniques are known to people skilled within the art. Additionally, a number of different commercial kits are available for sequencing of nucleic acids.
Primer extension
In yet another embodiment of the present invention, the method for analysis of the amplified nucleic acid is primer extension. The primer extension method uses primers designed to hybridize with a target. The primers may end one base upstream of the position of the putative single nucleotide polymorphism, in this method, the C of a CpG dinucleotide. In the single nucleotide primer extension technique a single chain-ending nucleotide, such as a ddNTP, is added. The only one of the four nucleotides that will extend the primer is the one that is complementary. The identity of the added nucleotide, which reflects the methylation status, is determined in a variety of ways known to people of general skill within the art. For example, the chain-ending nucleotide may be radioactively labelled or coupled to a fluorescent dye, which can subsequently be identified. Restriction enzyme digestion
In a further embodiment, the method according to the present invention for analysis of the amplified nucleic acid is restriction enzyme digestion. Restriction enzymes can be divided into exonucelases and endonucleases. In a specific embodiment, the analysis of the amplified nucleic acid is restriction endonuclease digestion.
The method of the present invention results in the specific conversion of unmethylated cytosines to thymines, i.e. G:C base pairs are converted to A:T base pairs at positions, where a cytosine was methylated. This means that the nucleic acid sequence is changed, which may lead to disruption of a restriction endonuclease site or the change of a site specific for one restriction endonuclease to another restriction endonuclease. In a preferred embodiment of the present invention, the modified and amplified nucleic acid is analyzed for disruption of a site specific for the endonuclease Acil, BstUI, Hhal, HinPI I, Hpall, HpyCH4IV, Mspl, Taqal, Fnu4HI, Hpy188l, HpyCH4lll, Neil, ScrFI, BssKI, Hpy99l, Nt.CviPII. StyD4l, Aatll, Accl, Acll, Afel, Afllll, Agel, Aval, Banl, BmgBI, BsaAI, BsaHI, BsaJI, BsaWI, BsiEI, BsiWI, BsoBI, BspDI, BspEI, BsrBI, BsrFI, BssHII, BssSI, BstBI, Btgl, Cac8l, Clal, Eael, Eagl, Fspl, Haell, Hindi, Hpy188lll, Kasl, Mlul, MspAI I, Nael, Narl, NgoMIV, NlalV, Nrul, PaeR7l, Pmll, Pvul, Sacll, Sail, Sfol, Smal, Smll, SnaBI, Tlil, TspMI, Xhol, Xmal, Zral, Rsrll, Ascl, AsiSI, Fsel, Notl, PspXI, SgrAI, AlwNI, Dralll, PflFI, Tth 1 1 11, Alel, BsaBI, Msll, PshAI, Xmnl, Ahdl, Bgll, Bsll, BstAPI, EcoNI, Mwol, PflMI, BsmBI, Faul, BstXI, Drdl, Sfil, Xcml, Hgal, Ecil, BceAI, BtgZI, Mmel, NmeAIII, BsaXI, Bcgl, CspCI, Bael, Accll, AspLEI, Bsh1236l, BsiSI, BstFNI, BstHHI, Cfol, Hapll, Hin6l, HspAI or Maell. The digested nucleic acid sample is subsequently analysed by for example gel electrophoresis. Denaturing gradient gel electrophoresis
In another embodiment of the present invention, the method for analysis of the amplified nucleic acid is denaturing gradient gel electrophoresis (DGGE). In this technique, the modified and amplified nucleic acid is loaded on a denaturing gel. This techniques allows the resolution of nucleic acids with different melting temperatures, which is based on the conversion of C:G base pairs to A:T base pairs, explained elsewhere herein. For DGGE analysis the nucleic acid is subjected to denaturing polyacrylamide gel electrophoresis, wherein the gel contain an increasing gradient of denaturants, such as for example a combination of urea and formamide. The increasing denaturant concentration corresponds to increased temperature, and therefore, a gradient of denaturants mimics a temperature gradient within the gel. The concentrations of denaturants alone, however, are not sufficient to induce DNA melting. Therefore, the gel is immersed in an electrophoresis buffer kept at 54-60 degrees Celsius. When a nucleic acid molecule reaches a level of denaturant that matches the melting temperature of the lowest melting domain, a partially melted intermediate will be formed that moves very slowly. Small shifts in the melting temperature of the low
melting domain induced by differences in G:C content will cause the domain to unwind at different concentrations of denaturant. Accordingly, the modified and amplified nucleic acid of the present invention will be retarded at different positions in the gel, providing the basis for physical separation between species with different G:C contents, which is reflective of methylation status.
Southern blotting
In another embodiment of the present invention, the method for analysis of the amplified nucleic acid is Southern blotting. In this procedure, the nucleic acid to be analysed are separated by gel electrophoresis and transferred to a nitrocellulose filter, whereto it is immobilized. After immobilization, the transferred nucleic acids can be identified by hybridization with specific probes comprising a complementary nucleic acid. After hybridization and removal of excess unbound probe, the amount of hybridized indicate whether the sequence of interest was represented in the nucleic acids immobilized on the nitrocellulose membrane. The probes are usually
radioactively labelled for subsequent detection by radiography. The details of the southern blotting technique are well known to people of skill within the art.
Methylation-sensitive, single-strand conformation analysis (MS-SSCA)
MS-SSCA is a method of screening for methylation changes. MS-SSCA uses single- strand conformation analysis for the screening of an amplified region of bisulfite- modified nucleic acid. The amplified products are denatured and electrophoresed on a nondenaturing polyacrylamide gel, whereby the sequence differences between unmethylated and methylated sequences lead to the formation of different secondary structures (conformers) with different mobilities. Once the normal mobility pattern is established, any variation would indicate some degree of methylation.
Denaturing high performance liquid chromatography (DHPLC)
DHPLC is yet another technique for methylation screening of bisulfite-modified PCR products. As for other techniques mentioned herein, DHPLC identifies single nucleotide polymorphisms, which are arise after bisulfite treatment of unmethylated alleles of the CpG containing nucleic acid. The optimum temperature for DHPLC can be predicted by the sequence of the fully methylated product. Subsequently, the temperature is verified to obtain tight peaks. The retention time of the peak reflects methylation status,
because the more unmethylated the target is, the less GC rich the PCR product is and the lower the retention time is.
Kit
One aspect of the present invention relates to a kit for the detection of methylation status of a nucleic acid in a sample. A kit will typically comprise both a forward and a reverse primer to be used in the amplifying step of the present invention. The forward primer, the reverse primer or both may be a methylation-independent oligonucleotide primer as described herein. Thus, one aspect of the invention relates to a kit for determining non-small cell lung cancer, predisposition to non-small cell lung cancer, or categorizing or predicting the clinical outcome of a non-small cell lung cancer, or monitoring the treatment of a non-small cell lung cancer, and/or monitoring relapse of a previously treated non-small cell lung cancer.
The kit of the invention comprise
i. an agent that (a) modifies methylated cytosine residues but not non- methylated cytosine residues; or (b) modifies non-methylated cytosine residues but not methylated cytosine residues; or (c) modifies a nucleic acid sequence in a methylation- dependent manner, and
ii. at least one pair of oligonucleotide primers that specifically hybridizes under amplification conditions to a region of a gene locus selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ). The agent is preferably a methylation-dependent endonuclease as described elsewhere herein, and/or an agent capable of modifying non-methylated cytosine residues but not methylated cytosine residues, such as a bisulphite compound as decribed elsewhere heren, for example sodium bisulphite. Generally, the kit preferably comprises at least one oligonucleotide primer of probe of the present invention, as defined herein above. In a preferred embodiment, the kit comprise at least one oligonucleotide primer selected from the group consisting of SEQ ID NO: 37-72. In a more preferred embodiment, the kit comprises at least one primer pair selected from table B; i.e. at least one primer pair identified as SEQ ID NO: 37/38,
39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72.
The kit may also comprise one or more reference sample, in particular reference samples comprising a nucleic acid sequence selected from a gene locus selected from the group consisting of SIM1 , HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H3G/HIST1 H2BI, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5, Chr1 (q21.1 ) and Chr6(p22.1 ). Thus, the kit may comprise a nucleic acid sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19- 36 and/or the complement thereof (modified strand).
For example, the at least one reference sample comprises 100% methylation positive reference nucleic acid, and/or 100% methylation negative reference nucleic acid. In a preferred embodiment the kit comprises at least two reference samples, wherein one of said reference samples comprises 100% methylation positive reference nucleic acid and a second reference sample comprises 100% methylation negative reference nucleic acid. The methylation positive and methylation negative reference nucleic acids may be mixed, by a person employing the kit, in ratios that are suitable for the detection of methylation in a particular sample. It is understood that reference samples in different ratios of methylation positive to methylation negative CpG-containing nucleic acids may be comprised in the kit. For example the kit may comprise at least one reference sample comprising 50% methylated and 50% non-methylated nucleic acid alleles of the respective genetic locus marker.
In particular, the nucleic acid comprised on the reference sample of the kit is preferably methylated (methylation positive) or non-methylated (methylation negative), and the kit preferably comprise two or more reference samples with different methylation status; i.e. different levels of methylation positive and methylation negative alleles. Thus, the specific nucleic acid alleles (e.g. alleles of the gene locus OSR1 ) of the reference sample may be unmethylated (0% methylated), 1 %, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 100% methylated. The kit may thus comprise one reference sample with a nucleic acid sequence as defined above, which is
unmethylated; and another reference sample with the same nucleic acid sequence, which is 100% methylated; and one or more samples comprising the same nucleic acid
sequence having different levels of intermediate methylation status, e.g. 10%, 50% and/or 90% methylation.
The kit preferably comprises the following combinations of one or more reference samples and primer pairs:
Reference sequence comprising gene locus OTX2: forward primer = SEQ ID NO: 37 and reverse primer = SEQ ID NO: 38;
Reference sequence comprising gene locus HOXD3: forward primer = SEQ ID NO: 39 and reverse primer = SEQ ID NO: 40;
Reference sequence comprising gene locus HOXB3/HOXB4: forward primer = SEQ ID NO: 41 and reverse primer = SEQ ID NO: 42;
Reference sequence comprising gene locus HOXD10: forward primer = SEQ ID NO: 43 and reverse primer = SEQ ID NO: 44;
Reference sequence comprising gene locus Chr1 (q21 .1 ): forward primer = SEQ ID NO: 45 and reverse primer = SEQ ID NO: 46;
Reference sequence comprising gene locus HIST1 H3G/HIST1 H2BI: forward primer = SEQ ID NO: 47 and reverse primer = SEQ ID NO: 48;
Reference sequence comprising gene locus GHSR: forward primer = SEQ ID NO: 49 and reverse primer = SEQ ID NO: 50;
Reference sequence comprising gene locus SIM1 : forward primer = SEQ ID NO: 51 and reverse primer = SEQ ID NO: 52;
Reference sequence comprising gene locus OSR1 : forward primer = SEQ ID NO: 53 and reverse primer = SEQ ID NO: 54;
Reference sequence comprising gene locus FRG1 B: forward primer = SEQ ID NO: 55 and reverse primer = SEQ ID NO: 56;
Reference sequence comprising gene locus Chr6(p22.1 ): forward primer = SEQ ID: 57 and reverse primer = SEQ ID NO: 58;
Reference sequence comprising gene locus HOXA3: forward primer = SEQ ID NO: 59 and reverse primer = SEQ ID NO: 60;
Reference sequence comprising gene locus LY75-CD302: forward primer = SEQ ID NO: 61 and reverse primer = SEQ ID NO: 62;
Reference sequence comprising gene locus CTAGE15: forward primer = SEQ ID NO: 63 and reverse primer = SEQ ID NO: 64;
Reference sequence comprising gene locus LOC648987: forward primer = SEQ ID NO: 65 and reverse primer = SEQ ID NO: 66;
Reference sequence comprising gene locus HIST1 H2AJ/HIST1 H2BM: forward primer = SEQ ID NO: 67 and reverse primer = SEQ ID NO: 68;
Reference sequence comprising gene locus HOXA5: forward primer = SEQ ID NO: 69 and reverse primer = SEQ ID NO: 70
Reference sequence comprising gene locus HIST1 H3E: forward primer = SEQ ID NO: 71 and reverse primer = SEQ ID NO: 72
The kit may also comprise at least one probe. Probes of the invention are defined herein above, and in a preferred embodiment, the kit comprise at least one
oligonucleotide probe comprising 10-100 consecutive nucleic acids selected from the group of sequences consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group of sequences consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand). Thus, the kit of the invention preferably comprise at least one oligonucleotide probe which hybridizes to a nucleic acid sequence selected from the group consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group of sequences consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
The kit of the invention preferably comprise at least one oligonucleotide probe which hybridizes to an amplification product generated by a primer pair selected from the group consisting of SEQ ID NO: SEQ ID NO: 37/38, 39/40, 41/42, 43/44, 45/46, 47/48, 49/50, 51/52, 53/54, 55/56, 57/58, 59/60, 61/62, 63/64, 65/66, 67/68, 69/70 and 71/72.
The kit may also comprise additional reagents used in the amplifying step of the detection method as disclosed herein. Thus, the kit may further comprise
deoxyribonucleoside triphosphates, DNA polymerase enzyme and/or nucleic acid amplification buffer. In another embodiment the kit further comprises an agent that modifies unmethylated cytosine nucleotides. Such an agent may for example be bisulfite, hydrogen sulfite, and/or disulfite reagent.
The kit may also comprise other components suitable for detection of methylation status. For example, the kit may comprise a methylation-sensitive restriction enzyme.
The kit may in preferred embodiments further comprise instructions for the
performance of the detection method of the kit and for the interpretation of the results.
The instructions for performing the method of the kit comprises for example information of particular annealing temperatures to be used for the at least one methylation- independent primers, as well as for example information on PCR cycling parameters. The kit may further comprise instructions for the interpretation of the results obtained by the method. For example how to interpret the amplified products subsequently analysed by melting curve analysis or methods as described elsewhere herein.
Information of the interpretation of melting curve analysis is described elsewhere herein.
The kit may in preferred embodiments further comprise software comprising an algorithm for calculation of primer annealing temperature and interpretation of results. Preferred embodiments for the CpG-containing nucleic acid for which the methylation
It is appreciated that the kit may be used for evaluating a non-small cell lung cancer in a human subject based on methylation status of specific genes as specified elsewhere herein.
Examples
The invention relates to methylation biomarkers for non-small cell lung cancer.
Candidate marker genes were first identified by micro array analysis. Then, the ability of each of these methylation markers to distinguish between lung tumour tissue and healthy tissue was evaluated. From this analysis, 19 highly sensitive and specific methylation biomarkers were identified.
Example 1
This example shows a genome-wide methylation screening, which identifies novel methylation biomarkers that can be used in clinical lung cancer management. Cancer- specific methylation changes are identified and the sensitivity and specificity of the most promising biomarkers are validated and evaluated. Results
Identification of differentially methylated regions (DMRs) in LAC
In order to identify novel genomic regions with disease-specific changes in methylation, we performed a genome-wide methylation screening of tumor and tumor-adjacent normal lung tissue from four LAC patients using the NimbleGen Human DNA
Methylation 3x720K CpG Island Plus RefSeq Promoter Array, which interrogates
15,980 CpG islands and 20,404 reference gene promoter regions . After data processing, a total of 346 probes (oligonucleotides of 60 bp spotted on the array) demonstrated a significant change in methylation levels between the tumor and tumor- adjacent normal lung tissue with 288 (83.2%) and 58 (16.7%) reporting hyper- and hypomethylation in the tumor tissue, respectively. The mapping of the probes to genomic regions revealed no enrichment bias in any specific parts of the genome. Out of the 346 probes, 131 (37.9%) were located within known CGIs, 164 (47.4%) at CGI shores, 6 (1.7%) in CGI shelves and 45 (13.0%) were not associated with any known CGIs. When mapping to genes, 176 (50.9%) probes were intragenic, 128 (37.0%) were located <5kb upstream of known genes, 3 (0.9%) were located <5kb downstream of known genes, and 39 (1 1 .2%) were not associated with any known gene. The 346 probes were sorted by differential methylation score, which indicates the magnitude of the detected change in methylation (See Methods section: Microarray analyses). To locate the most informative differentially methylated regions (DMRs) in LAC, we grouped the probes with a maximum inter-probe distance of 5 kb. Using this approach, we identified a total of 74 DMRs of which 63 (85.1 %) showed hypermethylation and 1 1 (14.9%) hypomethylation in LAC. Each DMR was on average targeted by 4.1 probes (Range: 2-13 probes) and spanned 547.3 bp (Range: 126 - 4245 bp). Of the 74 DMRs, 65 (87.8%) and 66 (89.2%) were located in association with known genes and CGIs, respectively. Several of the identified DMRs, including 07X2, OSR1 and GHSR, have previously been reported differentially methylated in LAC. The DMRs were sorted according to the probe with highest differential methylation score in each region and the asterisks denote the DMRs that were selected for further validation. Validation of candidate DNA methylation biomarkers
To evaluate our findings potential for clinical application, we selected 18 DMRs based upon highest differential methylation score and the number of probes targeting the region to undergo validation in a LAC validation cohort, comprising 52 primary lung tumors, 24 paired distant metastases (20 brain and 4 adrenal gland) and 32 tumor- adjacent normal lung samples. The histological and clinical characteristics of the patients are shown in Table 4. The array data showed that 16 of the 18 selected DMRs were hypermethylated and two regions, FRG1BP and CTAGE15, were hypomethylated in primary tumors compared to tumor-adjacent normal lung tissue. We then assessed the methylation status of each of the 18 candidate regions in our patient samples using Methylation-Sensitive High Resolution Melting (MS-HRM) analysis. The genomic
location of the MS-HRM assays is shown in Table 1 and the technical specifications for each assay are indicated for each of the oligonucleotide sequences below. The results of the MS-HRM-based methylation assessment are summarized in Table 2 and displayed as stacked bar percentage plots in Fig. 1. Using this approach, we were able to confirm a significant increase in methylated templates in the tumor samples for 15 of the 18 selected DMRs corresponding to the hypermethylation indicated by the array analysis. Normalized melting curves for representative tumor and normal lung samples are shown in Fig. 2 for the HOXD3, OSR1 and HIST1H3E MS-HRM assays, where the gain in methylation is seen as a relative shift in the melting curves towards the 100% methylated standard. We were unable to confirm the array results for three DMRs, including one hypermethylated region, LY75-CD302, and both hypomethylated regions, FR1GB and CTAGE15, as shown in Fig. 1j, 1 m and 1 n. For 12 of the 15 DMRs with concordant MS-HRM and array results, the difference in methylation frequency between tumor and normal lung tissue was very pronounced as indicated by the p- values (p<0.0001 ) in Fig. 1 . As an example, an elevated methylation level was detected in 75% of the tumor samples and 0% of the normal lung samples for the HOXB3/HOXB4 MS-HRM assay, as shown in Fig. 1 c, and in 87.9% of the tumor samples and only 3.2% of the normal lung samples for the OSR1 MS-HRM assay shown in Fig. 1 i. A high methylation frequency was also observed in the brain and adrenal gland metastases for all 15 confirmed DMRs. For the majority of the DMRs, the detected increase in methylation was even more prominent in the metastases compared to primary tumors, but due to the considerable difference in average tumor content between the primary tumors and metastases samples, these groups are not directly comparable. In conclusion, we have identified and validated 15 DMRs that can be targeted as novel biomarkers in LAC.
The candidate DNA methylation biomarkers and metastases formation in LAC
The presence of metastatic disease greatly reduces the overall survival of LAC patients. To determine if hypermethylation of the 15 identified candidate biomarkers are predictive of metastases formation in LAC, we compared the methylation status of the 15 DMRs between metastases-free patients with a minimum of 5 years recurrence-free survival, and patients that suffered from distant metastatic disease at the time of diagnosis. Three of the 15 DMRs, HOXB3/HOXB4, LOC648987 and HOXA5, showed a trend towards an increase in methylation in the metastasizing tumors as illustrated in Fig. 3a-c. At the HOXB3/HOXB4 region, we detected increased methylation in 15 of 24
(62.5%) non-metastasizing tumors compared to in 21 of 24 (87.5%) metastasizing tumors. Similarly, increased methylation in 3 out of 21 (14.3%) non-metastasizing tumors and in 8 out of 24 (33.3%) metastasizing tumors were detected for the
LOC648987 region, as well as in 19 out of 26 (73.1 %) non-metastasizing tumors compared to 23 out of 26 (88.4%) metastasizing tumors for the HOXA5 region. For all three DMRs, a similar increase in methylation was detected in the paired brain and adrenal gland metastases, as shown in Table 2.
Evaluation of the clinical potential of the candidate DNA methylation biomarkers Sensitivity and specificity are the most important parameters when describing the potential diagnostic applicability of a biomarker. In order to calculate these values, we determined an unambiguous consensus for when a sample was considered methylation positive or negative for each assay. MS-HRM is a semi-quantitative method capable of determining the relative amount of methylated alleles in a sample, and we therefore determined a specific cutoff value for each of the assays based on the relative amount of methylated alleles that is detected. For each potential cutoff value, we calculated the corresponding sensitivity and specificity, and the cutoff was then set to achieve maximal sensitivity without compromising a specificity limit of 0.8. The determined cutoff value, sensitivity and specificity for each candidate biomarker are shown in Table 3. For 9/15 assays, the cutoff was set at 1 % methylation and all samples containing more than 1 % methylated templates were therefore considered positive. Similarly, the cutoff was set at >10% methylation for the HOXD3,
Chr1 (q21.1 ).A and GHSR assays and at >50% methylation for the HOXA3, HOXA5 and HIST1H3E assays. Using this approach, we achieved a specificity of >0.90 for all assays, except HOXD10, and 9/15 assays reached a specificity of 1 .00, which translates into a false positive rate of 0%. The sensitivity ranged from 0.12 to 0.98 with 8/15 assays demonstrating a sensitivity of >0.75.
While it is possible to successfully employ cutoff values when using MS-HRM analysis, it is still preferential to use biomarkers that do not show any methylation in the corresponding normal tissue, as this allows for more accurate and stringent analyses. To identify the most promising candidate biomarkers, we therefore applied a lower sensitivity limit of 0.75 and a lower specificity limit of 0.90 to the assays with a cutoff of >1 % methylation. Three candidate biomarkers fulfilled these criteria, OSR1, SIM1 and HOXB3/HOXB4, which are indicated in bold print in Table 3. The OSR1 assay
demonstrated a sensitivity of 0.98 and a specificity of 0.97 and correctly identified 97% of all tested samples. The SIM1 assay correctly identified 91 % of the samples and showed a sensitivity of 0.92 and a specificity of 0.90 and similarly, the HOXB3/HOXB4 assay provided a sensitivity of 0.75 and a specificity of 1 .00 and therefore correctly identified 85% of the tested samples. The OSR1, SIM1, HOXB3/HOXB4 DMRs therefore show high clinical potential as biomarkers in LAC.
Discussion
Lung cancer has the highest mortality rates among cancers, but the prognosis for the individual patient varies considerably depending on the stage at which the disease is diagnosed. Efficient diagnostic tools that allow early and accurate disease detection are therefore of critical importance in clinical lung cancer management. Compelling evidence supporting the utility of methylation biomarkers in various aspects of cancer management, such as risk assessment, disease detection and personalization of treatment, has accumulated during the last decades. In this study, we aimed to identify and validate novel DMRs in LAC that can be targeted as biomarkers. Using a microarray-based genome-wide methylation screening approach, we identified 74 genomic regions that demonstrated differential methylation in tumor and tumor- adjacent normal lung tissue. Eighteen DMRs were selected for validation by MS-HRM analysis and we were able to confirm differential methylation in 15/18 DMRs. This yields a true positive rate of 83.3%, which indicates that the obtained microarray data is of high quality, but also emphasizes the importance of a thorough validation process when performing a microarray-based genome-wide methylation screening study. The validity of the data is furthermore supported by the fact that we were able to confirm differential methylation for the HIST1H3E region, which is number 61 out of the 74 identified DMRs when sorted by highest differential methylation score. The HIST1H3E region was only targeted by 3 probes and only showed a differential methylation score of 2.961 and we still confirmed a significant increase in methylation (p<0.0001 ) in the tumor samples as illustrated in Fig. 1 r and Fig. 2e and f. Moreover, Rauch et al.
recently published a similar methylation screening study using 8 LAC patient samples and there are several overlaps in the identified DMRs, which serves to confirm the validity of both studies.
Most of the DMRs identified in our study showed hypermethylation in LAC. In fact, 85.1 % of all of the 74 identified DMRs and all 15 confirmed DMRs were
hypermethylated. This overrepresentation of hypermethylated DMRs can be explained by the fact that the microarray used in our study is designed to specifically target CpG islands and reference gene promoter regions, which are known to frequently undergo de novo DNA methylation during tumorigenesis.
In order for a biomarker to be clinically relevant, it needs to be capable of distinguishing cancerous from healthy tissue with high sensitivity and specificity, as well as deliver unambiguous results. MS-HRM analysis allows implementation of assay-specific cutoff values, which can be useful when investigating methylation changes in regions with frequent low-level methylation in the surrounding non-cancerous tissue. However, the use of assay-specific cutoff values is challenging for clinical purposes, as a tumor- related increase in methylation can be easily masked by the normal methylation level in contaminating normal cells, which are inevitably present in surgical resections and biopsies. The tumor cell content in clinical specimens vary extensively between samples and the biomarker assessment assays therefore require a high dynamic range in order to successfully test samples with both high and low tumor content and this is difficult to achieve when introducing higher cutoff values. While this can be overcome through macro- or microdissection of each specimen prior to biomarker assessment, it greatly reduces the time-efficiency and increases the cost of the individual experiment and thus limits the clinical potential of a candidate biomarker. It is therefore highly preferential to target regions that do not show methylation in normal tissue, as any increase in methylation, regardless of the magnitude, can be attributed to the presence of cancerous cells regardless of the tumor content in the clinical specimen. We have evaluated the sensitivity and specificity for the MS-HRM assays targeting the 15 confirmed DMRs and identified the OSR1, SIM1 and HOXB3/HOXB4 regions, which all showed minor to no methylation in the surrounding normal tissue, as the most promising biomarkers in LAC. The OSR1 region demonstrated a remarkably high sensitivity and specificity, as we detected hypermethylation in 97.9% of the LAC tumors (n=48) and only in 3.2% of the tumor-adjacent normal lung samples (n=31 ). The odd- skipped related 1 (OSR1) gene encodes a zinc-finger transcription factor that was recently shown to function as a tumor suppressor in gastric cancer by activating TP53 transcription ENREF_22. Furthermore, OSR1 was shown to be silenced by promoter hypermethylation in 51.8% (n=164) of gastric cancer patients and was identified as an independent predictor of poor survival. Rauch et al. also reported OSR1
hypermethylation in 100% of the LACs (n=8) tested in their study, which underlines the potential of the region as a diagnostic biomarker in LAC. The single-minded homolog 1 (SIM1) region also showed high potential as a biomarker in LAC. SIM1 is frequently methylated in astrocytoma and breast cancer, but this study is the first to describe hypermethylation in lung cancer. A substantial subset of the 74 identified DMRs, including 5/15 of the validated DMRs, HOXD3, HOXB3/HOXB4, HOXD10, HOXA3 and HOXA5, were associated with homeobox genes. Hypermethylation of homeobox genes is a common observation in genome-wide methylation screening studies and have been reported in several cancers, including lung cancer. While the homeobox genes lack tumor subtype specificity, they may still be useful in combination with other diagnostic biomarkers in LAC. The HOXB3/HOXB4 region showed a tendency towards increased methylation in metastasizing compared to non-metastasizing tumors. We detected higher methylation levels in the metastases compared to the primary tumors for the majority of the DMRs. These results show that the hypermethylation observed in the primary tumors is maintained during the metastatic process, and suggests that it may play an important role in LAC development and progression.
This study was performed using tumor-adjacent normal lung tissue as a control due to the limited availability of lung tissue from healthy individuals. We were able to validate differential methylation in 15 DMRs, but it is possible that the low-level methylation that is observed in a small subset of the normal samples for several of the regions, e.g. HOXD10, HOXD3 and SIM1, is a result of the use of tissue, which have been exposed to the same environmental factors as the tumor tissue. It would therefore be highly relevant to investigate if the low-level methylation observed in these regions is present in normal lung tissue from healthy individuals as well. If this is not the case, then several of the DMRs that were excluded as a result of too low specificity, in particular HOXD3 and HOXD10, will hold a high clinical potential as well.
The Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI and OTX2 regions all demonstrated a specificity of 1 .0, but they were excluded due to their lower sensitivity of 0.67, 0.63 and 0.51 , respectively. It would therefore be interesting to investigate the clinical potential of these regions in combination with the three most promising regions, OSR1, SIM1 and HOXB3/HOXB4, in a larger LAC cohort.
This study has focused on the discovery of differentially methylated regions between primary tumor and tumor-adjacent normal lung tissue and the identified candidate
biomarkers can therefore potentially be applied diagnostically to separate malignant tumors from benign conditions of the lung where biopsy is indicated. Similarly, all primary tumors used in this study were adenocarcinomas and it would therefore be interesting to determine if the candidate biomarkers are specific for this subtype of lung cancer, as any candidate biomarkers with such specificity may be useful diagnostic tools for tumor sub classification. Furthermore, it would be highly relevant to investigate if the candidate biomarkers can be detected in non-invasive patient samples, such as blood or expectorates, as this would allow them to be used in screening programs of high-risk individuals, such as patients suffering from chronic obstructive pulmonary disease (COPD) and thus enable early disease detection. In conclusion, this study has identified 74 DMRs in LAC through a genome-wide methylation screening and confirmed significant changes in methylation for 15 selected regions in a LAC patient cohort using MS-HRM analysis. These 15 DMRs can be targeted as novel diagnostic biomarkers in LAC.
Table 1 : DMRs selected for validation by MS-HRM analysis
MS-HRM assay location relative to known genes
ID DMR
and CpG Islands (CGI)
Diff.
Diff. Methylati
Methyl on Down
No ation Hypo (-) / Upstr Intrage strea Location Score Hyper (+) earn nic m CGI (Hg38)
chr14:
OTX2 1 9.130 + OTX2 Island 56809871- 56809962 chr2:
17616267
HOXD HOXD
2 8.830 + Shore 6- 3 3
17616274
7
HOXB chr17:
HOXB
3/HOX 3 7.625 + HOXB4 Shore 48577876- 3
B4 48577988 chr2:1761
HOXD HOXD1 . . . 17574- 4 7.318 + Q Island
10 1761 1765
1 chr1 :1470
Chr1 (
80162- q21.1 ) 5 6.765 + Island
14708026
.A 0
HIST1 HIST1
chr6:2627 H3G/ H3G,
6 6.749 + Island 2252- HIST1 HIST1
26272379 H2BI H2BI
chr3:1724
48360-
GHSR 7 6.585 GHSR Island
17244844 7 chr6:1004 65031-
SIM1 8 6.191 SIM1 Island
10046512 5 chr2:1935
OSR1 9 6.1 17 OSR1 Island 7150-
19357251 chr20:
FRG1 FRG1B
10 5.838 Island 30377475- BP P
30377573 chr6:2820
Chr6(
13 5.653 Island 7550- p22.1 )
28207669 chr7:
HOXA HOXA
15 5.435 HOXA3 Island 27124232- 3 4
27124343 chr2:
LY75- LY75- 15979775
CD30 17 5.223 CD302, LY75 Island 6- 2 CD302 15979783
5 chr7:
14357168
CTAG CTAG
18 5.217 5- E15 E15
14357179 7 chr5:
LOC6 ANXA LOC64
19 4.790 Island 43040396- 48987 2R 8987
43040521
HIST1
chr6:2781 H2AJ/ HIST1 HIST1H
21 4.635 Island 4577- HIST1 H2BM 2AJ
27814732 H2BM
chr7:
HOXA HOXA
22 4.522 HOXA5 Island 27143474- 5
27143563 chr6:2622
HIST1 HIST1H
61 2.961 Island 5275- H3E 3E
26225421
Table 2: DNA methylation frequencies in tumor-adjacent normal lung and primary lung tumors.
ID Normal Lung Lung Tumor
Methylation Level Methylation Level
50- Tu
1 - 10- 100 1 - 10- 50- mor
0-1 % N 0-1 % N
10% 50% % 10% 50% 100% vs. n (%) n (%)
n (%) n (%) n (%) n (%) n (%) n (%) (%) Nor
(%) mal
51 <0.
31 0 0 0 31 25 26 0 0
OTX2 (10 000
(100) (0) (0) (0) (100) (49.0) (51.0) (0) (0)
0) 1
50 <0.
HOX 23 7 1 0 31 0 7 34 9
(10 000 D3 (74.2) (22.6) (3.2) (0) (100) (0) (14.0) (68.0) (18.0)
0) 1
HOX 48 <0.
31 0 0 0 31 12 16 19 1
B3/H (10 000
(100) (0) (0) (0) (100) (25.0) (33.3) (39.6) (2.1 )
OXB4 0) 1
51 <0.
HOX 26 6 0 0 32 5 27 18 1
(10 000 D10 (81.3) (18.7) (0) (0) (100) (9.8) (52.9) (35.3) (2.0)
0) 1
Chr1 ( 48 <0.
15 15 0 0 30 0 10 32 6
q21.1 (10 000
(50.0) (50.0) 66.7) (12.5)
).A (0) (0) (100) (0) (20.8) (
0) 1
HIST
1H3G 52 <0.
32 0 0 0 32 19 27 6 0
/HIST (10 000
(100) (0) (0) (0) (100) (36.5) (51.9) (1 1.6)
1H2B (0)
0) 1
I
48 <0.
GHS 1 1 19 1 0 31 2 6 29 1 1
(10 000 R (35.5) (61.3) (3.2) (0) (100) (4.2) (12.5) (60.4) (22.9)
0) 1
49 <0.
28 3 0 0 31 4 22 21 2
SIM1 (10 000
(90.3) (9.7) (0) (0) (100) (8.2) (44.9) (42.8) (4.1 )
0) 1
48 <0.
30 1 0 0 31 1 12 30 5
OSR1 (10 000 (96.8) (3.2) (0) (0) (100) (2.1 ) (25.0) (62.5) (10.4)
0) 1
52
FRG1 0 0 32 0 32 0 4 48 0 0.15
(10
BP (0) (0) (100) (0) (100) (0) (7.7) (92.3) (0) 9
0)
Chr6( 49 <0.
32 0 0 0 32 16 13 16 4
p22.1 (10 000
(100) (0) (0) (0) (100) (32.6) (26.6) (32.6) (8.2)
) 0) 1
50 <0.
HOX 0 14 15 0 29 0 3 40 7
(10 000 A3 (0) (48.3) (51.7) (0) (100) (0) (6.0) (80.0) (14.0)
0) 1
LY75- 48
30 1 0 0 31 47 1 0 0 >0. CD30 (10
(96.8) (3.2) 97.9) (2.1 )
2 (0) (0) (100) ( (0) (0) 999
0)
32 52
CTA 0 0 0 32 0 0 0 52 >0.
(10 (10
GE15 (0) (0) (0) (100) (0) (0) (0) (100) 999
0) 0)
LOC6 45
29 0 0 0 29 34 7 4 0 0.00
4898 (10
(100)
7 (0) (0) (0) (100) (75.5) (15.6) (8.9) (0) 5
0)
HIST
51
1H2A 32 0 0 0 32 40 1 1 0 0 0.00
(10
J/HIS (100) (0) (0) (0) (100) (78.4) (21.6) (0) (0) 5 T1H2 0)
BM
Table 3: Performance of MS-HRM assays
H3G/HI 32 0 32 19 33 52
>1 % 0.63 1.00
ST1H2 (100) (0) (100) (36.5) (63.5) (100)
Bl
31 0 31 25 26 51
OTX2 >1 % 0.51 1.00
(100) (0) (100) (49.0) (51.0) (100)
LOC64 29 0 29 34 1 1 45
>1 % 0.24 1.00
8987 (100) (0) (100) (75.5) (24.5) (100)
HIST1
H2AJ/ 32 0 32 40 1 1 51
>1 % 0.22 1.00
HIST1 (100) (0) (100) (78.4) (21.6) (100)
H2BM
HOXA 29 0 29 43 7 50
>50% 0.14 1.00
3 (100) (0) (100) (86.0) (14.0) (100)
HOXA 31 0 31 46 6 52
>50% 0.12 1.00 5 (100) (0) (100) (88.4) (1 1.6) (100)
Table 4: Clinical characteristics for the patient cohorts
Clinical Characteristics LAC LAC w/o metastases w/ metastases
Patients (n)
Cases 26 26
Cases with available metastatic tissue 0 24
Gender (n (%))
Male 10 (38.5%) 12 (46.2%) Female 16 (61 .5%) 14 (53.8%)
Age (years)
Min-Max (Average) 45-76 (62.1 ) 38-76 (61 .2)
Smoking status (n (%))
Current Smoker 18 (69.2%) 16 (61 .5%) Previous Smoker 8 (30.8%) 8 (30.8%) Unknown 0 (0%) 2 (7.7%)
TNM Classification (n (%))
TX 0 (0%) 1 (3.8%)
T1 9 (34.7%) 9 ((34.7%)
T2 16 (61 .5%) 16 (61 .5%)
T3 0 (0%) 0 (0%)
T4 1 (3.8%) 0 (0%)
NX 0 (0%) 2 (7.7%)
NO 21 (80.8%) 12 (46.2%)
N1 5 (19.2) 3 (1 1 .4%)
N2 0 (0%) 9 (34.7%)
N3 0 (0%) 0 (0%)
MO 26 (100%) 0 (0%) M1 0 (0%) 26 (100%)
Tumor content (%)
Tumors, Min-Max (Average) 5-60% (27.0%) 5-80% (39.4%) Metastases, Min-Max (Average) 5-90% (68.7%)
Methods
Patient samples
The regional ethical committee (De Videnskabsetiske Komiteer Region Midtjylland, Permission No.: 1 -10-72-20-14) approved this study and all experiments were performed in accordance with the approved guidelines and regulations. Formalin-fixed, paraffin embedded (FFPE) blocks of surgical resections from lung adenocarcinoma (LAC) patients were selected from the archives at the Institute of Pathology, Aarhus
University Hospital. Primary lung tumor and tumor-adjacent normal lung tissue from four LAC patients were used for the microarray analysis. For validation, 52 LAC patients were selected. These patients were divided into two cohorts matched on gender, age, smoking status, histology, T-stage and proportion of tumor cells. The first cohort comprises FFPE primary tumor and paired metastatic tissue (20 brain and 4 adrenal gland) from 26 patients that suffered from distant metastatic disease at the time of diagnosis. The second cohort comprises FFPE primary tumor tissue from 26 distant metastases-free patients with a minimum of 5 years recurrence-free survival following surgical resection. The histological and clinical characteristics for the 52 patients are shown in Table 4. FFPE tumor-adjacent normal lung tissue was selected by an experienced pathologist from 32 LAC patients and used as a control cohort. Peripheral blood samples obtained from healthy medical students of both sexes were used to generate unmethylated control DNA. Written informed consent was obtained from the subjects.
DNA extraction and Sodium Bisulfite treatment
For each FFPE sample, DNA was extracted from 5 χ 10 μηη sections using the
QIAamp DNA FFPE Tissue Kit (Qiagen, Hilden, Germany) according to the
manufacturer's protocol. DNA was extracted from the peripheral blood (PB) samples using a modified salt precipitation protocol. In brief, 10 ml blood was incubated for 30 min at 4°C with 40 ml Triton lysis buffer (1 % Triton X-100, 10 mM Tris, 0.32 M sucrose, 5 mM MgCI2) and spun for 30 min at 3-4000 rpm (4°C). The supernatant was then removed and the nuclei were washed using 0.9% NaCI. After a 10 min spin 2300 rpm, the remaining supernatant was discarded and the nuclei were lysed using 3 ml nuclei lysis buffer (24 mM EDTA, 75 mM NaCI), 230 μΙ 10% SDS and 25 μΙ pronase
(20mg/ml) and left shaking at room temperature over night. For each 3 ml nuclei lysis buffer, 1 ml saturated NaCI (6 M) was added and the mix was vigorously shaken for 15 sec. The supernatant was collected after a 15 min spin at 3000 rpm (4°C) and transferred to a new tube. After an additional 15 min spin at 3000 rpm (4°C), isopropanol was added (1 : 1 ) to the supernatant and gently shaken until the DNA precipitated. The precipitated DNA was then collected mechanically using a blunt end glass rod and transferred to a tube containing 400 μΙ double-distilled H20. DNA concentrations were measured using a NanoDrop 1000 spectrophotometer (Thermo Scientific, Waltham, MA, USA). For MS-HRM analysis, 500 ng genomic DNA from each sample was subjected to sodium bisulfite treatment using the EZ-96 DNA Methylation-
Gold™ kit (Zymo Research, Irvine, USA) according to the manufacturer's instructions and eluted in a final volume of 52 μΙ.
Microarray Analyses
The microarray based screening for differentially methylated regions (DMRs) was performed as previously described34. Briefly, DNA was extracted from primary tumor and tumor-adjacent normal lung tissue from four LAC patients. After DNA extraction, a methylated DNA immunoprecipitation (MeDIP) was performed in order to enrich the methylated fragments. A detailed description of the MeDIP protocol can be found in34. Two fractions from each sample (MeDIP enriched and input) were subsequently labeled with Cy5 and Cy3 and cohybridized to the NimbleGen Human DNA Methylation 3x720K CpG Island Plus RefSeq Promoter Array (Roche/NimbleGen, Madison, Wl, USA). The arrays were processed using NimbleScan software (Roche/NimbleGen, Madison, Wl, USA) to generate log2 signal ratios for each probe. The ratios were then averaged within each group (tumor and normal lung) and subsequently processed by the NimbleScan software to generate a relative enrichment score for each group. The enrichment scores for each group were then subtracted to produce a differential methylation score indicating an enrichment or depletion of signal in the tumor group relative to the normal lung group. Hence, negative and positive differential methylation scores indicate potentially hypo- and hypermethylated loci in lung cancer, respectively. A threshold of 2 was applied to the differential methylation score. A large fraction of the probes with differential methylation scores > 2 were located in close proximity and the probes that were located within 5000 bp of each other were therefore grouped into differentially methylated regions (DMRs) with at least two probes targeting each DMR. Eighteen DMRs were then selected for validation based on the differential methylation score and the number of probes that mapped to the region. Previously undescribed and hypermethylated regions were prioritized. All validation experiments were performed using MS-HRM.
Methylation-Sensitive High-Resolution Melting (MS-HRM)
Validation of the 18 potential DMRs was performed by MS-HRM analysis35,36. The LightCycler® 480 platform (Roche, Mannheim, Germany) was used for PCR and HRM, and each reaction comprised 1 χ MeltDoctor™ HRM Master Mix (Life Technologies,
Carlsbad, CA, USA), 3 mM MgCI2, 500 nM of each primer and 10 ng of bisulfite modified DNA in a final volume of 10 μΙ. All primers were designed to amplify both methylated and unmethylated DNA as described by Wojdacz et al.37. The methylation status of each DMR was determined by comparing the melting profiles of each sample with a standard dilution series of fully methylated DNA (Universal Methylated Human DNA Standard, Zymo Research, Irvine, CA USA) into unmethylated DNA, which was generated by subjecting DNA extracted from PB to whole genome amplification (WGA) using the lllustra GenomiPhi V2 DNA Amplification Kit (GE Healthcare Life Sciences, Piscataway, NJ, USA) according to the manufacturer's instructions. All analyses were performed in duplicates. The technical specifications for each of the 18 assays, including the genomic location of the used primers, PCR cycling and HRM protocol, as well as melting profiles of the standards are included as Supplementary Information.
Statistical analyses and calculation of sensitivity and specificity
Statistical analyses were done using GraphPad Prism version 6 software (GraphPad Software, La Jolla, CA, USA). A Mann-Whitney Test of Ranks was used to assess the statistical significance for each DMR. To perform this test, all samples were ranked based on the determined level of methylation for each DMR; 0-1 % methylation was ranked 1 , 1 -10% methylation was ranked 2, 10-50% methylation was ranked 3 and 50- 100% methylation was ranked 4. Two-tailed p-values < 0.05 were considered statistically significant. To evaluate the clinical potential of the candidate biomarkers, we calculated the sensitivity and specificity for each region. The sensitivity was calculated as True Positives / (True Positives + False Negatives) and the specificity was calculated as True Negatives / (True Negatives + False Positives).
Sequences
Assay 1 : OTX2
PCR cycling and HRM protocol for the OTX2 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 61°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 92 bp.
Genomic Location Hg38: Chr14: 56809871 -56809962
SEQ ID NO: 1 :
GTAATAACGATCGTTGCAAAAAGAAAAATGTGATCTAGAGATGAGAGCGGTAGTGGGAGA I I I I I I I ++ I I ++ I I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I ++ I I I I I I I I I I I
GTAATAACGATCGTTGTAAAAAGAAAAATGTGATTTAGAGATGAGAGCGGTAGTGGGAGA
GAGGCAGAGAGCGTGCTCCTGGGGGTCGTCGCTTCTGCAAAACGTCGTCGAAACGCTGCG I I I I : I I I I I I ++ I I : I : : I I I I I I I ++ I ++ : I I : I I : I I I I ++ I ++ I ++ I I I ++ : I I ++
GAGGTAGAGAGCGTGTTTTTGGGGGTCGTCGTTTTTGTAAAACGTCGTCGAAACGTTGCG
AATGTAATCTGGGGTGTTTTGGAAGGTTTTGTTTGTGGTTTTGTTTTTATGTCAACGCCG I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I ! ++ : ++
AATGTAATTTGGGGTGTTTTGGAAGGTTTTGTTTGTGGTTTTGTTTTTATGTTAACGTCG (SEQ ID NO : 19)
MS-HRM Primers:
SEQ ID NO: 37: OTX2 F: 5' - GAG CGG TAG TGG GAG AGA GG- SEQ ID NO: 38: OTX2 R: 5' - CAC CCC AAA TTA CAT TCG CAA C Assay 2: HOXD3
PCR cycling and HRM protocol for the HOXD3 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 62°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 72 bp.
Genomic Location Hg38: Chr2: 176162676-176162747
SEQ ID NO: 2 :
ACCCGGGGCCGCCACCCCCTTATGCCTCTAGCTAGATTTTTCTTTTTCCAGCCGCGGAGG
I : : ++ l I I : ++ : : I : : : : : I I I I I : : I : I I I : I I I I I I I I I : I I I I I : : I I : ++++ ! I I I
ATTCGGGGTCGTTATTTTTTTATGTTTTTAGTTAGATTTTTTTTTTTTTAGTCGCGGAGG
AACAGGGTAAGTTTGCGCCTGGGGGTTCCGGGGTGCGCGGTGCGCTTTGAGCTCTTGGCG I I : I I I I I I I I I I I I ++ : : I I I I I I I I : ++ l I I I I ++++ I I I ++ : I I I I I I : I : I I I I ++
AATAGGGTAAGTTTGCGTTTGGGGGTTTCGGGGTGCGCGGTGCGTTTTGAGTTTTTGGCG
TAAGAGGCTTGGGAAGAAGAAAGGAAAGAGGACCCCAAGTTAACCAAAGTTGGACCACCA I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : : : : I I I I I I I : : I I I I I I I I I : : I : : I
TAAGAGG GGGAAGAAGAAAGGAAAGAGGA AAG AA AAAG GGA A A (SEQ ID
NO : 20)
MS-HRM Primers:
SEQ ID NO: 39: HOXD3 F: 5' - CGG AGG AAT AGG GTA AGT TTG- 3'
SEQ ID NO: 40: HOXD3 R: 5' - CTC TTA CGC CAA AAA CTC AAA AC- 3'
Assay 3: HOXB3/HOXB4
PCR cycling and HRM protocol for the HOXB3/HOXB4 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 54°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 1 13 bp.
Genomic Location Hg38: Chr17: 48577876-48577988 SEQ ID NO: 3:
GGGCCCTCCTCCCGGAGCCCGGCCAGCGCTGCGAGGCGGTCAGCAGCAGCCCCCCGCCGC I I I : : : I : : I : : ++ I I I : : ++ I : : I I ++ : I I ++ I I I ++ I I : I I : I I : I I : : : : : ++ : ++ :
GGGTTTTTTTTTCGGAGTTCGGTTAGCGTTGCGAGGCGGTTAGTAGTAGTTTTTCGTCGT CTCCCTGCGCCCAGAACCCCCTGCACCCCAGCCCGTCCCACTCCGCGTGCAAAGAGCCCG
: I : : : I I ++ : : : I I I I : : : : : I I : I : : : : I I : : ++ I : : : I : I : ++++ ! I : I I I I I I : : ++
TTTTTTGCGTTTAGAATTTTTTGTATTTTAGTTCGTTTTATTTCGCGTGTAAAGAGTTCG
TCGTCTACCCCTGGATGCGCAAAGTTCACGTGAGCACGGGTGAGTGCGTGGGCACCCCTT
I ++ I : I I : : : : I I I I I I ++ : I I I I I I : I ++ I I I I : I ++ I I I I I I I I ++ I I I I : I : : : : I I
TCGTTTATTTTTGGATGCGTAAAGTTTACGTGAGTACGGGTGAGTGCGTGGGTATTTTTT ( SEQ ID NO : 21)
MS-HRM Primers:
SEQ ID NO: 41 : HOXB3/HOXB4 F: 5' - AGG CGG TTA GTA GTA GTT T - 3'
SEQ ID NO: 42: HOXB3/HOXB4 R: 5' - AAC TTT ACG CAT CCA AAA ATA AAC
Assay 4: HOXD10
PCR cycling and HRM protocol for the HOXD10 MS-HRM assay; 1 cycle of 95°C for 10 minutes.
1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 54°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 78 bp.
Genomic Location Hg38: Chr2: 1761 17574-1761 17651
SEQ ID NO: 4 :
CTGCCTGGCTGAGGTCTCCGTGTCCAGTCCCGAAGTGCAGGAGAAGGAAAGCAAAGGTCG
: I I : : I I I : I I I I I I : I : ++ l I I : : I I I : : ++ l I I I I : I I I I I I I I I I I I I : I I I I I I ++
TTGTTTGGTTGAGGTTTTCGTGTTTAGTTTCGAAGTGTAGGAGAAGGAAAGTAAAGGTCG
GTATGAGCAGAGTTGCCACCCCAGCGGGGCGCGCAGCCCGGGAACCCGGCAGAGAGGGAG I I I I I I I : I I I I I I I : : I : : : : I I ++ I I | ++++ : I I : : ++ I I I I : : ++ I : I I I I I I I I I I
GTATGAGTAGAGTTGTTATTT AGCGGGGCGCG AGTTCGGGAATTCGGTAGAGAGGGAG
TGCCGGGGTGCCCAGCGCCGAGCCGGAGCCCGACTTGGCAGGTGCTGCTCCGCCTGGTTT
I I : ++ I I I I I : : : I I ++ : ++ I I : ++ I I I : : ++ I : I I I I : I I I I I : I I : I : ++ : : I I I I I I
TGTCGGGGTGTTTAGCGTCGAGTCGGAGTTCGATTTGGTAGGTGTTGTTTCGTTTGGTTT ( SEQ ID NO: 22)
MS-HRM Primers:
SEQ ID NO: 43: HOXD10 F: 5' - TAA AGG TCG GTA TGA GTA GAG TTG TT - 3' SEQ ID NO: 44: HOXD10 R: 5' - ACC CCG ACA CTC CCT CTC TA- 3'
Assay 5: Chr1 (q21.1 ).A
PCR cycling and HRM protocol for the Chr1 (q21 .1 ).A MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 56°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 101 bp.
Genomic Location Hg38: Chr1 : 147080162-147080262 SEQ ID NO: 5:
CCGACTCCCGCCTCCAAGCGCGGCTCTTGGCTCGCTGGCGGGCAGCGTCCACAGAGTGTG
: ++ I : I : : ++ : : I : : I I I ++++ I : I : I I I I : I ++ : I I I ++ I I : I I ++ I : : I : I I I I I I I I
TCGATTTTCGTTTTTAAGCGCGGTTTTTGGTTCGTTGGCGGGTAGCGTTTATAGAGTGTG GAACCGCCGCAGCCGCAGCTCCCGCCCGCTGGCGGGCAGACACTAGCAGGAGAAAGGACA I I I : ++ : ++ : I I : ++ : I I : I : : ++ : : ++ : I I I ++ I I : I I I : I : I I I : I I I I I I I I I I I : I
GAATCGTCGTAGTCGTAGTTTTCGTTCGTTGGCGGGTAGATATTAGTAGGAGAAAGGATA
CAAGGCCTGCGTGGTGGGAAAGCATGGGAGACCTCGCTTTCCCACCGGACGAGAAGGTCT
: I I I I : : I I ++ I I I I I I I I I I I : I I I I I I I I : : I ++ : I I I : : : I : ++ l I ++ I I I I I I I : I
TAAGGTTTGCGTGGTGGGAAAGTATGGGAGATTTCGTTTTTTTATCGGACGAGAAGGTTT ( SEQ ID NO: 23)
MS-HRM Primers:
SEQ ID NO: 45: Chr1 (q21 .1 ).A F: 5' - GGT AGC GTT TAT AGA GTG TGG AAT- 3' SEQ ID NO: 46: Chr1 (q21 .1 ).A R: 5' - TTT CCC ACC ACG CAA ACC TTA TA- 3'
Assay 6: HIST1 H3G/HIST1 H2BI
PCR cycling and HRM protocol for the HIST1H3G/HIST1H2BI MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 62°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a
temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 128 bp.
Genomic Location Hg38: Chr6: 26272252-26272379
SEQ ID NO: 6:
TACCTGGAGGCGGTGCTGGAGTACCTGACCGCCAAGATCCTGGAGTTGGCGGGCAACGCG I I : : I I I I I I ++ I I I : I I I I I I I : : I I I : ++ : : I I I I I : : I I I I I I I I I ++ I I : I I ++++
TATTTGGAGGCGGTGTTGGAGTATTTGATCGTTAAGATTTTGGAGTTGGCGGGTAACGCG
GCCCGAGACAAGAAGACCCGCGTCACCCCCCGACGCCTGCAGCTCGCCATCCACGACGAG
I : : ++ I I I : I I I I I I I : : ++++ I : I : : : : : ++ I ++ : : I I : I I : I ++ : : I I : : I ++ I ++ I I
GTTCGAGATAAGAAGATTCGCGTTATTTTTCGACGTTTGTAGTTCGTTATTTACGACGAG
AAGCTCAACAAGCTGTTGGGCAAAGTTACTATCGCGCAGGGCGGTGTCCTGCCCAATATT I I I : I : I I : I I I : I I I I I I I : I I I I I I I : I I | ++++ : I I I I ++ I I I I : : I I : : : I I I I I I
AAGTTTAATAAGTTGTTGGGTAAAGTTATTATCGCGTAGGGCGGTGTTTTGTTTAATATT ( SEQ ID
NO : 24)
MS-HRM Primers:
SEQ ID NO: 47: HIST1 H3G/HIST1 H2BI F: 5' - GAT CGT TAA GAT TTT GGA GTT GG - 3'
SEQ ID NO: 48: HIST1 H3G/HIST1 H2BI R: 5' - CGA TAA TAA CTT TAC CCA ACA ACT T - 3'
Assay 7: GHSR
PCR cycling and HRM protocol for the GHSR MS-HRM assay; 1 cycle of 95°C for 10 minutes.
1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 62°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 88 bp.
Genomic Location Hg38: Chr3: 172448360-172448447
SEQ ID NO: 7 :
GCTCGTCGCCCAGCGAGTCGTTGCCGGGGGAAGCATCCCAGTCCAGGTCGGCCAGTGTGA
I : I ++ I ++ : : : I I ++ I I I ++ I I I : ++ | I I I I I I : I I : : : I I I : : I I I I ++ I : : I I I I I I I
GTTCGTCGTTTAGCGAGTCGTTGTCGGGGGAAGTATTTTAGTTTAGGTCGGTTAGTGTGA
GGTTGAACCCCGGCTCTTCGCTGGGCGTCGCGTTCCACATGCTGCCGGCTCAGCTGAACA I I I I I I I : : : ++ | : I : I | ++ : I I I | ++ | ++++ | | : : | : | | | : | | : ++ | : | : | | : | | | | : |
GGTTGAATTTCGGTTTTTCGTTGGGCGTCGCGTTTTATATGTTGTCGGTTTAGTTGAATA
GGCTCTGGGACGTGACTGCGCTGGGAGGCTGGACCGAGCTGGCTCCCGAGGAGGTCCGCT
I I : I : I I I I I ++ I I I : I I ++ : I I I I I I I : I I I I : ++ l I : I I I : I : : ++ l I I I I I I : ++ : I
GGTTTTGGGACGTGATTGCGTTGGGAGGTTGGATCGAGTTGGTTTTCGAGGAGGTTCGTT ( SEQ ID NO: 25) MS-HRM Primers:
SEQ ID NO: 49: GHSR F: 5' - GTC GGT TAG TGT GAG GTT GAA TT- 3'
SEQ ID NO: 50: GHSR R: 5' - CAC GTC CCA AAA CCT ATT CAA CTA- 3'
Assay 8: SIM1
PCR cycling and HRM protocol for the SIM1 MS-HRM assay; 1 cycle of 95°C for 10 minutes.
1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 54°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 95 bp.
Genomic Location Hg38: Chr6: 100465031 -100465125
SEQ ID NO: 8 :
GCCCCCTCGCCTCCCATTGGCCTGGCGCGGCCAGGGGCGAGGCGCTCATTGGCCAATAGG
I : : : : : I ++ : : I : : : I I I I I : : I I I ++++ I : : I I I I I ++ I I I ++ : I : I I I I I : : I I I I I I
GTTTTTTCGTTTT ATTGGTTTGGCGCGGT AGGGGCGAGGCGTTT TTGGTTAATAGG
GCTGAGTGACACGAGTCGGCCCCGAGCCGCCCTCCGCGCGCCCAGGCTCCGGGCTCTGAA
I : I I I I I I I : I ++ I I I ++ I : : : ++ I I : ++ : : : I : ++++++ : : : | | | : | : ++ | | : | : | | | |
GTTGAGTGATACGAGTCGGTTTCGAGTCGTTTTTCGCGCGTTTAGGTTTCGGGTTTTGAA
TCTTACTACCCGCGGGACCGCTGGACTCCTAATGAGCTCAGGGTGCCGGGTCGGCGCTGT
I : I I I : I I : : ++++ | I I : ++ : I I I I : I : : I I I I I I I : I : I I I I I I : ++ I I I ++ I ++ : I I I
TTTTATTATTCGCGGGATCGTTGGATTTTTAATGAGTTTAGGGTGTCGGGTCGGCGTTGT (SEQ ID
NO : 26)
MS-HRM Primers:
SEQ ID NO: 51 : SIM1 F: 5' - GTT TAT TGG TTA ATA GGG TTG AGT GAT - 3' SEQ ID NO: 52: SIM1 R: 5' - ATC CCG CGA ATA ATA AAA TTC AAA ACC - 3'
Assay 9: OSR1
PCR cycling and HRM protocol for the OSR1 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 60°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 102 bp.
Genomic Location Hg38: Chr2: 19357150-19357251
SEQ ID NO: 9:
GCTAGGTGTCCCAGGTTCACAACCCCCTAGGGAAGAGAAGCGCTGGAGGGGACTAGCAGC
I : I I I I I I I : : : I I I I I : I : I I : : : : : I I I I I I I I I I I I I ++ : I I I I I I I I I : I I I : I l +
GTTAGGTGT TAGGT A AATTTTT AGGGAAGAGAAGCGTTGGAGGGGATTAGTAGC
GACCGGCGGCGTGTAGATGTTCCATCCCAGGCTCCTCGAGGTAATTGCTGTTTTATTAAG
+ l : ++ l ++ l ++ l I I I I I I I I I : : I I : : : I I I : I : : I ++ I I I I I I I I I : I I I I I I I I I I I I
GATCGGCGGCGTGTAGATGTTTTATTTTAGGTTTTTCGAGGTAATTGTTGTTTTATTAAG
ATTGGGGAATGGATCACCGAGGACGCAGCGGACATTTAAGTGCAGCCGGATGACCGCCTT I I I I I I I I I I I I I I : I : ++ l I I I ++ : I I ++ I I : I I I I I I I I I : I I : ++ l I I I I : ++ : : I I
ATTGGGGAATGGATTATCGAGGACGTAGCGGATATTTAAGTGTAGTCGGATGATCGTTTT (SEQ ID
NO : 27)
MS-HRM Primers:
SEQ ID NO: 53: OSR1 F: 5' - GCG TTG GAG GGG ATT AGT AG- 3'
SEQ ID NO: 54: OSR1 R: 5' - TCA TCC GAC TAC ACT TAA ATA TCC-
Assav 10: FRG1 BP
PCR cycling and HRM protocol for the FRG1B MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 61°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 99 bp.
Genomic Location Hg38: Chr20: 30377475-30377573
SEQ ID NO: 10 :
CCCCTCTCTGCACAGGCGCCAGGAACCGCGGTCCGGCCTCCGTCCAGCCCAGACAGGGTC
: : : : I : I : I I : I : I I I ++ : : I I I I I : ++++ ! I : ++ I : : I : ++ I : : I I : : : I I I : I I I I I :
TTTTTTTTTGTATAGGCGTTAGGAATCGCGGTTCGGTTTTCGTTTAGTTTAGATAGGGTT
AGAGCGAAGCCTGGGAGGCCACAAAGCCGGCTCTCTGCACCACGGCTTCCACCGGATTCG I I I I ++ I I I : : I I I I I I I : : I : I I I I : ++ I : I : I : I I : I : : I ++ I : I I : : I : ++ I I I I ++
AGAGCGAAGTTTGGGAGGTTATAAAGTCGGTTTTTTGTATTACGGTTTTTATCGGATTCG CGGGGGTGGAGTGCATCCGAAAAGAACTGAGGAGGCTCCCACCAGAGCTGCAGGACCCAG
++ I I I I I I I I I I I : I I : ++ l I I I I I I : I I I I I I I I : I : : : I : : I I I I : I I : I I I I : : : I I
CGGGGGTGGAGTGTATTCGAAAAGAATTGAGGAGGTTTTTATTAGAGTTGTAGGATTTAG ( SEQ ID NO : 28) MS-HRM Primers:
SEQ ID NO: 55: FRG1 B F: 5' - CGT TTA GTT TAG ATA GGG TTA G - 3'
SEQ ID NO: 56: FRG1 B R: 5' - CGA ATA CAC TCC ACC CCC - 3'
Assay 1 1 : Chr6(p22.1 )
PCR cycling and HRM protocol for the Chr6(p22.1 ) MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 57°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 120 bp.
Genomic Location Hg38: Chr6: 28207550-28207669
SEQ ID NO: 11 :
CACCCAAATCAGCTGCCCCTGTTTGCAGCCACGAGGGAGTCGGGAACCAAGTCCGGACCA
: I : : : I I I I : I I : I I : : : : I I I I I I : I I : : I ++ I I I I I I I ++ I I I I : : I I I I : ++ l I : : I
TATTTAAATTAGTTGTTTTTGTTTGTAGTTACGAGGGAGTCGGGAATTAAGTTCGGATTA
CCGTCTCCTAGAGAATTACATGTCCCTGCACGCACGGAGAATAAAGCCAGGAACGATAAG
: ++ l : I : : I I I I I I I I I I : I I I I : : : I I : I ++ : I ++ I I I I I I I I I I : : I I I I I ++ I I I I I
TCGTTTTTTAGAGAATTATATGTTTTTGTACGTACGGAGAATAAAGTTAGGAACGATAAG TGGTAAATTATAGGGAATTTGGGCGGTGGTGACATAGTTAACGCCATTCTCGCCCTGCTG I I I I I I I I I I I I I I I I I I I I I I I ++ I I I I I I I : I I I I I I I I ++ : : I I I : I ++ : : : I I : I I
TGGTAAATTATAGGGAATTTGGGCGGTGGTGATATAGTTAACGTTATTTTCGTTTTGTTG ( SEQ ID
NO: 29)
MS-HRM Primers:
SEQ ID NO: 57: Chr6(p22.1 ) F: 5' - CGA GGG AGT CGG GAA TTA AG
SEQ ID NO: 58: Chr6(p22.1 ) R: 5' - CAC CAC CGC CCA AAT TCC C -
Assav 12: HOXA3
PCR cycling and HRM protocol for the HOXA3 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 60°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 1 12 bp.
Genomic Location Hg38: Chr7: 27124232-27124343
SEQ ID NO: 12 :
ACGGATGCGCAGACAGTTGGTAATGTTCTGATTCACGCTGGGGAAGGCTGCAGAGATACC
I ++ I I I I ++ : I I I : I I I I I I I I I I I I I : I I I I I : I ++ : I I I I I I I I I : I I : I I I I I I I : :
ACGGATGCGTAGATAGTTGGTAATGTTTTGATTTACGTTGGGGAAGGTTGTAGAGATATT
ACAGGACGGGCGCGCGGCTTTGTTCAATTTTCCCGGCGTTCATAAATCACCCGCGCCGGG
I : I I I I ++ I I ++++++ I : I I I I I I : I I I I I I : : ++ I ++ I I : I I I I I I : I : : ++++ : ++ | |
ATAGGACGGGCGCGCGGTTTTGTTTAATTTTTTCGGCGTTTATAAATTATTCGCGTCGGG
CGAGCGAGGGAGCAAGCGAGCGCCAAAAACGCGGAGAGAGAGGCCACGGCGGCGGCGGCA
++ I I ++ I I I I I I : I I I ++ I I ++ : : I I I I I ++++ I I I I I I I I I I : : I ++ I ++ I ++ I ++ I : I
CGAGCGAGGGAGTAAGCGAGCGTTAAAAACGCGGAGAGAGAGGTTACGGCGGCGGCGG A (SEQ ID
NO: 30)
MS-HRM Primers:
SEQ I D NO: 59: HOXA3 F: 5' - ACG TTG GGG AAG GTT GTA GAG- 3'
SEQ I D NO: 60: HOXA3 R: 5' - TTA ACG CTC GCT TAC TCC CTC- 3'
Assay 13: LY75-CD302
PCR cycling and HRM protocol for the LY75-CD302 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 62°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 80 bp.
Genomic Location Hg38: Chr2: 159797756-159797835
SEQ ID NO: 13 :
TCCTCACCGCAGACCCCACTCCGCGGGGAGGGAACCCCCAAATTAGGCCAGTTGGCCGGA
I : : I : I : ++ : I I I : : : : I : I : ++++ | I I I I I I I I : : : : : I I I I I I I I : : I I I I I I : ++ I I
TTTT ATCG AGATTT ATTTCGCGGGGAGGGAATTTTTAAA AGGT AGTTGGTCGGA
GAACTGAGGGACTTGGAGTCGCACGACGGGCGCCGTTTCAGGGCAATTTCGGGCTGAAAT I I I : I I I I I I I : I I I I I I I ++ : I ++ I ++ I | ++ : ++ | I I : I I I I : I I I I I ++ I I : I I I I I I
GAATTGAGGGATTTGGAGTCGTACGACGGGCGTCGTTTTAGGGTAATTTCGGGTTGAAAT GAGAAGCGGGGACGTTGGTGGCGATTTCCCCTGCTGGTGCGCGGCCGGAGTGGGGTTGCT I I I I I I ++ I I I I ++ I I I I I I I ++ I I I I : : : : I I : I I I I I ++++ I : ++ I I I I I I I I I I I : I
GAGAAGCGGGGACGTTGGTGGCGATTTTTTTTGTTGGTGCGCGGTCGGAGTGGGGTTGTT ( SEQ ID NO: 31) MS-HRM Primers:
SEQ I D NO: 61 : LY75-CD302 F: 5' - GTT GGT CGG AGA ATT GAG GGA- 3'
SEQ I D NO: 62: LY75-CD302 R: 5' - CCC GCT TCT CAT TTC AAC CC- 3
Assay 14: CTAGE15
PCR cycling and HRM protocol for the CTAGE15 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 56°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature
increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 1 13 bp.
Genomic Location Hg38: Chr7: 143571685-143571797
SEQ ID NO: 14 :
CCGGGCTCCTCCATAGCGTCGAGGCTGCTCTGGCGGTCACCGCAGTAACACTGGCCACAA
: ++ l I : I : : I : : I I I I ++ I ++ I I I : I I : I : I I I ++ I I : I : ++ : I I I I I : I : I I I : : I : I I
TCGGGTTTTTTTATAGCGTCGAGGTTGTTTTGGCGGTTATCGTAGTAATATTGGTTATAA
CAAGCGGTGGAGAACACGCAGCCTTGGGTCTGGAACCCGAATGCGCACGTGACAACCAAC
: I I I ++ I I I I I I I I : I ++ : I I : : I I I I I I : I I I I I : : ++ l I I I ++ : I ++ I I I : I I : : I I :
TAAGCGGTGGAGAATACGTAGTTTTGGGTTTGGAATTCGAATGCGTACGTGATAATTAAT CGGAGCGGACCACTGTGGAGCGGGCTGCGGGGGGAGCTGGGGAACGCGGGCACCCACAGG
++ I I I ++ I I : : I : I I I I I I I ++ I I : I I ++ I I I I I I I : I I I I I I I ++++ I I : I : : : I : I I I
CGGAGCGGATTATTGTGGAGCGGGTTGCGGGGGGAGTTGGGGAACGCGGGTATTTATAGG ( SEQ ID NO: 32)
MS-HRM Primers:
SEQ ID NO: 63: CTAGE15 F: 5' - GGT TAT CGT AGT AAT ATT GGT TAT A
SEQ ID NO: 64: CTAGE15 R: 5' - CAA CCC GCT CCA CAA TAA TTC - 3'
Assay 15: LOC648987
PCR cycling and HRM protocol for the LOC648987 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 60°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 126 bp.
Genomic Location Hg38: Chr5: 43040396-43040521
SEQ ID NO: 15 :
TTTAGCAGCACCCTTTACGGCGCCAAAACGGATATTTGTTTGGCAATACCAGCGCTATCC I I I I I : I I : I : : : I I I I ++ I ++ : : I I I I ++ I I I I I I I I I I I I I : I I I I : : I I ++ : I I I : +
TTTAGTAGTATTTTTTACGGCGTTAAAACGGATATTTGTTTGGTAATATTAGCGTTATTC
GCTAGGTGCCGGCGCTTGCTAAGTTCAACGCGCCAGTTTCTCGTTTGCAAGGTGGTTAGG
+ : I I I I I I : ++ ! ++ : I I I : I I I I I I : I I ++++ : : I I I I I : I ++ I I I I : I I I I I I I I I I I I
GTTAGGTGTCGGCGTTTGTTAAGTTTAACGCGTTAGTTTTTCGTTTGTAAGGTGGTTAGG
GCAGAGCCCTAGCAGACAGTTTTCCGGTGGCAGCAACGCTCATTTCCCGGAAACGGGTGG
I : I I I I : : : I I I : I I I : I I I I I I : ++ l I I I : I I : I I ++ : I : I I I I : : ++ l I I I ++ I I I I I
GTAGAGTTTTAGTAGATAGTTTTTCGGTGGTAGTAACGTTTATTTTTCGGAAACGGGTGG ( SEQ ID NO: 33)
MS-HRM Primers:
SEQ I D NO: 65: LOC648987 F: 5' - ACG GAT ATT TGT TTG GTA ATA TTA G - 3' SEQ I D NO: 66: LOC648987 R: 5' - CTA CCA CCG AAA AAC TAT CTA C - 3'
Assay 16: HIST1 H2AJ/HIST1 H2BM
PCR cycling and HRM protocol for the HIST1 H2AJ/HIST1 H2BM MS-HRM assay; 1 cycle of 95°C for 1 0 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 58°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 156 bp.
Genomic Location Hg38: Chr6: 27814577-27814732
SEQ ID NO: 16:
TGCCATGTCTGGGCGTGGTAAGCAGGGAGGCAAAGCTCGCGCCAAGGCCAAGACCCGCTC I I : : I I I I : I I I I ++ I I I I I I I : I I I I I I I : I I I I : I ++++ : : I I I I : : I I I I : : ++ : I :
TGTTATGTTTGGGCGTGGTAAGTAGGGAGGTAAAGTTCGCGTTAAGGTTAAGATTCGTTT
TTCTCGGGCCGGGCTTCAGTTTCCCGTAGGCCGAGTGCATCGCCTGCTCCGCAAAGGCAA I I : I ++ I I : ++ I I : I I : I I I I I : : ++ I I I I : ++ I I I I : I I ++ : : I I : I : ++ : I I I I I : I I
TTTTCGGGTCGGGTTTTAGTTTTTCGTAGGTCGAGTGTATCGTTTGTTTCGTAAAGGTAA
CTATGCGGAGCGGGTCGGTGCTGGAGCGCCGGTGTACCTGGCGGCGGTGCTGGAGTACCT
: I I I I ++ I I I ++ I I I ++ I I I : I I I I I ++ : ++ ! I I I I : : I I I ++ I ++ I I I : I I I I I I I : : I
TTATGCGGAGCGGGTCGGTGTTGGAGCGTCGGTGTATTTGGCGGCGGTGTTGGAGTATTT ( SEQ ID NO: 34)
MS-HRM Primers:
SEQ I D NO: 67: HIST1 H2AJ/HIST1 H2BM F: 5' - GCG TGG TAA GTA GGG AGG TAA
AG - 3'
SEQ I D NO: 68: HIST1 H2AJ/HIST1 H2BM R: 5' - ACC GAC GCT CCA ACA CC - 3'
Assay 17: HOXA5
PCR cycling and HRM protocol for the HOXA5 MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 54°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 90 bp.
Genomic Location Hg38: Chr7: 27143474-27143563
SEQ ID NO: 17 :
CACACATATCAAAAAACAAATGAGCTCTTATTTTGTAAACTCATTTTGCGGTCGCTATCC
: I : I : I I I I : I I I I I I : I I I I I I I : I : I I I I I I I I I I I I : I : I I I I I I ++ I I ++ : I I I : :
TATATATATTAAAAAATAAATGAGTTTTTATTTTGTAAATTTATTTTGCGGTCGTTATTT
AAATGGCCCGGACTACCAGTTGCATAATTATGGAGATCATAGTTCCGTGAGCGAGCAATT I I I I I I : :++l I : I I : : I I I I I : I I I I I I I I I I I I I I : I I I I I I :++l I I I ++ I I : I I I I
AAATGGTTCGGATTATTAGTTGTATAATTATGGAGATTATAGTTTCGTGAGCGAGTAATT
CAGGGACTCGGCGAGCATGCACTCCGGCAGGTACGGCTACGGCTACAATGGCATGGATCT
: I I I I I : I++I++I I : I I I : I : I :++l : I I I I I++I : I I++I : I I : I I I I I : I I I I I I : I
TAGGGATTCGGCGAGTATGTATTTCGGTAGGTACGGTTACGGTTATAATGGTATGGATTT ( SEQ ID NO: 35)
MS-HRM Primers:
SEQ ID NO: 69: HOXA5 F: 5' - TGG TTC GGA TTA TTA GTT GTA TAA T - 3' SEQ ID NO: 70: HOXA5 R: 5' - TAC CTA CCG AAA TAC ATA CTC - 3'
Assay 18: HIST1 H3E
PCR cycling and HRM protocol for the HIST1H3E MS-HRM assay; 1 cycle of 95°C for 10 minutes. 1 cycle of 95°C for 15 seconds, 40 cycles of 1 minute at 56°C. 95°C for 1 minute, 55°C for 1 minute and a melting phase from 55°C to 95°C with a temperature increase of 0.1 °C/sec and 50 fluorescence acquisition points per °C. 95°C for 1 minute. Amplicon length = 147 bp.
Genomic Location Hg38: Chr6: 26225275-26225421
SEQ ID NO: 18 :
TCCTGCAGCGCCATCACCGCGGAACTCTGGAAGCGCAGGTCGGTCTTGAAGTCCTGAGCT
I : : I I : I I ++ : : I I : I :++++! I I : I : I I I I I I ++ : I I I I ++ I I : I I I I I I I : : I I I I : I
TTTTGTAGCGTTATTATCGCGGAATTTTGGAAGCGTAGGTCGGTTTTGAAGTTTTGAGTT
ATTTCTCGCACCAGGCGCTGAAACGGCAGCTTCCGGATTAGAAGCTCGGTAGACTTCTGG I I I I : I++: I : : I I I ++ : I I I I I ++ I : I I : I I :++l I I I I I I I I : I ++ I I I I I : I I : I I I
ATTTTTCGTATTAGGCGTTGAAACGGTAGTTTTCGGATTAGAAGTTCGGTAGATTTTTGG
TAGCGACGGATCTCGCGCAGAGCCACGGTGCCAGGGCGGTAGCGATGGGGCTTCTTCACG I I I ++ I ++ I I I : I ++++ : I I I I : : I ++ I I I : : I I I I ++ I I I I ++ I I I I I I : I I : I I : I ++
AGCGACGGATTTCGCG AGAGT ACGGTGTTAGGGCGGT GCGATGGGGTTTTTTTACG (SEQ ID
NO: 36)
MS-HRM Primers:
SEQ ID NO: 71 : HIST1 H3E F: 5' - CGC GGA ATT TTG GAA GCG TAG G - 3'
SEQ ID NO: 72: HIST1 H3E R: 5' - CGC TAC CGC CCT AAC ACC - 3'
Claims
Claims
A method of determining non-small cell lung cancer, a predisposition to non- small cell lung cancer, the prognosis of a non-small cell lung cancer, and/or monitoring a non-small cell lung cancer in a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
A method for assessing whether a human subject is likely to develop non-small cell lung cancer, said method comprising
i. providing a sample from said human subject,
ii. determining in said sample the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
iii. on the basis of said methylation status identifying a human subject that is more likely to develop non-small cell lung cancer.
The method according to any of the preceding claims, wherein said method comprises determining the presence of metastatic lung cancer.
The method according to any of the preceding claims, wherein said methylation status is determined in the SIM1 gene locus and at least one additional gene locus selected from the group consisting of Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
The method according to any of the preceding claims, wherein said methylation status is determined in the Chr6(p22.1 ) gene locus and at least one additional gene locus selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
Chr1 (q21 .1 ).
6. The method according to any of the preceding claims, wherein said methylation status is determined in the HIST1 H3G/HIST1 H2BI gene locus and at least one additional gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
Chr1 (q21 .1 ).
7. The method according to any of the preceding claims, wherein said methylation status is determined in one or more gene loci selected from the group consisting of SIM1 , HIST1 H3G/HIST1 H2BI, Chr6(p22.1 ) and/or
HOXB3/HOXB4.
8. The method according to any of the preceding claims, wherein the methylation status of at least two gene loci are determined, such as at least three genes, such as at least four gene loci.
9. The method according to any of the preceding claims, wherein said sample comprise lung tissue, such as lung cells and/or genetic material of lung cells.
10. The method according to any of the preceding claims, wherein the methylation status of primary lung tumour cells and/or distant metastases are compared with tumour adjacent normal lung tissue cells.
1 1 . The method according to any of the preceding claims, wherein said methylation status is determined by any method selected from the group consisting of
Methylation-Specific PCR (MSP), Whole genome bisulfite sequencing (BS- Seq), HELP assays, ChlP-on-chip assays, Restriction landmark genomic scanning, Methylated DNA immunoprecipitation (MeDIP), Pyrosequencing of bisulfite treated DNA, Molecular break light assays, and Methyl Sensitive Southern Blotting.
12. The method according to any of the preceding claims, wherein methylation
status is determined by methylation specific PCR, bisulfite sequencing, COBRA, melting curve analysis, or DNA methylation arrays.
13. The method according to any of the preceding claims, wherein said methylation status is determined by melting curve analysis, such as high resolution melting
(HMR) analysis.
14. The method according to any of the preceding claims, wherein said methylation status is determined by a method comprising the steps of
i) providing a sample, such as a lung tissue sample, from said subject comprising nucleic acid material comprising said gene locus, ii) modifying said nucleic acid using an agent which cleaves nucleic acid sequences in a methylation-dependent manner, iii) amplifying at least one portion of said gene locus using primers, which span or comprise at least one CpG dinucleotide in said gene locus in order to obtain an amplification product, and iv) analyzing said amplification product.
15. The method according to claim 14, wherein said amplification product is
analysed by detecting the presence or absence of amplification product, wherein the presence of amplification product indicates that the target nucleic acid has not been cleaved by said agent, and wherein the absence of amplification product indicates that the target nucleic acid has been cleaved by said agent.
16. The method according to any of the preceding claims, wherein said methylation status is determined by a method comprising the steps of
i) providing a sample, such as a lung tissue sample, from said subject comprising nucleic acid material comprising said gene locus, ii) modifying said nucleic acid using an agent which modifies
unmethylated cytosine,
iii) amplifying at least one portion of said gene locus using primers, which span or comprise at least one CpG dinucleotide in said gene locus in order to obtain an amplification product, and iv) analyzing said amplification product.
17. The method according to claim 16, wherein said amplification product is
analysed for nucleic acid substitutions resulting from conversion of modified cytosine residues, wherein the presence of converted cytosine residues are indicative of unmethylated cytosine residues, and presence of unconverted cytosine residues is indicative of methylated cytosine residues.
18. The method according to any of the preceding claims, wherein said amplified
CpG-containing nucleic acid is analyzed by melting curve analysis
19. The method according to any of the preceding claims, wherein said
unmethylated cytosine is modified by bisulfite.
20. The method according to any of the preceding claims, wherein said methylation status is determined by amplifying at least one portion of said gene locus using at least one primer pair selected from the group consisting of SEQ ID NOs: 37-
72.
21 . The method according to any of the preceding claims, wherein said methylation status is determined by amplifying at least one portion of said at least one gene locus, and wherein the amplified portion is detected using at least one oligonucleotide probe.
22. The method according to any of the preceding claims, wherein said
oligonucleotide probe hybridizes to a sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
23. The method according to any of the preceding claims, wherein said
oligonucleotide probe comprises 10-100 consecutive nucleic acids selected from the group of sequences consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
24. A method for categorizing or predicting the clinical outcome of a non-small cell lung cancer of a subject, said method comprising in a sample from said subject determining the methylation status of at least one gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and
Chr1 (q21 .1 ).
25. The method according to claim 24, wherein the presence of methylation is indicative of decreased overall survival, different stage cancer.
26. A method of evaluating the risk for a subject of contracting cancer, said method comprising in a sample from said subject determining the methylation status of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987,
HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chi (q21 .1 ).
27. A method of treating a non-small cell lung cancer in a human subject, said method comprising the steps of
i. determining non-small cell lung cancer, a predisposition to non-small cell lung cancer, or the prognosis of a non-small cell lung cancer in a subject by a method as defined in any of the preceding claims,
ii. selecting human subjects having non-small cell lung cancer, a
predisposition to non-small cell lung cancer, or a negative or positive prognosis of a non-small cell lung cancer,
iii. subjecting said subjects identified in step ii. to a suitable treatment for non-small cell lung cancer.
28. The method according to claim 27, wherein said treatment is surgery,
chemotherapy and/or radiotherapy.
29. A kit for determining non-small cell lung cancer, predisposition to non-small cell lung cancer, or categorizing or predicting the clinical outcome of a non-small cell lung cancer, or monitoring the treatment of a non-small cell lung cancer, said kit comprising
i. an agent that (a) modifies methylated cytosine residues but not non- methylated cytosine residues; or (b) modifies non-methylated cytosine residues but not methylated cytosine residues; or (c) modifies a nucleic acid sequence in a methylation-dependent manner,
ii. and at least one pair of oligonucleotide primers that specifically hybridizes under amplification conditions to a region of a gene locus selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI,
HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E,
HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ).
30. The kit according to claim 29, wherein said at least one primer pair selected from the group consisting of SEQ ID NO: 37/38 to 71/72, for example at least one primer pair identified as SEQ ID NO: SEQ ID NO: 51 and 52 for SIM1 .
31 . The kit according to any of claims 27 to 30, wherein said kit comprise at least one oligonucleotide probe comprising 10-100 consecutive nucleic acids selected from the group of sequences consisting SEQ ID NO: 1 -18 and/or the
complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
32. The kit according to any of claims 27 to 31 , wherein said kit comprise at least one oligonucleotide probe which hybridizes to a sequence selected from the group consisting of SEQ ID NO: 1 -18 and/or the complement thereof (non- modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
33. The kit according to any of claims 27 to 32, said kit further comprising a DNA polymerase.
34. The kit according to any of claims 27 to 33, wherein said agent is a bisulfite, hydrogen sulfite, and/or disulfite reagent, for example sodium bisulfite.
35. The kit according to any of claims 27 to 33, said kit further comprising a
methylation-sensitive restriction enzyme.
36. Use of oligonucleotide primers comprising a sequence, which is a subsequence of a gene loci selected from the group consisting of SIM1 , Chr6(p22.1 ),
HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2, LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ) or the complement thereof for diagnosing non-small cell lung cancer in a method of any of the preceding claims.
37. The use according to any of the preceding claims, wherein said oligonucleotide primers is selected from the group consisting of SEQ ID NO: 37-72 and/or oligonucleotide primers comprising 10-100 consecutive nucleic acids selected from the group of sequences consisting SEQ ID NO: 1 -18 and/or the complement thereof (non-modified strand) or the group consisting of SEQ ID NO: 19-36 and/or the complement thereof (modified strand).
38. The use according to any of the preceding claims, wherein said gene locus is selected from the group consisting of SIM1 , HOXB3/HOXB4 and OSR1 .
39. A method of identifying therapeutically effective agents for treatment of non- small cell lung cancer, said method comprising
i. providing a non-small cell lung cancer cell line comprising one or more genetic loci selected from the group consisting of SIM1 , Chr6(p22.1 ), HIST1 H3G/HIST1 H2BI, HOXB3/HOXB4, OSR1 , GHSR, OTX2,
LOC648987, HIST1 H3E, HIST1 H2AJ/HIST1 H2BM, HOXD10, HOXD3, HOXA3, HOXA5 and Chr1 (q21 .1 ),
ii. providing one or more potential therapeutic agents,
iii. treating said non-small cell lung cancer cells by bringing said agents in contact with said non-small cell lung cancer cells,
iv. determining methylation status of said one or more genetic loci v. comparing said methylation status of said treated non-small cell lung cancer cells with the methylation status of said non-small cell lung cancer cells, when untreated, wherein a decreased level of methylation positive alleles is indicative of a therapeutic agent.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DKPA201670811 | 2016-10-14 | ||
DKPA201670811 | 2016-10-14 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2018069450A1 true WO2018069450A1 (en) | 2018-04-19 |
Family
ID=60201522
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2017/076076 WO2018069450A1 (en) | 2016-10-14 | 2017-10-12 | Methylation biomarkers for lung cancer |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2018069450A1 (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2710065C1 (en) * | 2018-12-14 | 2019-12-24 | Федеральное государственное бюджетное учреждение "Федеральный центр охраны здоровья животных" (ФГБУ "ВНИИЗЖ") | Synthetic oligonucleotide primers and method for detecting dna of asf virus by loop isothermal amplification |
CN111363811A (en) * | 2018-12-25 | 2020-07-03 | 广州市康立明生物科技有限责任公司 | Lung cancer diagnostic agent and kit based on FOXD3 gene |
CN113637745A (en) * | 2020-04-27 | 2021-11-12 | 广州市基准医疗有限责任公司 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
CN113637746A (en) * | 2020-04-27 | 2021-11-12 | 广州市基准医疗有限责任公司 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
WO2021243905A1 (en) * | 2020-06-03 | 2021-12-09 | 广州康立明生物科技股份有限公司 | Lung cancer detection reagent and kit |
EP3945135A1 (en) * | 2020-07-27 | 2022-02-02 | Les Laboratoires Servier | Biomarkers for diagnosing and monitoring lung cancer |
CN114645043A (en) * | 2020-12-17 | 2022-06-21 | 广州市基准医疗有限责任公司 | Methylation molecular marker combination and application for detecting benign and malignant pulmonary nodules |
CN114941029A (en) * | 2022-03-28 | 2022-08-26 | 武汉艾米森生命科技有限公司 | Biomarkers, nucleic acid products and kits for liver cancer |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012023648A1 (en) * | 2010-08-20 | 2012-02-23 | 국립암센터 | Diagnostic composition for non-small-cell lung cancer comprising a preparation for measuring the hoxa11 gene methylation level, and a diagnostic method for non-small-cell lung cancer using the same |
US20140271455A1 (en) * | 2013-03-14 | 2014-09-18 | City Of Hope | Dna methylation biomarkers for small cell lung cancer |
WO2016083360A1 (en) * | 2014-11-25 | 2016-06-02 | Ait Austrian Institute Of Technology Gmbh | Diagnosis of lung cancer |
-
2017
- 2017-10-12 WO PCT/EP2017/076076 patent/WO2018069450A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012023648A1 (en) * | 2010-08-20 | 2012-02-23 | 국립암센터 | Diagnostic composition for non-small-cell lung cancer comprising a preparation for measuring the hoxa11 gene methylation level, and a diagnostic method for non-small-cell lung cancer using the same |
US20140271455A1 (en) * | 2013-03-14 | 2014-09-18 | City Of Hope | Dna methylation biomarkers for small cell lung cancer |
WO2016083360A1 (en) * | 2014-11-25 | 2016-06-02 | Ait Austrian Institute Of Technology Gmbh | Diagnosis of lung cancer |
Non-Patent Citations (7)
Title |
---|
BOCK ET AL., NATURE BIOTECH., vol. 34, no. 7, 2016 |
CORTESE R ET AL: "Correlative gene expression and DNA methylation profiling in lung development nominate new biomarkers in lung cancer", INTERNATIONAL JOURNAL OF BIOCHEMISTRY AND CELL BIOLOGY, PERGAMON, GB, vol. 40, no. 8, 1 January 2008 (2008-01-01), pages 1494 - 1508, XP022655992, ISSN: 1357-2725, [retrieved on 20071227], DOI: 10.1016/J.BIOCEL.2007.11.018 * |
IBEN DAUGAARD ET AL: "Identification and validation of candidate epigenetic biomarkers in lung adenocarcinoma", SCIENTIFIC REPORTS, vol. 6, no. 1, 26 October 2016 (2016-10-26), XP055438917, DOI: 10.1038/srep35807 * |
JI WOONG SON ET AL: "Genome-wide combination profiling of DNA copy number and methylation for deciphering biomarkers in non-small cell lung cancer patients", CANCER LETTERS, vol. 311, no. 1, 26 June 2011 (2011-06-26), pages 29 - 37, XP028278511, ISSN: 0304-3835, [retrieved on 20110624], DOI: 10.1016/J.CANLET.2011.06.021 * |
MANIATIS ET AL.: "Molecular Cloning: A Laboratory Manual", 1982, COLD SPRING HARBOR, pages: 280 - 281 |
RAUCH TIBOR A; WANG ZUNDE; WU XIWEI; KERNSTINE KEMP H; RIGGS ARTHUR D; PFEIFER GERD P: "DNA methylation biomarkers for lung cancer", TUMOR BIOLOGY, vol. 33, no. 2, 1 April 2012 (2012-04-01) - 6 December 2011 (2011-12-06), pages 287 - 296, XP009502625, DOI: 10.1007/s13277-011-0282-2 * |
SYEDMORADI ET AL.: "Royal Soc of Chem.", 2016 |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2710065C1 (en) * | 2018-12-14 | 2019-12-24 | Федеральное государственное бюджетное учреждение "Федеральный центр охраны здоровья животных" (ФГБУ "ВНИИЗЖ") | Synthetic oligonucleotide primers and method for detecting dna of asf virus by loop isothermal amplification |
CN111363811A (en) * | 2018-12-25 | 2020-07-03 | 广州市康立明生物科技有限责任公司 | Lung cancer diagnostic agent and kit based on FOXD3 gene |
CN111363811B (en) * | 2018-12-25 | 2024-02-02 | 广州康立明生物科技股份有限公司 | Lung cancer diagnostic agent and kit based on FOXD3 gene |
CN113637745A (en) * | 2020-04-27 | 2021-11-12 | 广州市基准医疗有限责任公司 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
CN113637746A (en) * | 2020-04-27 | 2021-11-12 | 广州市基准医疗有限责任公司 | Methylated molecular markers for detecting benign and malignant lung nodules or combination and application thereof |
WO2021243905A1 (en) * | 2020-06-03 | 2021-12-09 | 广州康立明生物科技股份有限公司 | Lung cancer detection reagent and kit |
EP3945135A1 (en) * | 2020-07-27 | 2022-02-02 | Les Laboratoires Servier | Biomarkers for diagnosing and monitoring lung cancer |
WO2022023233A1 (en) * | 2020-07-27 | 2022-02-03 | Les Laboratoires Servier | Biomarkers for diagnosing and monitoring lung cancer |
CN114645043A (en) * | 2020-12-17 | 2022-06-21 | 广州市基准医疗有限责任公司 | Methylation molecular marker combination and application for detecting benign and malignant pulmonary nodules |
CN114645043B (en) * | 2020-12-17 | 2022-12-09 | 广州市基准医疗有限责任公司 | Methylated molecular marker combination for detecting benign and malignant lung nodules and application |
CN114941029A (en) * | 2022-03-28 | 2022-08-26 | 武汉艾米森生命科技有限公司 | Biomarkers, nucleic acid products and kits for liver cancer |
CN114941029B (en) * | 2022-03-28 | 2023-08-29 | 武汉艾米森生命科技有限公司 | Biomarker, nucleic acid product and kit for liver cancer |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102223014B1 (en) | Method for detecting precancerous lesions | |
WO2018069450A1 (en) | Methylation biomarkers for lung cancer | |
WO2018087129A1 (en) | Colorectal cancer methylation markers | |
US20160251730A1 (en) | Detection of bisulfite converted nucleotide sequences | |
JP2016500521A (en) | Gastric polyp and gastric cancer detection method using gastric polyp and gastric cancer-specific methylation marker gene | |
CN111670254B (en) | Improved detection of microsatellite instability | |
US20180163263A1 (en) | High-sensitivity method for detecting target nucleic acid | |
US9315870B2 (en) | Method for detecting methylation of colorectal cancer specific methylation marker gene for colorectal cancer diagnosis | |
US20090186360A1 (en) | Detection of GSTP1 hypermethylation in prostate cancer | |
JP2023524067A (en) | Methods for identification and relative quantification of nucleic acid sequence, mutation, copy number or methylation changes using nuclease, ligation, deamination, DNA repair and polymerase reactions in combination with carryover prevention and marker | |
US9752197B2 (en) | Method for detecting methylation of colorectal cancer specific methylation marker gene for colorectal cancer diagnosis | |
US20150299809A1 (en) | Biomarkers for Clinical Cancer Management | |
JP2016506755A (en) | Methods and kits for identifying and adjusting bias in sequencing polynucleotide samples | |
WO2024056008A1 (en) | Methylation marker for identifying cancer and use thereof | |
CN113493835A (en) | Method and kit for screening large intestine tumor by detecting methylation state of BCAN gene region | |
US20090176655A1 (en) | Methylation detection | |
CN104685070B (en) | The method for detecting BRAF and PI3K mutation | |
US8377657B1 (en) | Primers for analyzing methylated sequences and methods of use thereof | |
WO2022170984A1 (en) | Screening, risk assessment, and prognosis method and kit for advanced colorectal adenomas | |
KR101145406B1 (en) | Method for Detecting Methylation of Colorectal Cancer Specific Methylation Marker Gene for Colorectal Cancer Diagnosis | |
CN105441558A (en) | Primer probe system for MGMT (O<6>-methylguanine-DNA methyhransferase) gene methylation detection and kit adopting primer probe system | |
US20130309667A1 (en) | Primers for analyzing methylated sequences and methods of use thereof | |
WO2014134548A2 (en) | Assay, methods and compositions for diagnosing cancer | |
CN105441554A (en) | Primer probe system for SEPT9 (septin-9) gene methylation detection and kit adopting primer probe system | |
US20080213781A1 (en) | Methods of detecting methylation patterns within a CpG island |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 17791972 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 17791972 Country of ref document: EP Kind code of ref document: A1 |