JP6161607B2

JP6161607B2 - How to determine the presence or absence of different aneuploidies in a sample

Info

Publication number: JP6161607B2
Application number: JP2014522796A
Authority: JP
Inventors: ピーラバリチャード; エーコムストックデイビッド; ケーリースブライアン
Original assignee: ベリナタヘルスインコーポレイテッド
Priority date: 2011-07-26
Filing date: 2011-07-26
Publication date: 2017-07-12
Anticipated expiration: 2031-07-26
Also published as: GB2485635B; GB201114713D0; CN103003447B; HK1174063A1; CN103003447A; KR20140050032A; GB2485635A; EP2563937A1; CA2840418A1; KR101974492B1; AU2011373694A1; WO2013015793A1; JP2014521334A; CA2840418C

Description

本発明は、概して診断の分野に関し、また異なるゲノムに由来する核酸混合物における核酸配列の総量の変動（多型）を決定する方法を提供する。とくに、この方法は、非侵襲性出生前診断、並びにがん患者の転移性進行の診断及びモニタリングに適用できる。 The present invention relates generally to the field of diagnostics and provides a method for determining variations (polymorphisms) in the total amount of nucleic acid sequences in a mixture of nucleic acids derived from different genomes. In particular, this method is applicable to non-invasive prenatal diagnosis and diagnosis and monitoring of metastatic progression in cancer patients.

人間医学研究における主な努力の１つは、健康上の悪影響の元をなす遺伝的異常の発見である。多くの場合、特定遺伝子及び／又は重要診断マーカーは、ゲノムの異常コピー数が存在する部分で識別されてきた。例えば、出生前診断において、染色体全体における過剰又は消失コピーはしばしば現れる病変である。がんにおいて、染色体全体若しくは染色体断片（セグメント）におけるコピーの欠失若しくは重複、及びゲノムの特定領域における高レベルの増幅は共通して現れる。 One of the major efforts in human medical research is the discovery of genetic abnormalities that cause adverse health effects. In many cases, specific genes and / or important diagnostic markers have been identified in portions where there are abnormal copy numbers of the genome. For example, in prenatal diagnosis, excess or missing copies across the chromosome are often appearing lesions. In cancer, copy deletions or duplications in whole chromosomes or chromosome fragments (segments) and high levels of amplification in specific regions of the genome appear in common.

コピー数多型（変動）に関する多くの情報は、構造異常の認識を可能にする細胞遺伝学的解明によって得てきた。遺伝学的スクリーニング及び生物学的線量測定のための普通の手順は、核型解析用に細胞を採取するのに侵襲的手順、例えば、羊水穿刺を使用してきた。細胞培養を必要としない、より迅速なテストの必要性を認識して、蛍光その場ハイブリダイゼーション（ハイブリッド形成法）［ＦＩＳＨ：fluorescence in situ hybridization］、定量的蛍光ＰＣＲ（ＱＦ−ＰＣＲ）及びアレイ−比較ゲノムハイブリダイゼーション（ＣＧＨ：Comparative Genomic Hybridization）が、コピー数多型を解析するための分子細胞遺伝学的解析法として開発されている。 Much information on copy number variation (variation) has been obtained by cytogenetic elucidation that allows recognition of structural abnormalities. Common procedures for genetic screening and biological dosimetry have used invasive procedures, such as amniocentesis, to collect cells for karyotyping. Recognizing the need for faster tests that do not require cell culture, fluorescent in situ hybridization (FISH), quantitative fluorescent PCR (QF-PCR) and arrays Comparative genomic hybridization (CGH) has been developed as a molecular cytogenetic analysis method for analyzing copy number variation.

ゲノム全体を比較的短時間にシークエンシング（配列決定）を可能にする技術の出現、及び循環細胞フリー（circulating cell-free）ＤＮＡ（ｃｆＤＮＡ）は、侵襲性サンプル採取方法に関連する危険性なく、比較すべきある１つの染色体に由来する遺伝子材料を他の遺伝子材料の染色体と比較する機会を提供してきた。しかし、ｃｆＤＮＡの限界のあるレベルからステミング処理する不十分な感度、及びゲノム情報における固有の性質からステミング処理する技術のシークエンシングバイアスという限界を含む、既存の方法の限界が、特異度、感度及び利用可能性のうちいずれか、又はすべてを、種々の臨床背景における信頼性高いコピー数多型診断に与える、非侵襲性方法に対する永続的な要望の根底にある。 The advent of technology that allows sequencing of the entire genome in a relatively short time, and circulating cell-free DNA (cfDNA), without the risks associated with invasive sampling methods, It has provided an opportunity to compare genetic material from one chromosome to be compared with chromosomes from other genetic material. However, the limitations of existing methods, including the limited sensitivity of stemming from a limited level of cfDNA and the sequencing bias of the technique of stemming due to the inherent nature of genomic information, Underlying the persistent desire for non-invasive methods that provide any or all of the availability for reliable copy number polymorphism diagnosis in various clinical contexts.

本発明は、上述の要望のうち若干を満たし、またとくに、少なくとも非侵襲性出生前診断に、並びにがん患者の転移性進行の診断及びモニタリングに適用できる信頼性高い方法を提供する利点をもたらす。 The present invention fulfills some of the above needs and provides the advantage of providing a reliable method that can be applied, at least, to non-invasive prenatal diagnosis and to the diagnosis and monitoring of metastatic progression in cancer patients. .

本発明は、１つ又はそれ以上の関心対象である配列の総量が既知である、又はその配列の総量に違いがあると懸念される核酸混合物を有するテストサンプルにおける、関心対象である配列のコピー数多型（ＣＮＶ：copy number variations）を決定する方法を提供する。この方法は、処理に関連する染色体間変動及びシークエンシング間変動からの見越し変動ステミング処理を担う統計学的アプローチをなす。この方法は、任意の胎児異数性におけるＣＮＶ、及び種々の内科的疾患に関連すると既知である若しくは懸念されるＣＮＶを決定するのに適用可能である。本発明方法によって決定できるＣＮＶには、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１つ又はそれ以上におけるトリソミー及びモノソミー、他の染色体ポリソミー、及び染色体のうち１つ又はそれ以上における断片（セグメント）の欠失及び／又は重複が含まれ、こらはテストサンプルの核酸を１回だけシークエンシングすることによって決定することができる。いかなる異数性もテストサンプルの核酸を１回だけシークエンシングすることによって得られる配列情報から決定することができる。 The present invention relates to a copy of a sequence of interest in a test sample having a nucleic acid mixture in which the total amount of one or more sequences of interest is known or concerned that there is a difference in the total amount of the sequences. A method for determining copy number variations (CNV) is provided. This method represents a statistical approach that is responsible for the accrual variation stemming process from interchromosomal and inter-sequence variation associated with the process. This method is applicable to determine CNVs in any fetal aneuploidy, and CNVs known or concerned about various medical disorders. CNVs that can be determined by the method of the present invention include trisomy and monosomy in any one or more of chromosomes 1-22, X and Y, other chromosomal polysomy, and one or more of the chromosomes. Fragment deletions and / or duplications are included, which can be determined by sequencing the nucleic acid of the test sample only once. Any aneuploidy can be determined from the sequence information obtained by sequencing the nucleic acid of the test sample only once.

一実施形態において、本発明は、胎児及び母体の核酸を含む母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。本発明方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の４つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の４つ又はそれ以上の関心対象染色体それぞれの正規化染色体配列の配列タグ数を同定するステップと、(ｃ)前記任意の４つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化染色体配列に対して同定した前記配列タグ数を使用して、前記任意の４つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の４つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の４つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有する。ステップ(ａ)は、テストサンプルにおける核酸分子の少なくとも一部分をシークエンシングし、テストサンプルにおける胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化染色体配列に対して同定した前記配列タグ数との比として計算するステップを有する。若干の実施形態において、前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化染色体配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化染色体それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化染色体配列における前記配列タグ密度比との比として計算するステップを有する。 In one embodiment, the present invention provides a method for determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method of the present invention comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) using the sequence information from chromosomes 1 to 22, X chromosome and Y chromosome. Identifying the number of sequence tags for each of any four or more selected chromosomes of interest and identifying the number of sequence tags for each of the four or more selected chromosomes of interest of normalized chromosome sequences; (C) using the sequence tag number identified for each of the any four or more chromosomes of interest and the sequence tag number identified for the normalized chromosome sequence, Calculating a single chromosome dose for each of the four or more chromosomes of interest; and (d) the single stain of each of the four or more of the chromosomes of interest. Each somatic dose is compared to a threshold value for each of the any four or more chromosomes of interest, thereby determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in the maternal test sample. Determining. Step (a) sequences at least a portion of the nucleic acid molecules in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized chromosomal sequence of each of the chromosomes of interest. And calculating as a ratio to the number of sequence tags identified for. In some embodiments, step (c) comprises (i) associating the number of sequence tags identified for each of the chromosomes of interest in step (b) with the length of each of the chromosomes of interest. Calculating a sequence tag density ratio for each chromosome of interest; (ii) associating the number of sequence tags identified for the normalized chromosome sequence in step (b) with the length of each normalized chromosome Calculating the sequence tag density ratio of each normalized chromosome, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), respectively, Calculating the single-chromosome dose of the sequence tag density ratio of the chromosome of interest and the chromosome of interest respectively. Calculating as a ratio of the sequence tag density ratio in the normalized chromosomal sequence.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。本発明方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の４つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の４つ又はそれ以上の関心対象染色体それぞれの正規化染色体配列の配列タグ数を同定するステップと、(ｃ)前記任意の４つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化染色体配列に対して同定した前記配列タグ数を使用して、前記任意の４つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の４つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の４つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有し、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した前記任意の４つ又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも２０の染色体を含むものとし、少なくとも２０の異なった完全胎児染色体異数性の有無を決定する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化染色体配列に対して同定した前記配列タグ数との比として計算するステップを有する。若干の他の実施形態において、ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化染色体配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化染色体それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化染色体配列における前記配列タグ密度比との比として計算するステップを有する。 In another embodiment, a method is provided for determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method of the present invention comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) using the sequence information from chromosomes 1 to 22, X chromosome and Y chromosome. Identifying the number of sequence tags for each of any four or more selected chromosomes of interest and identifying the number of sequence tags for each of the four or more selected chromosomes of interest of normalized chromosome sequences; (C) using the sequence tag number identified for each of the any four or more chromosomes of interest and the sequence tag number identified for the normalized chromosome sequence, Calculating a single chromosome dose for each of the four or more chromosomes of interest; and (d) the single stain of each of the four or more of the chromosomes of interest. Each somatic dose is compared to a threshold value for each of the any four or more chromosomes of interest, thereby determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in the maternal test sample. The arbitrary four or more chromosomes of interest selected from chromosomes 1-22, X, and Y were selected from chromosomes 1-22, X, and Y. At least 20 chromosomes shall be included and the presence or absence of at least 20 different complete fetal chromosomal aneuploidies is determined. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, the step (c) comprises the step of identifying a single chromosomal dose of each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized chromosome of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the sequence. In some other embodiments, step (c) comprises (i) associating the number of sequence tags identified for each of the chromosomes of interest in step (b) with the length of each of the chromosomes of interest. (Ii) calculating the sequence tag density ratio of each chromosome of interest by (ii) the number of the sequence tags identified for the normalized chromosome sequence in step (b) to the length of each normalized chromosome Calculating the sequence tag density ratio of each of the normalized chromosomes by associating, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), Calculating each single chromosomal dose, the chromosomal dose comprising the sequence tag density ratio in the chromosome of interest and each of the chromosomes of interest. Calculating as a ratio of the sequence tag density ratio in the normalized chromosomal sequence.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。本発明方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の４つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の４つ又はそれ以上の関心対象染色体それぞれの正規化染色体配列の配列タグ数を同定するステップと、(ｃ)前記任意の４つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化染色体配列に対して同定した前記配列タグ数を使用して、前記任意の４つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の４つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の４つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有し、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した前記任意の４つ又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてとし、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべての異なった完全胎児染色体異数性の有無を決定する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化染色体配列に対して同定した前記配列タグ数との比として計算するステップを有する。他の若干の実施形態において、前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化染色体配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化染色体それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化染色体配列における前記配列タグ密度比との比として計算するステップを有する。 In another embodiment, a method is provided for determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method of the present invention comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) using the sequence information from chromosomes 1 to 22, X chromosome and Y chromosome. Identifying the number of sequence tags for each of any four or more selected chromosomes of interest and identifying the number of sequence tags for each of the four or more selected chromosomes of interest of normalized chromosome sequences; (C) using the sequence tag number identified for each of the any four or more chromosomes of interest and the sequence tag number identified for the normalized chromosome sequence, Calculating a single chromosome dose for each of the four or more chromosomes of interest; and (d) the single stain of each of the four or more of the chromosomes of interest. Each somatic dose is compared to a threshold value for each of the any four or more chromosomes of interest, thereby determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in the maternal test sample. And any four or more chromosomes of interest selected from chromosomes 1-22, X and Y are all chromosomes 1-22, X and Y. The presence or absence of all different complete fetal chromosome aneuploidies of chromosomes 1-22, X and Y. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, the step (c) comprises the step of identifying a single chromosomal dose of each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized chromosome of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the sequence. In some other embodiments, step (c) associates the number of sequence tags identified for each of the chromosomes of interest in (i) step (b) with the length of each of the chromosomes of interest. Calculating a sequence tag density ratio for each of the chromosomes of interest, (ii) determining the number of the sequence tags identified for the normalized chromosome sequence in step (b) by the length of each normalized chromosome Calculating a sequence tag density ratio for each of the normalized chromosomes, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), Calculating a single chromosome dose for each chromosome, the chromosome dose comprising the sequence tag density ratio in the chromosome of interest and the chromosome of interest Calculating as a ratio of each normalized chromosomal sequence to the sequence tag density ratio.

上述のいずれかの実施形態において、前記正規化染色体配列は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した単独染色体とする。代案として、前記正規化染色体配列は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した染色体グループとする。 In any of the above-described embodiments, the normalized chromosome sequence is a single chromosome selected from chromosomes 1 to 22, the X chromosome, and the Y chromosome. As an alternative, the normalized chromosomal sequence is a chromosome group selected from chromosomes 1-22 and X and Y chromosomes.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。この方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の１つ又はそれ以上の関心対象染色体それぞれの正規化断片配列の配列タグ数を同定するステップと、(ｃ)前記任意の１つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化断片配列に対して同定した前記配列タグ数を使用して、前記任意の１つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の１つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の１つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化断片配列に対して同定した前記配列タグ数との比として計算するステップを有する。前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化断片配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化断片配列それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化断片配列における前記配列タグ密度比との比として計算するステップを有する。 In another embodiment, a method is provided for determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) selecting from chromosomes 1-22, X chromosome and Y chromosome using the sequence information. Identifying the number of sequence tags for each of any one or more of the chromosomes of interest and identifying the number of sequence tags of each of the normalized fragment sequences of each of the one or more of the chromosomes of interest; (c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for the normalized fragment sequence, Calculating a single chromosomal dose for each of the one or more chromosomes of interest; and (d) the single chromosomal dose of each of the any one or more chromosomes of interest. Is compared to the threshold of each of the one or more chromosomes of interest to determine the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the maternal test sample. A step of performing. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized fragment of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the sequence. The step (c) includes (i) associating the number of the sequence tags identified for each of the chromosomes of interest in the step (b) with the length of each of the chromosomes of interest, thereby Calculating the sequence tag density ratio of: (ii) correlating the number of sequence tags identified for the normalized fragment sequence in step (b) with the length of each normalized chromosome, Calculating a sequence tag density ratio for each of the fragment fragments, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii) to calculate a single chromosome dose for each of the chromosomes of interest. And calculating the chromosome dose in the sequence tag density ratio in the chromosome of interest and the normalized fragment sequence of each of the chromosomes of interest. And calculating the ratio to the sequence tag density ratio.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。この方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の１つ又はそれ以上の関心対象染色体それぞれの正規化断片配列の配列タグ数を同定するステップと、(ｃ)前記任意の１つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化断片配列に対して同定した前記配列タグ数を使用して、前記任意の１つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の１つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の１つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有し、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも２０の染色体を含むものとし、少なくとも２０の異なった完全胎児染色体異数性の有無を決定する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化断片配列に対して同定した前記配列タグ数との比として計算するステップを有する。若干の他の実施形態において、前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化断片配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化断片配列それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化断片配列における前記配列タグ密度比との比として計算するステップを有する。 In another embodiment, a method is provided for determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) selecting from chromosomes 1-22, X chromosome and Y chromosome using the sequence information. Identifying the number of sequence tags for each of any one or more of the chromosomes of interest and identifying the number of sequence tags of each of the normalized fragment sequences of each of the one or more of the chromosomes of interest; (c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for the normalized fragment sequence, Calculating a single chromosomal dose for each of the one or more chromosomes of interest; and (d) the single chromosomal dose of each of the any one or more chromosomes of interest. Is compared to the threshold of each of the one or more chromosomes of interest to determine the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the maternal test sample. Any one or more chromosomes of interest selected from Chromosomes 1-22, X and Y are at least 20 selected from Chromosomes 1-22, X and Y The presence or absence of at least 20 different complete fetal chromosome aneuploidies is determined. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized fragment of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the sequence. In some other embodiments, step (c) associates the number of sequence tags identified for each of the chromosomes of interest in (i) step (b) with the length of each of the chromosomes of interest. Calculating the sequence tag density ratio of each of the chromosomes of interest, (ii) the number of the sequence tags identified for the normalized fragment sequence in step (b) is the length of each normalized chromosome (Iii) calculating the sequence tag density ratio of each of the normalized fragment sequences, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), Calculating a single chromosomal dose for each of the target chromosomes, the chromosomal dose comprising the sequence tag density ratio in the chromosome of interest and the chromosome of interest; Calculating as a ratio of each normalized fragment sequence to the sequence tag density ratio.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。この方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の１つ又はそれ以上の関心対象染色体それぞれの正規化断片配列の配列タグ数を同定するステップと、(ｃ)前記任意の１つ又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化断片配列に対して同定した前記配列タグ数を使用して、前記任意の１つ又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の１つ又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の１つ又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有し、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した前記任意の１つ又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてとし、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべての異なった完全胎児染色体異数性の有無を決定する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。若干の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体それぞれの単独染色体ドースを、前記関心対象染色体それぞれに対して同定した前記配列タグ数と、前記関心対象染色体それぞれの前記正規化断片配列に対して同定した前記配列タグ数との比として計算するステップを有する。他の若干の実施形態において、前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化断片配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化断片配列それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化断片配列における前記配列タグ密度比との比として計算するステップを有する。上述のいずれか１つの実施形態において、前記異なった完全胎児染色体異数性は、完全染色体トリソミー、完全染色体モノソミー、及び完全染色体ポリソミーから選択する。例えば、前記異なった完全胎児染色体異数性は、２番トリソミー、８番トリソミー、９番トリソミー、２１番トリソミー、１３番トリソミー、１６番トリソミー、１８番トリソミー、２２番トリソミー、４７,ＸＸＹ、４７,ＸＸＸ、４７,ＸＹＹ、及びＸモノソミーから選択する。 In another embodiment, a method is provided for determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) selecting from chromosomes 1-22, X chromosome and Y chromosome using the sequence information. Identifying the number of sequence tags for each of any one or more of the chromosomes of interest and identifying the number of sequence tags of each of the normalized fragment sequences of each of the one or more of the chromosomes of interest; (c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for the normalized fragment sequence, Calculating a single chromosomal dose for each of the one or more chromosomes of interest; and (d) the single chromosomal dose of each of the any one or more chromosomes of interest. Is compared to the threshold of each of the one or more chromosomes of interest to determine the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the maternal test sample. And any one or more chromosomes of interest selected from chromosomes 1-22, X and Y are all chromosomes 1-22, X and Y, Determine the presence or absence of all different complete fetal chromosomal aneuploidies of chromosomes 1-22, X and Y. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample. In some embodiments, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each of the chromosomes of interest, and the normalized fragment of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the sequence. In some other embodiments, step (c) associates the number of sequence tags identified for each of the chromosomes of interest in (i) step (b) with the length of each of the chromosomes of interest. Calculating the sequence tag density ratio of each of the chromosomes of interest, (ii) the number of the sequence tags identified for the normalized fragment sequence in step (b) is the length of each normalized chromosome (Iii) calculating the sequence tag density ratio of each of the normalized fragment sequences, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), Calculating a single chromosomal dose for each of the target chromosomes, the chromosomal dose comprising the sequence tag density ratio in the chromosome of interest and the chromosome of interest; Calculating as a ratio of each normalized fragment sequence to the sequence tag density ratio. In any one of the above embodiments, the different complete fetal chromosomal aneuploidy is selected from complete chromosomal trisomy, complete chromosomal monosomy, and complete chromosomal polysomy. For example, the different complete fetal chromosomal aneuploidies are trisomy 2, trisomy 8, trisomy 9, trisomy 21, trisomy 13, trisomy 16, trisomy 18, trisomy 22, 47, XXY, 47 , XXX, 47, XYY, and X monosomy.

上述したいずれか１つの実施形態において、ステップ(ａ)〜(ｄ)は、異なる母体検体からのテストサンプルに対して繰り返して行い、前記方法は、前記テストサンプルそれぞれにおける任意の４つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する。上述したいずれか１つの実施形態において、本発明方法は、さらに、正規化染色体値（ＮＣＶ：normalized chromosome value）を計算するＮＣＶ計算ステップであって、前記ＮＣＶは、次式のように、前記染色体ドースを、適格サンプルセットにおける対応の染色体ドースの平均に関連付ける値とし、
ここで、
は、それぞれ適格サンプルセットにおけるｊ番染色体ドースに対する推定した平均及び標準偏差であり、ｘ_ijはテストサンプルｉにおける観測したｊ番染色体ドースである、該ＮＣＶ計算ステップを有する。 In any one of the embodiments described above, steps (a)-(d) are repeated for test samples from different maternal specimens, and the method comprises any four or more in each of the test samples Determine the presence or absence of different complete fetal chromosomal aneuploidies. In any one of the above-described embodiments, the method of the present invention further includes an NCV calculation step of calculating a normalized chromosome value (NCV), wherein the NCV is expressed by the following equation: Let the dose relate to the average of the corresponding chromosomal doses in the eligible sample set,
here,
Are the estimated mean and standard deviation for chromosome jose in the eligible sample set, respectively, and x _ij is the observed NC chromosome dose in test sample i.

他の実施形態において、胎児及び母体の核酸を含む母体テストサンプルにおける異なった部分的胎児染色体異数性の有無を決定する方法を提供する。この方法は、(ａ)前記母体テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の断片それぞれの配列タグ数を同定し、また前記任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の断片それぞれの正規化断片配列の配列タグ数を同定するステップと、(ｃ)前記任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の断片それぞれに対して同定した前記配列タグ数、及び前記正規化断片配列に対して同定した前記配列タグ数を使用して、前記任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の断片それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の断片それぞれの前記単独断片ドースそれぞれを、前記任意の１つ又はそれ以上の関心対象染色体における任意の１つ又はそれ以上の染色体断片それぞれの閾値と比較し、これにより前記母体テストサンプルにおける任意の１つ又はそれ以上の異なった部分的胎児染色体異数性の有無を決定するステップとを有する。前記ステップ(ａ)は、前記テストサンプルにおける前記核酸分子の少なくとも一部分をシークエンシングし、前記テストサンプルにおける前記胎児及び母体の前記核酸分子に関する配列情報を得る。 In another embodiment, a method is provided for determining the presence or absence of different partial fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acids. The method comprises (a) obtaining sequence information relating to fetal and maternal nucleic acids in the maternal test sample, and (b) selecting from chromosomes 1-22, X chromosome and Y chromosome using the sequence information. Identifying the number of sequence tags for each of any one or more fragments in any one or more of the chromosomes of interest, and any one or more of said one or more of the chromosomes of interest Identifying the number of sequence tags of the normalized fragment sequence of each of the further fragments, and (c) identifying for each of any one or more fragments in the any one or more chromosomes of interest Using the sequence tag number and the sequence tag number identified for the normalized fragment sequence, any one in the one or more chromosomes of interest. Calculating a single chromosome dose for each of the one or more fragments, and (d) each of the single fragment doses for each of any one or more fragments in any one or more of the chromosomes of interest. Comparing any one or more chromosomal fragments of each of the one or more chromosomes of interest to a threshold, thereby providing any one or more different partials in the maternal test sample. Determining the presence or absence of fetal chromosomal aneuploidy. The step (a) sequences at least a portion of the nucleic acid molecule in the test sample to obtain sequence information regarding the fetal and maternal nucleic acid molecules in the test sample.

他の実施形態において、前記ステップ(ｃ)は、前記関心対象染色体における任意の１つ又はそれ以上の染色体断片それぞれの単独断片ドースを、前記関心対象染色体における任意の１つ又はそれ以上の染色体断片それぞれに対して同定した前記配列タグ数と、前記関心対象染色体における任意の１つ又はそれ以上の染色体断片それぞれの前記正規化断片配列に対して同定した前記配列タグ数との比として計算するステップを有する。若干の実施形態において、前記ステップ(ｃ)は、(i)前記ステップ(ｂ)で前記関心対象染色体それぞれに対して同定した前記配列タグ数を前記関心対象染色体それぞれの長さに関連付けすることによって、前記関心対象染色体それぞれの配列タグ密度比を計算するステップ、(ii)前記ステップ(ｂ)で前記正規化断片配列に対して同定した前記配列タグ数を前記正規化染色体それぞれの長さに関連付けすることによって、前記正規化断片配列それぞれの配列タグ密度比を計算するステップ、及び(iii)前記ステップ(i)及び(ii)で計算した前記配列タグ密度比を使用して、前記関心対象染色体それぞれの単独染色体ドースを計算するステップであって、前記染色体ドースは、前記関心対象染色体における前記配列タグ密度比と、前記関心対象染色体それぞれの前記正規化断片配列における前記配列タグ密度比との比として計算するステップを有する。この方法は、さらに、正規化断片値（ＮＳＶ：normalized segment value）を計算するＮＳＶ計算ステップであって、前記ＮＳＶは、次式のように、前記断片ドースを、適格サンプルセットにおける対応の断片ドースの平均に関連付ける値とし、
ここで、
は、それぞれ適格サンプルセットにおけるｊ番断片ドースに対する推定した平均及び標準偏差であり、ｘ_ijはテストサンプルｉにおける観測したｊ番断片ドースである、該ＮＳＶ計算ステップを有する。 In another embodiment, the step (c) comprises the step of dividing a single fragment dose of each of any one or more chromosome fragments in the chromosome of interest into any one or more chromosome fragments in the chromosome of interest. Calculating as a ratio between the number of sequence tags identified for each and the number of sequence tags identified for each normalized fragment sequence of any one or more chromosome fragments in the chromosome of interest Have In some embodiments, step (c) comprises (i) associating the number of sequence tags identified for each of the chromosomes of interest in step (b) with the length of each of the chromosomes of interest. Calculating a sequence tag density ratio for each chromosome of interest; (ii) associating the number of sequence tags identified for the normalized fragment sequence in step (b) with the length of each normalized chromosome Calculating the sequence tag density ratio of each of the normalized fragment sequences, and (iii) using the sequence tag density ratio calculated in steps (i) and (ii), Calculating each single chromosomal dose, the chromosomal dose comprising the sequence tag density ratio in the chromosome of interest and each of the chromosomes of interest. Calculating the ratio of the normalized tag sequence to the sequence tag density ratio. The method further includes an NSV calculation step of calculating a normalized segment value (NSV), wherein the NSV determines the fragment dose as a corresponding fragment dose in an eligible sample set as follows: The value associated with the average of
here,
Are the estimated mean and standard deviation for the jth fragment dose in the qualifying sample set, respectively, and x _ij is the observed jth fragment dose in test sample i.

染色体ドース又は断片ドースは正規化断片配列を使用して決定する上述した方法の実施形態において、前記正規化断片配列は、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１つ又はそれ以上における単独断片とする。代案として、前記正規化断片配列は、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１つ又はそれ以上における断片グループとする。 Chromosome dose or fragment dose is determined using a normalized fragment sequence In the above-described method embodiment, the normalized fragment sequence is any one or more of chromosomes 1-22 and X and Y. It is set as the single fragment | piece in the above. As an alternative, the normalized fragment sequence is a fragment group in any one or more of chromosomes 1-22 and X and Y.

部分的胎児染色体異数性の有無を決定する方法のステップ(ａ)〜(ｄ)は、異なる母体検体からのテストサンプルに対して繰り返して行う。本発明方法によって決定できる部分的胎児染色体異数性は、任意の染色体における任意の断片の部分的異数性を含む。部分的胎児染色体異数性は、部分的重複、部分的増殖、部分的挿入及び部分的欠失から選択することができる。本発明方法により決定できる部分異数性の例としては、１番染色体の部分モノソミー、４番染色体の部分モノソミー、５番染色体の部分モノソミー、７番染色体の部分モノソミー、１１番染色体の部分モノソミー、１５番染色体の部分モノソミー、１７番染色体の部分モノソミー、１８番染色体の部分モノソミー、及び２２番染色体の部分モノソミーがある。 Steps (a)-(d) of the method for determining the presence or absence of partial fetal chromosomal aneuploidy are repeated for test samples from different maternal specimens. Partial fetal chromosomal aneuploidy that can be determined by the method of the present invention includes partial aneuploidy of any fragment in any chromosome. Partial fetal chromosomal aneuploidy can be selected from partial duplication, partial growth, partial insertion and partial deletion. Examples of partial aneuploidy that can be determined by the method of the present invention include partial monosomy of chromosome 1, partial monosomy of chromosome 4, partial monosomy of chromosome 5, partial monosomy of chromosome 7, partial monosomy of chromosome 11, There is a partial monosomy of chromosome 15, a partial monosomy of chromosome 17, a partial monosomy of chromosome 18, and a partial monosomy of chromosome 22.

上述した実施形態のうち任意の１つにおいて、前記テストサンプルは、血液、血漿、血清、尿及び唾液のサンプルから選択した母体サンプルとする。任意な１つの実施形態において、テストサンプルは血漿サンプルとすることができる。母体サンプルの核酸分子は、胎児及び母体の無細胞ＤＮＡ分子の混合物とする。核酸のシークエンシングは、次世代シークエンシング（ＮＧＳ）を使用して実施することができる。若干の実施形態において、前記シークエンシングは、可逆色素ターミネーターによるシークエンシング・バイ・シンセシスを使用する。他の実施形態において、前記シークエンシングは、シークエンシング・バイ・リゲーションとする。さらに他の実施形態において、前記シークエンシングは、単独分子シークエンシングとする。随意的に、増幅ステップをシークエンシングの前に行うものとする。 In any one of the embodiments described above, the test sample is a maternal sample selected from blood, plasma, serum, urine and saliva samples. In any one embodiment, the test sample can be a plasma sample. The nucleic acid molecule of the maternal sample is a mixture of fetal and maternal cell-free DNA molecules. Nucleic acid sequencing can be performed using next generation sequencing (NGS). In some embodiments, the sequencing uses sequencing by synthesis with a reversible dye terminator. In another embodiment, the sequencing is sequencing by ligation. In yet another embodiment, the sequencing is a single molecule sequencing. Optionally, the amplification step shall be performed prior to sequencing.

他の実施形態において、本発明は、胎児及び母体の無細胞ＤＮＡ分子の混合物を含む母体血漿テストサンプルにおける任意の２０又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。この方法は、(ａ)前記母体血漿テストサンプルにおける胎児及び母体の無細胞ＤＮＡ分子に関する配列情報を取得するよう、無細胞ＤＮＡ分子の少なくとも一部をシークエンシングするステップと、(ｂ)前記配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の２０又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、また前記任意の２０又はそれ以上の関心対象染色体それぞれの正規化染色体の配列タグ数を同定するステップと、(ｃ)前記任意の２０又はそれ以上の関心対象染色体それぞれに対して同定した前記配列タグ数、及び前記正規化染色体に対して同定した前記配列タグ数を使用して、前記任意の２０又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び(ｄ)前記任意の２０又はそれ以上の関心対象染色体それぞれの前記単独染色体ドースそれぞれを、前記任意の２０又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより前記母体血漿テストサンプルにおける任意の２０又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有する。 In another embodiment, the present invention provides a method for determining the presence or absence of any 20 or more different complete fetal chromosomal aneuploidies in a maternal plasma test sample comprising a mixture of fetal and maternal cell-free DNA molecules. To do. The method comprises (a) sequencing at least a portion of a cell-free DNA molecule to obtain sequence information regarding fetal and maternal cell-free DNA molecules in the maternal plasma test sample; and (b) the sequence information. To identify the number of sequence tags for each of any 20 or more chromosomes of interest selected from chromosomes 1-22 and X and Y chromosomes, and any 20 or more of the chromosomes of interest Identifying the number of sequence tags for each normalized chromosome, and (c) identifying the number of sequence tags identified for each of the 20 or more of the chromosomes of interest, and identifying the normalized chromosomes Calculating a single chromosomal dose for each of the any 20 or more chromosomes of interest using the sequence tag number; and ( d) comparing each single chromosomal dose of each of the any 20 or more chromosomes of interest to a threshold of each of the any of the 20 or more chromosomes of interest, thereby any of the maternal plasma test samples; Determining the presence or absence of 20 or more different complete fetal chromosomal aneuploidies.

他の実施形態において、本発明は、テストサンプルにおける関心対象配列、すなわち、臨床的に関連する配列のコピー数多型（変動）を同定する方法を提供し、この本発明方法は、(ａ)テストサンプル及び複数個の適格サンプルを採取するサンプル採取ステップであって、テストサンプルはテスト核酸分子を含み、複数個の適格サンプルは適格核酸分子を含むものとした、該サンプル採取ステップと、(ｂ)テストサンプルにおける胎児及び母体の核酸に関する配列情報を取得するステップと、(ｃ)適格核酸分子のシークエンシングに基づいて、前記複数個の適格サンプルにおける適格関心対象配列の適格配列ドースを計算する適格配列ドース計算ステップであり、適格配列ドースの計算は、適格関心対象配列のための、少なくとも１つの適格正規化配列におけるパラメータを決定することを含む、該適格配列ドース計算ステップと、(ｄ)適格配列ドースに基づいて、少なくとも１つの適格正規化配列を同定する適格正規化配列同定ステップであって、少なくとも１つの適格正規化配列は、前記複数個の適格サンプルにおける配列ドースに関して最小変動性及び／又は最大弁別可能性を有するものとした、該適格正規化配列同定ステップと、(ｅ)前記テストサンプルにおける核酸分子の配列に基づいて、関心対象テスト配列のテスト配列ドースを計算するテスト配列ドース計算ステップであって、テスト配列ドースの計算は、関心対象テスト配列のための少なくとも１つの正規化テスト配列におけるパラメータを決定することを含み、また少なくとも１つの正規化テスト配列は少なくとも１つの適格正規化配列に対応するものとした、該テスト配列ドース計算ステップと、(ｆ)前記テスト配列ドースを少なくとも１つの閾値に比較するステップと、(ｇ)ステップ(ｆ)における結果に基づいて、テストサンプルにおける関心対象配列のコピー数多型を評価するステップとを有する。一実施形態において、適格関心対象配列及び少なくとも１つの適格正規化配列のパラメータは、適格関心対象配列にマッピングした配列タグ数を適格正規化配列にマッピングしたタグ数に関連付け、また関心対象テスト配列及び少なくとも１つの正規化テスト配列のパラメータは、関心対象テスト配列にマッピングした配列タグ数を正規化テスト配列にマッピングしたタグ数に関連付けるものとする。若干の実施形態において、ステップ(ｂ)は、適格核酸分子及びテスト核酸分子の少なくとも一部をシークエンシングするステップであって、このシークエンシングにより、関心対象テスト配列及び適格配列のための、並びに少なくとも１つのテスト正規化配列及び少なくとも１つの適格正規化配列のための複数のマッピングした配列タグを得るものとしたステップ、テストサンプルの核酸分子の少なくとも一部をシークエンシングしてテストサンプルにおける胎児及び母体の核酸分子の配列情報を得るステップを含むものとする。若干の実施形態において、シークエンシングステップは、次世代シークエンシングとする。若干の実施形態において、シークエンシング方法は、可逆色素ターミネーターによるシークエンシング・バイ・シンセシスを使用する、大量並列シークエンシングとする。他の実施形態において、シークエンシング方法は、シークエンシング・バイ・リゲーションとする。若干の実施形態において、シークエンシングは、増幅を含むものとする。他の実施形態において、シークエンシングは、単独分子シークエンシングとする。関心対象配列のＣＮＶは、染色体異数性又は部分異数性の異数性とする。若干の実施形態において、染色体異数性は、２番トリソミー、８番トリソミー、９番トリソミー、１６番トリソミー、２１番トリソミー、１３番トリソミー、１８番トリソミー、２２番トリソミー、４７,ＸＸＹ、４７,ＸＸＸ、４７,ＸＹＹ、及びＸモノソミーから選択する。他の実施形態において、部分異数性は、部分的染色体欠失又は部分的染色体挿入である。若干の実施形態において、本発明方法によって同定されるＣＮＶは、がんに関連する染色体異数性又は部分異数性である。若干の実施形態において、テストサンプル及び適格サンプルは、生体液サンプル、例えば、血漿サンプルとし、妊娠したヒト検体のような妊娠検体から採取する。他の実施形態において、テスト生体液サンプル及び適格生体液サンプル、例えば、血漿サンプルは、がんであることが既知である又は疑われている検体から採取する。 In another embodiment, the present invention provides a method of identifying a sequence of interest in a test sample, ie, a copy number variation (variation) of a clinically relevant sequence, the method of the present invention comprising: A sample collection step for collecting a test sample and a plurality of eligible samples, wherein the test sample comprises a test nucleic acid molecule and the plurality of eligible samples comprise a qualified nucleic acid molecule; (b ) Obtaining sequence information regarding fetal and maternal nucleic acids in the test sample; and (c) qualifying the qualifying sequence dose of the qualifying sequence of interest in the plurality of qualifying samples based on sequencing of qualifying nucleic acid molecules. A sequence dose calculation step, wherein the calculation of a qualified sequence dose comprises at least one qualified positive for the sequence of qualified interest. Determining a parameter in the normalized sequence, the qualified sequence dose calculation step, and (d) a qualified normalized sequence identification step that identifies at least one qualified normalized sequence based on the qualified sequence dose, comprising: A qualified normalized sequence identification step, wherein said qualified normalized sequence identification step has minimum variability and / or maximum discriminability with respect to sequence dose in said plurality of qualified samples; and (e) in said test sample A test sequence dose calculation step for calculating a test sequence dose for a test sequence of interest based on a sequence of nucleic acid molecules, the calculation of the test sequence dose in at least one normalized test sequence for the test sequence of interest Determining parameters, and at least one normalized test sequence is at least The test sequence dose calculation step corresponding to one eligible normalized sequence, (f) comparing the test sequence dose to at least one threshold, and (g) based on the results in step (f). Evaluating a copy number variation of the sequence of interest in the test sample. In one embodiment, the qualified sequence of interest and at least one qualified normalized sequence parameter relate the number of sequence tags mapped to the qualified sequence of interest to the number of tags mapped to the qualified normalized sequence, and the test sequence of interest and The parameter of at least one normalized test sequence shall relate the number of sequence tags mapped to the test sequence of interest to the number of tags mapped to the normalized test sequence. In some embodiments, step (b) is a step of sequencing at least a portion of the qualified nucleic acid molecule and the test nucleic acid molecule, whereby the sequencing is for the test sequence of interest and the qualified sequence, and at least Obtaining a plurality of mapped sequence tags for one test normalization sequence and at least one qualified normalization sequence, sequencing at least a portion of the nucleic acid molecules of the test sample, and fetus and mother in the test sample The step of obtaining the sequence information of the nucleic acid molecule. In some embodiments, the sequencing step is a next generation sequencing. In some embodiments, the sequencing method is massively parallel sequencing using sequencing by synthesis with a reversible dye terminator. In another embodiment, the sequencing method is sequencing by ligation. In some embodiments, sequencing shall include amplification. In other embodiments, the sequencing is a single molecule sequencing. The CNV of the sequence of interest is chromosomal or partial aneuploid. In some embodiments, the chromosomal aneuploidy is trisomy 2, trisomy 8, trisomy 9, trisomy 16, trisomy 21, trisomy 13, trisomy 18, trisomy 18, 47, XXY, 47, Select from XXX, 47, XYY, and X monosomy. In other embodiments, the partial aneuploidy is a partial chromosomal deletion or partial chromosomal insertion. In some embodiments, the CNV identified by the method of the invention is a chromosomal aneuploidy or partial aneuploidy associated with cancer. In some embodiments, the test sample and eligible sample are biological fluid samples, eg, plasma samples, taken from a pregnant specimen, such as a pregnant human specimen. In other embodiments, test biological fluid samples and qualified biological fluid samples, eg, plasma samples, are collected from specimens that are known or suspected of having cancer.

本明細書における実施例はヒトに関連し、専門用語も主にヒトに関するものであるが、本発明の概念は任意の植物又は動物のゲノムに適用できる。 Although the examples herein relate to humans and the terminology primarily relates to humans, the concepts of the present invention can be applied to the genome of any plant or animal.

参照による組入れ
すべての特許、特許出願、及び本明細書に引用した参照文献に記載のあらゆる配列を含む他の文献は、参照によって、各個別の刊行物、特許又は特許出願が特別にまた個別に記載したのと同程度に、本明細書にはっきりと組入れたものとする。引用したすべての文献の関連部分を参照によって本明細書に組入れたものとする。しかし、任意の文献の引用は、本発明の従来技術である旨の了解と解釈すべきではない。 INCORPORATION BY REFERENCE All patents, patent applications, and other documents, including any sequences described in the references cited herein, are specifically and individually listed by reference for each individual publication, patent or patent application. It is to be clearly incorporated herein as much as described. The relevant parts of all cited documents are incorporated herein by reference. However, citation of any document should not be construed as an admission that it is prior art of the present invention.

本発明の新規な特徴を添付の特許請求の範囲に特別に記載する。本発明の特徴及び利点をよりよい理解は、本発明の原理を利用する例示的な実施形態について、図面につき行う以下の詳細な説明を参照することによって得られるであろう。 The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized.

核酸混合物を有するテストサンプルにおけるコピー数多型の有無を決定する方法１００のフローチャートである。2 is a flowchart of a method 100 for determining the presence or absence of copy number polymorphism in a test sample having a nucleic acid mixture. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した２１番染色体に関する染色体ドース（量）の分布であって、２１番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、２１番トリソミーのテストサンプルを１〜１２番染色体及びＸ染色体に（△）を付して示す。Distribution of chromosomal doses for chromosome 21 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human male or female fetus, A test sample of No. 21 trisomy is shown with (Δ) attached to the chromosomes 1 to 12 and the X chromosome. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した２１番染色体に関する染色体ドース（量）の分布であって、２１番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、２１番トリソミーのテストサンプルを１〜２２番染色体及びＸ染色体に（△）を付して示す。Distribution of chromosomal dose (chromosome) for chromosome 21 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, A test sample of No. 21 trisomy is shown with (Δ) attached to chromosomes 1 to 22 and X chromosome, which is qualified as chromosome 21 (ie, normal). ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した１８番染色体に関する染色体ドース（量）の分布であって、１８番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、１８番トリソミーのテストサンプルを１〜１２番染色体及びＸ染色体に（△）を付して示す。A distribution of chromosomal doses for chromosome 18 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, A test sample of No. 18 trisomy is shown with (Δ) attached to the chromosomes 1 to 12 and the X chromosome. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した１８番染色体に関する染色体ドース（量）の分布であって、１８番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、１８番トリソミーのテストサンプルを１〜２２番染色体及びＸ染色体に（△）を付して示す。A distribution of chromosomal doses for chromosome 18 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, A test sample of trisomy 18 is shown with (Δ) attached to chromosomes 1 to 22 and X chromosome (◯) with respect to normal that is qualified as chromosome 18, ie, normal. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した１３番染色体に関する染色体ドース（量）の分布であって、１３番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、１３番トリソミーのテストサンプルを１〜１２番染色体及びＸ染色体に（△）を付して示す。Distribution of chromosomal dose (chromosome) for chromosome 13 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, A test sample of No. 13 trisomy is shown with (Δ) attached to the chromosomes 1 to 12 and the X chromosome that are qualified as chromosome 13 (ie, normal). ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定した１３番染色体に関する染色体ドース（量）の分布であって、１３番染色体として適格な、すなわち、正常なもの（normal）に対して（○）を付し、１３番トリソミーのテストサンプルを１〜２２番染色体及びＸ染色体に（△）を付して示す。Distribution of chromosomal dose (chromosome) for chromosome 13 determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, A test sample of No. 13 trisomy is shown with (Δ) attached to chromosomes 1 to 22 and X chromosome, which is qualified as chromosome 13 (ie, normal). ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定したＸ染色体に関する染色体ドース（量）の分布であって、男児のＸ染色体ドースに（４６，ＸＹ;（○））を付し、女児のＸ染色体ドースに（４６，ＸＸ;（△））を付し、Ｘモノソミーに（４５，Ｘ；（＋））を付し、複雑核型のサンプルを１〜１２番染色体に（Cplx（Ｘ））を付して示す。Distribution of chromosomal doses (quantities) on the X chromosome determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, (46, XY; (◯)) is attached to the X chromosome dose of the child, (46, XX; (△)) is attached to the X chromosome dose of the girl, and (45, X; (+)) is attached to the X monosomy. A complex karyotype sample is shown with (Cplx (X)) attached to chromosomes 1 to 12. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定したＸ染色体に関する染色体ドース（量）の分布であって、男児のＸ染色体ドースに（４６，ＸＹ;（○））を付し、女児のＸ染色体ドースに（４６，ＸＸ;（△））を付し、Ｘモノソミーに（４５，Ｘ；（＋））を付し、複雑核型のサンプルを１〜２２番染色体に（Cplx（Ｘ））を付して示す。A distribution of chromosomal doses (quantities) for the X chromosome determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, (46, XY; (◯)) is attached to the X chromosome dose of the child, (46, XX; (△)) is attached to the X chromosome dose of the girl, and (45, X; (+)) is attached to the X monosomy. A complex karyotype sample is shown with (Cplx (X)) attached to chromosomes 1 to 22. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定したＹ染色体に関する染色体ドース（量）の分布であって、男児のＹ染色体ドースに（４６，ＸＹ;（△））を付し、女児のＸ染色体ドースに（４６，ＸＸ;（○））を付し、Ｘモノソミーに（４５，Ｘ；（＋））を付し、複雑核型のサンプルを１〜１２番染色体に（Cplx（Ｘ））を付して示す。A distribution of chromosomal doses (quantity) for the Y chromosome determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, (46, XY; (△)) is attached to the Y-chromosome dose of the child, (46, XX; (◯)) is attached to the X-chromosome dose of the girl, and (45, X; (+)) is attached to the X monosomy. A complex karyotype sample is shown with (Cplx (X)) attached to chromosomes 1 to 12. ヒトの男児又は女児の胎児を妊娠した各被検体から採取した４８個の血液サンプルのセットから抽出したｃｆＤＮＡをシークエンシングすることによって決定したＹ染色体に関する染色体ドース（量）の分布であって、男児のＹ染色体ドースに（４６，ＸＹ;（△））を付し、女児のＸ染色体ドースに（４６，ＸＸ;（○））を付し、Ｘモノソミーに（４５，Ｘ；（＋））を付し、複雑核型のサンプルを１〜２２番染色体に（Cplx（Ｘ））を付して示す。A distribution of chromosomal doses (quantity) for the Y chromosome determined by sequencing cfDNA extracted from a set of 48 blood samples taken from each subject pregnant with a human boy or girl fetus, (46, XY; (△)) is attached to the Y-chromosome dose of the child, (46, XX; (◯)) is attached to the X-chromosome dose of the girl, and (45, X; (+)) is attached to the X monosomy. A complex karyotype sample is shown with (Cplx (X)) attached to chromosomes 1 to 22. 図２，３及び４でそれぞれ示したドースから決定された２１番染色体（黒四角）、１８番染色体（●）及び１３番染色体（黒三角）の変動係数（ＣＶ：coefficient of variation）を示す。The coefficient of variation (CV) of chromosome 21 (black square), chromosome 18 (●) and chromosome 13 (black triangle) determined from the doses shown in FIGS. 図５及び６でそれぞれ示したドースから決定されたＸ染色体（黒四角）、及びＹ染色体（●）の変動係数（ＣＶ：coefficient of variation）を示す。The coefficient of variation (CV) of the X chromosome (black square) and Y chromosome (●) determined from the doses shown in FIGS. 5 and 6, respectively, is shown. ヒト染色体によるＧＣ画分の累積分布であり、垂直軸は染色体の頻度を表し、値の下方でＧＣ含有量を水平軸に示す。It is the cumulative distribution of the GC fraction by human chromosomes, the vertical axis represents the frequency of the chromosome, and the GC content is shown on the horizontal axis below the value. 妊娠したヒト被検者から採取した７個の適格サンプル（○）及び１個のテストサンプル（◇）のセットから抽出したｃｆＤＮＡをシークエンシングすることで決定した１１番染色体の断片（81000082-10300013bp）の配列ドース（Ｙ軸）を示し、１１番染色体に部分的異数性を有する胎児を孕んでいる被検者からのサンプルを同定した。Fragment of chromosome 11 (81000082-10300013bp) determined by sequencing cfDNA extracted from a set of 7 eligible samples (◯) and 1 test sample (◇) collected from a pregnant human subject A sample from a subject bearing a fetus having a partial aneuploidy on chromosome 11 was identified. 異常がないサンプルにおける対応染色体に関する平均（Ｙ軸）の標準偏差に対する２１番染色体の正規化した染色体ドースの分布を示す。The distribution of the normalized chromosomal dose of chromosome 21 with respect to the standard deviation of the mean (Y axis) for the corresponding chromosome in a sample with no abnormality is shown. 異常がないサンプルにおける対応染色体に関する平均（Ｙ軸）の標準偏差に対する１８番染色体の正規化した染色体ドースの分布を示す。The distribution of the normalized chromosomal dose of chromosome 18 with respect to the standard deviation of the mean (Y axis) for the corresponding chromosome in a sample with no abnormality is shown. 異常がないサンプルにおける対応染色体に関する平均（Ｙ軸）の標準偏差に対する１３番染色体の正規化した染色体ドースの分布を示す。The distribution of the normalized chromosomal dose of chromosome 13 with respect to the standard deviation of the mean (Y axis) for the corresponding chromosome in a sample with no abnormality is shown. 異常がないサンプルにおける対応染色体に関する平均（Ｙ軸）の標準偏差に対するＸ染色体の正規化した染色体ドースの分布を示す。FIG. 5 shows the normalized chromosome dose distribution of the X chromosome relative to the standard deviation of the mean (Y axis) for the corresponding chromosome in a sample with no abnormalities. 異常がないサンプルにおける対応染色体に関する平均（Ｙ軸）の標準偏差に対するＹ染色体の正規化した染色体ドースの分布を示す。FIG. 6 shows the normalized chromosomal dose distribution of the Y chromosome relative to the standard deviation of the mean (Y axis) for the corresponding chromosome in a sample with no abnormalities. 実施例６に記載した正規化染色体を使用して、トレーニングセット１からのサンプルで決定した２１番染色体（○）、１８番染色体（△）、１３番染色体（□）の正規化染色体値を示す。Using normalized chromosomes described in Example 6, the normalized chromosome values of chromosome 21 (◯), chromosome 18 (Δ), and chromosome 13 (□) determined in the sample from training set 1 are shown. . 実施例６に記載した正規化染色体を使用して、テストセット１からのサンプルで決定した２１番染色体（○）、１８番染色体（△）、１３番染色体（□）の正規化染色体値を示す。Using normalized chromosomes described in Example 6, the normalized chromosome values of chromosome 21 (◯), chromosome 18 (Δ), and chromosome 13 (□) determined in the sample from test set 1 are shown. . チュー氏らによる正規化方法（サンプルにおける残存染色体用に取得した（塩基配列タグ数を有する関心対象である染色体として同定した配列タグ数を正規化する。実施例７参照）を使用して、テストセット１からのサンプルで決定した２１番染色体（○）、１８番染色体（△）、１３番染色体（□）の正規化染色体値を示す。Tested using the normalization method by Chu et al. (Normalized for the number of sequence tags identified for the chromosome of interest with base sequence tag number obtained for the remaining chromosome in the sample; see Example 7) The normalized chromosome values of chromosome 21 (◯), chromosome 18 (Δ), and chromosome 13 (□) determined with the sample from set 1 are shown. （実施例７で説明したように）系統的に決定した正規化染色体を使用して、トレーニングセット１からのサンプルで決定した２１番染色体（○）、１８番染色体（△）、１３番染色体（□）の正規化染色体値を示す。Using normalized chromosomes systematically determined (as described in Example 7), chromosome 21 (◯), chromosome 18 (Δ), chromosome 13 (in the sample from training set 1) □) shows the normalized chromosome value. （実施例７で説明したように）系統的に決定した正規化染色体を使用して、テストセット１からのサンプルで決定した２１番染色体（○）、１８番染色体（△）、１３番染色体（□）の正規化染色体値を示す。Using normalized chromosomes systematically determined (as described in Example 7), chromosomes 21 (◯), 18 (Δ), 13 (determined in samples from test set 1 ( □) shows the normalized chromosome value. （実施例７で説明したように）系統的に決定した正規化染色体を使用して、テストセット１からのサンプルで決定した９番染色体（○）の正規化染色体値を示す。The normalized chromosome values of chromosome 9 (O) determined in the sample from Test Set 1 are shown using systematically determined normalized chromosomes (as described in Example 7). Ｘ染色体（Ｘ軸）及びＹ染色体（Ｙ軸）の正規化した染色体値を示し、実施例７で説明したように、トレーニングセット及びテストセットのそれぞれで同定した５個のＸモノソミーサンプルを矢印が示す。The normalized chromosome values of the X chromosome (X axis) and the Y chromosome (Y axis) are shown, and as described in Example 7, five X monosomy samples identified in each of the training set and the test set are indicated by arrows. Show. Ｘ染色体（Ｘ軸）及びＹ染色体（Ｙ軸）の正規化した染色体値を示し、実施例７で説明したように、トレーニングセット及びテストセットのそれぞれで同定した３個のＸモノソミーサンプルを矢印が示す。The normalized chromosome values of the X chromosome (X axis) and the Y chromosome (Y axis) are shown, and as described in Example 7, three X monosomy samples identified in each of the training set and the test set are indicated by arrows. Show. （実施例７で説明したように）系統的に決定した正規化染色体を使用して、テストセット１からのサンプルで決定した１〜２２番染色体の正規化染色体値を示す。The normalized chromosome values of chromosomes 1-22 determined in samples from test set 1 are shown using systematically determined normalized chromosomes (as described in Example 7).

１個又はそれ以上の関心対象である配列の総量が既知である、又はその配列の総量に違いがあると懸念される核酸混合物を有するテストサンプルにおける、関心対象である配列のコピー数多型（ＣＮＶ：copy number variations）を決定する方法を提供する。関心対象である配列は、遺伝子状態又は病状に関連することが既知又は懸念される染色体全体に関してキロベース（ｋｂ）からメガベース（Ｍｂ）の範囲にわたるゲノム配列を含む。関心対象である配列の例としては、よく知られている異数性、例えば、２１番トリソミーに関連する染色体、及びがんのような疾患で増殖される染色体断片、例えば、急性骨髄性白血病における部分的８番トリソミーに関連する染色体がある。本発明により決定することができるＣＮＶとしては、１〜２２番の染色体、Ｘ及びＹの性染色体のうち任意の１つ又はそれ以上におけるモノソミー及びトリソミー、例えば、４５,Ｘ、４７,ＸＸＸ、４７,ＸＸＹ、及び４７,ＸＹＹ、他の染色体ポリソミー、すなわち限定はしないがＸＸＸＸ、ＸＸＸＸＸ、ＸＸＸＸＹ、ＸＹＹＹＹを含むテトラソミー及びペンタソミー、染色体のうち任意の１つ又はそれ以上における断片（セグメント）の欠失及び／又は重複がある。 A copy number variation of the sequence of interest in a test sample having a nucleic acid mixture in which the total amount of one or more sequences of interest is known or that the total amount of the sequences is of concern A method for determining CNV: copy number variations is provided. Sequences of interest include genomic sequences ranging from kilobases (kb) to megabases (Mb) with respect to the entire chromosome known or concerned to be associated with a genetic condition or disease state. Examples of sequences of interest include the well-known aneuploidy, eg, chromosomes associated with trisomy 21 and chromosome fragments that are propagated in diseases such as cancer, eg, acute myeloid leukemia There is a chromosome associated with partial trisomy 8. CNVs that can be determined according to the present invention include monosomy and trisomy on any one or more of chromosomes 1-22, X and Y sex chromosomes, eg 45, X, 47, XXX, 47 , XXY, and 47, XYY, other chromosomal polysomy, ie, tetrasomy and pentasomy, including but not limited to XXXX, XXXXXX, XXXXY, XYYYY, deletion of segments (segments) in any one or more of the chromosomes and There is an overlap.

本発明方法は、処理に関連する染色体間（ラン内）変動及びシークエンシング間（ラン間）変動からの見越し変動ステミング処理を担う統計的アプローチをなす。この方法は、任意の胎児異数性におけるＣＮＶ、及び種々の内科的疾患に関連すると既知である又は懸念されるＣＮＶを決定するのに適用可能である。 The method of the present invention provides a statistical approach that is responsible for the accrual variability stemming process from inter-chromosome (intra-run) and inter-sequence (inter-run) variability associated with processing. This method is applicable to determine CNV in any fetal aneuploidy, and CNVs known or concerned about various medical disorders.

他に明示しない限り、本発明の実施には、従来の技術範囲にある分子生物学、微生物学、タンパク質精製、タンパク質工学、タンパク質及びＤＮＡシークエンシング（配列決定）及び組換えＤＮＡの分野で共通して使用される普通の技術を含む。このような技術は当業者には既知であり、多くの文書及び参照文献（例えば、Sambrook et al.,”Molecular Cloning:A Laboratory Manual”,Third Edition (Cold Spring Harbor）, [2001]); 及びAusubel et al., “Current Protocols in Molecular Biology” [1987]参照。）に記載されている。 Unless otherwise indicated, the practice of the present invention is common in the fields of molecular biology, microbiology, protein purification, protein engineering, protein and DNA sequencing (sequencing) and recombinant DNA within the prior art. Including ordinary technology used. Such techniques are known to those skilled in the art and many documents and references (eg, Sambrook et al., “Molecular Cloning: A Laboratory Manual”, Third Edition (Cold Spring Harbor), [2001]); See Ausubel et al., “Current Protocols in Molecular Biology” [1987]. )It is described in.

多くの範囲が範囲を画定する数に含まれている。本明細書に記載するあらゆる最大数値限定は、あらゆるより低い数値限定をも、このようなより低い数値限定が本明細書にはっきりと記載されているように含むことを意図する。本明細書に記載するあらゆる最小数値限定は、あらゆるより高い数値限定をも、このようなより高い数値限定が本明細書にはっきりと記載されているように含むことを意図する。本明細書に記載するあらゆる数値範囲は、このようなより広い数値範囲内にあるあらゆる狭い数値範囲をも、このようなより狭い数値範囲が本明細書にはっきりと記載されているように含むことを意図する。 Many ranges are included in the number defining the range. Any maximum numerical limitation set forth herein is intended to include any lower numerical limitation as such lower numerical limitation is expressly set forth herein. Any minimum numerical limitation set forth herein is intended to include any higher numerical limitation as such higher numerical limitation is expressly set forth herein. Every numerical range recited herein includes any narrow numerical range falling within such wider numerical range, as such narrower numerical range is expressly set forth herein. Intended.

本明細書に付された見出し項目は、本明細書を全体的に参照することによって理解できる本発明の種々の態様又は実施形態を限定するものではない。したがって、上述したように、以下に定義する用語は、本明細書全体を参照することによってより十分に定義されるものである。 The headings provided herein are not intended to limit the various aspects or embodiments of the invention that can be understood by reference to the specification as a whole. Accordingly, as described above, the terms defined below are more fully defined by reference to the entire specification.

本明細書に別様に定義しない限り、本明細書に使用するすべての技術的及び科学的な用語は、本発明が属する当業者に共通して理解されるのと同一の意味を有する。本明細書に含まれる用語を含む種々の科学辞書は既知であり、当業者が利用可能である。本明細書に記載のものと類似又は等価な任意の方法及び材料を本発明の実施又はテストに使用できるが、幾つかの好適な方法及び材料を記載する。したがって、以下に定義する用語は、本明細書全体を参照することによってより十分に記述されるものである。本発明は、本明細書に記載される特別な方法論、手順、及び試薬に限定されるものではなく、当業者が使用する文脈に応じて変化し得る。 Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Various scientific dictionaries containing the terms contained herein are known and available to those skilled in the art. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, some suitable methods and materials are described. Accordingly, the terms defined below are more fully described by reference to the entire specification. The present invention is not limited to the particular methodologies, procedures, and reagents described herein and may vary depending on the context used by those skilled in the art.

定義
本明細書に使用する、単数表記の”a”,”an”及び”the”は、他に明示しない限り複数での言及も含むものとする。他に明示しない限り、核酸は左から右に５′から３′に向かう向きに記述し、アミノ酸配列は左から右にアミノからカルボキシに向かう向きでそれぞれ記述する。 Definitions As used herein, the singular forms “a”, “an”, and “the” are intended to include plural references unless the context clearly dictates otherwise. Unless otherwise indicated, nucleic acids are written left to right in 5 'to 3' orientation, and amino acid sequences are written left to right in amino to carboxy orientation, respectively.

本明細書における用語「評価（assessing）」は、「正常（normal）」、「異変あり（affected）」、「ノーコール（no-call）」という３タイプの判定（コール）のうち１つによって染色体異数性の状態を特徴付けることを意味する。例えば、トリソミー有無判定場合、「正常」の判定は、パラメータ、例えばユーザー定義の信頼性閾値未満のテスト染色体ドースの値によって決定し、「異変あり」の判定は、パラメータ、例えばユーザー定義の信頼性閾値を超えるテスト染色体ドースによって決定し、「ノーコール」の判定は、パラメータ、例えば「正常」又は「異変あり」の判定を行うユーザー定義の信頼性閾値間にあるテスト染色体ドースによって決定する。 As used herein, the term “assessing” refers to a chromosome by one of three types of determinations: “normal”, “affected”, and “no-call”. Means characterizing the aneuploidy state. For example, in the case of trisomy presence / absence determination, the determination of “normal” is determined by a parameter, for example, a value of a test chromosome dose that is less than a user-defined reliability threshold, and the determination of “abnormality” is a parameter, for example, a user-defined reliability The determination of “no call” is determined by a test chromosome dose that is between user-defined confidence thresholds that make a determination of a parameter, eg, “normal” or “abnormal”.

本明細書における用語「コピー数多型」は、適格サンプルに存在する核酸配列のコピー数と比較するテストサンプルに存在する１ｋｂ以上の長さを有する核酸配列におけるコピー数の変動を意味する。「コピー数多型（copy number variation）」は、核酸における１ｋｂ以上の長さを有する配列であって、コピー数の差異はテストサンプルにおける関心対象配列を適格サンプルに存在するその関心対象配列と比較することによって見出す。コピー数多型には、微小欠失を含む欠失、微小挿入を含む挿入、重複、増殖、逆位、転座、及び複合多部位変異がある。ＣＮＶは染色体異数性及び部分的異数性を含む。 As used herein, the term “copy number polymorphism” means a variation in copy number in a nucleic acid sequence having a length of 1 kb or more present in a test sample compared to the copy number of a nucleic acid sequence present in a qualified sample. A “copy number variation” is a sequence having a length of 1 kb or more in a nucleic acid, and the copy number difference compares a sequence of interest in a test sample with its sequence of interest in a qualified sample. To find out. Copy number polymorphisms include deletions including microdeletions, insertions including microinsertions, duplication, growth, inversion, translocation, and complex multisite mutations. CNV includes chromosomal aneuploidy and partial aneuploidy.

本明細書における用語「異数性」は、染色体全体又は染色体一部における不足又は過剰によって生ずる遺伝子材料の不均衡を意味する。 As used herein, the term “aneuploidy” means an imbalance of genetic material caused by a deficiency or excess in an entire chromosome or a portion of a chromosome.

本明細書における用語「染色体異数性」及び「完全染色体異数性」は、染色体全体における不足又は過剰によって生ずる遺伝子材料の不均衡を意味し、また生殖細胞系列異数性及びモザイク異数性を含む。 As used herein, the terms “chromosomal aneuploidy” and “complete chromosomal aneuploidy” mean an imbalance of genetic material caused by a deficiency or excess in the entire chromosome, and germline aneuploidy and mosaic aneuploidy including.

本明細書における用語「部分異数性」及び「部分染色体異数性」は、染色体の一部における不足又は過剰、例えば、部分モノソミー及び部分トリソミーによって生ずる遺伝子材料の不均衡を意味し、転座、欠失及び挿入によって生ずる不均衡を含む。 As used herein, the terms “partial aneuploidy” and “partial chromosomal aneuploidy” refer to an imbalance in genetic material caused by a deficiency or excess in part of a chromosome, for example, partial monosomy and partial trisomy, Including imbalances caused by deletions and insertions.

本明細書における用語「異数性サンプル」は、染色体含有量が正倍数性でない検体を表すサンプル、すなわち、染色体のコピー数が異常な検体を表すサンプルを意味する。 As used herein, the term “aneuploid sample” means a sample that represents an analyte whose chromosome content is not euploid, that is, a sample that represents an analyte with an abnormal chromosome copy number.

本明細書における用語「異数性染色体」は、異常コピー数のサンプルに存在することが既知である又は決定された染色体を意味する。 As used herein, the term “aneuploid chromosome” means a chromosome that is known or determined to be present in a sample of an abnormal copy number.

本明細書における用語「複数」は、本発明方法に使用するテストサンプル及び適格サンプルにおいて、コピー数多型における大きな差異を識別するのに十分な多数の核酸分子又は塩基配列タグ（例えば、染色体ドース）を意味する。若干の実施形態において、２０〜４０個の塩基対（ｂｐ）リード（reads）を有する、少なくとも約３×１０^６個の塩基配列タグ、少なくとも約５×１０^６個の塩基配列タグ、少なくとも約８×１０^６個の塩基配列タグ、少なくとも約１０×１０^６個の塩基配列タグ、少なくとも約１５×１０^６個の塩基配列タグ、少なくとも約２０×１０^６個の塩基配列タグ、少なくとも約３０×１０^６個の塩基配列タグ、少なくとも約４０×１０^６個の塩基配列タグ、少なくとも約５０×１０^６個の塩基配列タグを、各テストサンプルから得る。 As used herein, the term “plurality” refers to a large number of nucleic acid molecules or nucleotide sequence tags (eg, chromosomal dose tags) sufficient to distinguish large differences in copy number variations in test and qualified samples used in the methods of the invention. ). In some embodiments, at least about 3 × 10 ⁶ base sequence tags, at least about 5 × 10 ⁶ base sequence tags, having at least about 40 × 40 base pair (bp) reads, at least about 8 × 10 ⁶ base sequence tags, at least about 10 × 10 ⁶ base sequence tags, at least about 15 × 10 ⁶ base sequence tags, at least about 20 × 10 ⁶ base sequence tags, at least about 30 × 10 ^Six base sequence tags, at least about 40 × 10 ⁶ base sequence tags, at least about 50 × 10 ⁶ base sequence tags are obtained from each test sample.

本明細書における用語「ポリヌクレオチド」、「核酸」及び「核酸分子」は、互いに置き換え可能に使用され、ヌクレオチドの共有結合配列（すなわち、ＲＮＡのリボヌクレオチド及びＤＮＡのデオキシリボヌクレオチド）を意味し、１つのヌクレオチドにおけるペントースの３′位置をホスホジエステル基によって、次のヌクレオチドにおけるペントースの５′位置に結合し、限定しないがＲＮＡ，ＤＮＡ，及びｃｆＤＮＡ分子を含む任意の核酸形式の配列を含む。用語「ポリヌクレオチド」は、限定しないが単独らせん及び２重らせんのポリヌクレオチドを含む。 As used herein, the terms “polynucleotide”, “nucleic acid” and “nucleic acid molecule” are used interchangeably and refer to covalent sequences of nucleotides (ie, RNA ribonucleotides and DNA deoxyribonucleotides). The 3 'position of the pentose in one nucleotide is linked by a phosphodiester group to the 5' position of the pentose in the next nucleotide, including sequences of any nucleic acid format, including but not limited to RNA, DNA, and cfDNA molecules. The term “polynucleotide” includes, but is not limited to, single and double helix polynucleotides.

本明細書における用語「部分」は、生物学的サンプルにおける胎児及び母体の核酸分子の配列情報量の合計が１ヒトゲノムの配列情報より少ない量を意味する。 As used herein, the term “portion” means that the total amount of sequence information of fetal and maternal nucleic acid molecules in a biological sample is less than the sequence information of one human genome.

本明細書における用語「テストサンプル」は、コピー数に変動があったことが懸念される少なくとも１つの核酸配列を有する核酸混合物を含むサンプルを意味する。テストサンプルに存在する核酸は「テスト核酸」と称する。 As used herein, the term “test sample” refers to a sample comprising a mixture of nucleic acids having at least one nucleic acid sequence that is of concern for copy number variation. Nucleic acids present in a test sample are referred to as “test nucleic acids”.

本明細書における用語「適格サンプル」は、テストサンプルにおける核酸と比べられるコピー数が既知で存在する核酸混合物を有するサンプルを意味し、正常、すなわち関心対象である配列に異数性がないサンプルであり、例えば、２１番染色体の正規化染色体を識別するのに使用する適格サンプルは２１番トリソミーサンプルではないサンプルである。 As used herein, the term “eligible sample” refers to a sample having a mixture of nucleic acids that is known and present in copy numbers compared to nucleic acids in a test sample, that is, a sample that is normal, ie, the sequence of interest has no aneuploidy. Yes, for example, a qualified sample used to identify a normalized chromosome of chromosome 21 is a sample that is not a trisomy sample.

本明細書における用語「トレーニングセット」は、異変ありサンプル及び異変なしサンプルを含むことができるサンプルセットを意味する。トレーニングセットにおける異変なしサンプルは、正規化配列、例えば、正規化染色体を識別する適格サンプルとして使用し、異変なしサンプルの染色体ドースを使用し、関心対象である各配列、例えば、染色体それぞれの閾値を設定する。トレーニングセットにおける異変ありサンプルは、異変ありテストサンプルを異変なしサンプルから容易に区別できることを検証するのに使用することができる。 As used herein, the term “training set” means a sample set that can include samples with and without anomalies. Non-aberrant samples in the training set are used as qualifying samples to identify normalized sequences, e.g., normalized chromosomes, and the chromosomal doses of invariant samples are used to determine the threshold for each sequence of interest, e.g., each chromosome. Set. Samples with anomalies in the training set can be used to verify that test samples with anomalies can be easily distinguished from samples without anomalies.

本明細書における用語「適格核酸」は、「適格配列」と互いに置き換え可能に使用することができ、「適格配列」はテスト配列又はテスト核酸の総量と比較する配列である。適格配列は既知の表象で、すなわち、適格配列の総量が既知で生物学的サンプルに存在する。「関心対象である適格配列」は、適格サンプルにおいて総量が既知である適格配列であり、内科的疾患のある個人における配列表現の差異に関連する配列である。 As used herein, the term “qualifying nucleic acid” can be used interchangeably with “qualifying sequence”, which is a test sequence or a sequence that is compared to the total amount of test nucleic acid. A qualified sequence is a known representation, ie, the total amount of qualified sequence is known and present in a biological sample. A “qualifying sequence of interest” is a qualifying sequence whose total amount is known in a qualifying sample, and is a sequence associated with a difference in sequence expression in an individual with a medical disorder.

本明細書における用語「関心対象（である）配列」は、健康な個人対疾患のある個人における配列表現の差異に関連する核酸配列を意味する。関心対象配列は、内科的疾患又は遺伝子疾患における誤表現、すなわち、過剰表現又は不足表現されている染色体における配列である。関心対象配列は、さらに、染色体の一部、すなわち染色体断片（セグメント）、又は染色体にもなり得る。例えば、関心対象配列は、異数性症状で過剰表現される染色体、又はがんで不足表現される腫瘍抑制因子をコード化する遺伝子となり得る。関心対象配列は、検体における細胞の全体母集団又は部分母集団における過剰表現又は不足表現になっている配列を含む。「適格関心対象配列」は適格サンプルにおける関心対象配列である。「テスト関心対象配列」はテストサンプルにおける関心対象配列である。 As used herein, the term “sequence of interest” refers to a nucleic acid sequence that is associated with a difference in sequence representation in a healthy individual versus a diseased individual. A sequence of interest is a misrepresentation in a medical or genetic disorder, ie, a sequence in a chromosome that is over- or under-represented. The sequence of interest can also be part of a chromosome, ie a chromosomal fragment (segment), or a chromosome. For example, the sequence of interest can be a gene that encodes a chromosome that is overexpressed in aneuploidy or a tumor suppressor that is underexpressed in cancer. Sequences of interest include sequences that are over- or under-represented in the entire or subpopulation of cells in the specimen. A “qualifying sequence of interest” is a sequence of interest in a qualified sample. A “test sequence of interest” is a sequence of interest in a test sample.

本明細書における用語「正規化配列」は、サンプル間でマッピングされる塩基配列タグ（ときに「配列タグ」と略称する）の数における変動（多型）を呈する配列であって、正規化パラメータとして使用される関心対象配列のそれと最も近似し、また１つ又はそれ以上の異変なしサンプルから異変ありサンプルを最も区別できるシークエンシング実行である。「正規化染色体」又は「正規化染色体配列」は、「正規化配列」の例である。「正規化染色体配列」は、単独染色体又は染色体グループによって構成することができる。「正規化断片」は「正規化配列」の他の例である。「正規化断片配列」は、染色体の単独断片によって構成することができる、又は同一若しくは異なった染色体における２つ又はそれ以上の断片によって構成することができる。 As used herein, the term “normalized sequence” refers to a sequence that exhibits variation (polymorphism) in the number of base sequence tags (sometimes abbreviated as “sequence tags”) mapped between samples, A sequencing run that most closely resembles that of the sequence of interest used as and can most distinguish the anomalous samples from one or more anomalous samples. A “normalized chromosome” or “normalized chromosome sequence” is an example of a “normalized sequence”. A “normalized chromosomal sequence” can be composed of a single chromosome or a group of chromosomes. “Normalized fragment” is another example of “normalized sequence”. A “normalized fragment sequence” can be composed of a single fragment of a chromosome, or it can be composed of two or more fragments on the same or different chromosomes.

本明細書における用語「弁別可能性」は、１つ又はそれ以上の異変なし、すなわち正常サンプルを、１つ又はそれ以上の異変あり、すなわち異数性サンプルから区別できる正規化染色体の特徴を意味する。 As used herein, the term “discriminability” means a normalized chromosomal feature that can distinguish one or more abnormalities, ie, normal samples from one or more abnormalities, ie, aneuploid samples. To do.

本明細書における用語「配列ドース」は、関心対象配列の配列タグ密度を正規化配列のタグ密度に関連付けるパラメータを意味する。「テスト配列ドース」は、関心対象配列、例えば２１番染色体の配列タグ密度を正規化配列、例えばテストサンプルで決定した９番染色体の配列タグ密度に関連付けするパラメータである。同様に、「適格配列ドース」は、関心対象配列の配列タグ密度を適格サンプルで決定した正規化配列の配列タグ密度に関連付けするパラメータである。 As used herein, the term “sequence dose” refers to a parameter that relates the sequence tag density of a sequence of interest to the tag density of a normalized sequence. “Test sequence dose” is a parameter that relates the sequence tag density of a sequence of interest, eg, chromosome 21, to a normalized sequence, eg, the chromosome 9 sequence tag density determined in a test sample. Similarly, “qualifying sequence dose” is a parameter that relates the sequence tag density of a sequence of interest to the sequence tag density of a normalized sequence determined in a qualified sample.

本明細書における用語「配列タグ密度」は、基準ゲノム配列にマッピングされる配列リード（reads)数を意味し、例えば、２１番染色体の配列タグ密度は、基準ゲノムの２１番染色体にマッピングされるようシークエンシング方法によって生じた配列リード数である。本明細書における用語「配列タグ密度比」は、基準ゲノム配列の染色体、例えば、２１番染色体にマッピングされる配列タグ数の、基準ゲノムにおける２１番染色体の長さに対する比を意味する。 As used herein, the term “sequence tag density” refers to the number of sequence reads mapped to a reference genome sequence, eg, the sequence tag density of chromosome 21 is mapped to chromosome 21 of the reference genome. The number of sequence reads generated by the sequencing method. As used herein, the term “sequence tag density ratio” means the ratio of the number of sequence tags mapped to a chromosome of a reference genome sequence, eg, chromosome 21, to the length of chromosome 21 in the reference genome.

本明細書における用語「次世代シークエンシング（ＮＧＳ：Next Generation Sequencing」は、クローン的に増幅された、また単独核酸分子における大量並列シークエンシングができるシークエンシング方法を意味する。ＮＧＳにおける非限定的な例としては、可逆色素ターミネータを使用するシークエンシング・バイ・シンセシス（sequencing-by-synthesis）及びシークエンシング・バイ・リゲーション（sequencing-by-ligation）がある。 As used herein, the term “Next Generation Sequencing” refers to a sequencing method that is clonally amplified and capable of massively parallel sequencing on a single nucleic acid molecule. Examples are sequencing-by-synthesis using reversible dye terminators and sequencing-by-ligation.

本明細書における用語「パラメータ」は、数量的データセット及び／又は数量的データセット相互間の数的関係性を特徴付ける数値を意味する。例えば、染色体にマッピングされる配列タグ数と、配列タグがマッピングされる染色体の長さとの比（又は比の関数）をパラメータとする。 As used herein, the term “parameter” means a numerical value that characterizes a quantitative data set and / or a numerical relationship between quantitative data sets. For example, a ratio (or a function of the ratio) between the number of sequence tags mapped to a chromosome and the length of the chromosome to which the sequence tag is mapped is used as a parameter.

本明細書における用語「閾値」及び「適格閾値」は、適格認定するデータセットを使用して計算し、また有機体におけるコピー数多型、例えば異数性に関する診断上の制限値として作用する任意の数値を意味する。本発明を実施することから得られた結果が閾値を超える場合、被検体はコピー数多型、例えば２１番トリソミーがあると診断することができる。本発明に記載する方法の適切な閾値は、サンプルのトレーニングセット用に計算した正規化値（例えば、染色体ドース、ＮＣＶｓ、又はＮＳＶｓ）を解析することによって同定することができる。閾値は、適格（すなわち、異変なし）サンプル及び異変ありサンプルの双方を有するトレーニングセットにおける適格（すなわち、異変なし）サンプルを使用して同定することができる。トレーニングセットにおける染色体異数性があると分かっているサンプル（すなわち、異変ありサンプル）を使用して、選択した閾値がテストセットにおける異変なしサンプルから異変ありを区別するのに有用であるかを確認することができる（本明細書の実施例参照）。閾値選択は、分類を行わなければならないと希望するユーザーの確信レベルに依存する。若干の実施形態において、適切な閾値を同定するのに使用するトレーニングセットは、少なくとも１０個、少なくとも２０個、少なくとも３０個、少なくとも４０個、少なくとも５０個、少なくとも６０個、少なくとも７０個、少なくとも８０個、少なくとも９０個、少なくとも１００個、少なくとも２００個、少なくとも３００個、少なくとも４００個、少なくとも５００個、少なくとも６００個、少なくとも７００個、少なくとも８００個、少なくとも９００個、少なくとも１０００個、少なくとも２０００個、少なくとも３０００個、少なくとも４０００個、又はそれより多くの個数のサンプルを有するものとする。より多くの適格サンプルのセットを使用することは、閾値の診断有用性を改善するのに有利である。 As used herein, the terms “threshold” and “eligibility threshold” are arbitrary values that are calculated using the qualifying dataset and that serve as diagnostic limits for copy number variation in organisms, eg, aneuploidy Means the numerical value of If the result obtained from practicing the present invention exceeds a threshold, the subject can be diagnosed as having a copy number variation, eg, trisomy 21. Appropriate thresholds for the methods described in the present invention can be identified by analyzing normalized values (eg, chromosomal doses, NCVs, or NSVs) calculated for a training set of samples. The threshold can be identified using qualifying (ie, no anomaly) samples in a training set having both qualifying (ie, no anomaly) samples and anomalous samples. Use samples that are known to have chromosomal aneuploidy in the training set (ie, samples with anomalies) to see if the selected threshold is useful to distinguish the anomalies from the samples without anomalies in the test set (See the examples herein). The threshold selection depends on the confidence level of the user who wishes to perform classification. In some embodiments, the training set used to identify appropriate thresholds is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80. At least 90, at least 100, at least 200, at least 300, at least 400, at least 500, at least 600, at least 700, at least 800, at least 900, at least 1000, at least 2000, It shall have at least 3000, at least 4000, or more samples. Using a larger set of eligible samples is advantageous to improve the diagnostic utility of the threshold.

本明細書における用語「正規化値」は、関心対象配列（例えば、染色体又は染色体断片）に対して同定された配列タグの数を、正規化配列（例えば、正規化染色体又は正規化染色体断片）に対して同定された配列タグの数に関連付けする数値を意味する。例えば、「正規化値」は、本明細書のいたるところに記載した染色体ドースとすることができる、又は本明細書のいたるところに記載したＮＣＶ（Normalized Chromosome Value）又は本明細書のいたるところに記載したＮＳＶ（Normalized Segment Value）とすることができる。 As used herein, the term “normalized value” refers to the number of sequence tags identified for a sequence of interest (eg, a chromosome or chromosome fragment) and the normalized sequence (eg, normalized chromosome or normalized chromosome fragment). Means a numerical value associated with the number of sequence tags identified for. For example, a “normalized value” can be a chromosomal dose described elsewhere in this specification, or an NCV (Normalized Chromosome Value) described elsewhere in this specification, or elsewhere in this specification. The described NSV (Normalized Segment Value) can be used.

本明細書における用語「リード（read）」は、十分な長さ（例えば、少なくとも約３０ｂｐ）のＤＮＡ配列を意味し、このリードを使用してより大きな配列又は領域を同定することができ、例えば、染色体又はゲノム領域又は遺伝子に整列させ、また特別に割り当てることができる。 As used herein, the term “read” refers to a DNA sequence of sufficient length (eg, at least about 30 bp) that can be used to identify larger sequences or regions, Aligned and specially assigned to chromosomes or genomic regions or genes.

本明細書における用語「配列タグ」は、「マッピングされた配列タグ」と互いに置き換えて使用され、より大きな配列、例えば基準ゲノムに整列(alignment)によって特別に割り当てられた、すなわちマッピングされた配列リードを意味する。マッピングされた配列タグは、基準ゲノムに一意的にマッピングされる、すなわち、基準ゲノムに対する単独ロケーションとして割り当てられる。基準ゲノムに対して１ロケーションより多いロケーションでマッピングできるタグ、すなわち、一意的にマッピングされないタグはこの解析には含まれない。 The term “sequence tag” herein is used interchangeably with “mapped sequence tag” and is specifically assigned by a larger sequence, eg, alignment to a reference genome, ie, mapped sequence reads. Means. The mapped sequence tag is uniquely mapped to the reference genome, ie assigned as a single location relative to the reference genome. Tags that can be mapped at more than one location relative to the reference genome, ie, tags that are not uniquely mapped, are not included in this analysis.

本明細書における用語「整列した（aligned）」、「整列(alignment)」、又は「整列する（aligning）」は、基準ゲノムから既知の配列に対する核酸分子の順番における一致として同定される１つ又はそれ以上の配列状態を意味する。このような整列は、手作業で、又は例えば、イルミナ・ゲノミクス・アナリシス（Illumina Genomics Analysis）におけるパイプラインの一部として配給されるヌクレオチドデータの効率的局所的整列（ＥＬＡＮＤ：Efficient Local Alignment of Nucleotide Data）コンピュータアルゴリズムを含むコンピュータアルゴリズムによって行うことができる。 As used herein, the terms “aligned”, “alignment”, or “aligning” are one or more identified as a match in the order of nucleic acid molecules to a known sequence from a reference genome. It means more array state. Such alignment can be done manually or, for example, Efficient Local Alignment of Nucleotide Data (ELAND) of nucleotide data delivered as part of a pipeline in Illumina Genomics Analysis. ) It can be done by computer algorithms including computer algorithms.

本明細書における用語「基準ゲノム」は、検体から同定された配列を参照するのに使用することができる任意の有機体又はウイルスにおける任意の特別な既知ゲノム配列（部分又は全体のいずれか）を意味する。例えば、ヒト検体並びに他の多くの有機体用に使用される基準ゲノムは、バイオテクノロジー情報ナショナルセンター（www.ncbi.nlm.hih.gov.）において見つけることができる。「ゲノム」は、有機体又はウイルスにおける核酸配列で表現される完全遺伝情報を意味する。 As used herein, the term “reference genome” refers to any particular known genomic sequence (either partial or whole) in any organism or virus that can be used to refer to a sequence identified from a specimen. means. For example, reference genomes used for human specimens as well as many other organisms can be found at the National Center for Biotechnology Information (www.ncbi.nlm.hih.gov.). “Genome” means complete genetic information represented by a nucleic acid sequence in an organism or virus.

本明細書における用語「臨床関連配列（clinically-relevant sequence）」は、遺伝子疾患又は病状に関連する又は関与することが既知である、又は疑われる核酸配列を意味する。臨床関連配列の有無を決定することは、内科的疾患の診断を決定する若しくは診断の確認をする上で、又は疾病の進行診断を行う上で有用である。 As used herein, the term “clinically-relevant sequence” refers to a nucleic acid sequence known or suspected to be associated with or involved in a genetic disease or condition. Determining the presence or absence of clinically relevant sequences is useful in determining diagnosis or confirming diagnosis of a medical disease, or in making progress diagnosis of a disease.

本明細書において、核酸又は核酸混合物の文脈で使用される用語「由来する（derived）」は、核酸がその起源となる発生源（ソース）から得られたことを意味する。例えば、一実施形態において、２つの異なったゲノムから由来する核酸混合物とは、核酸、例えばｃｆＤＮＡがネクローシス又はアポトーシスのような自然発生的プロセスによって細胞から自然に放出されたものであることを意味する。他の実施形態においては、２つの異なったゲノムから由来する核酸混合物とは、核酸を検体からの細胞における２つの異なったタイプから抽出したことを意味する。 As used herein, the term “derived” as used in the context of a nucleic acid or nucleic acid mixture means that the nucleic acid was obtained from the source from which it originated. For example, in one embodiment, a nucleic acid mixture derived from two different genomes means that the nucleic acid, eg, cfDNA, is naturally released from the cell by a naturally occurring process such as necrosis or apoptosis. . In other embodiments, a mixture of nucleic acids derived from two different genomes means that the nucleic acids were extracted from two different types of cells from the specimen.

本明細書における用語「混合サンプル（mixed sample）」は、異なったゲノムに由来する核酸の混合物を含むサンプルを意味する。 The term “mixed sample” herein refers to a sample containing a mixture of nucleic acids from different genomes.

本明細書における用語「母体サンプル（maternal sample）」は、妊娠した検体、例えば、女性のヒトから採取した生物学的サンプルを意味する。 As used herein, the term “maternal sample” refers to a biological sample taken from a pregnant specimen, eg, a female human.

本明細書における用語「生体液（biological fluid）」は、生体液源から採取した液体を意味し、例えば、血液、血清、血漿、痰、破出液、脳脊髄液、尿、精液、汗、涙、唾液等がある。本明細書で使用する用語「血液」、「血漿」及び「血清」は、それらの画分又は処理した部分をも包含する。同様に、サンプルを生検、綿棒、塗抹等から採取する場合、「サンプル」は生検、綿棒、塗抹等に由来する処理画分又は部分をも含む。 The term “biological fluid” as used herein means a fluid collected from a biological fluid source, such as blood, serum, plasma, sputum, exudate, cerebrospinal fluid, urine, semen, sweat, There are tears and saliva. As used herein, the terms “blood”, “plasma” and “serum” also include fractions or processed portions thereof. Similarly, when a sample is taken from a biopsy, cotton swab, smear, etc., the “sample” also includes a treated fraction or portion derived from a biopsy, cotton swab, smear, etc.

本明細書における用語「母体核酸」及び「胎児核酸」は、それぞれ妊娠女性検体の核酸及びその妊娠女性検体が孕んでいる胎児の核酸を意味する。 As used herein, the terms “maternal nucleic acid” and “fetal nucleic acid” mean the nucleic acid of a pregnant female specimen and the nucleic acid of a fetus that the pregnant female specimen is in respectively.

本明細書で使用する用語「〜に対応する」は、異なった検体ゲノムに存在し、必ずしもすべてのゲノムにおいて同一配列を有するものではないが、例えば、遺伝子又は染色体である関心対象配列の遺伝情報以外の固有性を与えるのに供される、例えば、遺伝子又は染色体である核酸配列を意味する。 As used herein, the term “corresponds to” exists in different specimen genomes and does not necessarily have the same sequence in all genomes, but for example genetic information of a sequence of interest that is a gene or a chromosome. It means a nucleic acid sequence that is subject to other uniqueness, eg, a gene or a chromosome.

本明細書で使用する用語「ほぼ無細胞」は、所望のサンプルの調合であって、この所望サンプルから通常関連する成分を除去する該所望サンプルを包含する。例えば、血漿サンプルは、血漿に関連する血液細胞、例えば赤色細胞（赤血球）を除去することによってほぼ無細胞状態にしたものである。若干の実施形態においては、ほぼ無細胞のサンプルは、除去処理を行わないと、ＣＮＶのテストをすべき所望遺伝的材料に寄与するであろう細胞を除去する処理を行う。 As used herein, the term “substantially cell-free” encompasses the preparation of a desired sample that removes normally associated components from the desired sample. For example, a plasma sample is made substantially cell-free by removing blood cells associated with plasma, such as red cells (red blood cells). In some embodiments, the nearly cell-free sample is treated to remove cells that would otherwise contribute to the desired genetic material to be tested for CNV if not removed.

本明細書に使用する用語「胎児画分」は、胎児核酸及び母体核酸を含むサンプル内に存在する胎児核酸の画分を意味する。 As used herein, the term “fetal fraction” refers to the fraction of fetal nucleic acid present in a sample containing fetal and maternal nucleic acids.

本明細書に使用する用語「染色体」は、染色質に由来し、またＤＮＡ及びタンパク質成分（ヒストン）を含む生きている細胞の遺伝担持遺伝子キャリアを意味する。本明細書では、従来の国際的に認識されている個体ヒトゲノム染色体番号付け体系を採用する。 As used herein, the term “chromosome” refers to a gene carrier of a living cell that is derived from chromatin and contains DNA and protein components (histones). In this specification, the conventional internationally recognized individual human genome chromosome numbering system is adopted.

本明細書に使用する用語「ポリヌクレオチド長さ」は、基準ゲノムの配列又は領域における核酸分子（ヌクレオチド）の絶対数を意味する。用語「染色体長さ」は、塩基対における染色体の既知の長さを意味し、例えば、ワールド・ワイド・ウェブ上の”genome.ucsc.edu/cgi-bin/hgTracks?Hgsid=167155613&chromInfoPage=”で見つかるヒト染色体のNCBI36/hg18アセンブリに規定されている。 The term “polynucleotide length” as used herein refers to the absolute number of nucleic acid molecules (nucleotides) in a sequence or region of a reference genome. The term “chromosome length” means the known length of a chromosome in base pairs, eg found on the World Wide Web at “genome.ucsc.edu/cgi-bin/hgTracks?Hgsid=167155613&chromInfoPage=” It is defined in the NCBI36 / hg18 assembly of the human chromosome.

本明細書における用語「検体」は、ヒト検体並びにヒトではない検体、例えば、哺乳類、無脊椎動物、脊椎動物、菌類、酵母、細菌、及びウイルスをも意味する。本明細書における例はヒトに関連し、専門用語は主にヒト関連指向であるが、本発明の概念は任意の植物又は動物からのゲノムに適用でき、また獣医学、動物科学、研究所等の分野において有用である。 The term “specimen” as used herein also means human specimens as well as non-human specimens such as mammals, invertebrates, vertebrates, fungi, yeasts, bacteria, and viruses. Although the examples herein relate to humans and the terminology is primarily human-related, the concepts of the present invention can be applied to genomes from any plant or animal, and veterinary, animal science, laboratory, etc. It is useful in the field of

本明細書における用語「病状」は、すべての疾患及び障害を含む広い意味での「内科的疾患」を意味するが、個人の健康に影響を及ぼし、医療支援の恩恵を受ける、又は医療処置の実施を受けることがあるような「傷害」及び妊娠のような正常健康状態をも含むものとする As used herein, the term “disease state” refers to a “medical disease” in a broad sense, including all diseases and disorders, but affects an individual's health and benefits from medical assistance or medical treatment. Includes “injuries” that may be implemented and normal health conditions such as pregnancy.

用語「完全」は、本明細書において全体染色体の過剰又は不足に言及する染色体異数性につき使用する。 The term “complete” is used herein for chromosomal aneuploidy, which refers to a total chromosome excess or deficiency.

染色体異数性につき使用する用語「部分」は、染色体の一部における過剰又は不足に言及する。 The term “portion” used for chromosomal aneuploidy refers to an excess or deficiency in a portion of a chromosome.

本明細書における用語「モザイク」は、単独受精卵から成長した１個体における異なった核型を有する２つの細胞集団が存在することを意味する。モザイク現象は、成長中の突然変異から生じ、この突然変異は成熟細胞の小集団にのみ波及する。 The term “mosaic” as used herein means that there are two cell populations with different karyotypes in one individual grown from a single fertilized egg. Mosaic phenomenon results from a growing mutation that only affects a small population of mature cells.

本明細書における用語「非モザイク」は、１つの核型の細胞よりなる有機体、例えば、ヒト胎児を意味する。 As used herein, the term “non-mosaic” refers to an organism composed of cells of one karyotype, such as a human fetus.

染色体ドースを決定することにつき使用する用語「染色体を使用する」は、本明細書において、染色体のために得る配列情報、すなわち、染色体のために得る配列タグの数を使用することを意味する。 The term “using a chromosome” as used in determining chromosomal dose means herein using the sequence information obtained for a chromosome, ie the number of sequence tags obtained for the chromosome.

本明細書に使用する用語「感度」は、真陽性及び偽陰性の合計で真陽性を除算した数値に等しい。 As used herein, the term “sensitivity” is equal to the value obtained by dividing true positives by the sum of true positives and false negatives.

本明細書に使用する用語「特異度」は、真陰性及び偽陽性の合計で真陰性を除算した数値に等しい。 As used herein, the term “specificity” is equal to the number of true negatives divided by the sum of true negatives and false positives.

本明細書における用語「患者サンプル」は、患者、すなわち医療的注意、ケア又は処置を受ける個人から得た生物学的サンプルを意味する。患者サンプルは、本明細書に記載する任意のサンプルとすることができる。好適には、患者サンプルは、非侵襲性手順で採取し、例えば、末梢血サンプル又は糞便サンプルとする。 As used herein, the term “patient sample” means a biological sample obtained from a patient, ie, an individual who receives medical attention, care or treatment. The patient sample can be any sample described herein. Preferably, the patient sample is collected by a non-invasive procedure, for example a peripheral blood sample or a stool sample.

本明細書における用語「低二倍性」は、種の染色体特徴における正常な半数より１つ又はそれ以上少ない染色体数を意味する。 As used herein, the term “hypodiploidy” means a chromosome number that is one or more than the normal half of the chromosome characteristics of the species.

説明
本発明は、２つの異なったゲノム由来の核酸混合物を有し、１つ又はそれ以上の関心対象配列における総量が異なることが既知である、又は疑われるテストサンプルにおける、異なる関心対象配列のコピー数多型（ＣＮＶ：copy number variation）を決定する方法を提供する。本発明方法によって決定されたコピー数多型としては、全体染色体の過剰又は不足、顕微鏡的に可視の極めて大きな染色体断片を含む変更、キロベース（ｋｂ）からメガベース（Ｍｂ）にも及ぶＤＮＡ断片における超顕微鏡的コピー数多型の多量存在がある。本発明方法は、この方法は、処理に関連する染色体間変動及びシークエンシング間変動からの見越し変動ステミング処理を担う統計的アプローチを含む。この方法は、任意の胎児異数性におけるＣＮＶ、及び種々の内科的疾患に関連すると既知である若しくは懸念されるＣＮＶを決定するのに適用可能である。本発明方法によって決定できるＣＮＶには、１〜２２番染色体、Ｘ及びＹ染色体のうち任意の１つ又はそれ以上におけるトリソミー及びモノソミー、他の染色体ポリソミー、及び染色体のうち１つ又はそれ以上における断片（セグメント）の欠失及び／又は重複が含まれ、こらはテストサンプルの核酸を１回だけシークエンシングすることによって決定することができる。いかなる異数性もテストサンプルの核酸を１回だけシークエンシングすることによって得られる配列情報から決定することができる。 DESCRIPTION The present invention has a mixture of nucleic acids from two different genomes and copies of different sequences of interest in a test sample known or suspected of differing total amounts in one or more sequences of interest. A method of determining copy number variation (CNV) is provided. Copy number polymorphisms determined by the method of the present invention include excess or deficiency of total chromosomes, alterations involving very large chromosomal fragments visible microscopically, DNA fragments ranging from kilobases (kb) to megabases (Mb) There is a large amount of supermicroscopic copy number polymorphism. The method of the present invention includes a statistical approach that is responsible for accrual variation stemming processing from interchromosomal and inter-sequence variation associated with the process. This method is applicable to determine CNVs in any fetal aneuploidy, and CNVs known or concerned about various medical disorders. CNV that can be determined by the method of the present invention includes trisomy and monosomy in any one or more of chromosomes 1-22 and X and Y, other chromosomal polysomy, and fragments in one or more of the chromosomes (Segment) deletions and / or duplications are included, which can be determined by sequencing the nucleic acid of the test sample only once. Any aneuploidy can be determined from the sequence information obtained by sequencing the nucleic acid of the test sample only once.

ヒトゲノムにおけるＣＮＶは、ヒトの及び多様性及び疾病素因に大きな影響を与える（Redon et al., Nature 23:444-454 [2006], Shaikh et al. Genome Res 19:1682-1690 [2009]参照）。ＣＮＶは、異なるメカニズムによる遺伝的疾病に関与し、多くの場合遺伝子量又は遺伝子の乱れによる不均衡に起因することが分かっている。遺伝的疾患に直接関連することの他に、ＣＮＶは、疾患となり得る表現型異常の仲立ちをすることがしられている。近年、幾つかの研究は、正常な対照例と比較すると、自閉症、ＡＤＨＤ及び統合失調症のような複合疾患における、稀な又はデノボ（新）なＣＮＶの増加を報告しており、稀な又はユニークなＣＮＶの潜在的病原性を浮き彫りにしている（Sebat et al., 316:445-449 [2007]; Walsh et al., Science 320:539-543 [2008]参照）。ＣＮＶは、主に欠失、重複、挿入及び不均衡な転座事象に起因するゲノム再編成から生ずる。 CNV in the human genome has a major impact on human and diversity and disease predisposition (see Redon et al., Nature 23: 444-454 [2006], Shaikh et al. Genome Res 19: 1682-1690 [2009]) . CNV is implicated in genetic illnesses by different mechanisms and is often found to be due to an imbalance due to gene dosage or gene disruption. In addition to being directly related to genetic diseases, CNV has been able to mediate phenotypic abnormalities that can lead to disease. In recent years, several studies have reported a rare or de novo (new) CNV increase in complex diseases such as autism, ADHD and schizophrenia, compared to normal controls. It highlights the potential pathogenicity of a unique or unique CNV (see Sebat et al., 316: 445-449 [2007]; Walsh et al., Science 320: 539-543 [2008]). CNV arises from genomic rearrangements mainly due to deletions, duplications, insertions and unbalanced translocation events.

本明細書に記載する方法は、次世代シークエンシング技術（ＮＧＳ：next generation sequencing technology）を採用し、クローン的に増幅したＤＮＡテンプレート又は単独ＤＮＡ分子をフローセル内で大量並列的にシークエンシングする（例えば、Volkerding et al., Clin Chem 55:641-658 [2009]; Metzker M Nature Rev 11: 31-46 [2010]参照）。高いスループットの配列情報の他に、ＮＧＳは、各配列リードが個別クローンＤＮＡテンプレート又は単独ＤＮＡ分子を表現する計数可能な「配列タグ」である点で定量的情報を提供する。ＮＧＳのシークエンシング技術としては、ピロシークエンシング（pyrosequencing）、可逆色素ターミネータを使用するシークエンシング・バイ・シンセシス（sequencing-by-synthesis）、オリゴヌクレオチドのプローブ結紮によるシークエンシング、及びイオン半導体シークエンシングがある。個別サンプルからのＤＮＡを個別にシークエンシングする（すなわち、単独シークエンシング）、又は複数サンプルからのＤＮＡをプールし、１回のシークエンシング作業（ラン）でインデックス付きのゲノム分子としてシークエンシング（すなわち、多重シークエンシング）し、ＤＮＡ配列の数憶個のリードを生ずることができる。本発明方法による配列情報を得るのに使用できるシークエンシング技術の例を以下に説明する。 The methods described herein employ next generation sequencing technology (NGS) to sequence clonally amplified DNA templates or single DNA molecules in parallel in a flow cell (eg, Volkerding et al., Clin Chem 55: 641-658 [2009]; Metzker M Nature Rev 11: 31-46 [2010]). In addition to high throughput sequence information, NGS provides quantitative information in that each sequence read is a countable “sequence tag” that represents an individual clone DNA template or a single DNA molecule. NGS sequencing techniques include pyrosequencing, sequencing-by-synthesis using reversible dye terminators, sequencing by oligonucleotide probe ligation, and ion semiconductor sequencing. is there. Sequencing DNA from individual samples individually (ie, single sequencing), or pooling DNA from multiple samples and sequencing as indexed genomic molecules in a single sequencing run (ie, run) Multiple sequencing) and can generate several hundred reads of the DNA sequence. Examples of sequencing techniques that can be used to obtain sequence information according to the method of the present invention are described below.

シークエンシング方法
若干のシークエンシング技術は市場で入手可能であり、例えば、アフィメトリクス（Affymetrix Inc.[カリフォルニア州サニーベール]）社からのシークエンシング・バイ・ハイブリダイゼーション基盤、及び４５４ライフ・サイエンシズ（Life Sciences [コネチカット州ブラッドフォード]）社、イルミナ／ソレクサ（Illumina/Solexa [カリフォルニア州ヘイワード]）社及びヘリコス・バイオサイエンシズ（Helicos Biosciences [マサチューセッツ州ケンブリッジ]）社からのシークエンシング・バイ・シンセシス基盤、及び、アプライド・バイオシステムズ（Applied Biosystems [カリフォルニア州フォスターシティ]）社からのシークエンシング・バイ・リゲーション基盤があり、これらを以下に説明する。ヘリコス・バイオサイエンシズ社のシークエンシング・バイ・シンセシスを使用して実施する単独分子シークエンシングの他に、他の単独分子シークエンシング技術としては、パシフィック・バイオサイエンシズ（Pacific Biosciences）社のＳＭＲＴ（登録商標）、イオン・トレント（登録商標）社の技術、及び例えば、オックスフォード・ナノポア・テクノロジーズ（Oxford Nanopore Technologies）社が開発したナノポア（細孔）シークエンシングがある。自動化サンガー（Sanger）方法は「第１世代」技術として見られているが、自動化サンガーシークエンシングを含むサンガーシークエンシングも、本発明方法に使用することができる。他のシークエンシング方法としては、核酸画像形成（イメージング）技術、例えば、原子間力顕微鏡（ＡＦＭ：atomic force microscopy）又は透過型電子顕微鏡（ＴＥＭ：transmission electron microscopy）がある。典型的シークエンシング技術を以下に説明する。 Sequencing Methods Some sequencing techniques are commercially available, for example, sequencing by hybridization platforms from Affymetrix Inc. (Sunnyvale, Calif.), And 454 Life Sciences (Life Sciences). [Sequence by synthesis base from [Bradford, Conn.]), Illumina / Solexa (Hayward, Calif.) And Helicos Biosciences (Cambridge, Mass.), And There is a sequencing by ligation platform from Applied Biosystems (Foster City, CA), which is described below. In addition to single molecule sequencing performed using Sequencing by Synthesis from Helicos Biosciences, other single molecule sequencing techniques include SMRT (registered trademark) from Pacific Biosciences. ), Ion Trent (R) technology, and for example, Nanopore (pore) sequencing developed by Oxford Nanopore Technologies. Although the automated Sanger method is seen as a “first generation” technology, Sanger sequencing, including automated Sanger sequencing, can also be used in the method of the present invention. Other sequencing methods include nucleic acid imaging (imaging) techniques, such as atomic force microscopy (AFM) or transmission electron microscopy (TEM). A typical sequencing technique is described below.

一実施形態において、本発明方法は、ヘリコス社の真単独分子シークエンシング（ｔＳＭＳ：true Single Molecule Sequencing）技術（Harris T.D.et al., Science 320:106-109 [2008]参照）を使用して、テストサンプルにおける核酸の配列情報、例えば母体サンプルにおけるｃｆＤＮＡを得る。ｔＳＭＳ技術において、ＤＮＡサンプルを約１００〜２００個のヌクレオチドのストランドに開裂し、またポリＡ配列を各ＤＮＡストランドの３′端部に付加する。各ストランドは、蛍光ラベル付けされたアデノシンヌクレオチドの付加によってラベル付けされる。ＤＮＡストランドをつぎにフローセル内で交配させ、このフローセルは数１００万個のオリゴＴ捕捉部位を収容し、これらオリゴＴ捕捉部位はフローセル表面に不動に固定する。そのテンプレートは約１憶テンプレート／ｃｍ^２の密度にすることができる。このフローセルを機器、例えばＨｅｌｉＳｃｏｐｅ（登録商標）シークエンサーに装着し、レーザーをフローセルの表面に照射し、各テンプレーの位置を明らかにする。ＣＣＤカメラはフローセル表面上のテンプレート位置をマッピングすることができる。テンプレートの蛍光レベルを開裂し、洗い出す。シークエンシング反応はＤＮＡポリメラーゼ及び蛍光ラベル付けヌクレオチドを導入することによって開始する。オリゴＴ核酸ハプライマーとして作用する。ポリメラーゼは、テンプレート指導に従ってラベル付けヌクレオチドをプライマーに組込む。ポリメラーゼ及び組込まれなかったヌクレオチドを除去する。蛍光ラベル付けヌクレオチドの組込みを指導するテンプレートは、フローセル表面の画像形成によって判別される。画像形成後、開裂ステップは、蛍光ラベルを除去し、このプロセスは所望のリード長さを得るまで他の蛍光ラベル付けヌクレオチドに対して繰り返す。各ヌクレオチド付加ステップで配列情報を収集する。単独分子シークエンシング技術による全ゲノムシークエンシングは、シークエンシングライブラリ準備におけるＰＣＲ塩基増幅を排除し、またサンプル調合の直接性により、そのサンプルのコピー測定ではなくサンプル自体の直接測定を可能にする。 In one embodiment, the method of the present invention is tested using Helicos true single molecule sequencing (tSMS) technology (see Harris TD et al., Science 320: 106-109 [2008]). Obtain nucleic acid sequence information in the sample, eg, cfDNA in the maternal sample. In tSMS technology, a DNA sample is cleaved into strands of about 100-200 nucleotides and a poly A sequence is added to the 3 'end of each DNA strand. Each strand is labeled by the addition of a fluorescently labeled adenosine nucleotide. The DNA strands are then mated in the flow cell, which contains millions of oligo T capture sites that are immobilized on the flow cell surface. The template can have a density of about 1 billion templates / cm ² . The flow cell is attached to an instrument such as a HeliScope® sequencer, and a laser is irradiated on the surface of the flow cell to reveal the position of each template. The CCD camera can map the template position on the flow cell surface. Cleave and wash out the fluorescence level of the template. The sequencing reaction is initiated by introducing DNA polymerase and fluorescently labeled nucleotides. Acts as an oligo T nucleic acid primer. The polymerase incorporates labeled nucleotides into the primer according to template instructions. Remove the polymerase and unincorporated nucleotides. Templates that guide the incorporation of fluorescently labeled nucleotides are determined by imaging the flow cell surface. After imaging, the cleavage step removes the fluorescent label and the process is repeated for other fluorescently labeled nucleotides until the desired lead length is obtained. Sequence information is collected at each nucleotide addition step. Whole genome sequencing by single molecule sequencing technology eliminates PCR base amplification in sequencing library preparation and allows direct measurement of the sample itself rather than a copy measurement of the sample due to the directity of the sample preparation.

他の実施形態において、本発明方法は、（ロシュ[Roche]社）４５４シークエンシング（例えば、Margulies, M.et.al. Nature 437:376-380 [2005]参照）を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。４５４シークエンシングは、２つのステップを有する。第１ステップにおいて、ＤＮＡは、約３００〜８００個の塩基対フラグメントに切り分け、これらフラグメントは端部が粗い。次にオリゴヌクレオチドアダプタをフラグメントの端部に連結する。アダプタはフラグメントの増幅及びシークエンシングのプライマーとして作用する。フラグメントをＤＮＡ捕捉ビード、例えば、ストレプトアビジン被覆ビードに取付け、この取付けには、例えば、５′ビオチンタグを含むアダプタＢを使用する。ビードに取付けたフラグメントは、オイル−水エマルションの液滴内でＰＣＲ増幅される。この結果、各ビードにクローン的に増幅したＤＮＡフラグメントの多数のコピーを生ずる。第２ステップにおいて、ビードをウェル（ピコリットルのサイズ）内に捕捉する。ピロシークエンシングを各フラグメントに対して並列的に行う。１つ又はそれ以上のヌクレオチドの付加は光信号を発生し、この光信号をシークエンシング機器のＣＣＤカメラによって記録する。信号強度は組込まれたヌクレオチドの数に比例する。ピロシークエンシングは、ヌクレオチド付加の際に放出されるピロリン酸塩（ＰＰｉ）を利用する。ＰＰｉは、アデノシン５′ホスホ硫酸の存在下でＡＴＰスルフリル化によってＡＴＰに変換される。ルシフェラーゼ（発光酵素）はＡＴＰを使用して、ルシフェリンをオキシルシフェリンに変換し、またこの反応は光を発生し、この光を測定及び解析する。 In other embodiments, the method of the invention uses (Roche) 454 sequencing (see, eg, Margulies, M. et.al. Nature 437: 376-380 [2005]) to test samples. The sequence information of the nucleic acid, for example, cfDNA in the maternal test sample is obtained. 454 sequencing has two steps. In the first step, the DNA is cut into approximately 300-800 base pair fragments, which are rough at the ends. The oligonucleotide adapter is then ligated to the end of the fragment. The adapter acts as a primer for fragment amplification and sequencing. The fragment is attached to a DNA capture bead, such as a streptavidin-coated bead, for example using adapter B containing a 5 'biotin tag. Fragments attached to the beads are PCR amplified in oil-water emulsion droplets. This results in multiple copies of the clonally amplified DNA fragment in each bead. In the second step, the beads are captured in wells (picoliter size). Pyrosequencing is performed on each fragment in parallel. The addition of one or more nucleotides generates an optical signal that is recorded by the CCD camera of the sequencing instrument. The signal strength is proportional to the number of nucleotides incorporated. Pyrosequencing utilizes pyrophosphate (PPi) that is released upon nucleotide addition. PPi is converted to ATP by ATP sulfurylation in the presence of adenosine 5 'phosphosulfate. Luciferase (a luminescent enzyme) uses ATP to convert luciferin to oxyluciferin, and this reaction generates light, which is measured and analyzed.

他の実施形態において、本発明方法は、ＳＯＬｉＤ（登録商標）技術（アプライド・バイオシステムズ [Applied Biosystems]社）を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。ＳＯＬｉＤシークエンシング・バイ・リゲーションにおいて、ゲノムＤＮＡをフラグメントに切り分け、フラグメントの５′及び３′端部にアダプタを取付け、フラグメントライブラリを生成する。代案として、フラグメントの５′及び３′端部にアダプタを取付け、フラグメントを円形に配列し、円形に配列されたフラグメントを短縮化して内部アダプタを生成し、またこの結果生じたフラグメントの５′及び３′端部にアダプタを取付けて整合対ライブラリを生成することによって、内部アダプタを導入することができる。つぎに、ビード、プライマー、テンプレート、及びＰＣＲを収容するマイクロリアクタ内でクローンビード集団を調製する。ＰＣＲに続いて、テンプレートを変性させ、ビードを増やして拡張したテンプレートでビードを分離する。選択したビードにおけるテンプレートに３′修飾を加え、これによりスライドガラスへの結合が可能になる。配列は、ランダムなオリゴヌクレオチドを、順次に、特定のフルオロフォア（蛍光色素分子）によって識別される中心的な決定塩基（又は塩基対）と部分的にハイブリダイゼーション（交配）及びリゲーション（連結）することによって決定することができる。色を記録した後、連結されたオリゴヌクレオチドを開裂し、また除去し、つぎに、このプロセスを繰り返す。 In other embodiments, the methods of the present invention use SOLiD® technology (Applied Biosystems) to obtain nucleic acid sequence information in a test sample, eg, cfDNA in a maternal test sample. In SOLiD sequencing by ligation, genomic DNA is cut into fragments and adapters are attached to the 5 'and 3' ends of the fragments to generate a fragment library. Alternatively, adapters are attached to the 5 ′ and 3 ′ ends of the fragments, the fragments are arranged in a circle, the circularly arranged fragments are shortened to produce an internal adapter, and the resulting fragments 5 ′ and An internal adapter can be introduced by attaching an adapter to the 3 'end to create a matched pair library. A clonal bead population is then prepared in a microreactor containing beads, primers, templates, and PCR. Following PCR, the template is denatured and the beads are separated with an expanded template with increased beads. A 3 'modification is added to the template in the selected bead, which allows binding to the glass slide. The sequence consists of random oligonucleotides, in turn, partially hybridized and ligated with a central determinant base (or base pair) identified by a particular fluorophore (fluorescent dye molecule). Can be determined. After recording the color, the ligated oligonucleotide is cleaved and removed, and the process is then repeated.

他の実施形態において、本発明方法は、パシフィック・バイオサイエンシズ社の単独分子リアルタイム（ＳＭＲＴ）シークエンシング技術を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。ＳＭＲＴシークエンシングにおいて、色素ラベル付きヌクレオチドの連続的組込みをＤＮＡ合成中に画像化する。単独ＤＮＡポリメラーゼ分子は、リンに結合したヌクレオチドが成長するプライマーストランドに組込まれている間に配列情報を取得する、個別ゼロ・モード波長検出器（ＺＭＷ検出器：zero-mode wavelength detector）の底面に取り付く。ＺＭＷは閉じ込め構体であり、ＤＮＡポリメラーゼによる単独ヌクレオチドの組込みを、ＺＭＷに対して激しく（ミリ秒単位で）出入りするよう拡散する蛍光ヌクレオチドを背景として観察することができる。ヌクレオチドが成長ストランドに組込まれるのには数ミリ秒かかる。この時間中、蛍光ラベルは励起して蛍光信号を発生し、また蛍光タグが開裂する。色素の対応する蛍光測定はどの塩基が組込まれたかを示す。このプロセスを繰り返す。 In other embodiments, the method of the present invention uses Pacific Biosciences' single molecule real time (SMRT) sequencing technology to obtain nucleic acid sequence information in a test sample, eg, cfDNA in a maternal test sample. In SMRT sequencing, continuous incorporation of dye-labeled nucleotides is imaged during DNA synthesis. A single DNA polymerase molecule is located on the bottom of an individual zero-mode wavelength detector (ZMW detector) that acquires sequence information while incorporated into the growing primer strand of nucleotides bound to phosphorus. Attach. ZMW is a confinement structure and can be observed against a background of fluorescent nucleotides that diffuse the incorporation of single nucleotides by DNA polymerase into and out of ZMW intensely (in milliseconds). It takes a few milliseconds for the nucleotide to be incorporated into the growing strand. During this time, the fluorescent label is excited to generate a fluorescent signal and the fluorescent tag is cleaved. The corresponding fluorescence measurement of the dye indicates which base has been incorporated. Repeat this process.

他の実施形態において、本発明方法は、ナノポア（細孔）シークエンシング（例えば、Soni GV and Meller A. Clin Chem 53:1996-2001 [2007]参照）を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。ナノポアシークエンシングＤＮＡ解析技術は、オックスフォード・ナノポア・テクノロジーズ社（英国オックスフォード）を含む多くの会社によって産業的に開発されている。ナノポアシークエンシングは単独分子シークエンシング技術であり、これによってＤＮＡの単独分子をナノポアに通過させるとき直接シークエンシングすることができる。ナノポアは直径１ナノメートルのオーダーの細孔である。ナノポアを導電流体内に浸漬し、電位差（電圧）を印加することによってナノポアにイオンが導通することにより僅かな電流を生ずる。流れる電流の量はナノポアのサイズ及び形状の影響をうける。ＤＮＡ分子がナノポアを通過するとき、ＤＮＡ分子におけるヌクレオチドはナノポアを異なる程度で塞ぎ、ナノポアに流れる電流の大きさを異なる程度で変化する。このようにして、ＤＮＡ分子がナノポアを通過するときの電流変化がＤＮＡ配列のリード（読取り）を表す。 In other embodiments, the method of the invention uses nanopore (pore) sequencing (see, eg, Soni GV and Meller A. Clin Chem 53: 1996-2001 [2007]) to sequence nucleic acids in a test sample. Obtain information, eg, cfDNA in a maternal test sample. Nanopore sequencing DNA analysis technology has been industrially developed by many companies, including Oxford Nanopore Technologies (Oxford, UK). Nanopore sequencing is a single molecule sequencing technique that allows direct sequencing when a single molecule of DNA is passed through the nanopore. Nanopores are pores on the order of 1 nanometer in diameter. By immersing the nanopore in a conductive fluid and applying a potential difference (voltage), ions are conducted to the nanopore to generate a slight current. The amount of current flowing is affected by the size and shape of the nanopore. As the DNA molecule passes through the nanopore, the nucleotides in the DNA molecule plug the nanopore to different extents and change the magnitude of the current flowing through the nanopore to different extents. Thus, the change in current as the DNA molecule passes through the nanopore represents the reading of the DNA sequence.

他の実施形態において、本発明方法は、化学的感知電界効果トランジスタ（chemＦＥＴ）アレイ（例えば、米国特許出願公開第20090026082号参照）を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。この技術における１つの例において、ＤＮＡ分子を反応チャンバ内に配置し、テンプレート分子をポリメラーゼに結合したシークエンシングプライマーに交配させる。1個又はそれ以上の三リン酸塩がシークエンシングプライマーの３′端部で新たな核酸ストランドに組込まれるのをchemＦＥＴによる電流変化によって判別できる。アレイは多数のchemＦＥＴセンサを有することができる。他の実施例において、単独核酸はビードに取り付くことができ、また核酸はビード上で増幅でき、個別のビードはchemＦＥＴアレイにおける反応チャンバ（各チャンバはchemＦＥＴセンサを有する）に転写することができ、核酸をシークエンシングすることができる。 In other embodiments, the methods of the present invention use chemically sensitive field effect transistor (chemFET) arrays (see, eg, US Patent Application Publication No. 20090026082) to determine the sequence information of nucleic acids in a test sample, such as a maternal test sample. To obtain the cfDNA. In one example of this technique, a DNA molecule is placed in a reaction chamber and a template molecule is mated to a sequencing primer bound to a polymerase. One or more triphosphates can be incorporated into a new nucleic acid strand at the 3 'end of the sequencing primer by a current change by chemFET. An array can have multiple chemFET sensors. In other embodiments, single nucleic acids can be attached to the beads, nucleic acids can be amplified on the beads, individual beads can be transferred to reaction chambers in the chemFET array (each chamber has a chemFET sensor), Nucleic acids can be sequenced.

他の実施形態において、本発明方法は、透過型電子顕微鏡（ＴＥＭ）を使用するハルシオン・モレキュラー（Halcyon Molecular）社の技術を使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。個別分子配置急速ナノ転写（ＩＭＰＲＮＴ：Individual Molecule Placement Nano Transfer）と称されるこの方法は、選択的に重原子マーカーでラベル付けした高分子量（１５０ｋｂ以上）ＤＮＡを撮像する１原子解像度の透過型電子顕微鏡を使用し、これら分子を極薄フィルム上に超密度（3ｎｍのストランド相互間距離）の平行アレイにして塩基相互間の間隔が一定になるよう配列する。電子顕微鏡を使用してフィルム上の分子を撮像し、重原子マーカーの位置を決定し、またＤＮＡから塩基配列情報を抽出する。この方法は国際公開第２００９／０４６４４５号に記載されている。この方法は１０分未満で完全ヒトゲノムをシークエンシングすることができる。 In another embodiment, the method of the invention uses nucleic acid sequence information in a test sample, eg, cfDNA in a maternal test sample, using Halcyon Molecular technology using a transmission electron microscope (TEM). Get. This method, known as Individual Molecule Placement Nano Transfer (IMPRNT), is a single-atom-resolution transmission electron that images high molecular weight (150 kb or more) DNA selectively labeled with heavy atom markers. Using a microscope, these molecules are arranged on an ultrathin film in a parallel array of ultra-dense (3 nm interstrand distance) so that the spacing between bases is constant. An electron microscope is used to image molecules on the film, determine the position of heavy atom markers, and extract base sequence information from DNA. This method is described in WO2009 / 046445. This method can sequence the complete human genome in less than 10 minutes.

他の実施形態において、ＤＮＡシークエンシング技術はイオン・トレント（Ion Torrent）社の単独分子シークエンシングであり、これは半導体技術を単純シークエンシング化学に組合せて化学的コード化情報（Ａ，Ｃ，Ｇ，Ｔ）を半導体チップ上のデジタル情報（０，１）に直接翻訳するものである。実際は、ヌクレオチドをポリメラーゼによってＤＮＡストランドに組込むとき水素イオンが副産物として放出される。イオン・トレントは微細加工したウェルの高密度アレイを使用し、この生化学プロセスを大量並列的に行う。各ウェルは異なるＤＮＡ分子を保持する。ウェルの下方にイオン感知層を配置し、このイオン感知層の下方にイオンセンサを配置する。ヌクレオチド、例えばＣをＤＮＡテンプレートに付加し、次にＤＮＡストランドに組込むとき、水素イオンが放出される。そのイオンからの電荷は溶液のｐＨを変化させ、この変化をイオン・トレントのイオンセンサによって検出することができる。このシークエンサー（基本的には世界最小のソリッドステートｐＨ計）は塩基を判定し、化学的情報からデジタル情報に直接移行する。イオン・トレント社の個人向け（パーソナル）ゲノムマシン（ＰＧＭ［登録商標］）シーケンサーは、チップを順次に１つのヌクレオチドで横溢させる。チップを横溢する次のヌクレオチドが一致しない場合、電圧変化が記録されず、塩基が判定されない。ＤＮＡストランドに２つの個別塩基が存在する場合、電圧が倍になり、チップは判定された２つの個別塩基を記録する。直接検出によりヌクレオチド組込みを数秒で記録することができる。 In another embodiment, the DNA sequencing technology is Ion Torrent single molecule sequencing, which combines semiconductor technology with simple sequencing chemistry to provide chemical encoding information (A, C, G , T) is directly translated into digital information (0, 1) on the semiconductor chip. In fact, hydrogen ions are released as a byproduct when nucleotides are incorporated into DNA strands by polymerases. Ion Trent uses a high-density array of microfabricated wells to perform this biochemical process in large quantities in parallel. Each well holds a different DNA molecule. An ion sensing layer is disposed below the well, and an ion sensor is disposed below the ion sensing layer. When a nucleotide, such as C, is added to the DNA template and then incorporated into the DNA strand, hydrogen ions are released. The charge from the ions changes the pH of the solution, and this change can be detected by an ion torrent ion sensor. This sequencer (basically the world's smallest solid state pH meter) determines the base and moves directly from chemical information to digital information. The ION Trent personal genome machine (PGM®) sequencer sequentially floods the chip with one nucleotide. If the next nucleotide that overflows the chip does not match, the voltage change is not recorded and the base is not determined. If there are two individual bases in the DNA strand, the voltage is doubled and the chip records the determined two individual bases. Nucleotide incorporation can be recorded in seconds by direct detection.

他の実施形態において、本発明方法は、シークエンシング・バイ・ハイブリダイゼーションを使用して、テストサンプルにおける核酸の配列情報、例えば母体テストサンプルにおけるｃｆＤＮＡを得る。シークエンシング・バイ・ハイブリダイゼーションは、複数のポリヌクレオチド配列を複数個ポリヌクレオチドプローブに接触させ、複数個のポリヌクレオチドプローブそれぞれは随意的に基板に係留することができる。基板は既知のヌクレオチド配列のアレイを有する平坦表面とすることができる。アレイに対するハイブリダイゼーション（交配）パターンを使用してサンプルに存在するポリヌクレオチド配列を決定することができる。他の実施形態において、各プローブをビード、例えば磁気ビード等に係留する。ビードに対するハイブリダイゼーションを決定し、これを使用してサンプル内の複数のポリヌクレオチド配列を同定することができる。 In other embodiments, the methods of the invention use sequencing-by hybridization to obtain nucleic acid sequence information in a test sample, eg, cfDNA in a maternal test sample. Sequencing by hybridization involves contacting a plurality of polynucleotide sequences with a plurality of polynucleotide probes, each of which can optionally be anchored to a substrate. The substrate can be a flat surface having an array of known nucleotide sequences. Hybridization patterns to the array can be used to determine the polynucleotide sequence present in the sample. In other embodiments, each probe is anchored to a bead, such as a magnetic bead. Hybridization to the bead can be determined and used to identify multiple polynucleotide sequences within the sample.

他の実施形態において、本発明方法は、イルミナ社のシークエンシング・バイ・シンセシス及び可逆ターミネーター塩基シークエンシング化学（例えば、Bentley et al., Nature 6:53-59 [2009]参照）を使用する数１００万個のＤＮＡフラグメントの大量並列シークエンシングによって、テストサンプルにおける核酸の配列情報、例えば母体サンプルにおけるｃｆＤＮＡを得る。テンプレートＤＮＡはゲノムＤＮＡ、例えば、ｃｆＤＮＡとすることができる。若干の実施形態において、ｃｆＤＮＡをテンプレートとして使用し、ｃｆＤＮＡは短いフラグメントとして存在するのでフラグメント化を不要とする。例えば、胎児のｃｆＤＮＡは、長さが約１７０塩基対（ｂｐ）のフラグメントとして血流を循環し（例えば、Fan et al., Clin Chem 56:1279-1286 [2010]参照）、シークエンシング前にＤＮＡのフラグメント化は不要である。イルミナ社のシークエンシング技術は、フラグメント化したゲノムＤＮＡを、オリゴヌクレオチドアンカーを結合する平面状の透光性表面に取付けるのに依存する。テンプレートＤＮＡは端部修復されて５′リン酸化平滑末端を生じ、またクレノウフラグメントのポリメラーゼ活性を使用して単独のＡ塩基を平滑リン酸化ＤＮＡフラグメントの３′端部に付加する。この付加により、オリゴヌクレオチドアダプタに対するリゲーション（連結）のためにＤＮＡフラグメントを調製し、このオリゴヌクレオチドアダプタは３′端部で単独のＴ塩基の張出部を有し、リゲーション効率を高める。アダプタであるオリゴヌクレオチドはフローセルアンカーに対して相補的である。制限的希釈条件の下で、アダプタ修飾し、単独ストランド化したテンプレートＤＮＡをフローセルに付加し、アンカーに対するハイブリダイゼーションによって不動化する。取付けられたＤＮＡフラグメントは拡張し、またブリッジ増幅して、数億個のクラスタを有する超高密度シークエンシングフローセルを生じ、各クラスタは同一テンプレートの１０００個のコピーを含む。一実施形態において、ランダムにフラグメント化したゲノムＤＮＡ、例えばｃｆＤＮＡを、ＰＣＲを用いて増幅してからクラスタ増幅を行う。代案として、増幅しないゲノムライブラリ調製を使用し、クラスタ増幅のみを使用してランダムにフラグメント化したゲノムＤＮＡ、例えばｃｆＤＮＡを富裕にする（例えば、Kozarewa et al., Nature Methods 6:291-295 [2009]参照）。除去可能な蛍光色素を有する可逆ターミネーターを使用する堅牢な４色ＤＮＡシークエンシング・バイ・シンセシス技術を用いてテンプレートをシークエンシングする。高感度蛍光検出は、レーザー励起及び全内部反射光学系を使用することにより達成される。約２０〜４０ｂｐの短配列リードは、反復マスク掛けした基準ゲノムに対して整列し、また基準ゲノムに対する短配列リードの一意的マッピングは、特別に開発したデータ解析パイプラインソフトウェアを使用して同定する。非反復マスク掛け基準ゲノムを使用することもできる。反復マスク掛け又は非反復マスク掛けのいずれかの基準ゲノムを使用して、基準ゲノムに対して一意的にマッピングするリードのみをカウントする。第１リードが完了した後、テンプレートをその場で再生成し、フラグメントの反対側端部から第２リードを可能にする。このようにして、ＤＮＡフラグメントにおけるどちらか一方の単一端部又は修復端部のシークエンシングを使用することができる。サンプルに存在するＤＮＡフラグメントの部分的シークエンシングを行い、例えば３６ｂｐの所定長さのリードを有する配列タグを既知の基準ゲノムに対してマッピングし、これをカウントする。一実施形態において、基準ゲノム配列はNCBI36/hg18配列とし、この配列情報はワールド・ワイド・ウェブ上の”genome.ucsc.edu/cgi-bin/hgTracks?Hgsid=167155613&chromInfoPage=”から入手できる。代案として、基準ゲノム配列はGRCh37/hg19配列とし、この配列情報はワールド・ワイド・ウェブ上の”genome.ucsc.edu/cgi-bin/hgGateway”から入手できる。他の公開配列情報ソースとしては、ＧｅｎＢａｎｋ、ｄｂＥＳＴ、ｄｂＳＴＳ、ＥＭＢＬ（the European Molecular Biology Laboratory）、及びＤＤＢＪ（the DNA Databank of Japan）がある。配列を整列させる多くのコンピュータアルゴリズムが利用可能であり、これらアルゴリズムとしては、限定しないがＢＬＡＳＴ（altschul et al., 1990）、ＢＬＩＴＺ（ＭＰｓｒｃｈ）（Sturrock & Collins, 1993）、ＦＡＳＴＡ（Person & Lipman, 1988）、ＢＯＷＴＩＥ（Langmead et al., Genome Biology 10:R25.1-R25.10 [2009]）、又はＥＬＡＮＤ（Illumina, Inc., San Diego, CA, USA）がある。一実施形態において、血漿ｃｆＤＮＡのクローン的に拡張したコピーの一方の端部を、イルミナ・ゲノム・アナライザ（Illumina Genome Analyzer）用の生物情報解析によってシークエンシングし、また処理するが、これにはヌクレオチドデータベースの効率的大量整列（ＥＬＡＮＤ：Efficient Large-Scale Alignment of Nucleotide Database）ソフトウェアを使用する。本明細書に記載の方法における若干の実施形態において、マッピングされた配列タグは、約２０ｂｐ、約２５ｂｐ、約３０ｂｐ、約３５ｂｐ、約４０ｂｐ、約４５ｂｐ、約５０ｂｐ、約５５ｂｐ、約６０ｂｐ、約６５ｂｐ、約７０ｂｐ、約７５ｂｐ、約８０ｂｐ、約８５ｂｐ、約９０ｂｐ、約９５ｂｐ、約１００ｂｐ、約１１０ｂｐ、約１２０ｂｐ、約１３０ｂｐ、約１４０ｂｐ、約１５０ｂｐ、約２００ｂｐ、約２５０ｂｐ、約３００ｂｐ、約３５０ｂｐ、約４００ｂｐ、約４５０ｂｐ、約５００ｂｐの配列リードを有する。５００ｂｐを超える単独端部リードを可能にし、修復端部リードを生成するとき約１０００ｂｐより多いリードをも可能にする技術的進歩が期待される。一実施形態において、マッピングされた配列タグは３６ｂｐの配列リードを有する。配列タグのマッピングは、タグの配列を基準の配列と比較することによって得られ、これによりシークエンシングした核酸（例えば、ｃｆＤＮＡ）分子の染色体起源を決定し、また特別な遺伝配列情報は不要である。僅かな程度の不一致（配列タグあたり０〜２個の不一致）も、基準ゲノムと混合したサンプルにおけるゲノムとの間に存在し得る少数の多型性の要因となることができる。 In other embodiments, the methods of the present invention use Illumina sequencing by synthesis and reversible terminator base sequencing chemistry (see, eg, Bentley et al., Nature 6: 53-59 [2009]). Mass sequence information of 1 million DNA fragments provides nucleic acid sequence information in the test sample, eg, cfDNA in the maternal sample. The template DNA can be genomic DNA, eg cfDNA. In some embodiments, cfDNA is used as a template and does not require fragmentation because cfDNA exists as a short fragment. For example, fetal cfDNA circulates in the bloodstream as a fragment of approximately 170 base pairs (bp) in length (see, eg, Fan et al., Clin Chem 56: 1279-1286 [2010]) prior to sequencing. DNA fragmentation is not required. Illumina sequencing technology relies on attaching fragmented genomic DNA to a planar, translucent surface that binds oligonucleotide anchors. The template DNA is end repaired to give a 5 'phosphorylated blunt end, and the polymerase activity of the Klenow fragment is used to add a single A base to the 3' end of the blunt phosphorylated DNA fragment. This addition prepares a DNA fragment for ligation to the oligonucleotide adapter, which has a single T base overhang at the 3 'end, increasing ligation efficiency. The adapter oligonucleotide is complementary to the flow cell anchor. Under limiting dilution conditions, adapter-modified, single-stranded template DNA is added to the flow cell and immobilized by hybridization to the anchor. The attached DNA fragments are expanded and bridge amplified to produce ultra-high density sequencing flow cells with hundreds of millions of clusters, each cluster containing 1000 copies of the same template. In one embodiment, randomly fragmented genomic DNA, such as cfDNA, is amplified using PCR before cluster amplification. Alternatively, non-amplified genomic library preparations are used to enrich for randomly fragmented genomic DNA, such as cfDNA, using only cluster amplification (eg, Kozarewa et al., Nature Methods 6: 291-295 [2009 ]reference). Templates are sequenced using a robust four-color DNA sequencing by synthesis technique using a reversible terminator with a removable fluorescent dye. High sensitivity fluorescence detection is achieved by using laser excitation and total internal reflection optics. Short sequence reads of about 20-40 bp align to a repetitively masked reference genome, and a unique mapping of short sequence reads to the reference genome is identified using specially developed data analysis pipeline software . A non-repetitive masked reference genome can also be used. Using either repetitive or non-repetitive masking reference genomes, only those reads that uniquely map to the reference genome are counted. After the first lead is completed, the template is regenerated in situ, allowing a second lead from the opposite end of the fragment. In this way, sequencing of either single end or repair end of the DNA fragment can be used. Partial sequencing of DNA fragments present in the sample is performed, and a sequence tag having a predetermined length of, for example, 36 bp is mapped to a known reference genome and counted. In one embodiment, the reference genomic sequence is the NCBI36 / hg18 sequence, and this sequence information is available from “genome.ucsc.edu/cgi-bin/hgTracks?Hgsid=167155613&chromInfoPage=” on the World Wide Web. As an alternative, the reference genome sequence is the GRCh37 / hg19 sequence, and this sequence information is available from “genome.ucsc.edu/cgi-bin/hgGateway” on the World Wide Web. Other public sequence information sources include GenBank, dbEST, dbSTS, EMBL (the European Molecular Biology Laboratory), and DDBJ (the DNA Databank of Japan). Many computer algorithms for aligning sequences are available, including but not limited to BLAST (altschul et al., 1990), BLITZ (MPsrch) (Sturrock & Collins, 1993), FASTA (Person & Lipman, 1988), BOWTIE (Langmead et al., Genome Biology 10: R25.1-R25.10 [2009]), or ELAND (Illumina, Inc., San Diego, CA, USA). In one embodiment, one end of a clonal expanded copy of plasma cfDNA is sequenced and processed by bioinformatics analysis for the Illumina Genome Analyzer, which includes nucleotides Use Efficient Large-Scale Alignment of Nucleotide Database (ELAND) software. In some embodiments of the methods described herein, the mapped sequence tag is about 20 bp, about 25 bp, about 30 bp, about 35 bp, about 40 bp, about 45 bp, about 50 bp, about 55 bp, about 60 bp, about 65 bp. About 70 bp, about 75 bp, about 80 bp, about 85 bp, about 90 bp, about 95 bp, about 100 bp, about 110 bp, about 120 bp, about 130 bp, about 140 bp, about 150 bp, about 200 bp, about 250 bp, about 300 bp, about 350 bp, about It has a sequence read of 400 bp, about 450 bp, about 500 bp. Technological advances are expected that allow single end leads greater than 500 bp and allow for more than about 1000 bp when generating repair end leads. In one embodiment, the mapped sequence tag has a 36 bp sequence read. Sequence tag mapping is obtained by comparing the sequence of the tag to a reference sequence, thereby determining the chromosomal origin of the sequenced nucleic acid (eg, cfDNA) molecule, and no special genetic sequence information is required. . A slight degree of mismatch (0-2 mismatches per sequence tag) can also contribute to a small number of polymorphisms that may exist between the reference genome and the genome in the mixed sample.

サンプルにつき複数の配列タグが得られる。若干の実施形態において、２０〜４０個の塩基対（ｂｐ）リード（reads）を有する、少なくとも約３×１０^６個の塩基配列タグ、少なくとも約５×１０^６個の塩基配列タグ、少なくとも約８×１０^６個の塩基配列タグ、少なくとも約１０×１０^６個の塩基配列タグ、少なくとも約１５×１０^６個の塩基配列タグ、少なくとも約２０×１０^６個の塩基配列タグ、少なくとも約３０×１０^６個の塩基配列タグ、少なくとも約４０×１０^６個の塩基配列タグ、少なくとも約５０×１０^６個の塩基配列タグを、サンプルあたりリードを基準ゲノムに対してマッピングすることから得る。一実施形態において、すべての配列リードを基準ゲノムの全領域に対してマッピングする。一実施形態において、基準ゲノムにおけるすべての領域、例えばすべての染色体に対してマッピングされたタグをカウントし、また混合ＤＮＡサンプルにおけるＣＮＶ、すなわち関心対象配列、例えば染色体又は染色体の一部における過剰又は不足表現を決定する。この方法は、２つのゲノム間の区別は不要である。 Multiple sequence tags are obtained per sample. In some embodiments, at least about 3 × 10 ⁶ base sequence tags, at least about 5 × 10 ⁶ base sequence tags, having at least about 40 × 40 base pair (bp) reads, at least about 8 × 10 ⁶ base sequence tags, at least about 10 × 10 ⁶ base sequence tags, at least about 15 × 10 ⁶ base sequence tags, at least about 20 × 10 ⁶ base sequence tags, at least about 30 × 10 ^Six base sequence tags, at least about 40 × 10 ⁶ base sequence tags, at least about 50 × 10 ⁶ base sequence tags are obtained from mapping reads per sample to the reference genome. In one embodiment, all sequence reads are mapped to the entire region of the reference genome. In one embodiment, tags mapped to all regions in the reference genome, eg, all chromosomes, are counted, and CNVs in mixed DNA samples, ie sequences of interest, eg, excess or deficiency in a chromosome or part of a chromosome Determine the expression. This method does not require discrimination between the two genomes.

ＣＮＶ、例えば異数性がサンプルに存在する又はしないに関する正確な決定に必要な精度は、１回のシークエンシング作業（ラン）でサンプルあたり基準ゲノムにマッピングする配列タグ数の変動（染色体間変動）、及び異なるシークエンシング作業（ラン）で基準ゲノムにマッピングする配列タグ数の変動（シークエンシング間変動）に基づく。例えば、多型（変動）は、ＧＣリッチ又はＧＣプアな基準配列にマッピングするタグに対して特別に宣告することができる。他の多型（変動）は、核酸の抽出及び純化、シークエンシングライブラリの調製、及び異なるシークエンシング基盤に対して異なる手順を使用することから生じ得る。本発明方法は、正規化配列（正規化染色体配列、又は正規化断片配列）に関する知識をベースにした配列ドース（染色体ドース、又は断片ドース）を使用し、本来的に染色体間（ラン内）及びシークエンシング間（ラン間）からステミング処理する見越し変動性、並びに基盤依存変動性の要因とする。染色体ドースは、単独染色体、又は１〜２２番染色体、Ｘ及びＹ染色体から選択した２個又はそれ以上の染色体により構成することができる正規化染色体配列に関する知識に基づく。代案として、正規化染色体配列は、単独染色体断片、又は１個の染色体における、又は２個若しくはそれ以上の染色体における２個以上の断片により構成することができる。断片ドースは、任意の１個の染色体の単独断片、又は１〜２２番染色体、Ｘ及びＹ染色体のうち、任意の２個若しくはそれ以上の染色体ににおける２個以上の断片より構成することができる正規化断片配列に関する知識に基づく。 The accuracy required for an accurate determination regarding whether CNV, eg aneuploidy is present or absent in a sample, is the variation in the number of sequence tags that map to the reference genome per sample (interchromosomal variation) in a single sequencing operation (run) , And variations in the number of sequence tags (inter-sequencing variation) that map to the reference genome in different sequencing operations (runs). For example, polymorphisms (variations) can be specifically declared for tags that map to GC rich or GC poor reference sequences. Other polymorphisms (variations) can arise from nucleic acid extraction and purification, sequencing library preparation, and using different procedures for different sequencing platforms. The method of the present invention uses sequence doses (chromosome doses, or fragment doses) based on knowledge about normalized sequences (normalized chromosomal sequences, or normalized fragment sequences) and is inherently interchromosomal (intra-run) and It is assumed to be a factor of the foreseeable variability of the stemming process between sequencing (between runs) and the base-dependent variability. Chromosome doses are based on knowledge of a normalized chromosome sequence that can be composed of a single chromosome or two or more chromosomes selected from chromosomes 1-22 and X and Y chromosomes. As an alternative, the normalized chromosomal sequence can consist of a single chromosomal fragment, or two or more fragments on one chromosome or on two or more chromosomes. The fragment dose can be composed of a single fragment of any one chromosome, or two or more fragments in any two or more chromosomes among chromosomes 1-22, X and Y. Based on knowledge of normalized fragment sequences.

適格サンプルにおける正規化配列（正規化染色体配列及び正規化断片）の決定
正規化配列は、関心対象、例えば染色体又は染色体断片における任意の１個の配列に対する正常コピー数を有する細胞で構成されていることが既知の検体から採取した適格サンプルのセットからの配列情報を使用して同定する。正規化配列の決定は、図１に示す本発明方法の実施形態におけるステップ１００，１２０，１３０，１４０及び１４５で説明する。適格サンプルから採取した配列情報は、テストサンプルにおける染色体異数性の統計的に有意な同定を決定するのにも使用する（図１のステップ１５５及び実施例参照）。図１は、生物学的サンプルにおける関心対象、例えば染色体又は染色体断片の配列におけるＣＮＶを決定する本発明方法による実施形態１００のフローチャートである。若干の実施形態において、生物学的サンプルは、検体から採取し、異なるゲノムによって関与される核酸混合物を含む。異なるゲノムは２つの個体によってサンプルに関与し、例えば、異なったゲノムは胎児及びこの胎児を孕んだ母体によって関与される。その外に、ゲノムは、同一検体からの、例えばがん患者の血漿サンプルからの異数性がん細胞及び正倍数性細胞によってサンプルに関与する。 Determination of normalized sequences (normalized chromosomal sequences and normalized fragments) in qualifying samples Normalized sequences are composed of cells of interest, eg cells with normal copy number for any one sequence in a chromosome or chromosomal fragment Identification using sequence information from a set of eligible samples taken from known specimens. The determination of the normalization sequence is described in steps 100, 120, 130, 140 and 145 in the embodiment of the method of the present invention shown in FIG. Sequence information taken from eligible samples is also used to determine a statistically significant identification of chromosomal aneuploidy in the test sample (see step 155 and examples in FIG. 1). FIG. 1 is a flowchart of an embodiment 100 according to the method of the invention for determining a CNV in a sequence of interest, eg, a chromosome or chromosome fragment, in a biological sample. In some embodiments, the biological sample comprises a mixture of nucleic acids taken from an analyte and involved by different genomes. Different genomes are involved in the sample by two individuals, for example, different genomes are involved by the fetus and the mother that held the fetus. In addition, the genome is involved in the sample by aneuploid cancer cells and euploid cells from the same specimen, for example from plasma samples of cancer patients.

適格サンプルのセットは、適格正規化配列を同定するため、またテストサンプルにおけるＣＮＶの統計学的に有意な同定を決定するのに使用する変動値を得るために採取する。ステップ１１０において、複数の生物学的適格サンプルを、関心対象の任意な１つの配列に関して正常なコピー数を有する細胞を含んでいることが既知である複数の検体から採取する。一実施形態において、適格サンプルは、細胞遺伝学的手段を用いて染色体の正常コピー数を有することが確認された胎児を孕んだ母体から採取する。生物学的適格サンプルは、生物学的流体、例えば、血漿、又は以下に説明するような任意の適当なサンプルとすることができる。若干の実施形態において、適格サンプルは、核酸分子、例えばｃｆＤＮＡ分子の混合物を含むものとすることができる。若干の実施形態において、適格サンプルは、胎児及び母体のｃｆＤＮＡ分子の混合物を含む母体血漿サンプルとする。正規化染色体及び／又は正規化染色体の断片のための配列情報は、任意な既知のシークエンシング方法を使用して、例えば、胎児及び母体の核酸の少なくとも一部をシークエンシングすることによって得る。好適には、本明細書のいたるところに記載した次世代シークエンシング（ＮＧＳ）方法のうち任意な１つを使用して、単独分子又はクローン増幅した分子としての胎児及び母体の核酸をシークエンシングする。 A set of qualified samples is taken to identify qualified normalization sequences and to obtain a variation value used to determine a statistically significant identification of CNV in the test sample. In step 110, a plurality of biologically qualified samples are taken from a plurality of analytes known to contain cells having normal copy numbers for any one sequence of interest. In one embodiment, the eligible sample is taken from a maternal body bearing a fetus that has been confirmed to have a normal chromosome copy number using cytogenetic means. The biologically qualified sample can be a biological fluid, such as plasma, or any suitable sample as described below. In some embodiments, a qualifying sample can include a mixture of nucleic acid molecules, eg, cfDNA molecules. In some embodiments, the eligible sample is a maternal plasma sample containing a mixture of fetal and maternal cfDNA molecules. Sequence information for normalized chromosomes and / or fragments of normalized chromosomes is obtained using any known sequencing method, for example, by sequencing at least a portion of fetal and maternal nucleic acids. Preferably, any one of the Next Generation Sequencing (NGS) methods described elsewhere herein are used to sequence fetal and maternal nucleic acids as single molecules or clonal amplified molecules. .

ステップ１２０において、適格サンプルに含まれるすべての適格核酸それぞれの少なくとも一部分をシークエンシングして、例えばｈｇ１８の基準ゲノムに整列する、例えば３６ｂｐリードの配列リードを数１００万生ずる。若干の実施形態において、配列リードは、約２０ｂｐ、約２５ｂｐ、約３０ｂｐ、約３５ｂｐ、約４０ｂｐ、約４５ｂｐ、約５０ｂｐ、約５５ｂｐ、約６０ｂｐ、約６５ｂｐ、約７０ｂｐ、約７５ｂｐ、約８０ｂｐ、約８５ｂｐ、約９０ｂｐ、約９５ｂｐ、約１００ｂｐ、約１１０ｂｐ、約１２０ｂｐ、約１３０ｂｐ、約１４０ｂｐ、約１５０ｂｐ、約２００ｂｐ、約２５０ｂｐ、約３００ｂｐ、約３５０ｂｐ、約４００ｂｐ、約４５０ｂｐ、約５００ｂｐを有する。５００ｂｐを超える単独端部リードを可能にし、修復端部リードを生成するとき約１０００ｂｐより多いリードをも可能にする技術的進歩が期待される。一実施形態において、マッピングされた配列タグは３６ｂｐの配列リードを有する。配列リードは基準ゲノムに整列し、また基準ゲノムに一意的にマッピングされたリードは塩基配列タグ（ときに「配列タグ」と略称する）として知られている。一実施形態において、２０〜４０個の塩基対（ｂｐ）リード（reads）を有する、少なくとも約３×１０^６個の適格配列タグ、少なくとも約５×１０^６個の適格配列タグ、少なくとも約８×１０^６個の適格配列タグ、少なくとも約１０×１０^６個の適格配列タグ、少なくとも約１５×１０^６個の適格配列タグ、少なくとも約２０×１０^６個の適格配列タグ、少なくとも約３０×１０^６個の適格配列タグ、少なくとも約４０×１０^６個の適格配列タグ、少なくとも約５０×１０^６個の適格配列タグを、基準ゲノムに対して一意的にマッピングするリードから得る。 In step 120, at least a portion of each of all qualifying nucleic acids contained in the qualifying sample is sequenced, resulting in millions of sequence reads, eg, 36 bp reads that align, eg, with the reference genome of hg18. In some embodiments, the sequence reads are about 20 bp, about 25 bp, about 30 bp, about 35 bp, about 40 bp, about 45 bp, about 50 bp, about 55 bp, about 60 bp, about 65 bp, about 70 bp, about 75 bp, about 80 bp, about 85 bp, about 90 bp, about 95 bp, about 100 bp, about 110 bp, about 120 bp, about 130 bp, about 140 bp, about 150 bp, about 200 bp, about 250 bp, about 300 bp, about 350 bp, about 400 bp, about 450 bp, about 500 bp. Technological advances are expected that allow single end leads greater than 500 bp, and even greater than about 1000 bp when generating repair end leads. In one embodiment, the mapped sequence tag has a 36 bp sequence read. Sequence reads align with the reference genome, and reads that are uniquely mapped to the reference genome are known as base sequence tags (sometimes abbreviated as “sequence tags”). In one embodiment, at least about 3 × 10 ⁶ qualified sequence tags, at least about 5 × 10 ⁶ qualified sequence tags, at least about 8 ×, having 20-40 base pair (bp) reads. 10 ⁶ qualified sequence tags, at least about 10 × 10 ⁶ qualified sequence tags, at least about 15 × 10 ⁶ qualified sequence tags, at least about 20 × 10 ⁶ qualified sequence tags, at least about 30 × 10 ⁶ One eligible sequence tag, at least about 40 × 10 ⁶ eligible sequence tags, at least about 50 × 10 ⁶ eligible sequence tags are obtained from reads that uniquely map to a reference genome.

ステップ１３０で、適格サンプルにおける核酸をシークエンシングすることから得たすべてのタグをカウントして、適格配列タグ密度を決定する。一実施形態において、配列タグ密度は、基準ゲノムにおける関心対象配列にマッピングされる適格配列タグの数として決定する。他の実施形態において、適格配列タグは、マッピングされる関心対象適格配列の長さに対して正規化した、関心対象配列にマッピングした適格配列タグの数として決定する。関心対象配列長さに対するタグ密度の比として決定された配列タグ密度は、本明細書においてタグ密度比と称する。関心対象配列長さに対する正規化は不要であり、ヒト解釈用に簡素化するための数値における桁数を減らすステップとして含めることができる。すべての適格配列タグをマッピングし、また適格サンプルそれぞれにおいてカウントするとき、適格サンプルにおける関心対象配列、例えば臨床関連配列の配列タグ密度は、後に正規化配列を同定する付加的配列のための配列タグ密度として決定される。 At step 130, all tags resulting from sequencing nucleic acids in the qualified sample are counted to determine the qualified sequence tag density. In one embodiment, the sequence tag density is determined as the number of eligible sequence tags that map to the sequence of interest in the reference genome. In other embodiments, the qualified sequence tag is determined as the number of qualified sequence tags mapped to the sequence of interest, normalized to the length of the targeted sequence of interest to be mapped. The sequence tag density determined as the ratio of tag density to sequence length of interest is referred to herein as the tag density ratio. Normalization to the sequence length of interest is not necessary and can be included as a step to reduce the number of digits in the numerical value for simplification for human interpretation. When all qualifying sequence tags are mapped and counted in each qualifying sample, the sequence tag density of the sequence of interest in the qualifying sample, eg, the clinically relevant sequence, is the sequence tag for additional sequences that later identify the normalized sequence Determined as density.

若干の実施形態において、関心対象配列は、完全染色体異数性に関連する染色体、例えば２１番染色体とし、適格正規化配列は、染色体異数性に関連せず、また配列タグ密度における多型（変動）が関心対象配列の配列（すなわち、染色体）、例えば２１番染色体の多型に最も近似する完全染色体である。１〜２２番染色体、Ｘ及びＹ染色体のうち任意の１個又はそれ以上を関心対象配列とすることができ、また１個又はそれ以上の染色体は、適格サンプルにおける１〜２２番染色体、Ｘ及びＹ染色体のうち任意の１個それぞれに対する正規化配列として同定することができる。正規化染色体は個別染色体とするか、又は本明細書のいたるところに記載するように染色体のグループとすることができる。 In some embodiments, the sequence of interest is a chromosome associated with complete chromosomal aneuploidy, such as chromosome 21, and the qualified normalized sequence is not associated with chromosomal aneuploidy and is a polymorphism in sequence tag density ( Variation) is the complete chromosome that most closely approximates the sequence of interest sequence (ie, chromosome), eg, polymorphism of chromosome 21. Any one or more of chromosomes 1-22 and X and Y may be sequences of interest, and one or more chromosomes may be chromosomes 1-22, X and It can be identified as a normalized sequence for any one of the Y chromosomes. Normalized chromosomes can be individual chromosomes or can be a group of chromosomes as described elsewhere herein.

他の実施形態において、関心対象配列は、部分的異数性、例えば、染色体欠失若しくは挿入、又は不均衡染色体転座に関連する染色体断片であり、正規化配列は、部分異数性に関連せず、また配列タグ密度における多型（変動）が部分異数性に関連する染色体断片のの多型に最も近似する染色体断片である。１〜２２番染色体、Ｘ及びＹ染色体のうち任意の１個又はそれ以上における任意な１個又はそれ以上の断片を関心対象配列とすることができる。 In other embodiments, the sequence of interest is a partial aneuploidy, eg, a chromosomal fragment associated with a chromosomal deletion or insertion, or an unbalanced chromosomal translocation, and the normalized sequence is associated with a partial aneuploidy. In addition, the polymorphism (variation) in sequence tag density is the chromosome fragment that most closely resembles the polymorphism of the chromosome fragment associated with partial aneuploidy. Any one or more fragments in any one or more of chromosomes 1-22 and X and Y can be used as the sequence of interest.

全ての実施形態において、適格サンプルで単独配列又は配列グループが、任意な１個又はそれ以上の関心対象配列のための正規化配列として同定されるかどうかによって、適格正規化配列は、適格サンプルにおいて決定された関心対象配列の配列タグ密度に最も近似する配列タグ密度に変動を有する。例えば、適格正規化配列は最も小さい変動性（多型性）を有する配列である、すなわち、正規化配列の変動性は関心対象配列の変動性に最も近似する。 In all embodiments, a qualified normalized sequence is defined in a qualified sample depending on whether a single sequence or sequence group in the qualified sample is identified as a normalized sequence for any one or more sequences of interest. There is a variation in the sequence tag density that most closely approximates the determined sequence tag density of the sequence of interest. For example, a qualified normalized sequence is the sequence with the least variability (polymorphism), ie, the variability of the normalized sequence most closely approximates the variability of the sequence of interest.

若干の実施形態において、正規化配列は、１個又はそれ以上の異変ありサンプルから１個又はそれ以上の適格サンプルを最も区別される配列であり、正規化配列は最も弁別可能性の高い配列であることを意味し、すなわち、正規化配列の弁別可能性は、異変ありテストサンプルにおける関心対象配列に対して最適な弁別を行い、異変ありテストサンプルを他の異変なしサンプルから容易に区別することができる。他の実施形態において、正規化配列は最も小さい変動性及び最も大きい弁別可能性を有する配列とする。弁別可能性のレベルは、以下に実施例で説明するように、適格サンプルの母集団における配列ドース、例えば染色体ドース又は断片ドースと、１個又はそれ以上のテストサンプルにおける染色体ドースとの間における統計学的相違として決定することができる。例えば、弁別可能性は、Ｔ検定値として数値的に表すことができ、このＴ検定値は、適格サンプルの母集団における染色体ドースと、１個又はそれ以上のテストサンプルにおける染色体ドースとの間の統計学的な相違を表す。代案として、弁別可能性は、正規化染色体値（ＮＣＶ：Normalized Chromosome Value）として数値表現することができ、これはＮＣＶの分布が標準的である限り染色体ドースに対するｚスコアである。同様に、弁別可能性は、Ｔ検定値として数値表現することができ、これは適格サンプルの母集団における断片ドースと、１個又はそれ以上のテストサンプルにおける断片ドースとの間の統計学的相違を表す。代案として、断片ドースの弁別可能性は、正規化断片値（ＮＳＶ：Normalized Segment Value）として数値表現することができ、これはＮＳＶの分布が標準的である限り染色体ドースに対するｚスコアである。ｚスコアを決定する際に、適格サンプルのセットにおける染色体ドース又は断片ドースの平均及び標準偏差を使用することができる。代案として、適格サンプル及び異変ありサンプルを含むトレーニングセットにおける染色体ドース又は断片ドースの平均及び標準偏差を使用することができる。他の実施形態において、正規化配列は最小変動性及び最大弁別可能性を有する配列とする。 In some embodiments, the normalization sequence is the sequence that most distinguishes one or more eligible samples from one or more anomalous samples, and the normalization sequence is the most discriminative sequence. Meaning, i.e. the discriminability of the normalized sequence makes it possible to optimally discriminate against the sequence of interest in the anomaly test sample and easily distinguish the anomaly test sample from the other anomaly samples Can do. In other embodiments, the normalization sequence is the sequence with the least variability and the greatest discriminability. The level of discriminability is the statistics between sequence doses in the qualifying sample population, such as chromosomal or fragment doses, and chromosomal doses in one or more test samples, as described in the examples below. It can be determined as a scientific difference. For example, the discriminability can be expressed numerically as a T-test value, which is between the chromosomal dose in the population of eligible samples and the chromosomal dose in one or more test samples. Represents statistical differences. Alternatively, the discriminability can be expressed numerically as a Normalized Chromosome Value (NCV), which is a z-score for chromosomal doses as long as the NCV distribution is standard. Similarly, discriminability can be expressed numerically as a T-test value, which is the statistical difference between a fragment dose in a population of eligible samples and a fragment dose in one or more test samples. Represents. As an alternative, the discriminability of a fragment dose can be expressed numerically as a Normalized Segment Value (NSV), which is a z-score for a chromosomal dose as long as the NSV distribution is standard. In determining the z-score, the mean and standard deviation of chromosomal or fragment doses in the set of eligible samples can be used. Alternatively, the mean and standard deviation of chromosomal or fragment doses in a training set that includes eligible and anomalous samples can be used. In other embodiments, the normalization sequence is an sequence with minimal variability and maximum discriminability.

本発明方法は、本来的に類似特性を有し、またサンプル間及びシークエンシングの作業（ラン）間で類似の多型（変動）を受け易い配列を同定し、これはテストサンプルにおける配列ドースを決定するのに有用である。 The method of the present invention identifies sequences that have inherently similar characteristics and are susceptible to similar polymorphisms (variations) between samples and between sequencing tasks (runs), which can reduce sequence doses in test samples. Useful for making decisions.

適格サンプルにおける配列ドース（すなわち、染色体ドース又は断片ドース）の決定
ステップ１４０において、計算した適格タグ密度に基づいて、関心対象配列用の適格配列ドース、すなわち染色体ドース又は断片ドースを、関心対象配列の配列タグ密度と、後のステップ１４５で正規化配列を同定する付加的配列の適格配列タグ密度との比として決定する。これに続いて同定した正規化配列を使用し、テストサンプルにおける配列ドースを決定する。 Determining Sequence Dose (i.e. Chromosome Dose or Fragment Dose) in a Qualified Sample In step 140, based on the calculated qualifying tag density, a qualified sequence dose for the sequence of interest, i. Determined as the ratio of the sequence tag density to the qualifying sequence tag density of the additional sequence that identifies the normalized sequence in a later step 145 This is followed by using the identified normalized sequence to determine the sequence dose in the test sample.

一実施形態において、適格サンプルにおける配列ドースは、関心対象染色体に関する配列タグ数と、適格サンプルにおける正規化染色体配列に関する配列タグ数との比として計算される染色体ドースである。正規化染色体配列は、単独染色体、染色体グループ、１個の染色体の断片、又は異なる染色体からの断片グループとすることができる。したがって、関心対象染色体に関する染色体ドースは、適格サンプルにおいて、（ｉ）関心対象染色体に関するタグ数と、単独染色体により構成される正規化染色体配列に関するタグ数との比、（ii）関心対象染色体に関するタグ数と、２個又はそれ以上の染色体により構成される正規化染色体配列に関するタグ数との比、（iii）関心対象染色体に関するタグ数と、単独染色体断片により構成される正規化断片配列に関するタグ数との比、（iv）関心対象染色体に関するタグ数と、１個の染色体からの２個又はそれ以上の断片により構成される正規化断片配列に関するタグ数との比、又は（ｖ）関心対象染色体に関するタグ数と、２個又はそれ以上の染色体における２個又はそれ以上の染色体断片により構成される正規化断片配列に関するタグ数との比、として決定する。（ｉ）〜（ｖ）に従って関心対象である２１番染色体に関する染色体ドースを決定する実施例を以下に示す。関心対象染色体、例えば２１番染色体に関する染色体ドースを、２１番染色体の配列タグ密度と、残りのすべての染色体、すなわち１〜２０番染色体、２２番染色体、Ｘ染色体及びＹ染色体のそれぞれに関する配列タグ密度との比として決定する（ｉ）、関心対象染色体、例えば２１番染色体に関する染色体ドースを、２１番染色体の配列タグ密度と、残りの染色体における２個又はそれ以上のあり得るすべての組合せに関する配列タグ密度との比として決定する（ii）、関心対象染色体、例えば２１番染色体に関する染色体ドースを、２１番染色体の配列タグ密度と、他の染色体、例えば９番染色体の断片に関する配列タグ密度との比として決定する（iii）、関心対象染色体、例えば２１番染色体に関する染色体ドースを、２１番染色体の配列タグ密度と、他の１個の染色体の２個の断片、例えば９番染色体の２個の断片に関する配列タグ密度との比として決定する（iv）、及び関心対象染色体、例えば２１番染色体に関する染色体ドースを、２１番染色体の配列タグ密度と、２個の異なる染色体の２個の断片に関する、例えば９番染色体の断片及び１４番染色体の断片に関する配列タグ密度との比として決定する。 In one embodiment, the sequence dose in the eligible sample is a chromosome dose calculated as the ratio of the number of sequence tags for the chromosome of interest to the number of sequence tags for the normalized chromosomal sequence in the eligible sample. The normalized chromosomal sequence can be a single chromosome, a chromosomal group, a fragment of a single chromosome, or a group of fragments from different chromosomes. Therefore, the chromosomal dose related to the chromosome of interest is the ratio of the number of tags related to the chromosome of interest and the number of tags related to the normalized chromosome sequence composed of a single chromosome in the eligible sample, and (ii) the tag related to the chromosome of interest. The ratio of the number of tags to a normalized chromosome sequence composed of 2 or more chromosomes, (iii) the number of tags related to the chromosome of interest and the number of tags related to a normalized fragment sequence composed of a single chromosome fragment (Iv) the ratio of the number of tags for the chromosome of interest to the number of tags for a normalized fragment sequence composed of two or more fragments from one chromosome, or (v) the chromosome of interest A tag relating to a normalized fragment sequence composed of two or more chromosome fragments in two or more chromosomes As a ratio to the number. An example of determining the chromosomal dose for the chromosome 21 of interest according to (i) to (v) is shown below. Chromosome dose for the chromosome of interest, eg chromosome 21, the sequence tag density for chromosome 21 and the sequence tag density for each of the remaining chromosomes, ie, chromosomes 1-20, 22, 22 and X. (I) the chromosome dose for the chromosome of interest, eg chromosome 21, the sequence tag density for chromosome 21 and the sequence tag for all 2 or more possible combinations in the remaining chromosomes (Ii) determine the chromosome dose for the chromosome of interest, for example chromosome 21, as the ratio to the density, the ratio of the sequence tag density for chromosome 21 to the sequence tag density for fragments of other chromosomes, for example chromosome 9 (Iii) determine the chromosomal dose for the chromosome of interest, eg chromosome 21, Determined as the ratio of the row tag density to the sequence tag density for two fragments of another chromosome, for example two fragments of chromosome 9, (iv), and for the chromosome of interest, for example chromosome 21 Chromosome dose is determined as the ratio of the sequence tag density of chromosome 21 to the sequence tag density for two fragments of two different chromosomes, for example, the fragment of chromosome 9 and the fragment of chromosome 14.

他の実施形態において、適格サンプルにおける配列ドースは、関心対象断片に関する配列タグ数と適格サンプルにおける正規化断片配列に関する配列タグ数との比として計算される断片ドースとする。正規化断片配列は、１個の染色体断片、又は異なる染色体からの断片グループとすることができる。したがって、関心対象断片に関する断片ドースは適格サンプルにおいて、（ｉ）関心対象断片に関するタグ数と、染色体の単独断片により構成される正規化断片配列に関するタグ数との比、（ii）関心対象断片に関するタグ数と、１個の染色体における２個又はそれ以上の断片により構成される正規化断片配列に関するタグ数との比、又は（iii）関心対象断片に関するタグ数と、２個又はそれ以上の異なる染色体における２個又はそれ以上の断片により構成される正規化断片配列に関するタグ数との比、として決定する。 In other embodiments, the sequence dose in the qualified sample is the fragment dose calculated as the ratio of the number of sequence tags for the fragment of interest to the number of sequence tags for the normalized fragment sequence in the qualified sample. The normalized fragment sequence can be a single chromosomal fragment or a group of fragments from different chromosomes. Therefore, the fragment dose for the fragment of interest is the ratio of the number of tags for the fragment of interest to the number of tags for the normalized fragment sequence composed of a single fragment of the chromosome in the eligible sample, and (ii) for the fragment of interest. The ratio of the number of tags to the number of tags for a normalized fragment sequence composed of two or more fragments on one chromosome, or (iii) the number of tags for a fragment of interest is two or more different It is determined as the ratio to the tag number for a normalized fragment sequence composed of two or more fragments in the chromosome.

関心対象の１個又はそれ以上の染色体に関する染色体ドースはすべての適格サンプルにおいて決定し、また正規化染色体配列をステップ１４５で同定する。同様に、１個又はそれ以上の関心対象断片に関する断片ドースをすべての適格サンプルで決定し、また正規化断片配列をステップ１４５で同定する。 Chromosome doses for one or more chromosomes of interest are determined in all eligible samples, and normalized chromosomal sequences are identified in step 145. Similarly, fragment doses for one or more fragments of interest are determined in all eligible samples, and normalized fragment sequences are identified in step 145.

適格配列ドースからの正規化配列の同定
ステップ１４５において、正規化配列を関心対象配列用に計算した配列ドースに基づく配列として同定し、すなわち、すべての適格サンプルにわたり関心対象配列に関する配列ドースにおける変動性が最小となる。本発明方法は、本来的に類似特性を有し、またサンプル間及びシークエンシングの作業（ラン）間で類似の多型（変動）を受け易い配列を同定し、これはテストサンプルにおける配列ドースを決定するのに有用である Identifying normalized sequences from eligible sequence doses In step 145, the normalized sequence is identified as a sequence based on the sequence dose calculated for the sequence of interest, i.e. the variability in the sequence dose with respect to the sequence of interest across all eligible samples. Is minimized. The method of the present invention identifies sequences that have inherently similar characteristics and are susceptible to similar polymorphisms (variations) between samples and between sequencing tasks (runs), which can reduce sequence doses in test samples. Useful to decide

１個又はそれ以上の関心対象配列に関する正規化配列は、適格サンプルのセットで同定することができ、またその後適格サンプルで同定される配列を使用して、各テストサンプルで１個又はそれ以上の関心対象配列に関する配列ドースを計算し（ステップ１５０）、各テストサンプルにおける異数性の有無を決定する。関心対象の染色体又は断片のために同定した正規化配列は、異なるシークエンシング基盤を使用するとき、及び／又はシークエンシングすべき核酸の純化に差があるとき、及び／又はシークエンシングライブラリの調製に差があるとき、異なることがあり得る。本発明方法による正規化配列を使用することによって、使用するサンプル調製及び／又はシークエンシング基盤に無関係に、染色体又は断片におけるコピー数多型に関する特別なまた感度のよい評価基準をもたらす。 Normalized sequences for one or more sequences of interest can be identified in a set of eligible samples, and then one or more in each test sample using the sequences identified in the eligible samples. The sequence dose for the sequence of interest is calculated (step 150) and the presence or absence of aneuploidy in each test sample is determined. Normalized sequences identified for the chromosome or fragment of interest may be used when using different sequencing platforms and / or when there is a difference in the purification of the nucleic acids to be sequenced and / or in the preparation of sequencing libraries. When there is a difference, it can be different. The use of normalized sequences according to the method of the present invention provides a special and sensitive criterion for copy number variation in chromosomes or fragments regardless of the sample preparation and / or sequencing platform used.

若干の実施形態において、１個より多い正規化配列を同定する、すなわち、異なった正規化配列を１個の関心対象配列に関して決定することができ、また複数の配列ドースを１個の関心対象配列に関して決定することができる。例えば、関心対象である２１番染色体に関する染色体ドースにおける多型、例えば、変動係数は、１４番染色体の配列タグ密度を使用するとき最小となる。しかし、２個、３個、４個、５個、６個、７個、８個又はそれ以上の正規化配列は、テストサンプルにおける関心対象配列に関する配列ドース決定に使用して同定することができる。例えば、任意な１つのテストサンプルにおける２１番染色体に関する第２ドースを、正規化染色体配列として、７番染色体、９番染色体、１１番染色体又は１２番染色体を使用して決定することができ、なぜならこれら染色体はすべて１４番染色体のＣＶに近似するＣＶを有するからである（実施例２、表２参照）。好適には、単独染色体を、関心対象染色体に関する正規化染色体配列として選択するとき、正規化染色体配列は、テストされるすべてのサンプル、例えば適格サンプルにわたり最小の変動性を有する関心対象染色体に関する染色体ドースになる染色体とする。 In some embodiments, more than one normalized sequence can be identified, i.e., different normalized sequences can be determined for one sequence of interest, and multiple sequence doses can be determined for one sequence of interest. Can be decided on. For example, polymorphisms in the chromosomal dose for chromosome 21 of interest, eg, coefficient of variation, are minimal when using the sequence tag density of chromosome 14. However, two, three, four, five, six, seven, eight or more normalized sequences can be identified using the sequence dose determination for the sequence of interest in the test sample. . For example, the second dose for chromosome 21 in any one test sample can be determined using chromosome 7, 9, 11, or 12 as the normalized chromosome sequence, because This is because all these chromosomes have CVs similar to those of chromosome 14 (see Example 2, Table 2). Preferably, when a single chromosome is selected as the normalized chromosomal sequence for the chromosome of interest, the normalized chromosomal sequence is the chromosomal dose for the chromosome of interest that has the least variability across all samples being tested, eg, eligible samples. It becomes the chromosome which becomes.

染色体の正規化配列としての正規化染色体配列
他の実施形態において、正規化染色体配列は単独配列とするか、又は配列グループとすることができる。例えば、若干の実施形態において、正規化配列は、配列グループ、例えば染色体グループとし、これは１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１個又はそれ以上に関する正規化配列として同定する。関心対象染色体に関する正規化配列、すなわち、正規化染色体配列を含む染色体のグループは、２個、３個、４個、５個、６個、７個、８個、９個、１０個、１１個、１２個、１３個、１４個、１５個、１６個、１７個、１８個、１９個、２０個、２１個、２２個の染色体よりなるグループとすることができ、またＸ染色体及びＹ染色体のうち一方又は双方を含む又は含まないものとすることができる。正規化染色体配列として同定される染色体のグループは、テストされるすべてのサンプル、例えば適格サンプルにわたり最小の変動性を有する関心対象染色体に関する染色体ドースになる染色体グループとする。好適には、個別染色体及び染色体グループを、正規化染色体配列として選択される関心対象配列の挙動に最も近似する能力に関して、一緒にテストする。 Normalized Chromosomal Sequence as a Chromosomal Normalized Sequence In other embodiments, the normalized chromosomal sequence can be a single sequence or a sequence group. For example, in some embodiments, the normalized sequence is a sequence group, eg, a chromosomal group, which is identified as a normalized sequence for any one or more of chromosomes 1-22, X and Y. . Normalized sequence for the chromosome of interest, ie, the group of chromosomes containing the normalized chromosomal sequence is 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 , 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, and 22 chromosomes, and the X and Y chromosomes One or both of them may or may not be included. The group of chromosomes identified as a normalized chromosomal sequence is the chromosome group that becomes the chromosomal dose for the chromosome of interest with minimal variability across all samples being tested, eg, eligible samples. Preferably, individual chromosomes and chromosome groups are tested together for the ability to best approximate the behavior of the sequence of interest selected as the normalized chromosomal sequence.

一実施形態において、２１番染色体の正規化配列は、９番染色体、１番染色体、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、１４番染色体、１５番染色体、１６番染色体、及び１７番染色体から選択する。代案として、２１番染色体の正規化配列を、９番染色体、１番染色体、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、１４番染色体、１５番染色体、１６番染色体、及び１７番染色体から選択した染色体グループとする。他の実施形態において、染色体グループは、９番染色体、１番染色体、２番染色体、１１番染色体、１２番染色体、及び１４番染色体から選択したグループとする。 In one embodiment, the normalized sequence of chromosome 21 is chromosome 9, chromosome 1, chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, Select from chromosome 10, chromosome 11, chromosome 12, chromosome 13, chromosome 14, chromosome 15, chromosome 16, and chromosome 17. As an alternative, the normalized sequence of chromosome 21 is changed to chromosome 9, chromosome 1, chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 10 A chromosome group selected from chromosome, chromosome 11, chromosome 12, chromosome 13, chromosome 14, chromosome 15, chromosome 16, and chromosome 17. In another embodiment, the chromosome group is a group selected from chromosome 9, chromosome 1, chromosome 2, chromosome 11, chromosome 12, and chromosome 14.

若干の実施形態において、本発明方法は、さらに、各染色体を個別に、また残りの染色体とのあり得るすべての組合せを使用してすべての染色体ドースを系統的に計算することによって決定した正規化配列を使用することにより改善する（実施例７参照）。例えば、系統的に決定した正規化染色体は、各関心対象染色体に関して、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意な１個、及び１〜２２番染色体、Ｘ染色体及びＹ染色体のうち２個又はそれ以上の組合せを使用してすべてのあり得る染色体ドースを系統的に計算することによって決定し、どの単独染色体又はどの染色体グループを正規化染色体とするかを決定することができ、これにより適格サンプルのセットにわたる関心対象染色体に関する染色体ドースの変動性が最小となる（実施例７参照）。したがって、一実施形態において、２１番染色体に関する系統的に計算した正規化染色体配列は、４番染色体、１４番染色体、１６番染色体、２０番染色体、及び２２番染色体よりなる染色体グループとする。単独染色体又は染色体グループは、ゲノムにおけるすべての染色体に関して決定することができる。 In some embodiments, the method further comprises normalization determined by systematically calculating all chromosomal doses using each chromosome individually and all possible combinations with the remaining chromosomes. This is improved by using a sequence (see Example 7). For example, the normalized chromosomes systematically determined are, for each chromosome of interest, any one of chromosomes 1-22 and X and Y, and chromosomes 1-22, X and Y. It can be determined by systematically calculating all possible chromosomal doses using two or more combinations to determine which single chromosome or which chromosomal group is a normalized chromosome, Minimizes chromosomal dose variability with respect to the chromosome of interest across the set of eligible samples (see Example 7). Therefore, in one embodiment, the normalized chromosome sequence calculated systematically for chromosome 21 is a chromosome group consisting of chromosome 4, chromosome 14, chromosome 16, chromosome 20, and chromosome 22. A single chromosome or group of chromosomes can be determined for all chromosomes in the genome.

一実施形態において、１８番染色体の正規化配列は、８番染色体、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、１４番染色体から選択する。好適には、１８番染色体用の正規化配列を、８番染色体、２番染色体、３番染色体、５番染色体、６番染色体、１２番染色体、及び１４番染色体から選択する。代案として、１８番染色体の正規化配列を、８番染色体、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、及び１４番染色体から選択した染色体グループとする。好適には、染色体グループは、８番染色体、２番染色体、３番染色体、５番染色体、６番染色体、１２番染色体、及び１４番染色体から選択したグループとする。 In one embodiment, the normalized sequence of chromosome 18 is chromosome 8, chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 9, chromosome 10, Select from chromosome 11, chromosome 12, chromosome 13, and chromosome 14. Preferably, the normalization sequence for chromosome 18 is selected from chromosome 8, chromosome 2, chromosome 3, chromosome 5, chromosome 6, chromosome 12, and chromosome 14. As an alternative, the normalized sequence of chromosome 18 is changed from chromosome 8, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 9, chromosome 10, chromosome 11, A chromosome group selected from chromosome, chromosome 12, chromosome 13, and chromosome 14. Preferably, the chromosome group is a group selected from chromosome 8, chromosome 2, chromosome 3, chromosome 5, chromosome 6, chromosome 12, and chromosome 14.

他の実施形態において、１８番染色体の正規化配列は、（本明細書のあらゆるところで説明したように）あり得る各正規化染色体を個別に、また正規化染色体のあり得るすべての組合せを使用してあり得るすべての染色体ドースを系統的に計算することによって、決定する。したがって、一実施形態において、１８番染色体に関する正規化配列は、２番染色体、３番染色体、５番染色体、及び７番染色体よりなる染色体グループで構成される正規化染色体とする。 In other embodiments, the normalized sequence of chromosome 18 uses each possible normalized chromosome individually (as described elsewhere herein) and all possible combinations of normalized chromosomes. It is determined by systematically calculating all possible chromosomal doses. Therefore, in one embodiment, the normalized sequence for chromosome 18 is a normalized chromosome composed of a chromosome group consisting of chromosome 2, chromosome 3, chromosome 5, and chromosome 7.

一実施形態において、Ｘ染色体の正規化配列は、１番染色体、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、１４番染色体、１５番染色体、及び１６番染色体から選択する。好適には、Ｘ染色体の正規化配列を、２番染色体、３番染色体、５番染色体、６番染色体、及び８番染色体から選択する。代案として、Ｘ染色体の正規化配列を、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１３番染色体、１４番染色体、１５番染色体、及び１６番染色体から選択した染色体グループとする。好適には、染色体グループは、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、及び８番染色体から選択したグループとする。 In one embodiment, the normalized sequence of chromosome X is chromosome 1, chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 9, 10 Chromosome, chromosome 11, chromosome 12, chromosome 13, chromosome 14, chromosome 15 and chromosome 16 are selected. Preferably, the X chromosome normalization sequence is selected from chromosome 2, chromosome 3, chromosome 5, chromosome 6 and chromosome 8. As an alternative, the normalized sequence of chromosome X is the chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 9, chromosome 10, chromosome 11. Chromosome group selected from chromosome 12, chromosome 13, chromosome 14, chromosome 15, and chromosome 16. Preferably, the chromosome group is a group selected from chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, and chromosome 8.

他の実施形態において、Ｘ染色体の正規化配列は、（本明細書のいたるところに記載したように）正規化染色体のうちあり得る正規化染色体及びあり得る正規化染色体の組合せのそれぞれを使用して、すべてのあり得る染色体ドースを系統的に計算することによって決定する。したがって、一実施形態において、Ｘ染色体の正規化配列は４番染色体及び８番染色体のグループよりなる正規化染色体とする。 In other embodiments, the normalization sequence of the X chromosome uses each of the possible normalization chromosomes and possible normalization chromosome combinations of normalization chromosomes (as described elsewhere herein). Thus, all possible chromosomal doses are determined by systematic calculation. Therefore, in one embodiment, the normalized sequence of the X chromosome is a normalized chromosome consisting of a group of chromosomes 4 and 8.

一実施形態において、１３番染色体の正規化配列は、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１４番染色体、１８番染色体、及び２１番染色体から選択する。好適には、１３番染色体用の正規化配列を、２番染色体、３番染色体、５番染色体、６番染色体、及び８番染色体から選択した染色体とする。他の実施形態において、１３番染色体の正規化配列を、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、７番染色体、８番染色体、９番染色体、１０番染色体、１１番染色体、１２番染色体、１４番染色体、１８番染色体、及び２１番染色体から選択した染色体グループとする。好適には、染色体グループは、２番染色体、３番染色体、４番染色体、５番染色体、６番染色体、及び８番染色体から選択したグループとする。 In one embodiment, the normalized sequence of chromosome 13 is chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 9, chromosome 10, Select from chromosome 11, chromosome 12, chromosome 14, chromosome 18, and chromosome 21. Preferably, the normalized sequence for chromosome 13 is a chromosome selected from chromosome 2, chromosome 3, chromosome 5, chromosome 6, and chromosome 8. In another embodiment, the normalized sequence of chromosome 13 is the chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, chromosome 7, chromosome 8, chromosome 9, chromosome 10. Chromosome group selected from the 11th chromosome, the 12th chromosome, the 14th chromosome, the 18th chromosome, and the 21st chromosome. Preferably, the chromosome group is a group selected from chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, and chromosome 8.

他の実施形態において、１３番染色体の正規化配列は、（本明細書のいたるところに記載したように）正規化染色体のうちあり得る正規化染色体及びあり得る正規化染色体の組合せのそれぞれを使用して、すべてのあり得る染色体ドースを系統的に計算することによって決定する。したがって、一実施形態において、１３番染色体の正規化配列は４番染色体及び５番染色体のグループを有する正規化染色体とする。他の実施形態において、１３番染色体の正規化配列は４番染色体及び５番染色体のグループよりなる正規化染色体とする。 In other embodiments, the normalization sequence of chromosome 13 uses each of the possible normalization chromosomes and the possible normalization chromosome combinations of normalization chromosomes (as described elsewhere herein). Thus, all possible chromosomal doses are determined by systematic calculation. Accordingly, in one embodiment, the normalized sequence of chromosome 13 is a normalized chromosome having a group of chromosomes 4 and 5. In another embodiment, the normalized sequence of chromosome 13 is a normalized chromosome consisting of a group of chromosomes 4 and 5.

Ｙ染色体の染色体ドースにおける多型は３０個より多く、これらの正規化染色体をそれぞれ独立的に使用してＹ染色体ドースを決定する。したがって、１〜２２番染色体及びＸ染色体から選択した任意の１個の染色体又は２個若しくはそれ以上の染色体のグループをＹ染色体の正規化配列として使用する。一実施形態において、少なくとも１個の正規化染色体は、１〜２２番染色体及びＸ染色体よりなる染色体グループとする。他の実施形態において、染色体グループは、２番染色体、３番染色体、４番染色体、５番染色体、及び６番染色体よりなる染色体グループとする。 There are more than 30 polymorphisms in the chromosomal dose of the Y chromosome, and each of these normalized chromosomes is used independently to determine the Y chromosome dose. Therefore, any one chromosome or a group of two or more chromosomes selected from chromosomes 1-22 and X is used as the normalization sequence for the Y chromosome. In one embodiment, the at least one normalized chromosome is a chromosome group consisting of chromosomes 1 to 22 and the X chromosome. In another embodiment, the chromosome group is a chromosome group consisting of chromosome 2, chromosome 3, chromosome 4, chromosome 5, and chromosome 6.

他の実施形態において、Ｙ染色体の正規化配列は、（本明細書のいたるところに記載したように）正規化染色体のうちあり得る正規化染色体及びあり得る正規化染色体の組合せのそれぞれを使用して、すべてのあり得る染色体ドースを系統的に計算することによって決定する。したがって、一実施形態において、Ｙ染色体の正規化配列は４番染色体及び６番染色体よりなる染色体グループを有する正規化染色体とする。他の実施形態において、Ｙ染色体の正規化配列は４番染色体及び６番染色体よりなる染色体グループで構成した正規化染色体とする。 In other embodiments, the normalized sequence of the Y chromosome uses each of the possible normalized chromosomes and possible normalized chromosome combinations of the normalized chromosomes (as described elsewhere herein). Thus, all possible chromosomal doses are determined by systematic calculation. Therefore, in one embodiment, the normalized sequence of the Y chromosome is a normalized chromosome having a chromosome group consisting of chromosomes 4 and 6. In another embodiment, the normalized sequence of the Y chromosome is a normalized chromosome composed of a chromosome group consisting of chromosomes 4 and 6.

関心対象の異なる染色体又は関心対象の異なる断片におけるドースを計算するのに使用される正規化配列は同一の正規化配列とする、又は関心対象の異なる染色体又は関心対象の異なる断片に対してそれぞれ異なる正規化配列とすることができる。例えば、関心対象染色体Ａの正規化配列（単独若しくはグループ）は同一とするか、又は例えば関心対象染色体Ｂの正規化配列（単独若しくはグループ）とは異なるものとすることができる。 The normalization sequence used to calculate the dose on different chromosomes of interest or different fragments of interest should be the same normalization sequence or different for different chromosomes of interest or different fragments of interest It can be a normalized array. For example, the normalized sequence (single or group) of chromosome C of interest can be the same or different from the normalized sequence (single or group) of chromosome C of interest, for example.

完全染色体の正規化配列は完全染色体若しくは完全染色体グループとするか、又は染色体断片若しくは１個以上の染色体断片のグループとすることができる。 The normal sequence of a complete chromosome can be a complete chromosome or a complete chromosome group, or it can be a chromosome fragment or a group of one or more chromosome fragments.

染色体の正規化配列としての正規化断片配列
他の実施形態において、染色体の正規化配列は正規化断片配列とすることができる。正規化断片配列は単独断片とする若しくは１個の染色体の断片グループとすることができ、又は２個以上の異なる染色体からの断片とすることができる。正規化断片配列はゲノムにおける断片配列のすべての組合せを系統的に計算することによって決定することができる。例えば、２１番染色体の正規化断片配列は、２番染色体のサイズより大きい又は小さい単独断片とすることができ、２番染色体は約４７Ｍｂｐ（million base pairs）であり、約１４０Ｍｂｐの９番染色体とはサイズが異なる。代案として、２１番染色体のための正規化配列は、１番染色体からの配列と、１２番染色体からの配列との組合せとすることができる。 Normalized Fragment Sequence as a Chromosomal Normalized Sequence In other embodiments, the chromosomal normalized sequence can be a normalized fragment sequence. The normalized fragment sequence can be a single fragment or a group of fragments of one chromosome or can be fragments from two or more different chromosomes. Normalized fragment sequences can be determined by systematically calculating all combinations of fragment sequences in the genome. For example, the normalized fragment sequence of chromosome 21 can be a single fragment larger or smaller than the size of chromosome 2, chromosome 2 is about 47 Mbp (million base pairs), and about 140 Mbp of chromosome 9 and Are different in size. As an alternative, the normalized sequence for chromosome 21 can be a combination of the sequence from chromosome 1 and the sequence from chromosome 12.

一実施形態において、２１番染色体の正規化配列は、１〜２０番染色体、２２番染色体、Ｘ染色体及びＹ染色体のうちの１断片、又は２個以上の断片グループの正規化断片配列とする。他の実施形態において、１８番染色体の正規化配列は、１〜１７番染色体、１９〜２２番染色体、Ｘ染色体及びＹ染色体のうちの１断片、又は断片グループとする。他の実施形態において、１３番染色体の正規化配列は、１〜１２番染色体、１４〜２２番染色体、Ｘ染色体及びＹ染色体のうちの１断片、又は断片グループとする。他の実施形態において、Ｘ染色体の正規化配列は、１〜２２番染色体、及びＹ染色体のうちの１断片、又は断片グループとする。他の実施形態において、Ｙ染色体の正規化配列は、１〜２２番染色体、及びＸ染色体のうちの１断片、又は断片グループとする。単独断片又は断片グループの正規化断片配列はゲノムにおけるすべての染色体に対して決定することができる。正規化断片配列の２個以上の断片は１個の染色体からの断片とするか、又は２個以上の断片は２個以上の染色体の断片とすることができる。正規化染色体配列につき説明したように、正規化断片配列は２個以上の異なった染色体に対して同一とすることができる。 In one embodiment, the normalized sequence of chromosome 21 is a fragment of chromosomes 1 to 20, chromosome 22, X and Y, or a normalized fragment sequence of two or more fragment groups. In another embodiment, the normalization sequence of chromosome 18 is a fragment or group of chromosomes 1 to 17, chromosomes 19 to 22, X and Y. In another embodiment, the normalized sequence of chromosome 13 is a fragment or group of chromosomes 1-12, 14-22, X and Y. In another embodiment, the normalized sequence of the X chromosome is a fragment or group of chromosomes 1 to 22 and the Y chromosome. In another embodiment, the normalized sequence of the Y chromosome is a fragment or group of chromosomes 1 to 22 and the X chromosome. A normalized fragment sequence of a single fragment or group of fragments can be determined for every chromosome in the genome. Two or more fragments of the normalized fragment sequence can be fragments from one chromosome, or two or more fragments can be fragments of two or more chromosomes. As explained for the normalized chromosomal sequence, the normalized fragment sequence can be the same for two or more different chromosomes.

染色体断片のための正規化配列としての正規化断片配列
関心対象配列のＣＮＶ有無は、関心対象配列が染色体断片であるとき決定することができる。染色体断片のコピー数多型は、部分的染色体異数性の有無を決定することができる。以下に説明するのは、種々の胎児異常及び病状に関連する部分的染色体異数性の例である。染色体断片は任意の長さとすることができる。例えば、その長さはキロベースから数１００メガベースの範囲にわたることがあり得る。ヒトゲノムは３０億個のＤＮＡ塩基にわたって存在し、このＤＮＡ塩基を、数１０個、数１０００個、数１０万、数１００万の異なるサイズの断片に分割し、これら異なるサイズの断片を本発明方法によって決定することができる。染色体断片の正規化配列は、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意な１個からの単独断片、又は１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意な１個以上から断片グループとすることができる正規化断片配列である。 Normalized fragment sequence as a normalized sequence for a chromosomal fragment The presence or absence of CNV of a sequence of interest can be determined when the sequence of interest is a chromosomal fragment. Chromosomal fragment copy number polymorphism can determine the presence or absence of partial chromosomal aneuploidy. Described below are examples of partial chromosomal aneuploidy associated with various fetal abnormalities and medical conditions. Chromosome fragments can be of any length. For example, the length can range from kilobases to hundreds of megabases. The human genome exists over 3 billion DNA bases, and the DNA bases are divided into tens, thousands, hundreds of thousands, and millions of fragments of different sizes, and these fragments of different sizes are processed by the method of the present invention Can be determined by. The normalized sequence of the chromosomal fragment is a single fragment from any one of chromosomes 1-22 and X and Y, or any one or more of chromosomes 1-22, X and Y. A normalized fragment sequence that can be a fragment group.

関心対象断片の正規化配列は、染色体にわたり変動性を有する、また関心対象断片の配列に近接するサンプルにわたり変動性を有する配列である。正規化配列の決定は、正規化配列が１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意な１個又はそれ以上の断片グループであるとき、関心対象染色体の正規化配列を決定することに関して説明したとおりに実行することができる。１個の断片又は断片グループの正規化断片配列は、１個の断片及び２個以上の断片におけるすべてのあり得る組合せを使用して、適格サンプル、すなわち、関心対象断片が2倍体であると分かっているサンプルのセットにおける各サンプルの関心対象断片の正規化配列として断片ドースを計算することによって同定することができ、また正規化配列は、正規化染色体配列につき上述したように、すべての適格サンプルにわたる関心対象断片の変動性が最低の断片ドースを提供するように決定する。 A normalized sequence of a fragment of interest is a sequence that has variability across the chromosome and variability across samples that are close to the sequence of the fragment of interest. Normalized sequence determination relates to determining the normalized sequence of the chromosome of interest when the normalized sequence is any one or more fragment groups of chromosomes 1-22, X and Y. It can be executed as described. A normalized fragment sequence of a fragment or fragment group uses all possible combinations in one fragment and two or more fragments to determine that the eligible sample, ie the fragment of interest, is diploid. Can be identified by calculating the fragment dose as the normalized sequence of the fragment of interest in each sample in the set of known samples, and the normalized sequence is all qualified as described above for the normalized chromosomal sequence. Determine that the fragment variability of interest across the sample provides the lowest fragment dose.

例えば、１Ｍｂ（megabase）の関心対象断片に関して、約３Ｇｂのヒトゲノムにおける残りの３００万個（１ｍｇの関心対象断片を差し引いた）の断片を個別に、又は互いに組合せて使用し、適格サンプルのセットにおける関心対象断片の断片ドースを計算し、どの１個の断片又はどの断片グループが適格サンプル及びテストサンプルの正規化断片配列として作用するかを決定することができる。関心対象断片は約１０００個の塩基から数１０メガベースもの塩基にわたり変化し得る。正規化断片配列は、関心対象配列と同一サイズの１個又はそれ以上の断片により構成することができる。他の実施形態において、正規化断片配列は、関心対象配列の断片とは異なった、及び／又は互いに異なった断片により構成することができる。例えば、１０，０００塩基長さを有する配列の正規化断片配列は２０，０００塩基長さとすることができ、異なる長さ、例えば、７，０００＋８，０００＋５，０００塩基の配列組合せを有するものとすることができる。正規化染色体配列につき本明細書のいたるところに記載したように、正規化断片配列は、正規化断片におけるあり得る正規化染色体断片それぞれを個別に、またあり得るすべての組合せを使用して（正規化染色体配列につき本明細書のいたるところに記載したように）、すべてのあり得る染色体ドース及び／又は断片ドースを系統的に計算することによって、決定することができる。単独断片又は断片グループは、ゲノムにおけるすべての断片及び／又は染色体に対して決定することができる。 For example, for a fragment of interest of 1 Mb (megabase), the remaining 3 million fragments (minus 1 mg of the fragment of interest) in the approximately 3 Gb human genome can be used individually or in combination with each other in a set of eligible samples The fragment dose of the fragment of interest can be calculated to determine which single fragment or group of fragments will serve as the normalized fragment sequence for eligible and test samples. Fragments of interest can vary from about 1000 bases to as much as tens of megabases. A normalized fragment sequence can be composed of one or more fragments of the same size as the sequence of interest. In other embodiments, the normalized fragment sequence may be composed of fragments that are different from and / or different from the fragments of the sequence of interest. For example, a normalized fragment sequence of a sequence having a length of 10,000 bases can be 20,000 bases in length, and has a sequence combination of different lengths, eg, 7,000 + 8,000 + 5,000 bases. be able to. As described elsewhere herein for normalized chromosomal sequences, normalized fragment sequences are used to identify each possible normalized chromosomal fragment in the normalized fragment individually and in all possible combinations (normalized). Can be determined by systematically calculating all possible chromosomal and / or fragment doses (as described elsewhere herein for chromosomal sequences). A single fragment or fragment group can be determined for every fragment and / or chromosome in the genome.

関心対象の異なる染色体断片におけるドースを計算するのに使用する正規化配列は同一とするか、又は関心対象の異なる染色体断片に異なった正規化配列とすることができる。例えば、関心対象染色体断片Ａの正規化断片（単独若しくはグループ）の正規化配列は同一とするか、又は例えば関心対象染色体断片Ｂの正規化断片（単独若しくはグループ）とは異なる正規化配列とすることができる。 The normalization sequences used to calculate the dose in different chromosomal fragments of interest can be the same or can be different normalization sequences for different chromosomal fragments of interest. For example, the normalization sequence of the normalization fragment (single or group) of the chromosome fragment A of interest is the same, or the normalization sequence is different from the normalization fragment (single or group) of the chromosome fragment B of interest, for example. be able to.

テストサンプルにおける異数性決定
適格サンプルにおける正規化配列の同定に基づいて、１個又はそれ以上の関心対象配列に違いがあるゲノムに由来する核酸混合物を有するテストサンプルにおける関心対象配列に対して配列ドースを決定する。 Aneuploidy determination in a test sample Based on the identification of normalized sequences in a qualified sample, sequence against a sequence of interest in a test sample having a nucleic acid mixture derived from a genome that differs in one or more sequences of interest Determine the dose.

ステップ１１５において、テストサンプルを関心対象配列に臨床関連ＣＮＶがあることが疑われる又は既知の検体から採取する。このテストサンプルは、生体液、例えば、血漿、又は以下に説明するような任意の適当なサンプルとすることができる。若干の実施形態において、テストサンプルは、核酸分子、例えばｃｆＤＮＡ分子の混合物を含むものとする。若干の実施形態において、テストサンプルは、胎児及び母体のｃｆＤＮＡ分子混合物を含む母体血漿サンプルとする。 In step 115, a test sample is taken from a specimen suspected or known to have clinically relevant CNV in the sequence of interest. The test sample can be a biological fluid, such as plasma, or any suitable sample as described below. In some embodiments, the test sample will comprise a mixture of nucleic acid molecules, eg, cfDNA molecules. In some embodiments, the test sample is a maternal plasma sample containing a mixture of fetal and maternal cfDNA molecules.

ステップ１２５で、テストサンプルにおけるテスト核酸の少なくとも一部を、適格サンプルにつき説明したように、シークエンシングし、例えば、３６ｂｐリードの配列リードを数１００万個生成する。ステップ１２０と同様に、テストサンプルにおける核酸をシークエンシングすることから生じたリードを基準ゲノムに対して一意的にマッピングする。ステップ１２０につき説明したように、２０〜４０個の塩基対（ｂｐ）リード（reads）を有する、少なくとも約３×１０^６個の適格配列タグ、少なくとも約５×１０^６個の適格配列タグ、少なくとも約８×１０^６個の適格配列タグ、少なくとも約１０×１０^６個の適格配列タグ、少なくとも約１５×１０^６個の適格配列タグ、少なくとも約２０×１０^６個の適格配列タグ、少なくとも約３０×１０^６個の適格配列タグ、少なくとも約４０×１０^６個の適格配列タグ、少なくとも約５０×１０^６個の適格配列タグを、基準ゲノムに対して一意的にマッピングするリードから得る。 At step 125, at least a portion of the test nucleic acid in the test sample is sequenced as described for the qualified sample to generate millions of sequence reads, eg, 36 bp reads. Similar to step 120, the reads resulting from sequencing the nucleic acids in the test sample are uniquely mapped to a reference genome. As described for step 120, at least about 3 × 10 ⁶ qualified sequence tags, at least about 5 × 10 ⁶ qualified sequence tags, having at least 20-40 base pair (bp) reads, at least About 8 × 10 ⁶ qualified sequence tags, at least about 10 × 10 ⁶ qualified sequence tags, at least about 15 × 10 ⁶ qualified sequence tags, at least about 20 × 10 ⁶ qualified sequence tags, at least about 30 X10 ⁶ qualified sequence tags, at least about 40 x 10 ⁶ qualified sequence tags, at least about 50 x 10 ⁶ qualified sequence tags are obtained from reads that uniquely map to a reference genome.

ステップ１３５で、テストサンプルにおける核酸をシークエンシングすることから得るすべてのタグをカウントして、テスト配列タグ密度を決定する。一実施形態において、関心対象配列にマッピングされたテスト配列タグ数は、テスト配列タグ密度比を得るようマッピングされる関心対象配列の既知長さに正規化する。適格サンプルにつき説明したように、関心対象配列の既知長さに正規化する必要はなく、ヒト解釈用に簡素化するための数値における桁数を減らすステップとして含めることができる。すべてのマッピングされたテスト配列タグをテストサンプルにおいてカウントするとき、テストサンプルにおける関心対象配列、例えば臨床関連配列の配列タグ密度は、適格サンプルで同定される少なくとも１個の正規化配列に対応する付加的配列のための配列タグ密度として決定される。 At step 135, all tags resulting from sequencing nucleic acids in the test sample are counted to determine the test sequence tag density. In one embodiment, the number of test sequence tags mapped to the sequence of interest is normalized to the known length of the sequence of interest mapped to obtain a test sequence tag density ratio. As described for the qualified sample, it is not necessary to normalize to the known length of the sequence of interest and can be included as a step to reduce the number of digits in the number for simplification for human interpretation. When counting all mapped test sequence tags in a test sample, the sequence tag density of a sequence of interest in the test sample, eg, a clinically relevant sequence, is added corresponding to at least one normalized sequence identified in a qualified sample Determined as the sequence tag density for the target sequence.

ステップ１５０で、適格サンプルにおける少なくとも１個の正規化配列の同定に基づいて、テスト配列ドースを、テストサンプルにおける関心対象配列に対して決定する。本明細書のいたるところに記載したように、少なくとも１個の正規化配列は、単独配列又は配列グループとすることができる。テストサンプルにおける関心対象配列の配列ドースは、テストサンプルにおける関心対象配列に関して決定した配列タグ密度と、テストサンプルで決定した少なくとも１個の正規化配列の配列タグ密度との比であり、この場合、テストサンプルにおける正規化配列は、特別な関心対象配列のために適格サンプルにおいて同定した正規化配列に対応する。例えば、適格サンプルにおいて２１番染色体に対して同定した正規化配列が染色体、例えば１４番染色体であると決定される場合、２１番染色体（関心対象配列）のテスト配列ドースは、２１番染色体の配列タグ密度と、テストサンプルでそれぞれ決定された１４番染色体の配列タグ密度との比として決定される。同様に、１３番染色体、１８番染色体、Ｘ染色体、Ｙ染色体及び染色体異数性に関連する他の染色体の染色体ドースを決定する。関心対象染色体の正規化配列は、１個の染色体若しくは染色体グループ、又は１個の染色体断片若しくは染色体断片グループとすることができる。上述したように、関心対象配列は、染色体の一部、例えば染色体断片の一部とすることができる。したがって、染色体断片のドースは、テストサンプルにおける断片に関して決定された配列タグ密度と、テストサンプルにおける正規化染色体断片の配列タグ密度との比として決定することができ、この場合、テストサンプルにおける正規化断片は、特別な関心対象断片のために適格サンプルにおいて同定された正規化断片（単独断片又は断片グループ）に対応する。染色体断片は寸法がキロベースからメガベースにわたる。 At step 150, a test sequence dose is determined for the sequence of interest in the test sample based on the identification of at least one normalized sequence in the eligible sample. As described throughout the specification, at least one normalized sequence can be a single sequence or a sequence group. The sequence dose of the sequence of interest in the test sample is the ratio of the sequence tag density determined for the sequence of interest in the test sample to the sequence tag density of at least one normalized sequence determined in the test sample, where The normalized sequence in the test sample corresponds to the normalized sequence identified in the eligible sample for the particular sequence of interest. For example, if the normalized sequence identified for chromosome 21 in a qualified sample is determined to be a chromosome, eg, chromosome 14, the test sequence dose for chromosome 21 (the sequence of interest) is the sequence of chromosome 21 It is determined as a ratio between the tag density and the sequence tag density of chromosome 14 determined in each test sample. Similarly, chromosome doses of chromosome 13, chromosome 18, X chromosome, Y chromosome and other chromosomes related to chromosome aneuploidy are determined. The normalized sequence of the chromosome of interest can be a single chromosome or group of chromosomes, or a single chromosome fragment or group of chromosome fragments. As mentioned above, the sequence of interest can be part of a chromosome, for example part of a chromosome fragment. Thus, the chromosomal fragment dose can be determined as the ratio of the sequence tag density determined for the fragment in the test sample to the normalized chromosomal fragment sequence tag density in the test sample, in which case the normalized in the test sample Fragments correspond to normalized fragments (single fragments or fragment groups) identified in a qualified sample for a particular fragment of interest. Chromosome fragments range in size from kilobases to megabases.

ステップ１５５で、複数個の適格サンプルで決定された適格配列ドース、及び関心対象配列に関して異数性が既知のサンプルに対して決定された配列ドースのために確立された標準偏差から閾値を導き出す。正確な分類は、異なるクラス、すなわち、異数性のタイプに関する確率分布間の相違に依存する。好適には、閾値は、異数性の各タイプ、例えば２１番トリソミーの経験分布から選択する。胎児及び母体の核酸混合物を含む母体サンプルから抽出したｃｆＤＮＡをシークエンシングすることによって染色体異数性を決定するのに本発明方法を使用することを説明する実施例で記載するように、１３番トリソミー、１８番トリソミー、２１番トリソミー及びＸモノソミーの異数性を分類するのに確立された可能な閾値がある。或る染色体異数性の異変サンプルを区別するよう決定される閾値は、異なる異数性の異変サンプルを区別するよう決定される閾値と同一又は異なるものとすることができる。実施例で示すように、各関心対象染色体の閾値は、サンプル間及びシークエンシングラン間にわたる染色体ドースの変動性から決定する。任意の関心対象染色体の染色体ドースの変動性が少なければ少ないほど、異なった異数性を決定するための閾値を設定するのに使用されるすべての異変なしサンプルにわたる関心対象染色体のドースにおける分散は狭くなる。 In step 155, a threshold is derived from the qualifying sequence dose determined for the plurality of qualifying samples and the standard deviation established for the sequence dose determined for the sample with known aneuploidy with respect to the sequence of interest. The exact classification depends on the difference between the probability distributions for different classes, i.e. aneuploidy types. Preferably, the threshold is selected from an empirical distribution of each type of aneuploidy, eg trisomy 21. Trisomy 13 as described in the examples illustrating the use of the method of the invention to determine chromosomal aneuploidy by sequencing cfDNA extracted from a maternal sample containing a mixture of fetal and maternal nucleic acids. There are possible thresholds established to classify aneuploidy of trisomy 18, trisomy 21, trisomy 21 and X monosomy. The threshold value determined to distinguish anomalous samples of a certain chromosomal aneuploidy may be the same or different from the threshold value determined to distinguish anomalous samples of different aneuploidy. As shown in the examples, the threshold for each chromosome of interest is determined from the variability of the chromosomal dose across samples and between sequencing runs. The less variability of the chromosomal dose of any chromosome of interest is, the less the variance in the chromosomal dose of interest across all non-invariant samples used to set a threshold for determining different aneuploidies. Narrow.

ステップ１６０で、関心対象配列のコピー数多型を、関心対象配列のテスト配列ドースを適格配列ドースから確立した少なくとも１個の閾値と比較することによって決定する。 At step 160, the copy number variation of the sequence of interest is determined by comparing the test sequence dose of the sequence of interest with at least one threshold established from the qualified sequence dose.

ステップ１６５で、関心対象テスト配列の計算したドースを、ユーザー定義の信頼性閾値に従って選択した閾値として設定したドースと比較し、サンプルを「正常」、「異変あり」、「ノーコール」として分類する。「ノーコール」サンプルは、断定的診断を信頼性を持って下せないサンプルである。 In step 165, the calculated dose of the test sequence of interest is compared to the dose set as the threshold selected according to the user-defined confidence threshold, and the sample is classified as “normal”, “abnormal”, “no call”. A “no call” sample is a sample that cannot make a definitive diagnosis reliably.

本発明の他の実施形態は、胎児及び母体の核酸分子を有する生物学的サンプルにおける胎児の染色体異数性を出生前診断する方法を提供する。この診断は、生物学的テストサンプル、例えば母体血漿サンプルから採取した胎児及び母体の核酸分子混合物の少なくとも一部をシークエンシングする配列情報を取得し、配列データから１個又はそれ以上の関心対象染色体の正規化ドース及び／又は１個又はそれ以上の関心対象断片の正規化断片ドースをコンピュータ計算し、テストサンプルにおける関心対象染色体の染色体ドース及び／又は関心対象断片の断片ドースと、複数個の適格（正常）サンプルで確立された閾値との統計学的有意差を決定し、及び統計学的有意差に基づいて出生前診断を行うことに基づく。本発明方法のステップ１６５で説明するように、正常又は異変ありの診断を行う。「ノーコール」は、正常又は異変ありの診断が確信を持って行うことができない場合に下す。 Other embodiments of the invention provide methods for prenatal diagnosis of fetal chromosomal aneuploidy in biological samples having fetal and maternal nucleic acid molecules. This diagnosis involves obtaining sequence information that sequences at least a portion of a mixture of fetal and maternal nucleic acid molecules taken from a biological test sample, eg, a maternal plasma sample, and from the sequence data one or more chromosomes of interest. Computerized normalization doses and / or normalized fragment doses of one or more fragments of interest, and chromosomal doses of interest chromosomes and / or fragment doses of fragments of interest in the test sample Based on determining a statistically significant difference from the established threshold in the (normal) sample and making a prenatal diagnosis based on the statistically significant difference. As described in step 165 of the method of the present invention, a normal or abnormal diagnosis is performed. “No call” is given when a normal or abnormal diagnosis cannot be made with certainty.

サンプル
ＣＮＶ、例えば染色体異数性及び部分的異数性を決定するのに使用するサンプルは、細胞内に存在する、又は「無細胞」の核酸を有する。本発明の若干の実施形態において、無細胞核酸、例えば無細胞（cell-free）ＤＮＡ(ｃｆＤＮＡ)を採取するのが有利である。無細胞ＤＮＡを含む無細胞核酸は、生物学的サンプル、例えば限定しないが血漿及び血清から従来既知の種々の方法によって採取することができる（Chen et al., Nature Med.2:1033-1035 [1996]; Lo et al., Lancet 350:485-487 [1997]参照）。細胞から無細胞ＤＮＡを分離するため、分別、遠心分離（例えば、密度勾配遠心分離）、ＤＮＡ特異的沈降、又は高スループットの細胞ソート及び／又は分離方法を使用することができる。 Sample CNV, eg, a sample used to determine chromosomal aneuploidy and partial aneuploidy, is present in a cell or has “cell-free” nucleic acid. In some embodiments of the invention, it is advantageous to harvest cell-free nucleic acids, such as cell-free DNA (cfDNA). Cell-free nucleic acids, including cell-free DNA, can be collected from biological samples such as, but not limited to, plasma and serum by various methods known in the art (Chen et al., Nature Med. 2: 1033-1035 [ 1996]; Lo et al., Lancet 350: 485-487 [1997]). Fractionation, centrifugation (eg, density gradient centrifugation), DNA-specific precipitation, or high throughput cell sorting and / or separation methods can be used to separate cell-free DNA from cells.

本発明に記載の方法を適用する核酸混合物を有するサンプルは、例えば組織サンプル、生体液サンプル、又は細胞サンプルのような生物学的サンプルである。若干の実施形態において、核酸混合物を生物学的サンプルから既知の方法のうち任意な１つのによって、精製又は分離する。サンプルは精製又は分離したポリヌクレオチドにより構成されるか、又は組織サンプル、生体液サンプル、又は細胞サンプルのような生物学的サンプルを含むものとすることができる。生体液としては、限定しないが、血液、血漿、血清、汗、涙、痰、尿、精液、内耳液、リンパ液、唾液、脳脊髄液、破出液、骨髄懸濁液、膣液、経頸管洗浄液、脳液、腹水、母乳、呼吸器官、腸管及び泌尿生殖器管の分泌物、羊水、及び白血球共生サンプルがある。若干の実施形態において、サンプルは、非侵襲的手順によって容易に採取できるサンプル、例えば、血液、血漿、血清、汗、涙、痰、尿、精液、内耳液、唾液、又は糞便とする。好適には、生物学的サンプルは、末梢血液サンプル、又は血漿及び血清画分とする。他の実施形態において、生物学的サンプルは、綿棒若しくは塗抹採取試料、生検試料、又は細胞培養とする。他の実施形態において、サンプルは、２種類以上の生物学的サンプルの混合物とし、例えば、生物学的サンプルは、生体液サンプル、組織サンプル及び細胞培養サンプルのうち２つ又はそれ以上を有するものとすることができる。本明細書に使用する用語「血液」、「血漿」及び「血清」は、それらの画分又は処理した部分をも包含する。同様に、サンプルを生検、綿棒、塗抹等から採取する場合、「サンプル」は生検、綿棒、塗抹等に由来する処理画分又は部分をも含む。 The sample with the nucleic acid mixture to which the method according to the invention is applied is a biological sample such as a tissue sample, a biological fluid sample or a cell sample. In some embodiments, the nucleic acid mixture is purified or separated from the biological sample by any one of the known methods. The sample can be composed of purified or separated polynucleotides, or can include biological samples such as tissue samples, biological fluid samples, or cell samples. Biological fluids include, but are not limited to, blood, plasma, serum, sweat, tears, sputum, urine, semen, inner ear fluid, lymph fluid, saliva, cerebrospinal fluid, exudate, bone marrow suspension, vaginal fluid, transcervical canal There are lavage fluid, brain fluid, ascites, breast milk, respiratory organs, intestinal and urogenital secretions, amniotic fluid, and leukocyte symbiosis samples. In some embodiments, the sample is a sample that can be easily collected by non-invasive procedures, such as blood, plasma, serum, sweat, tears, sputum, urine, semen, inner ear fluid, saliva, or feces. Preferably, the biological sample is a peripheral blood sample or a plasma and serum fraction. In other embodiments, the biological sample is a swab or smear sample, a biopsy sample, or a cell culture. In other embodiments, the sample is a mixture of two or more biological samples, eg, the biological sample comprises two or more of a biological fluid sample, a tissue sample, and a cell culture sample. can do. As used herein, the terms “blood”, “plasma” and “serum” also include fractions or processed portions thereof. Similarly, when a sample is taken from a biopsy, cotton swab, smear, etc., the “sample” also includes a treated fraction or portion derived from a biopsy, cotton swab, smear, etc.

若干の実施形態において、サンプルはソースから採取することができ、限定はしないが、異なる個体、同一又は異なる個体の異なる発達段階、異なる疾患の個体（例えば、がんを有する又は遺伝性疾患を有することが疑われる個体）、正常個体、個体における疾患の異なる段階で採取したサンプル、異なる環境因子を受けた個体からのサンプル、又は病理学的素因を有する個体、又は感染性疾患作用因子（ＨＩＶ）に被曝した個体からのサンプルがある。 In some embodiments, the sample can be taken from a source, including but not limited to different individuals, different developmental stages of the same or different individuals, individuals with different diseases (eg, having cancer or having a genetic disease Individuals suspected to be), normal individuals, samples taken at different stages of the disease in individuals, samples from individuals who have received different environmental factors, or individuals with a pathological predisposition, or infectious disease agent (HIV) There are samples from individuals exposed to.

一実施形態において、サンプルは、妊娠した雌体、例えば、妊娠した女性から採取した母体サンプルとする。この場合、サンプルは本明細書に記載した方法を使用して解析し、胎児の潜在的染色体異常の出生前診断を行う。母体サンプルは組織サンプル、生体液サンプル、又は細胞サンプルとすることができる。生体液は、非限定的な例として、血液、血漿、血清、汗、涙、痰、尿、精液、内耳液、リンパ液、唾液、脳脊髄液、破出液、骨髄懸濁液、膣液、経頸管洗浄液、脳液、腹水、母乳、呼吸器官、腸管及び泌尿生殖器管の分泌物、羊水、及び白血球共生サンプルがある。他の実施形態において、母体サンプルは、２つ異常の生物学的サンプルの混合物とすることができ、例えば、生物学的サンプルは、生体液サンプル、組織サンプル及び細胞培養サンプルのうち２つ又はそれ以上を有するものとすることができる。若干の実施形態において、サンプルは、非侵襲的手順によって容易に採取できるサンプル、例えば、血液、血漿、血清、汗、涙、痰、尿、精液、内耳液、唾液、又は糞便とする。好適には、生物学的サンプルは、末梢血液サンプル、又は血漿及び血清画分とする。他の実施形態において、生物学的サンプルは、綿棒若しくは塗抹採取試料、生検試料、又は細胞培養とする。本明細書に使用する用語「血液」、「血漿」及び「血清」は、それらの画分又は処理した部分をも包含する。同様に、サンプルを生検、綿棒、塗抹等から採取する場合、「サンプル」は生検、綿棒、塗抹等に由来する処理画分又は部分をも含む。 In one embodiment, the sample is a pregnant female body, eg, a maternal sample taken from a pregnant woman. In this case, the sample is analyzed using the methods described herein for prenatal diagnosis of fetal potential chromosomal abnormalities. The maternal sample can be a tissue sample, a biological fluid sample, or a cell sample. Non-limiting examples of biological fluids include blood, plasma, serum, sweat, tears, sputum, urine, semen, inner ear fluid, lymph fluid, saliva, cerebrospinal fluid, rupture fluid, bone marrow suspension, vaginal fluid, There are transcervical lavage fluid, brain fluid, ascites, breast milk, respiratory organs, intestinal and urogenital secretions, amniotic fluid, and leukocyte symbiosis samples. In other embodiments, the maternal sample can be a mixture of two abnormal biological samples, eg, the biological sample is two or more of a biological fluid sample, a tissue sample, and a cell culture sample. It can have the above. In some embodiments, the sample is a sample that can be easily collected by non-invasive procedures, such as blood, plasma, serum, sweat, tears, sputum, urine, semen, inner ear fluid, saliva, or feces. Preferably, the biological sample is a peripheral blood sample or a plasma and serum fraction. In other embodiments, the biological sample is a swab or smear sample, a biopsy sample, or a cell culture. As used herein, the terms “blood”, “plasma” and “serum” also include fractions or processed portions thereof. Similarly, when a sample is taken from a biopsy, cotton swab, smear, etc., the “sample” also includes a treated fraction or portion derived from a biopsy, cotton swab, smear, etc.

サンプルは、試験管内培養したソースを含む組織、細胞、又はポリヌクレオチドから採取することができる。培養したサンプルは、限定しないが、異なる培地及び条件（例えば、ｐＨ、圧力、若しくは温度）に維持した培養物（例えば、組織若しくは細胞）、異なる長さの期間にわたり維持した培養物（例えば、組織若しくは細胞）、異なる因子若しくは試薬（例えば、薬剤候補若しくは調節因子）で処理した培養物（例えば、組織若しくは細胞）、又は組織若しくは細胞における異なるタイプの培養物を含むソースから採取することができる。 Samples can be taken from tissues, cells, or polynucleotides containing sources cultured in vitro. Cultured samples include, but are not limited to, cultures (eg, tissues or cells) maintained at different media and conditions (eg, pH, pressure, or temperature), cultures (eg, tissue maintained for different lengths of time) Or cells), cultures (eg, tissues or cells) treated with different factors or reagents (eg, drug candidates or modulators), or sources containing different types of cultures in tissues or cells.

生物学的ソースから核酸を単離する方法は、既知であり、またソースの性質に基づいて異なってくる。当業者であれば、方法に記載の方法に必要なソースから核酸を容易に単離することができる。若干の実施形態において、核酸サンプルにおける核酸分子をフラグメント化するのは有利である。フラグメント化を行うとき、ランダムとするか、又は例えば制限エンドヌクレアーゼを使用する特別なものとすることができる。ランダムフラグメント化の方法は従来既知であり、例えば、デオキシリボヌクレアーゼ温浸、アルカリ処理、及び物理的剪断がある。一実施形態において、サンプル核酸は、フラグメント化を受けていないｃｆＤＮＡから採取する。他の実施形態において、サンプル核酸は、約５００以上の塩基対のフラグメントにフラグメント化された、またＮＧＳ方法を容易に適用できるゲノムＤＮＡから採取する。 Methods for isolating nucleic acids from biological sources are known and will vary based on the nature of the source. One skilled in the art can readily isolate nucleic acids from the sources required for the methods described in the methods. In some embodiments, it is advantageous to fragment nucleic acid molecules in a nucleic acid sample. When performing fragmentation, it can be random or special, for example using a restriction endonuclease. Random fragmentation methods are known in the art and include, for example, deoxyribonuclease digestion, alkaline treatment, and physical shearing. In one embodiment, the sample nucleic acid is taken from unfragmented cfDNA. In other embodiments, the sample nucleic acid is taken from genomic DNA fragmented into fragments of about 500 base pairs or more and to which the NGS method can be easily applied.

出生前診断のためのＣＮＶ決定
母体血液内で循環する無細胞胎児ＤＮＡ及びＲＮＡは、母体管理及び生殖意思決定支援双方を行うため、多くの遺伝子疾患の初期出生前診断（ＮＩＰＤ：non-invasive prenatal diagnosis）に使用することができる。血流内で循環する無細胞ＤＮＡの存在は５０年にわたり知られてきた。より最近では、循環する少量の胎児ＤＮＡの存在が妊娠期間中の母体血流で発見された（Lo et al., Lancet 350:485-487 [1997]参照）。死にかけている胎盤細胞に由来すると考えられている、無細胞（cell-free）胎児ＤＮＡ（ｃｆＤＮＡ）は、典型的には２００ｂｐより少ない長さの短いフラグメントにより構成されていることが分かっており（Chan et al., Clin Chem 50:88-92 {2004]参照）、このｃｆＤＮＡは、妊娠４週目の早期に確認することができ（Illanes et al., Early Human Dev 83:563-566 [2007]参照）、また数時間の配給内で母体循環から除去されることがわかっている（Lo et al., Am J Hum Genet64:218-224 [1999]参照）。ＣｆＤＮＡの他に、無細胞胎児ＲＮＡ（ｃｆＲＮＡ）のフラグメントは、母体血流内に確認することができ、これは胎児又は胎盤内で転写された遺伝子に由来する。母体血液サンプルからのこれら胎児遺伝要素の抽出及びその後の解析により、新規なＮＩＰＤ機会をもたらす。 CNV Determination for Prenatal Diagnosis Cell-free fetal DNA and RNA circulating in maternal blood provide both maternal management and reproductive decision support, and therefore, early prenatal diagnosis (NIPD) for many genetic diseases diagnosis). The existence of cell-free DNA circulating in the bloodstream has been known for 50 years. More recently, the presence of small amounts of circulating fetal DNA has been found in maternal blood flow during pregnancy (see Lo et al., Lancet 350: 485-487 [1997]). Cell-free fetal DNA (cfDNA), believed to be derived from dying placental cells, has been found to be composed of short fragments, typically less than 200 bp in length ( Chan et al., Clin Chem 50: 88-92 [2004]) and this cfDNA can be confirmed early in the fourth week of pregnancy (Illanes et al., Early Human Dev 83: 563-566 [2007] ) And have been found to be removed from the maternal circulation within a few hours of distribution (see Lo et al., Am J Hum Genet 64: 218-224 [1999]). In addition to CfDNA, a cell-free fetal RNA (cfRNA) fragment can be identified in the maternal bloodstream, which is derived from a gene transcribed in the fetus or placenta. Extraction of these fetal genetic elements from maternal blood samples and subsequent analysis provides new NIPD opportunities.

本発明方法は、ＮＩＰＤに使用され、また胎児ｃｆＤＮＡを母体ｃｆＤＮＡから区別する必要がなく、胎児の異数性を決定することができる多型性独立方法である。若干の実施形態において、異数性は、完全染色体トリソミー若しくはモノソミー又は部分的トリソミー若しくはモノソミーである。部分的異数性は、染色体の一部の不足又は過剰によって生じ、また不安定な転座、不安定な反転、欠失及び挿入から引き起こされる染色体不均衡を含む。これまで、最も一般的に知られている生存可能な異数性は、２１番トリソミー、すなわちダウン症候群（ＤＳ）であり、これは２１番染色体の一部又はすべての存在によって生ずる。ＤＳは遺伝的又は散発性の異常によって引き起こされ、２１番染色体のすべて又は一部の過剰コピーが、他の染色体（通常、１４番染色体）に取り付いて、単独の変形した染色体を形成する。ＤＳは、知能障害、深刻な学習困難性、及び心臓病などの長期健康問題で生ずる超過死亡に関連する。臨床的に有意であることが知られている他の異数性としては、エドワーズ症候群（１８番トリソミー）及びパトー症候群（１３番トリソミー）があり、しばしば最初の２，３か月の寿命で死亡する。性染色体の数に関連する異常性も既知であり、Ｘモノソミー、例えばターナー症候群（ＸＯ）、及び女児出生におけるトリプルＸ症候群（ＸＸＸ）、クラインフェルター症候群（ＸＸＹ）、及び男児出生におけるＸＹＹ症候群があり、知的技能の欠落及び減退を含む種々の表現型に関連する。本発明方法を使用して、これら及び他の染色体異常を出生前に診断することができる。 The method of the present invention is a polymorphism-independent method that can be used for NIPD and can determine fetal aneuploidy without the need to distinguish fetal cfDNA from maternal cfDNA. In some embodiments, the aneuploidy is full chromosomal trisomy or monosomy or partial trisomy or monosomy. Partial aneuploidy is caused by a deficiency or excess of part of the chromosome and includes chromosomal imbalances caused by unstable translocations, unstable inversions, deletions and insertions. To date, the most commonly known viable aneuploidy is trisomy 21 or Down's syndrome (DS), which is caused by the presence of some or all of chromosome 21. DS is caused by genetic or sporadic abnormalities, and an extra copy of all or part of chromosome 21 attaches to another chromosome (usually chromosome 14) to form a single deformed chromosome. DS is associated with excess mortality resulting from intellectual disability, serious learning difficulties, and long-term health problems such as heart disease. Other aneuploidies known to be clinically significant include Edwards syndrome (Trisomy 18) and Pateau syndrome (Trisomy 13), often dying at the first few months of life To do. Abnormalities related to the number of sex chromosomes are also known, including X monosomy, such as Turner syndrome (XO), and triple X syndrome (XXX), Kleinfelter syndrome (XXY), and XYY syndrome in boys Related to various phenotypes, including lack and decline of intellectual skills These and other chromosomal abnormalities can be diagnosed prenatally using the methods of the present invention.

本発明の若干の実施形態によれば、本発明方法により決定されるトリソミーは、２１番トリソミー（Ｔ２１;ダウン症候群）、１８番トリソミー（Ｔ１８;エドワーズ症候群）、１６番トリソミー（Ｔ１６）、２２番トリソミー（Ｔ２２；キャットアイ症候群）、１５番トリソミー（Ｔ１５；プラダーウィリ症候群）、１３番トリソミー（Ｔ１３；パトー症候群）、８番トリソミー（Ｔ８；ワーカニー症候群）、ＸＸＹ（クラインフェルター症候群）、ＸＹＹ、又はＸＸＸトリソミーに限定しない。本発明が教示することによれば、様々な他の完全トリソミー及び部分的トリソミーを胎児ｃｆＤＮＡで決定することができる。部分的トリソミーの例としては、限定しないが、部分トリソミーｌｑ３２〜４４、９ｐトリソミー、４番トリソミーモザイク、１７ｐトリソミー、部分トリソミー４ｑ２６−ｑｔｅｒ、９番トリソミー、部分２ｐトリソミー、部分トリソミー１ｑ、及び／又は部分トリソミー６ｐ／モノソミー６ｑがある。 According to some embodiments of the present invention, the trisomy determined by the method of the present invention is Trisomy 21 (T21; Down's Syndrome), Trisomy 18 (T18; Edwards Syndrome), Trisomy 16 (T16), 22 Trisomy (T22; Cat Eye Syndrome), Trisomy 15 (T15; Praderwilli Syndrome), Trisomy 13 (T13; Patou Syndrome), Trisomy 8 (T8; Workany Syndrome), XXY (Kleinfelter Syndrome), XYY, or XXX Not limited to trisomy. According to the teachings of the present invention, various other complete and partial trisomy can be determined in fetal cfDNA. Examples of partial trisomy include, but are not limited to, partial trisomy lq32-44, 9p trisomy, 4th trisomy mosaic, 17p trisomy, partial trisomy 4q26-qter, 9th trisomy, partial 2p trisomy, partial trisomy 1q, and / or There is partial trisomy 6p / monosomy 6q.

本発明方法は、Ｘ染色体モノソミー、及び部分的モノソミーを決定することができ、部分的モノソミーとしては、例えば１３番モノソミー、１５番モノソミー、１６番モノソミー、２１番モノソミー、２２番モノソミーがあり、これらは流産となる妊娠に見られることが知られている。一般的に完全異数性に見られる染色体の部分的モノソミーも本発明方法によって決定することができる。モノソミー１８ｐは、まれな染色体障害であり、これは１８番染色体の短アームのすべて又は一部を欠失している（一染色体性である）。この障害は、典型的には小人症、程度に幅がある知能発育不全、発話発達遅滞、頭蓋骨及び顔（頭蓋顔面）領域奇形、及び／又は付加的な身体的異常で特徴付けされる。頭蓋顔面に関連する障害は、範囲及び重篤度に関して、ケースごとに大きく変動がある。１５番染色体の構造又はコピー数多型によって生ずる症状としては、アンジェルマン症候群及びプラダーウィリ症候群があり、これらは１５番染色体の同一部分、１５ｑ１１〜ｑ１３における遺伝子活性欠乏がある。幾つかの転座及び微小欠失は、キャリヤである親には無症状であり得るが、それでも子孫に大きな遺伝的障害を発症し得る。例えば、１５ｑ１１〜ｑ１３微小欠失を有する健康な母親は、アンジェルマン症候群、重篤な神経変性疾患を持つ子供を産むことがあり得る。本発明方法を使用して胎児におけるこのような部分的欠失及び他の欠失を同定することができる。部分モノソミー１３ｑはまれな染色体障害であり、１３番染色体における長アームの一部が消失している（一染色体性である）結果である。部分モノソミー１３ｑを持って生まれた胎児は、低出産時体重、頭部及び顔（頭蓋顔面領域）の奇形、骨格異常（とくに、手及び足）、及び他の身体的異常を呈する。精神発達遅滞はこの症状の特徴である。幼年期中の死亡率はこの障害を持って産まれた個体が高い。部分モノソミー１３ｑのほとんどすべてのケースは、はっきりとした理由なくランダムに（散発的に）発生する。２２ｑ１１.２欠失症候群（ディジョージ症候群としても知られている）は、２２番染色体の小さいピースの欠失によって発症する症候群である。（２２ｑ１１.２）欠失は、その染色体対のうち一方における長アームにある染色体中間部近傍で生ずる。この症候群の特徴は、同一症候群メンバー内でも広範囲に変動があり、身体の多くの部分に異変を及ぼす。特徴的兆候及び症候は、出生異常、例えば、先天性心臓疾患、最も一般的には神経筋の閉止問題に関連する口蓋における欠陥（口蓋帆咽頭不全）、学習障害、顔の特徴における軽度の相違、及び反復性感染がある。染色体領域２２ｑ１１.２での微小欠失は、統合失調症を２０〜３０倍にも増大させるリスクに関連する。一実施形態において、本発明方法を使用して、限定はしないが、モノソミー１８ｐ、１５番染色体の部分モノソミー（１５ｑ１１〜ｑ１３）、部分モノソミー１３ｑを含む部分モノソミーを決定し、また２２番染色体の部分モノソミーも本発明方法を用いて決定することができる。 The method of the present invention can determine X-chromosome monosomy and partial monosomy. Examples of the partial monosomy include 13 monosomy, 15 monosomy, 16 monosomy, 21 monosomy, and 22 monosomy. Is known to be found in miscarriage pregnancy. A partial monosomy of a chromosome generally found in complete aneuploidy can also be determined by the method of the present invention. Monosomy 18p is a rare chromosomal disorder that lacks all or part of the short arm of chromosome 18 (monochromic). This disorder is typically characterized by dwarfism, varying degrees of intellectual developmental dysfunction, delayed speech development, skull and facial (craniofacial) region malformations, and / or additional physical abnormalities. The craniofacial disorders vary widely from case to case with respect to scope and severity. Symptoms caused by the structure of chromosome 15 or copy number polymorphism include Angelman syndrome and Praderwilli syndrome, which have a deficiency in gene activity in the same part of chromosome 15, 15q11 to q13. Some translocations and microdeletions may be asymptomatic for the carrier parent, but still develop large genetic disorders in the offspring. For example, a healthy mother with 15q11-q13 microdeletions can give birth to a child with Angelman syndrome, a severe neurodegenerative disease. The method of the present invention can be used to identify such partial and other deletions in the fetus. Partial monosomy 13q is a rare chromosomal disorder, and is a result of the loss of part of the long arm in chromosome 13 (monochromic). A fetus born with partial monosomy 13q exhibits low birth weight, head and face deformities (craniofacial region), skeletal abnormalities (particularly hands and feet), and other physical abnormalities. Mental retardation is a characteristic of this symptom. Mortality during childhood is higher for individuals born with this disorder. Almost all cases of partial monosomy 13q occur randomly (sporadic) for no obvious reason. The 22q11.2 deletion syndrome (also known as DiGeorge syndrome) is a syndrome that develops due to the deletion of a small piece of chromosome 22. The (22q11.2) deletion occurs near the middle of the chromosome in the long arm of one of the chromosome pairs. The characteristics of this syndrome vary widely among members of the same syndrome, and affect many parts of the body. Characteristic signs and symptoms are birth defects, such as congenital heart disease, most commonly related to defects in the palate associated with neuromuscular closure problems (palatopharyngeal insufficiency), learning disabilities, minor differences in facial features And recurrent infections. A microdeletion in chromosomal region 22q11.2 is associated with a risk of increasing schizophrenia by 20 to 30 fold. In one embodiment, the method of the present invention is used to determine a partial monosomy including, but not limited to, monosomy 18p, partial monosomy of chromosome 15 (15q11-q13), partial monosomy 13q, and part of chromosome 22 Monosomy can also be determined using the method of the present invention.

本発明方法は、片方の親がこのような異常性のキャリヤであることが分かっている場合に何らかの異数性を決定するのにも使用することができる。これら異数性としては、限定しないが、小さい過剰マーカー染色体（ＳＭＣ：supernumerary marker chromosome）のモザイク、t(11;14)(p15;p13)転座、不均衡転座t(8;11)(p23.2;p15.5)、１１ｑ２３微小欠失、１７ｐ１１.２欠失、２２ｑ１３.３欠失、Ｘｐ２２.３微小欠失、１０ｐ１４欠失、２０ｐ微小欠失、ディジョージ症候群[del(22)(q11.2q11.23]、ウィリアムズ症候群(7q11.23及び7q36欠失)、１ｐ３６欠失、２ｐ微小欠失、神経線維腫症１型（17q11.2微小欠失）、Ｙｑ欠失、ウォルフ・ヒルシュホーン（Wolf-Hirschhorn）症候群（WHS, 4p16.3微小欠失）、１ｐ３６.２微小欠失、１１ｑ１４欠失、１９ｑ１３.２微小欠失、ルビンシュタイン・タイビ（16p13.3微小欠失）７ｐ２１微小欠失、ミラー・ディーカー（Miller-Dieker）症候群、１７ｐ１１.２欠失、及び２ｑ３７微小欠失がある。 The method of the invention can also be used to determine any aneuploidy when one parent is known to be such an abnormal carrier. These aneuploidies include, but are not limited to, a mosaic of small supermarker marker chromosomes (SMC), t (11; 14) (p15; p13) translocation, and unbalanced translocation t (8; 11) ( p23.2; p15.5), 11q23 microdeletion, 17p11.2 deletion, 22q13.3 deletion, Xp22.3 microdeletion, 10p14 deletion, 20p microdeletion, DiGeorge syndrome [del (22) (q11.2q11.23), Williams syndrome (7q11.23 and 7q36 deletion), 1p36 deletion, 2p microdeletion, neurofibromatosis type 1 (17q11.2 microdeletion), Yq deletion, Wolf Wolf-Hirschhorn syndrome (WHS, 4p16.3 microdeletion), 1p36.2 microdeletion, 11q14 deletion, 19q13.2 microdeletion, Rubinstein Taibi (16p13.3 microdeletion) 7p21 microdeletion Deletion, Miller-Dieker syndrome, 17p11.2 deletion, and 2q37 microdeletion .

完全胎児染色体異数性の決定
一実施形態において、本発明は、胎児及び母体の核酸分子を含む母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法を提供する。好適には、本発明方法は、任意の４つ又はそれ以上の完全染色体異数性の有無を決定する。本発明方法のステップは、（ａ）母体テストサンプルにおける胎児及び母体核酸の配列情報を取得するステップと、及び（ｂ）配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体それぞれにおける配列タグ数を同定し、また任意の１個又はそれ以上の関心対象染色体それぞれにおける正規化染色体配列の配列タグ数を同定するステップとを有する。正規化染色体配列は単独染色体とするか、又は１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した染色体グループとすることができる。本発明方法は、さらに、ステップ（ｃ）において、任意の１個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数、及び各正規化染色体配列に対して同定した配列タグ数を使用して、任意の１個又はそれ以上の関心対象染色体それぞれにおける単独染色体ドースを計算し、及びステップ（ｄ）で任意の１個又はそれ以上の関心対象染色体それぞれにおける各単独染色体ドースを任意の１個又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児異数性の有無を決定する。 Determination of Complete Fetal Chromosome Aneuploidy In one embodiment, the present invention determines the presence or absence of any one or more different complete fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acid molecules. Provide a method. Suitably, the method of the invention determines the presence or absence of any four or more complete chromosomal aneuploidies. The steps of the method of the present invention include (a) obtaining sequence information of fetal and maternal nucleic acids in a maternal test sample, and (b) using chromosome information from chromosomes 1-22 and X and Y chromosomes. Identifying the number of sequence tags in each of any one or more selected chromosomes of interest and identifying the number of sequence tags of the normalized chromosomal sequence in each of any one or more chromosomes of interest. Have. The normalized chromosomal sequence can be a single chromosome or a chromosome group selected from chromosomes 1-22 and X and Y chromosomes. The method of the invention further uses in step (c) the number of sequence tags identified for each of any one or more chromosomes of interest and the number of sequence tags identified for each normalized chromosomal sequence. A single chromosomal dose in each of any one or more chromosomes of interest, and each single chromosomal dose in each of any one or more of the chromosomes of interest in step (d) Compare to the threshold of each of the one or more chromosomes of interest, thereby determining the presence or absence of any one or more different complete fetal aneuploidies in the maternal test sample.

若干の実施形態において、ステップ（ｃ）は、各関心対象染色体の単独染色体ドースを、各関心対象染色体に対して同定した配列タグ数と、各関心対象染色体の正規化染色体に対して同定した配列タグ数との比として計算する。 In some embodiments, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each chromosome of interest, and the sequence identified for the normalized chromosome of each chromosome of interest. Calculate as a ratio to the number of tags.

他の実施形態において、ステップ（ｃ）は、関心対象染色体それぞれの単独染色体ドースを、関心対象染色体それぞれに対して同定した配列タグ数と、関心対象染色体それぞれの正規化染色体に対して同定した配列タグ数との比として計算する。他の実施形態において、ステップ（ｃ）は、関心対象染色体の配列タグ比の計算を、関心対象染色体に対して得た配列タグ数を関心対象染色体の長さに関連付けし、また関心対象染色体の対応する正規化染色体配列のタグ数を正規化染色体配列の長さに関連付けし、また関心対象染色体の染色体ドースを、関心対象染色体の配列タグ密度と正規化配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる母体検体からのテストサンプルに対して繰り返すことができる。 In another embodiment, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each chromosome of interest, and the sequence identified for the normalized chromosome for each chromosome of interest. Calculate as a ratio to the number of tags. In other embodiments, step (c) comprises calculating the sequence tag ratio of the chromosome of interest by associating the number of sequence tags obtained for the chromosome of interest with the length of the chromosome of interest, Corresponding normalized chromosomal sequence tag number is related to normalized chromosomal sequence length, and chromosomal dose of the chromosome of interest is calculated as the ratio of the sequence tag density of the chromosome of interest to the sequence tag density of the normalized sequence By doing. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different maternal specimens.

４つ又はそれ以上の完全胎児染色体異数性を、胎児及び母体の無細胞ＤＮＡの混合物を含む母体テストサンプルにおいて決定する実施形態の例は、（ａ）テストサンプルにおける胎児及び母体の無細胞ＤＮＡ分子の少なくとも一部をシークエンシングして配列情報を取得するステップと、（ｂ）この配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の２０個又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、またその２０個又はそれ以上の関心対象染色体それぞれの正規化染色体の配列タグ数を同定するステップと、（ｃ）その２０個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数及び各正規化染色体に対して同定した配列タグ数を使用して、その２０個又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び（ｄ）その２０個又はそれ以上の関心対象染色体それぞれの各単独染色体ドースを、２０個又はそれ以上の関心対象染色体それぞれの閾値と比較し、またこれによりテストサンプルにおける任意の２０又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有する。 Examples of embodiments in which four or more complete fetal chromosomal aneuploidies are determined in a maternal test sample comprising a mixture of fetal and maternal cell-free DNA include: (a) fetal and maternal cell-free DNA in the test sample Sequencing at least a portion of the molecule to obtain sequence information; (b) using this sequence information, any 20 or more selected from chromosomes 1-22, X and Y chromosomes Identifying the number of sequence tags for each of the chromosomes of interest and identifying the number of sequence tags for the normalized chromosome of each of the 20 or more chromosomes of interest; (c) the 20 or more of the interests Using the number of sequence tags identified for each of the target chromosomes and the number of sequence tags identified for each normalized chromosome, the 20 or more Calculating a single chromosome dose for each of the target chromosomes; and (d) comparing each single chromosome dose for each of the 20 or more chromosomes of interest to a threshold of each of the 20 or more chromosomes of interest. And thereby determining the presence or absence of any 20 or more different complete fetal chromosomal aneuploidies in the test sample.

他の実施形態において、上述の母体テストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定する方法は、関心対象染色体におけるドースを決定する正規化断片配列を使用する。この場合、この方法は、（ａ）テストサンプルにおける胎児及び母体の核酸の配列情報を取得するステップと、（ｂ）この配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、またその１つ又はそれ以上の関心対象染色体それぞれの正規化断片配列の配列タグ数を同定するステップとを有する。正規化断片配列は、染色体の単独断片とするか、又は任意の１個又はそれ以上の異なった染色体からの断片グループとすることができる。この方法は、さらに、（ｃ）その任意の１個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数及び各正規化断片配列に対して同定した配列タグ数を使用して、その任意の１個又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び（ｄ）その任意の１個又はそれ以上の関心対象染色体それぞれの各単独染色体ドースを、任意の１個又はそれ以上の関心対象染色体それぞれの閾値と比較し、またこれによりテストサンプルにおける任意の１つ又はそれ以上の異なった完全胎児染色体異数性の有無を決定するステップとを有する。 In other embodiments, a method for determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the maternal test sample described above uses a normalized fragment sequence that determines the dose in the chromosome of interest. To do. In this case, the method comprises: (a) obtaining sequence information of fetal and maternal nucleic acids in the test sample; and (b) using this sequence information from chromosomes 1-22, X and Y. Identifying the number of sequence tags for each of any one or more selected chromosomes of interest and identifying the number of sequence tags for the normalized fragment sequence of each of the one or more selected chromosomes of interest. . A normalized fragment sequence can be a single fragment of a chromosome or a group of fragments from any one or more different chromosomes. The method further comprises (c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for each normalized fragment sequence, Calculating a single chromosome dose for each of any one or more chromosomes of interest; and (d) any one single chromosome dose for each of the one or more chromosomes of interest Or comparing to a threshold value for each of the more or less chromosomes of interest and thereby determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the test sample.

若干の実施形態において、ステップ（ｃ）は、関心対象染色体それぞれの単独染色体ドースを、関心対象染色体それぞれに対して同定した配列タグ数と、関心対象染色体それぞれの正規化断片配列に対して同定した配列タグ数との比として計算する。 In some embodiments, step (c) has identified a single chromosomal dose for each chromosome of interest relative to the number of sequence tags identified for each chromosome of interest and the normalized fragment sequence for each chromosome of interest. Calculated as a ratio to the number of sequence tags.

他の実施形態において、ステップ（ｃ）は、関心対象染色体の配列タグ比の計算を、関心対象染色体に対して得た配列タグ数を関心対象染色体の長さに関連付けし、また関心対象染色体の対応する正規化断片配列のタグ数を正規化断片配列の長さに関連付けし、また関心対象染色体の染色体ドースを、関心対象染色体の配列タグ密度と正規化断片配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる母体検体からのテストサンプルに対して繰り返すことができる。 In other embodiments, step (c) comprises calculating the sequence tag ratio of the chromosome of interest by associating the number of sequence tags obtained for the chromosome of interest with the length of the chromosome of interest, The number of tags in the corresponding normalized fragment sequence is related to the length of the normalized fragment sequence, and the chromosomal dose of the chromosome of interest is expressed as the ratio of the sequence tag density of the chromosome of interest to the sequence tag density of the normalized fragment sequence. Do by calculating. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different maternal specimens.

異なるサンプルセットの染色体ドースを比較する手段は、テストサンプルにおける染色体ドースを、適格サンプルセットにおける対応の染色体ドースの平均に関連付ける正規化染色体値（ＮＣＶ：normalized chromosome value）を決定することによって得ることができる。ＮＣＶは、次式のように計算する。
ここで、
は、それぞれ適格サンプルセットにおけるｊ番染色体ドースに対する推定した平均及び標準偏差であり、ｘ_ijはテストサンプルｉにおける観測したｊ番染色体ドースである。 Means for comparing chromosomal doses of different sample sets can be obtained by determining a normalized chromosome value (NCV) that relates the chromosomal doses in the test sample to the average of the corresponding chromosomal doses in the qualified sample set. it can. NCV is calculated as follows:
here,
Are the estimated mean and standard deviation for chromosome jose in the eligible sample set, respectively, and x _ij is the observed chromosome jose in test sample i.

若干の実施形態において、少なくとも１つの完全胎児染色体異数性の有無を決定する。他の実施形態において、少なくとも２つ、少なくとも３つ、少なくとも４つ、少なくとも５つ、少なくとも６つ、少なくとも７つ、少なくとも８つ、少なくとも９つ、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４の、完全胎児染色体異数性の有無を１つのサンプル内で決定し、この場合、２２の完全胎児染色体異数性は、任意の１つ又はそれ以上の常染色体における完全染色体異数性に対応し、２３番目の染色体異数性及び２４番目の染色体異数性はＸ染色体及びＹ染色体の完全胎児染色体異数性に対応する。性染色体の異数性はテトラソミー、ペンタソミー及び他のポリソミーを含むため、本発明方法により決定することができる異なる完全染色体異数性の数としては、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、又は少なくとも３０の、完全染色体異数性がある。したがって、決定される異なる完全胎児染色体異数性の数は、解析用に選択される関心対象染色体の数に関連する。 In some embodiments, the presence or absence of at least one complete fetal chromosomal aneuploidy is determined. In other embodiments, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, Determining whether or not at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24 complete fetal chromosomal aneuploidy within a sample; In this case, the 22 complete fetal chromosomal aneuploidy corresponds to the complete chromosomal aneuploidy in any one or more autosomes, the 23rd chromosome aneuploidy and the 24th chromosome aneuploidy are Corresponds to the complete fetal chromosomal aneuploidy of the X and Y chromosomes. Since sex chromosome aneuploidy includes tetrasomy, pentasomy and other polysomy, the number of different complete chromosomal aneuploidies that can be determined by the method of the present invention is at least 24, at least 25, at least 26, at least 27, There are at least 28, at least 29, or at least 30 complete chromosomal aneuploidies. Thus, the number of different complete fetal chromosomal aneuploidies determined is related to the number of chromosomes of interest selected for analysis.

一実施形態において、上述のように母体テストサンプルにおける任意の１個又はそれ以上の異なる完全胎児染色体異数性の有無を決定することは、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した１つの関心対象染色体の正規化断片配列を使用する。他の実施形態において、２個又はそれ以上の関心対象染色体を、１番，２番，３番，４番，５番，６番，７番，８番，９番，１０番，１１番，１２番，１３番，１４番，１５番，１６番，１７番，１８番，１９番，２０番，２１番，２２番染色体、Ｘ染色体及びＹ染色体のうち任意の２つ又はそれ以上から選択する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも２０個の染色体を有し、この場合、少なくとも２０の異なった完全胎児染色体異数性の有無を決定する。他の実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてとし、またこの場合、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてにおける完全胎児染色体異数性の有無を決定する。決定することができる異なる完全胎児染色体異数性としては、完全染色体トリソミー、完全染色体モノソミー、及び完全染色体ポリソミーがある。完全胎児染色体異数性の例としては、常染色体のうち任意の１個又はそれ以上におけるトリソミー、例えば２番トリソミー、８番トリソミー、９番トリソミー、２１番トリソミー、１３番トリソミー、１６番トリソミー、１８番トリソミー、２２番トリソミーに限定することなく、性染色体のトリソミー、例えば４７,ＸＸＹ、４７ＸＸＸ、及び４７ＸＹＹ；性染色体のテトラソミー、例えば４８,ＸＸＹＹ、４８,ＸＸＸＹ、及び４８ＸＸＸＸ、及び４８ＸＹＹＹ；性染色体のペンタソミー、例えば４９,ＸＸＸＹＹ、４９,ＸＸＸＸＹ、及び４９,ＸＸＸＸＸ、及び４９,ＸＹＹＹＹ；及びＸモノソミーがある。本発明方法によって決定できる他の完全胎児染色体異数性を以下に説明する。 In one embodiment, determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in the maternal test sample as described above was selected from chromosomes 1-22, X and Y. The normalized fragment sequence of one chromosome of interest is used. In other embodiments, two or more chromosomes of interest are assigned to No. 1, No. 2, No. 3, No. 4, No. 5, No. 6, No. 7, No. 8, No. 9, No. 10, Select from any two or more of chromosomes 12, 13, 14, 15, 16, 17, 18, 19, 20, 20, 21, 22, chromosomes X and Y To do. In one embodiment, any one or more chromosomes of interest selected from chromosomes 1-22, X and Y are at least 20 chromosomes selected from chromosomes 1-22, X and Y. Determine the presence or absence of at least 20 different complete fetal chromosomal aneuploidies. In other embodiments, any one or more chromosomes of interest selected from chromosomes 1-22, X and Y are all of chromosomes 1-22, X and Y, and In this case, the presence or absence of complete fetal chromosome aneuploidy in all of chromosomes 1-22 and X and Y is determined. Different complete fetal chromosomal aneuploidies that can be determined include complete chromosomal trisomy, complete chromosomal monosomy, and complete chromosomal polysomy. Examples of complete fetal chromosomal aneuploidy include trisomy in any one or more of the autosomes, such as trisomy 2, trisomy 8, trisomy 9, trisomy 21, trisomy 13, trisomy 16, Without limiting to trisomy 18 and trisomy 22, sex chromosome trisomy, such as 47, XXY, 47XXX, and 47XYY; sex chromosome tetrasomy, such as 48, XXYY, 48, XXY, and 48XXXY, and sex Pentasomy, such as 49, XXXXYY, 49, XXXXY, and 49, XXXXXX, and 49, XYYYY; and X monosomy. Other complete fetal chromosomal aneuploidies that can be determined by the method of the present invention are described below.

部分胎児染色体異数性の決定
他の実施形態において、本発明は、胎児及び母体の核酸分子を含む母体テストサンプルにおける任意の１つ又はそれ以上の異なった部分胎児染色体異数性の有無を決定する方法を提供する。本発明方法のステップとしては、（ａ）母体テストサンプルにおける胎児及び母体核酸の配列情報を取得するステップと、及び（ｂ）配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける配列タグ数を同定し、また任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける正規化断片配列の配列タグ数を同定するステップとを有する。正規化断片配列は染色体の単独断片とするか、又は１個又はそれ以上の異なった染色体からの断片グループとすることができる。本発明方法は、さらに、ステップ（ｃ）において、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれに対して同定した配列タグ数、及び各正規化断片配列に対して同定した配列タグ数を使用して、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれの単独断片ドースを計算し、及びステップ（ｄ）で任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける各単独断片ドースを任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の染色体断片それぞれの閾値と比較し、これにより母体テストサンプルにおける任意の１つ又はそれ以上の異なった部分胎児染色体異数性の有無を決定する。 Determination of Partial Fetal Chromosome Aneuploidy In another embodiment, the present invention determines the presence or absence of any one or more different partial fetal chromosomal aneuploidies in a maternal test sample comprising fetal and maternal nucleic acid molecules. Provide a way to do it. The steps of the method of the present invention are as follows: (a) obtaining sequence information of fetal and maternal nucleic acids in a maternal test sample; and (b) using the sequence information, chromosomes 1-22 and X and Y chromosomes. Identifying the number of sequence tags in each of any one or more fragments in any one or more chromosomes of interest selected from and any one in any one or more chromosomes of interest Or identifying the number of sequence tags of the normalized fragment sequence in each of the more fragments. Normalized fragment sequences can be single fragments of chromosomes or groups of fragments from one or more different chromosomes. The method of the present invention further comprises, in step (c), the number of sequence tags identified for each of any one or more fragments in any one or more chromosomes of interest, and each normalized fragment sequence Calculate the single fragment dose for each of any one or more fragments in any one or more chromosomes of interest using the number of sequence tags identified for and optionally in step (d) Each single fragment dose in any one or more fragments in each of the one or more chromosomes of interest of any one or more chromosome fragments in any one or more of the chromosomes of interest To determine the presence or absence of any one or more different partial fetal chromosomal aneuploidies in the maternal test sample.

若干の実施形態において、ステップ（ｃ）は、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上それぞれの単独断片ドースを、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれに対して同定した配列タグ数と、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれの正規化断片配列に対して同定した配列タグ数との比として計算する。 In some embodiments, step (c) comprises converting any one or more single fragment doses in any one or more chromosomes of interest into any one or more chromosomes of interest. For the number of sequence tags identified for each of any one or more fragments and the normalized fragment sequence of each of any one or more fragments in any one or more chromosomes of interest Calculated as the ratio to the number of sequence tags identified.

他の実施形態において、ステップ（ｃ）は、関心対象断片の配列タグ比の計算を、関心対象断片に対して得た配列タグ数を関心対象断片の長さに関連付けし、また関心対象断片の対応する正規化断片配列のタグ数を正規化断片配列の長さに関連付けし、また関心対象断片の断片ドースを、関心対象断片の配列タグ密度と正規化断片配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる母体検体からのテストサンプルに対して繰り返すことができる。 In another embodiment, step (c) comprises calculating the sequence tag ratio of the fragment of interest by associating the number of sequence tags obtained for the fragment of interest with the length of the fragment of interest and The number of tags in the corresponding normalized fragment sequence is related to the length of the normalized fragment sequence, and the fragment dose of the fragment of interest as Do by calculating. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different maternal specimens.

異なるサンプルセットの断片ドースを比較する手段は、テストサンプルにおける断片ドースを、適格サンプルセットにおける対応の断片ドースの平均に関連付ける正規化断片値（ＮＳＶ：normalized segment value）を決定することによって得ることができる。ＮＳＶは、次式のように計算される。
ここで、
は、それぞれ適格サンプルセットにおけるｊ番断片ドースに対する推定した平均及び標準偏差であり、ｘ_ijはテストサンプルｉにおける観測したｊ番断片ドースである。 Means for comparing fragment doses of different sample sets can be obtained by determining a normalized segment value (NSV) that relates the fragment doses in the test sample to the average of the corresponding fragment doses in the eligible sample set. it can. NSV is calculated as follows:
here,
Are the estimated mean and standard deviation for the jth fragment dose in the eligible sample set, respectively, and x _ij is the observed jth fragment dose in test sample i.

若干の実施形態において、少なくとも１つの部分胎児染色体異数性の有無を決定する。他の実施形態において、２つ、３つ、４つ、５つ、６つ、７つ、８つ、９つ、１０、１５、２０、２５、又はそれ以上の、部分胎児染色体異数性の有無を１つのサンプル内で決定する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１個から選択した１個の関心対象断片は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択する。他の実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した２個又はそれ以上の関心対象断片を、１番，２番，３番，４番，５番，６番，７番，８番，９番，１０番，１１番，１２番，１３番，１４番，１５番，１６番，１７番，１８番，１９番，２０番，２１番，２２番染色体、Ｘ染色体及びＹ染色体のうち任意の２つ又はそれ以上から選択する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象断片は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも１個、５個、１０個、１５個、２０個、２５個又はそれ以上の断片を有し、この場合、少なくとも１個、５個、１０個、１５個、２０個、２５個の異なった部分胎児染色体異数性の有無を決定する。決定することができる異なる部分胎児染色体異数性としては、部分重複、部分増殖、部分挿入、及び部分欠失がある。部分胎児染色体異数性の例としては、常染色体の部分モノソミー、及び部分トリソミーがある。常染色体の部分モノソミーとしては、１番染色体の部分モノソミー、４番染色体の部分モノソミー、５番染色体の部分モノソミー、７番染色体の部分モノソミー、１１番染色体の部分モノソミー、１５番染色体の部分モノソミー、１７番染色体の部分モノソミー、１８番染色体の部分モノソミー、及び２２番染色体の部分モノソミー、がある。本発明方法によって決定できる他の部分胎児染色体異数性を以下に説明する。 In some embodiments, the presence or absence of at least one partial fetal chromosomal aneuploidy is determined. In other embodiments, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, or more partial fetal chromosomal aneuploidies Presence / absence is determined within one sample. In one embodiment, one fragment of interest selected from any one of chromosomes 1-22, X, and Y is selected from chromosomes 1-22, X, and Y. In another embodiment, two or more fragments of interest selected from chromosomes 1-22, X chromosome and Y chromosome are numbered 1, 2, 3, 4, 5, 6, 7 No. 8, No. 9, No. 10, No. 11, No. 12, No. 13, No. 14, No. 15, No. 16, No. 17, No. 18, No. 19, No. 20, No. 21, No. 22 chromosome, X chromosome And any two or more of the Y chromosomes. In one embodiment, any one or more fragments of interest selected from Chromosomes 1-22, X and Y are at least one selected from Chromosomes 1-22, X and Y, Have five, ten, fifteen, twenty, twenty-five or more fragments, in which case at least one, five, ten, fifteen, twenty, twenty-five different partial fetal chromosomes Determine the presence or absence of aneuploidy. Different partial fetal chromosomal aneuploidies that can be determined include partial duplication, partial growth, partial insertion, and partial deletion. Examples of partial fetal chromosomal aneuploidies include autosomal partial monosomy and partial trisomy. As the autosomal partial monosomy, the partial monosomy of chromosome 1, the partial monosomy of chromosome 5, the partial monosomy of chromosome 7, the partial monosomy of chromosome 7, the partial monosomy of chromosome 11, the partial monosomy of chromosome 15, There is a partial monosomy of chromosome 17, a partial monosomy of chromosome 18, and a partial monosomy of chromosome 22. Other partial fetal chromosome aneuploidies that can be determined by the method of the present invention are described below.

上述した実施形態の任意の１つにおいて、テストサンプルは、血液、血漿、血清、尿、及び唾液のサンプルから選択した母体サンプルとする。若干の実施形態において、母体サンプルは血漿サンプルとする。母体サンプルの核酸分子は、胎児及び母体の無細胞ＤＮＡ分子の混合物である。核酸のシークエンシングは、本明細書のいたるところに記載したように、次世代シークエンシング（ＮＧＳ）を使用して行うことができる。他の実施形態において、シークエンシングは、可逆色素ターミネーターによるシークエンシング・バイ・シンセシスを使用する大量並列シークエンシングとする。他の実施形態において、シークエンシングは、シークエンシング・バイ・リゲーションとする。さらに他の実施形態において、シークエンシングは単独分子シークエンシングとする。随意的に増幅ステップをシークエンシングに先立って行う。 In any one of the embodiments described above, the test sample is a maternal sample selected from blood, plasma, serum, urine, and saliva samples. In some embodiments, the maternal sample is a plasma sample. The nucleic acid molecule of the maternal sample is a mixture of fetal and maternal cell-free DNA molecules. Nucleic acid sequencing can be performed using Next Generation Sequencing (NGS), as described elsewhere herein. In other embodiments, the sequencing is massively parallel sequencing using sequencing by synthesis with a reversible dye terminator. In other embodiments, the sequencing is sequencing by ligation. In still other embodiments, the sequencing is single molecule sequencing. Optionally, an amplification step is performed prior to sequencing.

臨床的障害のＣＮＶ決定
出生異常の早期決定の他に、本明細書に記載する方法は、ゲノムにおける遺伝子配列の表現における何らかの異常性決定に適用することができる。 CNV determination of clinical disorders In addition to the early determination of birth defects, the methods described herein can be applied to any abnormality determination in the representation of gene sequences in the genome.

がん患者からの血液における血漿及び血清ＤＮＡは測定可能な量の腫瘍ＤＮＡを含み、これら腫瘍ＤＮＡを回収し、腫瘍ＤＮＡの代理ソースとして使用することができ、また腫瘍には、遺伝子配列若しくは全体染色体においてさえも、異数性又は不適切な数に特徴がある。個体からのサンプルにおける所定配列、すなわち、関心対象配列の総量における相違決定を内科的疾患の診断に使用することができる。若干の実施形態において、本発明方法を使用して、がんを患っていることが疑われる又は既知の患者における染色体異数性の有無を決定することができる。さらに、本発明方法は、疾患状態の有無を決定する、病原体、例えばウイルスの核酸の有無を決定する、移植片対宿主拒絶反応（ＧＶＨＤ：graft versus host disease）に関連する染色体異常を決定する、及び法医学解析における個体の関与を決定するのにも使用することができる。 Plasma and serum DNA in blood from cancer patients contains measurable amounts of tumor DNA, which can be recovered and used as a surrogate source of tumor DNA, and tumors can contain gene sequences or whole Even in the chromosome, it is characterized by aneuploidy or inappropriate numbers. A determination of the difference in a given sequence in a sample from an individual, i.e. the total amount of the sequence of interest, can be used to diagnose a medical disease. In some embodiments, the methods of the invention can be used to determine the presence or absence of chromosomal aneuploidy in a patient suspected or known to suffer from cancer. Furthermore, the method of the invention determines the presence or absence of a disease state, determines the presence or absence of a pathogen, such as a viral nucleic acid, determines a chromosomal abnormality associated with graft versus host disease (GVHD), And can also be used to determine individual involvement in forensic analysis.

本発明の実施形態は、異なる２つのゲノム由来の核酸混合物を含み、１個又はそれ以上の関心対象配列の総量に違いがあることが既知又は疑われるテストサンプルにおける関心対象配列、例えば、臨床関連配列のコピー数多型を評価する方法を提供する。核酸混合物は２つ又はそれ以上の細胞型に由来する。一実施形態において、核酸混合物は、内科的疾患、例えばがんを患っている検体から採取した正常細胞及びがん性細胞に由来する。 Embodiments of the present invention comprise a mixture of nucleic acids from two different genomes and include a sequence of interest in a test sample known or suspected of having a difference in the total amount of one or more sequences of interest, eg, clinically relevant A method for evaluating copy number variation of a sequence is provided. The nucleic acid mixture is derived from two or more cell types. In one embodiment, the nucleic acid mixture is derived from normal and cancerous cells taken from a specimen suffering from a medical disease, such as cancer.

がんの発症には、しばしば全体染色体の数における変動、すなわち完全染色体異数性、及び／又は染色体断片の数の変動、すなわち部分異数性を伴うことがよくあり、これらは染色体不安定性（ＣＩＮ：chromosome instability）として知られているプロセスによって生ずる（Thoma et al.,Swiss Med Weekly 2011:141:w13170参照）。多くの固形腫瘍、例えば乳がんは、初期位置から数個の遺伝子異常の蓄積による転移として進行する（Sato et al., Cancer Res., 50:7184-7189 [1990]; Jongsma et.al., J Clin Pathol: Mol Path 55:305-309 [2002]参照）。このような遺伝子異常は、蓄積すると、増殖性優位、遺伝的不安定性、及び薬物抵抗性が急激に進展する付随能力、亢進する血管形成、タンパク質分解及び転移をもたらす。この遺伝子異常は、減退する「腫瘍抑制遺伝子」又は優勢に作用する腫瘍遺伝子のいずれかの影響をもたらす。ヘテロ接合性喪失（ＬＯＨ：loss of heterozygosity）にいたる欠失及び遺伝子組換えは、突然変異腫瘍抑制対立遺伝子をカバーしないことによって、腫瘍進行に大きな役割を果たすと信じられている。 The onset of cancer is often accompanied by variations in the number of whole chromosomes, ie, complete chromosomal aneuploidy, and / or changes in the number of chromosome fragments, ie, partial aneuploidy, which are chromosomal instability ( It is caused by a process known as CIN (chromosome instability) (see Thomas et al., Swiss Med Weekly 2011: 141: w13170). Many solid tumors, such as breast cancer, progress as metastases from the initial location by the accumulation of several genetic abnormalities (Sato et al., Cancer Res., 50: 7184-7189 [1990]; Jongsma et.al., J Clin Pathol: Mol Path 55: 305-309 [2002]). Such genetic abnormalities, when accumulated, result in proliferative predominance, genetic instability, and the accompanying ability to rapidly develop drug resistance, increased angiogenesis, proteolysis and metastasis. This genetic abnormality results in the effect of either a “tumor suppressor gene” that declines or a predominantly acting oncogene. Deletions and genetic recombination leading to loss of heterozygosity (LOH) are believed to play a major role in tumor progression by not covering mutant tumor suppressor alleles.

ｃｆＤＮＡは、悪性腫瘍と診断された患者の血液循環内で見つかっており、悪性腫瘍としては、限定しないが、肺がん（Pathak et al., Clin Chem 52:1833-1842 {2006]参照）、前立腺がん（Schwartzenbach et al., Clin Cancer Res 15:1032-8 [2009]参照）、及び乳がん（Schwartzenbach et al., breast-cancer-research.com/content/11/5/R7 [2009]参照）で見つかっている。がん患者における循環ｃｆＤＮＡで決定することができるがんに関連するゲノム不安定性の同定は、潜在的な診断及び予測ツールである。一実施形態において、本発明方法は、がん、例えば、上皮性悪性腫、非上皮性悪性腫瘍、リンパ腫、白血病、胚細胞腫瘍、及び芽細胞腫に罹患していることが疑われる又は既知である検体に由来する核酸混合物を有するサンプルにおける関心対象ＣＮＶを評価する。一実施形態において、サンプルは末梢血液から採取（処理）した血漿サンプルとし、この血漿サンプルは正常細胞及びがん性細胞に由来するｃｆＤＮＡの混合物を含む。他の実施形態において、ＣＮＶが存在するか否かを決定するのに必要な生物学的サンプルは、他の生体液、例えば、限定しないが、血清、汗、涙、痰、尿、精液、内耳液、リンパ液、唾液、脳脊髄、破出液、骨髄懸濁液、膣液、経頸管洗浄液、脳液、腹水、母乳、呼吸器官、腸管及び泌尿生殖器管の分泌物、羊水、及び白血球共生サンプルからの、又は組織生検、綿棒若しくは塗抹採取試料における、がん性細胞及び非がん性細胞の混合物に由来するものとする。 cfDNA has been found in the blood circulation of patients diagnosed with malignant tumors, including but not limited to lung cancer (see Pathak et al., Clin Chem 52: 1833-1842 {2006]), prostate (See Schwartzenbach et al., Clin Cancer Res 15: 1032-8 [2009]) and breast cancer (see Schwartzenbach et al., Breast-cancer-research.com/content/11/5/R7 [2009]) Has been found. Identification of cancer-related genomic instabilities that can be determined with circulating cfDNA in cancer patients is a potential diagnostic and predictive tool. In one embodiment, the method of the invention is suspected or known to be afflicted with cancer, eg, epithelial malignant, non-epithelial malignant, lymphoma, leukemia, germ cell tumor, and blastoma. Evaluate the CNV of interest in a sample having a nucleic acid mixture derived from an analyte. In one embodiment, the sample is a plasma sample collected (processed) from peripheral blood, the plasma sample comprising a mixture of cfDNA derived from normal and cancerous cells. In other embodiments, the biological sample required to determine whether CNV is present is other biological fluid, such as, but not limited to, serum, sweat, tears, sputum, urine, semen, inner ear Fluid, lymph, saliva, cerebrospinal fluid, rupture fluid, bone marrow suspension, vaginal fluid, transcervical lavage fluid, brain fluid, ascites, breast milk, respiratory organs, intestinal and genitourinary tract secretions, amniotic fluid, and leukocyte symbiosis samples Or from a mixture of cancerous and non-cancerous cells from a tissue biopsy, swab or smear sample.

関心対象配列は、がんの発症及び／又は進行に役割を果たすことが既知である、又は疑われる核酸配列である。関心対象配列の例としては、核酸配列、すなわち、完全染色体及び／又は染色体断片があり、これらは以下に説明するように、がん性細胞内で増幅又は欠失される。 A sequence of interest is a nucleic acid sequence that is known or suspected to play a role in the development and / or progression of cancer. Examples of sequences of interest are nucleic acid sequences, ie complete chromosomes and / or chromosomal fragments, which are amplified or deleted in cancerous cells as described below.

一実施形態において、本発明方法は、染色体増幅の有無を決定するのに使用することができる。若干の実施形態において、染色体増幅は、１個又はそれ以上の染色体の過剰生成である。他の実施形態において、染色体増幅は、１個又はそれ以上の染色体断片の過剰生成である。さらに他の実施形態において、染色体増幅は、２個又はそれ以上の染色体における２個又はそれ以上の断片の過剰生成である。染色体増幅は、１個又はそれ以上のがん遺伝子の過剰生成である。 In one embodiment, the methods of the invention can be used to determine the presence or absence of chromosomal amplification. In some embodiments, chromosomal amplification is the overproduction of one or more chromosomes. In other embodiments, chromosomal amplification is the overproduction of one or more chromosomal fragments. In still other embodiments, chromosomal amplification is the overproduction of two or more fragments on two or more chromosomes. Chromosomal amplification is the overproduction of one or more oncogenes.

ヒトの固形腫瘍に関連して優勢的に活動する遺伝子は、過剰表現又は異形表現として効果を発揮する。遺伝子増幅は、遺伝子表現の上方調節に向かう共通のメカニズムである。細胞発生に関する研究からの証拠は、ヒトの乳がんの５０％以上にもわたり大きな増幅を生ずることを示している。最も注目すべきは、１７番染色体（17(17q21-q22)）に位置するがん原遺伝子に対するヒトにおける上皮細胞増殖因子受容体２（ＨＥＲ２：human epidermal growth factor receptor 2）の増幅によって、細胞表面におけるＨＥＲ２の過剰表現となり、乳がん及び他の悪性腫瘍における過剰なかつ無調節なシグナルを発生することになる（Park et al., Clinical Breast Cancer 8:392-401 [2008]参照）。様々ながん遺伝子が、ヒトの他の悪性腫瘍で増幅されることが分かってきた。ヒトの腫瘍における細胞性がん遺伝子の増幅は、前骨髄球性白血病細胞株ＨＬ６０及び小細胞肺がん細胞株におけるｃ-ｍｙｃ、原発性神経芽細胞腫（ステージIII及びIV）、神経芽腫細胞株、網膜芽細胞腫細胞株、原発性腫瘍、小細胞肺がん株、及び腫瘍におけるＮ-ｍｙｃ、小細胞肺がん細胞株及び腫瘍におけるＬ-ｍｙｃ、急性骨髄性白血病及び、大腸がん細胞株におけるｃ-ｍｙｂ、表皮がん細胞及び原発性神経膠腫におけるｃ-ｅｒｂｂ、肺、大腸、膀胱、及び直腸の原発性上皮性悪性腫瘍におけるｃ-Ｋ-ｒａｓ-２、乳房上皮性悪性腫瘍細胞株におけるＮ-ｒａｓの増幅を有する（Varmus H., Ann Rev Genetics 18:553-612 (1984)[cited in Watson et al., Molecular Biology of the Gene(4th ed.;Benjamin/Cummings Publishing Co. 1987)]参照）。 Genes that are predominantly associated with human solid tumors are effective as overexpression or phenotypic expression. Gene amplification is a common mechanism towards upregulation of gene expression. Evidence from studies on cell development indicates that over 50% of human breast cancers produce large amplifications. Most notably, the cell surface is amplified by the amplification of human epidermal growth factor receptor 2 (HER2) in humans against the proto-oncogene located on chromosome 17 (17 (17q21-q22)). Overexpression of HER2 in humans and generate excessive and unregulated signals in breast cancer and other malignancies (see Park et al., Clinical Breast Cancer 8: 392-401 [2008]). Various oncogenes have been found to be amplified in other human malignancies. Amplification of cellular oncogenes in human tumors includes c-myc, promyelocytic leukemia cell line HL60 and small cell lung cancer cell lines, primary neuroblastoma (stage III and IV), neuroblastoma cell lines , Retinoblastoma cell lines, primary tumors, small cell lung cancer lines, and N-myc in tumors, small cell lung cancer cell lines and L-myc in tumors, acute myeloid leukemia, and c- in colon cancer cell lines myb, c-erbb in epidermoid carcinoma cells and primary glioma, c-K-ras-2 in primary epithelial malignancies of lung, colon, bladder and rectum, N in breast epithelial malignant cell lines with amplification of -ras (Varmus H., Ann Rev Genetics 18: 553-612 (1984) [cited in Watson et al., Molecular Biology of the Gene (4th ed .; Benjamin / Cummings Publishing Co. 1987)] ).

一実施形態において、本発明方法を使用して染色体欠失の有無を決定することができる。若干の実施形態において、染色体欠失は、１個又はそれ以上の全体染色体の喪失とする。他の実施形態において、染色体欠失は、１個の染色体における１個又はそれ以上の断片の喪失とする。さらに他の実施形態において、染色体欠失は、２個又はそれ以上の染色体における２個又はそれ以上の断片の喪失とする。染色体欠失は、１個又はそれ以上の腫瘍抑制遺伝子の喪失を含むものとすることができる。 In one embodiment, the method of the invention can be used to determine the presence or absence of a chromosomal deletion. In some embodiments, the chromosomal deletion is a loss of one or more whole chromosomes. In other embodiments, the chromosomal deletion is a loss of one or more fragments on one chromosome. In still other embodiments, the chromosomal deletion is a loss of two or more fragments on two or more chromosomes. A chromosomal deletion can include the loss of one or more tumor suppressor genes.

腫瘍抑制遺伝子を含む染色体欠失は、固形腫瘍の発症及び進行に重要な役割を果たす。染色体１３ｑ１４に位置する網膜芽細胞腫抑制遺伝子（Ｒｂ-１）は最も強く特徴付けられた腫瘍抑制遺伝子である。Ｒｂ-１遺伝子産物である１０５ｋＤａ細胞核リンタンパク質は細胞サイクル調節に重要な役割を果たす（Howe et al., Proc Natl Acad Sci (USA) 87:5883-5887 [1990]参照）。Ｒｂタンパク質の変化又は喪失表現は、点（突然）変異又は染色体欠失のいずれかによって、両方の対立遺伝子の不活性化により生ずる。Ｒｂ-ｉ遺伝子の変化は、網膜芽細胞腫だけでなく、他の悪性腫瘍、例えば骨肉腫、小細胞肺がん（Rygaard et al., Cancer Res 50:5312-5317 [1990]参照）、及び乳がんでもあることが分かっている。制限断片長多型（ＲＦＬＰ：restriction fragment length polymorphism）研究は、このような腫瘍タイプは１３ｑでしばしばヘテロ接合性を喪失しており、Ｒｂ-１の対立遺伝子のうち一方が全体的な染色体欠失によって喪失していることを示唆するものであることを示した（Bowcock et al., Am J Hum Genet, 46:12 [1990]参照）。重複、欠失並びに６番染色体及び他のパートナー染色体を巻き込む不均衡転座を含む、１番染色体の異常は、１番染色体の領域、とくに、１ｑ２１〜１ｑ３２及び１ｐ１１〜１３が、骨髄増殖性新生物の慢性及び進行期双方に病因的に関連するがん遺伝子又は腫瘍抑制遺伝子を持っていることを示す（Caramazza et al., Eur J Hematol 84:191-200 [2010]参照）。骨髄増殖性新生物は５番染色体の欠失にも関連する。５番染色体の完全喪失又は間質的欠失は、骨髄異形成症候群（ＭＤＳ：myelodysplastic syndromes）における最も共通する核型異常である。単独del(5q)/5qのＭＤＳ患者は、付加的核型欠陥を有する患者よりも見込みある予後診断を受け、付加的核型欠陥を有する患者は骨髄増殖性新生物（ＭＰＮｓ：myeloproliferative neoplasms）及び急性骨髄性白血病を発症する傾向がある。不均衡となる５番染色体欠失の頻度は、５ｑには造血幹細胞／前駆細胞（ＨＳＣ／ＨＰＣ）の成長制御に基本的な役割を持つ１個又はそれ以上の腫瘍抑制遺伝子があるという着想に至る。５ｑ３１及び５ｑ３２に中心がある共通欠失領域（ＣＤＲ：commonly deleted regions）の細胞発生マッピングは、リボソーム・サブユニットＲＰＳ１４、転写因子Ｅｇｒｌ／Ｋｒｏｘ２０、細胞骨格再構成タンパク質及びαカテニンを含む腫瘍抑制遺伝子候補を同定した（Eisenmann et al., Oncogene 28:3429-3441 [2009]参照）。新鮮腫瘍及び腫瘍細胞株の細胞発生及びアレロタイプ化の研究は、３ｐ２５，３ｐ２１〜２２、３ｐ２１.３、３ｐ１２〜１３及び３ｐ１４を含む３番染色体の幾つかの明確な領域からのアレル欠失（対立遺伝子欠損）が、肺、乳房、腎臓、頭頸部、卵巣、頸部、大腸、膵臓、食道、膀胱、及び他の臓器における深刻な広範囲の上皮がんに見られる、最も初期に最も頻発するゲノム以上である。幾つかの腫瘍抑制遺伝子が染色体の３ｐ領域にマッピングされ、そして考えられることは、間質欠失又はプロモーター過剰メチル化が上皮性悪性腫瘍発症における３ｐ又は３番染色体全体の喪失に先行して生ずるということである（Angeloni D.,Briefings Functional Genomics 6:19-39 [2007]参照）。ダウン症候群（ＤＳ）を持つ新生児及び子供は、先天性一過性白血病を呈することがよくあり、また急性骨髄性白血病及び急性リンパ芽球性白血病のリスクが増大する。約３００個の遺伝子を有する２１番染色体は、多くの構造的異常、例えば、転座、欠失、及び増幅が白血病、リンパ腫、及び固形腫瘍に含まれる。さらに、２１番染色体に位置する遺伝子は腫瘍形成に重要な役割を果たすことが同定された。体細胞に関する並びに構造上の２１番染色体異常は白血病に関連し、また２１ｑに位置するＲＵＮＸ１、ＴＭＰＲＳＳ２及びＴＦＦを含む特定遺伝子が腫瘍形成に役割を果たす（Fonatsch C Gene Chromosomes Cancer 49:497-508 [2010]参照）。 Chromosomal deletions, including tumor suppressor genes, play an important role in the development and progression of solid tumors. The retinoblastoma suppressor gene (Rb-1) located on chromosome 13q14 is the most strongly characterized tumor suppressor gene. The 105 kDa nuclear phosphoprotein, the Rb-1 gene product, plays an important role in cell cycle regulation (see Howe et al., Proc Natl Acad Sci (USA) 87: 5883-5887 [1990]). Rb protein alteration or loss expression results from inactivation of both alleles, either by point (abrupt) mutations or chromosomal deletions. Changes in the Rb-i gene are not only in retinoblastoma but also in other malignant tumors such as osteosarcoma, small cell lung cancer (see Rygaard et al., Cancer Res 50: 5312-5317 [1990]), and breast cancer. I know that there is. Restriction fragment length polymorphism (RFLP) studies have shown that these tumor types are often heterozygous at 13q and one of the Rb-1 alleles is a gross chromosomal deletion (See Bowcock et al., Am J Hum Genet, 46:12 [1990]). Chromosome 1 abnormalities, including duplications, deletions and imbalanced translocations involving Chromosome 6 and other partner chromosomes, include regions of Chromosome 1, particularly 1q21-1q32 and 1p11-13, which are myeloproliferative It shows having an oncogene or tumor suppressor gene that is etiologically related to both the chronic and advanced stages of the organism (see Caramazza et al., Eur J Hematol 84: 191-200 [2010]). Myeloproliferative neoplasms are also associated with deletion of chromosome 5. Complete loss or interstitial deletion of chromosome 5 is the most common karyotypic abnormality in myelodysplastic syndromes (MDS). Individual del (5q) / 5q MDS patients have a better prognosis than patients with additional karyotype defects, and patients with additional karyotype defects are myeloproliferative neoplasms (MPNs) and Tend to develop acute myeloid leukemia. The frequency of unbalanced chromosome 5 deletion is based on the idea that 5q has one or more tumor suppressor genes that have a fundamental role in the growth control of hematopoietic stem / progenitor cells (HSC / HPC). It reaches. Cytogenetic mapping of commonly deleted regions (CDRs) centered on 5q31 and 5q32 is a candidate tumor suppressor gene comprising ribosome subunit RPS14, transcription factor Egrl / Krox20, cytoskeletal reconstitution protein and α-catenin (See Eisenmann et al., Oncogene 28: 3429-3441 [2009]). Cytogenetic and allelotyping studies of fresh tumors and tumor cell lines have shown that allelic deletions from several distinct regions of chromosome 3 including 3p25, 3p21-22, 3p21.3, 3p12-13 and 3p14 (allele) The earliest and most frequent genome is found in severe, wide-ranging epithelial cancers in the lung, breast, kidney, head and neck, ovary, neck, colon, pancreas, esophagus, bladder, and other organs That's it. Several tumor suppressor genes are mapped to the 3p region of the chromosome, and it is believed that stroma deletion or promoter hypermethylation precedes the loss of 3p or entire chromosome 3 in the development of epithelial malignancies (See Angeloni D., Briefings Functional Genomics 6: 19-39 [2007]). Neonates and children with Down syndrome (DS) often present with congenital transient leukemia and an increased risk of acute myeloid leukemia and acute lymphoblastic leukemia. Chromosome 21 with about 300 genes includes many structural abnormalities, such as translocations, deletions, and amplifications, in leukemias, lymphomas, and solid tumors. Furthermore, the gene located on chromosome 21 was identified to play an important role in tumorigenesis. Chromosome 21 abnormalities related to somatic cells and structurally are associated with leukemia, and specific genes including RUNX1, TMPRSS2 and TFF located at 21q play a role in tumorigenesis (Fonatsch C Gene Chromosomes Cancer 49: 497-508 [ 2010]).

一実施形態において、本発明方法は、遺伝子増幅と腫瘍進行度との関連性を評価する手段を提供する。増幅及び／又は欠失とがんのステージ又は悪性度との相関性は予測の上で重要であり、なぜなら、このような情報は遺伝的腫瘍悪性度の定義に寄与し、この悪性度は最悪の予測が持たれるより進行した腫瘍での将来的病気進行をよりよく予測する。さらに、初期増幅及び／又は欠失に関する情報は、将来的病気進行の予測因子としてこれら事象を評価する上で有用である。本発明方法によって同定されるように、遺伝子増幅及び欠失は、他の既知のパラメータ、例えば、腫瘍悪性度、組織構造、Ｂｒｄ／Ｕｒｄラベル付け指標、ホルモン状態、リンパ節の転移、腫瘍サイズ、生存期間、並びに疫学的及び生物統計学的研究から得られる他の腫瘍特性に関連付けることができる。例えば、本発明方法によってテストすべき腫瘍ＤＮＡとしては、異型過形成、非浸潤性乳管がん、ステージＩ〜IIIのがん、及び転移リンパ節があり、増幅及び欠失とステージとの関連性の同定を行うことができる。なされた関連性はできるだけ有効な治療介入を行うことができる。例えば、絶えず増幅される領域は過剰表現された遺伝子を含み、この遺伝子の生成物を治療的に攻撃することができる（例えば、成長因子受容体チロシン・キナーゼ、ｐ１８５^ＨＥＲ２）。 In one embodiment, the method of the present invention provides a means for assessing the association between gene amplification and tumor progression. The correlation between amplification and / or deletion and cancer stage or grade is important for prediction, because such information contributes to the definition of genetic tumor grade, which is the worst Better predict future disease progression in more advanced tumors with the prediction of In addition, information regarding initial amplification and / or deletion is useful in assessing these events as predictors of future disease progression. As identified by the method of the present invention, gene amplification and deletion can be achieved by other known parameters such as tumor grade, tissue structure, Brd / Urd labeling index, hormone status, lymph node metastasis, tumor size, Can be related to survival and other tumor characteristics obtained from epidemiological and biostatistical studies. For example, tumor DNA to be tested by the method of the present invention includes atypical hyperplasia, noninvasive ductal carcinoma, stage I-III cancer, and metastatic lymph nodes, and the relationship between amplification and deletion and stage Sex identification can be performed. The associations that have been made can provide the most effective therapeutic intervention possible. For example, a region that is constantly amplified contains an overexpressed gene, and the product of this gene can be therapeutically attacked (eg, growth factor receptor tyrosine kinase, p185 ^HER2 ).

本発明方法を使用して、原発がんから他の部位に転移したがん細胞の核酸配列のコピー数多型を決定することによって、薬物抵抗性に関連する増幅及び／又は欠失事象同定することができる。遺伝子増幅及び／又は欠失が薬物抵抗性の急速な発展を可能にする核型不安定性の発現である場合、化学療法に感受性のある患者の腫瘍よりも、化学療法に耐性がある患者からの原発腫瘍により多くの増幅及び／又は欠失が予想される。例えば、特定遺伝子の増幅が薬物抵抗性の発達に寄与する場合、その遺伝子の周囲の領域が、化学療法に耐性がある患者の原発腫瘍ではなく、胸膜滲出からの腫瘍細胞で絶えず増幅されていることが予想される。遺伝子増幅及び／又は欠失と薬物耐性発達との間の関連性を発見することは、術後補助（アジュバント）療法が有益か否かの患者同定を可能にする。 Identifying amplification and / or deletion events related to drug resistance by determining the copy number variation of the nucleic acid sequence of cancer cells that have metastasized from the primary cancer to other sites using the method of the present invention. Can do. If gene amplification and / or deletion is the development of karyotypic instability that allows rapid development of drug resistance, the tumor from patients who are resistant to chemotherapy rather than the tumor of patients who are sensitive to chemotherapy More amplification and / or deletion is expected in the primary tumor. For example, if amplification of a particular gene contributes to the development of drug resistance, the region surrounding that gene is constantly amplified with tumor cells from pleural effusion rather than the primary tumor of a patient resistant to chemotherapy It is expected that. Finding an association between gene amplification and / or deletion and drug resistance development allows patient identification of whether postoperative adjuvant therapy is beneficial.

母体サンプルにおける完全及び／又は部分的な胎児染色体異数性の有無を決定することにつき説明したのと同様に、本発明方法を使用して、核酸、例えば、ＤＮＡ又はｃｆＤＮＡを含む任意の患者サンプル（母体サンプルではない患者サンプルを含む）における完全及び／又は部分的な染色体異数性の有無を決定することができる。患者サンプルは、本明細書のいたるところに記載したように、任意の生物学的サンプルとすることができる。好適には、サンプルは非侵襲的手順で採取する。例えば、サンプルは血液サンプル又は血液の血清及び血漿画分とすることができる。代案として、サンプルは尿サンプル又は糞便サンプルとすることができる。さらに他の実施形態において、サンプルは組織生検サンプルとする。すべてのケースで、サンプルは核酸、例えば、ｃｆＤＮＡ又はゲノムＤＮＡを含み、これらＤＮＡを精製し、上述のような任意のＮＧＳシークエンシングを使用してシークエンシングする。 Similar to that described for determining the presence or absence of complete and / or partial fetal chromosomal aneuploidy in a maternal sample, any patient sample containing nucleic acid, eg, DNA or cfDNA, using the method of the present invention. The presence or absence of complete and / or partial chromosomal aneuploidy (including patient samples that are not maternal samples) can be determined. The patient sample can be any biological sample, as described elsewhere herein. Preferably, the sample is taken by a non-invasive procedure. For example, the sample can be a blood sample or blood serum and plasma fraction. Alternatively, the sample can be a urine sample or a stool sample. In yet other embodiments, the sample is a tissue biopsy sample. In all cases, the sample contains nucleic acid, such as cfDNA or genomic DNA, which is purified and sequenced using any NGS sequencing as described above.

フォーメーションに関連する完全及び部分的染色体異数性、並びにがん進行の双方を本発明方法により決定することができる。 Both complete and partial chromosomal aneuploidy associated with formation and cancer progression can be determined by the method of the present invention.

患者サンプルにおける完全染色体異数性の決定
一実施形態において、本発明は、核酸分子を含む患者テストサンプルにおける異なった任意の１つ又はそれ以上の完全染色体異数性の有無を決定する方法を提供する。若干の実施形態において、本発明方法は、任意の１つ又はそれ以上の異なった完全染色体異数性の有無を決定する。本発明方法のステップとしては、（ａ）患者テストサンプルにおける患者核酸の配列情報を取得するステップと、及び（ｂ）配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体における配列タグ数を同定し、また任意の１個又はそれ以上の関心対象染色体における正規化染色体配列の配列タグ数を同定するステップとを有する。正規化染色体配列は単独染色体とするか、又は１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した染色体グループとすることができる。本発明方法は、さらに、ステップ（ｃ）において、任意の１個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数、及び各正規化染色体配列に対して同定した配列タグ数を使用して、任意の１個又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算し、及びステップ（ｄ）で任意の１個又はそれ以上の関心対象染色体それぞれにおける各単独染色体ドースを任意の１個又はそれ以上の関心対象染色体それぞれの閾値と比較し、これにより患者テストサンプルにおける任意の１つ又はそれ以上の異なった完全患者異数性の有無を決定する。 Determining Complete Chromosome Aneuploidy in a Patient Sample In one embodiment, the present invention provides a method for determining the presence or absence of any different one or more complete chromosomal aneuploidies in a patient test sample comprising a nucleic acid molecule. To do. In some embodiments, the methods of the invention determine the presence or absence of any one or more different complete chromosomal aneuploidies. The steps of the method of the present invention include (a) obtaining sequence information of patient nucleic acid in a patient test sample, and (b) selecting from chromosomes 1 to 22, X chromosome and Y chromosome using the sequence information. Identifying the number of sequence tags in any one or more chromosomes of interest and identifying the number of sequence tags of normalized chromosomal sequences in any one or more chromosomes of interest. The normalized chromosomal sequence can be a single chromosome or a chromosome group selected from chromosomes 1-22 and X and Y chromosomes. The method of the invention further uses in step (c) the number of sequence tags identified for each of any one or more chromosomes of interest and the number of sequence tags identified for each normalized chromosomal sequence. A single chromosomal dose for each of any one or more chromosomes of interest and a single chromosomal dose for each of any one or more of the chromosomes of interest in step (d). Compare to the threshold of each of the one or more chromosomes of interest, thereby determining the presence or absence of any one or more different complete patient aneuploidies in the patient test sample.

若干の実施形態において、ステップ（ｃ）は、関心対象染色体それぞれの単独染色体ドースを、関心対象染色体それぞれに対して同定した配列タグ数と、関心対象染色体それぞれの正規化染色体に対して同定した配列タグ数との比として計算する。 In some embodiments, step (c) comprises identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each chromosome of interest, and a sequence identified for the normalized chromosome for each chromosome of interest. Calculate as a ratio to the number of tags.

他の実施形態において、ステップ（ｃ）は、関心対象染色体それぞれの単独染色体ドースを、関心対象染色体それぞれに対して同定した配列タグ数と、関心対象染色体それぞれの正規化染色体に対して同定した配列タグ数との比として計算する。他の実施形態において、ステップ（ｃ）は、関心対象染色体の配列タグ比の計算を、関心対象染色体に対して得た配列タグ数を関心対象染色体の長さに関連付けし、また関心対象染色体の対応する正規化染色体配列のタグ数を正規化染色体配列の長さに関連付けし、また関心対象染色体の染色体ドースを、関心対象染色体の配列タグ密度と正規化配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる患者からのテストサンプルに対して繰り返すことができる。 In another embodiment, step (c) comprises the step of identifying a single chromosomal dose for each chromosome of interest, the number of sequence tags identified for each chromosome of interest, and the sequence identified for the normalized chromosome for each chromosome of interest. Calculate as a ratio to the number of tags. In other embodiments, step (c) comprises calculating the sequence tag ratio of the chromosome of interest by associating the number of sequence tags obtained for the chromosome of interest with the length of the chromosome of interest, Corresponding normalized chromosomal sequence tag number is related to normalized chromosomal sequence length, and chromosomal dose of the chromosome of interest is calculated as the ratio of the sequence tag density of the chromosome of interest to the sequence tag density of the normalized sequence By doing. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different patients.

１つ又はそれ以上の完全染色体異数性を、無細胞ＤＮＡ分子を含むがん患者テストサンプルにおいて決定する実施形態の例は、（ａ）テストサンプルにおける患者の無細胞ＤＮＡ分子の少なくとも一部をシークエンシングして配列情報を取得するステップと、（ｂ）この配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の２０個又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、またその２０個又はそれ以上の関心対象染色体それぞれの正規化染色体の配列タグ数を同定するステップと、（ｃ）その２０個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数及び各正規化染色体に対して同定した配列タグ数を使用して、その２０個又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び（ｄ）その２０個又はそれ以上の関心対象染色体それぞれの各単独染色体ドースを、２０個又はそれ以上の関心対象染色体それぞれの閾値と比較し、またこれにより患者テストサンプルにおける任意の２０又はそれ以上の異なった完全染色体異数性の有無を決定するステップとを有する。 Examples of embodiments for determining one or more complete chromosomal aneuploidies in a cancer patient test sample comprising a cell-free DNA molecule include: (a) at least a portion of a patient's cell-free DNA molecule in the test sample; Sequencing to obtain sequence information; (b) using this sequence information, each of any 20 or more selected chromosomes of interest selected from chromosomes 1-22, X and Y. Identifying a sequence tag number and identifying a normalized chromosome sequence tag number for each of the 20 or more chromosomes of interest; (c) for each of the 20 or more chromosomes of interest Using each identified sequence tag number and the identified sequence tag number for each normalized chromosome, each of the 20 or more chromosomes of interest alone Calculating a chromosomal dose; and (d) comparing each single chromosomal dose of each of the 20 or more chromosomes of interest to a threshold of each of the 20 or more chromosomes of interest, and thereby Determining the presence or absence of any 20 or more different complete chromosomal aneuploidies in the patient test sample.

他の実施形態において、上述の患者テストサンプルにおける任意の１つ又はそれ以上の異なった完全染色体異数性の有無を決定する方法は、関心対象染色体におけるドースを決定する正規化断片配列を使用する。この場合、この方法は、（ａ）テストサンプルにおける核酸の配列情報を取得するステップと、（ｂ）この配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１つ又はそれ以上の関心対象染色体それぞれの配列タグ数を同定し、またその１つ又はそれ以上の関心対象染色体それぞれの正規化断片配列の配列タグ数を同定するステップとを有する。正規化断片配列は、染色体の単独断片とするか、又は任意の１個又はそれ以上の異なった染色体からの断片グループとすることができる。この方法は、さらに、（ｃ）その任意の１個又はそれ以上の関心対象染色体それぞれに対して同定した配列タグ数及び各正規化断片配列に対して同定した配列タグ数を使用して、その任意の１個又はそれ以上の関心対象染色体それぞれの単独染色体ドースを計算するステップと、及び（ｄ）その任意の１個又はそれ以上の関心対象染色体それぞれの各単独染色体ドースを、任意の１個又はそれ以上の関心対象染色体それぞれの閾値と比較し、またこれにより患者テストサンプルにおける任意の１つ又はそれ以上の異なった完全染色体異数性の有無を決定するステップとを有する。 In other embodiments, the method for determining the presence or absence of any one or more different complete chromosomal aneuploidies in the patient test sample described above uses a normalized fragment sequence that determines the dose in the chromosome of interest. . In this case, the method comprises (a) obtaining the sequence information of the nucleic acid in the test sample, and (b) using the sequence information, any one selected from chromosomes 1 to 22, the X chromosome and the Y chromosome. Identifying the number of sequence tags for each of the one or more chromosomes of interest and identifying the number of sequence tags for the normalized fragment sequence of each of the one or more chromosomes of interest. A normalized fragment sequence can be a single fragment of a chromosome or a group of fragments from any one or more different chromosomes. The method further comprises (c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for each normalized fragment sequence, Calculating a single chromosome dose for each of any one or more chromosomes of interest; and (d) any one single chromosome dose for each of the one or more chromosomes of interest Or comparing to a threshold for each of the more or less chromosomes of interest and thereby determining the presence or absence of any one or more different complete chromosomal aneuploidies in the patient test sample.

他の実施形態において、ステップ（ｃ）は、関心対象染色体の配列タグ比の計算を、関心対象染色体に対して得た配列タグ数を関心対象染色体の長さに関連付けし、また関心対象染色体の対応する正規化断片配列のタグ数を正規化断片配列の長さに関連付けし、また関心対象染色体の染色体ドースを、関心対象染色体の配列タグ密度と正規化断片配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる患者からのテストサンプルに対して繰り返すことができる。 In other embodiments, step (c) comprises calculating the sequence tag ratio of the chromosome of interest by associating the number of sequence tags obtained for the chromosome of interest with the length of the chromosome of interest, The number of tags in the corresponding normalized fragment sequence is related to the length of the normalized fragment sequence, and the chromosomal dose of the chromosome of interest is expressed as the ratio of the sequence tag density of the chromosome of interest to the sequence tag density of the normalized fragment sequence. Do by calculating. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different patients.

異なるサンプルセットの染色体ドースを比較する手段は、テストサンプルにおける染色体ドースを、適格サンプルセットにおける対応の染色体ドースの平均に関連付ける正規化染色体値（ＮＣＶ：normalized chromosome value）を決定することによって得ることができる。ＮＣＶは、次式のように計算される。
ここで
は、それぞれ適格サンプルセットにおけるｊ番染色体ドースに対する推定した平均及び標準偏差であり、ｘ_ijはテストサンプルｉにおける観測したｊ番染色体ドースである。 Means for comparing chromosomal doses of different sample sets can be obtained by determining a normalized chromosome value (NCV) that relates the chromosomal doses in the test sample to the average of the corresponding chromosomal doses in the qualified sample set. it can. NCV is calculated as follows:
here
Are the estimated mean and standard deviation for chromosome jose in the eligible sample set, respectively, and x _ij is the observed chromosome jose in test sample i.

若干の実施形態において、少なくとも１つの完全染色体異数性の有無を決定する。他の実施形態において、少なくとも２つ、少なくとも３つ、少なくとも４つ、少なくとも５つ、少なくとも６つ、少なくとも７つ、少なくとも８つ、少なくとも９つ、少なくとも１０、少なくとも１１、少なくとも１２、少なくとも１３、少なくとも１４、少なくとも１５、少なくとも１６、少なくとも１７、少なくとも１８、少なくとも１９、少なくとも２０、少なくとも２１、少なくとも２２、少なくとも２３、少なくとも２４の、完全染色体異数性の有無を１つのサンプル内で決定し、この場合、２２の完全染色体異数性は、任意の１つ又はそれ以上の常染色体における完全染色体異数性に対応し、２３番目の染色体異数性及び２４番目の染色体異数性はＸ染色体及びＹ染色体の完全染色体異数性に対応する。異数性はトリソミー、テトラソミー、ペンタソミー及び他のポリソミーを含み、また完全染色体異数性の数は、異なる疾患、同一疾患における異なるステージで変化するため、本発明方法により決定することができる異なる完全染色体異数性の数としては、少なくとも２４、少なくとも２５、少なくとも２６、少なくとも２７、少なくとも２８、少なくとも２９、少なくとも３０、少なくとも４０、少なくとも５０、少なくとも６０、少なくとも７０、少なくとも８０、少なくとも９０、少なくとも１００、又はそれ以上の染色体異数性がある。腫瘍の系統的染色体解析によれば、がん細胞における染色体数は、低二倍性（４６個の染色体よりも相当少ない）から４倍性及び４倍性を超える倍数性（２００個の染色体）ものわたり広範囲に変化することがある（Storchova and Kuffer J Cell Sci 121:3859-3866 [2008]参照）。若干の実施形態において、本発明方法は、がん、例えば、大腸がんを患っていることが疑われる又は既知である患者からのサンプルにおいて、２００以上もの染色体異数性の有無を決定する。染色体異数性としては、１個又はそれ以上の染色体喪失（低二倍性）、トリソミー、テトラソミー、ペンタソミー及び他のポリソミーを含む過剰完全染色体がある。染色体断片の過剰及び／又は喪失も、本明細書のいたるところに記載したように決定することができる。本発明方法は、本明細書のいたるところに記載したように、任意のがんを患っていることが疑われる又は既知である患者からのサンプルにおける異なる異数性の有無を決定するのに適用することができる。 In some embodiments, the presence or absence of at least one complete chromosomal aneuploidy is determined. In other embodiments, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, Determining the presence or absence of at least 14, at least 15, at least 16, at least 17, at least 18, at least 19, at least 20, at least 21, at least 22, at least 23, at least 24 complete aneuploidy within a sample; In this case, the 22 complete chromosomal aneuploidy corresponds to the complete chromosomal aneuploidy in any one or more autosomes, and the 23rd and 24th chromosomal aneuploidy is the X chromosome. Corresponding to the complete aneuploidy of the Y chromosome Aneuploidy includes trisomy, tetrasomy, pentasomy, and other polysomy, and the number of complete chromosomal aneuploidy varies at different stages in different diseases, same disease, so different completeness that can be determined by the method of the present invention The number of chromosomal aneuploidies includes at least 24, at least 25, at least 26, at least 27, at least 28, at least 29, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100. Or more chromosomal aneuploidy. According to systematic chromosomal analysis of tumors, the number of chromosomes in cancer cells ranges from hypodiploidy (much less than 46 chromosomes) to tetraploidy and more than tetraploidy (200 chromosomes). It can vary widely (see Storchova and Kuffer J Cell Sci 121: 3859-3866 [2008]). In some embodiments, the methods of the invention determine the presence or absence of more than 200 chromosomal aneuploidies in a sample from a patient suspected or known to suffer from cancer, eg, colorectal cancer. Chromosomal aneuploidy includes excess complete chromosomes including one or more chromosome loss (hypodiploidy), trisomy, tetrasomy, pentasomy and other polysomy. Excess and / or loss of chromosomal fragments can also be determined as described elsewhere herein. The method of the present invention is applied to determine the presence or absence of different aneuploidies in samples from patients suspected or known to suffer from any cancer, as described elsewhere herein. can do.

若干の実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１つを、上述のような患者テストサンプルにおける任意の１つ又はそれ以上の異なった完全染色体異数性の有無を決定する際の関心対象染色体とすることができる。他の実施形態において、２個又はそれ以上の関心対象染色体を、１番，２番，３番，４番，５番，６番，７番，８番，９番，１０番，１１番，１２番，１３番，１４番，１５番，１６番，１７番，１８番，１９番，２０番，２１番，２２番染色体、Ｘ染色体及びＹ染色体のうち任意の２つ又はそれ以上から選択する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも２０個の染色体を有し、この場合、少なくとも２０の異なった完全染色体異数性の有無を決定する。他の実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体は、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてとし、またこの場合、１〜２２番染色体、Ｘ染色体及びＹ染色体のすべてにおける完全染色体異数性の有無を決定する。決定することができる異なる完全染色体異数性としては、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１個又はそれ以上における完全染色体モノソミー、Ｘ染色体及びＹ染色体のうち任意の１個又はそれ以上における完全染色体トリソミー、Ｘ染色体及びＹ染色体のうち任意の１個又はそれ以上における完全染色体テトラソミー、Ｘ染色体及びＹ染色体のうち任意の１個又はそれ以上における完全染色体ペンタソミー、及びＸ染色体及びＹ染色体のうち任意の１個又はそれ以上における完全染色体ポリソミーがある。 In some embodiments, any one of chromosomes 1-22, X and Y chromosomes is present in any one or more different complete chromosomal aneuploidies in a patient test sample as described above. Can be the chromosome of interest in determining. In other embodiments, two or more chromosomes of interest are assigned to No. 1, No. 2, No. 3, No. 4, No. 5, No. 6, No. 7, No. 8, No. 9, No. 10, Select from any two or more of chromosomes 12, 13, 14, 15, 16, 17, 18, 19, 20, 20, 21, 22, chromosomes X and Y To do. In one embodiment, any one or more chromosomes of interest selected from chromosomes 1-22, X and Y are at least 20 chromosomes selected from chromosomes 1-22, X and Y. Having chromosomes, in this case, determining the presence or absence of at least 20 different complete chromosomal aneuploidies In other embodiments, any one or more chromosomes of interest selected from chromosomes 1-22, X and Y are all of chromosomes 1-22, X and Y, and In this case, the presence or absence of complete chromosomal aneuploidy in all of chromosomes 1-22 and X and Y is determined. Different complete chromosomal aneuploidies that can be determined include complete chromosome monosomy in any one or more of chromosomes 1-22 and X and Y, and any one of X and Y. Or a complete chromosome trisomy in any one or more of the X and Y chromosomes, a complete chromosome pentasomy in any one or more of the X and Y chromosomes, and the X chromosome and There is a complete chromosomal polysomy on any one or more of the Y chromosomes.

患者サンプルにおける部分染色体異数性の決定
他の実施形態において、本発明は、核酸分子を含む患者テストサンプルにおける任意の１つ又はそれ以上の異なった部分染色体異数性の有無を決定する方法を提供する。本発明方法のステップとしては、（ａ）テストサンプルにおける患者核酸の配列情報を取得するステップと、及び（ｂ）配列情報を使用して、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける配列タグ数を同定し、また任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける正規化断片配列の配列タグ数を同定するステップとを有する。正規化断片配列は染色体の単独断片とするか、又は１個又はそれ以上の異なった染色体からの断片グループとすることができる。本発明方法は、さらに、ステップ（ｃ）において、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれに対して同定した配列タグ数、及び各正規化断片配列に対して同定した配列タグ数を使用して、任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれの単独断片ドースを計算し、及びステップ（ｄ）で任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の断片それぞれにおける各単独断片ドースを任意の１個又はそれ以上の関心対象染色体における任意の１個又はそれ以上の染色体断片それぞれの閾値と比較し、これによりテストサンプルにおける任意の１つ又はそれ以上の異なった部分染色体異数性の有無を決定する。 Determination of Partial Chromosome Aneuploidy in a Patient Sample In another embodiment, the present invention provides a method for determining the presence or absence of any one or more different subchromosomal aneuploidies in a patient test sample comprising a nucleic acid molecule. provide. As the steps of the method of the present invention, (a) obtaining the sequence information of the patient nucleic acid in the test sample, and (b) using the sequence information, it was selected from chromosomes 1 to 22, the X chromosome and the Y chromosome. Identify the number of sequence tags in each of any one or more fragments in any one or more chromosomes of interest, and any one or more in any one or more chromosomes of interest Identifying the number of sequence tags of the normalized fragment sequence in each of the fragments. Normalized fragment sequences can be single fragments of chromosomes or groups of fragments from one or more different chromosomes. The method of the present invention further comprises, in step (c), the number of sequence tags identified for each of any one or more fragments in any one or more chromosomes of interest, and each normalized fragment sequence Calculate the single fragment dose for each of any one or more fragments in any one or more chromosomes of interest using the number of sequence tags identified for and optionally in step (d) Each single fragment dose in any one or more fragments in each of the one or more chromosomes of interest of any one or more chromosome fragments in any one or more of the chromosomes of interest To determine the presence or absence of any one or more different subchromosomal aneuploidies in the test sample.

他の実施形態において、ステップ（ｃ）は、関心対象断片の配列タグ比の計算を、関心対象断片に対して得た配列タグ数を関心対象断片の長さに関連付けし、また関心対象断片の対応する正規化断片配列のタグ数を正規化断片配列の長さに関連付けし、また関心対象断片の断片ドースを、関心対象断片の配列タグ密度と正規化断片配列の配列タグ密度との比として計算することによって行う。すべての関心対象染色体それぞれに対してこの計算を繰り返す。ステップ（ａ）〜（ｄ）を、異なる患者からのテストサンプルに対して繰り返すことができる。 In another embodiment, step (c) comprises calculating the sequence tag ratio of the fragment of interest by associating the number of sequence tags obtained for the fragment of interest with the length of the fragment of interest and The number of tags in the corresponding normalized fragment sequence is related to the length of the normalized fragment sequence, and the fragment dose of the fragment of interest as Do by calculating. Repeat this calculation for every chromosome of interest. Steps (a)-(d) can be repeated for test samples from different patients.

若干の実施形態において、少なくとも１つの部分染色体異数性の有無を決定する。他の実施形態において、２つ、３つ、４つ、５つ、６つ、７つ、８つ、９つ、１０、１５、２０、２５、又はそれ以上の、部分染色体異数性の有無を１つのサンプル内で決定する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１個から選択した１個の関心対象断片は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択する。他の実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した２個又はそれ以上の関心対象断片を、１番，２番，３番，４番，５番，６番，７番，８番，９番，１０番，１１番，１２番，１３番，１４番，１５番，１６番，１７番，１８番，１９番，２０番，２１番，２２番染色体、Ｘ染色体及びＹ染色体のうち任意の２つ又はそれ以上から選択する。一実施形態において、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した任意の１個又はそれ以上の関心対象断片は、１〜２２番染色体、Ｘ染色体及びＹ染色体から選択した少なくとも１個、５個、１０個、１５個、２０個、２５個、５０個、７５個、１００個又はそれ以上の断片を有し、この場合、少なくとも１個、５個、１０個、１５個、２０個、２５個、５０個、７５個、１００個又はそれ以上の異なった部分染色体異数性の有無を決定する。決定することができる異なる部分染色体異数性としては、部分重複、部分増殖、部分挿入、及び部分欠失がある。 In some embodiments, the presence or absence of at least one subchromosomal aneuploidy is determined. In other embodiments, there are 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, or more subchromosome aneuploidies Are determined within one sample. In one embodiment, one fragment of interest selected from any one of chromosomes 1-22, X, and Y is selected from chromosomes 1-22, X, and Y. In another embodiment, two or more fragments of interest selected from chromosomes 1-22, X chromosome and Y chromosome are numbered 1, 2, 3, 4, 5, 6, 7 No. 8, No. 9, No. 10, No. 11, No. 12, No. 13, No. 14, No. 15, No. 16, No. 17, No. 18, No. 19, No. 20, No. 21, No. 22 chromosome, X chromosome And any two or more of the Y chromosomes. In one embodiment, any one or more fragments of interest selected from Chromosomes 1-22, X and Y are at least one selected from Chromosomes 1-22, X and Y, 5, 10, 15, 20, 25, 50, 75, 100 or more fragments, in this case at least 1, 5, 10, 15, 20 25, 50, 75, 100 or more different subchromosome aneuploidies. Different partial chromosome aneuploidies that can be determined include partial duplication, partial growth, partial insertion, and partial deletion.

患者における染色体異数性（部分又は完全）の有無を決定するのに使用できるサンプルは、本明細書のいたるところに記載したように、任意の生物学的サンプルとすることができる。患者における異数性決定に使用できるサンプルのタイプは、罹患していることが既知である又は疑われる患者の疾患のタイプに基づく。例えば、糞便サンプルをＤＮＡソースとして選択し、結腸直腸がんに関連する異数性の有無を決定することができる。好適には、サンプルは非侵襲性手段によって採取した生物学的サンプル、例えば、血漿サンプルとする。本明細書のいたるところに記載したように、患者サンプルにおける核酸のシークエンシングは、次世代シークエンシング（ＮＧＳ）を使用して実施することができる。若干の実施形態において、シークエンシングは、可逆色素ターミネーターによるシークエンシング・バイ・シンセシスを使用する大量並列シークエンシングとする。他の実施形態において、シークエンシングは、シークエンシング・バイ・リゲーションとする。さらに他の実施形態において、シークエンシングは単独分子シークエンシングとする。随意的に増幅ステップをシークエンシングに先立って行う。 The sample that can be used to determine the presence or absence of chromosomal aneuploidy (partial or complete) in a patient can be any biological sample, as described elsewhere herein. The type of sample that can be used to determine aneuploidy in a patient is based on the type of disease in the patient that is known or suspected to be affected. For example, a stool sample can be selected as a DNA source to determine the presence or absence of aneuploidy associated with colorectal cancer. Preferably, the sample is a biological sample taken by non-invasive means, for example a plasma sample. As described elsewhere herein, sequencing of nucleic acids in patient samples can be performed using next generation sequencing (NGS). In some embodiments, the sequencing is massively parallel sequencing using sequencing by synthesis with a reversible dye terminator. In other embodiments, the sequencing is sequencing by ligation. In still other embodiments, the sequencing is single molecule sequencing. Optionally, an amplification step is performed prior to sequencing.

若干の実施形態において、異数性の有無は、本明細書のいたるところに記載したように、例えば、肺、乳房、腎臓、頭頸部、卵巣、頸部、大腸、膵臓、食道、膀胱、及び他の臓器のがん、並びに血液がんを罹患していることが疑われる患者において決定する。血液がんとしては、骨髄、血液、及びリンパ系のがんがあり、リンパ系は、リンパ節、リンパ管、へんとう腺、胸腺、脾臓、及び消化管のリンパ系組織を含む。骨髄からスタートする白血病及び骨髄腫、及びリンパ系からスタートするリンパ腫は、最も一般的な血液がんのタイプである。 In some embodiments, the presence or absence of aneuploidy is, for example, as described elsewhere herein, for example, lung, breast, kidney, head and neck, ovary, neck, colon, pancreas, esophagus, bladder, and Determine in patients suspected of having cancer of other organs, as well as blood cancer. Hematologic cancers include bone marrow, blood, and lymphoid cancers, which include lymph nodes, lymph vessels, condyles, thymus, spleen, and lymphatic tissues of the gastrointestinal tract. Leukemia and myeloma starting from the bone marrow and lymphoma starting from the lymphatic system are the most common types of blood cancer.

ＣＮＶを決定するための装置及びシステム Apparatus and system for determining CNV

シークエンシングデータの解析及びこの解析から導く診断は、一般的には種々のコンピュータアルゴリズム及びプログラムを使用して行う。一実施形態において、本発明は、テストサンプルにおける胎児異数性の有無を示す出力を発生するコンピュータプログラム製品を提供する。コンピュータ製品としては、プロセッサに対して胎児異数性を診断させるよう媒体に記録したロジックを有するコンピュータ可読媒体を含み、ロジックは、母体の生物学的サンプルからの核酸分子の少なくとも一部から、計算した染色体を含むシークエンシングデータを受け取る受取り手順、受取ったデータから胎児異数性を解析するコンピュータ支援ロジック、及びこの胎児異数性の有無又は種類を示す出力を発生する出力手順を有する。 Analysis of sequencing data and diagnostics derived from this analysis are typically performed using various computer algorithms and programs. In one embodiment, the present invention provides a computer program product that generates an output indicating the presence or absence of fetal aneuploidy in a test sample. The computer product includes a computer readable medium having logic recorded in the medium to cause the processor to diagnose fetal aneuploidy, the logic calculating from at least a portion of the nucleic acid molecules from the maternal biological sample. And a computer-aided logic for analyzing fetal aneuploidy from the received data, and an output procedure for generating an output indicating the presence or type of the fetal aneuploidy.

本発明方法は、任意のＣＮＶ、例えば、染色体異数性又は部分異数性を同定する方法を実行するコンピュータ可読命令を格納したコンピュータ可読媒体を使用して行うことができる。したがって、一実施形態において、本発明は、完全及び部分染色体異数性、例えば、胎児異数性を同定する方法を実施するコンピュータ可読命令を格納したコンピュータ可読媒体を提供する。 The methods of the invention can be performed using any CNV, for example a computer readable medium having stored thereon computer readable instructions for performing a method for identifying chromosomal aneuploidy or partial aneuploidy. Accordingly, in one embodiment, the present invention provides a computer readable medium having stored thereon computer readable instructions for performing a method for identifying complete and partial chromosomal aneuploidy, eg, fetal aneuploidy.

本発明方法は、さらに、任意のＣＮＶ、例えば、染色体異数性又は部分異数性を同定する方法を実行するよう構成したコンピュータ処理システムを使用して実施することができる。したがって、本発明は、上述の方法を実施するよう構成したコンピュータ処理システムを提供する。一実施形態において、本発明装置は、サンプルにおける核酸分子の少なくとも一部をシークエンシングして、本明細書のいたるところに記載したように、配列情報のタイプを取得するよう構成したシークエンシング装置を有する。 The methods of the present invention can further be performed using a computer processing system configured to perform any CNV, eg, a method for identifying chromosomal aneuploidy or partial aneuploidy. Accordingly, the present invention provides a computer processing system configured to implement the above-described method. In one embodiment, the device of the invention comprises a sequencing device configured to sequence at least a portion of a nucleic acid molecule in a sample to obtain a type of sequence information as described elsewhere herein. Have.

本発明を、以下の実施例で詳細に説明するが、これら実施例は、本発明の特許請求の範囲を限定することを意図しない。添付図面は、本発明の明細書と一体の部分として見なすべきである。以下の実施例は説明であって特許請求の範囲を限定するものではない。 The invention will be described in detail in the following examples, which are not intended to limit the scope of the claims of the invention. The accompanying drawings are to be regarded as an integral part of the specification of the invention. The following examples are illustrative and do not limit the scope of the claims.

実験実施例
実施例１
サンプル処理及びＤＮＡ抽出
末梢血液サンプルを、妊娠第一期又は第二期にあり、胎児異数性のリスクがあると見なされた妊婦から採取した。血液採取前に各当事者からインフォームド・コンセントを得た。血液を採取してから、羊水穿刺又は絨毛膜絨毛のサンプリングを行った。核型解析は、絨毛膜絨毛又は羊水穿刺のサンプルを使用して行い、胎児の核型を確認した。 Experimental example
Example 1
Sample Processing and DNA Extraction Peripheral blood samples were taken from pregnant women who were in the first or second trimester and considered to be at risk for fetal aneuploidy. Informed consent was obtained from each party prior to blood collection. After collecting blood, amniocentesis or chorionic villi were sampled. Karyotype analysis was performed using chorionic villi or amniocentesis samples to confirm fetal karyotype.

各検体から採取した末梢血液はＡＣＤ管に収集した。１個の血液サンプル管（６〜９ｍＬ／管）を、１５ｍＬの低速遠心管に移し入れた。血液を、ベックマン・コールター社のアレグラ６Ｒ遠心分離及びロータモデルＧＡ３.８を使用して、２６４０ｒｐｍ、４゜Ｃにして１０分間にわたり、遠心分離した。無細胞血漿抽出のため、上澄みの血漿層を１５ｍＬの高速遠心分離管に移し入れ、ベックマン・コールター社のアヴァンチＪ−Ｅ遠心分離器及びＪＡ１４ロータを使用して、１６０００ｘｇ、４゜Ｃにして１０分間にわたり遠心分離した。２段階の遠心分離ステップを血液採取後７２時間内に行った。無細胞血漿を−８０゜Ｃで保存し、ＤＮＡ抽出前に１回だけ解凍した Peripheral blood collected from each specimen was collected in an ACD tube. One blood sample tube (6-9 mL / tube) was transferred to a 15 mL low speed centrifuge tube. The blood was centrifuged at 2640 rpm, 4 ° C. for 10 minutes using a Beckman Coulter Allegra 6R centrifuge and a rotor model GA3.8. For cell-free plasma extraction, transfer the supernatant plasma layer to a 15 mL high-speed centrifuge tube and use a Beckman Coulter Avanti J-E centrifuge and JA14 rotor to 16000 × g, 4 ° C. to 10 ° C. Centrifuge for minutes. Two centrifugation steps were performed within 72 hours after blood collection. Cell-free plasma was stored at -80 ° C and thawed only once before DNA extraction

無細胞ＤＮＡは、キアゲン（Qiagen）社のQIAmp ＤＮＡ血液ミニキットを使用し、製造業者の取扱説明書に従って抽出した。５ミリリットルのバッファＡＬ及び５００μｌのキアゲン社のプロテアーゼを４.５ｍｌ〜５ｍｌの無細胞血漿に添加した。分量をリン酸緩衝生理食塩水（ＰＢＳ：phosphate buffered saline）で１０ｍｌに調整した。複数カラムを使用して、ベックマン社のマイクロ遠心分離器において８，０００ＲＰＭにして遠心分離することによって溶液から沈殿したｃｆＤＮＡを分離した。カラムをＡＷ１及びＡＷ２バッファで洗浄し、ｃｆＤＮＡを５５μｌのヌクレアーゼのない水で溶離した。約３.５〜７ｎｇのｃｆＤＮＡを血漿サンプルから抽出した。 Cell-free DNA was extracted using Qiagen QIAmp DNA blood mini kit according to the manufacturer's instructions. 5 ml of buffer AL and 500 μl of Qiagen protease were added to 4.5 ml to 5 ml of cell-free plasma. The amount was adjusted to 10 ml with phosphate buffered saline (PBS). Using multiple columns, the precipitated cfDNA was separated from the solution by centrifugation at 8,000 RPM in a Beckman microcentrifuge. The column was washed with AW1 and AW2 buffer and cfDNA was eluted with 55 μl of nuclease-free water. About 3.5-7 ng cfDNA was extracted from plasma samples.

すべてのシークエンシングライブラリを、母体血漿から抽出した約２ｎｇの精製ｃｆＤＮＡから調製した。ライブラリ調製は、以下に説明するようなイルミナ（Illumina:登録商標）用のＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１（Part No.E6000L; New England Biolabs, Ipswich, MA）の試薬を使用して行った。無細胞血漿ＤＮＡは元々フラグメント（断片）化されているため、噴霧又は超音波処理による更なるフラグメント化は血漿ＤＮＡサンプルでは行わない。４０μｌに含まれる約２ｎｇの精製ｃｆＤＮＡのオーバーハング部分は、ＮＥＢＮｅｘｔ（登録商標）末端修復モジュールによってリン酸化した平滑末端に転換させられ、この転換は、ｃｆＤＮＡを１.５ｍｌの微量遠心管内で、ＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１に供給した、５μｌの１０Ｘリン酸化バッファ、２μｌのデオキシヌクレオチド混合溶液（各ｄＮＴＰ毎に１０ｍＭ）、１μｌの１：５希釈ＤＮＡポリメラーゼ、１μｌのＴ４ＤＮＡポリメラーゼ、及び１μｌのＴ４ポリヌクレオチドキナーゼとともに１５分間にわたり２０゜Ｃで培養することによって行った。次にこの反応混合物を７５゜Ｃで５分間にわたり培養することによって、酵素を加熱不活性化した。この混合物を４゜Ｃに冷却し、また平滑末端化したＤＮＡのｄＡ尾端化（tailing）を、クレノウフラグメント（３′〜５′エキソマイナス）（ＮＥＢＮｅｘｔ（商標名）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１）を含む１０μｌのｄＡ尾端化マスター混合物を使用し、また１５分間にわたり３７゜Ｃで培養することによって行った。つぎに、クレノウフラグメントを加熱不活性化し、この不活性化は反応混合物を７５゜Ｃで５分間にわたり培養することによって行った。クレノウフラグメントの不活性化後に、１μｌのイルミナ・ゲノミック・アダプタ・オリゴ・ミックス（Part No. 1000521; Illumina Inc., Hayward, CA）１：５希釈液を使用して、イルミナアダプタ（Non-Index Y-Adaptors）を、ＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１に供給された４μｌのＴ４ＤＮＡリガーゼを使用して反応混合物を１５分間にわたり２５゜Ｃで培養することによって、ｄＡ尾端化したＤＮＡに結合させた。この混合物を４゜Ｃに冷却し、アダプタ結合したｃｆＤＮＡを、未結合アダプタ、アダプタ二量体、及びエージェンコートAMPure XP PCRの精製システム（Part No. A63881; Beckman Coulter Genomics, Danvers, MA）に供給された磁気ビードを使用する他の試薬から精製した。ＰＣＲの１８サイクルを行って、選択的にアダプタ結合ｃｆＤＮＡを富裕化し、この富裕化は、フュージョン（Phusion：登録商標）・ハイ−フィデリティ・マスター・ミックス（Finnzymes, Woburn, MA）及びアダプタに相補的なイルミナ社のＰＣＲプライマー（Part No. 1000537及び1000537）を使用して行った。アダプタ結合ＤＮＡに対してＰＣＲ（９８゜Ｃで３０秒間；９８゜Ｃで１０秒間、６５゜Ｃで３０秒間及び７２゜Ｃで３０秒間を１８サイクル、７２゜Ｃで５分間の最終延長、及び４゜Ｃで保持）を加え、この場合、製造業者の取扱説明書に従って、ＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１に供給されたイルミナゲノミックＰＣＲプライマー（Part No. 100537及び1000538）及びフュージョン・ＨＦ・ＰＣＲマスター・ミックスを使用した。増幅した生成物を、エージェンコートAMPure XP PCRの精製システム（Agencourt Bioscience Corporation, Berverly, MA）を使用して精製し、この精製は、製造業者の取扱説明書（www.beckmangenomics.com/products/AMPureXPProtocol_000387v001.pdfで入手可能）に従って行った。精製した増幅生成物は、４０μｌのキアゲン・ＥＢ・バッファで溶離し、増幅したライブラリの濃度及びサイズ分布を、２１００バイオアナライザ（Agilent technologies Inc., Santana Clara, CA）のためのアジレント・ＤＮＡ・1000・キットを使用して解析した。 All sequencing libraries were prepared from approximately 2 ng of purified cfDNA extracted from maternal plasma. Library preparation uses reagents from NEBNext® DNA Sample Prep DNA Reagent Set 1 (Part No. E6000L; New England Biolabs, Ipswich, Mass.) For Illumina® as described below. I went. Since cell-free plasma DNA is originally fragmented, no further fragmentation by spraying or sonication is performed on the plasma DNA sample. About 2 ng of the overhang portion of purified cfDNA contained in 40 μl was converted to blunt ends phosphorylated by the NEBNext® end repair module, which converted the cfDNA into NEBNext in a 1.5 ml microfuge tube. (Registered trademark) DNA sample Prep 5 μl of 10 × phosphorylated buffer, 2 μl of deoxynucleotide mixed solution (10 mM for each dNTP), 1 μl of 1: 5 diluted DNA polymerase, 1 μl of T4 DNA polymerase supplied to DNA reagent set 1 And incubated with 1 μl T4 polynucleotide kinase for 15 minutes at 20 ° C. The enzyme was then heat inactivated by incubating the reaction mixture at 75 ° C. for 5 minutes. The mixture was cooled to 4 ° C and blunt ended DNA dA tailing was performed using Klenow fragment (3'-5 'exominus) (NEBNext ™ DNA sample Prep DNA reagent set. This was done by using 10 μl of dA tailed master mix containing 1) and incubating at 37 ° C. for 15 minutes. The Klenow fragment was then heat inactivated and this inactivation was performed by incubating the reaction mixture at 75 ° C. for 5 minutes. After inactivation of the Klenow fragment, 1 μl of Illumina Genomic Adapter Oligo Mix (Part No. 1000521; Illumina Inc., Hayward, Calif.) 1: 5 dilution was used to make Illumina Adapter (Non-Index Y-Adaptors) was dA tailed by incubating the reaction mixture at 25 ° C. for 15 minutes using 4 μl of T4 DNA ligase supplied to NEBNext® DNA sample Prep DNA reagent set 1. Bound to DNA. The mixture is cooled to 4 ° C. and adapter-bound cfDNA is fed to the unbound adapter, adapter dimer, and agent-coated AMPure XP PCR purification system (Part No. A63881; Beckman Coulter Genomics, Danvers, Mass.). The purified magnetic beads were used to purify from other reagents. Perform 18 cycles of PCR to selectively enrich adapter-bound cfDNA, which is complementary to Fusion (High) Fidelity Master Mix (Finnzymes, Woburn, MA) and adapters Illumina PCR primers (Part No. 1000537 and 1000537) were used. PCR for adapter-bound DNA (98 ° C for 30 seconds; 98 ° C for 10 seconds, 65 ° C for 30 seconds and 72 ° C for 30 seconds for 18 cycles, 72 ° C for 5 minutes final extension, and In this case, the Illuminogenomic PCR primers (Part Nos. 100537 and 1000538) and fusion supplied to the NEBNext® DNA sample Prep DNA reagent set 1 according to the manufacturer's instructions. • HF • PCR master mix was used. The amplified product is purified using the purification system of Agencourt AMPure XP PCR (Agencourt Bioscience Corporation, Berverly, Mass.), Which can be obtained from the manufacturer's instructions (www.beckmangenomics.com/products/AMPureXPProtocol_000387v001 (Available in .pdf). The purified amplification product was eluted with 40 μl Qiagen EB buffer, and the concentration and size distribution of the amplified library was determined using an Agilent DNA 1000 for 2100 Bioanalyzer (Agilent technologies Inc., Santana Clara, Calif.). -Analyzed using a kit.

イルミナ・ゲノム・アナライザIIを使用して増幅したＤＮＡをシークエンシングし、３６ｂｐの単独末端リードを得た。ランダム配列情報の約３０ｂｐのみが特定ヒト染色体に属する配列を同定するのに必である。より長い配列は、より特別なターゲットを一意的に同定することができる。サンプルのシークエンシングが完了した際に、イルミナ社「シークエンサ制御ソフトウェア」は画像及び塩基コール（判定）ファイルを、イルミナ社「ゲノム・アナライザ・パイプライン」を実行するユニックスサーバーに転送した。イルミナ社「ジェラルド」プログラムを実行して配列を基準ヒトゲノムに整列させ、この基準ヒトゲノムはバイオテクノロジー情報ナショナルセンターによって規定されたｈｇ１８ゲノムＮＣＢＩ３６／ｈｇ１８）に由来するものである（ワールドワイドウェブ上におけるＵＲＬ;http://genome.ucsc.edu/cgi-bin/hgGateway?Org=Human&db=hg18&hgsid=166260105で入手可能である）。ゲノムに一意的に整列させる上述の手順から生成した配列データを、Linnuxオペレーティングシステムを実行するコンピュータ上で稼働するプログラム（ｃ２ｃ.ｐｌ）によってジェラルドの出力（export.txtファイル）から読んだ。塩基の不一致を有する配列アラインメントは、ゲノムに一意的に整列した場合にのみ整列カウントが可能であり、整列カウントに含める。同一の開始及び末端配位を有する（複製）配列アラインメントは排除した。 The amplified DNA was sequenced using the Illumina Genome Analyzer II to obtain a 36 bp single-ended read. Only about 30 bp of random sequence information is necessary to identify sequences belonging to a specific human chromosome. Longer sequences can uniquely identify more specific targets. When sample sequencing was complete, Illumina “Sequencer Control Software” transferred the image and base call (decision) files to a Unix server running Illumina “Genome Analyzer Pipeline”. Run the Illumina “Gerald” program to align the sequence to the reference human genome, which is derived from the hg18 genome NCBI36 / hg18 defined by the National Center for Biotechnology Information (URL on the World Wide Web) ; available at http://genome.ucsc.edu/cgi-bin/hgGateway?Org=Human&db=hg18&hgsid=166260105). The sequence data generated from the above procedure for uniquely aligning with the genome was read from Gerald's output (export.txt file) by a program (c2c.pl) running on a computer running the Linnux operating system. Sequence alignments with base mismatches can only be counted when aligned uniquely to the genome and are included in the alignment count. (Replication) sequence alignments with identical starting and terminal coordination were excluded.

約５００万〜約１５００万個の３６ｂｐタグ間で２個以下の不一致がヒトゲノムにマッピングされた。マッピングされたすべてのタグをカウントし、またテストサンプル及び適格サンプルの双方において染色体ドースの計算に含めた。塩基０から塩基２×１０^６に、塩基１０×１０^６から塩基１３×１０^６に、塩基２３×１０^６からＹ染色体の末端にわたる領域は、特別に解析から排除し、これは男児又は女児のいずれかの胎児に由来するタグはＹ染色体のこれら領域にマッピングされるからである。同一ラン（稼働）でシークエンシングされたサンプル間で個別染色体にマッピングされる配列タグの総数に若干の変動が示され（染色体間変動）、相当大きな変動が異なるシークエンシングランで生じた（シークエンシングラン間変動）。 Less than 2 mismatches between about 5 million to about 15 million 36 bp tags were mapped to the human genome. All mapped tags were counted and included in the chromosomal dose calculation in both test and eligible samples. The region spanning from the base 0 to base 2 × 10 ⁶ , base 10 × 10 ⁶ to base 13 × 10 ⁶ , base 23 × 10 ⁶ to the end of the Y chromosome is specifically excluded from the analysis, which is the same for boys or girls. This is because tags derived from any fetus are mapped to these regions of the Y chromosome. There was some variation in the total number of sequence tags mapped to individual chromosomes between samples sequenced in the same run (running) (interchromosomal variation), with significant variation occurring in different sequencing runs (sequence sequencing). Fluctuation between Gran).

実施例２
１３番、１８番、２１番、Ｘ及びＹ染色体のドース及び分散
すべての染色体に対してマッピングした配列タグ数の染色体間変動及びシークエンシング間変動の程度を調べるため、４８人のボランティアの妊婦検体における末梢血液に由来する血漿ｃｆＤＮＡを抽出し、実施例１に説明したようにシークエンシングし、以下のように解析した。各染色体にマッピングされた配列タグの総数（配列タグ密度）を決定した。代案として、マッピングした配列タグ数を染色体の長さに正規化し、配列タグ密度比を生成することができる。染色体長さに対する正規化は必要なステップではなく、単にヒトを解釈する上で簡素化するよう数の桁数を減少するために行うことができる。配列タグカウントを正規化するのに使用することができる染色体長さは、ワールドワイドウェブ上におけるgenome.ucsc.edu/goldenPath/stats.html#hg18で提供された長さとすることができる。 Example 2
Dose and dispersion of chromosomes 13, 18, 21, X and Y chromosomes 48 volunteer pregnant women samples to examine the degree of interchromosomal and inter-sequence variation in the number of sequence tags mapped to all chromosomes Plasma cfDNA derived from peripheral blood in was extracted, sequenced as described in Example 1, and analyzed as follows. The total number of sequence tags (sequence tag density) mapped to each chromosome was determined. Alternatively, the number of mapped sequence tags can be normalized to the length of the chromosome to generate a sequence tag density ratio. Normalization to chromosome length is not a necessary step, but can be done simply to reduce the number of digits to simplify the human interpretation. The chromosomal length that can be used to normalize the sequence tag count can be the length provided by genome.ucsc.edu/goldenPath/stats.html#hg18 on the World Wide Web.

各染色体に対して得られた配列タグ密度は、残りの染色体それぞれの配列タグ密度に関連付けして適格染色体ドースを導き出し、この適格染色体ドースは、関心対象染色体、例えば、２１番染色体の配列タグ密度と、残りの染色体、すなわち、１〜２０番染色体、２２番染色体及びＸ染色体それぞれの配列タグ密度との比として計算した。表１は、適格サンプルのうち１つで決定された関心対象の１３番、１８番、２１番染色体、Ｘ及びＹ染色体の計算した適格染色体ドースの例を示す。染色体ドースは、すべてのサンプルにおけるすべての染色体（Ｃｈｒ：chromosome）に対して決定し、また適格サンプルにおける関心対象の１３番、１８番、２１番染色体、Ｘ及びＹ染色体の平均ドースを表２及び表３に示し、図２〜６に示す。図２〜６は、さらに、テストサンプルに対する染色体ドースも示す。適格サンプルにおける関心対象染色体それぞれの染色体ドースは、各関心対象染色体それぞれにおけるマッピングした配列タグ総数における、残りの染色体それぞれにおけるマッピングした配列タグ総数に対する変動の比較尺度を提供する。したがって、適格染色体ドースが、染色体又は染色体グループ、すなわち、関心対象染色体の変動に最も近似するサンプル内の変動を有し、また更なる統計的評価をする上で正規化値用の理想配列として供する正規化染色体を同定することができる。図７及び８は、適格サンプルの母集団で決定した１３番、１８番、２１番染色体、Ｘ及びＹ染色体の計算した平均染色体ドースを示す。 The sequence tag density obtained for each chromosome is related to the sequence tag density of each of the remaining chromosomes to derive a qualified chromosome dose, which is the sequence tag density of the chromosome of interest, eg, chromosome 21. And the ratio of the sequence tag density of each of the remaining chromosomes, that is, chromosomes 1 to 20, chromosome 22, and X. Table 1 shows an example of the calculated eligible chromosomal doses of interest chromosomes 13, 18, 21 and X and Y chromosomes determined in one of the eligible samples. Chromosome doses are determined for all chromosomes (Chr: chromosome) in all samples, and the average doses of chromosomes 13, 18, 21, X and Y of interest in eligible samples are shown in Table 2 and Shown in Table 3 and shown in FIGS. Figures 2-6 also show chromosomal doses for the test samples. The chromosomal dose for each chromosome of interest in the eligible sample provides a comparative measure of variation in the total number of sequence tags mapped for each chromosome of interest relative to the total number of sequence tags mapped for each remaining chromosome. Thus, a qualified chromosomal dose has a variation within the sample that most closely approximates the variation of a chromosome or chromosome group, i.e., a chromosome of interest, and serves as an ideal sequence for normalized values for further statistical evaluation. Normalized chromosomes can be identified. Figures 7 and 8 show the calculated average chromosomal doses for chromosomes 13, 18, 21 and X and Y as determined in the population of eligible samples.

若干の実施形態において、最良の正規化染色体は最も少ない変動ではなく、テストサンプルを適格サンプルから最も区別する適格ドースの分布を有するものであり、すなわち、最良の正規化染色体は最低変動ではなく、最大弁別可能性である。この弁別可能性は、適格サンプルにおける染色体ドースの変動及びドース分布に基づく。 In some embodiments, the best normalized chromosome is not the least variation but has a distribution of qualified doses that most distinguishes the test sample from the qualified sample, i.e., the best normalized chromosome is not the least variation, Maximum discriminability. This discriminability is based on chromosomal dose variation and dose distribution in eligible samples.

表２及び３は、変動性の評価尺度としての変動係数（ＣＶ）、及び１８番、２１番染色体、Ｘ染色体及びＹ染色体の弁別可能性の評価尺度としてのスチューデントｔ検定値を示し、ｔ検定値が小さければ小さいほど、弁別可能性は大きくなる。１３番染色体の弁別可能性は、適格サンプルの平均染色体ドースとＴ１３テストサンプルのみの１３番染色体ドースとの間の差と、適格ドースの平均の標準偏差の比として決定した。 Tables 2 and 3 show the coefficient of variation (CV) as a measure of variability, and the Student t-test values as measures of the discriminability of chromosomes 18, 21 and X and Y. The smaller the value, the greater the discriminability. Chromosome 13 discriminability was determined as the ratio of the difference between the average chromosomal dose of the eligible sample and the chromosome 13 dose of the T13 test sample alone and the standard deviation of the average of the eligible dose.

適格染色体ドースは、以下に説明するように、テストサンプルにおける異数性を同定するとき、閾値を決定する基礎として供する。 A qualified chromosomal dose serves as a basis for determining a threshold when identifying aneuploidy in a test sample, as described below.

正規化染色体、染色体ドース及び関心対象染色体それぞれの弁別可能性を使用して得られたＴ２１，Ｔ１３，Ｔ１８及びターナー症候群の診断例を実施例３で説明する。 A diagnosis example of T21, T13, T18 and Turner syndrome obtained by using the discriminability of the normalized chromosome, the chromosome dose, and the chromosome of interest is described in Example 3.

実施例３
正規化染色体を使用する胎児異数性の診断
異数性を評価するため染色体ドースの使用を生物学的テストサンプルに適用する上で、母体血液テストサンプルをボランティアの妊婦から採取し、ｃｆＤＮＡを調製し、シークエンシングし、また実施例１及び２で記載したように解析した。 Example 3
Diagnosis of fetal aneuploidy using normalized chromosomes Applying the use of chromosomal doses to biological test samples to assess aneuploidy, maternal blood test samples are taken from volunteer pregnant women and cfDNA is prepared And sequenced and analyzed as described in Examples 1 and 2.

２１トリソミー
表４は、例としてのテストサンプル（#11403）における２１番染色体の計算したドースを示す。Ｔ２１異数性の陽性診断のための計算した閾値は適格（正常）サンプルの平均からの標準偏差の２倍より大きい値＞（２標準偏差）に設定した。Ｔ２１のための診断は、テストサンプルにおける染色体ドースが設定した閾値よりも大きいことに基づいて行った。１４番及び１５番染色体を個別の計算における正規化染色体として使用し、最も低い変動性を有する染色体、例えば１４番染色体、又は最も大きい弁別可能性を有する染色体、例えば染色体１５のうちいずれかを使用して異数性を同定することができる。計算した染色体ドースを使用して１３例のＴ２１サンプルを同定し、異数性サンプルがＴ２1であることを核型によって確認した。 Trisomy 21 Table 4 shows the calculated dose of chromosome 21 in an example test sample (# 11403). The calculated threshold for a positive diagnosis of T21 aneuploidy was set to a value greater than twice the standard deviation from the mean of eligible (normal) samples> (2 standard deviations). Diagnosis for T21 was made based on the fact that the chromosomal dose in the test sample was greater than the set threshold. Use chromosomes 14 and 15 as normalized chromosomes in separate calculations and use the chromosome with the lowest variability, eg, chromosome 14 or the chromosome with the highest discriminability, eg, chromosome 15 Thus, aneuploidy can be identified. Using the calculated chromosomal dose, 13 T21 samples were identified and the karyotype confirmed that the aneuploid sample was T21.

１８トリソミー
表５はテストサンプル（#11390）における１８番染色体の計算したドースを示す。Ｔ１８異数性陽性診断のための計算した閾値は適格（正常）サンプルの平均からの標準偏差の２倍（２標準偏差）に設定した。Ｔ１８のための診断は、テストサンプルにおける染色体ドースが設定した閾値よりも大きいことに基づいて行った。８番染色体を正規化染色体として使用した。この場合、８番染色体が最も低い変動性を有し、又は最も大きい弁別可能性を有する染色体であった。染色体ドースを使用して８例のＴ１８サンプルを同定し、Ｔ１８であることを核型によって確認した。 Trisomy 18 Table 5 shows the calculated dose of chromosome 18 in the test sample (# 11390). The calculated threshold for T18 aneuploidy positive diagnosis was set at twice the standard deviation (2 standard deviations) from the mean of eligible (normal) samples. Diagnosis for T18 was based on the chromosomal dose in the test sample being greater than the set threshold. Chromosome 8 was used as a normalized chromosome. In this case, chromosome 8 was the chromosome with the lowest variability or the highest discriminability. Eight T18 samples were identified using chromosomal dose and confirmed to be T18 by karyotype.

これらデータは、正規化染色体が最小変動性及び最大弁別可能性の双方を有することを示している。 These data indicate that normalized chromosomes have both minimum variability and maximum discriminability.

１３トリソミー
表６はテストサンプル（#51236）における１３番染色体の計算したドースを示す。Ｔ１３異数性陽性診断のための計算した閾値は適格（正常）サンプルの平均からの標準偏差の２倍（２標準偏差）に設定した。Ｔ１３のための診断は、テストサンプルにおける染色体ドースが設定した閾値よりも大きいことに基づいて行った。５番染色体又は３番、４番、５番及び６番染色体のグループを正規化染色体として使用して、１３番染色体の染色体ドースを計算した。１例のＴ１３サンプルを同定した。 Trisomy 13 Table 6 shows the calculated dose of chromosome 13 in the test sample (# 51236). The calculated threshold for T13 aneuploidy positive diagnosis was set at twice the standard deviation (2 standard deviations) from the mean of eligible (normal) samples. Diagnosis for T13 was based on the chromosomal dose in the test sample being greater than the set threshold. Chromosome doses of chromosome 13 were calculated using chromosome 5 or a group of chromosomes 3, 4, 5, and 6 as normalized chromosomes. One T13 sample was identified.

３〜６番染色体の配列タグ密度は、３〜６番染色体の平均タグカウント数である。
このデータは、３番、４番、５番及び６番染色体の組合せが、５番染色体よりも低い変動性、及び他の任意な染色体よりも最も大きい弁別可能性をもたらすものであることを示している。 The sequence tag density of chromosomes 3-6 is the average tag count number of chromosomes 3-6.
This data shows that the combination of chromosomes 3, 4, 5, and 6 results in lower variability than chromosome 5 and the greatest discriminability than any other chromosome. ing.

したがって、染色体グループを、染色体ドースを決定し、また異数性を同定するための正規化染色体として使用することができる。 Thus, chromosomal groups can be used as normalized chromosomes to determine chromosomal doses and identify aneuploidy.

ターナー症候群（Ｘモノソミー）
表７は、テストサンプル（#51238）におけるＸ染色体及びＹ染色体の計算したドースを示す。ターナー症候群（Ｘモノソミー）陽性診断のための計算した閾値は、Ｘ染色体に対して適格（正常）サンプルの平均から（−２標準偏差）よりも小さく、Ｙ染色体がない場合に対して適格（正常）サンプルの平均から（−２標準偏差）よりも小さく設定した。 Turner syndrome (X monosomy)
Table 7 shows the calculated doses for the X and Y chromosomes in the test sample (# 51238). The calculated threshold for a Turner syndrome (X monosomy) positive diagnosis is less than (−2 standard deviations) from the mean of eligible (normal) samples for the X chromosome and qualified for the absence of the Y chromosome (normal) ) Set smaller than (−2 standard deviation) from the average of the samples.

設定した閾値の染色体ドースよりも小さいＸ染色体ドースを有するサンプルを１個のＸ染色体よりも少ないものとして同定した。この同一サンプルは設定した閾値より少ないＹ染色体ドースを有すると決定したが、このことはそのサンプルがＹ染色体を持っていないことを示す。このようにして、Ｘ及びＹ染色体の染色体ドースの組合せを使用し、ターナー症候群（Ｘモノソミー）を同定した。したがって、本発明によれば、染色体のＣＮＶを決定することができる。とくに、本発明によれば、母体血漿ｃｆＤＮＡを大量並列シークエンシングすることによって、またシークエンシングデータの統計学的解析のための正規化染色体同定によって、過剰及び不足表現の染色体異数性を決定することができる。本発明方法の感度及び信頼性は、妊娠第１期及び第２期における異数性テストを正確に行うことができる。 Samples with X chromosome doses smaller than the set threshold chromosome dose were identified as less than one X chromosome. This same sample was determined to have less Y chromosome dose than the set threshold, indicating that the sample does not have a Y chromosome. Thus, a combination of chromosomal doses of the X and Y chromosomes was used to identify Turner syndrome (X monosomy). Therefore, according to the present invention, the CNV of a chromosome can be determined. In particular, according to the present invention, chromosomal aneuploidy is determined by massive parallel sequencing of maternal plasma cfDNA and by normalized chromosome identification for statistical analysis of sequencing data. be able to. The sensitivity and reliability of the method of the present invention can accurately perform aneuploidy tests in the first and second trimesters.

実施例４
部分異数性の決定
配列ドース使用を、血液の血漿から調製したｃｆＤＮＡの生物学的テストサンプルにおける部分異数性評価に適用し、実施例１に記載したようにシークエンシングした。サンプルは、染色体解析によって１１番染色体における部分的欠失を有する検体に由来するものと確認した。部分異数性（１１番染色体における部分的、すなわちｑ２１〜ｑ２３の欠失）のためのシークエンシングデータ解析を、上述した実施例で染色体異数性につき説明したように行った。配列タグをテストサンプルにおける１１番染色体にマッピングすることによって、染色体のｑアームにおける塩基対81000082〜103000103間におけるタグカウント数の、適格サンプルにおける１１番染色体の対応配列に関して得られたタグカウント数（データは示さない）に対する顕著な喪失を明らかにした。適格サンプルそれぞれにおける関心対象１１番染色体にマッピングした配列タグ（810000082〜103000103bp）及び適格サンプルにおける全体ゲノムの２０メガベース断片すべてにマッピングした配列タグ、すなわち適格配列タグ密度を使用して、すべての適格サンプルにおけるタグ密度比として適格配列ドースを決定した。平均配列ドース、標準偏差、及び変動係数を、全体ゲノムにおける２０メガベース断片のすべてに対して計算し、最小の変動性を有する２０メガベース配列は、５番染色体における同定された正規化配列（13000014〜33000033)であり（表８参照）、これを使用してテストサンプルにおける関心対象配列のドースを計算した（表９参照）。表８は、テストサンプルにおける１１番染色体の関心対象配列（810000082〜103000103bp）のドースを示し、これは、関心対象配列にマッピングした配列タグと、同定した正規化配列にマッピングした配列タグとの比として計算した。図１０は、７個の適格サンプルにおける関心対象配列の配列ドース（○）、及びテストサンプルにおける対応配列の配列ドース（◇）を示す。平均を実線で示し、平均から５標準偏差に設定した部分異数性陽性診断のための計算した閾値を破線で示す。部分異数性診断は、設定した閾値より少ないテストサンプルの配列ドースに基づいて行った。テストサンプルは、染色体解析によって、１１番染色体におけるｑ２１〜ｑ２３欠失を有することを検証した。 Example 4
Determination of Partial Aneuploidy Sequence dose use was applied to partial aneuploidy evaluation in biological test samples of cfDNA prepared from blood plasma and sequenced as described in Example 1. The sample was confirmed to be derived from a specimen having a partial deletion in chromosome 11 by chromosome analysis. Sequencing data analysis for partial aneuploidy (partial in chromosome 11, ie, deletion of q21-q23) was performed as described for chromosomal aneuploidy in the examples above. By mapping the sequence tag to chromosome 11 in the test sample, the tag count number obtained for the corresponding sequence of chromosome 11 in the qualified sample of the tag count number between base pairs 81000082 to 103000103 in the q-arm of the chromosome (data (Not shown) revealed a significant loss. All eligible samples using a sequence tag (810000082-103000103 bp) mapped to chromosome 11 of interest in each eligible sample and a sequence tag mapped to all 20 megabase fragments of the entire genome in the qualified sample, ie, qualified sequence tag density The qualifying sequence dose was determined as the tag density ratio in. The average sequence dose, standard deviation, and coefficient of variation are calculated for all 20 megabase fragments in the entire genome, and the 20 megabase sequence with minimal variability is the identified normalized sequence on chromosome 5 (13000014- 33000033) (see Table 8), which was used to calculate the dose of the sequence of interest in the test sample (see Table 9). Table 8 shows the dose of the sequence of interest of chromosome 11 (810000082-103000103 bp) in the test sample, which is the ratio of the sequence tag mapped to the sequence of interest and the sequence tag mapped to the identified normalized sequence. As calculated. FIG. 10 shows the sequence dose (◯) of the sequence of interest in 7 eligible samples and the sequence dose (◇) of the corresponding sequence in the test sample. The average is shown as a solid line, and the calculated threshold for partial aneuploidy positive diagnosis set to 5 standard deviations from the average is shown as a dashed line. The partial aneuploidy diagnosis was performed based on the sequence dose of the test sample that was less than the set threshold value. The test sample was verified by chromosome analysis to have a q21-q23 deletion in chromosome 11.

したがって、染色体異数性を同定することの他に、本発明方法を使用して部分異数性を同定することができる。 Thus, in addition to identifying chromosomal aneuploidy, partial aneuploidy can be identified using the methods of the present invention.

実施例５
異数性検出の実証
実施例２及び３で説明し、また図２〜６に示したサンプルで得られた配列データをさらに解析して、母体サンプルにおける異数性をうまく同定することに成功する本発明方法の感度を示した。２１番、１８番、１３番染色体、Ｘ染色体及びＹ染色体のための正規化染色体ドースは、平均の標準偏差に対する分布（Ｙ軸）として解析し、また図１１に示す。使用した正規化染色体は、基準（デノミネータ）として示した（Ｘ軸）。 Example 5
Demonstration of aneuploidy detection The sequence data obtained in the samples described in Examples 2 and 3 and shown in FIGS. 2-6 are further analyzed to successfully identify aneuploidy in the maternal sample. The sensitivity of the method of the present invention is shown. Normalized chromosomal doses for chromosomes 21, 18, 13, X, and Y are analyzed as a distribution (Y axis) with respect to the average standard deviation and are shown in FIG. The normalized chromosome used was indicated as a reference (denominator) (X axis).

図１１Ａは、２１番染色体の正規化染色体として１４番染色体を使用するとき、異変なしサンプル（○）及び２１トリソミーサンプル（Ｔ２１；△）における２１番染色体の平均からの標準偏差に対する染色体ドース分布を示す。図１１Ｂは、１８番染色体の正規化染色体として８番染色体を使用するとき、異変なしサンプル（○）及び１８トリソミーサンプル（Ｔ１８；△）における１８番染色体の平均からの標準偏差に対する染色体ドース分布を示す。図１１Ｃは、１３番染色体の染色体ドースを決定するための正規化染色体として３番、４番、５番及び６番染色体のグループの平均配列タグ密度を使用し、異変なしサンプル（○）及び１３トリソミーサンプル（Ｔ１３；△）における１３番染色体の平均からの標準偏差に対する染色体ドース分布を示す。図１１Ｄは、Ｘ染色体の正規化染色体として４番染色体を使用するとき、異変なし女児サンプル（○）、異変なし男児サンプル（△）及びＸモノソミーサンプル（ＸＯ；＋）におけるＸ染色体の平均からの標準偏差に対する染色体ドース分布を示す。図１１Ｅは、Ｙ染色体の染色体ドースを決定するための正規化染色体として１〜２２番染色体及びＸ染色体のグループの平均配列タグ密度を使用するとき、異変なし男児サンプル（○）、異変なし女児サンプル（△）、及びＸモノソミーサンプル（＋）におけるＹ染色体の平均からの標準偏差に対する染色体ドース分布を示す。 FIG. 11A shows the chromosomal dose distribution with respect to the standard deviation from the average of chromosome 21 in the non-abnormal sample (◯) and trisomy sample (T21; Δ) when chromosome 14 is used as the normalized chromosome of chromosome 21. Show. FIG. 11B shows the chromosomal dose distribution with respect to the standard deviation from the average of chromosome 18 in the unchanged sample (◯) and trisomy sample (T18; Δ) when chromosome 8 is used as the normalized chromosome of chromosome 18. Show. FIG. 11C uses the average sequence tag density of the group of chromosomes 3, 4, 5, and 6 as normalized chromosomes to determine the chromosomal dose of chromosome 13, and samples without change (◯) and 13 The chromosome dose distribution with respect to the standard deviation from the average of the chromosome 13 in a trisomy sample (T13; (triangle | delta)) is shown. FIG. 11D shows from the average of the X chromosome in the non-abnormal girl sample (◯), the non-abnormal boy sample (Δ), and the X monosomy sample (XO; +) when using chromosome 4 as the normalized chromosome of the X chromosome. The chromosomal dose distribution with respect to the standard deviation is shown. FIG. 11E shows that when using the average sequence tag density of the group of chromosomes 1 to 22 and the X chromosome as normalization chromosomes for determining the chromosomal dose of the Y chromosome, no abnormal boy sample (○), no abnormal girl sample (Δ) and chromosomal dose distribution with respect to the standard deviation from the average of the Y chromosome in the X monosomy sample (+).

データは、２１トリソミー、１８トリソミー、１３トリソミーを異変なし（正常）サンプルから明確に区別されたことを示している。Ｘモノソミーサンプルは、異変なし女児サンプルのＸ染色体ドースよりも明らかに低いＸ染色体ドースを有し、また異変なし男児サンプルのＹ染色体ドースよりも明らかに低いＹ染色体ドースを有する（図１１Ｅ参照）ものとして容易に同定した。したがって、本発明方法は、母体血液サンプルにおける染色体異数性の有無を決定するのに感度よく、また特化したものである。 The data show that trisomy 21, trisomy 18 and trisomy 13 were clearly distinguished from the unaltered (normal) sample. The X monosomy sample has a clearly lower X-chromosome dose than the X-chromosome dose of the unabnormal girl sample, and a clearly lower Y-chromosome dose than the Y-chromosome dose of the unabnormal boy sample (see FIG. 11E) As easily identified. Thus, the method of the present invention is sensitive and specialized for determining the presence or absence of chromosomal aneuploidy in a maternal blood sample.

実施例６
母体血液からの無細胞胎児ＤＮＡに関して大量並列ＤＮＡシークエンシングを使用する
胎児異常の決定：トレーニングセット１とは独立したテストセット１
研究は、米国１３か所の臨床施設における有資格現地臨床研究員によって２００９年４月から２０１０年７月までの間に、各機関での機関審査委員会（ＩＲＢ：institutional review board）が承認したヒト検体プロトコルの下で行った。インフォームドコンセントの同意書を、研究関与前に各検体から取得した。プロトコルは、血液サンプル及び臨床データを得て非侵襲性妊婦遺伝子診断方法の発展を支援するよう策定した。１８歳以上の妊婦を有資格者とした。臨床的にＣＶＳ又は羊水穿刺を受ける患者に対して、手順を実施する前に血液した採取し、また胎児の核型結果を収集した。末梢血液サンプル（管２個又は総量２０ｍＬまでの）をすべての検体からクエン酸デキストロース（ＡＣＤ：acid citrate dextrose）管（ベクトン・ディッキンソン社製）内に引き込んだ。すべてのサンプルに対し、身元が判明しないようにし、匿名患者ＩＤ番号を割り当てた。血液サンプルを、研究用に設けた温度制御輸送容器内で一晩かけて研究室に輸送した。血液引込みとサンプル受入れとの間に経過した時間を、サンプル評価の一部として記録した。 Example 6
Use massively parallel DNA sequencing on cell-free fetal DNA from maternal blood
Determination of fetal abnormalities: Test set 1 independent of training set 1
The study was approved by an institutional review board (IRB) at each institution between April 2009 and July 2010 by qualified local clinical researchers at 13 US clinical facilities. Performed under specimen protocol. Informed consent consent was obtained from each specimen prior to study involvement. The protocol was developed to obtain blood samples and clinical data to support the development of noninvasive maternal genetic diagnosis methods. Pregnant women over the age of 18 were qualified. For patients who received CVS or amniocentesis clinically, blood samples were collected prior to performing the procedure, and fetal karyotype results were collected. Peripheral blood samples (2 tubes or up to a total volume of 20 mL) were drawn from all specimens into citrate dextrose (ACD) tubes (Becton Dickinson). All samples were unidentified and assigned an anonymous patient ID number. Blood samples were transported to the laboratory overnight in a temperature controlled transport container provided for research. The time elapsed between blood draw and sample acceptance was recorded as part of the sample evaluation.

現地研究コーディネーターは、匿名患者ＩＤ番号を使用して、患者のその時の妊娠状況及び履歴に関連する臨床データを研究症例報告書（ＣＲＦ：case report forms）に書き込んだ。侵襲性出生前手順によるサンプルからの胎児核型の細胞発生学的解析を、地方の研究所毎に行い、この結果も研究ＣＲＦに記録した。ＣＲＦで得られたすべてのデータを、研究所の臨床データベースに入力した。無細胞血漿を、血管穿刺後２４〜４８時間内に個別の血液管から２段階遠心分離処理を使用して得た。単独の血液管からの血漿でシークエンシング解析に十分である。無細胞ＤＮＡは、製造業者の取扱説明書に従ってＱＩＡｍｐＤＮＡ血液ミニキット（キアゲン社）を使用して無細胞血漿から抽出した。無細胞ＤＮＡフラグメントは、長さが約１７０塩基対（ｂｐ）であることが分かっているため（Fan et al.,Clin Chem 56:1279-1286 [2010]参照）、シークエンシング前にはデータのフラグメント化は不要である。 The field study coordinator used the anonymous patient ID number to write clinical data related to the patient's current pregnancy status and history in a case report forms (CRF). Cytogenetic analysis of fetal karyotypes from samples by invasive prenatal procedures was performed at each local laboratory and the results were also recorded in the study CRF. All data obtained with the CRF was entered into a laboratory clinical database. Cell-free plasma was obtained from individual blood tubes using a two-stage centrifugation process within 24-48 hours after vascular puncture. Plasma from a single blood tube is sufficient for sequencing analysis. Cell-free DNA was extracted from cell-free plasma using the QIAmpDNA blood mini kit (Qiagen) according to the manufacturer's instructions. Cell-free DNA fragments are known to be about 170 base pairs (bp) in length (see Fan et al., Clin Chem 56: 1279-1286 [2010]), so the data before sequencing Fragmentation is not necessary.

トレーニングセットのサンプルのために、ｃｆＤＮＡを、シークエンシングライブラリ調製及びイルミナ・ゲノム・アナライザIIX装置（http://www.illumina.com/参照）により標準製造業者プロトコルを使用するシークエンシングを行うよう、プログノシス・バイオサイエンシズ社（カリフォルニア州ラ・ジョラ）に送った。３６塩基対の単独末端リードを得た。シークエンシングが完了した際に、すべての塩基コールを収集し解析する。テストセットのサンプルに対して、シークエンシングライブラリを調製し、シークエンシングを、イルミナ・ゲノム・アナライザIIX装置上で行った。シークエンシングライブラリの調製は以下のようにして行った。記載された全編プロトコルは、実質的にイルミナ社が規定した標準プロトコルであり、増幅ライブラリの精製においてのみイルミナ社プロトコルから異なる。イルミナ社プロトコルはゲル電気泳動を使用して増幅ライブラリを精製することを指示するが、本明細書に記載するプロトコルは、同一精製ステップに対して磁気ビードを使用する。母体血漿から抽出した精製ｃｆＤＮＡの約２ｎｇを使用して、一次シークエンシングライブラリを調製し、この調製は、製造業者の指示書に従い、イルミナ社のＮＥＢＮｅｘｔ（商標名）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１（Part No.E6000L; New England Biolabs, Ipswich, MA）を使用して行った。精製カラムの代わりにエージェンコート社の磁気ビード及び試薬を使用して行った、アダプタ結合生成物の最終精製を除くすべてのステップを、プロトコルに従って、ゲノムＤＮＡライブラリのサンプル調製用ＮＥＢＮｅｘｔ（商標名）試薬とともに行い、イルミナ（登録商標）ＧＡIIを使用してシークエンシングした。ＮＥＢＮｅｘｔ(商標名）プロトコルは、ほぼイルミナ社が規定したもの(grcf.jhml.edu/hts/protocols/11257047_ChIP_Sample_Prep.pdf.で入手可能)に従う。 For the training set samples, the cfDNA is sequenced using the standard manufacturer protocol with sequencing library preparation and Illumina Genome Analyzer IIX instrument (see http://www.illumina.com/). Sent to Prognosis Biosciences (La Jolla, CA). A 36 base pair single terminal read was obtained. Collect and analyze all base calls when sequencing is complete. Sequencing libraries were prepared for test set samples and sequencing was performed on an Illumina Genome Analyzer IIX instrument. The sequencing library was prepared as follows. The full-length protocol described is substantially a standard protocol defined by Illumina and differs from the Illumina protocol only in the purification of the amplification library. Although the Illumina protocol directs gel electrophoresis to purify the amplification library, the protocol described herein uses magnetic beads for the same purification step. Approximately 2 ng of purified cfDNA extracted from maternal plasma was used to prepare the primary sequencing library, which was prepared according to the manufacturer's instructions by the Illumina NEBNext ™ DNA sample Prep DNA reagent set 1 ( Part No. E6000L; New England Biolabs, Ipswich, MA). All steps, except final purification of the adapter-bound product, performed using Agent Coat magnetic beads and reagents in place of the purification column, were performed according to the protocol according to the NENEXT ™ reagent for sample preparation of genomic DNA libraries. And sequenced using Illumina® GAII. The NEBNext (TM) protocol follows approximately that specified by Illumina (available at grcf.jhml.edu/hts/protocols/11257047_ChIP_Sample_Prep.pdf.).

４０μｌに含まれる約２ｎｇの精製ｃｆＤＮＡフラグメントのオーバーハング（突出）部分を、ＮＥＢＮｅｘｔ（登録商標）末端修復モジュールによってリン酸化した平滑末端に転換し、この転換は、４０μｌのｃｆＤＮＡを、ＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１に供給された、５μｌの１０Ｘリン酸化バッファ、２μｌのデオキシヌクレオチド混合溶液（各ｄＮＴＰ毎に１０ｍＭ）、１μｌの１：５希釈ＤＮＡポリメラーゼ、１μｌのＴ４ＤＮＡポリメラーゼ、及び１μｌのＴ４ポリヌクレオチドキナーゼとともに１５分間にわたり２０゜Ｃで培養することによって行った。サンプルを４゜Ｃに冷却し、また以下のように、ＱＩＡＱｕｉｃｋＰＣＲ精製キット（キアゲン社カリフォルニア州バレンシア）に供給されたＱＩＡＱｕｉｃｋカラムを使用して精製した。５０μｌ反応物を１.５ｍｌの微量遠心管内に移し入れ、２５０μｌのキアゲンバッファＰＢを添加した。この結果物３００μｌをＱＩＡＱｕｉｃｋカラム内に移し入れ、微量遠心管内において１分間にわたり１３，０００ＲＰＭで遠心分離した。カラムを７５０μｌのキアゲンバッファＰＥで洗浄し、再び遠心分離した。残留エタノールを、５分間１３，０００ＲＰＭで追加遠心分離することによって除去した。ＤＮＡを、遠心分離によって３９μｌのキアゲンバッファＥＢ内で溶離した。平滑末端化したＤＮＡのｄＡ尾端化（tailing）を、クレノウフラグメント（３′〜５′エキソマイナス）（ＮＥＢＮｅｘｔ（商標名）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１）を含む１６μｌのｄＡ尾端化マスター混合物を使用し、また製造業者のＮＥＢＮｅｘｔ（登録商標）ｄＡ尾端化モジュールに従って、３０分間にわたり３７゜Ｃで培養することによって行った。サンプルを４゜Ｃに冷却し、以下のように、ＭｉｎＥｌｕｔｅＰＣＲ精製キット（キアゲン社、カリフォルニア州バレンシア）に供給されたカラムを使用して精製した。つぎに、クレノウフラグメントを加熱不活性化し、この不活性化は反応混合物を７５゜Ｃで５分間にわたり培養することによって行った。５０μｌ反応物を１.５ｍｌの微量遠心管内に移し入れ、２５０μｌのキアゲンバッファＰＢを添加した。３００μｌをＭｉｎＥｌｕｔｅカラムに移し入れ、微量遠心管内において１分間にわたり１３，０００ＲＰＭで遠心分離した。残留エタノールを、５分間１３，０００ＲＰＭで追加遠心分離することによって除去した。ＤＮＡを、遠心分離によって１５μｌのキアゲンバッファＥＢ内で溶離した。１０マイクロリットルのＤＮＡ溶離物を、１μｌのイルミナ・ゲノミック・アダプタ・オリゴ・ミックス（Part No. 1000521）１：５希釈液、１５μｌの２Ｘクイック・リゲーション反応バッファ、及び４μｌのＴ４ＤＮＡリガーゼとともに、ＮＥＢＮｅｘｔ（登録商標）クイック・リゲーション・モジュールに従って、１５分間２５゜Ｃで培養した。サンプルを４゜Ｃに冷却し、以下のように、ＭｉｎＥｌｕｔｅカラムを使用して精製した。１５０ミリリットルのキアゲンバッファＰＥを３０μｌの反応物に添加し、全体量をＭｉｎＥｌｕｔｅカラムに移し入れ、このカラムを微量遠心管内において１３，０００ＲＰＭで１分間にわたり遠心分離した。カラムを７５０μｌのキアゲンバッファＰＥで洗浄し、また再び遠心分離した。残留エタノールを、５分間１３，０００ＲＰＭで追加遠心分離することによって除去した。ＤＮＡを、遠心分離によって２８μｌのキアゲンバッファＥＢ内で溶離した。２３マイクロリットルのアダプタ結合ＤＮＡに対して１８サイクルのＰＣＲ（９８゜Ｃで３０秒間；９８゜Ｃで１０秒間、６５゜Ｃで３０秒間及び７２゜Ｃで３０秒間を１８サイクル、７２゜Ｃで５分間の最終延長、及び４゜Ｃで保持）を加え、この場合、製造業者の取扱説明書に従って、ＮＥＢＮｅｘｔ（登録商標）ＤＮＡサンプルＰｒｅｐＤＮＡ試薬セット１に供給されたイルミナゲノミックＰＣＲプライマー（Part No. 100537及び1000538）及びフュージョン・ＨＦ・ＰＣＲマスター・ミックスを使用した。増幅した生成物を、エージェンコートAMPure XP PCRの精製システム（Agencourt Bioscience Corporation, Berverly, MA）を使用して精製し、この精製は、製造業者の取扱説明書（www.beckmangenomics.com/products/AMPureXPProtocol_000387v001.pdfで入手可能）に従って行った。エージェンコートAMPure XP PCRの精製システムは、組込まれなかったｄＮＴＰ、プライマー、プライマー二量体、塩、及び他の汚染物を除去し、また１００ｂｐより多い単位複製配列を回収する。精製した増幅生成物は、４０μｌのキアゲン・ＥＢ・バッファで溶離し、増幅したライブラリの濃度及びサイズ分布を、２１００バイオアナライザ（Agilent technologies Inc., Santana Clara, CA）のためのアジレント・ＤＮＡ・1000・キットを使用して解析した。トレーニングサンプルセット及びテストサンプルセットの双方に対して３６塩基対の単独末端リードをシークエンシングした。 The overhang portion of approximately 2 ng of the purified cfDNA fragment contained in 40 μl is converted to blunt ends phosphorylated by the NEBNext® end repair module, which converts 40 μl of cfDNA into NEBNext®. ) 5 μl of 10 × phosphorylated buffer, 2 μl of deoxynucleotide mixed solution (10 mM for each dNTP), 1 μl of 1: 5 diluted DNA polymerase, 1 μl of T4 DNA polymerase, and 1 μl supplied to DNA sample Prep DNA reagent set 1) By culturing at 20 ° C. for 15 minutes with a T4 polynucleotide kinase. Samples were cooled to 4 ° C and purified using a QIAQuick column supplied with the QIAQuick PCR purification kit (Qiagen Valencia, CA) as follows. 50 μl reaction was transferred into a 1.5 ml microfuge tube and 250 μl Qiagen buffer PB was added. 300 μl of this result was transferred into a QIAQuick column and centrifuged at 13,000 RPM for 1 minute in a microcentrifuge tube. The column was washed with 750 μl Qiagen buffer PE and centrifuged again. Residual ethanol was removed by additional centrifugation at 13,000 RPM for 5 minutes. DNA was eluted in 39 μl Qiagen buffer EB by centrifugation. DA tailing of blunt ended DNA was performed using 16 μl of dA tailing master containing Klenow fragment (3′-5 ′ exominus) (NEBNext ™ DNA sample Prep DNA reagent set 1). The mixture was used and incubated at 37 ° C. for 30 minutes according to the manufacturer's NEBNext® dA tail-end module. Samples were cooled to 4 ° C. and purified using columns supplied with the MinElute PCR purification kit (Qiagen, Valencia, Calif.) As follows. The Klenow fragment was then heat inactivated and this inactivation was performed by incubating the reaction mixture at 75 ° C. for 5 minutes. 50 μl reaction was transferred into a 1.5 ml microfuge tube and 250 μl Qiagen buffer PB was added. 300 μl was transferred to a MinElute column and centrifuged at 13,000 RPM for 1 minute in a microcentrifuge tube. Residual ethanol was removed by additional centrifugation at 13,000 RPM for 5 minutes. DNA was eluted in 15 μl Qiagen buffer EB by centrifugation. Ten microliters of DNA eluate, along with 1 μl of Illumina Genomic Adapter Oligo Mix (Part No. 1000521) 1: 5 dilution, 15 μl of 2X quick ligation reaction buffer, and 4 μl of T4 DNA ligase, NEBNext Incubated for 15 minutes at 25 ° C. according to the (registered trademark) quick ligation module. The sample was cooled to 4 ° C. and purified using a MinElute column as follows. 150 milliliters of Qiagen buffer PE was added to 30 μl reaction, the entire volume was transferred to a MinElute column, and the column was centrifuged in a microcentrifuge tube at 13,000 RPM for 1 minute. The column was washed with 750 μl Qiagen buffer PE and centrifuged again. Residual ethanol was removed by additional centrifugation at 13,000 RPM for 5 minutes. DNA was eluted in 28 μl Qiagen buffer EB by centrifugation. 18 cycles of PCR on 23 microliters of adapter-bound DNA (98 ° C for 30 seconds; 98 ° C for 10 seconds, 65 ° C for 30 seconds and 72 ° C for 30 seconds, 18 cycles at 72 ° C) 5 min final extension and hold at 4 ° C), in this case the Illuminogenomic PCR primers (Part No.) supplied to the NEBNext® DNA sample Prep DNA reagent set 1 according to the manufacturer's instructions. 100537 and 1000538) and Fusion HF PCR Master Mix. The amplified product is purified using the purification system of Agencourt AMPure XP PCR (Agencourt Bioscience Corporation, Berverly, Mass.), Which can be obtained from the manufacturer's instructions (www.beckmangenomics.com/products/AMPureXPProtocol_000387v001 (Available in .pdf). The Agent Coat AMPure XP PCR purification system removes unincorporated dNTPs, primers, primer dimers, salts, and other contaminants, and recovers amplicons greater than 100 bp. The purified amplification product was eluted with 40 μl Qiagen EB buffer, and the concentration and size distribution of the amplified library was determined using an Agilent DNA 1000 for 2100 Bioanalyzer (Agilent technologies Inc., Santana Clara, Calif.). -Analyzed using a kit. A 36 base pair single-ended read was sequenced for both the training sample set and the test sample set.

データ解析及びサンプル分類
長さが３６塩基の配列リードをＵＣＳＣデータベースから得られるヒトゲノムアセンブリ（http://hgdownload.cse.ucsc.edu/goldenPath/hg18/bigZips/参照）に整列させた。アラインメントは、アラインメント中に２つの塩基の不一致まで許容するボータイ（Bowtie）ショートリードアライナー（バージョン0.12.5）を利用して行った（Langmead et al., Genome Biol 10:R25 [2009]参照）。単独のゲノム位置にあいまいにマッピングされたリードのみを排除した。リードをマッピングしたゲノム位置をカウントし、また染色体ドースの計算に含めた（後の説明参照）。男児及び女児からの配列タグを区別することなしにマッピングしたＹ染色体の領域（とくに、塩基０から塩基２×１０^６に、塩基１０×１０^６から塩基１３×１０^６に、塩基２３×１０^６からＹ染色体の末端にわたる領域）を解析から排除した。 Data analysis and sample classification Sequence reads of 36 bases in length were aligned to the human genome assembly (see http://hgdownload.cse.ucsc.edu/goldenPath/hg18/bigZips/) obtained from the UCSC database. Alignment was performed using a Bowtie short read aligner (version 0.12.5) that allows for two base mismatches during alignment (see Langmead et al., Genome Biol 10: R25 [2009]). Only reads that were ambiguously mapped to a single genomic location were excluded. The genomic locations to which the reads were mapped were counted and included in the chromosomal dose calculation (see later description). Y chromosome region mapped without distinguishing sequence tags from boys and girls (especially from base 0 to base 2 × 10 ⁶ , base 10 × 10 ⁶ to base 13 × 10 ⁶ , base 23 × 10 ⁶ To the end of the Y chromosome) was excluded from the analysis.

配列リードの染色体分布におけるラン内及びラン間シークエンシング変動は、マッピングされた配列部位分布における胎児異数性の効果を損なう恐れがある。このような変動を補正するため、染色体ドースは、所定正規化染色体配列で観測されたカウントに正規化された所定関心対象染色体に対するマッピング部位のカウントとして計算した。上述したように、正規化された染色体配列は、単独染色体又は染色体グループにより構成することができる。正規化染色体配列は、まず各常染色体を関心対象染色体を有するカウントの比における潜在的なデノミネーター（分母）としてみなし、トレーニングサンプルセットにおける関心対象の２１番染色体、１８番染色体、１３番染色体及びＸ染色体の2倍体核型を有する、異変なしサンプル、すなわち適格サンプルであるサンプルの部分集合内で同定した。デノミネーター染色体、すなわちシークエンシングラン内及びシークエンシングラン間での染色体ドース変動を最少化した正規化染色体配列を選択した。各関心対象染色体は、明確な正規化染色体配列（デノミネーター）を有すると決定した（表１０参照）。どの単独染色体も１３番染色体の正規化染色体配列として同定されず、なぜならどの染色体も、サンプルにわたる１３番染色体のドース変動を減少するとの決定はされなかったから、すなわち、１３番染色体のＮＣＶ値の分散が十分減少されず、Ｔ１３異数性を正確に同定できなかったからである。２〜６番染色体をランダムに選択し、１３番染色体の挙動を擬態するグループとしての能力をテストした。２〜６番染色体のグループは、トレーニングサンプルにおける１３番染色体のドース変動性をほぼ減少させることが分かり、したがって、１３番染色体の正規化染色体配列として選択した。上述したように、Ｙ染色体の染色体ドースにおける変動性は３０より多く、これらから独立的に単独染色体を、Ｙ染色体ドース決定における正規化染色体配列として使用した。２〜６番染色体のグループは、トレーニングサンプルにおけるＹ染色体のドース変動性をほぼ減少させることが分かり、したがって、Ｙ染色体の正規化染色体配列として選択した。 Intra-run and inter-run sequencing variations in the chromosomal distribution of sequence reads can compromise the effect of fetal aneuploidy on the mapped sequence site distribution. To correct for such variations, chromosomal doses were calculated as the number of mapping sites for a given chromosome of interest normalized to the count observed for a given normalized chromosome sequence. As described above, the normalized chromosomal sequence can be composed of a single chromosome or a chromosomal group. The normalized chromosomal sequence first considers each autosome as a potential denominator (denominator) in the ratio of counts having the chromosome of interest, and is the chromosome of interest 21, chromosome 13, chromosome 13 in the training sample set. Identified within a subset of samples that have diploid karyotypes of the X chromosome, that is, eligible samples. Denominator chromosomes, ie normalized chromosomal sequences that minimize chromosomal dose variation within and between sequencing runs, were selected. Each chromosome of interest was determined to have a well-defined normalized chromosomal sequence (denominator) (see Table 10). No single chromosome was identified as a normalized chromosomal sequence of chromosome 13, since no chromosome was determined to reduce the dose variation of chromosome 13 across the sample, ie the variance of NCV values of chromosome 13. This is because the T13 aneuploidy could not be accurately identified. Chromosomes 2-6 were selected at random and tested for ability as a group to mimic the behavior of chromosome 13. The group of chromosomes 2-6 was found to substantially reduce the dose variability of chromosome 13 in the training sample and was therefore selected as the normalized chromosome sequence for chromosome 13. As mentioned above, the variability in chromosome dose of the Y chromosome is greater than 30, and independently from these, a single chromosome was used as the normalized chromosome sequence in the Y chromosome dose determination. The group of chromosomes 2-6 was found to substantially reduce Y chromosome dose variability in the training sample and was therefore selected as the normalized chromosome sequence for the Y chromosome.

適格サンプルにおける関心対象染色体それぞれの染色体ドースは、関心対象染色体それぞれのマッピングした配列タグの総数の、残りの染色体それぞれのマッピングした配列タグの総数に対する変動の判断基準尺度をなす。したがって、適格染色体ドースは、サンプル間変動が関心対象染色体の変動に最も近似し、また更なる統計学的評価のための値を正規化する理想配列として供する染色体又は染色体グループ、すなわち正規化染色体配列を同定することができる。 The chromosomal dose of each chromosome of interest in the eligible sample provides a measure of variation in the total number of mapped sequence tags for each chromosome of interest relative to the total number of mapped sequence tags for each remaining chromosome. Thus, a qualifying chromosomal dose is a chromosome or chromosomal group, i.e., a normalized chromosomal sequence that provides inter-sample variation that most closely approximates the variation of the chromosome of interest and also normalizes values for further statistical evaluation. Can be identified.

トレーニングセットにおけるすべてのサンプル、すなわち適格サンプル及び異変ありサンプルの染色体ドースも、以下に説明するようにテストサンプルにおいて異数性を同定するとき、閾値を決定する基礎として供し得る。 The chromosomal dose of all samples in the training set, i.e. eligible and anomalous samples, can also serve as a basis for determining the threshold when identifying aneuploidy in a test sample, as described below.

テストセットにおける各サンプルの関心対象染色体それぞれに対して正規化値を決定し、異数性有無を決定するのに使用する。正規化値は染色体ドースとして計算することができ、この染色体ドースをさらに計算して正規化された染色体値（ＮＣＶ）を生ずることができる。 A normalized value is determined for each chromosome of interest in each sample in the test set and used to determine the presence or absence of aneuploidy. The normalized value can be calculated as a chromosomal dose, which can be further calculated to yield a normalized chromosomal value (NCV).

テストセットに対して、各サンプルにおける関心対象の２１番染色体、１８番染色体、１３番染色体、Ｘ染色体及びＹ染色体それぞれの染色体ドースを計算した。上述の表１０に記載したように、２１番染色体の染色体ドースは、テストサンプルにおける２１番染色体にマッピングされたテストサンプルにおけるタグ数と、９番染色体にマッピングされたテストサンプルにおけるタグ数との比として計算し、１８番染色体の染色体ドースは、テストサンプルにおける１８番染色体にマッピングされたテストサンプルにおけるタグ数と、８番染色体にマッピングされたテストサンプルにおけるタグ数との比として計算し、１３番染色体の染色体ドースは、テストサンプルにおける１３番染色体にマッピングされたテストサンプルにおけるタグ数と、２〜６番染色体にマッピングされたテストサンプルにおけるタグ数との比として計算し、Ｘ染色体の染色体ドースは、テストサンプルにおけるＸ染色体にマッピングされたテストサンプルにおけるタグ数と、６番染色体にマッピングされたテストサンプルにおけるタグ数との比として計算し、Ｙ染色体の染色体ドースは、テストサンプルにおけるＹ染色体にマッピングされたテストサンプルにおけるタグ数と、２〜６番染色体にマッピングされたテストサンプルにおけるタグ数との比として計算した。 For the test set, the chromosome dose of each of the chromosome 21, chromosome 18, chromosome 13, X chromosome and Y chromosome of interest in each sample was calculated. As described in Table 10 above, the chromosome dose of chromosome 21 is the ratio of the number of tags in the test sample mapped to chromosome 21 in the test sample to the number of tags in the test sample mapped to chromosome 9. The chromosome dose of chromosome 18 is calculated as the ratio of the number of tags in the test sample mapped to chromosome 18 in the test sample and the number of tags in the test sample mapped to chromosome 8; The chromosome dose of the chromosome is calculated as the ratio of the number of tags in the test sample mapped to chromosome 13 in the test sample and the number of tags in the test sample mapped to chromosome 2-6, and the chromosome dose of X chromosome is , Map to the X chromosome in the test sample Calculated as a ratio of the number of tags in the test sample mapped to the number of tags in the test sample mapped to chromosome 6, and the chromosome dose of the Y chromosome is the number of tags in the test sample mapped to the Y chromosome in the test sample. And the ratio of the number of tags in the test sample mapped to chromosomes 2-6.

正規化した染色体値
各テストサンプルにおける関心対象染色体それぞれの染色体ドース、トレーニングセットの適格サンプルにおいて決定した対応の染色体ドースの平均を使用し、正規化した染色体値（ＮＣＶ：normalized chromosome value）は、次式で計算される。
ここで、
は、それぞれトレーニングセットにおけるｊ番染色体ドースに対する推定した平均及び標準偏差であり、ｘ_ijはサンプルｉにおける観測したｊ番染色体ドースである。染色体ドースが正規分布しているとき、ＮＣＶはドースに対して統計学的にｚスコアに等しい。異変なしサンプルからのＮＣＶの変位値−変位値プロットにおける線形性からの大きな逸脱は観測されない。さらに、ＮＣＶの正規性標準検定は、正規性の帰無仮説を却下できない。 Normalized chromosome value Using the average of the chromosome dose of each chromosome of interest in each test sample and the corresponding chromosome dose determined in the eligible sample of the training set, the normalized chromosome value (NCV) is Calculated by the formula.
here,
Are the estimated mean and standard deviation for the j-th chromosome dose in the training set, respectively, and x _ij is the observed j-th chromosome dose in sample i. When the chromosomal dose is normally distributed, NCV is statistically equal to the z-score for the dose. No significant deviations from linearity in the NCV displacement-displacement value plots from the unabated samples are observed. Furthermore, NCV's normality standard test cannot reject the normality null hypothesis.

テストセットに対して、各サンプルにおける関心対象の２１番染色体、１８番染色体、１３番染色体、Ｘ染色体及びＹ染色体それぞれのＮＣＶを計算した。安全かつ有効な分類スキームを保証するため、控えめの境界を異数性分類に選択した。常染色体の異数性状態を分類するため、染色体に異変あり（すなわち、その染色体が異数性である）と分類するのにＮＣＶ＞４であることを必要とし、染色体に異変なしと分類するにはＮＣＶ＜２.５であることを必要とした。ＮＣＶが２.５と４.０との間である常染色体を有するサンプルは「ノーコール」として分類した。 For each test set, the NCV of each of the chromosomes of interest 21, chromosome 18, chromosome 13, X chromosome and Y chromosome in each sample was calculated. To ensure a safe and effective classification scheme, conservative boundaries were chosen for aneuploidy classification. To classify an autosomal aneuploidy state, it requires NCV> 4 to classify the chromosome as anomalous (ie, the chromosome is aneuploid), and classifies the chromosome as anomalous Required NCV <2.5. Samples with autosomes with NCV between 2.5 and 4.0 were classified as “no call”.

この検定における性染色体の分類は、Ｘ染色体及びＹ染色体双方に対して、以下のようにＮＣＶを逐次的に適用することによって行った。
１．男児サンプルの平均からＮＣＶＹ＞-２.０標準偏差である場合、このサンプルは男児（ＸＹ）であると分類した。
２．男児サンプルの平均からＮＣＶＹ＜-２.０標準偏差であり、かつ女児サンプルの平均からＮＣＶＸ＞-２.０標準偏差である場合、このサンプルは女児（ＸＸ）であると分類した。
３．男児サンプルの平均からＮＣＶＹ＜-２.０標準偏差であり、かつ女児サンプルの平均からＮＣＶＸ＜-３.０標準偏差である場合、このサンプルはＸモノソミー、すなわちターナー症候群であると分類した。
４．ＮＣＶが上述の基準のいずれにも当てはまらない場合、そのサンプルは、性に関して「ノーコール」として分類した。 The classification of sex chromosomes in this test was performed by sequentially applying NCV to both the X and Y chromosomes as follows.
1. A sample was classified as a boy (XY) if NCV Y> −2.0 standard deviation from the average of the boy sample.
2. A sample was classified as a girl (XX) if NCV Y <−2.0 standard deviation from the average of the boy sample and NCV X> −2.0 standard deviation from the average of the girl sample.
3. If the average of the boy sample is NCV Y <-2.0 standard deviation and the average of the girl sample is NCV X <-3.0 standard deviation, the sample was classified as X monosomy, ie Turner syndrome. .
4). If NCV did not meet any of the above criteria, the sample was classified as “no call” for gender.

結果
人口統計学的研究
合計１，０１４名の患者を２００９年４月から２０１０年７月にかけて登録した。患者の人口統計学的データ、侵襲的処置タイプ及び核型結果を
に列挙する。研究参加者の平均年齢は３５.６歳（１７〜４７歳の範囲）であり、妊娠期間は、６週間、１日から３８週の間、１日の範囲にわたる（平均は１５週と４日）。異常胎児染色体核型の全体的出現率は６.８％で、そのうちＴ２１の事象は２.５％であった。単胎妊娠及び核型を有する９４６検体のうち、９０６例（９６％）は、出生前手順前の胎児異数性に関して少なくとも１つの臨床的に認識される危険因子を示した。単一兆候としての高齢母体の例を除外したとしても、データは、現在のスクリーニングモダリティに対して極めて高い偽陽性率を示している。増大した項部透過性、滑液嚢水腫、又は他の構造的先天性異常の超音波による発見は、この統計群における異常核型を最も多く予測した。 Results Demographic studies A total of 1,014 patients were enrolled from April 2009 to July 2010. Patient demographic data, invasive treatment types and karyotype results are listed in The average age of the study participants is 35.6 years (range 17-47 years) and the gestation period ranges from 1 week to 6 weeks, 1 day to 38 weeks (average is 15 weeks and 4 days) ). The overall incidence of abnormal fetal chromosome karyotype was 6.8%, of which T21 events were 2.5%. Of 946 specimens with single pregnancy and karyotype, 906 (96%) showed at least one clinically recognized risk factor for fetal aneuploidy prior to prenatal procedures. Even excluding the example of an older maternal as a single sign, the data shows a very high false positive rate for the current screening modality. Ultrasound discovery of increased nodal permeability, synovial edema, or other structural congenital anomalies most predicted abnormal karyotypes in this statistical group.

*複数妊娠期間からの胎児結果を含む、**臨床医が評価及び報告した略記：
AMA=高齢母体年齢(Advanced Maternal Age),NT=項部透過性(nuchal translucency) * Includes fetal results from multiple gestation periods ** Abbreviations evaluated and reported by clinicians:
AMA = Advanced Maternal Age, NT = nuchal translucency

この研究母集団で表される種々の民族的背景の分布も表１１に示す。この研究における患者全体のうち６３％がコーカサス系、１７％がラテン系、６％がアジア系、５％が混血系、４％がアフリカ系アメリカ人であった。民族的多様性は、場所毎に大きく変動することが分かった。例えば、ある１つの場所では登録者の６０％がラテン系、２６％がコーカサス系の検体であり、３か所のクリニックはすべて同一の州に位置してラテン系の検体は登録されなかった。予期したとおり、この研究結果において異なった民族に対して顕著の相違は観察されなかった。 The distribution of the various ethnic backgrounds represented in this study population is also shown in Table 11. Of the total patients in this study, 63% were Caucasian, 17% Latin, 6% Asian, 5% mixed, and 4% African American. It was found that ethnic diversity varies greatly from place to place. For example, in one location, 60% of registrants were Latin and 26% were Caucasian specimens, and all three clinics were located in the same state and no Latin specimens were registered. As expected, no significant differences were observed for different ethnicities in the results of this study.

トレーニングデータセット１
トレーニングセット研究は、２００９年４月から２００９年１２月にかけて収集した４３５サンプルの初期逐次蓄積から７１サンプルを選択した。この一次検体シリーズにおける異変あり胎児（異常核型）を孕んだすべての検体を、シークエンシング及びランダム選択のために、また適正サンプル及びデータを有する異変なし検体数に含めた。トレーニングセット患者の臨床的特性は、表１１に示す全体的研究の人口統計に一致した。トレーニングセットのサンプルにおける妊娠期間は１０週０日〜２３週１日にわたる範囲であった。３８を下回るＣＶＳ、３２を下回る羊水穿刺１名の患者は、侵襲的手順タイプとして特定されなかった（異変なし核型４６，ＸＹ）。患者の７０％はコーカサス系、８.５％はラテン系、８.５％はアジア系、８.５％は混血系であった。シークエンシングした６サンプルは、トレーニング目的のためにこのセットから除外した。すなわち、双子を孕んだ検体（以下にさらに説明する）からの４サンプル、調製中に汚染されたＴ１８を有する１サンプル、及び胎児核型６９，ＸＸＸを有する１サンプルを除外し、他の６５サンプルをトレーニングセットとして残した。 Training data set 1
The training set study selected 71 samples from the initial sequential accumulation of 435 samples collected from April 2009 to December 2009. All specimens harboring anomalous fetuses (abnormal karyotypes) in this primary specimen series were included in the number of specimens without anomaly for sequencing and random selection and with appropriate samples and data. The clinical characteristics of the training set patients were consistent with the demographics of the overall study shown in Table 11. The gestation period in the training set samples ranged from 10 weeks 0 days to 23 weeks 1 day. One patient with CVS below 38, amniocentesis below 32 was not identified as an invasive procedure type (no change karyotype 46, XY). 70% of patients were Caucasian, 8.5% Latin, 8.5% Asian and 8.5% mixed. Six samples that were sequenced were excluded from this set for training purposes. 4 samples from twin-bearing specimens (described further below), 1 sample with T18 contaminated during preparation, and 1 sample with fetal karyotype 69, XXX, and 65 other samples Left as a training set.

特異配列部位（すなわち、ゲノムにおいて特異部位として同定されたタグ）の数は、トレーニングセット研究の初期段階における２.２Ｍから後期段階における１３.７Ｍまで変化し、これは時とともにシークエンシング技術が改良されたことに起因する。特異部位におけるこの６倍もの染色体ドースにおけるいかなる潜在的シフトをもモニタリングするため、研究の開始時及び終了時に異なる異変なしサンプルで稼働した。最初の１５の異変なしサンプルでの稼働に対して、平均特異部位数は３.８Ｍ、２１番染色体及び１８番染色体の平均染色体ドースは、それぞれ０.３１４及び０.５２８であった。最後の１５の異変なしサンプルでの稼働に対して、平均特異部位数は１０.７Ｍ、２１番染色体及び１８番染色体の平均染色体ドースは、それぞれ０.３１６及び０.５２９であった。トレーニングセット研究の時間経過における２１番染色体及び１８番染色体の平均染色体ドース間には統計学的相違はなかった。 The number of unique sequence sites (ie, tags identified as unique sites in the genome) varies from 2.2M in the early stages of the training set study to 13.7M in the late stages, which over time has improved sequencing techniques This is due to the fact that To monitor any potential shift in this 6-fold chromosomal dose at a specific site, we worked with different unaltered samples at the start and end of the study. The average number of unique sites was 3.8M and the average chromosome doses for chromosomes 21 and 18 were 0.314 and 0.528, respectively, for the first 15 non-aberrant samples. The average number of unique sites was 10.7M and the average chromosome doses for chromosomes 21 and 18 were 0.316 and 0.529, respectively, for the last 15 samples with no change. There was no statistical difference between the average chromosomal doses of chromosome 21 and chromosome 18 over the time course of the training set study.

２１番、１８番及び１３番染色体のトレーニングセットＮＣＶを図１２に示す。図１２に示す結果は、2倍体ＮＣＶのおおよそ９９％は、平均の±２.５標準偏差内に入る点で正常性の想定に一致する。この６５サンプルセットのうち、Ｔ２１を示す臨床的核型を有する８サンプルは、６〜２０にわたるＮＣＶを有していた。胎児Ｔ１８を示す臨床的核型を有する４サンプルは、３.３〜１２にわたるＮＣＶを有し、胎児１３トリソミー（Ｔ１３）を示す臨床的核型を有する２サンプルは、２.６〜４にわたるＮＣＶを有していた。異変ありサンプルにおけるＮＣＶの分散は、個別サンプルにおける胎児ｃｆＤＮＡのパーセンテージに依存することに起因する。 A training set NCV for chromosomes 21, 18, and 13 is shown in FIG. The results shown in FIG. 12 are consistent with the normality assumption that approximately 99% of diploid NCVs fall within the mean ± 2.5 standard deviations. Of this 65 sample set, 8 samples with clinical karyotype showing T21 had NCV ranging from 6-20. Four samples with clinical karyotype representing fetal T18 have NCV ranging from 3.3 to 12, and two samples with clinical karyotype representing fetal trisomy (T13) are NCV ranging from 2.6 to 4 Had. The variance of NCV in anomalous samples is due to dependence on the percentage of fetal cfDNA in individual samples.

常染色体と同様に、性染色体の平均及び標準偏差をトレーニングセットで確立した。性染色体閾値は、トレーニングセットにおける男女の胎児を１００％同定できた。 As with autosomes, sex chromosome means and standard deviations were established in the training set. The sex chromosome threshold could identify 100% of male and female fetuses in the training set.

テストデータセット１
トレーニングセットからの染色体ドースの平均及び標準偏差を確立した後、４８サンプルのテストセットを、２０１０年１月から２０１０年６月にかけて収集した合計５７５サンプルから選択した。双子妊娠からのサンプルのうち１例を最終解析から除外し、テストセットとして４７サンプルを残した。シークエンシング及び機器を操作するための職員が調製するサンプルは、臨床的核型情報が分からないようにした。妊娠期間の範囲はトレーニングセットで見たのと同様であった（表１１参照）。侵襲性手順の５８％はＣＶＳであり、手順に関する全体的人口統計よりも高いが、トレーニングセットと同様であった。検体の５０％はコーカサス系、２７％はラテン系、１０.４％はアジア系、６.３％はアフリカ系アメリカ人であった。 Test data set 1
After establishing the average and standard deviation of chromosomal doses from the training set, a 48-sample test set was selected from a total of 575 samples collected from January 2010 to June 2010. One sample from twin pregnancies was excluded from the final analysis, leaving 47 samples as a test set. Samples prepared by personnel for sequencing and instrument operation were kept from having clinical karyotype information. The range of gestational age was similar to that seen in the training set (see Table 11). 58% of invasive procedures were CVS, higher than the overall demographics for the procedure, but similar to the training set. 50% of the specimens were Caucasian, 27% Latin, 10.4% Asian, and 6.3% African American.

テストセットにおいて、特異配列タグ数は、約１３Ｍ〜２６Ｍにわたり変動した。異変なしサンプルに対して、２１番染色体及び１８番染色体の染色体ドースは、それぞれ０.３１３及び０.５２７であった。２１番、１８番及び１３番染色体のテストセットＮＣＶを
図１３に示し、分類を表１２に示す。 In the test set, the number of specific sequence tags varied from about 13M to 26M. Chromosome doses for chromosomes 21 and 18 were 0.313 and 0.527, respectively, for the samples without anomaly. The test set NCV for chromosomes 21, 18 and 13 is shown in FIG.

*ＭＸは、Ｙ染色体の確証がないＸ染色体におけるモノソミーである。 * MX is a monosomy on the X chromosome with no confirmation of the Y chromosome.

テストセットにおいて、胎児Ｔ２１を示した臨床的核型を有する１３／１３検体は、５〜１４の範囲のＮＣＶを有すると正確に同定された。胎児Ｔ１８を示した核型を有する８／８検体は、８.５〜２２の範囲のＮＣＶを有すると正確に同定された。このテストセットにおいてＴ１３として分類された核型を有する１つのサンプルは約３のＮＣＶを有してノーコールとして分類された。 In the test set, 13/13 specimens with clinical karyotype that exhibited fetal T21 were correctly identified as having NCV in the range of 5-14. 8/8 specimens with karyotypes exhibiting fetal T18 were correctly identified as having NCVs in the range of 8.5-22. One sample with a karyotype classified as T13 in this test set had a NCV of about 3 and was classified as no call.

テストデータセットに関して、すべての男児サンプルが、複雑核型、４６,ＸＹ＋マーカー染色体（核型によっては同定不能）を有するサンプルを含めて正確に同定された（表３参照）。２０女児サンプルのうち１９サンプルが正確に同定され、また１女児サンプルがノーコールとして分類された。テストセットにおける核型４５,Ｘを有する３サンプルに関して、３個のうち２サンプルがＸモノソミーとして正確に同定され、１サンプルがノーコールとして分類された（表１２参照）。 For the test data set, all boy samples were correctly identified, including samples with complex karyotypes, 46, XY + marker chromosomes (not identifiable by karyotype) (see Table 3). Of the 20 girl samples, 19 were correctly identified and one girl sample was classified as no call. For 3 samples with karyotype 45, X in the test set, 2 out of 3 samples were correctly identified as X monosomy and 1 sample was classified as no call (see Table 12).

双子
トレーニングセットに関して初期に選択した４サンプル及びテストセットにおける１サンプルは、双子妊娠からのものであった。ここで使用される閾値は、双子妊娠の設定で期待されるｃｆＤＮＡ量が異なることによって混乱を生ずるおそれがある。トレーニングセットにおいて、双子サンプルのうち１サンプルからの核型は、単一絨毛膜性の４７,ＸＹ+２１であった。第２の双子サンプルは二卵性双生児であり、羊水穿刺を各胎児に対して個別に行った。この双子妊娠において、二卵性双生児のうち一方は４７,ＸＹ+２１の核型であり、他方は正常な核型４６,ＸＸであった。これらケース双方において、無細胞分類は、サンプルをＴ２１と分類した上述の方法に基づいて行った。トレーニングセットにおける他の２つの双子妊娠は、Ｔ２１に関して異変なしとして正確に分類された（すべての双子は２１番染色体に関して２倍体の核型を示した）。テストセットにおける双子サンプルに関して、核型は双子Ｂ（４６,ＸＸ）に対してのみ確立し、アルゴリズムはＴ２１に関して異変なしと正確に分類した。 The four samples initially selected for the twin training set and one sample in the test set were from twin pregnancy. The thresholds used here can be confused by different cfDNA amounts expected in twin pregnancy settings. In the training set, the karyotype from one of the twin samples was single chorionic 47, XY + 21. The second twin sample was a dizygotic twin and amniocentesis was performed on each fetus individually. In this twin pregnancy, one of the twins was 47, XY + 21 karyotype and the other was normal karyotype 46, XX. In both cases, cell-free classification was performed based on the method described above, where the sample was classified as T21. The other two twin pregnancies in the training set were correctly classified as unchanged for T21 (all twins showed a diploid karyotype with respect to chromosome 21). For twin samples in the test set, the karyotype was established only for twin B (46, XX) and the algorithm correctly classified as no change with respect to T21.

結論
データは、大量並列シークエンシングを使用して妊婦の血液から複数の異常胎児核型を決定することができることを示す。これらデータは、独立したテストセットデータを使用して２１トリソミー及び１８トリソミーを有するサンプルの１００％正確な分類を同定できることを実証している。異常性染色体核型を有する胎児の場合でも、本発明方法のアルゴリズムによって、不正確に分類されるサンプルはなかった。重要なことに、アルゴリズムは、少なくとも一方が異変ありの胎児である双子妊娠の２セットで、Ｔ２１の存在を決定するのによく機能し、これは従来ではみられなかったことである。さらに、この研究は、複数のセンターからの様々な配列サンプルを審査し、商業的臨床的設定で直面しそうな異常核型の範囲のみならず、共通トリソミーの異変がない妊娠を正確に分類し、出生前スクリーニングで今も残存する容認できない高い疑陽性率に対処する意義を示している。データは、将来この方法を用いる大きな能力に価値ある予想を与える。示した特異ゲノム部位の部分集合解析は分散一貫ポアソン計数統計で増加する。 CONCLUSION The data show that multiple abnormal fetal karyotypes can be determined from pregnant woman's blood using massively parallel sequencing. These data demonstrate that independent test set data can be used to identify a 100% accurate classification of samples with trisomy 21 and trisomy 18. Even in the case of a fetus with an abnormal chromosomal karyotype, no sample was classified incorrectly by the algorithm of the present method. Importantly, the algorithm works well in determining the presence of T21 in two sets of twin pregnancies, at least one of which is an abnormal fetus, which has not been seen before. In addition, the study screened various sequence samples from multiple centers to accurately classify pregnancy not only in the range of abnormal karyotypes likely to be encountered in commercial clinical settings, but also in common trisomy abnormalities, It shows the significance of dealing with the unacceptably high false positive rate that still remains in prenatal screening. The data provides valuable predictions for the great ability to use this method in the future. The subset analysis of the specific genomic sites shown increases with distributed consistent Poisson counting statistics.

大量並列シークエンシングを使用して、母体血漿から胎児異常を非侵襲的出生前決定の感度が計数統計によってのみ制限されることを実証したファン及びクウェイク氏の発見に基づく（Fan and Quake, PLos One 5, e10439 [2010]）。シークエンシング情報は全体ゲノムにわたって収集したため、この方法は、任意の異数性又は挿入及び欠失を含む他のコピー数多型を決定することができる。シークエンシングデータを５００キロベースのビンで解析したとき、サンプルのうち１つからの核型は１１番染色体にｑ２１とｑ２３との間で小さい欠失を有し、これをｑ２１から開始する２５Ｍｂ領域におけるタグの相対数の１０％減少として観測された。さらに、トレーニングセットにおいて、３サンプルが細胞発生解析におけるモザイク現象に起因する複雑性染色体核型を有していた。これら核型は、ｉ）４７,ＸＸＸ[９]／４５,Ｘ[６]、ii）４５,Ｘ[３]／４６,ＸＹ[１７]、及び
iii）４７,ＸＸＸ[１３]／４５,Ｘ[７]であった。若干のＸＹを含む細胞を示したサンプルii）はＸＹと正確に分類された。サンプルｉ（ＣＶＳ処置からの）及びiii（羊水穿刺からの）双方は細胞発生解析によるＸＸＸ及びＸ細胞の混合（モザイク型ターナー症候群に一致する）を示し、これらはそれぞれノーコール及びＸモノソミーとして分類された。 Based on the findings of Fan and Quake, PLos One, who used massively parallel sequencing to demonstrate that the sensitivity of noninvasive prenatal decisions from maternal plasma was limited only by counting statistics (Fan and Quake, PLos One 5, e10439 [2010]). Since sequencing information has been collected across the entire genome, this method can determine any aneuploidy or other copy number polymorphisms including insertions and deletions. When the sequencing data was analyzed in a 500 kilobase bin, the karyotype from one of the samples had a small deletion between chromosomes q21 and q23 on chromosome 11 that started with q21 Was observed as a 10% reduction in the relative number of tags in Furthermore, in the training set, 3 samples had complex chromosomal karyotypes due to mosaicism in cell development analysis. These karyotypes are i) 47, XXX [9] / 45, X [6], ii) 45, X [3] / 46, XY [17], and
iii) 47, XXX [13] / 45, X [7]. Sample ii), which showed cells containing some XY, was correctly classified as XY. Samples i (from CVS treatment) and iii (from amniocentesis) show mixed XXX and X cells by cytogenetic analysis (corresponding to mosaic Turner syndrome), which are classified as no call and X monosomy, respectively. It was.

アルゴリズムを検査するにあたり、他の興味深いデータポイントは、テストセットからの１サンプルに関して、２１番染色体における-５と-６との間にＮＣＶを持つことを観測した（図１３参照）。このサンプルは細胞発生によって２１番染色体に２倍体性を示すが、核型は９番染色体に部分的三倍体性、４７,ＸＸ+９[９]／４６ＸＸ[６]を有するモザイク現象を示した。９番染色体はデノミネーターに使用して２１番染色体の染色体ドースを決定するため（表１０参照）、このことは全体ＮＣＶ値を低下させる。このサンプルにおいて胎児９番トリソミーを決定するのに正規化染色体を使用する能力は、以下の実施例７で得られる結果によって証明される。 In examining the algorithm, we observed that another interesting data point had an NCV between −5 and −6 on chromosome 21 for one sample from the test set (see FIG. 13). This sample shows diploidy on chromosome 21 by cell development, but the karyotype shows a mosaic phenomenon with partial triploidy on chromosome 9, 47, XX + 9 [9] / 46XX [6]. Indicated. Chromosome 9 is used as a denominator to determine the chromosomal dose of chromosome 21 (see Table 10), which reduces the overall NCV value. The ability to use normalized chromosomes to determine fetal trisomy 9 in this sample is demonstrated by the results obtained in Example 7 below.

この方法の感度に関するファン氏らの結論は、利用するアルゴリズムがシークエンシング方法によってもたらされる任意なランダム的又は系統的なバイアスを構成する場合のみ正確である。シークエンシングデータが適正に正規化されていない場合、結果としての解析は計数統計に劣る。チュー氏らは、最近の論文では、大量並列シークエンシング方法を使用する１８番及び１３番染色体の測定は不正確であると記述し、またこの方法をＴ１８及びＴ１３の決定に適用するにはより詳しいリサーチが必要であると結論付けた（Chiu et al., BMJ 342:c7401 [2011]）。チュー氏らの論文で使用される方法は、単に関心対象染色体（彼らのケースでは２１番染色体）における、シークエンシングランでタグ総数によって正規化された配列タグ数を使用するものである。このアプローチの課題は、各染色体におけるタグの分布は、シークエンシングラン毎に変動するおそれがあり、したがって、異数性決定に関する測定基準の全体変動を増大する点にある。チュー氏のアルゴリズムによる結果を本発明の実施例に使用した染色体ドースに比較するため、２１番及び１８番染色体のテストデータを、図１４に示すようなチュー氏らが推奨する方法を使用して、再解析した。全体的には、２１番及び１８番染色体それぞれのＮＣＶ範囲の比較、並びに異数性分類に対して４.０のＮＣＶ閾値を利用して本発明によるテストセットから正確に同定した１０／１３のＴ２１サンプル５／８のＴ１８サンプルの決定率低下を観測した。 Fan's conclusion regarding the sensitivity of this method is accurate only if the algorithm used constitutes any random or systematic bias introduced by the sequencing method. If the sequencing data is not properly normalized, the resulting analysis is inferior to counting statistics. Chu et al. Describe in a recent paper that the measurements of chromosomes 18 and 13 using the massively parallel sequencing method are inaccurate and are more amenable to applying this method to the determination of T18 and T13. We concluded that further research is necessary (Chiu et al., BMJ 342: c7401 [2011]). The method used in Chu et al. Simply uses the number of sequence tags in the chromosome of interest (chromosome 21 in their case) normalized by the total number of tags in the sequencing run. The challenge of this approach is that the distribution of tags on each chromosome can fluctuate from one sequencing run to another, thus increasing the overall variability of metrics for aneuploidy determination. In order to compare the results of Chu's algorithm to the chromosomal doses used in the examples of the present invention, the test data of chromosomes 21 and 18 were used using the method recommended by Chu and others as shown in FIG. Re-analyzed. Overall, a comparison of NCV ranges for chromosomes 21 and 18 and 10/13 accurately identified from the test set according to the present invention using an NCV threshold of 4.0 for aneuploidy classification. A decrease in the determination rate of the T18 sample of the T21 sample 5/8 was observed.

エーリッヒ氏らもＴ２１にのみ注目し、チュー氏らと同一のアルゴリズムを使用した（Ehrich et al., Am J Obstet Gynecol 204:205 el-e11 [2011]）。さらに、彼らのテストセットｚスコア測定基準における外部基準データ、すなわち、トレーニングセットからのずれを観測した後、彼らはテストセットを再トレーニングして、分類境界を規定した。原理的にはこのアプローチは実現可能であるが、実際的には、どの位多くのサンプルがトレーニングを必要とするか、及び分類境界が正確であるという確証を得るのにどの位の頻度で再トレーニングをする必要があるかを決定することに難題がある。この問題を軽減する１つの方法は、各シークエンシングラン毎に、ベースラインを測定し、また定量的な挙動を較正する制御を設けることである。 Erich et al. Only focused on T21 and used the same algorithm as Chu et al. (Ehrich et al., Am J Obstet Gynecol 204: 205 el-e11 [2011]). In addition, after observing the external reference data in their test set z-score metric, ie deviation from the training set, they retrained the test set to define the classification boundaries. In principle this approach is feasible, but in practice, how many samples need training and how often to re-establish confirmation that the classification boundary is accurate. There are challenges in deciding if you need to train. One way to alleviate this problem is to provide a control that measures the baseline and calibrates the quantitative behavior for each sequencing run.

本発明方法を使用して得られたデータは、大量並列シークエンシングは染色体カウントデータを正規化するアルゴリズムが最適化されるとき、妊婦の血漿から複数の胎児染色体異常を決定できることを示している。定量化のための本発明方法は、シークエンシングラン相互間のランダムな及び系統的な変動を最小化するだけでなく、全体ゲノムにわたる、異数性、とくに、Ｔ２１及びＴ１８を効果的に分類することができる。より多くのサンプル収集がＴ１３決定のためのアルゴリズムをテストするのに必要となる。この目的のため、本発明の診断精度をさらに実証する、有望な、手探りの、複数場所での臨床的研究を行っている。 Data obtained using the method of the present invention show that massively parallel sequencing can determine multiple fetal chromosomal abnormalities from the plasma of pregnant women when the algorithm that normalizes the chromosome count data is optimized. The method of the present invention for quantification not only minimizes random and systematic variation between sequencing runs, but also effectively classifies aneuploidy, particularly T21 and T18, across the entire genome. be able to. More sample collection is needed to test the algorithm for T13 determination. To this end, promising, fumbling, clinical research is being conducted at multiple locations to further demonstrate the diagnostic accuracy of the present invention.

実施例７
個別テストサンプルのすべての染色体における少なくとも５つの異なった染色体異数性の
有無決定
１組の母体テストサンプル（テストセット１；実施例６）それぞれにおける任意の染色体異数性の有無を決定する本発明方法の能力を実証するため、系統的に決定した正規化染色体配列を、トレーニングセット（トレーニングセット１;実施例６）の異変なしサンプルにおいて同定し、各テストサンプルにおけるすべての染色体の染色体ドースを計算するのに使用した。テストセット及びトレーニングセットの各サンプルにおける任意の１つ又はそれ以上の異なる完全胎児染色体異数性の有無決定は、各個別サンプルに対する単独シークエンシングランから得たシークエンシング情報によって行った。 Example 7
At least five different chromosomal aneuploidies on all chromosomes of an individual test sample
Existence Determination To demonstrate the ability of the method of the invention to determine the presence or absence of any chromosomal aneuploidy in each of a set of maternal test samples (Test Set 1; Example 6), systematically determined normalized chromosomal sequences are , Identified in the unaltered sample of the training set (Training Set 1; Example 6) and used to calculate the chromosomal dose of all chromosomes in each test sample. The presence or absence of any one or more different complete fetal chromosomal aneuploidies in each sample of the test set and training set was made by sequencing information obtained from a single sequencing run for each individual sample.

実施例６で説明したトレーニングセットの各サンプルにおける各染色体に対して同定された染色体密度、すなわち配列タグ数を使用して、単独染色体又は染色体グループよりなる系統的に決定した正規化染色体配列を、１〜２２染色体、Ｘ染色体及びＹ染色体それぞの単独染色体ドースを計算することによって決定した。１〜２２染色体、Ｘ染色体及びＹ染色体それぞれの系統的に決定した正規化染色体配列は、あり得るすべての染色体の組合せを分子として使用し、各染色体の染色体ドースを系統的に計算することによって決定した。例えば、関心対象染色体としての２１番染色体の染色体ドースは、（ｉ）２１番染色体（関心対象染色体）に得られた配列タグ数と、（ii）残りの染色体それぞれに得られた配列タグ数との比、及び残りの染色体（２１番染色体を除外する）、すなわち、１，２，３，４，５等から２０，２１，２２，Ｘ，及びＹにいたる染色体のあり得るすべての組合せ、例えば、1+2,1+3,1+4,1+5等から1+20,1+22,1+X,及び1+Y;,1+2+3,1+2+4,1+2+5等から1+2+20,1+2+22,1+2+X,及び1+2+Y;1+3+4,1+3+5,1+3+6等から1+3+20,1+3+22,1+3+X,及び1+3+Y;1+2+3+4,1+2+3+5,1+2+3+6等から1+2+3+20,1+2+3+22,1+2+3+X,及び1+2+3+Y等々に得られたタグ数の合計として計算し、染色体１〜２０番染色体、２２番染色体、Ｘ染色体及びＹ染色体のすべてのあり得る組合せを、正規化染色体配列（分子）として使用し、トレーニングセットにおける適格サンプルそれぞれにおける各関心対象染色体のあり得るすべての染色体ドースを決定した。トレーニングサンプルのすべてにおける染色体ドースは２１番染色体と同様にして決定し、２１番染色体に対する系統的に決定した正規化染色体配列は、すべてのトレーニングサンプルにわたり変動性が最も小さい２１番染色体ドースとなる、単独染色体又は染色体グループとして決定した。同一解析を繰り返し、１３番染色体、１８番染色体、Ｘ染色体及びＹ染色体を含む残りの染色体それぞれの系統的に決定した正規化染色体配列として供する単独染色体又は染色体の組合せを決定した、すなわち、あり得るすべての染色体の組合せを使用して、すべてのトレーニングサンプルにおける他のすべての関心対象染色体、１〜１２番染色体、１４〜１７番染色体、１９〜２０番染色体、２２番染色体、Ｘ染色体及びＹ染色体の正規化配列（単独染色体又は染色体グループ）を決定した。このようにして、すべての染色体を関心対象染色体として処理し、系統的に決定した正規化配列を、トレーニングセットの異変なしサンプルそれぞれにおけるすべての染色体それぞれに対して決定した。表１３は、１〜２２番染色体、Ｘ染色体及びＹ染色体それぞれの系統的に決定した正規化配列として同定した単独染色体又は染色体グループを示す。表１３から明らかなように、関心対象である幾つかの染色体に対しては、系統的に決定した正規化染色体配列は単独染色体であると決定し（例えば、４番染色体が関心対象染色体であるとき）、他の関心対象染色体に対しては、系統的に決定した正規化染色体配列は染色体グループであると決定した（例えば、２１番染色体が関心対象染色体であるとき）。 Using the chromosome density identified for each chromosome in each sample of the training set described in Example 6, i.e. the number of sequence tags, a systematically determined normalized chromosomal sequence consisting of a single chromosome or chromosome group, It was determined by calculating the single chromosome dose for chromosomes 1-22, X and Y. Systematically determined normalized chromosomal sequences for each of chromosomes 1-22, X and Y are determined by systematically calculating the chromosomal dose of each chromosome using all possible chromosome combinations as molecules. did. For example, the chromosome dose of chromosome 21 as the chromosome of interest includes (i) the number of sequence tags obtained for chromosome 21 (the chromosome of interest), and (ii) the number of sequence tags obtained for each of the remaining chromosomes. And the remaining chromosomes (excluding chromosome 21), ie all possible combinations of chromosomes from 1, 2, 3, 4, 5 etc. to 20, 21, 22, X, and Y, eg 1 + 2,1 + 3,1 + 4,1 + 5 etc. to 1 + 20,1 + 22,1 + X, and 1 + Y;, 1 + 2 + 3,1 + 2 + 4,1 + From 2 + 5 etc. to 1 + 2 + 20, 1 + 2 + 22, 1 + 2 + X, and 1 + 2 + Y; 1 + 3 + 4, 1 + 3 + 5, 1 + 3 + 6 etc. 1 from + 3 + 20, 1 + 3 + 22, 1 + 3 + X, and 1 + 3 + Y; 1 + 2 + 3 + 4, 1 + 2 + 3 + 5, 1 + 2 + 3 + 6 etc. Calculated as the sum of the number of tags obtained for + 2 + 3 + 20, 1 + 2 + 3 + 22, 1 + 2 + 3 + X, 1 + 2 + 3 + Y, etc., chromosomes 1-20 All possible combinations of chromosome 22, chromosome X, and chromosome Y are used as normalized chromosome sequences (molecules) and training sets All possible chromosomal doses for each chromosome of interest in each eligible sample in were determined. Chromosome doses in all of the training samples are determined in the same manner as chromosome 21, and the systematically determined normalized chromosome sequence for chromosome 21 is the chromosome 21 dose with the least variability across all training samples. Determined as single chromosome or chromosome group. The same analysis was repeated to determine a single chromosome or combination of chromosomes to serve as a systematically determined normalized chromosomal sequence for each of the remaining chromosomes including chromosomes 13, 18, X and Y. Using all chromosome combinations, all other chromosomes of interest in all training samples, chromosomes 1-12, chromosomes 14-17, chromosomes 19-20, chromosome 22, chromosome X and Y The normalization sequence (single chromosome or chromosome group) was determined. In this way, all chromosomes were treated as chromosomes of interest, and a systematically determined normalized sequence was determined for each of all chromosomes in each of the unaltered samples of the training set. Table 13 shows single chromosomes or chromosomal groups identified as systematically determined normalized sequences for chromosomes 1-22 and X and Y chromosomes, respectively. As is apparent from Table 13, for some chromosomes of interest, the systematically determined normalized chromosomal sequence is determined to be a single chromosome (eg, chromosome 4 is the chromosome of interest). For other chromosomes of interest, the systematically determined normalized chromosome sequence was determined to be a chromosome group (eg, when chromosome 21 is the chromosome of interest).

ずべての染色体それぞれに対して決定した系統的に決定した正規化染色体配列の平均、標準偏差（ＳＤ）、及び変動係数（ＣＶ）を表１４に示す。
^ａトリソミーを含む
^ｂ女児 Table 14 shows the average, standard deviation (SD), and coefficient of variation (CV) of systematically determined normalized chromosomal sequences determined for each of all chromosomes.
Includes ^a trisomy
^b girl child

ＣＶの値によって反映されたすべてのトレーニングサンプルにわたる染色体ドースの分散は、大きな信号対ノイズ比及びダイナミックレンジを得る系統的に決定した染色体配列の使用が、以下に示すように異数性決定を高い感度及び高い特異度で行うことができることを実証する。 The variance of chromosomal dose across all training samples as reflected by the CV value indicates that the use of systematically determined chromosomal sequences to obtain a large signal-to-noise ratio and dynamic range results in high aneuploidy determination as shown below Demonstrate that it can be done with sensitivity and high specificity.

本発明方法の感度及び特異度を実証するため、関心対象となる１〜２２番染色体、Ｘ染色体及びＹ染色体すべての染色体ドースを、トレーニングセットの各サンプルにおいて、また実施例５で説明したテストセットのすべてのサンプルそれぞれにおいて、上述の表１３に示した対応する系統的に決定した正規化染色体配列を使用して決定した。 In order to demonstrate the sensitivity and specificity of the method of the present invention, the chromosomal doses of chromosomes 1-22, X and Y of interest are tested in each sample of the training set and in the test set described in Example 5. In each of the samples, the corresponding systematically determined normalized chromosomal sequence shown in Table 13 above was determined.

各関心対象染色体の系統的に決定した正規化染色体配列を使用して、何らかの染色体異数性有無をトレーニングセットにおける各サンプルにおいて、及びテストサンプルそれぞれにおいて決定した、すなわち、各サンプルが１，２，３，４，５，６，７，８，９，１０，１１，１２，１３，１４，１５，１６，１７，１８，１９，２０，２１，２２番染色体、Ｘ染色体及びＹ染色体の完全胎児染色体異数性を含んでいるか否かを決定した。配列情報、すなわち、配列タグ数は、トレーニングセットにおける各サンプルにおける、及びテストサンプルそれぞれにおけるすべての染色体に関して取得し、トレーニングサンプル及びテストサンプルのそれぞれにおける各染色体の単独染色体ドースを、上述したように、トレーニングセットで決定したのに対応する系統的に決定した正規化染色体配列（表１３参照）に関して得た配列タグ数を使用して計算した。系統的に決定した正規化染色体配列用に各トレーニングサンプルにおいて得た配列タグ数を使用して、各トレーニングサンプルにおける各染色体の染色体ドースを決定し、また系統的に決定した正規化染色体配列用に各テストサンプルにおいて得た配列タグ数を使用して、各テストサンプルにおける各染色体の染色体ドースを決定した。安全で効果的な異数性分類を確実にするため、実施例６で説明したのと同一の控えめな境界を選択した。 Using the systematically determined normalized chromosomal sequence of each chromosome of interest, the presence or absence of any chromosomal aneuploidy was determined in each sample in the training set and in each test sample, ie, each sample is 1, 2, Complete fetuses of chromosomes 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22 and chromosomes X and Y It was determined whether or not it contained chromosomal aneuploidy. Sequence information, i.e., sequence tag number, is obtained for all chromosomes in each sample in the training set and in each test sample, and the single chromosomal dose for each chromosome in each of the training and test samples, as described above. Calculations were made using the number of sequence tags obtained for systematically determined normalized chromosomal sequences (see Table 13) corresponding to those determined in the training set. Using the number of sequence tags obtained in each training sample for a systematically determined normalized chromosomal sequence, determine the chromosomal dose of each chromosome in each training sample, and for a systematically determined normalized chromosomal sequence The number of sequence tags obtained in each test sample was used to determine the chromosomal dose of each chromosome in each test sample. The same conservative boundaries as described in Example 6 were chosen to ensure a safe and effective aneuploidy classification.

トレーニングセット結果
系統的に決定した正規化染色体配列を使用してのトレーニングセットのサンプルにおける２１番、１８番及び１３番染色体の染色体ドースのプロットを図１５に示す。系統的に決定した正規化染色体配列、すなわち、4+14+16+20+22番の染色体グループを使用するとき、Ｔ２１を示す臨床的核型を有する８個のサンプルは５.４〜２１.５の間のＮＣＶを有していた。系統的に決定した正規化染色体配列、すなわち、2+3+5+7番の染色体グループを使用するとき、Ｔ１８を示す臨床的核型を有する４個のサンプルは３.３〜１５.３の間のＮＣＶを有していた。系統的に決定した正規化染色体配列、すなわち、4+5番の染色体グループを使用するとき、Ｔ１３を示す臨床的核型を有する２個のサンプルは８.０及び１２.４のＮＣＶを有していた。トレーニングセットにおけるＴ２１を有するサンプルは、２１番染色体データの最後の８サンプル（○）として示し、トレーニングセットにおけるＴ１８を有するサンプルは、１８番染色体データの最後の４サンプル（△）として示し、トレーニングセットにおけるＴ１３を有するサンプルは、１３番染色体データの最後の２サンプル（□）として示す。 Training Set Results Plots of chromosome doses for chromosomes 21, 18, and 13 in training set samples using systematically determined normalized chromosome sequences are shown in FIG. When using a systematically determined normalized chromosomal sequence, i.e. chromosome group 4 + 14 + 16 + 20 + 22, 8 samples with clinical karyotype showing T21 are 5.4-21. It had an NCV between 5. When using a systematically determined normalized chromosomal sequence, ie chromosome group 2 + 3 + 5 + 7, four samples with clinical karyotypes showing T18 are between 3.3 and 15.3. Had an NCV in between. When using a systematically determined normalized chromosomal sequence, ie chromosome group 4 + 5, two samples with clinical karyotype showing T13 have an NCV of 8.0 and 12.4 It was. Samples with T21 in the training set are shown as the last 8 samples (◯) of chromosome 21 data, samples with T18 in the training set are shown as the last 4 samples (Δ) of chromosome 18 data, and training set Samples having T13 in are shown as the last two samples (□) of chromosome 13 data.

これらデータは、正規化染色体配列を使用して異なる完全胎児染色体異常を高い確度で決定し、また正確に分類できることを示している。異変あり核型を有するすべてのサンプルは３より大きいＮＣＶを有するため、これらサンプルが異変なしの分布の一部である確率は約０．１％よりも低い。 These data show that normalized chromosomal sequences can be used to determine and accurately classify different complete fetal chromosomal abnormalities. Since all samples with anomalous karyotypes have an NCV greater than 3, the probability that these samples are part of a distribution without anomalies is less than about 0.1%.

常染色体と同様に、Ｘ染色体用の系統的に決定した正規化染色体配列（すなわち、4+8番染色体）を使用するとき、またＹ染色体用の系統的に決定した正規化染色体配列（すなわち、4+6番染色体）を使用するとき、トレーニングセットにおけるすべての男児及び女児の胎児が正確に同定された。さらに、Ｘモノソミーの５サンプルすべてを同定した。図１８Ａは、トレーニングセットにおける各サンプルそれぞれにおける、Ｘ染色体に対して決定したＮＣＶ（Ｘ軸）、及びＹ染色体に対して決定したＮＣＶ（Ｙ軸）をプロットしたものを示す。核型がＸモノソミーであるサンプルすべては、−４.８３未満のＮＣＶ値を有する。４５,Ｘ核型（フル又はモザイク）に一致する核型を有するそれらＸモノソミーのサンプルは、予想どおりゼロに近いＹのＮＣＶを有する。女児サンプルはＸ及びＹ双方ともＮＣＶ＝０の周りに集まる。 As with autosomes, when using a systematically determined normalized chromosomal sequence for the X chromosome (ie, chromosomes 4 + 8), and also a systematically determined normalized chromosomal sequence for the Y chromosome (ie, All boys and girls fetuses in the training set were correctly identified when using chromosomes 4 + 6. In addition, all 5 samples of X monosomy were identified. FIG. 18A shows a plot of NCV determined for the X chromosome (X axis) and NCV determined for the Y chromosome (Y axis) for each sample in the training set. All samples whose karyotype is X monosomy have NCV values less than -4.83. Those X monosomy samples with karyotypes matching the 45, X karyotype (full or mosaic) have a NCV of Y close to zero as expected. The girl samples are collected around NCV = 0 for both X and Y.

テストセット結果
関連の系統的に決定した正規化染色体配列を使用するテストサンプルにおける２１番、１８番及び１３番染色体の染色体ドースのプロットを図１６に示す。系統的に決定した正規化染色体配列（すなわち、4+14+16+20+22番の染色体グループ）を使用するとき、Ｔ２１を示す臨床的核型を有する１３個のサンプルのうち１３個は７.２〜１６.３の間のＮＣＶで同定された。系統的に決定した正規化染色体配列（すなわち、2+3+5+7番の染色体グループを使用するとき）、Ｔ１８を示す臨床的核型を有する８個のサンプルすべては１２.７〜３０.７の間のＮＣＶで同定された。系統的に決定した正規化染色体配列（すなわち、4+5番の染色体グループ）を使用するとき、Ｔ１３を示す臨床的核型を有する１個のみのサンプルは８.６のＮＣＶで同定された。テストセットにおけるＴ２１を有するサンプルは、２１番染色体データの最後の１３サンプル（○）として示し、テストセットにおけるＴ１８を有するサンプルは、１８番染色体データの最後の８サンプル（△）として示し、テストセットにおけるＴ１３を有するサンプルは、１３番染色体データの最後の１サンプル（□）として示す。 Test Set Results Plots of chromosome doses for chromosomes 21, 18, and 13 in test samples using relevant systematically determined normalized chromosome sequences are shown in FIG. When using a systematically determined normalized chromosomal sequence (ie, chromosome group 4 + 14 + 16 + 20 + 22), 13 out of 13 samples with clinical karyotype showing T21 are 7 Identified with an NCV between .2 and 16.3. Systematically determined normalized chromosomal sequence (ie when using chromosome group 2 + 3 + 5 + 7), all 8 samples with clinical karyotype showing T18 are 12.7-30. Identified with an NCV between 7. When using a systematically determined normalized chromosomal sequence (ie chromosome group 4 + 5), only one sample with a clinical karyotype exhibiting T13 was identified with an NCV of 8.6. The sample with T21 in the test set is shown as the last 13 samples (◯) of the chromosome 21 data, the sample with T18 in the test set is shown as the last 8 samples (Δ) of the chromosome 18 data, and the test set The sample having T13 in is shown as the last sample (□) of chromosome 13 data.

これらデータは、系統的に決定した正規化染色体配列を使用して異なる完全胎児染色体異常を高い確度で決定し、また正確に分類できることを示している。トレーニングセットと同様に、異変あり核型を有するすべてのサンプルは７より大きいＮＣＶを有し、このことはこれらサンプルが異変なしの分布の一部である確率は無限小的に低いことを示す（図１６参照）。 These data show that systematically determined normalized chromosomal sequences can be used to accurately determine and accurately classify different complete fetal chromosomal abnormalities. Similar to the training set, all samples with anomalous karyotypes have NCVs greater than 7, indicating that the probability that these samples are part of a distribution without anomalies is infinitely low ( (See FIG. 16).

常染色体と同様に、Ｘ染色体用の系統的に決定した正規化染色体配列（すなわち、4+8番染色体）を使用するとき、またＹ染色体用の系統的に決定した正規化染色体配列（すなわち、4+6番染色体）を使用するとき、テストセットにおけるすべての男児及び女児の胎児が正確に同定された。さらに、Ｘモノソミーの３サンプルすべてを同定した。図１８Ｂは、テストセットにおける各サンプルにおける、Ｘ染色体に対して決定したＮＣＶ（Ｘ軸）、及びＹ染色体に対して決定したＮＣＶ（Ｙ軸）をプロットしたものを示す。 As with autosomes, when using a systematically determined normalized chromosomal sequence for the X chromosome (ie, chromosomes 4 + 8), and also a systematically determined normalized chromosomal sequence for the Y chromosome (ie, When using chromosomes 4 + 6), all male and female fetuses in the test set were correctly identified. In addition, all three samples of X monosomy were identified. FIG. 18B shows a plot of NCV determined for the X chromosome (X axis) and NCV determined for the Y chromosome (Y axis) in each sample in the test set.

上述したように、本発明方法によれば、各サンプルにおける１〜２２番染色体、Ｘ染色体及びＹ染色体それぞれの完全又は部分的な染色体異数性の有無を決定することができる。Ｔ１３，Ｔ１８，Ｔ２１及びＸモノソミーの完全染色体異数性を決定する他に、本発明方法はテストサンプルのうち１つのサンプルで９番染色体トリソミーの存在を決定した。系統的に決定した正規化染色体配列（すなわち、3+4+8+10+17+19+20+22番染色体グループ）を使用するとき、関心対象の９番染色体に対して、１４.４のＮＣＶを有するサンプルを同定した（図１７参照）。このサンプルは、２１番染色体（実施例６ではこの２１番染色体に対して９番染色体が正規化染色体配列として使用された）のドースが異常に低く計算されることで９番染色体の異数性が疑われた実施例６のテストサンプルに対応するものであった。 As described above, according to the method of the present invention, it is possible to determine the presence or absence of complete or partial chromosomal aneuploidy of each of chromosomes 1 to 22, chromosome X and chromosome Y in each sample. In addition to determining the complete chromosomal aneuploidy of T13, T18, T21 and X monosomy, the method of the present invention determined the presence of chromosome 9 trisomy in one of the test samples. When using a systematically determined normalized chromosomal sequence (ie 3 + 4 + 8 + 10 + 17 + 19 + 20 + 22 chromosome group), for chromosome 9 of interest, 14.4 A sample with NCV was identified (see FIG. 17). In this sample, the aneuploidy of chromosome 9 is calculated by calculating the dose of chromosome 21 (in Example 6, chromosome 9 was used as a normalized chromosome sequence for chromosome 21) to be abnormally low. Corresponded to the test sample of Example 6 which was suspected.

このデータは、Ｔ２１，Ｔ１３，Ｔ１８，Ｔ９及びＸモノソミーを示す臨床的核型を有するサンプルの１００％が正確に同定されたことを示す。図１９は、４７個のテストサンプルそれぞれにおける１〜２２番染色体それぞれのＮＣＶのプロットを示す。ＮＣＶの中央値はゼロに正規化した。データは、本発明方法（系統的に決定した正規化染色体配列の使用を含む）が、このテストセットに存在した５タイプの染色体異数性すべての存在を１００％の感度及び１００％の特異度で決定したことを示し、また本発明方法は、いかなるサンプルにおいても、１〜２２番染色体、Ｘ染色体及びＹ染色体のうち任意の１つに関する任意の完全染色体異数性を同定できることを明らかに示している。 This data indicates that 100% of samples with clinical karyotypes exhibiting T21, T13, T18, T9 and X monosomy were correctly identified. FIG. 19 shows NCV plots for chromosomes 1-22 in each of the 47 test samples. The median value of NCV was normalized to zero. The data show that the method of the present invention (including the use of systematically determined normalized chromosomal sequences) shows 100% sensitivity and 100% specificity for the presence of all 5 types of chromosomal aneuploidy present in this test set. And the method of the present invention clearly shows that in any sample, any complete chromosomal aneuploidy with respect to any one of chromosomes 1-22, X and Y can be identified. ing.

実施例８
部分的胎児染色体異数性の有無決定：ネコ眼症候群の決定
ディジョージ症候群（２２ｑ１１.２欠失症候群）、すなわち、２２番染色体における欠失で生ずる障害は、幾つかの身体系統における発育不全を生ずる結果となる。ディジョージ症候群に共通して関連する内科的疾患としては、心臓疾患、免疫系の機能不全、口蓋裂、副甲状腺機能不全及び行動障害がある。ディジョージ症候群に関連する障害の数及び重篤度は大きく変動する。ディジョージ症候群を有するほとんどすべてのヒトは様々な分野の専門家からの治療を必要とする。 Example 8
Determining the presence or absence of partial fetal chromosomal aneuploidy: Determining the feline eye syndrome DiGeorge syndrome (22q11.2 deletion syndrome), a disorder caused by a deletion in chromosome 22, can lead to stunting in some body systems. Will result. Medical diseases commonly associated with DiGeorge syndrome include heart disease, immune system dysfunction, cleft palate, parathyroid dysfunction and behavioral disorders. The number and severity of disorders associated with DiGeorge syndrome vary widely. Almost all humans with DiGeorge syndrome require treatment from experts in various fields.

胎児における２２番染色体の部分的欠失有無を決定するため、血液サンプルを母親の静脈穿刺によって採取し、上述の実施例で説明したようにｃｆＤＮＡを調製する。精製したｃｆＤＮＡをアダプタに結合し、イルミナ社のｃＢｏｔクラスタステーションを使用してクラスタ増幅する。大量並列シークエンシングを、可逆色素ターミネーターを使用して行い、３６ｂｐリードを数１００万生成する。配列リードをヒトｈｇ１９基準ゲノムに整列させ、基準ゲノムに一意的にマッピングされたリードをタグとしてカウントする。 To determine the presence or absence of a partial deletion of chromosome 22 in the fetus, a blood sample is taken by maternal venipuncture and cfDNA is prepared as described in the above examples. Purified cfDNA is bound to an adapter and cluster amplified using an Illumina cBot cluster station. Mass parallel sequencing is performed using a reversible dye terminator, producing millions of 36 bp reads. The sequence reads are aligned with the human hg19 reference genome and the reads uniquely mapped to the reference genome are counted as tags.

すべて２２番染色体の２倍体が既知である１組の適格サンプルのセット、すなわち、２２番染色体又は２２番染色体のいかなる部分も２倍体状態でのみ存在することが既知である適格サンプルセットを先ずシークエンシング及び解析し、３メガベース（ＭＢ）の１０００断片（領域２２ｑ１１.２は除外する）それぞれの配列タグ数を得る。ヒトゲノムが約３０憶個の塩基（３Ｇｂ）を有すると仮定すると、３Ｍｂの１０００断片それぞれはゲノムの残りをほぼ構成する。１０００断片それぞれは、関心対象断片、すなわち、２２ｑ１１.２の３Ｍｂ領域の正規化断片配列を決定するのに個別に又は断片配列グループとして、使用される。単独の１０００ｂｐ断片にそれぞれにマッピングされる配列タグ数を個別に使用して、２２ｑ１１.２の３Ｍｂ領域の断片ドースを計算する。さらに、２個以上の断片のあり得るすべての組合せを使用してすべての適格サンプルにおける関心対象断片の断片ドースを決定する。サンプルにわたり最小の変動性を有する断片となる単独３Ｍｂ断片又は２個以上の３Ｍｂ断片の組合せを、正規化断片配列として選択する。 A set of eligible samples, all known to be chromosome 22 diploids, ie a sample set of chromosomes 22 or any portion of chromosome 22 known to exist only in a diploid state. First, sequencing and analysis are performed to obtain the number of sequence tags for each of 3 megabase (MB) 1000 fragments (excluding region 22q11.2). Assuming that the human genome has about 30 billion bases (3 Gb), each of the 3 Mb 1000 fragments almost constitutes the rest of the genome. Each of the 1000 fragments is used individually or as a fragment sequence group to determine the fragment of interest, ie, the normalized fragment sequence of the 3Mb region of 22q11.2. Using the number of sequence tags individually mapped to a single 1000 bp fragment, calculate the fragment dose of the 22 Mb 11.2 3 Mb region. In addition, all possible combinations of two or more fragments are used to determine the fragment dose of the fragment of interest in all eligible samples. A single 3Mb fragment or a combination of two or more 3Mb fragments that results in a fragment with minimal variability across the sample is selected as the normalized fragment sequence.

各適格サンプルにおける関心対象断片にマッピングされた配列タグ数を使用して、各適格サンプルにおける断片ドースを決定する。すべての適格サンプルにおける断片ドースの平均及び標準偏差を計算し、またテストサンプルで決定した断片ドースと比較する上での閾値を設定するのに使用する。好適には、正規化断片値（ＮＳＶ）をすべての適格サンプルにおけるすべての関心対象断片に対して計算し、閾値設定に使用する。 The number of sequence tags mapped to the fragment of interest in each eligible sample is used to determine the fragment dose in each eligible sample. The mean and standard deviation of the fragment doses in all eligible samples are calculated and used to set a threshold for comparison with the fragment doses determined in the test sample. Preferably, normalized fragment values (NSV) are calculated for all fragments of interest in all eligible samples and used for threshold setting.

これに続いて、対応のテストサンプルにおける正規化断片配列にマッピングしたタグ数を使用してテストサンプルにおける関心対象断片のドースを決定する。正規化断片値（ＮＳＶ）を、上述したように、テストサンプルの断片に対して計算し、テストサンプルにおける関心対象断片のＮＣＶを、適格サンプルを使用して決定した閾値と比較し、テストサンプルにおける２２ｑ１１.２の欠失有無を決定する。 Following this, the number of tags mapped to the normalized fragment sequence in the corresponding test sample is used to determine the dose of the fragment of interest in the test sample. A normalized fragment value (NSV) is calculated for the test sample fragment, as described above, and the NCV of the fragment of interest in the test sample is compared to a threshold determined using a qualified sample, The presence or absence of 22q11.2 deletion is determined.

テストＮＣＶ＜-３は、関心対象断片の喪失、すなわち、２２番染色体の部分的（２２ｑ１１.２の）欠失が検査サンプルに存在することを示す。 A test NCV <−3 indicates that a loss of the fragment of interest, ie a partial (22q11.2) deletion of chromosome 22, is present in the test sample.

実施例９
ステージII大腸がん患者の転帰予測の便ＤＮＡ検査
すべてのステージII大腸がん患者の約３０％は再発し、がん疾患で死に至る。再発を起こした患者のステージII大腸がんは、４，５，１５ｑ、１７ｑ及び１８ｑ番染色体に多くの喪失を示した。とくに、ステージII大腸がん患者の４ｑ２２.１〜４ｑ３５.２における喪失は、より悪い転帰を示した。これらゲノム変化の有無決定は、補助（アジュバント）治療を患者が選択する上での支援となり得る（Brosens et al., Analytical Cellular Pathology/Cellular Oncology 33:95-104 [2010]参照）。 Example 9
Fecal DNA testing to predict outcome in patients with stage II colorectal cancer Approximately 30% of all stage II colorectal cancer patients will relapse and die of cancer disease. Stage II colorectal cancer in patients with relapses showed much loss on chromosomes 4, 5, 15q, 17q and 18q. In particular, the loss at 4q22.1-4q35.2 of patients with stage II colorectal cancer showed worse outcomes. Determining the presence or absence of these genomic alterations can assist patients in choosing adjuvant (adjuvant) therapy (see Brosens et al., Analytical Cellular Pathology / Cellular Oncology 33: 95-104 [2010]).

ステージII大腸がんを有する患者の４ｑ２２.１〜４ｑ３５.２の領域における１つ以上の染色体欠失有無を決定するため、便及び／又は血漿サンプルを患者から採取する。便ＤＮＡは、刊行物（Chen et al., J Natl Cancer Inst 97:1124-1132 {2005]）に記載の方法に従って調製し、また血漿ＤＮＡを上述の実施例で説明した方法に従って調製した。ＤＮＡを本明細書で説明したＮＧＳに従ってシークエンシングし、また患者サンプルの配列情報を使用して４ｑ２２.１〜４ｑ３５.２の領域にわたる１つ以上の断片の断片ドースを計算する。断片ドースは、適格便及び／又は血漿サンプルのセットにおいてそれぞれ事前に決定する正規化断片配列を使用して決定する。検査サンプル（患者サンプル）における断片ドースを計算し、４ｑ２２.１〜４ｑ３５.２の領域内での１つ又はそれ以上の部分的染色体欠失有無は、関心対象断片それぞれのＮＳＶを、適格サンプルセットにおけるＮＳＶからの閾値と比較することによって決定する。 To determine the presence or absence of one or more chromosomal deletions in the region of 4q22.1-4q35.2 of patients with stage II colorectal cancer, stool and / or plasma samples are collected from the patients. Fecal DNA was prepared according to the method described in the publication (Chen et al., J Natl Cancer Inst 97: 1124-1132 [2005]), and plasma DNA was prepared according to the method described in the above examples. DNA is sequenced according to the NGS described herein, and the patient sample sequence information is used to calculate the fragment dose of one or more fragments spanning the region 4q22.1-4q35.2. Fragment doses are determined using normalized fragment sequences that are predetermined in each set of eligible stool and / or plasma samples. The fragment dose in the test sample (patient sample) is calculated, and the presence or absence of one or more partial chromosomal deletions in the region of 4q22.1-4q35.2 determines the NSV of each fragment of interest as the eligible sample set By comparing with the threshold from NSV.

本発明の好適な実施形態を本明細書において示しまた説明したが、このようなすべての実施形態は単なる例であることは当業者には明らかであろう。当業者にとっては、多くの変更、改変及び代用を、本発明から逸脱することなく行うことができるであろう。本明細書に記載した実施形態の種々の代替実施形態を、本発明の実施に使用できる。特許請求の範囲の請求項が本発明の範囲を規定し、また特許請求の範囲内の方法及び構成及び均等物も本発明によってカバーされることを意図する。 While preferred embodiments of the present invention have been shown and described herein, it will be apparent to those skilled in the art that all such embodiments are merely examples. Many alterations, modifications and substitutions will occur to those skilled in the art without departing from the invention. Various alternative embodiments of the embodiments described herein can be used to practice the present invention. It is intended that the following claims define the scope of the invention and that methods, configurations, and equivalents within the scope of the claims be covered by the present invention.

Claims

In a method for determining the presence or absence of any four or more different complete fetal chromosomal aneuploidies in each of any four or more chromosomes of interest in a maternal test sample comprising fetal and maternal nucleic acids,
(a) obtaining sequence information regarding fetal and maternal nucleic acids in the maternal test sample using next generation sequencing (NGS);
(b) using the sequence information, identifying the number of sequence tags of each of the arbitrary four or more chromosomes of interest selected from chromosomes 1 to 22, the X chromosome and the Y chromosome; Identifying a normalized chromosome sequence tag number for each of the four or more chromosomes of interest, wherein the normalized chromosome of each of the four or more chromosomes of interest comprises normal chromosomes of interest Identified using sequence information of eligible samples taken from specimens known to be composed of cells with copy number, and:
(I) exhibit a variation in the number of sequence tags mapped to normalized chromosomes that most closely approximates the variation in the number of sequence tags mapped to the chromosome of interest;
And / or (ii) distribution of chromosomal doses of the chromosome of interest in the qualifying sample, statistically identified based on chromosomal dose variation and dose distribution with the chromosome of interest in the qualifying sample And the largest statistical difference between the distribution of chromosomal doses of the chromosome of interest in the test sample,
When,
(c) using the number of sequence tags identified for each of the any four or more chromosomes of interest and the number of sequence tags identified for the normalized chromosome, Calculating a single chromosomal dose for each of the above chromosomes of interest; and
(d) comparing each of the single chromosomal doses of each of the four or more chromosomes of interest with a threshold of each of the four or more chromosomes of interest, thereby providing any four in the maternal test sample Determining the presence or absence of different complete fetal chromosomal aneuploidies.

The method according to claim 1, wherein the step (c) includes identifying the single chromosome dose of each chromosome of interest as the number of sequence tags identified for each of the chromosomes of interest, and the normality of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the chromosomal sequence.

3. The method of claim 1 or 2, wherein step (c) comprises
(i) calculating the sequence tag density ratio for each chromosome of interest by associating the number of sequence tags identified for each chromosome of interest in step (b) with the length of each chromosome of interest Step to do,
(ii) calculating the sequence tag density ratio of each normalized chromosome by associating the number of sequence tags identified for the normalized chromosome sequence in step (b) with the length of each normalized chromosome Steps to perform, and
(iii) calculating a single chromosomal dose for each chromosome of interest using the sequence tag density ratio calculated in steps (i) and (ii), wherein the chromosomal dose is the subject of interest Calculating the ratio of the sequence tag density ratio in the chromosome and the sequence tag density ratio in the normalized chromosome sequence of each of the chromosomes of interest.

The method according to any one of claims 1 to 3, wherein the arbitrary four or more chromosomes of interest selected from chromosomes 1-22 and X and Y are chromosomes 1-22, A method comprising at least 20 chromosomes selected from the X and Y chromosomes and determining the presence or absence of at least 20 different complete fetal chromosomal aneuploidies.

The method according to any one of claims 1 to 3, wherein the arbitrary four or more chromosomes of interest selected from chromosomes 1-22 and X and Y are chromosomes 1-22, A method for determining the presence or absence of all different complete fetal chromosomal aneuploidies of chromosomes 1 to 22, X chromosome and Y chromosome as all X chromosome and Y chromosome.

The method according to any one of claims 1 to 5, wherein the normalized chromosome sequence is a single chromosome selected from chromosomes 1 to 22, the X chromosome, and the Y chromosome.

The method according to any one of claims 1 to 5, wherein the normalized chromosome sequence is a chromosome group selected from chromosomes 1-22 and X and Y chromosomes.

In a method for determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies in any one or more chromosomes of interest in a maternal test sample comprising fetal and maternal nucleic acids,
(a) obtaining sequence information regarding fetal and maternal nucleic acids in the maternal test sample using next generation sequencing (NGS);
(b) Using the sequence information, identify the number of sequence tags for each of any one or more chromosomes of interest selected from chromosomes 1 to 22, the X chromosome, and the Y chromosome; Identifying the number of sequence tags of the normalized fragment sequence of each of the one or more chromosomes of interest, wherein the normalized fragment sequence of each of the one or more chromosomes of interest normalizes the chromosome of interest Identified using sequence information of eligible samples taken from specimens known to be composed of cells with a large copy number and:
(I) exhibit a variation in the number of sequence tags mapped to a normalized fragment sequence that most closely approximates a variation in the number of sequence tags that map to a chromosome of interest;
And / or (ii) distribution of chromosomal doses of the chromosome of interest in the qualifying sample, statistically identified based on chromosomal dose variation and dose distribution with the chromosome of interest in the qualifying sample And the largest statistical difference between the distribution of chromosomal doses of the chromosome of interest in the test sample,
When,
(c) using the number of sequence tags identified for each of the one or more chromosomes of interest and the number of sequence tags identified for the normalized fragment sequences, Calculating a single chromosomal dose for each of the one or more chromosomes of interest; and
(d) comparing each single chromosomal dose of each of the any one or more chromosomes of interest with a threshold of each of the any one or more chromosomes of interest, thereby determining in the maternal test sample Determining the presence or absence of any one or more different complete fetal chromosomal aneuploidies.

9. The method according to claim 8, wherein the step (c) includes identifying the single chromosome dose of each chromosome of interest as the number of sequence tags identified for each of the chromosomes of interest, and the regularity of each of the chromosomes of interest. Calculating as a ratio to the number of sequence tags identified for the fragmented sequence.

10. The method of claim 8 or 9, wherein the one or more chromosomes of interest selected from chromosomes 1-22 and X and Y are chromosomes 1-22, X and Y. A method comprising at least 20 selected chromosomes and determining the presence or absence of at least 20 different complete fetal chromosomal aneuploidies.

10. The method of claim 8 or 9, wherein said one or more chromosomes of interest selected from chromosomes 1-22 and X and Y are chromosomes 1-22, X and Y. A method for determining the presence or absence of all different complete fetal chromosome aneuploidies of chromosomes 1-22, X and Y.

12. The method according to any one of claims 1 to 11, wherein the different complete fetal chromosomal aneuploidy is selected from complete chromosomal trisomy, complete chromosomal monosomy, and complete chromosomal polysomy.

13. The method according to any one of claims 1 to 12, wherein the different complete fetal chromosomal aneuploidies are: trisomy No. 2, trisomy No. 8, trisomy No. 9, trisomy No. 21, trisomy No. 13, trisomy No. 16 , 18 trisomy, 22 trisomy, 47, XXY, 47, XXX, 47, XYY, and X monosomy.

14. The method according to any one of claims 1 to 13, wherein steps (a) to (d) are repeated for test samples from different maternal specimens, the method comprising: Determining the presence or absence of four or more different complete fetal chromosomal aneuploidies.

15. The method according to any one of claims 1 to 14, further comprising a NCV calculation step of calculating a normalized chromosome value (NCV), wherein the NCV is expressed by the following equation: Let chromosomal dose be the value that correlates to the average of the corresponding chromosomal dose in the eligible sample set,
here,

Are the estimated mean and standard deviation for each chromosome j dose in the eligible sample set, and x _ij is the observed chromosome j dose in test sample i, the NCV calculation step.

In a method for determining the presence or absence of different partial fetal chromosomal aneuploidies of one or more fragments of any one or more chromosomes of interest in a maternal test sample comprising fetal and maternal nucleic acids,
(a) obtaining sequence information regarding fetal and maternal nucleic acids in the maternal test sample using next generation sequencing (NGS);
(b) a sequence tag for each of any one or more fragments in any one or more chromosomes of interest selected from chromosomes 1-22, X and Y using the sequence information; Identifying a number and identifying a sequence tag number of the normalized fragment sequence of each of any one or more fragments in the any one or more chromosomes of interest, wherein the one or The normalized fragment sequence of each of one or more fragments of a further chromosome of interest is obtained from a qualified sample taken from a specimen known to be composed of cells having a normal copy number of the chromosome of interest. Identified using sequence information and:
(I) exhibit a variation in the number of sequence tags mapped to a normalized fragment sequence that most closely approximates a variation in the number of sequence tags that map to one or more fragments of one or more chromosomes of interest ,
And / or (ii) one or more objects of interest in a qualifying sample that are statistically identified based on chromosomal dose variation and dose distribution with a chromosome of interest in a qualifying sample The largest statistic between the distribution of chromosomal doses of one or more fragments of chromosomes and the distribution of chromosomal doses of one or more fragments of one or more chromosomes of interest in the test sample To provide a scientific difference,
When,
(c) the sequence tag number identified for each of any one or more fragments in the any one or more chromosomes of interest, and the sequence tag identified for the normalized fragment sequence Calculating a single chromosomal dose for each of any one or more fragments in the any one or more chromosomes of interest using a number; and
(d) each of the single fragment doses of any one or more fragments in any one or more of the chromosomes of interest is replaced with any one of the one or more of the chromosomes of interest. Comparing to a threshold of each of one or more chromosomal fragments, thereby determining the presence or absence of any one or more different partial fetal chromosomal aneuploidies in the maternal test sample.

17. The method of claim 16, wherein step (c) comprises calculating a single fragment dose for each of any one or more chromosomal fragments in the chromosome of interest by any one or more of the chromosomes of interest. Calculated as the ratio between the number of sequence tags identified for each chromosome fragment and the number of sequence tags identified for each normalized fragment sequence of any one or more chromosome fragments in the chromosome of interest A method comprising the steps of:

18. The method according to claim 16 or 17, further comprising an NSV calculation step of calculating a normalized segment value (NSV), wherein the NSV is obtained by converting the fragment dose into a qualified sample as follows: The value associated with the average of the corresponding fragment doses in the set,

here,

Are the estimated mean and standard deviation for the jth fragment dose in the qualifying sample set, respectively, and x _ij is the observed jth fragment dose in test sample i.

19. The method according to any one of claims 16-18, wherein the different partial fetal chromosomal aneuploidy is selected from partial duplication, partial growth, partial insertion and partial deletion. .

The method according to any one of claims 16 to 19, wherein the different partial fetal chromosomal aneuploidy is a partial monosomy of chromosome 1, a partial monosomy of chromosome 4, a partial monosomy of chromosome 5, A method comprising selecting from a partial monosomy of chromosome 7, a partial monosomy of chromosome 11, a partial monosomy of chromosome 15, a partial monosomy of chromosome 17, a partial monosomy of chromosome 18, and a partial monosomy of chromosome 22.

21. The method according to any one of claims 16 to 20, wherein steps (a) to (d) are repeated for test samples from different maternal specimens, the method being different for each of the test samples. A method for determining the presence or absence of partial fetal chromosomal aneuploidy.

The method according to any one of claims 8 to 21, wherein the normalized fragment sequence is a single fragment in any one or more of chromosomes 1-22 and X and Y. .

The method according to any one of claims 8 to 21, wherein the normalized fragment sequence is a fragment group in any one or more of chromosomes 1-22 and X and Y. .

24. The method according to any one of claims 1 to 23, wherein said step (a) comprises sequencing at least a portion of said nucleic acid in said test sample, said sequence relating to said fetal and maternal nucleic acids in said test sample. How to get information.

25. The method according to any one of claims 1 to 24, wherein the test sample is a maternal sample selected from blood, plasma, serum, urine and saliva samples.

26. The method according to any one of claims 1 to 25, wherein the nucleic acid is a mixture of fetal and maternal cell-free DNA molecules.

27. The method according to any one of claims 1-26, wherein the next generation sequencing is massively parallel sequencing using sequencing by synthesis with a reversible dye terminator.

28. The method according to any one of claims 1 to 27, wherein the next generation sequencing is sequencing by ligation.

29. The method according to any one of claims 1-28, wherein the next generation sequencing comprises amplification.

30. The method according to any one of claims 1-29, wherein the next generation sequencing is a single molecule sequencing.

The method according to any one of claims 1 to 7, wherein the four or more chromosomes of interest are chromosome 13, chromosome 18, chromosome 21, and chromosome X,
(I) the normalized chromosome of chromosome 13 is at least one of chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6 and chromosome 8,
(Ii) the normalized chromosome of chromosome 18 is at least one of chromosome 8, chromosome 2, chromosome 3, chromosome 5, chromosome 6, chromosome 12, and chromosome 14.
(Iii) The normalized chromosome of chromosome 21 is at least one of chromosome 9, chromosome 1, chromosome 2, chromosome 11, chromosome 12, and chromosome 14.
(Iv) The method, wherein the normalized chromosome of the X chromosome is at least one of chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, and chromosome 8.

Method according to any one of claims 8-15, wherein the object of interest chromosomes, chromosome 13, chromosome 18, chromosome 21, and four or more interest from the X chromosome Ru is selected A chromosome ,
(I) the normalized chromosome of chromosome 13 is selected from at least one of chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6 and chromosome 8;
(Ii) the normalized chromosome of chromosome 18 is selected from at least one of chromosome 8, chromosome 3, chromosome 5, chromosome 6, chromosome 12, chromosome 14, and chromosome 14.
(Iii) the normalized chromosome of chromosome 21 is selected from at least one of chromosome 9, chromosome 1, chromosome 2, chromosome 11, chromosome 12, and chromosome 14;
(Iv) The method, wherein the normalized chromosome of chromosome X is selected from at least one of chromosome 2, chromosome 3, chromosome 4, chromosome 5, chromosome 6, and chromosome 8.