JP6777351B2

JP6777351B2 - Programs, information processing equipment and information processing methods

Info

Publication number: JP6777351B2
Application number: JP2020093387A
Authority: JP
Inventors: 邦裕西村; 貴司青木; 俊貴竹内; 純井村
Original assignee: XCOO INC.
Current assignee: XCOO INC.
Priority date: 2020-05-28
Filing date: 2020-05-28
Publication date: 2020-10-28
Anticipated expiration: 2039-03-07
Also published as: JP2020144940A

Description

本発明は、プログラム、情報処理装置および情報処理方法に関する。 The present invention relates to programs, information processing devices and information processing methods.

生検、採血または手術等により患者から採取された検体を用いて病理検査、遺伝子検査等が行なわれる。遺伝子検査においては、シーケンサを用いて読み取った核酸の塩基配列を可視化するゲノム解析装置等が提案されている（特許文献１） Pathological tests, genetic tests, etc. are performed using samples collected from patients by biopsy, blood sampling, surgery, etc. In genetic testing, a genome analyzer or the like that visualizes the base sequence of nucleic acid read by using a sequencer has been proposed (Patent Document 1).

国際公開第２０１６−１７５３３０号International Publication No. 2016-175330

塩基配列の変異状態と、抗がん剤の効果や患者の予後等との関係に関する情報は、多くの研究者および公的機関等により随時更新されている。治療方針を決定する際には、最新の情報に基づいて判断を行なうことが望ましい。一方、過去の治療方針の良否を判断する際には、治療方針を決定した時点の知見を参照する必要がある。 Information on the relationship between the mutation state of the base sequence and the effect of anticancer drugs and the prognosis of patients is updated from time to time by many researchers and public institutions. When deciding on a treatment policy, it is desirable to make a decision based on the latest information. On the other hand, when judging the quality of the past treatment policy, it is necessary to refer to the knowledge at the time when the treatment policy is decided.

しかしながら、特許文献１に記載されたゲノム解析装置では、検体から読み取られた塩基配列と、現在および過去の情報との関連を出力できない。 However, the genome analyzer described in Patent Document 1 cannot output the relationship between the base sequence read from the sample and the current and past information.

プログラムは、検体から検出された遺伝子変異を含む前記検体に関する解析結果を取得し、報告書出力要求を受け付けた場合、遺伝子変異と、複数の情報源から取得した前記遺伝子変異に関する医学情報と、前記医学情報の取得日および根拠情報とを関連づけて統合した統合ＤＢから、取得した前記遺伝子変異をキーとして医学情報を抽出し、前記検体に関する解析結果と、抽出した医学情報と、前記統合ＤＢのバージョンとを関連づけて記録した報告書を出力し、過去の日付および当該日付における報告書出力要求を受け付けた場合、前記日付における前記統合ＤＢから、取得した前記遺伝子変異をキーとして医学情報を抽出し、前記検体に関する解析結果と、抽出した医学情報と、前記統合ＤＢのバージョンとを関連づけて記録した報告書を出力し、遺伝子変異に関する医学情報が追加されることにより前記統合ＤＢが更新された場合、更新された前記統合ＤＢから、取得した前記遺伝子変異をキーとして医学情報を抽出し、前記検体に関する解析結果と、抽出した医学情報と、前記統合ＤＢのバージョンとを関連づけて記録した追加報告書を出力する処理をコンピュータに実行させる。 The program obtains the analysis results for the specimen containing the detected mutation from biopsy material, when receiving a report output request, and gene mutation, and the acquired genetic alterations related to medical information from a plurality of sources, from the integrated DB that integrates in association with an acquisition date and the basis information of the medical information, extracts the medical information the acquired genetic variation as a key, and the analysis results on the specimen, and extracted medical information, of the integrated DB outputting a report recorded in association with the version, extracts medical information when receiving a report output request in a past date and the date, said from the integrated DB before Symbol date, the acquired the gene mutation as a key Then, a report was output in which the analysis result regarding the sample, the extracted medical information, and the version of the integrated DB were recorded in association with each other, and the integrated DB was updated by adding the medical information regarding the gene mutation. In the case, an additional report in which medical information is extracted from the updated integrated DB using the acquired gene mutation as a key, and the analysis result regarding the sample, the extracted medical information, and the version of the integrated DB are recorded in association with each other. Have the computer execute the process of outputting the book .

一つの側面では、検体から読み取られた塩基配列と、現在および過去の情報との関連を出力するプログラム等を提供することを目的とする。 One aspect is to provide a program or the like that outputs the relationship between the base sequence read from the sample and the current and past information.

ゲノム解析システムを用いた処理の流れを説明する説明図である。It is explanatory drawing explaining the flow of processing using a genome analysis system. 学習モデルの生成方法を説明する説明図である。It is explanatory drawing explaining the generation method of the learning model. 統合ＤＢの概要を説明する説明図である。It is explanatory drawing explaining the outline of the integrated DB. ゲノムデータの概要を説明する説明図である。It is explanatory drawing explaining the outline of the genome data. ゲノム解析システムの構成を説明する説明図である。It is explanatory drawing explaining the structure of the genome analysis system. 教師データＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of a teacher data DB. 統合ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of the integrated DB. 報告書ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of a report DB. 学習モデルを説明する説明図である。It is explanatory drawing explaining the learning model. 報告書の例を説明する説明図である。It is explanatory drawing explaining the example of a report. コメント欄の例を説明する説明図である。It is explanatory drawing explaining the example of a comment column. 非同義体細胞変異欄の例を説明する説明図である。It is explanatory drawing explaining the example of the non-synonymous cell mutation column. 生殖細胞変異欄の例を説明する説明図である。It is explanatory drawing explaining the example of a germline mutation column. 解析欄の例を説明する説明図である。It is explanatory drawing explaining the example of the analysis column. プログラムの処理の流れを説明するフローチャートである。It is a flowchart explaining the process flow of a program. ＲＮＡ欄の例を説明する説明図である。It is explanatory drawing explaining the example of RNA column. 変更履歴ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of the change history DB. 実施の形態３の報告書ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of the report DB of Embodiment 3. 追加報告書を出力するプログラムの処理の流れを説明するフローチャートである。It is a flowchart explaining the process flow of the program which outputs an additional report. 専門家ＤＢのレコードレイアウトを説明する説明図である。It is explanatory drawing explaining the record layout of an expert DB. エキスパートパネルへの参加者を選択する画面の例を説明する説明図である。It is explanatory drawing explaining the example of the screen which selects the participant to the expert panel. エキスパートパネルへの参加依頼を確認する画面の例を説明する説明図である。It is explanatory drawing explaining the example of the screen which confirms the participation request to the expert panel. 実施の形態４の修正受付のサブルーチンの処理の流れを説明するフローチャートである。It is a flowchart explaining the process flow of the subroutine of modification acceptance of Embodiment 4. 統合ＤＢレビュー参加依頼画面の例を説明する説明図である。It is explanatory drawing explaining the example of the integrated DB review participation request screen. 統合ＤＢを更新するプログラムの処理の流れを説明するフローチャートである。It is a flowchart explaining the process flow of the program which updates the integrated DB. ゲノムデータから臨床上意味のある遺伝子変異を予測する段階の情報処理装置の機能ブロック図である。It is a functional block diagram of an information processing apparatus at the stage of predicting a clinically meaningful gene mutation from genomic data. 遺伝子変異と統合ＤＢとに基づいて報告書を作成する段階における情報処理装置の機能ブロック図である。It is a functional block diagram of an information processing apparatus at the stage of preparing a report based on a gene mutation and an integrated DB. 実施の形態７のゲノム解析システムの構成を説明する説明図である。It is explanatory drawing explaining the structure of the genome analysis system of Embodiment 7.

［実施の形態１］
図１は、ゲノム解析システム１０を用いた処理の流れを説明する説明図である。ゲノムは、１つの個体、ここでは一人のヒトの遺伝情報全体を意味する。 [Embodiment 1]
FIG. 1 is an explanatory diagram illustrating a flow of processing using the genome analysis system 10. The genome means the entire genetic information of one individual, here one human.

患者から検体が採取される。検体は、腫瘍部と、正常部との両方からそれぞれ採取されることが望ましい。腫瘍部の検体は、病変部の生検または手術等により採取される。以下の説明では、腫瘍部から採取された検体を腫瘍検体と記載する。血液がん等、血液に異常がある患者を除き、正常部の検体は採血等により採取される場合が多い。血液がんの患者の場合には、血液から腫瘍部の検体が採取され、それ以外の正常組織から正常部の検体が採取される。 Specimens are taken from the patient. It is desirable that the sample be collected from both the tumor part and the normal part, respectively. The sample of the tumor part is collected by biopsy or surgery of the lesion part. In the following description, a sample collected from the tumor site is referred to as a tumor sample. Except for patients with abnormal blood such as blood cancer, samples of normal parts are often collected by blood sampling or the like. In the case of a patient with blood cancer, a sample of the tumor part is collected from the blood, and a sample of the normal part is collected from other normal tissues.

それぞれの検体から核酸、すなわちＤＮＡ（Deoxyribonucleic Acid）またはＲＮＡ（Ribonucleic Acid）が抽出される。以下の説明では、ＤＮＡが抽出される場合を例にして説明する。読取装置３１によりＤＮＡの塩基配列が読み取られ、ゲノムデータが作成される。ゲノムデータの詳細については後述する。以下の説明においては、読取装置３１は次世代シーケンサである場合を例にして説明するが、読取装置３１はＤＮＡマイクロアレイその他塩基配列を読み取る任意の装置または機器であっても良い。 Nucleic acid, that is, DNA (Deoxyribonucleic Acid) or RNA (Ribonucleic Acid) is extracted from each sample. In the following description, a case where DNA is extracted will be described as an example. The base sequence of DNA is read by the reading device 31, and genomic data is created. Details of the genomic data will be described later. In the following description, the reading device 31 will be described by taking the case of a next-generation sequencer as an example, but the reading device 31 may be a DNA microarray or any other device or device that reads a base sequence.

ゲノムデータが学習モデル５３に入力される。学習モデル５３から、臨床上意味のある遺伝子変異の予測が出力される。出力された遺伝子変異と、医学文献等から収集した情報を統合した統合ＤＢ（Database）５２とに基づいて、報告書案が自動的に作成される。学習モデル５３および統合ＤＢ５２の詳細については後述する。 Genome data is input to the learning model 53. From the learning model 53, a prediction of a clinically meaningful gene mutation is output. A draft report is automatically created based on the output gene mutation and the integrated database (Database) 52 that integrates the information collected from the medical literature and the like. Details of the learning model 53 and the integrated DB 52 will be described later.

なお、学習モデル５３から臨床上の意味の有無にかかわらず遺伝子変異の予測が出力されても良い。そのようにする場合、学習モデル５３から出力された遺伝子変異と、統合ＤＢ５２とに基づいて、臨床上意味のある変異が抽出されて、報告書案が自動的に作成される。 In addition, the prediction of the gene mutation may be output from the learning model 53 regardless of the presence or absence of clinical significance. In such a case, clinically meaningful mutations are extracted based on the gene mutations output from the learning model 53 and the integrated DB 52, and a draft report is automatically prepared.

がん専門医および遺伝子学者等の専門家により構成されたエキスパートパネルが、報告書案をレビューし、必要に応じて修正することにより、報告書が完成する。患者の治療を担当する臨床医は、報告書を見て治療方針を判断する。報告書案および報告書の詳細については後述する。なお、エキスパートパネルによるレビューは行なわれなくても良い。このようにする場合、臨床医は、統合ＤＢ５２から出力された報告書案を見て治療方針を判断する。 The report is completed by an expert panel of experts such as oncologists and geneticists reviewing the draft report and modifying it as necessary. The clinician in charge of treating the patient looks at the report and decides the treatment policy. The draft report and the details of the report will be described later. The review by the expert panel does not have to be performed. In this case, the clinician looks at the draft report output from the integrated DB 52 and decides the treatment policy.

図２は、学習モデル５３の生成方法を説明する説明図である。腫瘍部の検体を用いて病理検査が行なわれる。腫瘍部の検体から、腫瘍細胞を含む部分が切り取られる。切り取られた検体から、腫瘍部のＤＮＡが抽出される。正常部の検体から、正常部のＤＮＡが抽出される。正常部のＤＮＡと、腫瘍部のＤＮＡとが読取装置３１に投入されて、ゲノムデータが作成される。 FIG. 2 is an explanatory diagram illustrating a method of generating the learning model 53. A pathological examination is performed using a sample of the tumor site. The part containing the tumor cells is cut out from the sample of the tumor part. The DNA of the tumor part is extracted from the cut sample. The DNA of the normal part is extracted from the sample of the normal part. The DNA of the normal part and the DNA of the tumor part are put into the reading device 31 to create genomic data.

病理検査の結果と、ゲノムデータと、その他の検査数値とに基づいて、腫瘍の良悪性、原発がんであるか否か、腫瘍部検体中の腫瘍含有量、効果を期待できる薬剤等を専門家が判断して、診断データを作成する。 Based on the results of pathological tests, genomic data, and other test values, experts on whether the tumor is benign or malignant, whether it is primary cancer, the tumor content in the tumor sample, and drugs that can be expected to be effective. Judges and creates diagnostic data.

ゲノムデータと診断データとが関連づけられて教師データＤＢ５１（図５参照）に記録される。教師データＤＢ５１の詳細については後述する。教師データＤＢ５１に基づいて教師あり機械学習を行ない、学習モデル５３が生成される。学習モデル５３は、検体に含まれる塩基配列を読み取ったゲノムデータが入力された場合に、検体にかかる遺伝子変異に関する予測を出力する学習済モデルである。 The genomic data and the diagnostic data are associated and recorded in the teacher data DB 51 (see FIG. 5). The details of the teacher data DB 51 will be described later. Supervised machine learning is performed based on the teacher data DB 51, and a learning model 53 is generated. The learning model 53 is a learned model that outputs a prediction regarding a gene mutation in a sample when genomic data obtained by reading a base sequence contained in the sample is input.

図３は、統合ＤＢ５２の概要を説明する説明図である。統合ＤＢ５２は、複数の情報源から取得した遺伝子変異に関する医学情報と、当該医学情報の取得元とを関連づけて統合したＤＢである。情報源は、たとえば医学論文を公開するＤＢ、国または研究機関等が、薬剤または治療法の臨床試験に関する情報を公開するＤＢ、企業または大学等が発行した医療に関するプレスリリース等の公開情報を蓄積したＤＢ等の、種々の医学情報ＤＢ５８である。 FIG. 3 is an explanatory diagram illustrating an outline of the integrated DB 52. The integrated DB 52 is a DB that associates and integrates medical information related to gene mutations acquired from a plurality of information sources with the acquisition source of the medical information. Information sources include, for example, DBs that publish medical papers, DBs that publish information on clinical trials of drugs or treatments by the national government or research institutes, and public information such as medical press releases issued by companies or universities. It is various medical information DB 58 such as the said DB.

医学情報ＤＢ５８は、無償で公開されているＤＢであっても、有償で公開されているＤＢであっても良い。なお、有償で公開されているＤＢを使用する場合には、有償ＤＢの提供元と、統合ＤＢ５２の提供元との間で、適切なライセンス契約を締結する等の、ライセンス処理を行う。 The medical information DB 58 may be a DB that is open to the public free of charge or a DB that is open to the public for a fee. When using a DB that is open to the public for a fee, license processing such as concluding an appropriate license agreement between the provider of the paid DB and the provider of the integrated DB 52 is performed.

それぞれの医学情報ＤＢ５８には、異なるフォーマットで医学情報が記録されており、異なるタイミングで情報が更新される。それぞれの医学情報ＤＢ５８にアクセスして、情報を収集してデータベース化するクローリングにより、統合ＤＢ５２が作成される。 Medical information is recorded in different formats in each medical information DB 58, and the information is updated at different timings. An integrated DB 52 is created by crawling to access each medical information DB 58, collect information, and create a database.

クローリングは適宜行なわれ、更新された統合ＤＢ５２が作成される。それぞれの統合ＤＢ５２は、たとえば更新日または更新日時等が判別できる状態でバージョン管理される。統合ＤＢ５２の詳細については後述する。 Crawling is performed as appropriate, and an updated integrated DB 52 is created. Each integrated DB 52 is version-controlled in a state where, for example, an update date or an update date and time can be determined. Details of the integrated DB 52 will be described later.

なお、それぞれの統合ＤＢ５２には、前のバージョンとの差分、または、任意のバージョンとの差分が記録され、必要に応じて任意の時点における統合ＤＢ５２を構築できるように構成されても良い。差分を記録することにより、統合ＤＢ５２の記録容量を節約できる。 In addition, each integrated DB 52 may be configured so that a difference from the previous version or a difference from an arbitrary version is recorded, and the integrated DB 52 at an arbitrary time can be constructed as needed. By recording the difference, the recording capacity of the integrated DB 52 can be saved.

図４は、ゲノムデータの概要を説明する説明図である。検体に対して前処理が行なわれる。具体的には、前述のとおり検体からＤＮＡが抽出される。抽出されたＤＮＡに対して、精製、断片化および増幅等の処理が行なわれる。断片化により、ＤＮＡは後工程で使用される読取装置３１による読み取りに適した長さの断片に切断される。 FIG. 4 is an explanatory diagram illustrating an outline of genomic data. Pretreatment is performed on the sample. Specifically, as described above, DNA is extracted from the sample. The extracted DNA is subjected to treatments such as purification, fragmentation and amplification. Fragmentation cuts the DNA into fragments of length suitable for reading by the reader 31 used in the subsequent step.

読取装置３１は、断片化されたそれぞれのＤＮＡの塩基配列を順次読み取る。１本のＤＮＡ断片から読み取られた塩基配列に関する情報はリードと呼ばれる。リードには、個々の塩基について読み取りの信頼度を示すクオリティスコアも記録される。 The reading device 31 sequentially reads the base sequence of each fragmented DNA. Information about the base sequence read from one DNA fragment is called a read. The read also records a quality score that indicates the reliability of the reading for each base.

それぞれのリードは、たとえば日本人の基準ゲノム配列（Japanese Reference Genome:JRG）、または、国際ヒトゲノム参照配列等の参照配列にマッピングされる。マッピング結果は、たとえばＢＡＭ形式、ＳＡＭ形式またはＣＲＡＭ形式のファイルに記録される。 Each read is mapped to a reference sequence such as the Japanese Reference Genome (JRG) or the International Human Genome Reference Sequence. The mapping result is recorded in a file of, for example, BAM format, SAM format or CRAM format.

マッピング結果と、参照配列との相違点、すなわち参照配列に対して検体のゲノムが変異している箇所の位置および変異内容等についての情報が、たとえばＶＣＦ形式またはＢＣＦ形式のファイルに記録される。 Information about the difference between the mapping result and the reference sequence, that is, the position where the genome of the sample is mutated with respect to the reference sequence, the content of the mutation, and the like is recorded in a file in, for example, VCF format or BCF format.

なお、ＶＣＦ形式のファイルには、遺伝情報がコードされていないイントロンの変異、および、コードされたアミノ酸に変化を生じない同義変異等、臨床的な重要性の低い変異が多数含まれる。したがって、ＶＣＦ形式のファイルから、治療方針等を定めるための情報を読み取るには、高度な専門知識を要する。 The VCF format file contains many mutations of low clinical importance, such as intron mutations in which the genetic information is not encoded and synonymous mutations in which the encoded amino acids are not changed. Therefore, reading information for determining a treatment policy or the like from a VCF format file requires a high degree of specialized knowledge.

ＦＡＳＴＱ形式のファイルおよび参照配列が与えられれば、公知の解析手法により、ＢＡＭ形式、ＳＡＭ形式、ＣＲＡＭ形式およびＶＣＦ形式のファイルに変換できる。以上に説明した、ＦＡＳＴＱ形式、ＢＡＭ形式、ＳＡＭ形式、ＣＲＡＭ形式、ＶＣＦ形式およびＢＣＦ形式のデータを総称して、ゲノムデータと呼ぶ。ゲノムデータは、ここに例示した形式以外の任意の形式のデータであっても良い。 Given a FASTQ format file and a reference sequence, it can be converted into a BAM format, a SAM format, a CRAM format and a VCF format file by a known analysis method. The data in FASTQ format, BAM format, SAM format, CRAM format, VCF format and BCF format described above are collectively referred to as genomic data. The genomic data may be in any format other than the format exemplified here.

たとえば、読取装置３１がＦＡＳＴＱ形式のファイルを出力し、図示を省略する解析装置がＢＡＭ形式およびＶＣＦ形式のファイルに変換する。読取装置３１が解析装置を内蔵し、直接ＢＡＭ形式およびＶＣＦ形式のファイルを出力しても良い。後述する情報処理装置２０（図５参照）が、ＦＡＳＴＱ形式またはＢＡＭ形式のファイルを取得して、ＶＣＦ形式に変換しても良い。 For example, the reading device 31 outputs a FASTQ format file, and an analysis device (not shown) converts the file into a BAM format file and a VCF format file. The reading device 31 may have a built-in analysis device and directly output BAM format and VCF format files. The information processing device 20 (see FIG. 5), which will be described later, may acquire a file in FASTQ format or BAM format and convert it to VCF format.

ＣＮＡ（Copy Number Alteration：体細胞コピー数異常）解析を行なう場合には、患者から採取した複数の正常部の検体から得られたゲノムデータと、腫瘍部の検体から得られたゲノムデータとを比較する。 When performing CNA (Copy Number Alteration) analysis, the genomic data obtained from multiple normal part samples collected from patients is compared with the genomic data obtained from tumor part samples. To do.

ＣＮＡ解析には、ＰＯＮ（Panel Of Normals）の手法が使用されても良い。ＰＯＮを用いる場合には、複数の人から採取された正常部検体について、たとえばＢＡＭ形式またはＳＡＭ形式のゲノムデータを作成し、保存しておく。患者から採取された腫瘍部の検体から得られたゲノムデータと、保存済のゲノムデータとを比較して、解析を行なう。 A PON (Panel Of Normals) method may be used for CNA analysis. When PON is used, genomic data in, for example, BAM format or SAM format is created and stored for normal part samples collected from a plurality of people. The genomic data obtained from the tumor sample collected from the patient is compared with the preserved genomic data for analysis.

図５は、ゲノム解析システム１０の構成を説明する説明図である。ゲノム解析システム１０は、情報処理装置２０、読取装置３１およびデータサーバ３２を備える。 FIG. 5 is an explanatory diagram illustrating the configuration of the genome analysis system 10. The genome analysis system 10 includes an information processing device 20, a reading device 31, and a data server 32.

情報処理装置２０は、制御部２１、主記憶装置２２、補助記憶装置２３、通信部２４、およびバスを備える。制御部２１は、本実施の形態のプログラムを実行する演算制御装置である。制御部２１は、一もしくは複数のＣＰＵ（Central Processing Unit）、マルチコアＣＰＵまたはＧＰＵ（Graphics Processing Unit）等により構成される。制御部２１は、バスを介して情報処理装置２０を構成するハードウェア各部と接続されている。 The information processing device 20 includes a control unit 21, a main storage device 22, an auxiliary storage device 23, a communication unit 24, and a bus. The control unit 21 is an arithmetic control device that executes the program of the present embodiment. The control unit 21 is composed of one or a plurality of CPUs (Central Processing Units), a multi-core CPU, a GPU (Graphics Processing Unit), and the like. The control unit 21 is connected to each hardware unit constituting the information processing device 20 via a bus.

主記憶装置２２は、ＳＲＡＭ（Static Random Access Memory）、ＤＲＡＭ（Dynamic Random Access Memory）、フラッシュメモリ等の記憶装置である。主記憶装置２２には、制御部２１が行なう処理の途中で必要な情報および制御部２１で実行中のプログラムが一時的に保存される。 The main storage device 22 is a storage device such as a SRAM (Static Random Access Memory), a DRAM (Dynamic Random Access Memory), and a flash memory. The main storage device 22 temporarily stores information necessary in the middle of processing performed by the control unit 21 and a program being executed by the control unit 21.

補助記憶装置２３は、ＳＲＡＭ、フラッシュメモリまたはハードディスク等の記憶装置である。補助記憶装置２３には、教師データＤＢ５１、統合ＤＢ５２、学習モデル５３、報告書案ＤＢ５５、報告書ＤＢ５６、制御部２１に実行させるプログラム、およびプログラムの実行に必要な各種データが保存される。なお、教師データＤＢ５１、統合ＤＢ５２、学習モデル５３、報告書案ＤＢ５５および報告書ＤＢ５６は、情報処理装置２０に接続された外部の大容量記憶装置、または、データサーバ３２等に保存されていても良い。 The auxiliary storage device 23 is a storage device such as a SRAM, a flash memory, or a hard disk. The auxiliary storage device 23 stores a teacher data DB 51, an integrated DB 52, a learning model 53, a report draft DB 55, a report DB 56, a program to be executed by the control unit 21, and various data necessary for executing the program. The teacher data DB 51, the integrated DB 52, the learning model 53, the report draft DB 55, and the report DB 56 may be stored in an external large-capacity storage device connected to the information processing device 20, a data server 32, or the like. ..

通信部２４は、情報処理装置２０とネットワークとの間の通信を行なうインターフェイスである。 The communication unit 24 is an interface for communicating between the information processing device 20 and the network.

前述のとおり、読取装置３１は、次世代シーケンサ、ＤＮＡマイクロアレイその他塩基配列を読み取る任意の装置または機器である。読取装置３１が読み取った塩基配列に基づいて作成されたゲノムデータはデータサーバ３２に記録される。制御部２１は、通信部２４およびネットワークを介してデータサーバ３２に記録されたゲノムデータを取得できる。なお、制御部２１は、データサーバ３２を介さず、読取装置３１から直接ゲノムデータを取得してもよい。 As described above, the reading device 31 is a next-generation sequencer, a DNA microarray, or any other device or device that reads a base sequence. The genomic data created based on the base sequence read by the reading device 31 is recorded in the data server 32. The control unit 21 can acquire the genomic data recorded in the data server 32 via the communication unit 24 and the network. The control unit 21 may acquire the genome data directly from the reading device 31 without going through the data server 32.

本実施の形態の情報処理装置２０は、汎用のパソコン、タブレット、大型計算機、または、大型計算機上で動作する仮想マシンである。情報処理装置２０は、複数のパソコン、タブレットまたは大型計算機等のハードウェアにより構成されても良い。情報処理装置２０は、量子コンピュータにより構成されても良い。情報処理装置２０は、読取装置３１と一体化されていても良い。情報処理装置２０は、いわゆるクラウドコンピューティングにより実現されても良い。 The information processing device 20 of the present embodiment is a general-purpose personal computer, a tablet, a large-scale computer, or a virtual machine that operates on the large-scale computer. The information processing device 20 may be composed of a plurality of personal computers, tablets, hardware such as a large computer, and the like. The information processing device 20 may be configured by a quantum computer. The information processing device 20 may be integrated with the reading device 31. The information processing device 20 may be realized by so-called cloud computing.

図６は、教師データＤＢ５１のレコードレイアウトを説明する説明図である。教師データＤＢ５１は、ゲノムデータと診断データとを関連づけて記録するＤＢである。図６には、教師データＤＢ５１の１つのレコードを示す。 FIG. 6 is an explanatory diagram illustrating a record layout of the teacher data DB 51. The teacher data DB 51 is a DB that records the genomic data and the diagnostic data in association with each other. FIG. 6 shows one record of the teacher data DB 51.

教師データＤＢ５１は、検体フィールド、ゲノムデータフィールドおよび診断データフィールドを有する。検体フィールドは、正常部検体フィールドおよび腫瘍部検体フィールドを有する。ゲノムデータフィールドは、正常部ゲノムフィールドおよび腫瘍部ゲノムフィールドを有する。なお、教師データＤＢ５１は、正常部ゲノムフィールドを有さなくても良い。 The teacher data DB 51 has a sample field, a genome data field, and a diagnostic data field. The sample field includes a normal part sample field and a tumor part sample field. The genomic data field has a normal genomic field and a tumor genomic field. The teacher data DB 51 does not have to have a normal genome field.

診断データフィールドは、非同義体細胞変異フィールド、生殖細胞変異フィールドおよび腫瘍含有量フィールドを有する。非同義体細胞変異フィールドは、遺伝子フィールドおよびＤＮＡ変異フィールドを有する。生殖細胞変異フィールドは、遺伝子フィールドおよびＤＮＡ変異フィールドを有する。教師データＤＢ５１は、１組の教師データについて１つのレコードを有する。なお、診断データフィールドは、腫瘍含有量フィールドを有さなくてもよい。 Diagnostic data fields include non-synonymous cell mutation fields, germline mutation fields and tumor content fields. The non-synonymous cell mutation field has a gene field and a DNA mutation field. The germline mutation field has a gene field and a DNA mutation field. The teacher data DB 51 has one record for one set of teacher data. The diagnostic data field does not have to have a tumor content field.

正常部検体フィールドには、正常部の検体が採取された部位が記録される。腫瘍部検体フィールドには、腫瘍部の検体が採取された部位が記録される。正常部ゲノムフィールドには、正常部検体から取得したゲノムデータのファイル名が記録される。腫瘍部ゲノムフィールドには、腫瘍部検体から取得したゲノムデータのファイル名が記録される。 In the normal part sample field, the part where the normal part sample was collected is recorded. In the tumor part sample field, the site where the tumor part sample was collected is recorded. In the normal part genome field, the file name of the genome data obtained from the normal part sample is recorded. In the tumor part genome field, the file name of the genome data obtained from the tumor part sample is recorded.

非同義体細胞変異フィールドのサブフィールドには、腫瘍部ゲノムに含まれる非同義体細胞変異、すなわちＤＮＡの塩基配列にコードされたアミノ酸に変化を生じさせる体細胞変異を有する遺伝子と、変異内容とが記録される。体細胞変異は、正常部ゲノムには生じていないが、腫瘍部ゲノムには生じている変異を意味する。すなわち非同義体細胞変異は、腫瘍の特性に関する変異である。 In the subfield of the non-synonymous cell mutation field, there are genes having non-synonymous cell mutations contained in the tumor genome, that is, somatic mutations that cause changes in amino acids encoded by the DNA base sequence, and mutation contents. Is recorded. Somatic mutations mean mutations that do not occur in the normal genome but do occur in the tumor genome. That is, non-synonymous cell mutations are mutations related to tumor characteristics.

たとえば、図６の非同義体細胞変異フィールドの１行目は、ＡＲＩＤ１Ａ（AT-rich interactive domain 1A）遺伝子の５１６４番目の塩基がＣ（シトシン）からＴ（チミン）に変異していることを示す。同様に２行目はＴＰ５３遺伝子の７４３番目の塩基がＧ（グアニン）からＡ（アデニン）に変異していることを示す。 For example, the first line of the non-synonymous cell mutation field in FIG. 6 shows that the 5164th base of the ARID1A (AT-rich interactive domain 1A) gene is mutated from C (cytosine) to T (thymine). .. Similarly, the second line shows that the 743rd base of the TP53 gene is mutated from G (guanine) to A (adenine).

生殖細胞変異フィールドのサブフィールドには、正常部ゲノムに含まれる変異を有する遺伝子と、変異内容とが記録される。たとえば、図６の生殖細胞変異フィールドの１行目は、ＢＲＡＦ遺伝子の１７９１番目の塩基がＴからＧに変異していることを示す。 In the subfield of the germline mutation field, a gene having a mutation contained in the normal genome and the content of the mutation are recorded. For example, the first line of the germline mutation field in FIG. 6 shows that the 1791 base of the BRAF gene is mutated from T to G.

非同義体細胞フィールドおよび生殖細胞変異フィールドには、検体から検出された遺伝子変異のうち、教師データに記録する必要がある任意の数の遺伝子が記録される。 In the non-synonymous cell field and germline mutation field, any number of gene mutations detected in the sample that need to be recorded in the teacher data are recorded.

なお、正常部の検体を採取してゲノムデータを取得する代わりに、日本人の基準ゲノム配列等の参照配列を使用する場合がある。このようにする場合には、生殖細胞変異に関する結果は、推定結果である。 In addition, instead of collecting a sample of a normal part and acquiring genomic data, a reference sequence such as a Japanese reference genomic sequence may be used. In this case, the results for germline mutations are putative results.

診断データフィールドは、同義体細胞変異を記録する同義体細胞変異フィールドを有しても良い。非同義体細胞変異フィールドの代わりに体細胞変異フィールドを有し、同義体細胞変異と非同義体細胞変異の両方を記録しても良い。 The diagnostic data field may have a synonymous cell mutation field that records the synonymous cell mutation. It may have a somatic mutation field instead of a non-synonymous cell mutation field and record both a somatic cell mutation and a non-synonymous cell mutation.

腫瘍含有量フィールドには、腫瘍部から採取した検体の腫瘍含有量が記録される。腫瘍含有量は、たとえばヘテロＳＮＰ（Single Nucleotide Polymorphism）数に基づいて算出される。ＢＡＭファイルまたはＳＡＭファイルに記録されたアリル頻度、または、ＢＡＭファイルまたはＳＡＭファイルに記録されたデータから算出されたアリル頻度に基づいて、腫瘍含有量が算出されても良い。 In the tumor content field, the tumor content of the sample collected from the tumor site is recorded. Tumor content is calculated, for example, based on the number of hetero SNPs (Single Nucleotide Polymorphisms). Tumor content may be calculated based on the allele frequency recorded in the BAM file or SAM file, or the allele frequency calculated from the data recorded in the BAM file or SAM file.

病理検査により観察された有核細胞の数と腫瘍細胞の数との比、または、顕微鏡視野内で腫瘍細胞が占める面積に基づいて、腫瘍含有量が算出されても良い。腫瘍含有量の定義は任意であるが、教師データＤＢ５１に含まれるすべての教師データにおいて、統一した定義が用いられていることが望ましい。 Tumor content may be calculated based on the ratio of the number of nucleated cells to the number of tumor cells observed by pathological examination, or the area occupied by the tumor cells in the microscopic field of view. The definition of tumor content is arbitrary, but it is desirable that a unified definition be used for all teacher data included in the teacher data DB 51.

図７は、統合ＤＢ５２のレコードレイアウトを説明する説明図である。統合ＤＢ５２は、複数の情報源から取得した遺伝子変異に関する医学情報と、当該医学情報の取得元とを関連づけて統合したＤＢである。統合ＤＢ５２は、バージョンフィールド、ゲノム変異フィールドおよび知識データフィールドを有する。 FIG. 7 is an explanatory diagram illustrating the record layout of the integrated DB 52. The integrated DB 52 is a DB that associates and integrates medical information related to gene mutations acquired from a plurality of information sources with the acquisition source of the medical information. The integrated DB 52 has a version field, a genomic mutation field and a knowledge data field.

バージョンフィールドには、統合ＤＢ５２のバージョンが記録されている。本実施の形態では、統合ＤＢ５２は更新日付で管理されている。ゲノム変異フィールドは、検体フィールド、遺伝子フィールドおよび変異内容フィールドを有する。知識データフィールドは、発がん性フィールド、臨床的意義フィールド、対応薬剤フィールド、対応疾患フィールド、レベルフィールドおよび根拠情報フィールドを有する。統合ＤＢ５２は、遺伝子変異に関する１件の医学情報について、１つのレコードを有する。 The version of the integrated DB 52 is recorded in the version field. In this embodiment, the integrated DB 52 is managed by the update date. The genomic mutation field has a sample field, a gene field, and a mutation content field. The knowledge data field has a carcinogenicity field, a clinical significance field, a corresponding drug field, a corresponding disease field, a level field and a rationale information field. The integrated DB 52 has one record for one medical piece of information about a gene mutation.

検体フィールドには、検体が採取された部位が記録される。遺伝子フィールドには、変異が検出された遺伝子が記録される。なお、複数の変異の組合せに関する医学情報が記録されたレコードにおいては、遺伝子フィールドに複数の遺伝子が記録される。 In the sample field, the site where the sample was collected is recorded. In the gene field, the gene in which the mutation is detected is recorded. In the record in which medical information regarding a combination of a plurality of mutations is recorded, a plurality of genes are recorded in the gene field.

変異内容フィールドには、非同義体細胞変異または生殖細胞変異等の、変異の内容が記録される。なお、コードされたアミノ酸に変化が生じない同義体細胞変異に関する情報も統合ＤＢ５２に記録される場合がある。 In the mutation content field, the content of the mutation, such as a non-synonymous cell mutation or a germline mutation, is recorded. In addition, information on synonymous cell mutations in which the encoded amino acid does not change may also be recorded in the integrated DB 52.

発がん性フィールドには、ゲノム変異の発がん性のレベルが記録される。臨床的意義フィールドには、ゲノム変異の臨床的意義が記録される。知識データフィールドは、発がん性フィールドと、臨床的意義フィールドは、いずれか一方のみを有してもよい。 In the carcinogenicity field, the carcinogenicity level of the genomic mutation is recorded. The clinical significance of genomic mutations is recorded in the clinical significance field. The knowledge data field may have only one of the carcinogenicity field and the clinical significance field.

対応薬剤フィールドには、ゲノム変異を有する患者に投与した場合に効果がある薬剤が記録される。対応薬剤フィールドに、治験中の薬剤が記録されても良い。対応疾患フィールドには、ゲノム変異に対応する疾患が記録される。レベルフィールドには、ゲノム変異の重要度のレベルが記録される。根拠情報フィールドには、レコードに記載された情報の根拠である文献、データベース名、または、情報に固有に付与されたＩＤ（Identifier）等の、根拠情報にアクセスするための情報が記録される。 In the corresponding drug field, drugs that are effective when administered to patients with genomic mutations are recorded. The drug under investigation may be recorded in the corresponding drug field. Diseases corresponding to genomic mutations are recorded in the corresponding disease field. The level field records the level of importance of the genomic mutation. In the rationale information field, information for accessing the rationale information such as a document, a database name, or an ID (Identifier) uniquely assigned to the information, which is the basis of the information described in the record, is recorded.

知識データフィールドの各サブフィールドにおいて「−」は対応する情報がないことを意味する。 A "-" in each subfield of the knowledge data field means that there is no corresponding information.

図８は、報告書ＤＢ５６のレコードレイアウトを説明する説明図である。報告書ＤＢ５６は、検体に関する情報と、検体に基づく診断データとを関連づけて記録したＤＢである。図８には、報告書ＤＢ５６の１つのレコードを示す。 FIG. 8 is an explanatory diagram illustrating the record layout of the report DB 56. The report DB 56 is a DB that records information about a sample in association with diagnostic data based on the sample. FIG. 8 shows one record of report DB56.

報告書ＤＢ５６は、検体ＩＤフィールド、検体フィールド、ゲノムデータフィールド、統合ＤＢＶｅｒ．フィールド、診断データフィールドおよびエキスパートＩＤフィールドを有する。検体フィールドは、正常部検体フィールドおよび腫瘍部検体フィールドを有する。ゲノムデータフィールドは、正常部ゲノムフィールドおよび腫瘍部ゲノムフィールドを有する。 The report DB56 contains a sample ID field, a sample field, a genome data field, and an integrated DB Ver. It has fields, diagnostic data fields and expert ID fields. The sample field includes a normal part sample field and a tumor part sample field. The genomic data field has a normal genomic field and a tumor genomic field.

診断データフィールドは、非同義体細胞変異フィールド、生殖細胞変異フィールドおよび腫瘍含有量フィールドを有する。非同義体細胞変異フィールドは、診断データフィールドおよび知識データフィールドを有する。診断データフィールドは、遺伝子フィールドおよびＤＮＡ変異フィールドを有する。知識データフィールドは、発がん性フィールド、臨床的意義フィールド、対応薬剤フィールド、対応疾患フィールド、レベルフィールドおよび根拠情報フィールドを有する。 Diagnostic data fields include non-synonymous cell mutation fields, germline mutation fields and tumor content fields. The non-synonymous cell mutation field has a diagnostic data field and a knowledge data field. The diagnostic data field has a gene field and a DNA mutation field. The knowledge data field has a carcinogenicity field, a clinical significance field, a corresponding drug field, a corresponding disease field, a level field and a rationale information field.

生殖細胞変異フィールドは、診断データフィールドおよび知識データフィールドを有する。診断データフィールドは、遺伝子フィールドおよびＤＮＡ変異フィールドを有する。知識データフィールドは、臨床的意義フィールド、レベルフィールドおよび根拠情報フィールドを有する。報告書ＤＢ５６は、１組の検体について、１つのレコードを有する。 The germline mutation field has a diagnostic data field and a knowledge data field. The diagnostic data field has a gene field and a DNA mutation field. The knowledge data field has a clinical significance field, a level field and a rationale information field. Report DB56 has one record for a set of samples.

検体ＩＤフィールドには、１組の検体に固有に付与された検体ＩＤが記録される。検体ＩＤは、電子カルテシステム等と連携して、患者に紐付けられている。正常部検体フィールドには、正常部の検体が採取された部位が記録される。腫瘍部検体フィールドには、腫瘍部の検体が採取された部位が記録される。正常部ゲノムフィールドには、正常部検体から取得したゲノムデータのファイル名が記録される。腫瘍部ゲノムフィールドには、腫瘍部検体から取得したゲノムデータのファイル名が記録される。統合ＤＢＶｅｒ．フィールドには、報告書レコードの作成時に用いられた統合ＤＢ５２のバージョンが記録される。 In the sample ID field, a sample ID uniquely assigned to one set of samples is recorded. The sample ID is associated with the patient in cooperation with an electronic medical record system or the like. In the normal part sample field, the part where the normal part sample was collected is recorded. In the tumor part sample field, the site where the tumor part sample was collected is recorded. In the normal part genome field, the file name of the genome data obtained from the normal part sample is recorded. In the tumor part genome field, the file name of the genome data obtained from the tumor part sample is recorded. Integrated DB Ver. In the field, the version of the integrated DB 52 used when creating the report record is recorded.

非同義体細胞変異フィールド中の診断データフィールドのサブフィールドには、非同義体細胞変異を有する遺伝子と、変異内容とが記録される。知識データフィールドの各サブフィールドには、診断データフィールドに記録された遺伝子変異に関連する医学情報が記録される。各サブフィールドに記録される情報は、図７を使用して説明した統合ＤＢ５２中の同名のサブフィールドに記録される情報と同様であるため、説明を省略する。 In the subfield of the diagnostic data field in the non-synonymous cell mutation field, the gene having the non-synonymous cell mutation and the mutation content are recorded. Each subfield of the knowledge data field records medical information related to the gene mutation recorded in the diagnostic data field. Since the information recorded in each subfield is the same as the information recorded in the subfield of the same name in the integrated DB 52 described with reference to FIG. 7, the description thereof will be omitted.

生殖細胞変異フィールド中の診断データフィールドのサブフィールドには、生殖細胞変異を有する遺伝子と、変異内容とが記録される。知識データフィールドの各サブフィールドには、診断データフィールドに記録された遺伝子変異に関連する医学情報が記録される。各サブフィールドに記録される情報は、図７を使用して説明した統合ＤＢ５２中の同名のサブフィールドに記録される情報と同様であるため、説明を省略する。 In the subfield of the diagnostic data field in the germline mutation field, the gene having the germline mutation and the mutation content are recorded. Each subfield of the knowledge data field records medical information related to the gene mutation recorded in the diagnostic data field. Since the information recorded in each subfield is the same as the information recorded in the subfield of the same name in the integrated DB 52 described with reference to FIG. 7, the description thereof will be omitted.

エキスパートＩＤフィールドには、後述するプログラムにより制御部２１が自動的に作成した報告書案をレビューしたエキスパートパネルを構成した専門家にそれぞれ固有に付与された専門家ＩＤが記録される。複数の専門家が参加する専門家グループに対して、１つのエキスパートＩＤが付与されてもよい。 In the expert ID field, an expert ID uniquely assigned to each expert who constitutes the expert panel that reviews the draft report automatically created by the control unit 21 by the program described later is recorded. One expert ID may be assigned to an expert group in which a plurality of experts participate.

報告書案ＤＢ５５のレコードレイアウトは、エキスパートＩＤフィールドを有さない他は、図８を使用して説明した報告書ＤＢ５６のレコードレイアウトと同一であるため、図示および詳細な説明を省略する。 Since the record layout of the draft report DB 55 is the same as the record layout of the report DB 56 described with reference to FIG. 8 except that it does not have an expert ID field, illustration and detailed description are omitted.

図９は、学習モデル５３を説明する説明図である。学習モデル５３は、入力層５３１、中間層５３２および出力層５３３を備えるニューラルネットワークである。図９においては、学習モデル５３はＣＮＮである場合を例示する。なお、畳み込み層およびプーリング層については、図示を省略する。 FIG. 9 is an explanatory diagram illustrating the learning model 53. The learning model 53 is a neural network including an input layer 531 and an intermediate layer 532 and an output layer 533. In FIG. 9, the case where the learning model 53 is CNN is illustrated. The convolutional layer and the pooling layer are not shown.

学習モデル５３の入力は、腫瘍部のゲノムデータ、正常部のゲノムデータ、腫瘍部検体が採取された部位および正常部検体が採取された部位である。ゲノムデータは、たとえばパイルアップされたアラインメント情報のテンソルであり、塩基配列、ストランド情報、ベースクオリティおよびマップクオリティ等を構成要素に含む。塩基配列は、Ａ、Ｔ、Ｇ、Ｃの各塩基のカウントで表されてもよい。学習モデル５３に入力されたデータは、図示を省略する畳み込み層およびプーリング層の繰り返しを介して、入力層５３１に入力する。 The inputs of the learning model 53 are the genome data of the tumor part, the genome data of the normal part, the part where the tumor part sample was collected, and the part where the normal part sample was collected. Genome data is, for example, a tensor of piled-up alignment information, and includes base sequence, strand information, base quality, map quality, and the like as components. The base sequence may be represented by the count of each base of A, T, G, and C. The data input to the learning model 53 is input to the input layer 531 via the repetition of the convolutional layer and the pooling layer (not shown).

学習モデル５３の出力は、たとえば診断データの各項目の確率である。具体的には、臨床的に意味のある変異それぞれが発生じている確率、および、腫瘍含有量が所定の値である確率である。たとえば図９において一番上の出力ノードには、ＢＲＣＡ遺伝子の６９５２番目の塩基がＣからＴに変異した体細胞変異が生じている確率が、２番目の出力ノードには、ＢＲＣＡ遺伝子の６９５２番目の塩基がＣからＴに変異した生殖細胞変異が生じている確率がそれぞれ出力される。 The output of the learning model 53 is, for example, the probability of each item of the diagnostic data. Specifically, it is the probability that each clinically significant mutation has occurred and the probability that the tumor content is a predetermined value. For example, in FIG. 9, the top output node has a somatic mutation in which the 6952th base of the BRCA gene is mutated from C to T, and the second output node has a 6952th BRCA gene mutation. The probability of a germline mutation in which the base of is mutated from C to T is output.

なお、体細胞は対立遺伝子を含むため、検体の体細胞は父親由来の「ＢＲＣＡ遺伝子の６９５２番目の塩基」と、母親由来の「ＢＲＣＡ遺伝子の６９５２番目の塩基」とを有する。したがって、体細胞の変異には、父親由来の遺伝子と母親由来遺伝子との双方が変異している場合、父親由来の遺伝子のみが変異している場合、および、母親由来の遺伝子のみが変異している場合が含まれる。 Since the somatic cell contains an allele, the somatic cell of the sample has a "base 6952 of the BRCA gene" derived from the father and a "base 6952 of the BRCA gene" derived from the mother. Therefore, somatic cell mutations include mutations in both the father-derived gene and the mother-derived gene, mutations in only the father-derived gene, and mutations in only the mother-derived gene. Includes cases where

たとえば、学習モデル５３の出力は、HomoRef、Hetero、および、HomoAltのスコアであってもよい。HomoRef、Hetero、および、HomoAltは、deepvariant等のゲノム解析用バリアントコーラーで使用される指標である。 For example, the output of the training model 53 may be the scores of HomoRef, Hetero, and Homo Alt. HomoRef, Hetero, and HomoAlt are indicators used in variant callers for genome analysis such as deep variant.

図９の一番下の出力ノードには、腫瘍含有量が１０パーセントである確率が出力される。出力ノードは、たとえば１０パーセント刻み等の任意の腫瘍含有量である確率を出力するノードを含む。 The output node at the bottom of FIG. 9 outputs the probability that the tumor content is 10 percent. Output nodes include nodes that output the probability of any tumor content, eg, in 10 percent increments.

学習モデル５３は、入力層５３１にゲノムデータおよび検体採取部位が入力された場合に、出力層５３３に臨床的に意味のあるそれぞれの変異が生じている、および、所定の腫瘍含有量である確率を出力する。学習段階においては、制御部２１は、ゲノムデータおよび検体採取部位と、臨床上の意味のある変異の有無および腫瘍含有量に関する診断データとを関連づけて記録した教師データＤＢ５１を用いて、誤差逆伝播法等を用いて中間層５３２のパラメータを演算することにより、教師あり機械学習を行なう。 In the learning model 53, when genomic data and sampling sites are input to the input layer 531, it is probable that each clinically meaningful mutation occurs in the output layer 533 and that the tumor content is predetermined. Is output. In the learning stage, the control unit 21 uses the supervised data DB 51, which records the genomic data and the sample collection site in association with the diagnostic data regarding the presence or absence of clinically meaningful mutations and the tumor content, and back-propagates the error. Supervised machine learning is performed by calculating the parameters of the intermediate layer 532 using a method or the like.

教師あり機械学習は、たとえばロジスティック回帰、ＳＶＭ（Support Vector Machine）、ランダムフォレスト、ＣＮＮ、ＲＮＮまたは、ＸＧＢｏｏｓｔ（eXtreme Gradient Boosting）等の任意の手法により行なえる。 Supervised machine learning can be performed by any method such as logistic regression, SVM (Support Vector Machine), random forest, CNN, RNN, or XGBost (eXtreme Gradient Boosting).

学習モデル５３は任意のコンピュータを用いて生成されても良い。生成された学習モデル５３は、ネットワーク等を介して情報処理装置２０に送信されて、補助記憶装置２３に記録される。教師あり学習の代わりに、半教師あり学習が用いられてもよい。 The learning model 53 may be generated using any computer. The generated learning model 53 is transmitted to the information processing device 20 via a network or the like and recorded in the auxiliary storage device 23. Semi-supervised learning may be used instead of supervised learning.

図１０は、報告書６０の例を説明する説明図である。報告書６０は、報告書ＤＢ５６のレコードに記録された情報、および、電子カルテに記録された情報を、ユーザが閲覧しやすい形式に整形して作成される。報告書６０は、書誌事項欄６１、コメント欄６２、非同義体細胞変異欄６３、生殖細胞変異欄６４および解析欄６５を含む。 FIG. 10 is an explanatory diagram illustrating an example of Report 60. The report 60 is created by formatting the information recorded in the record of the report DB 56 and the information recorded in the electronic medical record into a format that is easy for the user to view. The report 60 includes a journal item column 61, a comment column 62, a non-synonymous cell mutation column 63, a germline mutation column 64, and an analysis column 65.

書誌事項欄６１は、ＩＤ欄６１１、患者情報欄６１２、検体欄６１３、病理組織診断欄６１４および検体番号欄６１５を含む。ＩＤ欄６１１には、患者に固有に付与された患者ＩＤが表示される。患者情報欄６１２には、患者の性別および年齢が表示される。なお、患者情報欄６１２は、表示されなくてもよい。 The bibliographic item column 61 includes an ID column 611, a patient information column 612, a sample column 613, a histopathological diagnosis column 614, and a sample number column 615. In the ID column 611, a patient ID uniquely assigned to the patient is displayed. In the patient information column 612, the gender and age of the patient are displayed. The patient information column 612 may not be displayed.

検体欄６１３には、ゲノム解析に用いた正常部検体および腫瘍部検体が表示される。図１０において「ＦＦＰＥ（Formalin Fixed Paraffin Embedded）肺」は、ホルマリン固定パラフィン包埋を行なった肺組織であることを意味する。 In the sample column 613, the normal part sample and the tumor part sample used for the genome analysis are displayed. In FIG. 10, “FFPE (Formalin Fixed Paraffin Embedded) lung” means that the lung tissue is embedded with formalin-fixed paraffin.

病理組織診断欄６１４には、検体を顕微鏡で観察する病理診断による所見が表示される。検体番号欄６１５には、検体に固有に付与された検体番号が表示される。書誌事項欄６１に表示される情報は、図８を使用して説明した報告書レコードの検体ＩＤをキーとして電子カルテシステムから取得される。 In the histopathological diagnosis column 614, findings by pathological diagnosis by observing the sample with a microscope are displayed. In the sample number column 615, a sample number uniquely assigned to the sample is displayed. The information displayed in the bibliographic item column 61 is acquired from the electronic medical record system using the sample ID of the report record described with reference to FIG. 8 as a key.

図１１は、コメント欄６２の例を説明する説明図である。図１１Ａから図１１Ｃは、それぞれ異なる報告書に表示されるコメント欄６２の例を示す。図１１Ａは、「Pathologic」すなわち病原性を有することが確実な生殖細胞変異が発見された検体に関する報告書のコメント欄６２を示す。病原性を有する生殖細胞変異が生じた遺伝子および変異位置と、その根拠、ならびに生殖細胞変異に関する今後の対応についてのアドバイスが表示される。 FIG. 11 is an explanatory diagram illustrating an example of the comment column 62. 11A to 11C show examples of comment fields 62 displayed in different reports. FIG. 11A shows comment section 62 of the report on "Pathologic" or germline mutations found to be pathogenic. It provides advice on the genes and locations of pathogenic germline mutations, their rationale, and future actions regarding germline mutations.

図１１Ｂは、腫瘍含有量が低い、すなわち腫瘍部検体の質に問題がある可能性がある検体に関する報告書のコメントの例を示す。図１１Ｃは、腫瘍部検体にがん化変異が発見された検体に関するコメントの例を示す。がん化に関連する体細胞変異が生じた遺伝子と、その遺伝子に関連する臨床試験についての情報が表示される。 FIG. 11B shows an example of a report comment on a specimen having a low tumor content, i.e., a specimen that may have a problem with the quality of the tumor specimen. FIG. 11C shows an example of comments regarding a sample in which a carcinogenic mutation was found in a tumor part sample. Information about genes with somatic mutations associated with carcinogenesis and clinical trials associated with those genes is displayed.

コメント欄６２に表示される文章は、報告書ＤＢ５６の診断フィールドに記録された情報に基づいて、公知の手法により定型文を組み合わせて作成される。検体に生じている複数の遺伝子変異うち、病原性または発がん性が高い遺伝子変異に関連する定型文を選択して表示することにより、遺伝子検査に関する知識が少ない臨床医であっても重要性の高い情報を速やかに把握できる。 The text displayed in the comment field 62 is created by combining fixed phrases by a known method based on the information recorded in the diagnostic field of the report DB 56. By selecting and displaying a fixed phrase related to a gene mutation that is highly pathogenic or carcinogenic among multiple gene mutations occurring in a sample, it is highly important even for clinicians with little knowledge of genetic testing. Information can be grasped quickly.

図１２は、非同義体細胞変異欄６３の例を説明する説明図である。図１２においては、図８に例示した報告書レコード中の非同義体細胞変異フィールドに基づいて表示される非同義体細胞変異欄６３の例を示す。 FIG. 12 is an explanatory diagram illustrating an example of a non-synonymous cell mutation column 63. FIG. 12 shows an example of a non-synonymous cell mutation column 63 displayed based on a non-synonymous cell mutation field in the report record exemplified in FIG.

非同義体細胞変異欄６３は、遺伝子欄６３１、サイトバンド欄６３２、ＤＮＡ変異欄６３３、アミノ酸変異欄６３４、アリル頻度欄６３５および知識データ欄６３６を含む。遺伝子欄６３１、ＤＮＡ変異欄６３３および知識データ欄６３６には、それぞれ非同義体細胞変異フィールドに記録された情報が表示される。 The non-synonymous cell mutation column 63 includes a gene column 631, a cytoband column 632, a DNA mutation column 633, an amino acid mutation column 634, an allyl frequency column 635, and a knowledge data column 636. The information recorded in the non-synonymous cell mutation field is displayed in the gene column 631, the DNA mutation column 633, and the knowledge data column 636, respectively.

サイトバンド欄６３２には、染色体上の遺伝子の位置が表示される。アミノ酸変異欄６３４には、ＤＮＡ変異に起因するアミノ酸の変異が表示される。アリル頻度欄６３５には、たとえばＢＡＭファイルまたはＳＡＭファイルに記録されたアリル頻度、または、ＢＡＭファイルまたはＳＡＭファイルに記録されたデータから算出されたアリル頻度が表示される。 The position of the gene on the chromosome is displayed in the cytoband column 632. In the amino acid mutation column 634, mutations of amino acids caused by DNA mutations are displayed. In the allyl frequency column 635, for example, the allyl frequency recorded in the BAM file or the SAM file, or the allyl frequency calculated from the data recorded in the BAM file or the SAM file is displayed.

非同義体細胞変異欄６３の上部には、非同義体細胞変異欄６３に記載していない体細胞変異も含めた総体細胞変異数および総体細胞変異頻度が表示される。総体細胞変異数および総体細胞変異頻度は、ＶＣＦ形式のファイルから取得できる。 At the upper part of the non-synonymous cell mutation column 63, the total number of somatic cell mutations including somatic mutations not described in the non-synonymous cell mutation column 63 and the total cell mutation frequency are displayed. The number of total cell mutations and the frequency of total cell mutations can be obtained from a VCF format file.

図１３は、生殖細胞変異欄６４の例を説明する説明図である。図１３においては、図８に例示した報告書レコード中の生殖細胞変異フィールドに基づいて表示される生殖細胞変異欄６４の例を示す。 FIG. 13 is an explanatory diagram illustrating an example of a germline mutation column 64. FIG. 13 shows an example of a germline mutation column 64 displayed based on a germline mutation field in the report record exemplified in FIG.

生殖細胞変異欄６４は、遺伝子欄６４１、サイトバンド欄６４２、ＤＮＡ変異欄６４３、アミノ酸変異欄６４４、正常部アリル頻度欄６４７、腫瘍部アリル頻度欄６４８および知識データ欄６４５を含む。遺伝子欄６４１、ＤＮＡ変異欄６４３および知識データ欄６４５には、それぞれ生殖細胞変異フィールドに記録された情報が表示される。 The germline mutation column 64 includes a gene column 641, a cytoband column 642, a DNA mutation column 643, an amino acid mutation column 644, a normal part allele frequency column 647, a tumor part allyl frequency column 648, and a knowledge data column 645. The information recorded in the germline mutation field is displayed in the gene column 641, the DNA mutation column 643, and the knowledge data column 645, respectively.

サイトバンド欄６４２には、染色体上の遺伝子の位置が表示される。アミノ酸変異欄６４４には、ＤＮＡ変異に起因するアミノ酸の変異が記録される。正常部アリル頻度欄６４７には、たとえばＢＡＭ形式またはＳＡＭ形式のファイルに記録された正常部のアリル頻度が表示される。腫瘍部アリル頻度欄６４８には、たとえばＢＡＭ形式またはＳＡＭ形式のファイルに記録された腫瘍部のアリル頻度が表示される。 The position of the gene on the chromosome is displayed in the sight band column 642. In the amino acid mutation column 644, mutations of amino acids caused by DNA mutations are recorded. In the normal part allyl frequency column 647, for example, the allyl frequency of the normal part recorded in a file of BAM format or SAM format is displayed. In the tumor part allele frequency column 648, for example, the allele frequency of the tumor part recorded in a BAM format or SAM format file is displayed.

図１４は、解析欄６５の例を説明する説明図である。解析欄６５は、推定腫瘍含有量欄６５１および変異頻度相関係数欄６５２を含む。推定腫瘍含有量欄６５１には、学習モデル５３の出力に基づく推定腫瘍含有量が表示される。 FIG. 14 is an explanatory diagram illustrating an example of the analysis column 65. The analysis column 65 includes an estimated tumor content column 651 and a mutation frequency correlation coefficient column 652. In the estimated tumor content column 651, the estimated tumor content based on the output of the learning model 53 is displayed.

変異頻度相関係数欄６５２には、正常部から採取した検体中の遺伝子変異頻度と、腫瘍部から採取した検体中の遺伝子変異頻度との相関係数が表示される。相関係数が高い場合には、正常部と異常部とで、同一の塩基が変異している場合が多く、同一患者由来の検体であると判定される。相関係数が閾値よりも低い場合には、検体の取り違え、または、コンタミネーション等の発生が疑われる。 In the mutation frequency correlation coefficient column 652, the correlation coefficient between the gene mutation frequency in the sample collected from the normal part and the gene mutation frequency in the sample collected from the tumor part is displayed. When the correlation coefficient is high, the same base is often mutated in the normal part and the abnormal part, and it is determined that the sample is derived from the same patient. If the correlation coefficient is lower than the threshold value, it is suspected that the sample is mistaken or contamination occurs.

変異頻度相関係数欄６５２は表示されなくても良い。たとえば、正常部検体を使用せずに解析を行なう場合には、変異頻度相関係数欄６５２は不要である。 The mutation frequency correlation coefficient column 652 may not be displayed. For example, when the analysis is performed without using the normal part sample, the mutation frequency correlation coefficient column 652 is unnecessary.

ユーザが、図１０から図１４を使用して説明した各欄をたとえば右クリック等により選択した場合、制御部２１は、報告書レコードの根拠情報フィールドに記録された情報を表示する。制御部２１は、根拠情報フィールドに基づいて根拠情報へのリンクを表示するか、根拠情報自体を表示しても良い。ユーザは、報告書６０の記載の根拠を閲覧することにより、報告書の信頼性を確認できる。 When the user selects each column described with reference to FIGS. 10 to 14 by, for example, right-clicking, the control unit 21 displays the information recorded in the basis information field of the report record. The control unit 21 may display a link to the rationale information based on the rationale information field, or may display the rationale information itself. The user can confirm the reliability of the report by viewing the basis of the description in the report 60.

報告書６０には、レビューを実施したエキスパートパネルの連絡先等が、表示されても良い。ユーザは、報告書６０に基づいてエキスパートパネルへの質問、相談等を行なえる。 In the report 60, the contact information of the expert panel that carried out the review may be displayed. The user can ask questions, consult, etc. to the expert panel based on the report 60.

報告書は、検体に行なった前処理、読取装置３１が塩基配列を読み取ったリード数、または、参照配列へのマッピング深度等の情報を含んでも良い。遺伝子検査に詳しい臨床医であれば、これらの情報に基づいて報告書の信頼度を判断できる。 The report may include information such as pretreatment performed on the sample, the number of reads read by the reading device 31 on the base sequence, or the mapping depth to the reference sequence. A clinician familiar with genetic testing can use this information to determine the reliability of a report.

図１５は、プログラムの処理の流れを説明するフローチャートである。制御部２１は、報告書作成要求に基づいてデータサーバ３２からゲノムデータを取得する（ステップＳ５０１）。制御部２１は、報告書案ＤＢ５５に新規レコードを作成し、検体ＩＤフィールド、検体フィールドおよびゲノムデータフィールドにそれぞれデータを記録する（ステップＳ５０２）。 FIG. 15 is a flowchart illustrating a process flow of the program. The control unit 21 acquires genomic data from the data server 32 based on the report creation request (step S501). The control unit 21 creates a new record in the draft report DB 55 and records the data in the sample ID field, the sample field, and the genome data field, respectively (step S502).

制御部２１は、取得したゲノムデータを学習モデル５３に入力して、出力層５３３の各ノードの予測確率を取得する（ステップＳ５０３）。制御部２１は、出力層５３３の遺伝子変異にかかるノードから所定の閾値以上の確率が出力された遺伝子変異を抽出する（ステップＳ５０４）。閾値は、遺伝子変異ごとに異なる値であっても、一定の値であっても良い。 The control unit 21 inputs the acquired genomic data into the learning model 53 and acquires the prediction probability of each node of the output layer 533 (step S503). The control unit 21 extracts a gene mutation for which a probability equal to or higher than a predetermined threshold is output from a node involved in the gene mutation in the output layer 533 (step S504). The threshold value may be a different value for each gene mutation or a constant value.

制御部２１は、出力層５３３の腫瘍含有量にかかるノードのうちの、最も確率が高いノードに基づいて、検体中の腫瘍含量を判定する（ステップＳ５０５）。制御部２１は、ステップＳ５０２で作成した報告書案レコードの非同義体細胞変異フィールドまたは生殖細胞変異フィールドの診断データフィールドに、ステップＳ５０４で抽出した変異を、腫瘍含有量フィールドにステップＳ５０５で判定した腫瘍含有量をそれぞれ記録する（ステップＳ５０６）。 The control unit 21 determines the tumor content in the sample based on the node having the highest probability among the nodes related to the tumor content of the output layer 533 (step S505). The control unit 21 puts the mutation extracted in step S504 in the diagnostic data field of the non-synonymous cell mutation field or germline mutation field of the draft report record prepared in step S502, and determines the tumor in the tumor content field in step S505. Each content is recorded (step S506).

なお、腫瘍含有量は、図１５に示すプログラムとは別の独立したプログラムにより算出されてもよい。そのようにする場合には、ステップＳ５０５は不要である。 The tumor content may be calculated by an independent program different from the program shown in FIG. In that case, step S505 is unnecessary.

制御部２１は、報告書案レコードに記録された検体の採取部位と遺伝子変異とをキーとして統合ＤＢ５２を検索し、抽出されたレコードの知識データフィールドから知識データを取得する（ステップＳ５０７）。制御部２１は、報告書レコードに取得した知識データを記録する（ステップＳ５０８）。 The control unit 21 searches the integrated DB 52 using the sample collection site and the gene mutation recorded in the draft report record as keys, and acquires knowledge data from the knowledge data field of the extracted record (step S507). The control unit 21 records the acquired knowledge data in the report record (step S508).

制御部２１は、報告書案レコードに記録されたすべての遺伝子変異の処理を終了したか否かを判定する（ステップＳ５０９）。終了していないと判定した場合（ステップＳ５０９でＮＯ）、制御部２１はステップＳ５０７に戻る。終了したと判定した場合（ステップＳ５０９でＹＥＳ）、制御部２１は報告書レコードに基づいて図１０を使用して説明した報告書６０の案を作成し、補助記憶装置２３またはデータサーバ３２に記録する（ステップＳ５１０）。 The control unit 21 determines whether or not the processing of all the gene mutations recorded in the draft report record has been completed (step S509). If it is determined that the process has not been completed (NO in step S509), the control unit 21 returns to step S507. When it is determined that the report has been completed (YES in step S509), the control unit 21 creates a draft report 60 described with reference to FIG. 10 based on the report record, and records it in the auxiliary storage device 23 or the data server 32. (Step S510).

エキスパートパネルのメンバーである専門家は、定期的または不定期に開催されるエキスパート会議において報告書６０の案をレビューし、必要に応じて修正する。エキスパート会議は、専門家が実際に１室に集合して行なわれても、テレビ会議または電話会議等で行なわれても良い。エキスパート会議は、チャットシステム等を用いた電子会議で行なわれても良い。 Experts who are members of the Expert Panel will review the draft Report 60 at regular or irregular expert meetings and revise it as necessary. The expert meeting may be held by the experts actually gathering in one room, or may be held by a video conference, a telephone conference, or the like. The expert meeting may be held as an electronic meeting using a chat system or the like.

エキスパートパネルは、必要に応じてＦＡＳＴＱ形式、ＢＡＭ形式、ＶＣＦ形式等のゲノムデータを参照する。エキスパートパネルは、病理検査時に撮影された顕微鏡写真等を参照しても良い。エキスパートパネルは病理検査を担当した病理医、または、患者を担当する臨床医から情報収集しても良い。 The expert panel refers to genomic data such as FASTQ format, BAM format, and VCF format as needed. The expert panel may refer to a micrograph or the like taken at the time of pathological examination. The expert panel may collect information from the pathologist in charge of the pathological examination or the clinician in charge of the patient.

制御部２１は、エキスパート会議で決定された修正を受け付ける（ステップＳ５１１）。制御部２１は、報告書案レコードに記録された情報を修正した報告書レコードを報告書ＤＢ５６に記録する（ステップＳ５１２）。制御部２１は、報告書レコードのエキスパートＩＤフィールドに、レビューを行なった専門家に固有に付与されたエキスパートＩＤを記録する。制御部２１は処理を終了する。 The control unit 21 receives the correction decided at the expert meeting (step S511). The control unit 21 records in the report DB 56 a report record in which the information recorded in the draft report record is modified (step S512). The control unit 21 records an expert ID uniquely assigned to the expert who performed the review in the expert ID field of the report record. The control unit 21 ends the process.

制御部２１は、メールその他任意の手段を用いて、臨床医に対して報告書が作成されたことを通知してもよい。制御部２１は、電子カルテシステムに報告書をアップロードしても良い。制御部は、臨床医がゲノム解析システム１０にログインした場合に、新規報告書があることを通知しても良い。 The control unit 21 may notify the clinician that the report has been prepared by e-mail or any other means. The control unit 21 may upload the report to the electronic medical record system. The control unit may notify that there is a new report when the clinician logs in to the genome analysis system 10.

制御部２１は、図１５を使用して説明したプログラムの開始時に、報告書６０を作成する統合ＤＢ５２の日付の指定を受け付けても良い。日付の指定を受け付けた場合、制御部２１はステップＳ５０７において指定した日付における最新の統合ＤＢ５２を使用して、知識データを取得する。ステップＳ５１０において、制御部２１は、指定された日付における最新情報に基づく報告書案を記録する。 The control unit 21 may accept the designation of the date of the integrated DB 52 that creates the report 60 at the start of the program described with reference to FIG. When the date designation is accepted, the control unit 21 acquires the knowledge data by using the latest integrated DB 52 on the date designated in step S507. In step S510, the control unit 21 records a draft report based on the latest information on the designated date.

たとえば、過去に判断された治療方針等の妥当性を検証する場合、その医療行為が行なわれた日付を指定して図１５を使用して説明したプログラムを実行することにより、その日付における最新情報に基づく報告書案を作成できる。 For example, when verifying the validity of a treatment policy determined in the past, the latest information on that date can be obtained by specifying the date on which the medical practice was performed and executing the program described using FIG. Can prepare a draft report based on.

報告書ＤＢ５６に記録された情報、治療後の情報、および、投薬後の情報等に基づいて、教師データＤＢ５１にデータを追加して、学習モデル５３の再学習を行なっても良い。専門家によるレビューが行なわれたデータを教師データに追加することにより、学習モデル５３の精度を高めることができる。 Data may be added to the teacher data DB 51 based on the information recorded in the report DB 56, the post-treatment information, the post-medication information, and the like to relearn the learning model 53. The accuracy of the learning model 53 can be improved by adding the data reviewed by the experts to the teacher data.

本実施の形態によると、検体から読み取られた塩基配列に基づいて、臨床上重要な変異の自動抽出を行なう学習モデル５３を提供できる。学習モデル５３を使用することにより、遺伝子検査に関する高度な専門知識を有さない医師であっても、臨床上重要な遺伝子変異の有無を判断できる。 According to this embodiment, it is possible to provide a learning model 53 that automatically extracts clinically important mutations based on the base sequence read from the sample. By using the learning model 53, even a doctor who does not have a high degree of expertise in genetic testing can determine the presence or absence of a clinically important gene mutation.

本実施の形態によると、統合ＤＢ５２を使用することにより遺伝子変異に関する医学情報をユーザに提示するゲノム解析システム１０を提供できる。遺伝子検査の分野は研究スピードが速く、頻繁に新たな知見が発表されるため、個々の医師が常に最新情報を把握することは困難である。統合ＤＢ５２に基づいて、医学情報を提供されるとともに、その根拠も提示されるため、医師は必要に応じて根拠を確認して、患者に対して適切な医療を提供できる。 According to this embodiment, it is possible to provide a genome analysis system 10 that presents medical information on gene mutations to a user by using the integrated DB 52. In the field of genetic testing, research speed is fast and new findings are frequently announced, so it is difficult for individual doctors to keep up to date with the latest information. Since medical information is provided and the rationale is also presented based on the integrated DB 52, the doctor can confirm the rationale as necessary and provide appropriate medical care to the patient.

報告書案をエキスパートパネルでレビューして、エキスパートパネルによる修正を反映することにより、信頼性の高い報告書６０を作成するゲノム解析システム１０を提供できる。エキスパートパネルがレビューを行なうことにより、教師データＤＢ５１に含まれていない新しい情報に基づいて報告書６０を作成できる。 By reviewing the draft report on the expert panel and reflecting the corrections made by the expert panel, it is possible to provide the genome analysis system 10 for producing a highly reliable report 60. By conducting a review by the expert panel, the report 60 can be created based on new information not included in the teacher data DB 51.

臨床医が、遺伝子検査に関する専門知識を有する場合には、エキスパートパネルによるレビューを省略して、報告書案をそのまま報告書６０に使用しても良い。患者本人または臨床医が報告書案およびゲノムデータを取得し、自ら選択した専門医に意見を求めても良い。 If the clinician has expertise in genetic testing, the draft report may be used as is for Report 60, omitting the review by the expert panel. The patient or clinician may obtain the draft report and genomic data and seek the opinion of a specialist of his choice.

［実施の形態２］
本実施の形態は、ＤＮＡに加えてＲＮＡの塩基配列の解析も行なうゲノム解析システム１０に関する。実施の形態１と共通する部分については、説明を省略する。 [Embodiment 2]
The present embodiment relates to a genome analysis system 10 that analyzes the base sequence of RNA in addition to DNA. The description of the parts common to the first embodiment will be omitted.

本実施の形態においては、腫瘍部から採取された検体は３つに分けられる。１つは病理検査に、１つはＤＮＡの解析に使用される。最後の１つは、前処理にてＲＮＡが抽出されて、読取装置３１によりＲＮＡの塩基配列が読み取られ、ＤＮＡと同様の手法により解析される。 In the present embodiment, the sample collected from the tumor portion is divided into three. One is used for pathological examination and one is used for DNA analysis. In the last one, RNA is extracted by pretreatment, the base sequence of RNA is read by the reading device 31, and the RNA is analyzed by the same method as DNA.

ＲＮＡを解析することにより、腫瘍部で発現している遺伝子異常に関する情報を得ることができる。腫瘍部で発現している遺伝子異常は、たとえば複数のＤＮＡが転座または遺伝子再構成により融合した融合遺伝子、または、ＤＮＡがＲＮＡに転写される際に、一部が脱落するエクソンスキッピングである。本実施の形態の報告書６０には、たとえば非同義体細胞変異欄６３と生殖細胞変異欄６４との間に、ＲＮＡを解析して得た情報を表示するＲＮＡ欄６６が表示される。 By analyzing RNA, information on genetic abnormalities expressed in tumors can be obtained. The gene abnormality expressed in the tumor site is, for example, a fusion gene in which a plurality of DNAs are fused by translocation or gene rearrangement, or exon skipping in which a part of the DNA is dropped when it is transcribed into RNA. In the report 60 of this embodiment, for example, an RNA column 66 displaying information obtained by analyzing RNA is displayed between the non-synonymous cell mutation column 63 and the germline mutation column 64.

図１６は、ＲＮＡ欄６６の例を説明する説明図である。図１６Ａと図１６Ｂとは、それぞれ異なる報告書に表示されるＲＮＡ欄６６の例を示す。図１６Ａは、ＲＮＡに異常が発見されない検体に関するＲＮＡ欄６６の例を示す。図１６Ｂは、融合遺伝子およびエクソンスキッピングが発見された検体に関するＲＮＡ欄６６の例を示す。 FIG. 16 is an explanatory diagram illustrating an example of RNA column 66. 16A and 16B show examples of RNA column 66 displayed in different reports. FIG. 16A shows an example of RNA column 66 for a sample in which no abnormality is found in RNA. FIG. 16B shows an example of RNA column 66 for a sample in which a fusion gene and exon skipping were found.

図１６Ｂに示すＲＮＡ欄６６は、遺伝子欄６６１、変異欄６６７、サイトバンド欄６６２、リード数欄６６８および知識データ欄６６６を含む。遺伝子欄６６１には、ＲＮＡが転写された転写元の遺伝子が表示される。 The RNA column 66 shown in FIG. 16B includes a gene column 661, a mutation column 667, a sight band column 662, a read number column 668, and a knowledge data column 666. In the gene column 661, the gene from which the RNA was transcribed is displayed.

変異欄６６７には、ＲＮＡの変異が表示される。たとえば図１６Ｂの一番上の行には、ＰＡＸ３遺伝子とＦＯＸＯ１遺伝子との融合遺伝子が検出されたことが表示される。図１６Ｂの一番下の行には、ＭＥＴ遺伝子のエクソン１スキッピングが検出されたことが表示される。 Mutations in RNA are displayed in the mutation column 667. For example, the top row of FIG. 16B shows that a fusion gene of the PAX3 gene and the FOXO1 gene was detected. The bottom row of FIG. 16B shows that exon 1 skipping of the MET gene was detected.

サイトバンド欄６６２には、染色体上の遺伝子の位置が表示される。リード数欄６６８には、読取装置３１により読み取られたリードのうち、変異が検出されたリードの数および割合が表示される。リード数欄６６８に表示される情報は、ＦＡＳＴＱ形式のファイルから読み取られる。知識データ欄６６６には、統合ＤＢ５２から取得された情報が表示される。 The position of the gene on the chromosome is displayed in the sight band column 662. In the read number column 668, the number and ratio of the reads in which the mutation is detected among the reads read by the reading device 31 are displayed. The information displayed in the read number column 668 is read from the FASTQ format file. The information acquired from the integrated DB 52 is displayed in the knowledge data column 666.

本実施の形態によると、腫瘍で発現している遺伝子の異常を検出して、報告書６０に表示するゲノム解析システム１０を提供できる。 According to this embodiment, it is possible to provide a genome analysis system 10 that detects an abnormality in a gene expressed in a tumor and displays it in Report 60.

［実施の形態３］
本実施の形態は、統合ＤＢ５２が更新された場合に、過去に出力した報告書６０の変更点を示す追加報告書を出力するゲノム解析システム１０に関する。実施の形態１と共通する部分については、説明を省略する。 [Embodiment 3]
The present embodiment relates to a genome analysis system 10 that outputs an additional report indicating changes in the previously output report 60 when the integrated DB 52 is updated. The description of the parts common to the first embodiment will be omitted.

図１７は、変更履歴ＤＢのレコードレイアウトを説明する説明図である。変更履歴ＤＢは、統合ＤＢ５２に記録された遺伝子変異と、知識データが変更された変更日とを関連づけて記録するＤＢである。変更履歴ＤＢは、ゲノム変異フィールドおよび変更日フィールドを有する。 FIG. 17 is an explanatory diagram illustrating a record layout of the change history DB. The change history DB is a DB that records the gene mutation recorded in the integrated DB 52 in association with the change date in which the knowledge data is changed. The change history DB has a genome mutation field and a change date field.

ゲノム変異フィールドは、腫瘍部検体フィールド、遺伝子フィールドおよび変異内容フィールドを有する。変更日フィールドは、第１変更日フィールド、第２変更日フィールド等、任意の数のサブフィールドを有する。変更履歴ＤＢは、統合ＤＢ５２に記録された１つの医学情報について、１つのレコードを有する。 The genome mutation field has a tumor part sample field, a gene field, and a mutation content field. The change date field has an arbitrary number of subfields such as a first change date field and a second change date field. The change history DB has one record for one medical information recorded in the integrated DB 52.

腫瘍部検体フィールドには、検体が採取された部位が記録される。遺伝子フィールドには、変異が検出された遺伝子が記録される。なお、複数の変異の組合せに関する医学情報が記録されたレコードにおいては、遺伝子フィールドに複数の遺伝子が記録される。 In the tumor part sample field, the site where the sample was collected is recorded. In the gene field, the gene in which the mutation is detected is recorded. In the record in which medical information regarding a combination of a plurality of mutations is recorded, a plurality of genes are recorded in the gene field.

第１変更日フィールドには、ゲノム変異フィールドに記録された遺伝子変異に関するレコードが統合ＤＢ５２に記録された日付が記録される。第２変更日フィールド以降には、統合ＤＢ５２に記録された医学情報が変更された日付が記録される。 In the first modification date field, the date on which the record relating to the gene mutation recorded in the genome mutation field is recorded in the integrated DB 52 is recorded. After the second modification date field, the date on which the medical information recorded in the integrated DB 52 was modified is recorded.

図１８は、実施の形態３の報告書ＤＢ５６のレコードレイアウトを説明する説明図である。本実施の形態の報告書ＤＢ５６は、図８を使用して説明した実施の形態１の報告書ＤＢ５６に確認日フィールドが追加されている。確認日フィールドには、統合ＤＢ５２の更新状況を確認した日付が記録される。 FIG. 18 is an explanatory diagram illustrating a record layout of the report DB 56 of the third embodiment. The report DB 56 of the present embodiment has a confirmation date field added to the report DB 56 of the first embodiment described with reference to FIG. In the confirmation date field, the date on which the update status of the integrated DB 52 is confirmed is recorded.

図１９は、追加報告書を出力するプログラムの処理の流れを説明するフローチャートである。制御部２１は、報告書ＤＢ５６に記録された報告書レコードを取得する（ステップＳ５２１）。制御部２１は、正常部検体フィールドおよび腫瘍部検体フィールドに記録された、検体が採取された部位を取得する（ステップＳ５２２）。制御部２１は、確認日フィールドに記録された確認日を取得する（ステップＳ５２３）。 FIG. 19 is a flowchart illustrating a processing flow of a program that outputs an additional report. The control unit 21 acquires the report record recorded in the report DB 56 (step S521). The control unit 21 acquires the site from which the sample was collected recorded in the normal part sample field and the tumor part sample field (step S522). The control unit 21 acquires the confirmation date recorded in the confirmation date field (step S523).

制御部２１は、非同義体細胞変異フィールドまたは生殖細胞変異フィールドの遺伝子フィールドに記録された遺伝子変異を取得する（ステップＳ５２４）。制御部２１はステップＳ５２２で取得した検体が採取された部位およびステップＳ５２４で取得した遺伝子変異をキーとして変更履歴ＤＢを検索してレコードを抽出する。制御部２１は、抽出したレコードの変更日フィールドに記録された日付と、ステップＳ５２３で取得した確認日とを比較し、確認日以後に知識データが変更されたか否か判定する（ステップＳ５２５）。 The control unit 21 acquires the gene mutation recorded in the gene field of the non-synonymous cell mutation field or the germline mutation field (step S524). The control unit 21 searches the change history DB using the site where the sample acquired in step S522 was collected and the gene mutation acquired in step S524 as a key, and extracts a record. The control unit 21 compares the date recorded in the change date field of the extracted record with the confirmation date acquired in step S523, and determines whether or not the knowledge data has been changed after the confirmation date (step S525).

知識データが変更されていないと判定した場合（ステップＳ５２５でＮＯ）、制御部２１はステップＳ５２４に戻る。知識データが変更されたと判定した場合（ステップＳ５２５でＹＥＳ）、制御部２１はステップＳ５２２で取得した検体が採取された部位およびステップＳ５２４で取得した遺伝子変異をキーとして、最新の統合ＤＢ５２を検索してレコードを抽出する。制御部２１は、抽出したレコードから知識データを取得する（ステップＳ５２６）。 When it is determined that the knowledge data has not been changed (NO in step S525), the control unit 21 returns to step S524. When it is determined that the knowledge data has been changed (YES in step S525), the control unit 21 searches the latest integrated DB 52 using the site where the sample acquired in step S522 was collected and the gene mutation acquired in step S524 as keys. To extract records. The control unit 21 acquires knowledge data from the extracted records (step S526).

制御部２１は、報告書レコードの知識データフィールドに、ステップＳ５２６で取得した知識データを記録する（ステップＳ５２７）。制御部２１は報告書レコードのコピーを作成して、ステップＳ５２６で取得した知識データを記録しても良い。 The control unit 21 records the knowledge data acquired in step S526 in the knowledge data field of the report record (step S527). The control unit 21 may make a copy of the report record and record the knowledge data acquired in step S526.

制御部２１は、ステップＳ５２１で取得した報告書レコードに記録されたすべての変異の処理を終了したか否かを判定する（ステップＳ５２８）。終了していないと判定した場合（ステップＳ５２８でＮＯ）、制御部２１はステップＳ５２４に戻る。 The control unit 21 determines whether or not the processing of all the mutations recorded in the report record acquired in step S521 has been completed (step S528). If it is determined that the process has not been completed (NO in step S528), the control unit 21 returns to step S524.

終了したと判定した場合（ステップＳ５２８でＹＥＳ）、制御部２１はステップＳ５２５で知識データが変更されていると判定した遺伝子変異があるか否かを判定する（ステップＳ５２９）。あると判定した場合（ステップＳ５２９でＹＥＳ）、制御部２１は臨床医に対して、報告書が変更されたことを通知する（ステップＳ５３０）。通知は、たとえば電子メールまたはメッセンジャー等の、任意の手段により行なえる。 When it is determined that the process is completed (YES in step S528), the control unit 21 determines whether or not there is a gene mutation determined in step S525 that the knowledge data has been changed (step S529). If it is determined to be present (YES in step S529), the control unit 21 notifies the clinician that the report has been changed (step S530). Notifications can be made by any means, such as email or messenger.

制御部２１は、ステップＳ５３０においてエキスパートパネルに対して通知を行ない、レビュー結果に基づく修正を受け付けた後に、臨床医、または、病院に対する通知を行なっても良い。知識データが変更されていると判定した遺伝子変異がないと判定した場合（ステップＳ５２９でＮＯ）またはステップＳ５３０の終了後、制御部２１は処理を終了するか否かを判定する（ステップＳ５３１）。 The control unit 21 may notify the clinician or the hospital after notifying the expert panel in step S530 and accepting the correction based on the review result. When it is determined that there is no gene mutation determined that the knowledge data has been changed (NO in step S529) or after the end of step S530, the control unit 21 determines whether or not to end the process (step S531).

終了しないと判定した場合（ステップＳ５３１でＮＯ）、制御部２１はステップＳ５２１に戻る。終了すると判定した場合（ステップＳ５３１でＹＥＳ）、制御部２１は処理を終了する。 If it is determined that the process is not completed (NO in step S531), the control unit 21 returns to step S521. If it is determined to end (YES in step S531), the control unit 21 ends the process.

本実施の形態によると、過去に作成した報告書に関連する新たな医学情報が公開された場合に、追加報告書を出力するゲノム解析システム１０を提供できる。臨床医は、治療中の患者に対して効果が期待できる薬剤、治験または治療法等に関する追加情報を受け取り、治療方針に反映させることができる。 According to this embodiment, it is possible to provide a genome analysis system 10 that outputs an additional report when new medical information related to a report prepared in the past is released. The clinician can receive additional information about drugs, clinical trials, treatments, etc. that are expected to be effective for the patient being treated and reflect them in the treatment policy.

制御部２１は、追加情報を必要としない報告書６０の指定を受け付けても良い。臨床医は、治療を終了した患者に関する報告書６０等について追加報告書を必要としない旨を指定できる。制御部２１は、ステップＳ５２１において、追加情報を必要としない報告書を取得対象から外すことにより、必要とされない追加報告書の作成を回避する。 The control unit 21 may accept the designation of the report 60 that does not require additional information. The clinician can specify that no additional report is required for reports 60 and the like regarding patients who have completed treatment. In step S521, the control unit 21 excludes reports that do not require additional information from the acquisition target, thereby avoiding the creation of unnecessary additional reports.

［実施の形態４］
本実施の形態は、エキスパートパネルに参加した専門家に対してインセンティブを付与するゲノム解析システム１０に関する。実施の形態１と共通する部分については、説明を省略する。 [Embodiment 4]
The present embodiment relates to a genome analysis system 10 that provides incentives to experts who participate in the expert panel. The description of the parts common to the first embodiment will be omitted.

図２０は、専門家ＤＢのレコードレイアウトを説明する説明図である。専門家ＤＢは、エキスパートパネルに参加する専門家に固有に付与されたエキスパートＩＤと、専門分野と、ポイントとを関連づけて記録するＤＢである。 FIG. 20 is an explanatory diagram illustrating a record layout of the expert DB. The expert DB is a DB that records an expert ID uniquely assigned to an expert participating in the expert panel, a specialized field, and points in association with each other.

専門家ＤＢは、エキスパートＩＤフィールド、専門分野フィールドおよびポイントフィールドを有する。エキスパートＩＤフィールドには、エキスパートＩＤが記録される。専門分野フィールドには、専門家の専門分野が記録されている。ポイントフィールドには、専門家に付与されたポイントが記録されている。 The expert DB has an expert ID field, a specialty field, and a point field. The expert ID is recorded in the expert ID field. In the field of specialization, the field of specialization of the specialist is recorded. In the point field, the points given to the expert are recorded.

専門家は、エキスパートパネルに参加して報告書案のレビューを行なうごとに、ポイントを獲得できる。専門家は溜まったポイントをたとえば、金券、報告書６０の作成を依頼する際に利用できる報告書作成依頼券、または、学習モデル５３を利用した遺伝子解析を依頼する際に利用できる学習モデル利用券等と交換できる。ポイントにより、専門家に対してエキスパートパネルに参加するインセンティブを与えることができる。 Experts can earn points each time they participate in an expert panel and review a draft report. Experts can use the accumulated points, for example, a gold ticket, a report creation request ticket that can be used when requesting the creation of report 60, or a learning model usage ticket that can be used when requesting gene analysis using the learning model 53. Can be exchanged for etc. Points can give professionals an incentive to participate in the expert panel.

ポイントは、たとえば１回のレビューに５ポイントのように定められていても良い。エキスパートレビュー時の発言量または意見の内容に基づいて、たとえばエキスパートパネルのリーダが個々の専門家に付与するポイントを決定しても良い。エキスパートパネルへの参加頻度に基づいて、１回のレビューに付与されるポイントが定められても良い。 The points may be set as, for example, 5 points in one review. Based on the amount of remarks or the content of opinions at the time of expert review, for example, the leader of the expert panel may determine the points to be given to individual experts. The points to be awarded for one review may be determined based on the frequency of participation in the expert panel.

図２１は、エキスパートパネルへの参加者を選択する画面の例を説明する説明図である。図２１に示す画面は、エキスパートパネルの事務局担当者が使用するパソコン、タブレットまたはスマートフォン等の情報機器に表示される。事務局担当者が使用する情報機器は、ネットワークを介して情報処理装置２０に接続されている。 FIG. 21 is an explanatory diagram illustrating an example of a screen for selecting a participant to the expert panel. The screen shown in FIG. 21 is displayed on an information device such as a personal computer, tablet, or smartphone used by the secretariat staff of the expert panel. The information device used by the secretariat staff is connected to the information processing device 20 via a network.

エキスパートパネルへの参加者を選択する画面は、検体情報欄７４、絞込条件欄７５、再検索ボタン７６、候補リスト７７、確認ボタン７８および依頼送信ボタン７９を含む。検体情報欄７４には、エキスパートパネルでのレビューを行なう検体に関する情報が表示されている。 The screen for selecting participants in the expert panel includes a sample information field 74, a narrowing condition field 75, a re-search button 76, a candidate list 77, a confirmation button 78, and a request transmission button 79. In the sample information column 74, information regarding a sample to be reviewed by the expert panel is displayed.

絞込条件欄７５には、専門家の絞込を行なう際に使用する項目が表示されている。ユーザは、各項目の先頭に表示されているチェックボックスを選択することにより、絞込条件を選択できる。なお、絞込条件欄７５は、フリーキーワードを受け付ける欄を有しても良い。候補リスト７７には、エキスパートパネルに参加する専門家の候補リストが表示されている。 In the narrowing condition column 75, items to be used when narrowing down experts are displayed. The user can select the narrowing conditions by selecting the check box displayed at the beginning of each item. The narrowing-down condition column 75 may have a column for accepting free keywords. In the candidate list 77, a candidate list of experts participating in the expert panel is displayed.

ユーザは、絞込条件欄７５を使用して、所望の条件を設定して、再検索ボタン７６を選択する。設定された条件が、情報処理装置２０に送信される。制御部２１は、設定された条件に合う専門家を抽出して、ユーザの使用する情報機器に送信する。 The user sets a desired condition using the narrowing condition field 75 and selects the search button 76 again. The set conditions are transmitted to the information processing device 20. The control unit 21 extracts an expert who meets the set conditions and transmits the expert to the information device used by the user.

候補リスト７７に、設定された条件に合致する専門家のリストが表示される。ユーザは、候補リスト７７の右端に表示されたチェックボックスを使用して、エキスパートパネルへの参加を依頼する専門家を選択する。 The candidate list 77 displays a list of experts who meet the set conditions. The user uses the check box displayed at the right end of the candidate list 77 to select an expert to request participation in the expert panel.

候補リスト７７に表示される専門家の数が多すぎる場合、または、少なすぎる場合には、ユーザは絞込条件欄７５の設定を適宜変更して、再検索を行なう。ユーザが確認ボタン７８を選択した場合、選択された専門家の一覧が表示される。ユーザが依頼送信ボタン７９を選択した場合、選択された専門家の一覧が情報処理装置２０に送信される。 If the number of experts displayed in the candidate list 77 is too large or too small, the user appropriately changes the setting of the narrowing condition field 75 and performs a re-search. If the user selects the confirmation button 78, a list of selected experts is displayed. When the user selects the request transmission button 79, a list of selected experts is transmitted to the information processing device 20.

制御部２１は、検体ＩＤと、選択された専門家のエキスパートＩＤとを関連づけて、補助記憶装置２３に記憶する。制御部２１は、それぞれの専門家に対してＵＲＬ（Uniform Resource Locator）を記載した電子メールを送信する。 The control unit 21 associates the sample ID with the expert ID of the selected expert and stores it in the auxiliary storage device 23. The control unit 21 sends an e-mail containing a URL (Uniform Resource Locator) to each expert.

図２２は、エキスパートパネルへの参加依頼を確認する画面の例を説明する説明図である。図２２は、専門家がＵＲＬにより示されたＷＥＢサイトにアクセスした場合に、専門家の使用する情報機器に表示される画面である。 FIG. 22 is an explanatory diagram illustrating an example of a screen for confirming a request for participation in the expert panel. FIG. 22 is a screen displayed on the information device used by the expert when the expert accesses the WEB site indicated by the URL.

エキスパートパネルへの参加依頼を確認する画面は、依頼リスト７２および参加ボタン７１を含む。依頼リスト７２には、専門家に参加を依頼するエキスパートパネルのリストが表示されている。それぞれのエキスパートパネルについて、検体の採取部位、患者情報、報告書６０の作成を依頼した医療機関等の情報が表示されている。 The screen for confirming the participation request to the expert panel includes the request list 72 and the participation button 71. In the request list 72, a list of expert panels for asking experts to participate is displayed. For each expert panel, information such as a sample collection site, patient information, and a medical institution that requested the preparation of report 60 is displayed.

専門家は、依頼リスト７２を見て、参加を希望するエキスパートパネルについて参加ボタン７１を選択する。制御部２１は、参加ボタン７１を選択した専門家が参加する電子会議室を設定し、報告書案をアップロードする。参加者は、電子会議室上で報告書のレビューを行なう。あらかじめ指名されたリーダが結論をまとめて、電子会議室を終了させる。なお、電子会議システムは従来から広く使用されているため、制御部２１が行なう処理の詳細については説明を省略する。 The expert looks at the request list 72 and selects the join button 71 for the expert panel he wishes to join. The control unit 21 sets up an electronic conference room in which an expert who has selected the participation button 71 participates, and uploads a draft report. Participants will review the report in the electronic conference room. A pre-designated leader summarizes the conclusions and terminates the electronic conference room. Since the electronic conferencing system has been widely used in the past, the details of the processing performed by the control unit 21 will be omitted.

電子会議室の終了後、制御部２１はエキスパートパネルに参加した専門家にポイントを付与する。具体的には、制御部２１は、専門家ＤＢからエキスパートパネルに参加した専門家にかかるレコードを抽出し、ポイントフィールドにポイントを加算する。 After the end of the electronic conference room, the control unit 21 gives points to the experts who participated in the expert panel. Specifically, the control unit 21 extracts the record related to the expert who participated in the expert panel from the expert DB, and adds points to the point field.

図２３は、実施の形態４の修正受付のサブルーチンの処理の流れを説明するフローチャートである。修正受付のサブルーチンは、エキスパートパネルへの専門家の参加を受け付け、参加した専門家にポイントを付与するサブルーチンである。修正受付のサブルーチンは、図１５を使用して説明した実施の形態１のプログラムのステップＳ５１１の代わりに起動する。 FIG. 23 is a flowchart illustrating a processing flow of the subroutine of the modification reception of the fourth embodiment. The modification reception subroutine is a subroutine that accepts the participation of experts in the expert panel and gives points to the participating experts. The modification reception subroutine is activated instead of step S511 of the program of the first embodiment described with reference to FIG.

制御部２１は、専門家ＤＢに登録された専門家ごとに図２２を使用して説明したエキスパートパネル参加依頼画面を作成し、ＵＲＬを記載したメールを送信して、参加依頼を通知する（ステップＳ５４１）。 The control unit 21 creates an expert panel participation request screen described using FIG. 22 for each expert registered in the expert DB, sends an e-mail containing the URL, and notifies the participation request (step). S541).

制御部２１は、専門家ＤＢの専門分野フィールドに記録された専門分野に基づいて、どの専門家にどの報告書案のレビューを依頼するかを定めることができる。たとえば制御部２１は、呼吸器から腫瘍部検体が採取された症例、および、呼吸器科から依頼された症例に関するエキスパートパネルについては、専門分野フィールドに呼吸器が登録された専門家に参加依頼を通知する。 The control unit 21 can determine which expert is requested to review which report draft based on the specialized field recorded in the specialized field of the expert DB. For example, the control unit 21 asks an expert whose respiratory organs are registered in the specialized field to participate in the expert panel regarding the cases in which the tumor part sample is collected from the respiratory organs and the cases requested by the respiratory department. Notice.

制御部２１は、専門家ＤＢに登録された専門家をカテゴリごとに選択して、参加依頼を通知しても良い。制御部２１は、専門家ＤＢに登録された専門家全員に、参加依頼を通知しても良い。制御部２１は、専門家による参加ボタン７１の選択を受け付けることにより、エキスパートパネルへの参加を受け付ける（ステップＳ５４２）。制御部２１は、それぞれのエキスパートパネルへの参加者を登録した電子会議室を設定する（ステップＳ５４３）。制御部２１は、電子会議室へのアクセス情報を、それぞれの参加者に送信する。 The control unit 21 may select an expert registered in the expert DB for each category and notify the participation request. The control unit 21 may notify all the experts registered in the expert DB of the participation request. The control unit 21 accepts participation in the expert panel by accepting the selection of the participation button 71 by an expert (step S542). The control unit 21 sets up an electronic conference room in which participants to each expert panel are registered (step S543). The control unit 21 transmits access information to the electronic conference room to each participant.

制御部２１は、電子会議室に報告書案をアップロードし、参加者が閲覧できる状態にする（ステップＳ５４４）。参加者は、電子会議室を通じて他の参加者とのコミュニュケーションを行ない、報告書案をレビューする。 The control unit 21 uploads the draft report to the electronic conference room so that the participants can view it (step S544). Participants will communicate with other participants through the electronic conference room and review the draft report.

あらかじめ指名されたリーダが結論をまとめて、電子会議室を終了する操作を行なう。制御部２１は、終了操作を受け付ける（ステップＳ５４５）。制御部２１は、電子会議室を閉鎖する（ステップＳ５４６）。制御部２１は、専門家ＤＢからエキスパートパネルに参加した専門家にかかるレコードを抽出し、ポイントフィールドにポイントを加算する（ステップＳ５４７）。制御部２１は、処理を終了する。 A pre-designated leader summarizes the conclusions and performs the operation of terminating the electronic conference room. The control unit 21 accepts the end operation (step S545). The control unit 21 closes the electronic conference room (step S546). The control unit 21 extracts the record related to the expert who participated in the expert panel from the expert DB, and adds points to the point field (step S547). The control unit 21 ends the process.

本実施の形態によると、エキスパートパネルへの参加に対するインセンティブを与えるゲノム解析システム１０を提供できる。学習モデル利用料金および報告書作成料金等で得る収益を、ポイントにより専門家に分配することで、エキスパートパネルに参加する専門家を確保しやすいゲノム解析システム１０を提供できる。 According to this embodiment, it is possible to provide a genome analysis system 10 that provides an incentive for participation in an expert panel. By distributing the profits obtained from the learning model usage fee, the report preparation fee, and the like to the experts by points, it is possible to provide the genome analysis system 10 that makes it easy to secure the experts to participate in the expert panel.

それぞれのエキスパートパネルに参加するか否かを、専門家自身が決定できるため、意欲がある参加者を集められるゲノム解析システム１０を提供できる。電子会議室を用いてエキスパートレビューを行なうため、多忙な専門家であってもエキスパートパネルに参加しやすいゲノム解析システム１０を提供できる。 Since the experts themselves can decide whether or not to participate in each expert panel, it is possible to provide a genome analysis system 10 that can attract motivated participants. Since expert reviews are conducted using an electronic conference room, it is possible to provide a genome analysis system 10 that makes it easy for even busy experts to participate in the expert panel.

［実施の形態５］
本実施の形態は、統合ＤＢ５２に記録される情報のレビューを専門家に依頼するゲノム解析システム１０に関する。実施の形態４と共通する部分については、説明を省略する。 [Embodiment 5]
The present embodiment relates to a genome analysis system 10 that asks an expert to review the information recorded in the integrated DB 52. The description of the parts common to the fourth embodiment will be omitted.

図２４は、統合ＤＢレビュー参加依頼画面の例を説明する説明図である。制御部２１は、それぞれの専門家に対してＵＲＬを記載した電子メールを送信する。専門家がパソコンまたはスマートフォン等の情報機器を用いてＵＲＬにより示されたＷＥＢサイトにアクセスした場合に、図２４に示す統合ＤＢレビュー参加依頼画面が情報機器に表示される。 FIG. 24 is an explanatory diagram illustrating an example of an integrated DB review participation request screen. The control unit 21 sends an e-mail containing a URL to each expert. When an expert accesses the WEB site indicated by the URL using an information device such as a personal computer or a smartphone, the integrated DB review participation request screen shown in FIG. 24 is displayed on the information device.

統合ＤＢレビュー参加依頼画面は、依頼リスト７３および参加ボタン７１を含む。依頼リスト７３には、専門家にレビューを依頼する医学情報のリストが表示されている。それぞれの医学情報について、対象の遺伝子、ＤＮＡ変異および情報源が表示されている。統合ＤＢレビューの対象は、図２４のＮｏ．３に例示するように、特定の遺伝子変異に関係しない情報であっても良い。 The integrated DB review participation request screen includes a request list 73 and a participation button 71. The request list 73 displays a list of medical information for which an expert is requested to review. For each medical information, the gene, DNA mutation and source of interest are displayed. The target of the integrated DB review is No. 24 in FIG. As illustrated in 3, the information may not be related to a specific gene mutation.

専門家は、依頼リスト７３を見て自分の専門領域である薬剤、疾患または治験に関する医学情報であるか否かを判断できる。専門家は、レビューへの参加を希望する場合には、参加ボタン７１を選択する。制御部２１は、参加ボタン７１を選択した専門家が参加する電子会議室を設定し、報告書案をアップロードする。参加者は、電子会議室上で報告書のレビューを行なう。あらかじめ指名されたリーダが結論をまとめて、電子会議室を終了させる。 The expert can look at the request list 73 to determine if it is medical information about a drug, disease or clinical trial in his area of expertise. The expert selects the join button 71 if he wishes to participate in the review. The control unit 21 sets up an electronic conference room in which an expert who has selected the participation button 71 participates, and uploads a draft report. Participants will review the report in the electronic conference room. A pre-designated leader summarizes the conclusions and terminates the electronic conference room.

なお、レビューは１名の専門家が単独で実施しても良い。その場合には、電子会議室を使用しなくても良い。 The review may be conducted independently by one expert. In that case, it is not necessary to use the electronic conference room.

制御部２１は、レビュー結果に基づいて、統合ＤＢ５２への新規レコードの追加、または既存レコードの更新を実行する。 The control unit 21 adds a new record to the integrated DB 52 or updates an existing record based on the review result.

図２５は、統合ＤＢ５２を更新するプログラムの処理の流れを説明するフローチャートである。以下の説明では、情報処理装置２０が統合ＤＢ５２の更新を行なう場合を例にして説明する。統合ＤＢ５２の更新は情報処理装置２０以外の情報機器で実行されても良い。 FIG. 25 is a flowchart illustrating a processing flow of a program for updating the integrated DB 52. In the following description, a case where the information processing apparatus 20 updates the integrated DB 52 will be described as an example. The update of the integrated DB 52 may be executed by an information device other than the information processing device 20.

制御部２１は、様々な医学情報ＤＢ５８を巡回して、遺伝子変異に関する新たな医学情報を収集してデータベース化するクローリングを行なう（ステップＳ５５１）。クローリングは、クローラまたはロボットと呼ばれるプログラムにより実行される。クローリングは従来から広く行なわれているため、詳細については説明を省略する。 The control unit 21 patrols various medical information DB 58s, collects new medical information related to gene mutations, and creates a database for crawling (step S551). Crawling is performed by a program called a crawler or robot. Since crawling has been widely used in the past, detailed description thereof will be omitted.

制御部２１は、クローリングにより収集された医学情報を選択して、統合ＤＢ５２に既に記録されている遺伝子変異に関する情報であるか否かを判定する（ステップＳ５５２）。統合ＤＢ５２に記録されている遺伝子変異に関する情報であると判定した場合（ステップＳ５５２でＹＥＳ）、制御部２１は統合ＤＢ５２に記録されている情報と同一の内容であるか否かを判定する（ステップＳ５５３）。 The control unit 21 selects the medical information collected by crawling and determines whether or not the information is related to the gene mutation already recorded in the integrated DB 52 (step S552). When it is determined that the information is related to the gene mutation recorded in the integrated DB 52 (YES in step S552), the control unit 21 determines whether or not the information is the same as the information recorded in the integrated DB 52 (step). S553).

統合ＤＢ５２に記録されている遺伝子変異に関する情報ではないと判定した場合（ステップＳ５５２でＮＯ）、または、統合ＤＢ５２に記録されている情報と同一の内容ではないとト判定した場合（ステップＳ５５３でＮＯ）、制御部２１は、処理中の医学情報がレビュー対象である旨を記録する（ステップＳ５５４）。 When it is determined that the information is not related to the gene mutation recorded in the integrated DB 52 (NO in step S552), or when it is determined that the information is not the same as the information recorded in the integrated DB 52 (NO in step S553). ), The control unit 21 records that the medical information being processed is the subject of the review (step S554).

同一内容であると判定した場合（ステップＳ５５３でＹＥＳ）、またはステップＳ５５４の終了後、制御部２１はステップＳ５５１で収集した医学情報の処理を終了したか否かを判定する（ステップＳ５５５）。終了していないと判定した場合（ステップＳ５５５でＮＯ）、制御部２１はステップＳ５５２に戻る。 If it is determined that the contents are the same (YES in step S553), or after the end of step S554, the control unit 21 determines whether or not the processing of the medical information collected in step S551 is completed (step S555). If it is determined that the process has not been completed (NO in step S555), the control unit 21 returns to step S552.

終了したと判定した場合（ステップＳ５５５でＹＥＳ）、制御部２１は、専門家ＤＢに登録された専門家ごとに図２４を使用して説明した統合ＤＢレビュー参加依頼画面を作成し、ＵＲＬを記載したメールを送信して、参加依頼を通知する（ステップＳ５６１）。 When it is determined that the process has been completed (YES in step S555), the control unit 21 creates an integrated DB review participation request screen described using FIG. 24 for each expert registered in the expert DB, and describes the URL. Is sent to notify the participation request (step S561).

制御部２１は、専門家による参加ボタン７１の選択を受け付けることにより、レビューへの参加を受け付ける（ステップＳ５６２）。制御部２１は、それぞれのレビューへの参加者を登録した電子会議室を設定する（ステップＳ５６３）。制御部２１は、電子会議室へのアクセス情報を、それぞれの参加者に送信する。 The control unit 21 accepts participation in the review by accepting the selection of the participation button 71 by an expert (step S562). The control unit 21 sets up an electronic conference room in which participants for each review are registered (step S563). The control unit 21 transmits access information to the electronic conference room to each participant.

制御部２１は、電子会議室にクローリングにより収集した医学情報をアップロードし、参加者が閲覧できる状態にする（ステップＳ５６４）。参加者は、電子会議室を通じて他の参加者とのコミュニュケーションを行ない、医学情報をレビューする。 The control unit 21 uploads the medical information collected by crawling to the electronic conference room so that the participants can view it (step S564). Participants communicate with other participants through the electronic conference room and review medical information.

あらかじめ指名されたリーダが結論をまとめて、電子会議室を終了する操作を行なう。結論は、参加した専門家の多数決により決定されてもよい。制御部２１は、終了操作を受け付ける（ステップＳ５６５）。制御部２１は、電子会議室を閉鎖する（ステップＳ５６６）。制御部２１は、専門家ＤＢからレビューに参加した専門家にかかるレコードを抽出し、ポイントフィールドにポイントを加算する（ステップＳ５６７）。制御部２１は、それぞれの医学情報に関するレビュー結果に基づいて、統合ＤＢ５２を更新する（ステップＳ５６８）。制御部２１は、処理を終了する。 A pre-designated leader summarizes the conclusions and performs the operation of terminating the electronic conference room. The conclusion may be decided by a majority vote of the participating experts. The control unit 21 accepts the end operation (step S565). The control unit 21 closes the electronic conference room (step S566). The control unit 21 extracts the record related to the expert who participated in the review from the expert DB, and adds points to the point field (step S567). The control unit 21 updates the integrated DB 52 based on the review result regarding each medical information (step S568). The control unit 21 ends the process.

本実施の形態によると、統合ＤＢ５２に登録する情報をクローリングにより自動収集した後に、専門家によるレビューを経て統合ＤＢ５２を更新するゲノム解析システム１０を提供できる。クローリング技術を活用することにより、統合ＤＢ５２に新しい医学情報を適宜反映させるゲノム解析システム１０を提供できる。 According to this embodiment, it is possible to provide a genome analysis system 10 that automatically collects information registered in the integrated DB 52 by crawling and then updates the integrated DB 52 after a review by an expert. By utilizing the crawling technique, it is possible to provide a genome analysis system 10 that appropriately reflects new medical information in the integrated DB 52.

収集した医学情報を統合ＤＢ５２に登録する前に専門家によるレビューを実施することにより、統合ＤＢ５２の信頼度を保ち、正確な報告書６０を出力するゲノム解析システム１０を提供できる。 By conducting a review by an expert before registering the collected medical information in the integrated DB 52, it is possible to provide a genome analysis system 10 that maintains the reliability of the integrated DB 52 and outputs an accurate report 60.

学習モデル利用料金および報告書作成料金等で得る収益を、ポイントにより専門家に分配することで、レビューに参加する専門家を確保しやすいゲノム解析システム１０を提供できる。 By distributing the profits obtained from the learning model usage fee, the report preparation fee, etc. to the experts by points, it is possible to provide the genome analysis system 10 that makes it easy to secure the experts to participate in the review.

それぞれのレビューに参加するか否かを、専門家自身が決定できるため、意欲があるレビュー参加者を集められるゲノム解析システム１０を提供できる。電子会議室を用いてレビューを行なうため、多忙な専門家であってもレビューに参加しやすいゲノム解析システム１０を提供できる。 Since the expert himself can decide whether or not to participate in each review, it is possible to provide a genome analysis system 10 that can attract motivated review participants. Since the review is performed using the electronic conference room, it is possible to provide the genome analysis system 10 that makes it easy for even a busy expert to participate in the review.

［実施の形態６］
図２６は、ゲノムデータから臨床上意味のある遺伝子変異を予測する段階における情報処理装置２０の機能ブロック図である。情報処理装置２０は、ゲノムデータ取得部８１と、ゲノムデータ入力部８２と、出力部８３とを有する。 [Embodiment 6]
FIG. 26 is a functional block diagram of the information processing apparatus 20 at the stage of predicting a clinically meaningful gene mutation from genomic data. The information processing device 20 includes a genome data acquisition unit 81, a genome data input unit 82, and an output unit 83.

ゲノムデータ取得部８１は、検体に含まれる塩基配列を読み取ったゲノムデータを取得する。ゲノムデータ入力部８２は、ゲノムデータを受け付けて遺伝子変異に関する予測を出力する学習モデル５３に、ゲノムデータ取得部８１が取得したゲノムデータを入力する。出力部８３は、ゲノムデータ入力部８２により入力されたゲノムデータに基づいて学習モデル５３から出力された予測を出力する。 The genome data acquisition unit 81 acquires the genome data obtained by reading the base sequence contained in the sample. The genome data input unit 82 inputs the genome data acquired by the genome data acquisition unit 81 into the learning model 53 that receives the genome data and outputs the prediction regarding the gene mutation. The output unit 83 outputs the prediction output from the learning model 53 based on the genome data input by the genome data input unit 82.

図２７は、遺伝子変異と統合ＤＢ５２とに基づいて報告書を作成する段階における情報処理装置２０の機能ブロック図である。情報処理装置２０は、第１受付部８４と、第１出力部８５と、第２受付部８６と、第２出力部８７とを有する。 FIG. 27 is a functional block diagram of the information processing apparatus 20 at the stage of creating a report based on the gene mutation and the integrated DB 52. The information processing device 20 has a first reception unit 84, a first output unit 85, a second reception unit 86, and a second output unit 87.

第１受付部８４は、検体から検出された遺伝子変異を受け付ける。第１出力部８５は、第１受付部８４が受け付けた遺伝子変異と、複数の情報源から取得した遺伝子変異に関する医学情報、医学情報の取得日および根拠情報を関連づけて統合した統合ＤＢ５２とに基づいて、検体に関する解析結果と、統合ＤＢ５２のバージョンとを関連づけて記録した報告書を出力する。 The first reception unit 84 receives the gene mutation detected in the sample. The first output unit 85 is based on the gene mutation received by the first reception unit 84 and the integrated DB 52 that integrates medical information regarding the gene mutation acquired from a plurality of information sources, the acquisition date of the medical information, and the basis information. Then, the report that records the analysis result of the sample and the version of the integrated DB 52 in association with each other is output.

第２受付部８６は、過去の日付、当該日付における報告書出力要求、および、検体から検出された遺伝子変異を受け付ける。第２出力部８７は、第２受付部８６が受け付けた遺伝子変異と、当該日付における統合ＤＢ５２とに基づいて、検体に関する解析結果と、統合ＤＢ５２のバージョンとを関連づけて記録した報告書を出力する。 The second reception unit 86 receives a past date, a report output request on that date, and a gene mutation detected in a sample. The second output unit 87 outputs a report in which the analysis result regarding the sample and the version of the integrated DB 52 are recorded in association with each other based on the gene mutation received by the second reception unit 86 and the integrated DB 52 on the date. ..

［実施の形態７］
本実施の形態は、汎用のコンピュータ９０とプログラム９７とを組み合わせて動作させることにより、本実施の形態のゲノム解析システム１０を実現する形態に関する。図２８は、実施の形態７のゲノム解析システム１０の構成を説明する説明図である。実施の形態１と共通する部分については、説明を省略する。 [Embodiment 7]
The present embodiment relates to a mode in which the genome analysis system 10 of the present embodiment is realized by operating a general-purpose computer 90 and a program 97 in combination. FIG. 28 is an explanatory diagram illustrating the configuration of the genome analysis system 10 of the seventh embodiment. The description of the parts common to the first embodiment will be omitted.

本実施の形態のゲノム解析システム１０は、コンピュータ９０と、読取装置３１と、データサーバ３２とを含む。 The genome analysis system 10 of the present embodiment includes a computer 90, a reading device 31, and a data server 32.

コンピュータ９０は、制御部２１、主記憶装置２２、補助記憶装置２３、通信部２４、読取部２９およびバスを備える。コンピュータ９０は、汎用のパーソナルコンピュータ、タブレットまたはサーバコンピュータ等の情報機器である。 The computer 90 includes a control unit 21, a main storage device 22, an auxiliary storage device 23, a communication unit 24, a reading unit 29, and a bus. The computer 90 is an information device such as a general-purpose personal computer, a tablet, or a server computer.

プログラム９７は、可搬型記録媒体９６に記録されている。制御部２１は、読取部２９を介してプログラム９７を読み込み、補助記憶装置２３に保存する。また制御部２１は、コンピュータ９０内に実装されたフラッシュメモリ等の半導体メモリ９８に記憶されたプログラム９７を読出しても良い。さらに、制御部２１は、通信部２４および図示しないネットワークを介して接続される図示しない他のサーバコンピュータからプログラム９７をダウンロードして補助記憶装置２３に保存しても良い。 The program 97 is recorded on the portable recording medium 96. The control unit 21 reads the program 97 via the reading unit 29 and stores it in the auxiliary storage device 23. Further, the control unit 21 may read the program 97 stored in the semiconductor memory 98 such as the flash memory mounted in the computer 90. Further, the control unit 21 may download the program 97 from the communication unit 24 and another server computer (not shown) connected via a network (not shown) and store the program 97 in the auxiliary storage device 23.

プログラム９７は、コンピュータ９０の制御プログラムとしてインストールされ、主記憶装置２２にロードして実行される。これにより、コンピュータ９０は上述した情報処理装置２０として機能する。 The program 97 is installed as a control program of the computer 90, loaded into the main storage device 22, and executed. As a result, the computer 90 functions as the information processing device 20 described above.

各実施例で記載されている技術的特徴（構成要件）はお互いに組合せ可能であり、組み合わせすることにより、新しい技術的特徴を形成することができる。
今回開示された実施の形態はすべての点で例示であって、制限的なものではないと考えられるべきである。本発明の範囲は、上記した意味ではなく、特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The technical features (constituent requirements) described in each embodiment can be combined with each other, and by combining them, a new technical feature can be formed.
The embodiments disclosed this time should be considered to be exemplary in all respects and not restrictive. The scope of the present invention is indicated by the scope of claims, not the above-mentioned meaning, and is intended to include all modifications within the meaning and scope equivalent to the scope of claims.

１０ゲノム解析システム
２０情報処理装置
２１制御部
２２主記憶装置
２３補助記憶装置
２４通信部
２９読取部
３１読取装置
３２データサーバ
５１教師データＤＢ
５２統合ＤＢ
５３学習モデル
５３１入力層
５３２中間層
５３３出力層
５５報告書案ＤＢ
５６報告書ＤＢ
５８医学情報ＤＢ
６０報告書
６１書誌事項欄
６１１ＩＤ欄
６１２患者情報欄
６１３検体欄
６１４病理組織診断欄
６１５検体番号欄
６２コメント欄
６３非同義体細胞変異欄
６３１遺伝子欄
６３２サイトバンド欄
６３３ＤＮＡ変異欄
６３４アミノ酸変異欄
６３５アリル頻度欄
６３６知識データ欄
６４生殖細胞変異欄
６４１遺伝子欄
６４２サイトバンド欄
６４３ＤＮＡ変異欄
６４４アミノ酸変異欄
６４５知識データ欄
６４７正常部アリル頻度欄
６４８腫瘍部アリル頻度欄
６５解析欄
６５１推定腫瘍含有量欄
６５２変異頻度相関係数欄
６６ＲＮＡ欄
６６１遺伝子欄
６６２サイトバンド欄
６６６知識データ欄
６６７変異欄
６６８リード数欄
７１参加ボタン
７２依頼リスト
７３依頼リスト
７４検体情報欄
７５絞込条件欄
７６再検索ボタン
７７候補リスト
７８確認ボタン
７９依頼送信ボタン
８１ゲノムデータ取得部
８２ゲノムデータ入力部
８３出力部
８４第１受付部
８５第１出力部
８６第２受付部
８７第２出力部
９０コンピュータ
９６可搬型記録媒体
９７プログラム
９８半導体メモリ 10 Genome analysis system 20 Information processing device 21 Control unit 22 Main storage device 23 Auxiliary storage device 24 Communication unit 29 Reading unit 31 Reading device 32 Data server 51 Teacher data DB
52 Integrated DB
53 Learning model 531 Input layer 532 Intermediate layer 533 Output layer 55 Draft report DB
56 Report DB
58 Medical Information DB
60 Report 61 Journal matters column 611 ID column 612 Patient information column 613 Specimen column 614 Histopathological diagnosis column 615 Specimen number column 62 Comment column 63 Non-synonymous cell mutation column 631 Gene column 632 Site band column 633 DNA mutation column 634 Amino acid mutation Column 635 Allyl frequency column 636 Knowledge data column 64 Reproductive cell mutation column 641 Gene column 642 Site band column 643 DNA mutation column 644 Amino acid mutation column 645 Knowledge data column 647 Normal part Allyl frequency column 648 Tumor part Allyl frequency column 65 Analysis column 651 Estimated Tumor content column 652 Mutation frequency correlation coefficient column 66 RNA column 661 Gene column 662 Site band column 666 Knowledge data column 667 Mutation column 668 Read number column 71 Participation button 72 Request list 73 Request list 74 Specimen information column 75 Narrowing condition column 76 Re-search button 77 Candidate list 78 Confirmation button 79 Request send button 81 Genome data acquisition unit 82 Genome data input unit 83 Output unit 84 1st reception unit 85 1st output unit 86 2nd reception unit 87 2nd output unit 90 Computer 96 Portable recording medium 97 Program 98 Semiconductor memory

Claims

Obtain the analysis result of the sample including the gene mutation detected in the sample,
When the report output request is accepted
Extraction and gene mutation, a medical information on the gene mutation acquired from multiple sources, from the integrated DB acquired date and integrated in association with the basis information of the medical information, medical information acquired the gene mutation as a key And
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When a report output request on a past date and that date is accepted
From the integrated DB in the date, extracts medical information acquired the gene mutation as a key,
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When the integrated DB is updated by adding medical information about gene mutation,
Medical information is extracted from the updated integrated DB using the acquired gene mutation as a key.
A program that causes a computer to execute a process of outputting an additional report that records an analysis result of the sample, extracted medical information, and a version of the integrated DB in association with each other.

Obtain the analysis result of the sample including the genomic data obtained by reading the base sequence contained in the sample.
Input the acquired genomic data into a learning model that accepts genomic data and outputs predictions about gene mutations.
Based on the input genomic data, the prediction about the gene mutation output from the learning model is acquired, and the prediction is obtained.
Extraction and gene mutation, a medical information on the gene mutation acquired from multiple sources, before Symbol integrated DB that integrates in association with an acquisition date and the basis information of the medical information, medical information the acquired predicted as a key And
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When the integrated DB is updated by adding medical information about gene mutation,
Medical information is extracted from the updated integrated DB using the acquired prediction as a key.
A program that causes a computer to execute a process of outputting an additional report that records an analysis result of the sample, extracted medical information, and a version of the integrated DB in association with each other.

Send a review request regarding the update of the integrated DB to an expert,
Accept the review results for the submitted review request,
Record incentives for accepted review results in association with the expert
The program according to claim 1 or 2 .

Send a review request for the report to an expert
Accept the review results for the submitted review request,
The program according to any one of claims 1 to 3 , which records an incentive for the received review result in association with the expert.

The incentive is a cash voucher, a report creation request voucher, or a learning model voucher.
The program according to claim 3 or 4 .

The incentive varies based on the review results.
The program according to any one of claims 3 to 5 .

The first reception section that receives the analysis results for the sample containing the gene mutation detected in the sample, and
When the report output request is accepted
A gene mutation in which the first receiving unit has received a medical information on the gene mutation acquired from multiple sources, from the integrated DB that integrates in association with an acquisition date and the basis information of the medical information, said gene obtained The first extraction unit that extracts medical information using mutation as a key,
A first output unit that outputs a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
The second reception department that accepts past dates and report output requests on that date,
From the integrated DB in the date, a second extractor for extracting medical information acquired the gene mutation as a key,
A second output unit that outputs a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other .
When the integrated DB is updated by adding medical information about gene mutation,
A third extraction unit that extracts medical information from the updated integrated DB using the acquired gene mutation as a key.
An information processing device including an additional output unit that outputs an additional report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other .

A first reception unit that accepts analysis results related to the sample, including genomic data obtained by reading the base sequence contained in the sample.
A prediction acquisition unit that inputs the received genome data into the learning model that receives the genome data and outputs the prediction about the gene mutation, and acquires the prediction about the gene mutation output from the learning model based on the input genome data. ,
Medical information is extracted using the acquired prediction as a key from the integrated DB that integrates the gene mutation, the medical information related to the gene mutation acquired from a plurality of information sources, and the acquisition date and the basis information of the medical information. The first extraction unit and
A first output unit that outputs a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When the integrated DB is updated by adding medical information about gene mutation,
A third extraction unit that extracts medical information from the updated integrated DB using the acquired prediction as a key.
An additional output unit that outputs an additional report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
Information processing device equipped with.

Obtain the analysis result of the sample including the gene mutation detected in the sample,
When the report output request is accepted
Extraction and gene mutation, a medical information on the gene mutation acquired from multiple sources, from the integrated DB acquired date and integrated in association with the basis information of the medical information, medical information acquired the gene mutation as a key And
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When a report output request on a past date and that date is accepted
From the integrated DB in the date, extracts medical information acquired the gene mutation as a key,
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When the integrated DB is updated by adding medical information about gene mutation,
Medical information is extracted from the updated integrated DB using the acquired gene mutation as a key.
An information processing method in which a computer executes a process of outputting an additional report recorded by associating an analysis result of the sample with the extracted medical information and a version of the integrated DB .

Obtain the analysis result of the sample including the genomic data obtained by reading the base sequence contained in the sample.
Input the acquired genomic data into a learning model that accepts genomic data and outputs predictions about gene mutations.
Based on the input genomic data, the prediction about the gene mutation output from the learning model is acquired, and the prediction is obtained.
Medical information is extracted using the acquired prediction as a key from the integrated DB that integrates the gene mutation, the medical information related to the gene mutation acquired from a plurality of information sources, and the acquisition date and the basis information of the medical information. ,
Output a report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
When the integrated DB is updated by adding medical information about gene mutation,
Medical information is extracted from the updated integrated DB using the acquired prediction as a key.
Output an additional report that records the analysis result of the sample, the extracted medical information, and the version of the integrated DB in association with each other.
An information processing method that causes a computer to perform processing.