KR20150085459A

KR20150085459A - Novel NTRK1 fusion gene as colon cancer marker and the uses thereof

Info

Publication number: KR20150085459A
Application number: KR1020140143631A
Authority: KR
Inventors: 박중훈; 신은지; 박옥구; 박도윤; 신나리; 최찬; 이재혁; 권채화
Original assignee: 주식회사 엘지생명과학
Priority date: 2014-01-14
Filing date: 2014-10-22
Publication date: 2015-07-23
Also published as: KR102267407B1

Abstract

The present invention relates to a fusion protein of lamin A (LMNA) or tropomyosin 3 (TPM3) and neurotrophic tyrosine kinase receptor type 1 (NTRK1), and a fusion gene polynucleotide encoding the same. The present invention also relates to a colon cancer diagnosis composition including the fusion protein, the fusion gene, or a formulation for measuring a transcription product of the fusion gene, to a kit for diagnosing colon cancer including the same, to a method for providing information needed for colon cancer diagnosis, which includes a step of measuring a level of the NTRK1 fusion gene, a transcription product thereof, or protein thereof, and to a method for providing information needed to make prognosis of colon cancer treatment. In addition, the present invention relates to a method for screening a colon cancer treating agent including a step of measuring an expression level of the NTRK1 fusion gene, and to a pharmaceutical composition including an NTRK1 fusion protein inhibiting agent as an active ingredient for preventing or treating colon cancer. According to the present invention, provided is a diagnosis marker specific to colon cancer, thus it is possible to accurately diagnose colon cancer, to predict of clinical prognosis by classifying colon cancer patients, to predict efficacy of colon cancer treatment, and especially to predict efficacy of colon cancer treatment by the NTRK1 protein inhibiting agent. Accordingly, administration of an agent to a colon cancer patient for whom administration of an agent is considered to be ineffective can be limited. Therefore, efficient and safe colon cancer treatment can be realized.

Description

A novel NTRK1 fusion gene as a colon cancer marker and its use {Novel NTRK1 fusion gene as colon cancer marker and the uses thereof}

LMNA(Lamin A) 또는 TPM3(Tropomyosin 3)와 NTRK1(Neurotrophic Tyrosine Kinase, Receptor, Type 1)와의 융합단백질 및 이를 암호화하는 융합유전자 폴리뉴클레오티드에 관한 것이다. 또한, 상기 융합단백질, 상기 융합유전자, 또는 상기 융합유전자의 전사산물을 측정하는 제제를 포함하는 대장암 진단용 조성물 및 이를 포함하는 대장암 진단용 키트에 관한 것이다. 아울러, NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 대장암 진단에 필요한 정보를 제공하는 방법 및 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법에 관한 것이다. 또한, NTRK1 융합유전자의 발현 수준을 측정하는 단계를 포함하는 대장암 치료제의 스크리닝 방법 및 NTRK1 융합단백질 저해제를 유효성분으로 포함하는 대장암의 예방 또는 치료용 약제학적 조성물에 관한 것이다.
A fusion protein of LMNA (Lamin A) or TPM3 (Tropomyosin 3) with NTRK1 (Neurotrophic Tyrosine Kinase, Receptor, Type 1) and a fusion gene polynucleotide encoding the same. Further, the present invention relates to a composition for diagnosing colorectal cancer comprising the fusion protein, the fusion gene, or an agent for measuring the transcription product of the fusion gene, and a kit for diagnosing colorectal cancer comprising the same. In addition, the present invention relates to a method for providing information necessary for diagnosis of colon cancer, including the step of measuring the level of the NTRK1 fusion gene, its transcription product or its protein, and a method for providing information necessary for predicting colon cancer treatment prognosis. Further, the present invention relates to a method for screening a therapeutic agent for colorectal cancer, which comprises the step of measuring the expression level of the NTRK1 fusion gene, and a pharmaceutical composition for preventing or treating colorectal cancer comprising the NTRK1 fusion protein inhibitor as an active ingredient.

대장암은 암 사망에 있어서 제2-3위를 차지하는 암으로 전체 암의 약 13% 정도를 차지하고 있다. 미국의 경우, 연간 약 50,310명이 대장암으로 사망하고 136,830명의 새로운 대장암 환자가 발생한다고 보고되고 있다. 우리나라에서는 사망률에 있어서 전체 암에서 차지하는 비율이 대장암이 폐암, 간암, 위암에 이어 제4위를 차지하고 있다. 1980년에 전체 암의 5.8%를 차지하던 것이 1985년에는 6.1%, 1995년에는 8.2%, 2002년에는 11.2%, 2012년에는 12.9%로 지속적인 증가 추세를 보이고 있으며, 1999년도 자료와 비교해 볼 때 2011년에는 위암, 간암 및 자궁경부암의 사망률이 감소한 데 비해 대장암에 의한 사망률은 5.6%의 증가하였다.
Colon cancer is the second most common cancer in cancer deaths, accounting for about 13% of all cancers. In the United States, about 50,310 people die of colorectal cancer annually and 136,830 new colorectal cancer cases are reported to occur. In Korea, colon cancer accounts for the fourth place in lung cancer, liver cancer, and stomach cancer in terms of mortality. In 1980, 5.8% of total cancer cases showed a steadily increasing trend in 6.15% in 1985, 8.2% in 1995, 11.2% in 2002 and 12.9% in 2012. Compared with 1999 data In 2011, mortality from gastric cancer, liver cancer and cervical cancer decreased, while mortality from colon cancer increased by 5.6%.

대장내시경을 이용한 조기진단 및 근치술, 화학요법 등의 발달로 전체 대장암 환자의 5년 생존율이 약 64.7%로 보고되고 있다. 그러나 1기 대장암 환자의 5년 생존율이 89.8%, 2-3기 대장암 환자의 5년 생존율이 70.5%로 향상된 반면, 말기 대장암 환자의 5년 생존율은 12.9%로 지난 30여년 간 정체되었고, 기존 치료법을 통하여 생존율을 향상시키기는 어려운 실정이다. 최근 보험급여가 인정된 얼비툭스(머크사) 및 아바스틴(로슈사) 등의 표적치료제에 의한 말기 대장암 환자의 무병생존 연장기간은 기존의 화학요법 대비 각각 0.9개월(p = 0.048) 및 1.4개월(p = 0.0023)로 임상적 실효성이 불충분한 상황이다. 유방암 및 폐암 등과 비교할 때, 대장암은 유전체학적 연구가 미비하고, 이에 따라 분자수준의 환자계층화가 미비한 실정이다. 따라서, 대장암 환자의 생존율을 높이기 위해서는 무엇보다도 분자수준의 환자계층화 및 새로운 대장암 맞춤 타겟의 발굴이 시급하다.
The 5 - year survival rate of all colorectal cancer patients is reported to be about 64.7% due to early diagnosis, curative surgery and chemotherapy using colonoscopy. However, the 5-year survival rate of primary colorectal cancer patients was 89.8% and the 5-year survival rate of second-stage colorectal cancer patients was improved to 70.5%, while the 5-year survival rate of end stage colorectal cancer patients was 12.9% , And it is difficult to improve the survival rate through existing therapies. The duration of disease-free survival of end-stage colorectal cancer patients treated with targeted therapies such as Erbitux (Merck) and Avastin (Roche), which recently received insurance benefits, was 0.9 months (p = 0.048) and 1.4 months = 0.0023), indicating that clinical efficacy is inadequate. Compared with breast cancer and lung cancer, colon cancer has not been studied in genomic studies, and thus, there is not much stratification of the molecular level. Therefore, in order to increase the survival rate of patients with colorectal cancer, it is urgent to stratify the molecular level and to find a new target for colon cancer.

대장암의 발생과정에는 여러 종류의 암 유전자, 종양 억제유전자 등 다양한 유전자 변화가 관여하는 것으로 알려져 있다. 실제 대장암은 발암과정에서 일어나는 유전적 변화가 가장 많이 밝혀진 암이다. 대장암은 한 개의 암 유전자 또는 종양억제 유전자의 변화가 단독으로 암을 유발시킬 수 있는 것이 아니고 정상 대장 점막세포가 선종의 단계를 거쳐 대장암으로 진행되기 위해서는 수년에 걸친 긴 세월을 통해 여러 개의 암 관련 유전자의 변화가 축적되어야 하는데 이를 대장암 발생에 있어서 유전자의 다단계적 변화라고 한다. 여기서 중요한 것은 각 단계에서의 유전자 변화 순서가 아니라 궁극적으로 누적되는 유전자들의 변화의 총합이다. 대장암 발생과정에 관여하는 유전적 변화로는 암 유전자와 종양 억제유전자의 돌연변이, DNA 메틸화 이상 및 DNA 수리 유전자의 돌연변이도 관계된다.
It is known that various genetic changes such as various kinds of cancer genes and tumor suppressor genes are involved in the development of colon cancer. Actually, colorectal cancer is one of the most common genetic changes in carcinogenesis. In the case of colorectal cancer, the change of one cancer gene or tumor suppressor gene can not induce cancer alone. In order for normal colon mucosal cells to progress to colorectal cancer through adenoma stage, Changes in related genes should be accumulated, which is called multistep change of genes in the development of colorectal cancer. What is important here is not the sequence of gene changes at each step but the sum of the changes of genes that ultimately accumulate. The genetic changes involved in colon cancer development involve the mutation of cancer gene and tumor suppressor gene, DNA methylation abnormality, and mutation of DNA repair gene.

암의 형성은 다양한 유전자들과 이들 유전자들의 상호작용이 복합적으로 연관되어 진행되므로, 전체 유전자의 발현 및 돌연변이를 조사하고 비교하여 새로운 암 유전자 및 치료용 타겟을 발굴하고자 하는 유전체적 접근이 이루어지고 있다. 암세포에서 특이적으로 발현이 증가하거나 감소하는 유전자들은 세포분열, 세포신호전달, 세포 골격, 세포 운동, 세포 방어, 유전자 및 단백질의 발현 그리고 세포 내 물질 대사 등 암세포의 생존 및 증식에 관여하는 것으로 보고되고 있다. 암 유전체학의 발전은 유방암 및 폐암 등의 환자계층화 및 계층별 환자의 맞춤 치료용 타겟 발굴에 기여하였다. 그러나 발굴된 타겟에 대한 임상적 실효성에 대한 평가의 부재는 암 유전체학을 통해 연구개발된 맞춤형 항암 치료제의 임상연구 실패의 주요 원인으로 평가되고 있다.
Since the formation of cancer is caused by a complex combination of various genes and the interaction of these genes, a genetic approach to discover new cancer genes and therapeutic targets has been made by investigating and comparing the expression and mutation of whole genes . Genes that specifically increase or decrease expression in cancer cells are thought to be involved in the survival and proliferation of cancer cells such as cell division, cell signaling, cytoskeleton, cell movement, cell defense, gene and protein expression, and intracellular metabolism . The development of cancer genomics has contributed to the stratification of patients such as breast cancer and lung cancer, and to the development of targets for tailored treatment for individual patients. However, the absence of an assessment of the clinical efficacy of the identified target has been evaluated as a major cause of failure in clinical studies of customized chemotherapeutic drugs developed through cancer genomics.

NTRK1 융합유전자는 다양한 암종에 걸쳐 보고된 바 있다. 대장암과 갑상선 유두암에서는 TPM3-NTRK1의 융합유전자가 보고되었고(Martin-Zanca 등, Nature, 1986년, 319권, 743~748 페이지; Wilton 등, Cytogenetics and cell genetics, 1995년, 68권, 122~124 페이지), 폐암에서는 미오신 탈인산화효소 Rho 상호작용 단백질(myosin phosphatase Rho interacting protein: MPRIP) 또는 CD74-NTRK1 융합유전자가 보고되었으며(Vaishnave 등, Nature medicine, 2013년, 19권, 1469~1472 페이지; 특허번호 WO 2014071358 A2), 스피트조이드(Spitzoid) 흑색종에서는 TP53 또는 라민 A/C (lamin A/C: LMNA)-NTRK1 융합유전자가 보고되었고(Wiesner 등, Nature communications, 2014년 5권, 3116; 특허번호 WO 2013059740 A1), 담낭암에서는 RAB 지티파아제 유사 활성 단백질 1(RAB GTPase activating protein 1-like: RABGAP1L)-NTRK1 융합유전자가 보고되었으며(Ross 등, The oncologist, 2014년, 19권, 235~242 페이지), 다형교아종에서는 뉴로파신(neurofascin: NFASC) 또는 브레비칸(brevican: BCAN)-NTRK1 융합유전자가 보고되었다(Kim 등, PloS one, 2014년, 9권, e91940). 그러나 NTRK1 융합유전자의 암 유전자로서의 역할은 일부 암종에서 규명된 반면, 대장암에서 NTRK1 융합유전자의 암 유전자로서의 역할, 빈도 및 임상적 의의는 포괄적으로 규명된 바 없다.
The NTRK1 fusion gene has been reported across a variety of carcinomas. The fusion gene of TPM3-NTRK1 has been reported in colon cancer and papillary thyroid carcinoma (Martin-Zanca et al., Nature, 1986, 319, 743-748; Wilton et al., Cytogenetics and cell genetics, 124), and myosin phosphatase Rho interacting protein (MPRIP) or CD74-NTRK1 fusion gene has been reported in lung cancer (Vaishnave et al., Nature medicine, 2013, vol. 19, p. 1469-1472; Patent No. WO 2014071358 A2), TP53 or lamin A / C: LMNA) -NTRK1 fusion genes have been reported in Spitzoid melanomas (Wiesner et al., Nature communications, vol. 5, 3116, 2014; RAB GTPase activating protein 1-like (RABGAP1L) -TRK1 fusion gene has been reported in gallbladder carcinoma (Ross et al., The oncologist, 2014, vol. 19, 242 pages), and in multifocal subspecies neurofascin NFASC) or brevican (BCAN) -NTRK1 fusion genes have been reported (Kim et al., PloS one, 2014, vol. 9, e91940). However, the role of the NTRK1 fusion gene as a cancer gene has been elucidated in some carcinomas, while the role, frequency, and clinical significance of the NTRK1 fusion gene as a cancer gene in colorectal cancer has not been elucidated.

본 발명자들은 대장암의 유전체학적 연구를 통해 대장암 환자계층화 및 예후예측, 그리고 치료의 타겟이 될 수 있는 유전자의 발견을 위하여 예의 연구 노력한 결과, 한국인 및 중국인 대장암 조직 내 LMNA-NTRK1 및 TPM3-NTRK1 융합유전자 및 이의 발현이 특정 환자군에서 현저히 증가하며, NTRK1 융합유전자 및 이에 의해 암호화되는 융합전사산물 및 융합단백질은 대장암의 임상예후를 결정하는 인자로, 이를 억제하는 경우 대장암 치료 효능이 있다는 것을 알아냄으로써, 본 발명을 완성하게 되었다.
The present inventors have conducted intensive research for genome-wide study of colon cancer through stratification of colorectal cancer patients, prediction of prognosis, and discovery of genes that can be targeted for treatment. As a result, LMNA-NTRK1 and TPM3- The NTRK1 fusion gene and its expression are significantly increased in certain patient groups, and the NTRK1 fusion gene and the fusion transcription product and fusion protein encoded thereby are the factors that determine the clinical prognosis of colorectal cancer. , Thereby completing the present invention.

본 발명의 하나의 목적은, LMNA(Lamin A) 또는 TPM3(Tropomyosin 3)와 NTRK1(Neurotrophic Tyrosine Kinase, Receptor, Type 1)와의 융합단백질 및 이를 암호화하는 융합유전자 폴리뉴클레오티드를 제공하는 것이다.One object of the present invention is to provide a fusion protein of LMNA (Lamin A) or TPM3 (Tropomyosin 3) with NTRK1 (Neurotrophic Tyrosine Kinase, Receptor, Type 1) and a fusion gene polynucleotide encoding the fusion protein.

본 발명의 다른 목적은, 상기 융합단백질, 상기 융합유전자, 또는 상기 융합유전자의 전사산물을 측정하는 제제를 포함하는 대장암 진단용 조성물을 제공하는 것이다.Another object of the present invention is to provide a composition for diagnosing colorectal cancer comprising the fusion protein, the fusion gene, or the agent for measuring the transcription product of the fusion gene.

본 발명의 또 다른 목적은, 상기 대장암 진단용 조성물을 포함하는 대장암 진단용 키트를 제공하는 것이다. It is still another object of the present invention to provide a colon cancer diagnostic kit comprising the composition for diagnosing colorectal cancer.

본 발명의 또 다른 목적은, 상기 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 대장암 진단에 필요한 정보를 제공하는 방법을 제공하는 것이다.Yet another object of the present invention is to provide a method for providing information necessary for diagnosis of colon cancer, comprising the step of measuring the level of said NTRK1 fusion gene, its transcription product or its protein.

본 발명의 또 다른 목적은, 상기 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법을 제공하는 것이다.Yet another object of the present invention is to provide a method for providing information necessary for predicting colon cancer therapeutic prognosis, which comprises the step of measuring the level of said NTRK1 fusion gene, its transcription product or its protein.

본 발명의 또 다른 목적은, 상기 NTRK1 융합유전자의 발현 수준을 측정하는 단계를 포함하는 대장암 치료제의 스크리닝 방법을 제공하는 것이다.It is still another object of the present invention to provide a method for screening a therapeutic agent for colorectal cancer comprising the step of measuring the expression level of the NTRK1 fusion gene.

본 발명의 또 다른 목적은, NTRK1 융합단백질 저해제를 유효성분으로 포함하는 대장암의 예방 또는 치료용 약제학적 조성물을 제공하는 것이다.
It is still another object of the present invention to provide a pharmaceutical composition for the prevention or treatment of colorectal cancer comprising an NTRK1 fusion protein inhibitor as an active ingredient.

본 발명자들은, 상기 목적을 달성하기 위해 예의 연구를 거듭한 결과, 150례의 대장암 임상 조직과 50례의 정상 대장 임상 조직으로부터 추출한 전사체의 차세대 전사체 염기서열분석에 의해, TPM3-NTRK1 융합전사산물 및 LMNA-NTRK1 융합전사산물을 검출하였다. TPM3-NTRK1 융합유전자는 1번 염색체 q21.3와 q23.1 영역 간의 역위에 의해서 발생하였고, LMNA-NTRK1 융합유전자는 상기의 염색체 q22와 q23.1 영역 간의 결실에 의해서 발생하였다.
As a result of intensive studies to achieve the above object, the inventors of the present invention have found that by sequencing the next generations of transcripts from 150 colon cancer clinical specimens and 50 normal colon tissues, TPM3-NTRK1 fusion The transcription product and the LMNA-NTRK1 fusion transcription product were detected. The TPM3-NTRK1 fusion gene was generated by the inversion between chromosome 1q21.3 and q23.1 regions, and the LMNA-NTRK1 fusion gene was generated by deletion between chromosome q22 and q23.1 regions.

상기한 신규한 융합유전자 및 이에 암호화 된 융합전사산물 및 융합단백질은 147례의 한국인 대장암 임상 조직 중 3례(2%)에서 검색되었다. 상기의 사실로부터 대장암에서 융합전사산물 및 융합단백질의 발현은 유전자 융합에 의한 것이 밝혀졌다.
The novel fusion gene and its encoded fusion transcript and fusion protein were detected in 3 out of 147 (2%) Korean colon cancer clinical tissues. From the above facts, it has been revealed that expression of fusion transcription products and fusion proteins in colon cancer is due to gene fusion.

NTRK1 융합전사산물이 확인된 3례의 대장암 임상 조직은 KRAS, NRAS 및 PIK3CA 등 공지의 대장암 유전자의 체세포 돌연변이를 갖지 않았다. 또한 상기의 NTRK1 융합유전자를 종양성이 없는 NIH3T3 세포주에서 강제로 발현시켰을 때 체외 및 채내 종양성이 획득되었다. 상기의 사실로부터 NTRK1 유전자융합은 대장암에 있어서 원인 유전자(driver oncogene)인 것이 밝혀졌다.
Three colon cancer clinical tissues in which NTRK1 fusion transcripts were identified did not have somatic mutation of known colon cancer genes such as KRAS, NRAS and PIK3CA. In addition, when the above NTRK1 fusion gene was forcedly expressed in a non-tumorigenic NIH3T3 cell line, positive in vitro and in situ species were obtained. From these facts, it has been found that NTRK1 gene fusion is a driver oncogene in colorectal cancer.

본 발명의 융합단백질은 29.2%의 한국인 대장암 환자에서 확인되었고, 25.6%의 중국인 대장암 환자에서 확인되어, 상기의 융합유전자는 한국인 및 중국인 등 동북아시아인의 대장암에 있어서 높은 빈도로 관찰되는 원인 유전자인 것이 밝혔다. 또한 본 발명의 융합유전자 또는 이에 암호화되는 융합단백질을 갖는 대장암 환자의 생존율은 본 발명의 융합유전자 또는 이에 암호화되는 융합단백질을 갖지 않는 대장암 환자의 생존율보다 낮아, 본 발명의 융합유전자가 대장암 환자의 임상예후를 예측하는 진단 마커로 가능하다는 것을 알아내었다.
The fusion protein of the present invention was confirmed in 29.2% of Korean patients with colorectal cancer and confirmed in 25.6% of Chinese patients with colorectal cancer. The fusion gene was observed in a high frequency in colon cancer of Northeast Asian people including Korean and Chinese It is the causative gene. In addition, the survival rate of a colon cancer patient having the fusion gene of the present invention or a fusion protein encoded by the fusion gene of the present invention is lower than the survival rate of a colorectal cancer patient having no fusion gene or the fusion protein of the present invention, And it is possible to use it as a diagnostic marker to predict the clinical prognosis of the patient.

상기의 유전자 융합은, NTRK1 단백질의 향상된 활성화를 초래하는 것이라고 생각되며, 이러한 활성화가 발생하고 있는 대장암 환자에 있어서는, NTRK1 단백질에 대한 저해제가 치료에 있어서 유효성을 나타내는 것이라고 생각된다. 이 때문에, 본 발명자들은, 대장암에 있어서, 해당 융합유전자를 표적으로 하여 약물에 의한 치료의 유효성을 예측하는 것이 가능하며, 또한, 이 예측에 있어서 약물에 의한 치료가 유효하다고 판정된 환자에 해당 약물을 투여하면, 효율적인 치료가 가능한 것을 알아내어, 본 발명을 완성하기에 이르렀다.
It is believed that the gene fusion results in enhanced activation of the NTRK1 protein. In patients with colon cancer in which such activation is occurring, inhibitors of the NTRK1 protein are considered to be effective in treatment. Therefore, the present inventors have found that, in colorectal cancer, the efficacy of treatment with a drug can be predicted by using the fusion gene as a target, and furthermore, in the prediction, It has been found that effective treatment is possible when a drug is administered, and the present invention has been completed.

따라서, 본 발명은, TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자간의 융합유전자 및 이에 암호화되는 융합전사산물 및 융합단백질의 존재를 지표로 대장암 환자의 임상예후를 진단하는 방법, NTRK1 단백질 저해제에 의한 대장암 치료의 유효성을 판정하는 방법, 이 유효성의 판정을 이용한 대장암의 치료 방법에 관한 것으로서, 보다 상세하게는, 이하의 발명을 제공하는 것이다.
Accordingly, the present invention provides a method for diagnosing the clinical prognosis of patients with colorectal cancer based on the presence of the TPM3 gene or the fusion gene between the LMNA gene and the NTRK1 gene and the fusion transcription product and the fusion protein encoded thereby, The present invention relates to a method of determining the efficacy of treatment, and a method of treating colon cancer using the determination of efficacy, and more particularly, to provide the following invention.

상기 목적을 달성하기 위한 하나의 양태로서, 본 발명은 LMNA(Lamin A) 또는 TPM3(Tropomyosin 3)와 NTRK1(Neurotrophic Tyrosine Kinase, Receptor, Type 1)와의 융합단백질 및 이를 암호화하는 융합유전자 폴리뉴클레오티드를 제공한다.In one aspect, the present invention provides a fusion protein of LMNA (Lamin A) or TPM3 (Tropomyosin 3) with NTRK1 (Neurotrophic Tyrosine Kinase, Receptor, Type 1) and a fusion gene polynucleotide encoding the fusion protein do.

구체적으로, 본 발명은 하기의 구조식으로 이루어진 융합단백질을 제공한다.Specifically, the present invention provides a fusion protein having the following structural formula.

N-[LMNA(Lamin A)의 N 말단을 포함하는 단편]-[NTRK1(Neurotrophic Tyrosine Kinase, Receptor, Type 1)의 C 말단을 포함하는 단편]-C, 또는A fragment containing the N-terminus of N- [LMNA (Lamin A)] - [a fragment containing the C-terminus of NTRK1 (Neurotrophic Tyrosine Kinase, Receptor, Type 1]

N-[TPM3(Tropomyosin 3)의 N 말단을 포함하는 단편]-[NTRK1의 C 말단을 포함하는 단편]-C.A fragment containing the N-terminus of N- [TPM3 (Tropomyosin 3)] - [a fragment containing the C-terminus of NTRK1] -C.

또한, 본 발명은 하기의 구조식으로 이루어진 융합유전자 폴리뉴클레오티드를 제공한다.The present invention also provides a fusion gene polynucleotide having the following structural formula.

5'-[LMNA 유전자의 5' 말단을 포함하는 단편]-[NTRK1 유전자의 3' 말단을 포함하는 단편]-3', 또는5 '- [fragment containing the 5' end of the LMNA gene] - [fragment containing the 3 'end of the NTRK1 gene] -3'

5'-[TPM3의 5' 말단을 포함하는 단편]-[NTRK1 유전자의 3' 말단을 포함하는 단편]-3'.
5 '- [Fragment containing the 5' end of TPM3] - [Fragment containing the 3 'end of the NTRK1 gene] -3'.

즉, 본 발명은 LMNA의 N 말단을 포함하는 단편 또는 TPM3의 N 말단을 포함하는 단편과 NTRK1의 C 말단을 포함하는 단편이 연결된 융합단백질 및 LMNA 유전자의 5' 말단을 포함하는 단편 또는 TPM3 유전자의 5' 말단을 포함하는 단편과 NTRK1 유전자의 3' 말단을 포함하는 단편이 연결된 융합단백질에 관한 것으로, 상기 LMNA, TPM3 또는 NTRK1은 인간 유래일 수 있다. 본 발명의 융합단백질은 NTRK1 융합단백질, 또는 경우에 따라 LMNA-NTRK1 융합단백질 또는 TPM3-NTRK1 융합단백질과 혼용될 수 있다.That is, the present invention provides a fusion protein comprising a N-terminal fragment of LMNA or a fusion protein comprising a fragment containing N-terminal of TPM3 and a fragment containing C-terminal of NTRK1 and a fragment comprising the 5'- The fusion protein comprising a fragment comprising the 5 'end and a fragment comprising the 3' end of the NTRK1 gene are linked, wherein the LMNA, TPM3 or NTRK1 may be human. The fusion protein of the present invention may be mixed with an NTRK1 fusion protein, or alternatively, a LMNA-NTRK1 fusion protein or a TPM3-NTRK1 fusion protein.

본 발명자들은 본 발명 실시예에 있어서 나타내는 바와 같이, 인간 대장암 환자에서 LMNA 단백질과 NTRK1 단백질과의 융합예를 처음으로 밝혀졌다. 또한 TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자의 융합 및 이에 암호화된 융합전사산물과 융합단백질을 갖는 대장암 환자의 빈도 및 임상예후를 대규모 대장암 환자 코호트 분석을 통해 처음으로 확인하였다(표 1).
As shown in the examples of the present invention, the present inventors first found an example of fusion of LMNA protein and NTRK1 protein in human colon cancer patients. The frequency and clinical prognosis of patients with colorectal cancer with fusion of the TPM3 gene or the LMNA gene with the NTRK1 gene and the fusion fusion product and fusion protein encoded thereby were first identified through a cohort analysis of large colorectal cancer patients (Table 1).

본 발명에서 TPM3 단백질을 암호화하는 TPM3 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 1번(q21.3)에 위치하며, 이로부터 암호화되는 TPM3 단백질은 총 아미노산 길이가 285aa인 폴리펩티드이다. TPM3 단백질 또는 TPM3 단백질 단편은 TPM3-NTRK1 융합단백질의 N 말단쪽 융합파트너이다. 예컨대, TPM3 유전자는 GenBank accession no. NM_152263에서 제공되는 염기 서열을 갖는 것일 수 있으며, TPM3 단백질은 NM_152263에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다. 특히 상기 TPM3 단백질은 서열번호 4의 아미노산 서열로 이루어진 폴리펩티드일 수 있으며, 상기 TPM3 유전자는 서열번호 2의 핵산 서열로 구성된 것일 수 있으나, 이에 제한되지 않는다.In the present invention, the TPM3 gene encoding the TPM3 protein may be derived from a human and is located on human chromosome 1 (q21.3), and the TPM3 protein encoded thereby is a polypeptide having a total amino acid length of 285 aa. The TPM3 protein or TPM3 protein fragment is the N-terminal fusion partner of the TPM3-NTRK1 fusion protein. For example, the TPM3 gene can be obtained from GenBank accession no. NM_152263, and the TPM3 protein may be a protein having an amino acid sequence encoded by NM_152263. In particular, the TPM3 protein may be a polypeptide consisting of the amino acid sequence of SEQ ID NO: 4, and the TPM3 gene may be composed of the nucleic acid sequence of SEQ ID NO: 2, but is not limited thereto.

본 발명에서 TPM3의 N 말단을 포함하는 단편은 NM_152263의 8번째 엑손(1번 염색체 상의 (-) 가닥(strand)을 기준으로 154134289~154142876 염기위치)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 8의 3’말단에 코돈(codon)을 이루지 못하는 1개의 뉴클레오타이드(g)가 존재하는 형태일 수 있다(표 2 및 표 3). 특히 상기 TPM3의 N 말단을 포함하는 단편은 서열번호 5의 아미노산 서열로 구성되는 것일 수 있으나, 이에 제한되지 않는다.
In the present invention, the fragment containing the N-terminus of TPM3 has the amino acid sequence encoded by the nucleotide sequence from the eighth exon of NM_152263 (the nucleotide positions 154134289 to 154142876 based on the (-) strand on chromosome 1) , And one nucleotide (g) that does not form a codon at the 3 'end of exon 8 may exist (Table 2 and Table 3). In particular, the fragment containing the N-terminus of TPM3 may be composed of the amino acid sequence of SEQ ID NO: 5, but is not limited thereto.

LMNA 단백질을 암호화하는 LMNA 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 1번(q22)에 위치하며, 이로부터 암호화되는 LMNA 단백질은 총 아미노산 길이가 664aa인 폴리펩티드이다. LMNA 단백질 또는 LMNA 단백질 단편은 LMNA-NTRK1 융합단백질의 N 말단쪽 융합파트너이다. LMNA 유전자는 GenBank accession no. NM_170707에서 제공되는 뉴클레오타이드 서열을 갖는 것일 수 있으며, LMNA 단백질은 NM_170707에 의하여 암호화되는 아미노산 서열을 갖는 단백질 일 수 있다. 특히 상기 LMNA 단백질은 서열번호 27의 아미노산 서열로 이루어진 폴리펩티드일 수 있으며, 상기 LMNA 유전자는 서열번호 28의 핵산 서열로 구성된 것일 수 있으나, 이에 제한되지 않는다.
The LMNA gene encoding the LMNA protein may be derived from a human and is located on human chromosome 1 (q22), and the encoded LMNA protein is a polypeptide having a total amino acid length of 664 aa. The LMNA protein or LMNA protein fragment is the N-terminal fusion partner of the LMNA-NTRK1 fusion protein. The LMNA gene is available from GenBank accession no. NM_170707, and the LMNA protein may be a protein having an amino acid sequence encoded by NM_170707. In particular, the LMNA protein may be a polypeptide consisting of the amino acid sequence of SEQ ID NO: 27, and the LMNA gene may be composed of the nucleic acid sequence of SEQ ID NO: 28, but is not limited thereto.

본 발명에서 LMNA의 N 말단을 포함하는 단편은 NM_170707의 6번째 엑손(1번 염색체 상의 (+)가닥을 기준으로 156084504~156105740 염기위치)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있으며, 엑손 6의 3’말단에 코돈(codon)을 이루지 못하는 1개의 뉴클레오타이드(c)가 존재하는 형태일 수 있다(표 6 및 표 7). 특히 상기 LMNA의 N 말단을 포함하는 단편은 서열번호 31의 아미노산 서열로 구성되는 것일 수 있으나, 이에 제한되지 않는다.
In the present invention, the fragment containing the N-terminus of LMNA may have an amino acid sequence encoded by a nucleotide sequence from the sixth exon of NM_170707 (base position 156084504 to 156105740 based on the (+) strand on chromosome 1) , And one nucleotide (c) that does not form a codon at the 3 'end of exon 6 (Table 6 and Table 7). In particular, the fragment containing the N-terminus of the LMNA may be composed of the amino acid sequence of SEQ ID NO: 31, but is not limited thereto.

NTRK1 단백질을 암호화하는 NTRK1 유전자는 인간으로부터 유래하는 것일 수 있고 인간의 염색체 1번(q23.1)에 위치하며, 이로부터 암호화되는 NTRK1 단백질은 총 아미노산 길이가 796aa인 단백질이다. NTRK1 단백질 또는 NTRK1 단백질 단편은 TPM3-NTRK1 융합단백질의 C 말단쪽 융합 파트너이다. 예컨대, 상기 NTRK1 유전자는 GenBank accession no. NM_001012331에 제공되는 염기 서열을 갖는 것일 수 있으며, 상기 NTRK1 단백질은 NM_001012331에 의하여 암호화되는 아미노산 서열을 갖는 단백질일 수 있다. 특히 상기 NTRK1 단백질은 서열번호 7의 아미노산 서열로 구성되는 폴리펩티드일 수 있으며, 상기 NTRK1 유전자는 서열번호 8, 서열번호 10 또는 서열번호 13의 핵산 서열로 구성된 것일 수 있으나, 이에 제한되지 않는다.
The NTRK1 gene encoding the NTRK1 protein can be derived from a human and is located on human chromosome 1 (q23.1), and the NTRK1 protein encoded thereby is a protein having a total amino acid length of 796 aa. The NTRK1 protein or NTRK1 protein fragment is the C-terminal fusion partner of the TPM3-NTRK1 fusion protein. For example, the NTRK1 gene can be obtained from GenBank accession no. NM_001012331, and the NTRK1 protein may be a protein having an amino acid sequence encoded by NM_001012331. In particular, the NTRK1 protein may be a polypeptide consisting of the amino acid sequence of SEQ ID NO: 7, and the NTRK1 gene may be composed of the nucleic acid sequence of SEQ ID NO: 8, SEQ ID NO: 10 or SEQ ID NO:

본 발명에서 NTRK1의 C 말단을 포함하는 단편은 NM_001012331의 엑손 9(1번 염색체 상의 (+)가닥을 기준으로 156830671~156844363 염기위치) 또는 엑손 11(1번 염색체 상의 (-)가닥을 기준으로 156830671~156845312 염기위치) 또는 엑손 12(1번 염색체 상의 (-)가닥을 기준으로 156830671~156845872 염기위치)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것일 수 있다. 이때, 엑손 9의 3’말단의 처음 시작하는 2개의 뉴클레오타이드(ac)와 엑손 11의 3’말단의 처음 시작하는 2개의 뉴클레오타이드(gc) 및 엑손 12의 3’말단의 처음 시작하는 2개의 뉴클레오타이드(gt)는 코돈을 이루지 못하는 형태일 수 있고, TPM3 단백질의 단편과 융합 시에 앞서 설명한 바와 같이 TPM3 단백질의 단편에 추가로 포함된 1개의 뉴클레오타이드(g)와 연결되어 코돈(gac 또는 ggc 또는 ggt)을 이루어 하나의 아미노산(D 또는 G)을 코딩할 수 있다. 특히 상기 NTRK1의 C 말단을 포함하는 단편은 서열번호 15, 서열번호 17 또는 서열번호 19의 아미노산 서열로 구성되는 것일 수 있으나, 이에 제한되지 않는다.
In the present invention, the fragment containing the C-terminus of NTRK1 has the nucleotide sequence of exon 9 of NM_001012331 (position 156830671 to 156844363 on the (+) strand of chromosome 1) or exon 11 (position 156830671 Or 156845312 nucleotide positions) or exon 12 (156830671 to 156845872 base positions relative to the (-) strand on chromosome 1) to the last exon. (Ac) of the 3 'end of exon 9, 2 nucleotides (gc) starting from the 3' end of exon 11, and 2 nucleotides starting from the 3 'end of exon 12 gt; may be in a form that does not form a codon, and when fused with a fragment of the TPM3 protein, the codon (gac or ggc or ggt) is linked to one nucleotide (g) To encode one amino acid (D or G). In particular, the fragment containing the C-terminus of NTRK1 may be composed of the amino acid sequence of SEQ ID NO: 15, SEQ ID NO: 17 or SEQ ID NO: 19, but is not limited thereto.

본 명세서에 있어서 융합파트너인 단백질의 단편들의 절단점(break point; 또는 융합 부위)은 이들을 암호화하는 유전자의 엑손(exon)을 기준으로 설명되며, 상기 단편의 N말단(C말단 융합파트너의 경우) 또는 C말단(N말단 융합 파트너의 경우) 마지막에 포함되는 엑손과 절단되어 제거되는 엑손 사이의 인트론(intron) 부위 중 어느 지점에서 절단되어도 암호화되는 단백질 단편의 아미노산 서열에는 영향을 미치지 않게 되므로, 실제 절단되는 지점은 상기 인트론 위치 중 어느 지점이어도 무방하다.
In this specification, break points (or fusion sites) of fragments of a fusion partner are described on the basis of the exon of the gene encoding them, and the N-terminus of the fragment (in the case of a C-terminal fusion partner) Or the intron site between the exon at the last (in the case of the N-terminal fusion partner) and the exon which is cleaved to be cleaved does not affect the amino acid sequence of the protein fragment to be encoded even when cleaved at any point in the C- The point to be cut may be any of the intron positions.

한편, 본 발명에서 LMNA-NTRK1 융합유전자는 서열번호 33의 염기서열을 가지는 것일 수 있으며, LMNA-NTRK1 융합단백질은 서열번호 34의 염기서열을 가지는 것일 수 있으나, 이에 제한되지 않는다. 또한, 본 발명에서 TPM3-NTRK1 융합유전자는 서열번호 21,서열번호 22 또는 서열번호 23의 염기서열을 가지는 것일 수 있으며, TPM3-NTRK1 융합단백질은 서열번호 24, 서열번호 25 또는 서열번호 26을 가지는 것일 수 있으나, 이에 제한되지 않는다.
In the present invention, the LMNA-NTRK1 fusion gene may have the nucleotide sequence of SEQ ID NO: 33, and the LMNA-NTRK1 fusion protein may have the nucleotide sequence of SEQ ID NO: 34, but is not limited thereto. Also, the TPM3-NTRK1 fusion gene may have the nucleotide sequence of SEQ ID NO: 21, SEQ ID NO: 22 or SEQ ID NO: 23, and the TPM3-NTRK1 fusion protein may have the nucleotide sequence of SEQ ID NO: 24, SEQ ID NO: But is not limited thereto.

본 발명에서 별다른 언급이 없는 한, 본 명세서에 기재된 유전자, 전사산물 또는 전사산물 유래 cDNA 또는 DNA 분자를 시퀀싱하여 결정된 모든 뉴클레오타이드 서열들은 자동화된 차세대 염기서열 시퀀서 또는 생거법 염기서열 시퀀서를 사용하여 결정될 수 있고, 결정된 뉴클레오타이드 서열에 의하여 암호화되는 모든 아미노산 서열들은 자동화된 펩타이드 시퀀서를 사용하여 결정될 수 있다. 이 자동화된 접근에 의하여 결정된 뉴클레오타이드 서열은 실제 서열과 비교하여 일부 에러를 포함할 수 있다. 예컨대, 자동에 의하여 결정된 뉴클레오타이드 서열들은 시퀀싱된 cDNA 또는 DNA 분자의 실제 뉴클레오타이드 서열과 전형적으로 약 90% 이상, 구체적으로 약 95% 이상, 보다 구체적으로 약 99% 이상, 더욱 구체적으로 약 99.9% 이상의 서열 상동성을 가질 수 있다. 실제 서열과 비교하여 결정된 뉴클레오타이드에서 하나의 삽입 또는 결손은 포함할 수 있으며, 이와 같은 삽입 또는 결손에 의하여 뉴클레오타이드 서열 번역시의 프레임시프트(frameshift)가 야기되어, 암호화되는 아미노산 서열이 실제 아미노산 서열과 완전하게 다르게 될 수 있다.
Unless otherwise stated in the present invention, all nucleotide sequences determined by sequencing cDNA, or DNA molecules derived from a gene, transcription product, or transcription product described herein, can be determined using an automated next generation nucleotide sequence sequencer or a lethal nucleotide sequence sequencer And all amino acid sequences encoded by the determined nucleotide sequence can be determined using an automated peptide sequencer. The nucleotide sequence determined by this automated approach may contain some errors compared to the actual sequence. For example, the nucleotide sequences determined by automated sequencing are typically at least about 90%, specifically at least about 95%, more specifically at least about 99%, more specifically at least about 99.9% sequence identity with the actual nucleotide sequence of the sequenced cDNA or DNA molecule And may have homology. May include one insert or deletion in the nucleotide determined relative to the actual sequence and such insertions or deletions may result in framehifts in translating the nucleotide sequence so that the encoded amino acid sequence is identical to the actual amino acid sequence and complete It can be different.

또 하나의 양태로서, 본 발명은 상기 융합단백질, 상기 융합유전자, 또는 상기 융합유전자의 전사산물을 측정하는 제제를 포함하는 대장암 진단용 조성물을 제공한다.In another embodiment, the present invention provides a composition for diagnosing colorectal cancer comprising the fusion protein, the fusion gene, or an agent for measuring the transcription product of the fusion gene.

본 발명에서 상기 융합단백질 및 융합유전자는 상기 설명한 바와 같다.
In the present invention, the fusion protein and the fusion gene are as described above.

본 발명에서 용어, "진단"은 병리 상태의 존재 또는 특징을 확인하는 것을 의미한다. 본 발명의 목적상, 진단은 대장암 발병 여부를 확인하는 것이다.As used herein, the term "diagnosis" means identifying the presence or characteristic of a pathological condition. For the purpose of the present invention, the diagnosis is to confirm the incidence of colon cancer.

본 발명에서 용어, “대장암(colon cancer)”은 대장의 가장 안쪽 표면인 점막에서 발생한 암으로, 직장암, 결장암 및 항문암을 통칭한 것을 의미한다.In the present invention, the term " colon cancer " refers to a cancer arising from the mucosa, which is the innermost surface of the large intestine, and refers to rectal cancer, colon cancer and anal cancer.

한편, 본 발명에서 LMNA 또는 TPM3와 NTRK1의 융합단백질 및 융합유전자는 대장암을 진단하는 진단용 마커로 사용될 수 있으며, 본 발명에서 용어, “진단용 마커, 진단하기 위한 마커 또는 진단 마커(diagnosis marker)”란 대장암 세포를 정상 세포와 구분하여 진단할 수 있는 물질로, 정상 세포에 비하여 대장암을 가진 세포에서 증가양상을 보이는 폴리펩타이드 또는 핵산(예: mRNA 등), 지질 , 당지질, 당단백질, 당(단당류, 이당류, 올리고당류 등) 등과 같은 유기 생체 분자 등을 포함한다. 본 발명의 목적상, 대장암 진단 마커는 LMNA-NTRK1(Lamin A-Neurotrophic Tyrosine Kinase, Receptor, Type 1) 또는 TPM3-NTRK1(Tropomyosin 3-Neurotrophic Tyrosine Kinase, Receptor, Type 1) 융합유전자로, 대장암 조직 또는 세포에서 발현이 증가하는 유전자들이다. In the present invention, the fusion protein and the fusion gene of LMNA or TPM3 and NTRK1 can be used as diagnostic markers for diagnosing colon cancer. In the present invention, the term " diagnostic markers, markers for diagnosis or diagnosis markers " Is a substance that can distinguish colon cancer cells from normal cells and includes polypeptides or nucleic acids (for example, mRNA) showing an increase pattern in cells having colon cancer as compared with normal cells, lipids, glycolipids, glycoproteins, (Monosaccharides, disaccharides, oligosaccharides, etc.), and the like. For the purpose of the present invention, the colon cancer diagnostic marker is a fusion gene of LMNA-NTRK1 (Lamin A-Neurotrophic Tyrosine Kinase, Receptor, Type 1) or TPM3-NTRK1 (Tropomyosin 3-Neurotrophic Tyrosine Kinase, Receptor, Type 1) Are genes whose expression is increased in tissues or cells.

유의성 있는 진단 마커의 선택과 적용은 진단 결과의 신뢰도를 결정짓는다. 유의성 있는 진단 마커란, 진단하여 얻은 결과가 정확하여 타당도(validity)가 높고 반복 측정시에도 일관된 결과를 나타내도록 신뢰도가(reliability)가 높은 마커를 의미한다. 본 발명의 대장암 진단 마커는, 대장암의 발병과 함께 직접적 또는 간접적 요인으로 발현이 항상 증가하는 유전자들로 반복된 실험에도 동일한 결과를 나타내며, 발현 수준의 차이가 대조군과 비교할 때 매우 커서 잘못된 결과를 내린 확률이 거의 없는 신뢰도가 높은 마커들이다. 그러므로 본 발명의 유의성 있는 진단 마커의 발현 정도를 측정하여 얻은 결과를 토대로 진단된 결과는 타당하게 신뢰할 수 있다. 이때, 정상 대장 상피 세포와 대장암 세포에서 거의 동일한 양으로 발현되는 유전자들은 제외하고, 예를 들어 대조군으로 사용한 정상 조직에서 발현되는 유전자들에 비해 대장암 조직에서 발현이 2배 이상으로 증가되는 유전자들을 선별할 수 있다.
The selection and application of significant diagnostic markers determines the reliability of diagnostic results. Significant diagnostic markers are those markers that have a high degree of validity with high diagnostic accuracy and high reliability to provide consistent results even in repeated measurements. The colon cancer diagnostic markers of the present invention show the same results in repeated experiments with genes whose expression always increases directly or indirectly with the onset of colorectal cancer and the difference in expression level is very large when compared with the control group, Are highly reliable markers with little chance of falling. Therefore, the diagnosis result based on the result of measuring the expression level of the significant diagnostic marker of the present invention can be reasonably reliable. In this case, except for the genes expressing almost the same amount in the normal colon epithelial cells and the colon cancer cells, for example, the genes expressed in the normal tissues used as the control group, Can be selected.

본 발명의 구체적인 일 실시예에서는, TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자와의 융합유전자는 대장암에 있어서의 원인 유전자이며, 상기의 융합에 의해 NTRK1 단백질 발현의 항진, 나아가서는 NTRK1 단백질의 항상적인 활성화가 생기고, 대장암의 악성화에 기인한 불량예후의 진단 및 NTRK1 단백질 저해제에 의한 치료의 개연성이 높다는 것을 확인하였다.In one embodiment of the present invention, the fusion gene of the TPM3 gene or the LMNA gene and the NTRK1 gene is a causative gene in colorectal cancer. By the above fusion, the expression of the NTRK1 protein is enhanced, and the NTRK1 protein is constantly activated And the diagnosis of poor prognosis caused by malignancy of colon cancer and the possibility of treatment with NTRK1 protein inhibitor are high.

따라서, 본 발명의 NTRK1 융합유전자 또는 이에 암호화되는 융합전사산물 및 융합단백질은 고형암, 구체적으로 대장암 환자에게서 특이적으로 발견되거나 발현되는 것으로 확인되었으므로, 상기 융합유전자 또는 이에 암호화되는 융합전사산물 및 융합단백질은 고형암, 구체적으로 대장암의 예후예측 진단 마커로서 유용하게 사용될 수 있다.
Therefore, the NTRK1 fusion gene of the present invention or the fusion transcription product and fusion protein encoded thereby is found to be specifically detected or expressed in solid tumors, specifically in colorectal cancer patients. Therefore, the fusion gene or the fusion fusion product encoded therewith and fusion Proteins can be usefully used as markers for the prediction of prognosis of solid tumors, specifically colon cancer.

생물학적 시료 중의 본 발명의 융합유전자 발현 수준은 융합유전자의 전사산물, 특히 mRNA 등과 융합단백질의 양을 확인함으로써 확인할 수 있다. 바람직하게는, 상기 융합유전자 mRNA의 수준을 측정하는 제제는 상기 유전자에 특이적으로 결합하는 프라이머를 포함하는 것인 대장암 진단용 조성물일 수 있다.The level of expression of the fusion gene of the present invention in a biological sample can be confirmed by confirming the amount of the transcription product of the fusion gene, particularly mRNA and the amount of the fusion protein. Preferably, the agent for measuring the level of the fusion gene mRNA comprises a primer that specifically binds to the gene.

본 발명에서 “mRNA 발현수준 측정”이란 대장암을 진단하기 위하여 생물학적 시료에서 대장암 마커 유전자들의 mRNA 존재 여부와 발현 정도를 확인하는 과정으로 mRNA의 양을 측정한다. 이를 위한 분석 방법으로는 역전사 중합효소반응(RT-PCR), 경쟁적 역전사 중합효소반응(Competitive RT-PCR), 실시간 역전사 중합효소반응(Real-time RT-PCR), RNase 보호 분석법(RPA; RNase protection assay), 노던 블랏팅(Northern blotting), DNA 칩 등이 있으나 이로 제한되는 것은 아니다. In the present invention, " measurement of mRNA expression level " is used to determine the presence and expression level of mRNA of colon cancer marker genes in a biological sample in order to diagnose colon cancer. RT-PCR, competitive RT-PCR, real-time RT-PCR, RNase protection (RPA), and reverse transcriptase-polymerase chain reaction assay, Northern blotting, DNA chip, and the like.

본 발명에서 상기 융합유전자의 mRNA에 특이적으로 결합하는 프라이머는 상기 융합유전자의 mRNA에 특이적으로 결합하는 프라이머는 프라이머에 의해 증폭되는 영역이 융합 대상인 두 유전자의 융합점을 포함하도록, 융합점을 사이에 두도록 설계된 프라이머일 수 있으며, 상기 융합유전자를 구성하는 융합 대상인 두 유전자가 TPM3 유전자 및 NTRK1 유전자인 경우, TPM3 단백질의 N 말단을 암호화하는 염기서열에 특이적인 프라이머와 NTRK1의 C 말단을 암호화하는 염기서열에 특이적인 프라이머를 포함하는 프라이머쌍이며, 특히, 서열번호 35 및 서열번호 36의 염기서열로 구성된 프라이머쌍일 수 있으나, 이에 제한되지 않는다.In the present invention, the primer that specifically binds to the mRNA of the fusion gene is such that the primer that specifically binds to the mRNA of the fusion gene has a fusing point such that the region amplified by the primer includes the fusion point of two genes to be fused When the two fusion genes constituting the fusion gene are the TPM3 gene and the NTRK1 gene, a primer specific to the nucleotide sequence encoding the N-terminal of the TPM3 protein and a primer coding for the C-terminal of the NTRK1 A primer pair comprising a base sequence specific to a base sequence, and in particular, may be a primer pair consisting of a base sequence of SEQ ID NO: 35 and SEQ ID NO: 36, but is not limited thereto.

또한, 상기 융합유전자를 구성하는 융합 대상인 두 유전자가 LMNA 유전자 및 NTRK1 유전자인 경우, LMNA 단백질의 N 말단을 암호화하는 염기서열에 특이적인 프라이머와 NTRK1의 C 말단을 암호화하는 염기서열에 특이적인 프라이머를 포함하는 프라이머쌍이며, 특히, 서열번호 37 및 서열번호 38의 염기서열로 구성된 프라이머쌍일 수 있으나, 이에 제한되지 않는다.
When the two fusion genes constituting the fusion gene are the LMNA gene and the NTRK1 gene, a primer specific to the nucleotide sequence encoding the N-terminus of the LMNA protein and a primer specific to the nucleotide sequence encoding the C-terminus of the NTRK1 And may be, in particular, a pair of primers consisting of the nucleotide sequence of SEQ ID NO: 37 and SEQ ID NO: 38, but is not limited thereto.

한편, 본 발명의 "프라이머"는 짧은 자유 3말단 수산화기를 가지는 핵산 서열로 상보적인 템플레이트(template)와 염기쌍을 형성할 수 있고 템플레이트 가닥 복사를 위한 시작 지점으로 기능을 하는 짧은 핵산 서열을 의미한다. 프라이머는 적절한 완충용액 및 온도에서 중합반응(즉, DNA 중합효소 또는 역전사효소)을 위한 시약 및 상이한 4가지 뉴클레오사이드 트리포스페이트의 존재하에서 DNA 합성이 개시할 수 있다. 본 발명의 프라이머는, 각 마커 유전자 특이적인 프라이머로 7개 내지 50개의 뉴클레오타이드 서열을 가진 센스 및 안티센스 핵산이다. 프라이머는 DNA 합성의 개시점으로 작용하는 프라이머의 기본 성질을 변화시키지 않는 추가의 특징을 혼입할 수 있다. The term "primer " of the present invention means a short nucleic acid sequence capable of forming a base pair with a complementary template with a short free 3-terminal hydroxyl group and functioning as a starting point for template strand radiation. Primers can initiate DNA synthesis in the presence of reagents and four different nucleoside triphosphates for polymerization reactions (i. E., DNA polymerase or reverse transcriptase) at appropriate buffer solutions and temperatures. The primer of the present invention is a sense and antisense nucleic acid having 7 to 50 nucleotide sequences for each marker gene-specific primer. Primers can incorporate additional features that do not alter the primer's basic properties that serve as a starting point for DNA synthesis.

본 발명의 프라이머는 포스포르아미다이트 고체 지지체 방법, 또는 기타 널리 공지된 방법을 사용하여 화학적으로 합성할 수 있다. 이러한 핵산 서열은 또한 당해 분야에 공지된 많은 수단을 이용하여 변형시킬 수 있다. 이러한 변형의 비제한적인 예로는 메틸화, "캡화", 천연 뉴클레오타이드 하나 이상의 동족체로의 치환, 및 뉴클레오타이드 간의 변형, 예를 들면, 하전되지 않은 연결체(예: 메틸 포스포네이트, 포스포트리에스테르, 포스포로아미데이트, 카바메이트 등) 또는 하전된 연결체(예: 포스포로티오에이트, 포스포로디티오에이트 등)로의 변형이 있다. 핵산은 하나 이상의 부가적인 공유 결합된 잔기, 예를 들면, 단백질(예: 뉴클레아제, 독소, 항체, 시그날 펩타이드, 폴리-L-리신 등), 삽입제(예: 아크리딘, 프소랄렌 등), 킬레이트화제(예: 금속, 방사성 금속, 철, 산화성 금속 등), 및 알킬화제를 함유할 수 있다. 본 발명의 핵산 서열은 또한 검출 가능한 시그날을 직접적으로 또는 간접적으로 제공할 수 있는 표지를 이용하여 변형시킬 수 있다. 표지의 예로는 방사성 동위원소, 형광성 분자, 바이오틴 등이 있다.
The primers of the present invention can be chemically synthesized using the phosphoramidite solid support method, or other well-known methods. Such nucleic acid sequences may also be modified using many means known in the art. Non-limiting examples of such modifications include, but are not limited to, methylation, "capping ", substitution of one or more natural nucleotides into homologues, and modifications between nucleotides, such as uncharged linkers such as methylphosphonate, phosphotriester, (E.g., phosphoramidate, carbamate, etc.) or charged linkages (e.g., phosphorothioate, phosphorodithioate, etc.). The nucleic acid can be in the form of one or more additional covalently linked residues such as a protein such as a nuclease, a toxin, an antibody, a signal peptide, a poly-L-lysine, an intercalator such as acridine, ), Chelating agents (e.g., metals, radioactive metals, iron, oxidizing metals, etc.), and alkylating agents. The nucleic acid sequences of the present invention can also be modified using labels that can directly or indirectly provide a detectable signal. Examples of labels include radioactive isotopes, fluorescent molecules, biotin, and the like.

본 발명에서 "융합유전자를 측정하는 제제"는 대장암을 진단하기 위하여 생물학적 시료에서 대장암 마커 유전자인 본 발명의 융합유전자 존재 자체를 확인하는 과정으로, 예를 들어, 상기 융합유전자에 특이적으로 결합하는 프로브 또는 프라이머를 포함하는 것일 수 있다. 특히, 상기 융합유전자에 특이적으로 결합하는 프로브는 표 8의 특징을 갖는 프로브를 사용할 수 있으나, 이에 제한되지 않는다. 본 발명의 구체적인 실시예에서는 Abnova사에서 판매하는 NTRK1 Split FISH Probe( Cat# FS0024, Taiwan)의 프로브를 사용하여 측정하였다.
In the present invention, "agent for measuring a fusion gene" is a process for identifying the presence of a fusion gene of the present invention, which is a colon cancer marker gene, in a biological sample to diagnose colon cancer. For example, Or a probe or primer that binds to the target. In particular, probes that specifically bind to the fusion gene can use probes having the characteristics of Table 8, but are not limited thereto. In a specific example of the present invention, a probe of the NTRK1 Split FISH Probe (Cat # FS0024, Taiwan) marketed by Abnova was used.

본 발명에서 “단백질 발현수준 측정”이란 대장암을 진단하기 위하여 생물학적 시료에서 대장암 마커 유전자인 본 발명의 융합유전자로부터 발현된 단백질의 존재 여부와 발현 정도를 확인하는 과정으로, 바람직하게는, 상기 융합유전자의 단백질에 대하여 특이적으로 결합하는 항체를 이용하여 단백질의 양을 확인할 수 있다. In the present invention, " measurement of protein expression level " is a process for confirming the presence and expression level of a protein expressed from a fusion gene of the present invention, which is a colon cancer marker gene, in a biological sample in order to diagnose colon cancer, The amount of the protein can be confirmed using an antibody that specifically binds to the protein of the fusion gene.

특히, 본 발명에서 상기 융합유전자의 단백질에 대하여 특이적으로 결합하는 항체는 LMNA 또는 TPM3와 NTRK1의 융합위치 근방 또는 NTRK1의 C 말단에 결합하는 것일 수 있으나, 이에 제한되지 않는다. 이를 위한 분석 방법으로는 웨스턴 블랏, 엘라이자(enzyme linked immunosorbent assay,ELISA), 방사선면역분석(RIA: Radioimmunoassay), 방사 면역 확산법(radioimmunodiffusion), 오우크테로니(Ouchterlony) 면역 확산법, 로케트(rocket) 면역전기영동, 조직면역 염색, 면역침전 분석법(Immunoprecipitation Assay), 보체 고정 분석법(Complement Fixation Assay), 유세포분석(Fluorescence Activated Cell Sorter, FACS), 단백질 칩(protein chip) 등이 있으나 이로 제한되는 것은 아니다. In particular, the antibody that specifically binds to the fusion gene of the present invention may bind to the vicinity of the fusion site of LMNA or TPM3 with NTRK1 or the C-terminus of NTRK1, but is not limited thereto. Methods for this analysis include Western blot, enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA), radioimmunodiffusion, Ouchterlony immunodiffusion, rocket, But are not limited to, immunoelectrophoresis, immunohistochemistry, immunoprecipitation assays, complement fixation assays, fluorescence activated cell sorters (FACS), protein chips, and the like. .

바람직하게는, 상기 단백질의 수준을 측정하는 제제는 상기 단백질에 특이적인 항체를 포함하는 것인 대장암 진단용 조성물일 수 있다.Preferably, the agent for measuring the level of the protein comprises an antibody specific for the protein.

본 발명에서, “항체”란 항원성 부위에 대해서 지시되는 특이적인 단백질 분자를 의미한다. 본 발명의 목적상, 항체는 마커 단백질에 대해 특이적으로 결합하는 항체를 의미하며, 다클론 항체, 단클론 항체 및 재조합 항체를 모두 포함한다.
In the present invention, " antibody " means a specific protein molecule directed against an antigenic site. For purposes of the present invention, an antibody refers to an antibody that specifically binds to a marker protein and includes both polyclonal antibodies, monoclonal antibodies, and recombinant antibodies.

상기한 바와 같이 본 발명에서는 신규한 대장암 마커 융합단백질이 규명되었으므로, 이를 이용하여 항체를 생성하는 것은 당업계에 널리 공지된 기술을 이용하여 용이하게 제조할 수 있다. As described above, since the novel colon cancer marker fusion protein has been identified in the present invention, the antibody can be easily produced using techniques well known in the art.

다클론 항체는 상기한 대장암 마커 단백질 항원을 동물에 주사하고 동물로부터 채혈하여 항체를 포함하는 혈청을 수득하는 당업계에 널리 공지된 방법에 의해 생산할 수 있다. 이러한 다클론 항체는 염소, 토끼, 양, 원숭이, 말, 돼지, 소 개 등의 임의의 동물 종 숙주로부터 제조 가능하다. Polyclonal antibodies can be produced by methods well known in the art for obtaining serum containing antibodies by injection of the colon cancer marker protein antigen into the animal and collection from the animal. Such polyclonal antibodies can be prepared from any animal species host, such as goats, rabbits, sheep, monkeys, horses, pigs, small dogs, and the like.

단클론 항체는 당업계에 널리 공지된 하이브리도마 방법(hybridoma method)(Kohler 및 Milstein (1976) European Jounral of Immunology 6:511-519 참조), 또는 파지 항체 라이브러리(Clackson et al, Nature, 352:624-628, 1991; Marks et al, J. Mol. Biol., 222:58, 1-597, 1991) 기술을 이용하여 제조될 수 있다. 상기 방법으로 제조된 항체는 겔 전지영동, 투석, 염 침전, 이온교환 크로마토그래피, 친화성 크로마토그래피 등의 방법을 이용하여 분리, 정제할 수 있다. Monoclonal antibodies may be obtained from the hybridoma method (see Kohler and Milstein (1976) European Jounal of Immunology 6: 511-519), or the phage antibody library (Clackson et al, Nature, 352: 624 Biol., 222: 58, 1-597, 1991) techniques. The antibody prepared by the above method can be isolated and purified by gel electrophoresis, dialysis, salt precipitation, ion exchange chromatography, affinity chromatography, and the like.

또한 본 발명의 항체는 2개의 전체 길이의 경쇄 및 2개의 전체 길이의 중쇄를 가지는 완전한 형태뿐만 아니라, 항체 분자의 기능적인 단편을 포함한다. 항체 분자의 기능적인 단편이란 적어도 항원 결합 기능을 보유하고 있는 단편을 뜻하며, Fab, F(ab'), F(ab') 2 및 Fv 등이 있다.
The antibodies of the present invention also include functional fragments of antibody molecules as well as complete forms with two full-length light chains and two full-length heavy chains. A functional fragment of an antibody molecule refers to a fragment having at least an antigen binding function, and includes Fab, F (ab ') 2, F (ab') 2 and Fv.

본 발명의 또 다른 양태에서는, 본 발명에 따른 상기 대장암 진단용 조성물을 포함하는 대장암 진단용 키트를 제공한다. In another embodiment of the present invention, there is provided a kit for colon cancer diagnosis comprising the composition for diagnosing colorectal cancer according to the present invention.

구체적으로, 상기 대장암 진단용 키트는 역전자 중합효소-PCR(RT-PCR) 키트, DNA 칩 키트 또는 단백질 칩 키트일 수 있다. 특히, 상기 대장암 진단용 키트는 분석 방법에 적합한 한 종류 또는 그 이상의 다른 구성성분 조성물, 용액 또는 장치를 더 포함하여 구성될 수 있다. Specifically, the colorectal cancer diagnostic kit may be a reverse-electron polymerase-PCR (RT-PCR) kit, a DNA chip kit, or a protein chip kit. In particular, the colon cancer diagnostic kit may further comprise one or more other component compositions, solutions or devices suitable for the assay method.

구체적인 일 양태로서, 상기 진단용 키트는 역전사 중합효소반응을 수행하기 위해 필요한 필수 요소를 포함하는 것을 특징으로 하는 진단용 키트일 수 있다. 역전사 중합효소반응 키트는 마커 유전자에 대한 특이적인 각각의 프라이머 쌍을 포함한다. 프라이머는 각 마커 유전자의 핵산서열에 특이적인 서열을 가지는 뉴클레오타이드로서, 약 7 bp 내지 50 bp의 길이, 보다 바람직하게는 약 10 bp 내지 30 bp 의 길이이다. 또한 대조군 유전자의 핵산 서열에 특이적인 프라이머를 포함할 수 있다. 그 외 역전사 중합효소반응 키트는 테스트 튜브 또는 다른 적절한 컨테이너, 반응 완충액(pH 및 마그네슘 농도는 다양), 데옥시뉴클레오타이드(dNTPs), Taq-폴리머라아제 및 역전사효소와 같은 효소, DNAse, RNAse 억제제 DEPC-수(DEPC-water), 멸균수 등을 포함할 수 있다. In a specific embodiment, the diagnostic kit may be a diagnostic kit characterized by comprising essential elements necessary for performing a reverse transcription-polymerase reaction. The RT-PCR kit contains the respective primer pairs specific for the marker gene. The primer is a nucleotide having a sequence specific to the nucleic acid sequence of each marker gene, and has a length of about 7 bp to 50 bp, more preferably about 10 bp to 30 bp. It may also contain a primer specific for the nucleic acid sequence of the control gene. Other reverse transcription polymerase reaction kits may be used in combination with test tubes or other appropriate containers, reaction buffers (varying in pH and magnesium concentration), deoxynucleotides (dNTPs), enzymes such as Taq polymerase and reverse transcriptase, DNAse, RNAse inhibitor DEPC DEPC-water, sterile water, and the like.

또 다른 양태로는, 바람직하게 DNA 칩을 수행하기 위해 필요한 필수 요소를 포함하는 것을 특징으로 하는 진단 키트일 수 있다. DNA 칩 키트는 유전자 또는 그의 단편에 해당하는 cDNA 또는 올리고뉴클레오티드 (oligonucleotide)가 부착되어 있는 기판, 및 형광표식 프로브를 제작하기 위한 시약, 제제, 효소 등을 포함할 수 있다. 또한 기판은 대조군 유전자 또는 그의 단편에 해당하는 cDNA 또는 올리고뉴클레오티드를 포함할 수 있다. In another embodiment, it may be a diagnostic kit, preferably comprising an essential element necessary for carrying out a DNA chip. The DNA chip kit may include a substrate to which a cDNA or oligonucleotide corresponding to a gene or a fragment thereof is attached, and reagents, preparations, enzymes, and the like for producing a fluorescent-labeled probe. The substrate may also comprise a cDNA or oligonucleotide corresponding to a control gene or fragment thereof.

또한 바람직하게는, 단백질 칩을 수행하거나, ELISA를 수행하기 위해 필요한 필수 요소를 포함하는 것을 특징으로 하는 진단 키트일 수 있다. 단백질 칩 키트나 ELISA 키트는 마커 단백질에 대한 특이적인 항체를 포함한다. 항체는 각 마커 단백질에 대한 특이성 및 친화성이 높고 다른 단백질에 대한 교차 반응성이 거의 없는 항체로, 단클론 항체, 다클론 항체 또는 재조합 항체이다. 또한 ELISA 키트는 대조군 단백질에 특이적인 항체를 포함할 수 있다. 그 외 ELISA 키트는 결합된 항체를 검출할 수 있는 시약, 예를 들면, 표지된 2차 항체, 발색단(chromophores), 효소(예: 항체와 컨주게이트됨) 및 그의 기질 또는 항체와 결합할 수 있는 다른 물질 등을 포함할 수 있다.
Also preferably, it may be a diagnostic kit characterized by comprising essential elements necessary for performing a protein chip or conducting an ELISA. Protein chip kits or ELISA kits contain antibodies specific for the marker protein. Antibodies are monoclonal antibodies, polyclonal antibodies or recombinant antibodies with high specificity and affinity for each marker protein and little cross reactivity to other proteins. The ELISA kit may also include antibodies specific for the control protein. Other ELISA kits can be used to detect antibodies that can bind a reagent capable of detecting the bound antibody, such as a labeled secondary antibody, chromophores, an enzyme (e. G., Conjugated to an antibody) Other materials, and the like.

또 하나의 양태로서, 본 발명은 대장암 진단에 필요한 정보를 제공하기 위하여, NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 대장암 진단에 필요한 정보를 제공하는 방법을 제공한다. 구체적으로, 대장암이 의심되는 개체의 분리된 시료로부터 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계; 및 상기 측정된 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준을 정상 대조군 시료의 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준과 비교하는 단계를 포함하는, 대장암 진단에 필요한 정보를 제공하는 방법을 제공한다.In another aspect, the present invention provides a method for providing information necessary for diagnosing colorectal cancer, comprising the step of measuring the level of the NTRK1 fusion gene, its transcription product or its protein in order to provide information necessary for diagnosis of colon cancer to provide. Specifically, measuring the level of the NTRK1 fusion gene, its transcription product or its protein from a separate sample of individuals suspected of having colon cancer; And comparing the measured expression level of the fusion gene, its transcription product or its protein with a fusion gene of a normal control sample, its transcription product, or its protein expression level. to provide.

본 발명에서 상기 NTRK1 융합유전자는 LMNA 유전자 또는 TPM3 유전자의 5' 말단을 구성하는 폴리뉴클레오티드; 및 NTRK1 유전자의 3' 말단을 구성하는 폴리뉴클레오티드일 수 있다.
In the present invention, the NTRK1 fusion gene may be a polynucleotide constituting the 5 'end of the LMNA gene or the TPM3 gene; And a polynucleotide that constitutes the 3 ' end of the NTRK1 gene.

본 발명에서 "개체"란 대장암이 생길 수 있는 동물로, 대장암이 의심되는 한 제한되지 않는다. 또한, 상기 방법을 통하여 진단에 필요한 정보를 얻을 수 있는 질환으로는 TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자와의 융합유전자의 발현, 또는 NTRK1 유전자의 융합에 의존적으로 NTRK1 유전자의 발현이 인정되는 암종이라면 제한되지 않으며, 특히 대장암일 수 있다.
In the present invention, the term "individual" refers to an animal in which colon cancer can occur, so long as colon cancer is suspected. Examples of the disease that can obtain the information necessary for diagnosis through the above method include a TPM3 gene, a fusion gene of the LMNA gene and the NTRK1 gene, or a carcinoma whose expression of the NTRK1 gene is dependent on the fusion of the NTRK1 gene Especially colon cancer.

본 발명에서 용어, 개체의 시료란 대장암 마커인 LMNA-NTRK1 또는 TPM3-NTRK1 융합유전자의 mRNA 또는 이의 단백질의 수준이 차이나는 조직, 예를 들면, 세포, 조직, 장기, 체액(혈액, 림프액 등), 소화액, 객담, 폐포-기관지 세정액, 뇨, 변뿐만 아니라, 이들의 생체 시료로부터 얻어지는 핵산 추출물(유전체 추출물, 전사체 추출물, 전사체 추출물로부터 조제된 cDNA 조제물이나 aRNA 조제물 등)이나 단백질 추출물도 포함등을 포함하며, 이에 제한되지 않는다. 한다. 또한, 상기 시료는, 포르말린 고정 처리, 알코올 고정 처리, 동결 처리 또는 파라핀 포매 처리가 실시되어 있는 것이어도 된다. 또한, 유전자, 전사산물, cDNA 또는 단백질은, 상기 시료의 종류 및 상태 등을 고려하여, 거기에 적합한 공지의 수법을 선택하여 조제하는 것이 가능하다.
In the present invention, the term "sample of an individual" refers to a tissue in which the level of the mRNA of the LMNA-NTRK1 or TPM3-NTRK1 fusion gene, which is a colon cancer marker, or the protein level thereof differs in cells, tissues, organs, body fluids (DNA extracts, transcript extracts, cDNA preparations prepared from transcript extracts or aRNA preparations, etc.) obtained from these biological samples as well as digestive fluids, sputum, alveolar-bronchial washing liquids, urine, Extracts, and the like, but are not limited thereto. do. The sample may be subjected to a formalin fixing treatment, an alcohol fixing treatment, a freezing treatment, or a paraffin embedding treatment. In addition, the gene, transcription product, cDNA or protein can be prepared by selecting a well-known method appropriate for the type and condition of the sample.

상기 검출 방법들을 통하여, 정상 대조군에서의 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 발현 수준을 대장암 의심 개체에서의 발현 수준과 비교함으로써 대장암 의심 개체의 실제 대장암 여부를 진단할 수 있다. 즉, 대장암으로 추측되는 세포에 대해서 본 발명의 마커의 발현 수준을 측정하고, 정상 세포에 대해서 본 발명의 마커의 발현 수준을 측정하여 양자를 비교한 후, 본 발명의 마커의 발현 수준이 정상 세포의 것보다 대장암으로 추측되는 세포 유래에서 더 많이 발현되면 대장암으로 추측되는 세포를 대장암으로 판단할 수 있는 것이다. 즉, 상기 방법에서 대장암이 의심되는 개체의 분리된 시료로부터 측정된 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준이 정상 대조군 시료의 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준보다 높을 경우, 대장암으로 진단하는 것을 특징으로 할 수 있다.
Through the above detection methods, the expression level of the NTRK1 fusion gene, its transcription product or its protein in the normal control group can be compared with the expression level in suspected colon cancer, thereby diagnosing the actual colon cancer of the suspected colon cancer. That is, the expression level of the marker of the present invention was measured for cells suspected to be colon cancer, the expression level of the marker of the present invention was measured for normal cells, and then the expression level of the marker of the present invention was determined to be normal If cells are more expressed in a cell-like state than in a cell, it is possible to judge a cell that is supposed to be a colorectal cancer to be a colon cancer. That is, when the fusion gene, the transcription product thereof, or the protein expression level thereof measured from the separated sample of the subject suspected of having colorectal cancer is higher than the fusion gene of the normal control sample, the transcription product thereof or the protein expression thereof, Cancer. &Lt; / RTI >

LMNA-NTRK1 또는 TPM3-NTRK1 융합유전자의 mRNA 또는 이의 단백질의 수준을 측정하기 위한 분석 방법으로는, 웨스턴 블랏, ELISA, 방사선면역분석, 방사 면역 확산법, 오우크테로니 면역 확산법, 로케트 면역전기영동, 조직면역 염색, 면역침전 분석법, 보체 고정 분석법, FACS, 단백질 칩 등이 있으나 이로 제한되는 것은 아니다. 상기 분석 방법들을 통하여, 정상 대조군에서의 항원-항체 복합체의 형성량과 대장암 의심 개체에서의 항원-항체 복합체의 형성량을 비교할 수 있고, 대장암 마커 유전자에서 LMNA-NTRK1 또는 TPM3-NTRK1 융합단백질으로의 유의한 발현량의 증가여부를 판단하여, 대장암 의심 환자의 실제 대장암 발병 여부를 진단할 수 있다.
Analysis methods for measuring the level of mRNA or a protein of the fusion gene of LMNA-NTRK1 or TPM3-NTRK1 include Western blotting, ELISA, radioimmunoassay, radioimmunoprecipitation, Ouchteroni immunodiffusion, rocket immunoelectrophoresis, Tissue immuno staining, immunoprecipitation assays, complement fixation assays, FACS, protein chips, and the like. Through the above analysis methods, it is possible to compare the amount of the antigen-antibody complex formed in the normal control group with the amount of the antigen-antibody complex formed in the suspected colon cancer, and the LMNA-NTRK1 or TPM3-NTRK1 fusion protein , And thus it is possible to diagnose the incidence of colon cancer in patients with suspected colon cancer.

본 발명에서 용어 항원-항체 복합체란 대장암 마커 단백질과 이에 특이적인 항체의 결합물을 의미하고, 항원-항체 복합체의 형성 량은 검출 라벨(detection label)의 시그널의 크기를 통해서 정량적으로 측정 가능하다.
In the present invention, the term antigen-antibody complex refers to a combination of a colon cancer marker protein and an antibody specific thereto, and the amount of the antigen-antibody complex formed can be quantitatively measured through the size of a signal of a detection label .

이러한 검출 라벨은 효소, 형광물, 리간드, 발광물, 미소입자(microparticle), 레독스 분자 및 방사선 동위원소로 이루어진 그룹중에서 선택할 수 있으며, 반드시 이로 제한되는 것은 아니다. 검출 라벨로서 효소가 사용되는 경우 이용 가능한 효소에는 β-글루쿠로니다제, β-D-글루코시다제, β-D-갈락토시다제, 우레아제, 퍼옥시다아제 또는 알칼라인 포스파타아제, 아세틸콜린에스테라제, 글루코즈 옥시다제, 헥소키나제와 GDPase, RNase, 글루코즈 옥시다제와 루시페라제, 포스포프럭토키나제, 포스포에놀피루베이트 카복실라제, 아스파르테이트 아미노트랜스페라제, 포스페놀피루베이트 데카복실라제, β-라타마제 등이 있으며 이로 제한되지 않는다. 형광물에는 플루오레신, 이소티오시아네이트, 로다민, 피코에리테린, 피코시아닌, 알로피코시아닌, o-프탈데히드, 플루오레스카민 등이 있으며 이로 제한되지 않는다. 리간드에는 바이오틴 유도체 등이 있으며 이로 제한되지 않는다. 발광물에는 아크리디늄 에스테르, 루시페린, 루시퍼라아제 등이 있으며 이로 제한되지 않는다. 미소입자에는 콜로이드 금, 착색된 라텍스 등이 있으며 이로 제한되지 않는다. 레독스 분자에는 페로센, 루테늄 착화합물, 바이올로젠, 퀴논, Ti 이온, Cs 이온, 디이미드, 1,4-벤조퀴논, 하이드로퀴논, K4 W(CN)8 , [Os(bpy) 3 ] 2+ ,[RU(bpy) 3 ] 2+, [MO(CN) 8 ] 4- 등이 포함되며 이로 제한되지 않는다. 방사선동위원소에는 3H, 14C, 32P, 35S, 36Cl, 51Cr, 57Co, 58Co, 59Fe, 90Y, 125I, 131I , 186Re 등이 포함되며 이로 제한되지 않는다.
Such detection labels may be selected from the group consisting of enzymes, chromophores, ligands, emitters, microparticles, redox molecules, and radioisotopes, but are not necessarily limited thereto. When an enzyme is used as the detection label, available enzymes include? -Glucuronidase,? -D-glucosidase,? -D-galactosidase, urease, peroxidase or alkaline phosphatase, acetylcholine Glucoamylase, terazo, glucose oxidase, hexokinase and GDPase, RNase, glucose oxidase and luciferase, phosphofructokutase, phosphoenolpyruvate carboxylase, aspartate aminotransferase, phosphoenolpyruvate decar &Lt; / RTI > beta-lactamase, and the like. The minerals include, but are not limited to, fluorescein, isothiocyanate, rhodamine, picoeriterine, picocyanin, allophycocyanin, o-phthaldehyde, fluororescamine and the like. Ligands include, but are not limited to, biotin derivatives. Emitters include, but are not limited to, acridinium esters, luciferin, luciferase, and the like. Fine particles include, but are not limited to, colloidal gold, colored latex, and the like. Examples of the redox molecules include ferrocene, ruthenium complex, viologen, quinone, Ti ion, Cs ion, diimide, 1,4-benzoquinone, hydroquinone, K4 W (CN) 8, [Os (bpy) [RU (bpy) 3] 2+, [MO (CN) 8] 4-, and the like. Radiation isotopes include, but are not limited to, 3H, 14C, 32P, 35S, 36Cl, 51Cr, 57Co, 58Co, 59Fe, 90Y, 125I, 131I, 186Re.

단백질 발현수준 측정은 바람직하게는, ELISA법을 이용하는 것이다. ELISA는 고체 지지체에 부착된 항원을 인지하는 표지된 항체를 이용하는 직접적 ELISA, 고체 지지체에 부착된 항원을 인지하는 항체의 복합체에서 포획 항체를 인지하는 표지된 항체를 이용하는 간접적 ELISA, 고체 지지체에 부착된 항체와 항원의 복합체에서 항원을 인지하는 표지된 또 다른 항체를 이용하는 직접적 샌드위치 ELISA, 고체 지지체에 부착된 항체와 항원의 복합체에서 항원을 인지하는 또 다른 항체와 반응시킨 후 이 항체를 인지하는 표지된 2차 항체를 이용하는 간접적 샌드위치 ELISA 등 다양한 ELISA 방법을 포함한다. 보다 바람직하게는, 고체 지지체에 항체를 부착시키고 시료를 반응시킨 후 항원-항체 복합체의 항원을 인지하는 표지된 항체를 부착시켜 효소 적으로 발색시키거나 항원-항체 복합체의 항원을 인지하는 항체에 대해 표지된 2차 항체를 부착시켜 효소 적으로 발색시키는 샌드위치 ELISA 방법에 의해서 검출한다. 대장암 마커 단백질과 항체의 복합체 형성 정도를 확인하여, 대장암 발병 여부를 확인할 수 있다.
Measurement of protein expression levels is preferably performed using an ELISA method. ELISAs include direct ELISA using labeled antibodies that recognize the antigen attached to the solid support, indirect ELISA using labeled antibodies that recognize the capture antibody in a complex of antibodies recognizing the antigen attached to the solid support, A direct sandwich ELISA using another labeled antibody that recognizes an antigen in the complex of antibody and antigen, a method of reacting with another antibody recognizing an antigen in a complex of an antibody and an antigen attached to a solid support, Indirect sandwich ELISA using a secondary antibody, and various ELISA methods. More preferably, the antibody is attached to a solid support, the sample is reacted, and the labeled antibody recognizing the antigen of the antigen-antibody complex is adhered to produce an enzyme, or an antibody that recognizes the antigen of the antigen-antibody complex Is detected by a sandwich ELISA method in which a labeled secondary antibody is attached and the enzyme is developed. By checking the degree of complex formation between the colon cancer marker protein and the antibody, it can be confirmed whether or not the colon cancer has developed.

또한, 바람직하게는, 상기 대장암 마커에 대한 하나 이상의 항체를 이용한 웨스턴 블랏을 이용하는 것이다. 시료에서 전체 단백질을 분리하고, 이를 전기영동하여 단백질을 크기에 따라 분리한 다음, 니트로셀룰로즈 막으로 이동시켜 항체와 반응시킨다. 생성된 항원-항체 복합체의 양을 표지된 항체를 이용하여 확인하는 방법으로 유전자의 발현에 의해 생성된 단백질의 양을 확인하여, 대장암 발병 여부를 확인할 수 있다. 상기 검출 방법은 대조군에서의 마커 유전자의 발현 량과 대장암이 발병한 세포에서의 마커 유전자의 발현 량을 조사하는 방법으로 이루어진다. 펩티드 단편의 수준은 상기한 마커 단백질의 절대적(예: ㎍/㎖) 또는 상대적(예: 시그널의 상대 강도) 차이로 나타낼 수 있다.
Further, preferably, Western blotting using one or more antibodies against the colon cancer marker is used. The whole protein is separated from the sample, and the protein is separated according to size by electrophoresis, and then transferred to the nitrocellulose membrane to react with the antibody. The amount of the protein produced by expression of the gene can be confirmed by confirming the amount of the generated antigen-antibody complex using the labeled antibody, thereby confirming whether or not the colon cancer has occurred. The detection method comprises a method of examining the expression level of a marker gene in a control group and the expression level of a marker gene in a cell in which colon cancer has developed. The level of the peptide fragment may be expressed as the absolute (e.g., [mu] g / ml) or relative (e.g., relative intensity of the signal) difference of the marker protein.

또한, 바람직하게는, 상기 대장암 마커에 대한 하나 이상의 항체가 기판 위의 정해진 위치에 배열되어 고밀도로 고정화되어 있는 단백질 칩을 이용하는 것이다. 단백질 칩을 이용하여 시료를 분석하는 방법은, 시료에서 단백질을 분리하고, 분리한 단백질을 단백질 칩과 혼성화시켜서 항원-항체 복합체를 형성시키고, 이를 판독하여, 단백질의 존재 또는 발현 정도를 확인하여, 대장암 발병 여부를 확인할 수 있다.
Preferably, one or more antibodies against the colon cancer markers are arranged at predetermined positions on the substrate to use protein chips immobilized at a high density. A method of analyzing a sample using a protein chip is a method of separating a protein from a sample, hybridizing the separated protein with a protein chip to form an antigen-antibody complex, reading the protein, It is possible to confirm whether or not the colon cancer develops.

구체적으로, 본 발명의 구체적인 대장암 의심 개체의 임상 시료에서 상기 융합단백질, 융합유전자 또는 전사산물을 검출하기 위하여 (a) 상기 임상 시료에 상기 융합단백질, 이를 암호화하는 융합유전자 및 상기 융합유전자에 상응하는 전사산물로 이루어진 군에서 선택된 1종 이상과 상호작용하는 물질을 처리(첨가)하고 반응시키는 단계; 및 (b) 상기 과정을 통해 얻어진 반응물을 검출하는 단계를 포함할 수 있다. 상기 단계 (a) 이전에 임상 시료를 준비하는 단계를 추가로 포함할 수 있으며, 상기 임상 시료를 준비하는 단계는 대장암 의심 개체로부터 임상 시료를 얻는(분리하는) 단계를 포함하는 것일 수 있다. 상기 단계 (a)에서 상기 상호작용하는 물질은 상기 융합단백질(전부 또는 일부; 예컨대 융합 부위)에 특이적으로 결합하는 화합물, 항체, 앱타머 및 상기 융합유전자 또는 전사산물(전부 또는 일부; 예컨대 융합 부위)에 결합하는 핵산 분자(예컨대, 프라이머, 프로브, 앱타머 등), 화합물 등으로 이루어진 군에서 선택된 1종 이상일 수 있다. 이들 상호작용 하는 물질은 자유 래디컬, 방사성-동위원소, 형광염료, 발색기질, 효소, 박테리오파지, 조효소 등으로 이루어진 군에서 선택된 1종 이상의 면역화학적 표지 물질이 부착되거나, 상기 표지 물질과 함께 사용될 수 있다. 상기 단계 (b)에서, 상기 반응물은 단계 (a)에서 얻어진 상기 융합단백질, 융합유전자, 및 전사산물로 이루어진 군에서 선택된 1종 이상과 이와 상호작용하는 물질이 상호작용(결합)하여 생성된 복합체(complex)일 수 있다. 상기 반응물을 검출하는 단계에서 반응물이 검출되면 상기 융합단백질, 융합유전자 또는 전사산물이 존재하는 것으로 판단할 수 있다.
Specifically, in order to detect the fusion protein, fusion gene or transcription product in a clinical specimen of a suspected individual of a colorectal cancer of the present invention, it is preferable that (a) the fusion protein, the fusion gene encoding the fusion protein, (Add) and react with a substance that interacts with at least one selected from the group consisting of transcription products, And (b) detecting the reactant obtained through the process. The method may further include preparing a clinical sample before the step (a), and the step of preparing the clinical sample may include obtaining (separating) a clinical sample from a suspected colon cancer patient. In step (a), the interacting substance may be a compound, an antibody, an aptamer and a fusion gene or a transcription product (all or a part of the fusion protein, such as fusion Nucleic acid molecules (for example, primers, probes, aptamers, etc.) that bind to the target nucleic acid, and compounds. These interacting substances can be attached to, or used in conjunction with, one or more immunochemical labeling substances selected from the group consisting of free radicals, radioactive isotopes, fluorescent dyes, chromogenic substrates, enzymes, bacteriophages, . In the step (b), the reactant is a complex formed by interaction (binding) of at least one member selected from the group consisting of the fusion protein, the fusion gene, and the transcription product obtained in step (a) or complex. When the reactant is detected in the step of detecting the reactant, it can be determined that the fusion protein, the fusion gene or the transcription product is present.

NTRK1 융합단백질을 암호화하는 융합유전자 또는 이에 상응하는 전사산물과 상호작용하는 물질은 상기 융합유전자 또는 전사산물과 혼성화가 가능한 핵산 분자일 수 있다. 예컨대, 상기 핵산 분자는 임상 시료 내의 상기 융합유전자, 상기 융합유전자의 융합 부위 또는 상기 융합유전자 또는 융합 부위에 상응하는 전사산물과 특이적으로 혼성화가 가능한 안티센스 올리고뉴클레오타이드(예컨대, siRNA, 마이크로RNA 등), 프로브(예컨대, 5 내지 100 bp, 5 내지 50bp, 5 내지 30bp, 또는 5 내지 25bp), 앱타머 등으로 이루어진 군에서 선택된 1종 이상일 수 있다. 예컨대, 상기 융합단백질을 암호화하는 융합유전자 또는 융합유전자에 상응하는 전사산물 분자와 혼성화가 가능한 핵산 분자는 상기 융합유전자 내의 융합 부위를 포함하는 연속하는 50 내지 250개, 구체적으로 100 내지 200개의 염기로 이루어진 폴리뉴클레오타이드 단편을 증폭할 수 있도록 상기 폴리뉴클레오타이드 단편의 양 말단에 인접하는 20 내지 100개, 구체적으로 25 내지 50개의 염기서열 또는 이와 상보적인 서열과 혼성화가 가능한 20 내지 100bp, 또는 25 내지 50bp 길이의 프라이머쌍일 수 있다. 상기 '혼성화가 가능하다'함은 검출 대상 염기서열과 완전히 상보적이거나, 80% 이상(예컨대 80-100%), 구체적으로 90% 이상(예컨대 90-100%) 상보적인 염기서열을 가져서 상기 검출 대상 유전자 또는 전사산물과 특이적으로 결합 가능함을 의미한다.
A substance that interacts with a fusion gene encoding NTRK1 fusion protein or a corresponding transcription product may be a nucleic acid molecule capable of hybridizing with the fusion gene or transcription product. For example, the nucleic acid molecule may be an antisense oligonucleotide (e.g., siRNA, microRNA, etc.) that can specifically hybridize with the fusion gene in the clinical sample, the fusion site of the fusion gene, or the transcription product corresponding to the fusion gene or fusion site, , A probe (for example, 5 to 100 bp, 5 to 50 bp, 5 to 30 bp, or 5 to 25 bp), aptamer, and the like. For example, a fusion gene encoding the fusion protein, or a nucleic acid molecule capable of hybridizing with a transcription product molecule corresponding to the fusion gene, is fused to a continuous 50 to 250, particularly 100 to 200 bases containing a fusion site in the fusion gene 20 to 100 bp, or 25 to 50 bp in length, capable of hybridizing with 20 to 100, particularly 25 to 50, or complementary sequences adjacent to both ends of the polynucleotide fragment so as to amplify the polynucleotide fragment made, Lt; / RTI > The 'hybridization is possible' means that the base sequence is completely complementary to the base sequence to be detected, or has a complementary base sequence of 80% or more (for example, 80-100%), specifically 90% or more (for example, 90-100% Means that it can specifically bind to the gene or transcription product of interest.

또 다른 예에서, NTRK1 융합유전자 또는 이의 전사산물과 특이적으로 상호작용하는 물질은 자유 래디컬, 방사성-동위원소, 형광염료, 발색기질, 효소, 박테리오파지, 조효소 등으로 이루어진 군에서 선택된 1종 이상의 면역화학적 표지 물질이 부착되거나, 상기 표지 물질과 함께 사용될 수 있다. 예컨대, 각 융합유전자에 혼성 가능한 형광직접접합법(fluorescence in situ hybridization)용 프로브와 연쇄중합반응(polymerase chain reaction)용 프라이머쌍을 이용할 수 있다(표 8 및 표 9).
In another example, a substance that specifically interacts with the NTRK1 fusion gene or a transcription product thereof may be one or more immunogens selected from the group consisting of free radicals, radioactive isotopes, fluorescent dyes, chromogenic substrates, enzymes, bacteriophages, A chemical labeling substance may be attached or used together with the labeling substance. For example, a probe for fluorescence in situ hybridization capable of hybridizing with each fusion gene and a pair of primers for a polymerase chain reaction can be used (Table 8 and Table 9).

또 다른 예에서, NTRK1 융합단백질의 검출은 상기 융합단백질 또는 대장암에서 NTRK1 유전자의 융합에 의존적으로 발현하는 NTRK1 단백질과 상호작용하는 물질, 예컨대 특이적으로 결합하는 물질(예컨대, 항체, 앱타머 또는 화합물 등)을 이용하여 상기 융합단백질과 상기 물질(예컨대, 항체 또는 앱타머)과의 상호작용, 즉 복합체 형성을 검출하는 통상적인 에세이법, 예컨대, 면역크로마토그래피(immunochromatography), 면역조직학염색(immunohistochemistry), 효소결합 면역흡착 분석(enzyme linked immunosorbent assay), 방사선 면역측정법(radioimmunoassay), 효소 면역분석(enzyme immunoassay), 형광면역분석(fluorescence immunoassay), 발광면역분석(luminescence immunoassay), 웨스턴블라팅(western blotting) 및 유세포분석(flow cytometry cell sorting) 등에 의하여 검출할 수 있다.
In yet another example, detection of a NTRK1 fusion protein may be accomplished by contacting a substance that interacts with the NTRK1 protein that is expressed in a fusion protein or a colon cancer with the NTRK1 gene in a fusion-dependent manner, such as a specifically binding agent (e.g., an antibody, (E. G., Immunochromatography, immunohistochemistry (e. G., Immunohistochemistry) to detect the interaction, i. E. Complex formation, of the fusion protein with the material ), Enzyme linked immunosorbent assay, radioimmunoassay, enzyme immunoassay, fluorescence immunoassay, luminescence immunoassay, Western blotting blotting and flow cytometry cell sorting.

또 하나의 양태로서, 본 발명은 대장암 환자의 분리된 시료로부터 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법을 제공한다.In another aspect, the present invention provides a method for providing information necessary for predicting colon cancer therapeutic prognosis, comprising the step of measuring the level of the NTRK1 fusion gene, its transcription product or its protein from a separate sample of colon cancer patients to provide.

구체적으로, 본 발명은 대장암 환자의 분리된 시료로부터 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계; 및 상기 측정된 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준을 정상 대조군 시료의 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준과 비교하는 단계를 포함하는, 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법Specifically, the present invention provides a method for diagnosing cancer, comprising measuring the level of a NTRK1 fusion gene, a transcription product thereof, or a protein thereof, from a separate sample of a colon cancer patient; And comparing the measured expression level of the fusion gene, its transcription product, or its protein with a fusion gene of a normal control sample, a transcription product thereof or a protein expression level thereof, to provide information necessary for predicting colorectal cancer treatment prognosis How to

상기 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질 수준을 측정하는 단계는 상기 대장암 진단에 필요한 정보를 제공하는 방법에서 설명한 바와 거의 동일하다.The step of measuring the NTRK1 fusion gene, the transcription product thereof or the protein level thereof is almost the same as that described in the method for providing the information necessary for the diagnosis of colon cancer.

다만, 측정하는 시료가 대장암 의심 개체의 분리된 시료가 아닌 대장암 환자의 분리된 시료라는 측면과, 본 발명에서 정상 대조군이란 NTRK1 융합유전자가 발현되지 않은 정상군 및 대장암 환자군이라는 측면에서 차이가 있다.However, in the present invention, the sample to be measured is not a separate sample of suspected colon cancer but a separate sample of colon cancer patients. In the present invention, the normal control group is a normal group in which the NTRK1 fusion gene is not expressed, .

본 발명에서 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법은 대장암 환자의 분리된 시료로부터 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준이 정상 대조군 시료의 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준보다 높을 경우, 같거나 낮을 경우보다 대장암 치료 예후가 나쁠 것으로 예측하는 것을 특징으로 하는 것일 수 있다. 또한, NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준을 비교하는 것에는 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질 발현 여부를 확인하는 것이 포함된다. 즉, NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질 발현되지 않은 대장암 환자보다 발현된 대장암 환자가 대장암 치료 예후가 나쁠 것으로 예측하는 것을 포함한다.
In the present invention, a method for providing information necessary for predicting colorectal cancer treatment prognosis is a method in which a NTRK1 fusion gene, a transcription product thereof, or a protein expression level thereof from a separate sample of colon cancer patients is compared with a fusion gene of a normal control sample, Protein expression level is predicted to be worse than that of the same or lower than that of the protein expression level. Also, comparing the NTRK1 fusion gene, its transcription product, or its protein expression level includes confirming the expression of the NTRK1 fusion gene, its transcription product or its protein. That is, it is predicted that patients with colorectal cancer expressed from NTRK1 fusion gene, its transcription product, or a patient with no colorectal cancer expressing the protein will be inferior in colorectal cancer treatment prognosis.

또한, 본 발명에서 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법은 대장암 환자의 분리된 시료로부터 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준이 정상 대조군 시료의 융합유전자, 이의 전사산물 또는 이의 단백질 발현 수준보다 높을 경우, NTRK1 단백질 저해제에 의한 대장암 치료 유효성이 높다고 예측하는 것을 특징으로 하는 것일 수 있다.
In addition, the present invention provides a method for providing information necessary for predicting the prognosis of colorectal cancer, which comprises detecting a fusion gene, a transcription product thereof, or a protein expression level thereof from a separate sample of a colon cancer patient with a fusion gene, a transcription product thereof, Is higher than its protein expression level, it may be characterized that it is predicted that the NTRK1 protein inhibitor is highly effective for colon cancer treatment.

한편, 본 발명에서 대장암 치료 예후를 예측하는데 필요한 정보를 제공하는 방법은 대장암 환자의 분리된 세포에 NTRK1 융합단백질 저해제를 처리하여, NTRK1 융합유전자의 NTRK1 융합유전자, 이의 전사산물 또는 이의 단백질의 수준을 측정하는 단계를 포함하는 것일 수 있다.Meanwhile, in the present invention, a method for providing information necessary for predicting colorectal cancer treatment prognosis is a method of treating NTRK1 fusion protein inhibitor to isolated cells of colon cancer patients and detecting the NTRK1 fusion gene, its transcription product or its protein And measuring the level.

본 발명에 있어서 대장암 치료의 유효성을 평가하는 대상이 되는 'NTRK1 단백질 저해제'로서는, NTRK1 단백질의 기능을 직접적으로 또는 간접적으로 억제할 수 있는 물질이면 특별한 제한은 없다. NTRK1 단백질의 기능, 구체적으로는 티로신키아나제, 구체적으로는 기타의 티로신키아나제를 저해할 수 있는 물질이어도 된다. 본 발명에 적용할 수 있는 공지의 NTRK1 단백질 저해제로서는, 예를 들면, (5S,6S,8R)-6-하이드록시-6-(하이드록시메틸)-5-메틸-7,8,14,15-테트라하이드로-5H-16-옥사-4b,8a,14-트리아자-5,8-메타노이벤조[b,h]사이클로옥타[jkl]사이클로펜타[e]-아스-인다센-13(6H)-원(일반명: 레스타우르티닙(lestaurtinib): FLT3, JAK2, TRKA, TRKB 및 TRKC를 표적으로 하는 화합물), N-[5-((2R)-2-메톡시-2-페닐에타노일)-1,4,5,6-테트라하이드로피롤로[3,4-c]피파졸-3-일]-4-(4-메틸피페라진-1-일)벤자마이드(일반명: 다누세르팁(danusertib): ARK1, RET, ABL, FGFR1, TRKA를 표적으로 하는 화합물), N,1,4,4-테트라메일-8-((4-(4-메틸피페라진-1-일)페닐)아미노)-4,5-디하이드로-1H-피라졸로[4,3-h]퀴나졸린-3-카복사마이드 (일반명: 미르시클립(milciclib): CDK2, TRKA, TRKB, TRKC를 표적으로 하는 화합물), RXDX-101 (코드명: ROS1, ALK, TRKA를 표적으로 하는 화합물), TSR-011 (코드명: TRKA, TRKB, TRKC, ALK를 표적으로 하는 화합물), PLX-7486 (코드명: CSF1R, TRKA, TRKB, TRKC를 표적으로 하는 화합물), NMS-E628 (코드명: ROS1, JAK2, ALK, TRKA를 표적으로 하는 화합물), 코드명 AZD-7451의 화합물 (TRKA, TRKB, TRKC를 표적으로 하는 화합물), 코드명 NMS-P626의 화합물 (TRKA를 표적으로 하는 화합물), 코드명 RXDX-102의 화합물 (TRKA를 표적으로 하는 화합물), 코드명 ARRY-47의 화합물 (TRKA, TRKB, TRKC를 표적으로 하는 화합물)을 들 수 있다.
In the present invention, the 'NTRK1 protein inhibitor' to be evaluated for the effectiveness of colorectal cancer treatment is not particularly limited as long as it is a substance capable of directly or indirectly inhibiting the function of the NTRK1 protein. A substance capable of inhibiting the function of the NTRK1 protein, specifically, tyrosine kinase, specifically, tyrosine kinase. Examples of known NTRK1 protein inhibitors applicable to the present invention include (5S, 6S, 8R) -6-hydroxy-6- (hydroxymethyl) -Trihydro-5H-16-oxa-4b, 8a, 14-triaza-5,8-methanoibenzo [b, h] cycloocta [jkl] cyclopenta [e] ) - a generic name for lestaurtinib: a compound targeting FLT3, JAK2, TRKA, TRKB and TRKC), N- [5 - ((2R) -2- methoxy- 3-yl] -4- (4-methylpiperazin-1-yl) benzamide (generic name: Tannu 8- ((4- (4-methylpiperazin-1-yl) -1H-pyrazol-3-yl) 4-yl) quinazoline-3-carboxamide (generic name: milciclib: CDK2, TRKA, TRKB, TRKC) Target compounds), RXDX-101 (code name: ROS1, ALK, TRKA) (Compounds targeting TRKA, TRKB, TRKC, ALK), PLX-7486 (a compound targeting CSF1R, TRKA, TRKB and TRKC), NMS-E628 (A compound targeting TRKA, TRKB, TRKC), a compound having the codename NMS-P626 (a compound targeting TRKA), a compound having a nucleotide sequence of SEQ ID NO: (Compounds targeting TRKA), and compounds having the code name ARRY-47 (compounds targeting TRKA, TRKB, and TRKC).

본 발명의 구체적인 일 실시예에서는 NTRK1 융합유전자에 의한 암호화되는 단백질의 발현에 따라 216명의 한국인 대장암 환자 코호트의 10년 생존기간을 분석하여 NTRK1 융합단백질이 발현된 63명의 대장암 환자는 NTRK1 융합단백질이 발현되지 않는 153명의 대장암 환자에 비하여 생존기간이 유의적으로 단축되는 것을 확인하였다(도 12). 이는 대장암 환자에서 NTRK1 융합유전자 및 이로부터 발현되는 융합단백질이 측정될 경우 대장암 치료 예후가 나쁘다는 것을 확인한 것이다.
In a specific embodiment of the present invention, the 10-year survival period of 216 cohorts of Korean colon cancer patients was analyzed according to the expression of the protein encoded by the NTRK1 fusion gene. Sixty-three colon cancer patients expressing the NTRK1 fusion protein were analyzed for NTRK1 fusion protein The survival time was significantly shorter than that of the 153 colorectal cancer patients (Fig. 12). This indicates that the NTRK1 fusion gene and the fusion protein expressed therefrom in the colon cancer patients are inferior in colorectal cancer treatment prognosis.

또 하나의 양태로서, 본 발명은 NTRK1 융합유전자의 발현 수준을 측정하는 단계를 포함하는 대장암 치료제의 스크리닝 방법을 제공하며, 구체적으로 대장암 치료용 후보물질을 NTRK1 융합유전자를 발현하는 세포에 처리하는 단계; 및 상기 NTRK1 융합유전자의 발현 수준을 측정하는 단계를 포함하는, 대장암 치료제의 스크리닝 방법을 제공한다.In another aspect, the present invention provides a method for screening a therapeutic agent for colorectal cancer, comprising the step of measuring the expression level of the NTRK1 fusion gene. Specifically, the present invention provides a method for screening a candidate compound for treating colorectal cancer, ; And measuring the expression level of the NTRK1 fusion gene.

본 발명에서 NTRK1 융합유전자는 상기 설명한 바와 같다.
In the present invention, the NTRK1 fusion gene is as described above.

본 발명에서 NTRK1 융합유전자의 발현 수준을 측정하는 것에는, 융합유전자의 전사산물 및 이의 단백질 등의 수준을 측정하는 방법 등이 있고, 상기 융합유전자의 전사산물의 수준을 측정하는 방법이나 융합단백질의 수준을 측정하는 방법은 상기 설명한 바와 같다.In the present invention, the measurement of the expression level of the NTRK1 fusion gene includes a method of measuring the level of the transcription product of the fusion gene and a method of measuring the level of the transcription product of the fusion gene and the protein thereof. The method of measuring the level is as described above.

본 발명의 대장암 치료제의 스크리닝 방법은 특히 대장암 치료용 후보물질 처리에 의해 NTRK1 융합유전자의 발현 수준이 낮아질 경우, 대장암 치료제로 판단하는 것일 수 있다.The screening method of the therapeutic agent for colorectal cancer of the present invention may be judged to be a therapeutic agent for colorectal cancer when the expression level of the NTRK1 fusion gene is lowered, particularly by treatment with a candidate substance for the treatment of colorectal cancer.

구체적으로, 대장암 치료 후보 물질의 존재 및 부재 하에서 NTRK1 융합유전자의 발현의 증가 또는 감소를 비교하는 방법으로 대장암 치료제를 스크리닝하는데 유용하게 사용할 수 있다. NTRK1 융합유전자(LMNA-NTRK1 또는 TPM3-NTRK1), 또는 이로부터 발현되는 단백질의 농도를 간접적으로 또는 직접적으로 감소시키는 물질은 대장암의 치료제로서 선택할 수 있다. 즉, 대장암 치료 후보 물질의 부재 하에 대장암 세포에서의 본 발명의 마커 NTRK1 융합유전자의 발현 수준을 측정하고, 또한 대장암 치료 후보 물질의 존재 하에서 본 발명의 마커 NTRK1 융합유전자의 발현 수준을 측정하여 양자를 비교한 후, 대장암 치료 후보 물질이 존재할 때의 본 발명의 마커의 발현 수준이 대장암 치료 후보 물질의 부재 하에서의 마커 발현 수준보다 감소시키는 물질을 대장암의 치료제로 예측할 수 있는 것이다.
Specifically, the method can be useful for screening a therapeutic agent for colorectal cancer by comparing the increase or decrease of the expression of the NTRK1 fusion gene in the presence or absence of a colorectal cancer treatment candidate substance. Substances that indirectly or directly reduce the concentration of the NTRK1 fusion gene (LMNA-NTRK1 or TPM3-NTRK1), or proteins expressed therefrom, can be selected as therapeutic agents for colon cancer. That is, the expression level of the marker NTRK1 fusion gene of the present invention in colon cancer cells is measured in the absence of a candidate for colorectal cancer treatment candidate, and the expression level of the marker NTRK1 fusion gene of the present invention is measured And comparing the quantities of the candidate agents with those of the candidate agent for colorectal cancer therapy, the agent can be predicted as a therapeutic agent for colorectal cancer when the level of expression of the marker of the present invention in the presence of the colorectal cancer therapeutic candidate substance is lower than the marker expression level in the absence of the colorectal cancer therapeutic candidate substance.

또 하나의 양태로서, 본 발명은 상기 대장암 치료제의 스크리닝 방법을 통하여 대장암 치료제로 스크리닝된 물질을 제공한다. In another aspect, the present invention provides a substance screened with a therapeutic agent for colorectal cancer through the screening method of the colorectal cancer therapeutic agent.

본 발명의 구체적인 일 실시예에서, 상기 NTRK1 융합단백질의 기능 저해를 통한 대장암 치료제 스크리닝 방법은 대장암 세포주 KM12를 이용하여 확인하였다. 다중 티로신키나아제 저해제인 레스타우르티닙과 크리조티닙은 토크리스 바이오사이언스사 (영국)에서 구입하였고, NTRK1 단백질 특이적인 저해제인 ARRY-470은 LG생명과학에서 합성하였고, 한국과학기술원에서 제공받은 화합물을 상기 본 발명의 스크리닝 방법을 통하여 스크리닝한 결과 대한민국 공개특허 제10-2013-0106186호에 개시된 화합물 8(본 발명에서 5f) 및 화합물 27(본 발명에서 6s)이 스크리닝되었다. 상기 스크리닝된 화합물 8(5f)은 하기 화학식 1로 표시되는 화합물이고 화합물 27(6s)는 하기 화학식 2로 표시되는 화합물이다.In one specific embodiment of the present invention, the method for screening a therapeutic agent for colon cancer by inhibiting the function of the NTRK1 fusion protein was confirmed using a colon cancer cell line KM12. The multi-tyrosine kinase inhibitors Lestaurintinib and Krystotinib were purchased from Torquis Biosciences (UK). ARRY-470, a specific inhibitor of NTRK1 protein, was synthesized by LG Life Sciences and the compounds provided by Korea Advanced Institute of Science and Technology As a result of screening through the screening method of the present invention, Compound 8 (5f in the present invention) and Compound 27 (6s in the present invention) disclosed in Korean Patent Publication No. 10-2013-0106186 were screened. The screened compound 8 (5f) is a compound represented by the following formula (1) and the compound 27 (6s) is a compound represented by the following formula (2).

[화학식 1][Chemical Formula 1]

[화학식 2](2)

상기 각각의 저해제는 0.64 nM에서 10 μM의 농도로 4일간 처리하였고, 세포증식의 억제 정도는 ATP-Glo Bioluminometric Cell Viability Assay kit (바이오티움사, 미국)를 이용하여 확인하였다.
Each of the inhibitors was treated at a concentration of 0.64 nM at 10 μM for 4 days, and the degree of inhibition of cell proliferation was confirmed by using an ATP-Glo Bioluminometric Cell Viability Assay kit (Biotium Co., USA).

또 하나의 양태로서, 본 발명은 NTRK1 융합단백질 저해제를 유효성분으로 포함하는 대장암의 예방 또는 치료용 약제학적 조성물을 제공한다.In another aspect, the present invention provides a pharmaceutical composition for the prevention or treatment of colon cancer, which comprises an NTRK1 fusion protein inhibitor as an active ingredient.

상기 NTRK1 융합단백질 저해제는 융합단백질의 활성이나 융합유전자의 발현을 억제하는 등의 작용을 할 수 있다. 상기 NTRK1 융합단백질 저해제는 상술한 바와 같이, NTRK1 단백질의 기능, 구체적으로 NTRK1 티로신키아나제의 기능을 직접적으로 또는 간접적으로 억제할 수 있는 물질이면 특히 제한은 없다. NTRK1 티로신키나아제를 저해할 수 있는 한, 기타 티로신키나아제를 저해하는 물질이어도 된다. 본 발명에 적용할 수 있는 공지의 RET 티로신키나아제 저해제로서는 상술한 바와 같이, 예를 들면, (5S,6S,8R)-6-하이드록시-6-(하이드록시메틸)-5-메틸-7,8,14,15-테트라하이드로-5H-16-옥사-4b,8a,14-트리아자-5,8-메타노이벤조[b,h]사이클로옥타[jkl]사이클로펜타[e]-아스-인다센-13(6H)-원(일반명: 레스타우르티닙(lestaurtinib): FLT3, JAK2, TRKA, TRKB 및 TRKC를 표적으로 하는 화합물), N-[5-((2R)-2-메톡시-2-페닐에타노일)-1,4,5,6-테트라하이드로피롤로[3,4-c]피파졸-3-일]-4-(4-메틸피페라진-1-일)벤자마이드(일반명: 다누세르팁(danusertib): ARK1, RET, ABL, FGFR1, TRKA를 표적으로 하는 화합물), N,1,4,4-테트라메일-8-((4-(4-메틸피페라진-1-일)페닐)아미노)-4,5-디하이드로-1H-피라졸로[4,3-h]퀴나졸린-3-카복사마이드 (일반명: 미르시클립(milciclib): CDK2, TRKA, TRKB, TRKC를 표적으로 하는 화합물), RXDX-101 (코드명: ROS1, ALK, TRKA를 표적으로 하는 화합물), TSR-011 (코드명: TRKA, TRKB, TRKC, ALK를 표적으로 하는 화합물), PLX-7486 (코드명: CSF1R, TRKA, TRKB, TRKC를 표적으로 하는 화합물), NMS-E628 (코드명: ROS1, JAK2, ALK, TRKA를 표적으로 하는 화합물), 코드명 AZD-7451의 화합물 (TRKA, TRKB, TRKC를 표적으로 하는 화합물), 코드명 NMS-P626의 화합물 (TRKA를 표적으로 하는 화합물), 코드명 RXDX-102의 화합물 (TRKA를 표적으로 하는 화합물), 코드명 ARRY-47의 화합물 (TRKA, TRKB, TRKC를 표적으로 하는 화합물)일 수 있으며, 특히 ARRY-470, 하기 화학식 1로 표시되는 화합물 또는 화학식 2로 표시되는 화합물일 수 있다. The NTRK1 fusion protein inhibitor may act to inhibit the activity of the fusion protein or the expression of the fusion gene. As described above, the NTRK1 fusion protein inhibitor is not particularly limited as long as it is a substance capable of directly or indirectly inhibiting the function of the NTRK1 protein, specifically the function of NTRK1 tyrosine kinase. As long as it can inhibit NTRK1 tyrosine kinase, it may be a substance that inhibits other tyrosine kinases. As a known RET tyrosine kinase inhibitor applicable to the present invention, for example, (5S, 6S, 8R) -6-hydroxy-6- (hydroxymethyl) 8,14,15-tetrahydro-5H-16-oxa-4b, 8a, 14-triaza-5,8-methanoibenzo [b, h] cycloocta [jkl] cyclopenta [e] (2R) -2-methoxy-l, 3-dihydro-lH-pyrazolo [ 3,4-c] pyrazol-3-yl] -4- (4-methylpiperazin-1-yl) benzamide (4-methylpiperazin-1-yl) - (4-methylpiperazin-1-yl) Yl) phenyl) amino) -4,5-dihydro-1H-pyrazolo [4,3-h] quinazoline-3-carboxamide (generic name: milciclib: CDK2, TRKA , TRKB, and TRKC), RXDX-101 TRKA, TRKB, TRKC, and ALK), PLX-7486 (codenamed CSF1R, TRKA, TRKB and TRKC as targets), TSR- (Compound targeting TRKA, TRKB and TRKC), code NMS-E628 (code name: a compound targeting ROS1, JAK2, ALK and TRKA) (A compound targeting TRKA), a compound with a code name of RXDX-102 (a compound targeting TRKA), a compound with a code name ARRY-47 (a compound targeting TRKA, TRKB, or TRKC) , Particularly ARRY-470, a compound represented by the following formula (1) or a compound represented by the following formula (2).

[화학식 1][Chemical Formula 1]

[화학식 2](2)

또한, 상기 NTRK1 융합단백질 저해제는 NTRK1 융합유전자 발현 억제제일 수 있으며, NTRK1 융합유전자의 안티센스 올리고뉴클레오타이드, siRNA, shRNA 및 microRNA로 구성된 군으로부터 선택되는 것일 수 있다. 또한, NTRK1 융합단백질 저해제는 NTRK1 융합단백질에 특이적으로 결합하는 항체일 수 있다.
In addition, the NTRK1 fusion protein inhibitor may be an inhibitor of NTRK1 fusion gene expression, and may be selected from the group consisting of antisense oligonucleotides, siRNA, shRNA and microRNA of the NTRK1 fusion gene. In addition, the NTRK1 fusion protein inhibitor may be an antibody that specifically binds to the NTRK1 fusion protein.

이러한 항체는 폴리클로날 항체, 모노클로날 항체 또는 항원 결합성을 갖는 것이면 그것의 일부도 본 발명의 항체에 포함되고 모든 면역 글로불린 항체가 포함된다. 나아가, 본 발명의 항체에는 인간화 항체 등의 특수 항체도 포함하며, 신규한 항체 외에 이미 당해 기술분야에서 공지된 항체들도 포함될 수 있다. 상기 항체는 상기 목적을 달성하기 위한 또 다른 양태로서, 본 발명은 LMNA-NTRK1 또는 TPM3-NTRK1 융합유전자의 발현 억제제 또는 상기 융합유전자로부터 발현되는 단백질 억제제를 유효성분으로 포함하는 대장암의 예방 또는 치료용 약제학적 조성물을 제공한다.을 특이적으로 인식하는 결합의 특성을 갖는 한, 2개의 중쇄와 2개의 경쇄의 전체 길이를 가지는 완전한 형태뿐만 아니라, 항체 분자의 기능적인 단편을 포함한다. 항체의 분자의 기능적인 단편이란, 적어도 항원 결합 기능을 보유하고 있는 단편을 뜻하며, Fab, F(ab'), F(ab')₂ 및 Fv 등이 있다.Such an antibody may be a polyclonal antibody, a monoclonal antibody, or a part thereof having an antigen binding property, and all immunoglobulin antibodies are included in the antibody of the present invention. Furthermore, the antibodies of the present invention include special antibodies such as humanized antibodies, and in addition to the novel antibodies, antibodies that are already known in the art may also be included. In another aspect of the present invention, the antibody is an agent for inhibiting the expression of LMNA-NTRK1 or TPM3-NTRK1 fusion gene or a protein inhibitor expressed from the fusion gene as an active ingredient. As long as it has the characteristics of a binding that specifically recognizes the full length of two heavy chains and two light chains as well as functional fragments of the antibody molecule. The functional fragment of the molecule of the antibody refers to a fragment having at least an antigen binding function, and includes Fab, F (ab ') 2, F (ab') ₂ and Fv.

본 발명의 상기 대장암 치료용 약제학적 조성물은 NTRK1 융합단백질에 특이적인 항체를 이용하여, 대장암 환자의 혈청 내에서 암의 이동을 증가시키는 활성을 가지는 NTRK1 융합단백질을 제거하는 방법에 이용되어, 대장암을 치료할 수 있다. 본 발명자들은 스크리닝을 통해 NTRK1 융합단백질 억제 효과를 보이는 화합물이 대장암 세포 증식을 억제하는 효과를 가지는 것을 확인하여 NTRK1 융합유전자 또는 이로부터 발현되는 융합단백질이 대장암 치료의 타겟이 될 수 있음을 규명하였다.The pharmaceutical composition for the treatment of colorectal cancer of the present invention is used in a method for removing NTRK1 fusion protein having an activity of increasing cancer migration in serum of colorectal cancer patients using an antibody specific for NTRK1 fusion protein, Colon cancer can be cured. The inventors of the present invention found that a compound showing an inhibitory effect of NTRK1 fusion protein through screening has an effect of inhibiting the proliferation of colon cancer cells, and thus it was confirmed that the NTRK1 fusion gene or a fusion protein expressed therefrom could be a target of colon cancer treatment Respectively.

이와 같은 대장암 치료용 조성물은 NTRK1 융합유전자 또는 이로부터 발현되는 단백질에 특이적으로 결합하여 항암 활성을 촉진할 수 있는 물질은 제한없이 포함하나, 그 예로서 단백질, 앱타머, 펩티드, 탄수화물, 화합물일 수 있다. Such a composition for treating colorectal cancer includes, but is not limited to, a substance capable of specifically binding to the NTRK1 fusion gene or a protein expressed therefrom to promote the anticancer activity. Examples thereof include proteins, aptamers, peptides, carbohydrates, Lt; / RTI >

본 발명의 조성물은 약제학적으로 허용되는 담체와 함께 투여될 수 있고, 경구 투여시에는 결합체, 활택제, 붕해제, 부형제, 가용화제, 분산제, 안정화제, 현탁화제, 색소, 향료 등을 사용할 수 있으며, 주사제의 경우에는 완충제, 보존제, 무통화제, 가용화제, 등장제, 안정화제 등을 혼합하여 사용할 수 있으며, 국소 투여용의 경우에는 기제, 부형제, 윤활제, 보존제 등을 사용할 수 있다. 본 발명의 조성물의 제형은 상술한 바와 같은 약제학적으로 허용되는 담체와 혼합하여 다양하게 제조될 수 있다. 예를 들어, 경구 투여시에는 정제, 트로키, 캡슐, 엘릭시르, 서스펜션, 시럽, 웨이퍼 등의 형태로 제조할 수 있으며, 주사제의 경우에는 단위 투약 앰플 또는 다수회 투약 형태로 제조될 수 있다.
The composition of the present invention may be administered together with a pharmaceutically acceptable carrier. In oral administration, a conjugate, a lubricant, a disintegrant, an excipient, a solubilizer, a dispersant, a stabilizer, a suspending agent, In the case of injections, a buffer, a preservative, an anhydrous agent, a solubilizer, an isotonic agent, a stabilizer and the like may be mixed. In the case of topical administration, a base, excipient, lubricant and preservative may be used. Formulations of the compositions of the present invention may be prepared in a variety of ways by mixing with pharmaceutically acceptable carriers as described above. For example, oral administration may be in the form of tablets, troches, capsules, elixirs, suspensions, syrups, wafers, etc. In the case of injections, unit dosage ampoules or multiple dose forms may be prepared.

또 하나의 양태로서, 본 발명은 상기 NTRK1 융합단백질 저해제를 유효성분으로 포함하는 대장암의 예방 또는 치료용 약제학적 조성물을 개체에 투여하는 단계를 포함하는, 대장암 예방 또는 치료방법을 제공한다.In another aspect, the present invention provides a method of preventing or treating colon cancer, which comprises administering to a subject a pharmaceutical composition for preventing or treating colon cancer comprising the NTRK1 fusion protein inhibitor as an active ingredient.

본 발명의 NTRK1 융합단백질 저해제, 개체 및 조성물은 상기 설명한 바와 같다.
The NTRK1 fusion protein inhibitors, individuals and compositions of the present invention are as described above.

본 발명의 조성물은 단독으로 사용할 수 있지만, 치료 효율을 증가시키기 위해 방사선 요법 또는 화학요법(세포 성장 정지 또는 세포 독성 물질, 항생 물질형 물질, 알킬화제, 항대사성 물질, 호르몬제, 면역제, 인터페론형 물질, 사이클로옥시게나제 억제제, 메탈로매트릭스프로테아제 억제제, 텔로머라제 억제제, 티로신 키나제 억제제, 항성장인자 수용체 물질, 항-HER 물질, 항-EGFR 물질, 항-혈관생성 물질, 파르네실 트랜스퍼라제 억제제, ras-raf 시그날 전도 경로 억제제, 세포 주기 억제제, 기타 cdk 억제제, 튜불린 결합체, 토포이소머라제Ⅰ 억제제, 토포이소머라제 Ⅱ 억제제 등)과 같은 다른 항암 치료법과 병용하여 사용할 수 있다.The composition of the present invention may be used alone, but it may be administered alone or in combination with radiation therapy or chemotherapy (including cell growth arrest or cytotoxic substance, antibiotic substance, alkylating agent, antimetabolite, hormone, An anti-EGFR agent, an anti-angiogenic agent, a paroxetine transferase inhibitor, an anti-angiogenesis inhibitor, a metallo-matrix protease inhibitor, a telomerase inhibitor, a tyrosine kinase inhibitor, an anti-growth factor receptor substance, , ras-raf signaling pathway pathway inhibitors, cell cycle inhibitors, other cdk inhibitors, tubulin binders, topoisomerase I inhibitors, topoisomerase II inhibitors, etc.).

본 발명에서 사용되는 용어, 투여는 어떠한 적절한 방법으로 환자에게 본 발명의 조성물을 도입하는 것을 의미하며, 본 발명의 조성물의 투여경로는 해당 저해제의 종류나 대장암의 종류 등에 따라 선택되지만, 목적 조직에 도달할 수 있는 한 어떠한 일반적인 경로를 통하여 투여될 수 있다. 경구 투여, 복강 내 투여, 정맥 내 투여, 근육 내 투여, 피하 투여, 피내 투여, 비내 투여, 폐내 투여, 직장내 투여, 강내 투여, 복강 내 투여, 경막 내 투여가 이루어질 수 있으나, 이에 제한되지는 않는다.The term used in the present invention means administration of the composition of the present invention to a patient by any suitable method. The route of administration of the composition of the present invention is selected depending on the type of the inhibitor or the type of colon cancer, Lt; RTI ID = 0.0 > normal < / RTI > pathway. But are not limited to, oral administration, intraperitoneal administration, intravenous administration, intramuscular administration, subcutaneous administration, intradermal administration, intranasal administration, intrapulmonary administration, intrarectal administration, intraperitoneal administration, intraperitoneal administration, Do not.

본 발명의 조성물의 유효 투여량의 범위는 성별, 체표면적, 질환의 종류 및 중증도, 연령, 약물에 대한 민감도, 투여경로 및 배출비율, 투여시간, 치료기간, 표적세포, 발현 수준 등 기타 의학 분야에 잘 알려진 다양한 요인에 따라 달라질 수 있으며, 당 분야의 전문가들에 의해 용이하게 결정될 수 있다.
The effective dose range of the composition of the present invention can be varied depending on the sex, body surface area, kind and severity of disease, age, sensitivity to the drug, administration route and discharge rate, administration time, treatment period, target cell, And may be readily determined by one of ordinary skill in the art.

본 발명에 따라 대장암에 특이적인 진단 마커를 제공함으로써, 정확한 대장암 진단을 가능하게 하며, 대장암 환자의 계층화를 통해 임상예후를 예측하는 것, 대장암에 대한 치료의 유효성을 예측하는 것, 특히 NTRK1 단백질 저해제에 의한 대장암 치료의 유효성을 예측하는 것이 가능해진다. 이에 따라, 약물을 투여하는 것이 유효하지 않다고 생각되는 대장암 환자에 대한 약물의 투여를 제한할 수 있으므로, 효율적이고 안전한 맞춤형 대장암 치료를 실현하는 것이 가능해진다.
The present invention provides diagnostic markers specific for colorectal cancer, thereby enabling accurate diagnosis of colorectal cancer, predicting clinical prognosis through stratification of colorectal cancer patients, predicting the efficacy of treatment for colorectal cancer, In particular, it becomes possible to predict the efficacy of treatment of colon cancer with the NTRK1 protein inhibitor. This makes it possible to limit the administration of a drug to a colon cancer patient who is thought to be ineffective in administering the drug, thus making it possible to realize an efficient and safe customized colon cancer treatment.

도 1은 일루미나사의 HiSeq 2000을 이용한 차세대 전사체 염기서열분석을 통해 150명의 대장암 임상 조직 및 50명의 동일 환자 유래의 정상 대장 임상 조직 중 3개의 대장암 임상 조직에서 NTRK1 유전자 융합에 의해 NTRK1 유전자의 전사산물의 발현이 증가되었음을 나타내는 그림과 데이터 분석 결과를 나타내는 그림이다.
도 2는 NTRK1 융합유전자로부터 암호화된 융합전사산물을 생어시퀀싱(Sanger sequencing)에 의해 염기서열을 분석하여 NTRK1이 LMNA 또는 TPM3와 융합되었음을 나타내는 그림과 데이터 분석 결과를 나타내는 그림이다.
도 3은 전사산물 수준에서 NTRK1 융합유전자가 확인 된 대장암 및 정상 대장 임상 조직쌍을 대상으로 NTRK1 단백질의 발현을 분석하여, NTRK1 융합유전자를 갖는 대장암 특이적인 NTRK1 단백질 발현을 나타내는 그림이다.
도 4는 차세대 전사체 염기서열분석을 통해 전사산물 및 단백질 수준에서 NTRK1 융합유전자가 확인 된 (a) 대장암 및; (b) 정상 대장 임상 조직쌍을 대상으로 NTRK1 유전자의 융합여부를 형광직접접합법(fluorescence in situ hybridization)을 통해 DNA 수준에서 분석하여, 대장암 특이적인 NTRK1 유전자의 융합을 나타내는 그림이다. 노란색 화살표는 유전자 융합에 의해 분리된 NTRK1 유전자를 나타낸다. 기준자는 5 마이크로미터를 나타낸다.
도 5는 LMNA 유전자와 NTRK1 유전자의 단일 융합 부위 및 TPM3 유전자와 NTRK1 유전자의 3종류의 융합 부위를 나타내는 그림이다.
도 6은 LMNA 유전자와 NTRK1 유전자의 단일 융합유전자와 TPM3 유전자와 NTRK1 유전자의 3종류의 융합유전자에 의해 암호화되는 융합단백질의 구조를 나타내는 그림이다.
도 7은 차세대 전사체 염기서열분석을 통해 147명의 대장암 임상 조직에서 대장암 특이적인 암 유전자 및 억제 유전자의 체세포 돌연변이를 분석하여 NTRK1 융합유전자가 있는 대장암 임상 조직은 대장암 특이적인 암 유전자의 체세포 돌연변이가 없는 것을 확인한 그림이다.
도 8은 LMNA-NTRK1 및 TPM3-NTRK1 융합유전자를 종양성이 없는 NIH3T3 세포주에서 강제로 발현시켜 세포괴 형성 유무(colony formation assay)를 평가하여 NTRK1 융합유전자의 발현이 NIH3T3 세포주의 체외 종양성을 유도하는 것을 나타낸 그림이다. 음성대조군으로 NTRK1 융합유전자가 없는 공백터(empty vector)를 강제 발현하였고, 양성대조군으로 TPM3-NTRK1 융합유전자가 있는 KM12 대장암 세포주를 사용하였다.
도 9는 LMNA-NTRK1 및 TPM3-NTRK1 융합유전자를 종양성이 없는 NIH3T3 세포주에서 강제로 발현시켜 면역기능이 결핍된 누드마우스의 등쪽 피하에 접종한 후 종양 형성 유무를 평가하여 NTRK1 융합유전자의 발현이 NIH3T3 세포주의 체외 종양성을 유도하는 것을 나타낸 그림이다. 양성대조군으로 TPM3-NTRK1 융합유전자가 있는 KM12 대장암 세포주를 사용하였다.
도 10은 NTRK1 유전자에 의해 암호화되는 단백질의 발현을 216명의 한국인 대장암 환자 코호트 유래의 조직마이크로어레이(tissue microarray) 및 472명의 중국인 대장암 환자 코호트 유래의 조직마이크로어레이에서 분석하여 한국인 대장암 환자 코호트 중 29.1% 및 중국인 대장암 코호트 중 25.6%에서 NTRK1 단백질의 발현을 확인한 그림이다.
도 11은 NTRK1 유전자에 의해 암호화되는 단백질의 발현이 확인된 한국인 대장암 환자 코호트 유래의 임상 조직 중 NTRK1 단백질의 발현 수준이 높을 수록 NTRK1 유전자의 융합 빈도가 높아졌음을 형광직접접합법으로 나타낸 그림이다. 노란색 화살표는 유전자 융합에 의해 분리된 NTRK1 유전자를 나타낸다. 기준자는 10 마이크로미터를 나타낸다.
도 12는 NTRK1 유전자에 의한 암호화되는 단백질의 발현에 따라 216명의 한국인 대장암 환자 코호트의 10년 생존기간을 분석하여 NTRK1 단백질이 발현된 63명의 대장암 환자는 NTRK1 단백질이 발현되지 않는 153명의 대장암 환자에 비하여 생존기간이 유의적으로 단축되는 것을 나타낸 그림이다.
도 13은 TPM3-NTRK1 융합유전자가 있는 KM12 대장암 세포주를 대상으로 티로신키나아제 저해제 중 NTRK1 단백질에 대한 저해능이 있는 레스타우르티닙(lestaurtinib) 및 NTRK1 단백질에 대한 저해능이 없는 크리조티닙(crizotinib)과 NTRK1 단백질 특이적 저해제인 대한민국 공개특허 제10-2013-0106186호에 개시된 화합물 8(5f) 및 화합물 27(5s) 및 ARRY-470을 농도별로 적용하여 KM12 세포증식이 NTRK1 특이적 저해제에서 더욱 감소하는 것을 나타낸 그림이다.
도 14는 차세대 전사체 염기서열분석을 통해 중간값 118.5 백만개의 얼라인된 리드는 생산하였고 100개의 염기 중 1개의 염기에서 오차가 발생할 수 있는 수준은 94.3%로 확인되었음을 나타낸 그림이다.
도 15는 80% 이상의 임상 조직에서 발현이 인정되는 18,725개의 유전자 발현량을 기초로 전체 임상 조직을 PCA (Principle Component Analysis)로 분석하여 이상치(outlier) 임상조직 3개를 확인한 그림이다.Fig. 1 shows the results of sequencing analysis of the NTRK1 gene by NTRK1 gene fusion in 150 colon cancer clinical tissues and 3 colon cancer clinical tissues derived from 50 identical patients from the same patient using HiSeq 2000 of Illumina Co., This figure shows that the expression of the transcription product is increased and the data analysis result.
FIG. 2 is a figure showing the result of analysis and data showing that NTRK1 is fused with LMNA or TPM3 by analyzing the base sequence by Sanger sequencing of the fusion fusion product encoded from the NTRK1 fusion gene.
FIG. 3 is a graph showing the expression of NTRK1 protein having a NTRK1 fusion gene and analyzing expression of NTRK1 protein in colon cancer and normal colon tissue pairs in which the NTRK1 fusion gene has been identified at the transcript level.
FIG. 4 shows (a) colon cancer in which the NTRK1 fusion gene has been identified at the transcription product and protein level through a sequencing analysis of the next generation transcript; (b) Fusion of NTRK1 gene in normal colon tissue pairs is analyzed at the DNA level by fluorescence in situ hybridization, which shows the fusion of NTRK1 gene specific for colon cancer. The yellow arrow represents the NTRK1 gene isolated by gene fusion. The reference represents 5 micrometers.
5 is a diagram showing a single fusion site of the LMNA gene and the NTRK1 gene and three fusion sites of the TPM3 gene and the NTRK1 gene.
6 is a diagram showing the structure of a fusion protein encoded by a fusion gene of LMNA gene and NTRK1 gene, and a fusion gene of TPM3 gene and NTRK1 gene.
FIG. 7 is a graph showing the results of analysis of somatic cell mutations of a cancer gene-specific gene and a repressor gene in 147 colon cancer clinical tissues through a sequencing analysis of a next-generation transcript. The colon cancer clinical tissue having the NTRK1 fusion gene was analyzed for colon cancer- It is confirmed that there is no somatic mutation.
FIG. 8 shows the expression of the NTRK1 fusion gene inducing the in vitro tumorigenicity of the NIH3T3 cell line by forcibly expressing the LMNA-NTRK1 and TPM3-NTRK1 fusion genes in the non-tumorigenic NIH3T3 cell line and evaluating the colony formation assay Fig. An empty vector without the NTRK1 fusion gene was forcibly expressed as a negative control, and a KM12 colon cancer cell line with a TPM3-NTRK1 fusion gene as a positive control was used.
FIG. 9 is a graph showing the expression of NTRK1 fusion gene and the expression of NTRK1 fusion gene after inoculation into the dorsal subcutaneously of nude mice lacking immune function by forcibly expressing the LMNA-NTRK1 and TPM3-NTRK1 fusion genes in the non-tumorigenic NIH3T3 cell line Lt; RTI ID = 0.0 > NIH3T3 < / RTI > cell line. A KM12 colon cancer cell line with a TPM3-NTRK1 fusion gene was used as a positive control.
Figure 10 shows the expression of the protein encoded by the NTRK1 gene in tissue microarrays derived from 216 Korean colon cancer patients cohort and tissue microarrays derived from 472 Chinese colon cancer patient cohorts, And the expression of NTRK1 protein was detected in 25.6% of the Chinese colon cancer cohort.
FIG. 11 is a graph showing the fluorescence direct binding method that the fusion frequency of the NTRK1 gene was increased as the expression level of the NTRK1 protein in the clinical tissue derived from the cohort of the colon cancer patient who was confirmed to have been expressed by the NTRK1 gene was confirmed. The yellow arrow represents the NTRK1 gene isolated by gene fusion. The reference represents 10 micrometers.
FIG. 12 is a graph showing the survival time of 216 Korean colon cancer patient cohorts according to the expression of the protein encoded by the NTRK1 gene. In 63 colon cancer patients expressing NTRK1 protein, 153 colon cancer patients without NTRK1 protein expression The survival time was significantly shorter than that of the patients.
Fig. 13 is a graph showing the effect of the inhibition of NTRK1 protein against lestaurtinib and NTRK1 protein in the KM12 colon cancer cell line with the TPM3-NTRK1 fusion gene and the inhibitory effect of crizotinib and NTRK1 (5f) and Compound 27 (5s) and ARRY-470 disclosed in Korean Patent Laid-Open No. 10-2013-0106186, which is a protein-specific inhibitor, were applied at different concentrations to further reduce KM12 cell proliferation in NTRK1-specific inhibitors Fig.
FIG. 14 is a graph showing that the intermediate level of 118.5 million aligned leads were produced through sequencing analysis of the next generation transcript, and the level at which one base out of 100 bases could have an error was 94.3%.
FIG. 15 shows three outlier clinical tissues obtained by analyzing whole clinical tissues using PCA (Principle Component Analysis) based on the expression amount of 18,725 genes recognized in 80% or more of clinical tissues.

이하 본 발명을 실시예를 통하여 보다 상세하게 설명한다. 그러나, 이들 실시예는 본 발명을 예시적으로 설명하기 위한 것으로 본 발명의 범위가 이들 실시예에 한정되는 것은 아니다.
Hereinafter, the present invention will be described in more detail with reference to examples. However, these examples are for illustrative purposes only, and the scope of the present invention is not limited to these examples.

실시예Example 1 : One : LMNALMNA 또는 or TPM3TPM3 -- NTRK1NTRK1 융합유전자 및 이에 의해 암호화된 The fusion gene and the encoded 전사산Warrior mountain 물 및 융합 단백질Water and fusion protein

본 발명에서는 하기 실시예를 통하여, 대장암에 있어서 LMNA 단백질과 NTRK1 단백질과의 융합예를 본 발명에서 처음으로 밝혔다. 또한 TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자의 융합 및 이에 암호화된 융합전사산물과 융합단백질을 갖는 대장암 환자의 빈도 및 임상예후가 대규모 대장암 환자 코호트 분석을 통해 본 발명에서 처음으로 확인되었다.
In the present invention, an example of the fusion of the LMNA protein and the NTRK1 protein in colon cancer has been disclosed for the first time in the present invention through the following examples. Also, the frequency and clinical prognosis of patients with colorectal cancer having a fusion of TPM3 gene or LMNA gene with NTRK1 gene and a fusion fusion product and fusion protein encoded thereby are first identified in the present invention through a large-scale colorectal cancer patient cohort analysis.

1-1. 1-1. NTRK1NTRK1 와 Wow 융합단백질을The fusion protein 이루는 유전자 Gene

우선, 본 발명에서 확인된 대장암 조직에서 특이적으로 존재하는 NTRK1 융합유전자를 다음의 표 1에 정리하였다.
First, the NTRK1 fusion gene specifically present in the colon cancer tissue identified in the present invention is summarized in Table 1 below.

대장암 특이적 NTRK1 융합유전자Colon cancer-specific NTRK1 fusion gene 공여유전자 (donor gene)Donor gene 영역domain 수납유전자 (acceptor gene)The acceptor gene 영역domain 거리 (Mb)Distance (Mb) TPM3TPM3 1q21.31q21.3 NTRK1NTRK1 1q23.11q23.1 2.62.6 LMNALMNA 1q221q22 NTRK1NTRK1 1q23.11q23.1 0.60.6

본 발명의 NTRK1 융합유전자는 상기 표 1에서 정리하고 있는 단백질 단편들의 절단점(break point; 또는 융합 부위)에서 절단 및 융합된다. 상기 단편의 N말단(C말단 융합파트너의 경우) 또는 C말단(N말단 융합 파트너의 경우) 마지막에 포함되는 엑손과 절단되어 제거되는 엑손 사이의 인트론(intron) 부위 중 어느 지점에서 절단되어도 암호화되는 단백질 단편의 아미노산 서열에는 영향을 미치지 않게 되므로, 실제 절단되는 지점은 상기 인트론 위치 중 어느 지점이어도 무방하다.
The NTRK1 fusion gene of the present invention is cleaved and fused at the break point (fusion site) of the protein fragments compiled in Table 1 above. Is cleaved at any point in the intron region between the exon contained at the end of the fragment (in the case of a C-terminal fusion partner) or the C-terminal (in the case of an N-terminal fusion partner) and the truncated exon of the fragment The amino acid sequence of the protein fragment is not affected, so that the point at which the protein is actually cleaved may be any of the intron positions.

1-2. 1-2. TPM3TPM3 -- NTRK1NTRK1 융합유전자 및 Fusion genes and TPM3TPM3 -- NTRK1NTRK1 융합단백질Fusion protein

본 발명에서 TPM3 단백질을 암호화하는 TPM3 유전자는 인간 유래 TPM3 유전자를 이용하였으며 인간의 염색체 1번(q21.3)에 위치하며, 이로부터 암호화되는 TPM3 단백질은 총 아미노산 길이가 285aa인 단백질이다. TPM3 단백질 또는 TPM3 단백질 단편은 TPM3-NTRK1 융합단백질의 N 말단쪽 융합파트너이다. In the present invention, the TPM3 gene encoding the TPM3 protein is a human TPM3 gene. The TPM3 protein encoded by the chromosome 1 (q21.3) is a protein having a total amino acid length of 285 aa. The TPM3 protein or TPM3 protein fragment is the N-terminal fusion partner of the TPM3-NTRK1 fusion protein.

본 발명에서 TPM3 단백질은 인간 유래 TPM3를 사용하였고, 이를 암호화하는 NTRK1 유전자는 인간의 염색체 1번(q21.3)에 위치한다. 이로부터 암호화되는 TPM3 단백질은 총 아미노산 길이가 285 aa인 단백질이다. TPM3 단백질 또는 TPM3 단백질 단편은 TPM3-NTRK1 융합단백질의 N 말단쪽 융합 파트너이다. In the present invention, the TPM3 protein uses human-derived TPM3, and the NTRK1 gene encoding the same is located on human chromosome 1 (q21.3). The TPM3 protein encoded by this is a protein with a total amino acid length of 285 aa. The TPM3 protein or TPM3 protein fragment is the N-terminal fusion partner of the TPM3-NTRK1 fusion protein.

본 발명에서는 상기 GenBank accession no. NM_152263의 염기 서열을 갖는 TPM3 유전자를 사용하였다.
In the present invention, the above-mentioned GenBank accession no. The TPM3 gene having the nucleotide sequence of NM_152263 was used.

상기 TPM3 단백질의 단편은 NM_152263의 8번째 엑손(1번 염색체 상의 (-) 가닥(strand)을 기준으로 154134289~154142876 염기위치)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것을 사용하였다. 본 발명에서 사용한 상기 TPM3 단백질을 암호화하는 엑손 8의 3' 말단에 코돈(codon)을 이루지 못하는 1개의 뉴클레오타이드(g)가 존재하였다. 상기 TPM3 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 TPM3 단백질에 대하여 아래의 표 2 및 3에 정리하였다.
The fragment of the TPM3 protein used was one having an amino acid sequence encoded by a nucleotide sequence from the eighth exon of NM_152263 (positions 154134289 to 154142876 bases on the (-) strand on chromosome 1). There was one nucleotide (g) that did not form a codon at the 3 'end of the exon 8 encoding the TPM3 protein used in the present invention. The genes encoding the TPM3 protein fragments and the TPM3 proteins encoded therefrom are summarized in Tables 2 and 3 below.

TPM3 단백질의 단편을 암호화 하는 유전자A gene encoding a fragment of the TPM3 protein TPM3 유전자
Accession No.TPM3 gene
Accession No. TPM3 단백질 암호화 부위
(CDS)TPM3 protein coding region
(CDS) TPM3 단백질 단편 암호화 부위:
엑손 기준TPM3 protein fragment Encoding site:
Exon basis TPM3 단백질 단편 암호화 부위:
cDNA 기준TPM3 protein fragment Encoding site:
cDNA standard 염색체 상
절단위치Chromosome image
Cutting position 5' 말단 방향 절단위치
염기서열5 'end cutting position
Base sequence NM_152263NM_152263 118~975 (858bp)
(서열번호 1)118 to 975 (858 bp)
(SEQ ID NO: 1) 엑손 1~엑손 8Exon 1 to exon 8 118~891 + 1nt (g) (775bp) (서열번호 2)118 to 891 + 1 nt (g) (775 bp) (SEQ ID NO: 2) chr1: 154142876chr1: 154142876 5'-ATT GAT GAC CTG GAA g-3'
(서열번호 3)5'-ATT GAT GAC CTG GAA g-3 '
(SEQ ID NO: 3)

TPM3 단백질의 단편Fragment of TPM3 protein TPM3 단백질의 전체 길이
(Accession No.)Overall length of TPM3 protein
(Accession No.) TPM3 단백질 단편 부위TPM3 protein fragment site N-말단방향 절단위치
아미노산 서열N-terminal direction cleavage position
Amino acid sequence 285aa (NP_689476)
(서열번호 4)285aa (NP_689476)
(SEQ ID NO: 4) 1~258aa + 1nt (g) (아미노산 서열: 서열번호 5)1 to 258aa + 1nt (g) (amino acid sequence: SEQ ID NO: 5) N-IDDLE + 1nt (g) (아미노산 서열: 서열번호 6)N-IDDLE + 1nt (g) (amino acid sequence: SEQ ID NO: 6)

본 발명에서 NTRK1 단백질은 인간 유래 NTRK1을 사용하였고, 이를 암호화하는 NTRK1 유전자는 인간의 염색체 1번(q23.1)에 위치한다. 이로부터 암호화되는 NTRK1 단백질은 총 아미노산 길이가 796aa인 단백질이다. NTRK1 단백질 또는 NTRK1 단백질 단편은 TPM3-NTRK1 융합단백질의 C 말단쪽 융합 파트너이다. In the present invention, the NTRK1 protein uses human-derived NTRK1, and the NTRK1 gene encoding the same is located on human chromosome 1 (q23.1). The NTRK1 protein encoded thereby is a protein with a total amino acid length of 796 aa. The NTRK1 protein or NTRK1 protein fragment is the C-terminal fusion partner of the TPM3-NTRK1 fusion protein.

본 발명에서는 GenBank accession no. NM_001012331의 염기 서열을 갖는 NTRK1 유전자를 사용하였다.
In the present invention, GenBank accession no. The NTRK1 gene having the nucleotide sequence of NM_001012331 was used.

상기 NTRK1 단백질의 단편은 NM_001012331의 엑손 9(1번 염색체 상의 (+)가닥을 기준으로 156830671~156844363 염기위치) 또는 엑손 11(1번 염색체 상의 (-)가닥을 기준으로 156830671~156845312 염기위치) 또는 엑손 12(1번 염색체 상의 (-)가닥을 기준으로 156830671~156845872 염기위치)부터 마지막 엑손까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것을 사용하였다. 이때, 엑손 9의 3' 말단의 처음 시작하는 2개의 뉴클레오타이드(ac)와 엑손 11의 3' 말단의 처음 시작하는 2개의 뉴클레오타이드(gc) 및 엑손 12의 3' 말단의 처음 시작하는 2개의 뉴클레오타이드(gt)는 코돈을 이루지 못하는 서열을 가지고 있었다. 상기 코돈을 이루지 못하는 서열들은 TPM3 단백질의 단편과 융합 시에 앞서 설명한 바와 같이 TPM3 단백질의 단편에 추가로 포함된 1개의 뉴클레오타이드(g)와 연결되어 코돈(gac 또는 ggc 또는 ggt)을 이루어 하나의 아미노산(D 또는 G)을 암호화하였다. 상기 NTRK1 단백질의 단편을 암호화하는 유전자 및 이로부터 발현되는 NTRK1 단백질에 대하여 아래의 표 4 및 표 5에 정리하였다.
The fragment of the NTRK1 protein is located at the base position of exon 9 of NM_001012331 (position 156830671-156844363 base on the (+) strand of chromosome 1) or exon 11 (position 156830671-156845312 base of the (-) strand on chromosome 1) And those having an amino acid sequence encoded by a nucleotide sequence from exon 12 (position 156830671 to 156845872 base on the (-) strand on chromosome 1) to the last exon were used. (Ac) of the 3 'end of exon 9, 2 nucleotides (gc) starting from the 3' end of exon 11, and 2 nucleotides starting from the 3 'end of exon 12 gt) contained sequences that did not form codons. The sequences that do not form the codon are fused to a fragment of the TPM3 protein to form a codon (gac or ggc or ggt) linked to one nucleotide (g) additionally contained in the fragment of the TPM3 protein, (D or G). The gene encoding the NTRK1 protein fragment and the NTRK1 protein expressed therefrom are summarized in Tables 4 and 5 below.

NTRK1 단백질의 단편을 암호화 하는 유전자A gene encoding a fragment of the NTRK1 protein NTRK1 유전자
(Accession No.)NTRK1 gene
(Accession No.) NTRK1 단백질의 암호화 부위 (CDS)The coding region (CDS) of the NTRK1 protein NTRK1 단백질 단편 암호화 부위:
엑손 기준NTRK1 protein fragment Encoding site:
Exon basis NTRK1 단백질 단편 암호화 부위:
cDNA 기준NTRK1 protein fragment Encoding site:
cDNA standard 염색체 상 절단위치Chromosomal cleavage site 3' 말단 방향 절단위치 염기서열3 'terminal direction cleavage site sequence NM_002529NM_002529 52~2447 (2396bp)
(서열번호 7)52 to 2447 (2396 bp)
(SEQ ID NO: 7) 엑손 9~엑손 17Exon 9 to exon 17 2nt (ac) +1236~2447 (1214bp) (서열번호 8)2nt (ac) + 1236 to 2447 (1214 bp) (SEQ ID NO: 8) chr1:156844363chr1: 156844363 5'- ac ACT AAC AGC ACA TCT -3' (서열번호 9)5'-ac ACT AAC AGC ACA TCT-3 '(SEQ ID NO: 9) NM_002529NM_002529 52~2447 (2396bp)
(서열번호 7)52 to 2447 (2396 bp)
(SEQ ID NO: 7) 엑손 11~엑손 17Exon 11 to exon 17 2nt (gc) + 1393~2447 (1055bp) (서열번호 10)2nt (gc) + 1393 to 2447 (1055 bp) (SEQ ID NO: 10) chr1:156845312chr1: 156845312 5'- gc CCG GCT GTG CTG GCT -3' (서열번호 11)5'-gc CCG GCT GTG CTG GCT-3 '(SEQ ID NO: 11) NM_002529NM_002529 52~2447 (2396bp)
(서열번호 7)52 to 2447 (2396 bp)
(SEQ ID NO: 7) 엑손 12~엑손 17Exon 12 to exon 17 2nt (gt) + 1540~2447 (908bp) (서열번호 12)2nt (gt) + 1540 to 2447 (908 bp) (SEQ ID NO: 12) chr1:156845872chr1: 156845872 5'- gt GTT CAC CAC ATC AAG -3' (서열번호 13)Gt; GTT CAC < / RTI > CAC ATC AAG -3 '(SEQ ID NO: 13)

NTRK1 단백질의 단편Fragment of NTRK1 protein NTRK1 단백질의 전체 길이 (Accession No.)The total length of the NTRK1 protein (Accession No.) NTRK1 단백질 단편 부위NTRK1 protein fragment site C-말단방향 절단위치 아미노산 서열C-terminal direction cleavage site Amino acid sequence 796aa (NP_002520)
(서열번호 14)796aa (NP_002520)
(SEQ ID NO: 14) 2nt (ac) + 400~796 (아미노산 서열: 서열번호 15)2nt (ac) + 400-796 (amino acid sequence: SEQ ID NO: 15) 2nt (ac) + TNSTS-C (아미노산 서열: 서열번호 16)2nt (ac) + TNSTS-C (amino acid sequence: SEQ ID NO: 16) 796aa (NP_002520)
(서열번호 14)796aa (NP_002520)
(SEQ ID NO: 14) 2nt (gc) + 453~796 (아미노산 서열: 서열번호 17)2nt (gc) + 453-796 (amino acid sequence: SEQ ID NO: 17) 2nt (gc) + PAVLA-C (아미노산 서열: 서열번호 18)2nt (gc) + PAVLA-C (amino acid sequence: SEQ ID NO: 18) 796aa (NP_002520)
(서열번호 14)796aa (NP_002520)
(SEQ ID NO: 14) 2nt (gt) + 502~796 (아미노산 서열: 서열번호 19)2nt (gt) + 502-796 (amino acid sequence: SEQ ID NO: 19) 2nt (gt) + VHHIK-C (아미노산 서열: 서열번호 20)2nt (gt) + VHHIK-C (amino acid sequence: SEQ ID NO: 20)

상기 TPM3 단백질 또는 그의 단편과 NTRK1 단백질 또는 그의 단편이 융합된 TPM3-NTRK1 융합단백질을 암호화하는 융합유전자(TPM3-NTRK1 융합유전자)는 5' 말단에 상기한 바와 같은 TPM3 단백질 또는 그의 단편을 암호화하는 뉴클레오타이드 분자 및 3' 말단에 상기한 바와 같은 NTRK1 단백질 또는 그의 단편을 암호화하는 뉴클레오타이드 분자를 포함하도록 제조하였다.
A fusion gene (TPM3-NTRK1 fusion gene) encoding the TPM3-NTRK1 fusion protein in which the NTM1 protein or a fragment thereof is fused with the TPM3 protein or a fragment thereof is a nucleotide encoding a TPM3 protein or a fragment thereof at the 5 ' And a nucleotide molecule encoding the NTRK1 protein or fragment thereof as described above at the 3 ' end.

상기 표에서 확인할 수 있듯이 본 발명에서 TPM3-NTRK1 융합유전자로 5' 말단 쪽에 NM_152263의 엑손 1부터 엑손 8까지의 뉴클레오타이드 서열과 3' 말단 쪽에 NM_002529의 엑손 9 또는 엑손 11 또는 엑손 12부터 엑손 17까지의 뉴클레오타이드 서열이 연결된 융합유전자를 제조하였다. 구체적으로, 본 발명에서 5' 말단 쪽에 NM_152263의 118번째부터 892번째까지의 뉴클레오타이드 서열(서열번호 2)과 3' 말단 쪽에 NM_002529의 1234번째 또는 1391번째 또는 1538번째부터 2447번째까지의 뉴클레오타이드 서열(서열번호 8; 서열번호10; 서열번호 12)이 연결된 TPM3-NTRK1 융합유전자(서열번호 21: 서열번호 3과 서열번호 9를 포함하는 염기서열; 서열번호 22: 서열번호 3과 서열번호 11을 포함하는 염기서열; 서열번호 23: 서열번호 3과 서열번호 13을 포함하는 염기서열)를 제조하였다.As shown in the above table, in the present invention, the nucleotide sequence of exon 1 to exon 8 of NM_152263 and the exon 9 or exon 11 or exon 12 to exon 17 of NM_002529 on the 3 'end of the 5' end of the TPM3-NTRK1 fusion gene A fusion gene to which the nucleotide sequence was linked was prepared. Specifically, in the present invention, the nucleotide sequence from the 118th to 892nd nucleotides of the NM_152263 (SEQ ID NO: 2) and the nucleotide sequence from the 1234th or 1391th or the 1338th to 2447th nucleotides of the NM_002529 (SEQ ID NO: 21: a base sequence comprising SEQ ID NO: 3 and SEQ ID NO: 9; SEQ ID NO: 22: a fusion protein comprising SEQ ID NO: 3 and SEQ ID NO: 11) Nucleotide sequence; SEQ ID NO: 23: a nucleotide sequence comprising SEQ ID NO: 3 and SEQ ID NO: 13).

또한, 본 발명에서 상기 제조된 TPM3-NTRK1 융합유전자로부터 암호화된 TPM3-NTRK1 융합단백질은 N말단 쪽에 NP_689476의 1번째부터 258번째까지의 아미노산 서열(서열번호 5)과 C말단 쪽에 NP_002520의 400번째 또는 453번째 또는 502번째부터 796번째까지의 아미노산 서열(서열번호 16; 서열번호 18; 서열번호 20)이 연결된 융합단백질(서열번호 24: 서열번호 6과 서열번호 16을 포함하는 아미노산 서열; 서열번호 25: 서열번호 6과 서열번호 18을 포함하는 아미노산 서열; 서열번호 26: 서열번호 6과 서열번호 20을 포함하는 아미노산 서열)이였다.
In addition, the TPM3-NTRK1 fusion protein encoded from the TPM3-NTRK1 fusion gene prepared in the present invention has an amino acid sequence (SEQ ID NO: 5) of NP_689476 at the N terminus and an amino acid sequence at the 400th or (SEQ ID NO: 24: an amino acid sequence comprising SEQ ID NO: 6 and SEQ ID NO: 16; SEQ ID NO: 25, SEQ ID NO: : An amino acid sequence comprising SEQ ID NO: 6 and SEQ ID NO: 18; SEQ ID NO: 26: an amino acid sequence including SEQ ID NO: 6 and SEQ ID NO: 20).

1-3. 1-3. NMNANMNA -- NTRK1NTRK1 융합유전자 및 Fusion genes and NMNANMNA -- NTRK1NTRK1 융합단백질Fusion protein

본 발명에서 대표적으로 실시한 LMNA 단백질을 암호화하는 LMNA 유전자는 인간 유래 LMNA 유전자를 이용하였으며 인간의 염색체 1번(q22)에 위치하며, 이로부터 암호화되는 LMNA 단백질은 총 아미노산 길이가 664aa인 단백질이다. LMNA 단백질 또는 LMNA 단백질 단편은 LMNA-NTRK1 융합단백질의 N 말단쪽 융합파트너이다.The LMNA gene encoding the LMNA protein, which is a representative example of the present invention, uses a human-derived LMNA gene and is located on chromosome 1 (q22) of human, and the LMNA protein encoded thereby is a protein having a total amino acid length of 664 aa. The LMNA protein or LMNA protein fragment is the N-terminal fusion partner of the LMNA-NTRK1 fusion protein.

본 발명에서는 GenBank accession no. NM_170707의 염기 서열을 갖는 TPM3 유전자를 사용하였다.
In the present invention, GenBank accession no. The TPM3 gene having the nucleotide sequence of NM_170707 was used.

상기 LMNA 단백질의 단편은 NM_170707의 6번째 엑손(1번 염색체 상의 (+)가닥을 기준으로 156084504~156105740 염기위치)까지의 뉴클레오타이드 서열에 의하여 암호화되는 아미노산 서열을 갖는 것을 사용하였다. 본 발명에서 사용한 상기 LMNA 단백질을 암호화하는 엑손 6의 3' 말단에 코돈(codon)을 이루지 못하는 1개의 뉴클레오타이드(c)가 존재하였다. 상기 LMNA 단백질의 단편을 암호화하는 유전자 및 이로부터 암호화되는 LMNA 단백질에 대하여 아래의 표 6 및 7에 정리하였다.
The fragments of the LMNA protein were those having an amino acid sequence encoded by a nucleotide sequence from the sixth exon of NM_170707 (positions 156084504 to 156105740 bases on the (+) strand of chromosome 1). There was one nucleotide (c) that did not form a codon at the 3 'end of exon 6 that encodes the LMNA protein used in the present invention. The genes encoding the fragments of the LMNA protein and the LMNA proteins encoded therefrom are summarized in Tables 6 and 7 below.

LMNA 단백질의 단편을 암호화 하는 유전자A gene encoding a fragment of LMNA protein LMNA 유전자
(Accession No.)LMNA gene
(Accession No.) LMNA 단백질 암호화 부위 (CDS)The LMNA protein coding region (CDS) LMNA 단백질 단편 암호화 부위: 엑손 기준LMNA protein fragment Encoding site: exon standard LMNA 단백질 단편 암호화 부위: cDNA 기준LMNA protein fragment Encoding site: cDNA standard 염색체 상 절단위치Chromosomal cleavage site 5' 말단 방향 절단위치 염기서열5 'terminal direction cleavage site sequence NM_170707NM_170707 250~2244 (1995bp) (서열번호 27)250 to 2244 (1995 bp) (SEQ ID NO: 27) 엑손 1~엑손 6Exon 1 to exon 6 250~1233 + 1nt (c) (985bp) (서열번호 28)250 to 1233 + 1 nt (c) (985 bp) (SEQ ID NO: 28) chr1:156105740chr1: 156105740 5'-GAG GAC TCA CTG GCC c-3' (서열번호 29)5'-GAG GAC TCA CTG GCC c-3 '(SEQ ID NO: 29)

LMNA 단백질의 단편Fragment of LMNA protein LMNA 단백질의 전체 길이 (Accession No.)The total length of the LMNA protein (Accession No.) LMNA 단백질 단편 부위LMNA protein fragment site N-말단방향 절단위치 아미노산 서열N-terminal direction cleavage site Amino acid sequence 664aa (NP_733821) (서열번호 30)664aa (NP_733821) (SEQ ID NO: 30) 1~328aa + 1nt (c) (아미노산 서열: 서열번호 31)1 to 328aa + 1nt (c) (amino acid sequence: SEQ ID NO: 31) N-EDSLA + 1nt (g) (아미노산 서열: 서열번호 32)N-EDSLA + 1nt (g) (amino acid sequence: SEQ ID NO: 32)

본 발명에서 상기 LMNA 단백질 또는 그의 단편과 NTRK1 단백질 또는 그의 단편이 융합된 LMNA-NTRK1 융합단백질을 암호화하는 융합유전자(LMNA-NTRK1 융합유전자)는 5' 말단에 상기한 바와 같은 LMNA 단백질 또는 그의 단편을 암호화하는 뉴클레오타이드 분자 및 3' 말단에 상기 실시예 1-2에서 설명한 NTRK1 단백질 또는 그의 단편을 암호화하는 뉴클레오타이드 분자를 포함하도록 제조하였다.
In the present invention, a fusion gene (LMNA-NTRK1 fusion gene) encoding the LMNA-NTRK1 fusion protein in which the NTRK1 protein or a fragment thereof is fused with the LMNA protein or a fragment thereof is obtained by introducing the LMNA protein or a fragment thereof at the 5 ' Encoding nucleotide molecule and a nucleotide molecule encoding the NTRK1 protein or fragment thereof described in Example 1-2 above at the 3 'end.

구체적으로 본 발명에서 5' 말단 쪽에 NM_170707의 엑손 1부터 엑손 6까지의 뉴클레오타이드 서열과 3' 말단 쪽에 NM_002529의 엑손 11부터 엑손 17까지의 뉴클레오타이드 서열이 연결된 LMNA-NTRK1 융합유전자를 제조하였다. 구체적으로, 본 발명에서는 5' 말단 쪽에 NM_170707의 250번째부터 1234번째까지의 뉴클레오타이드 서열(서열번호 28)과 3' 말단 쪽에 NM_002529의 1391번째부터 2447번째까지의 뉴클레오타이드 서열(서열번호10)이 연결된 LMNA-NTRK1 융합유전자(서열번호 33: 서열번호 29와 서열번호 11를 포함하는 염기서열)를 제조하였다.Specifically, in the present invention, the LMNA-NTRK1 fusion gene to which the nucleotide sequence from exon 1 to exon 6 of NM_170707 and the nucleotide sequence from exon 11 to exon 17 of NM_002529 were linked to the 3 'end of the 5' end was prepared. Specifically, in the present invention, a nucleotide sequence from the 250th to 1234th nucleotides of NM_170707 (SEQ ID NO: 28) and a nucleotide sequence from the 1391th to 2447th nucleotides of the NM_002529 (SEQ ID NO: 10) -NTRK1 fusion gene (SEQ ID NO: 33: a nucleotide sequence comprising SEQ ID NO: 29 and SEQ ID NO: 11).

또한, 본 발명에서 상기 제조된 LMNA-NTRK1 융합유전자로부터 암호화된 LMNA-NTRK1 융합단백질은 N-말단 쪽에 LMNA 단백질의 1번째부터 328번째까지의 아미노산 서열(서열번호 31)과 C-말단 쪽에 NTRK1 단백질의 453번째부터 796번째까지의 아미노산 서열(서열번호 18)이 연결된 융합단백질(서열번호 34: 서열번호 32와 서열번호 18을 포함하는 아미노산 서열)이었다.
In addition, the LMNA-NTRK1 fusion protein encoded by the LMNA-NTRK1 fusion gene prepared in the present invention has the N-terminal amino acid sequence (SEQ ID NO: 31) of the LMNA protein and the NTRK1 protein (SEQ ID NO: 34: amino acid sequence comprising SEQ ID NO: 32 and SEQ ID NO: 18) was linked to the amino acid sequence of SEQ ID NO: 18 from SEQ ID NO:

실시예Example 2: 2: NTRK1NTRK1 융합유전자 또는 이의 The fusion gene or its 전사산물Warrior product 측정 수단 Measuring means

본 발명에서는 NTRK1 융합유전자(TPM3 유전자 또는 LMNA 유전자와 NTRK1 유전자의 융합유전자) 및 이에 암호화된 융합전사산물과 융합단백질을 갖는 대장암 환자의 빈도 및 임상예후에 대하여 대장암 환자에서 분석하기 위해서, 해당 NTRK1 융합유전자 또는 이의 전사산물을 측정하고자 하였다. 이를 위해서는 다양한 방법이 가능하나 본 발명에서는 NTRK1 융합유전자에 혼성가능한 형광직접접합법(fluorescence in situ hybridization: FISH)용 프로브(NTRK1 Split FISH Probe, Cat# FS0024, Abnova, Taiwan)를 사용하였으며, NTRK1 융합전사산물에 혼성가능한 연쇄중합반응용 프라이머를 제조하였다.
In the present invention, in order to analyze the frequency and clinical prognosis of patients with colorectal cancer having the NTRK1 fusion gene (fusion gene of TPM3 gene or LMNA gene and NTRK1 gene) and the fusion fusion product and fusion protein encoded thereby, NTRK1 fusion gene or its transcription product. However, in the present invention, a probe for fluorescence in situ hybridization (FISH) (NTRK1 Split FISH Probe, Cat # FS0024, Abnova, Taiwan) capable of hybridizing with the NTRK1 fusion gene was used and NTRK1 fusion transcription A hybridizable mid-chain hybrid application primer to the product was prepared.

NTRK1 융합유전자에 혼성 가능한 형광직접접합법용 프로브Probes for fluorescence direct conjugation that can hybridize to the NTRK1 fusion gene 접합 부위Junction 길이Length 표지 물질Labeling substance 서열번호 7의 5' 말단
(chr1: 156390000~156814000)The 5 'end of SEQ ID NO: 7
(chr1: 156390000 to 156814000) 약 420 kbApproximately 420 kb TexRedTexRed 서열번호 7의 3' 말단
(chr1: 156851000~157630000)The 3 ' end of SEQ ID NO: 7
(chr1: 156851000 to 157630000) 약 780 kbApproximately 780 kb FITCFITC

NTRK1 융합전사산물에 혼성 가능한 연쇄중합반응용 프라이머Synthesized conjugate application primers hybridizable to the NTRK1 fusion transcription product 융합유전자Fusion gene 융합 부위Fusion site Forward Primer SequenceForward Primer Sequence Reverse Primer SequenceReverse Primer Sequence TPM3-NTRK1TPM3-NTRK1 서열번호 21;
서열번호 22;
서열번호 23SEQ ID NO: 21;
SEQ ID NO: 22;
SEQ ID NO: 23 5'- AAG AAG ATA AAT ATG AGG -3'
(서열번호 35)5'-AAG AAG ATA AAT ATG AGG-3 '
(SEQ ID NO: 35) 5'- CCG TGC CGC ATA TAC TCA AA -3'
(서열번호 36)5'-CCG TGC CGC ATA TAC TCA AA-3 '
(SEQ ID NO: 36) LMNA-NTRK1LMNA-NTRK1 서열번호 33SEQ ID NO: 33 5'- CCA GGT GGA GCA GTA TAA GAA G -3'
(서열번호 37)5'-CCA GGT GGA GCA GTA TAA GAA G -3 '
(SEQ ID NO: 37) 5'- TGT GGG TTC TCG ATG ATG TG -3'
(서열변호 38)5'-TGT GGG TTC TCG ATG ATG TG -3 '
(Sequence defense 38)

또한, NTRK1 융합단백질의 검출은 상기 융합단백질 또는 대장암에서 NTRK1 유전자의 융합에 의존적으로 발현하는 NTRK1 단백질과 상호작용하는 물질, 예컨대 특이적으로 결합하는 물질(예컨대, 항체, 앱타머 또는 화합물 등)을 이용하여 상기 융합단백질과 상기 물질(예컨대, 항체 또는 앱타머)과의 상호작용, 즉 복합체 형성을 검출하는 통상적인 에세이법, 예컨대, 면역크로마토그래피(immunochromatography), 면역조직학염색(immunohistochemistry), 효소결합 면역흡착 분석(enzyme linked immunosorbent assay), 방사선 면역측정법(radioimmunoassay), 효소 면역분석(enzyme immunoassay), 형광면역분석(fluorescence immunoassay), 발광면역분석(luminescence immunoassay), 웨스턴블라팅(western blotting) 및 유세포분석(flow cytometry cell sorting) 등에 의하여 검출할 수 있다. 예컨대, NTRK1 단백질을 검출하는 면역조직학검사용 항체를 아래 표 10에 예시하였다. 본 발명에서는 NTRK1 단백질을 검출하는 면역조직학검사용 항체(Anti-NTRK1 Antibody EP1058Y (Cat# TA301109, OriGene, USA))를 이용하였다.
In addition, the detection of the NTRK1 fusion protein may be accomplished by using a substance that specifically interacts with the NTRK1 protein, such as an antibody, an aptamer or a compound, that expresses in a fusion protein or a colon cancer in a manner dependent on the fusion of the NTRK1 gene, Such as immunochromatography, immunohistochemistry, enzyme-linked immunosorbent assay (ELISA), and the like, to detect the interaction of the fusion protein with the material (e.g., antibody or aptamer) An enzyme-linked immunosorbent assay, a radioimmunoassay, an enzyme immunoassay, a fluorescence immunoassay, a luminescence immunoassay, a western blotting, Can be detected by flow cytometry cell sorting or the like. For example, antibodies for immunohistochemistry for detecting NTRK1 protein are shown in Table 10 below. In the present invention, an antibody for immunohistochemistry (Anti-NTRK1 Antibody EP1058Y (Cat # TA301109, OriGene, USA)) for detecting NTRK1 protein was used.

NTRK1 융합단백질과 상호작용하는 면역조직학검사용 항체Antibodies for immunohistochemistry that interact with NTRK1 fusion proteins 항원antigen 작용 부위Action site 음성대조Negative contrast 양성대조Positive contrast 기타 적용Other applications NTRK1 단백질 아미노산 서열 중 791번째 타이로신 및 그 주변의 펩타이드The 791st tyrosine of the NTRK1 protein amino acid sequence and its surrounding peptides NTRK1 단백질의 카이네이즈 도메인 C-말단The kinase domain C-terminus of the NTRK1 protein 임파구(lymphocytes)Lymphocytes 신경절 세포(ganglion cells)Ganglion cells 웨스턴블라팅, 형광면역분석, 유세포분석Western blotting, fluorescence immunoassay, flow cytometry

실시예Example 3: 임상 조직을 이용한 실험 3: Experiment using clinical tissue

본 발명에 사용된 신선 동결(fresh frozen) 임상 조직은 2008년부터 2012년에 부산대학교병원 및 화순전남대학교병원에서 외과적 절제를 받은 대장암 환자로부터 수집되어 한국인체자원은행 부산대학교병원 거점은행 및 화순전남대학교병원 거점은행에서 보관 후 분양되었다. 대장암 조직은 국제 암게놈 컨소시움(International Cancer Genome Consortium: ICGC)의 지침에 따라 선발되었다. 예컨대, 임상조직은 병리학적 진단 후 60% 이상의 암세포 및 20% 미만의 괴사세포(necrotic cells) 또는 정상세포로 구성된 임상 조직만을 사용하였다. 대장암 가족력이 있는 대장암 환자로부터 수집된 대장암 임상 조직은 제외되었다. 최종적으로 150례의 대장암 조직 및 50례의 정상 대장 조직이 사용되었다.
The fresh frozen clinical tissue used in the present invention was collected from patients with colorectal cancer who underwent surgical resection at Pusan National University Hospital and Chonnam National University Hospital from 2008 to 2012, It was preserved at Chonnam National University Hospital Hwasoon branch of Hwasun city and sold. Colon cancer tissues were selected according to the guidelines of the International Cancer Genome Consortium (ICGC). For example, clinical tissues used only clinical tissues composed of more than 60% cancer cells and less than 20% necrotic cells or normal cells after pathological diagnosis. Colon cancer clinical specimens collected from colon cancer patients with family history of colon cancer were excluded. Finally, 150 colon cancer tissues and 50 normal colon tissues were used.

실시예Example 4: 4: 현미부수체Brown rice 불안정성 분석( Instability analysis microsatellitemicrosatellite instabilityinstability analysis분석 : : MSIMSI ))

상기 실시예 3의 대장암 임상조직 중 포르말린 고정 후 파라핀으로 포매된(formalin-fixed paraffin-embedded) 조직에서 전체 DNA를 추출한 후 베데스다 마커(BAT26, D5S346, BAT25, D17S250, D2S123)를 이용하여 현미부수체 불안정성을 조사하였다. The whole DNA was extracted from formalin-fixed paraffin-embedded tissues after formalin fixation in the colon cancer clinical tissue of Example 3, and then analyzed using Bethesda marker (BAT26, D5S346, BAT25, D17S250, D2S123) Body instability.

구체적으로, 전체 DNA는 QIAmp DNA FFPE Tissue kit (키아젠사, 독일)을 이용하여 추출하였다. DNA 순도 및 농도는 ND-100 분광광도계(spectrophotometer) (나노드롭테크놀로지사, 미국)로 측정하였다. 다중(multiplex) 연쇄중합반응 후 획득한 증폭산물의 길이와 신호강도에 따라 현미부수체 불안정성을 판단하였다. 추출된 DNA의 불완전성 등으로 연쇄중합반응이 실패한 경우 DNA중합요소 엡실론(DNA polymerase epsilon: POLE)를 포함한 주요 유전자의 체세포 돌연변이의 총 수를 고려하여 현미부수체 불안전성을 판단하였다.
Specifically, the whole DNA was extracted using a QIAmp DNA FFPE Tissue kit (KIAJEN, Germany). DNA purity and concentration were determined by ND-100 spectrophotometer (NanoDrop Technologies, USA). Microsatellite instability was determined according to the amplification product length and signal intensity obtained after multiplex polymerisation. When chain polymerization failed due to incompleteness of the extracted DNA, brown solid anxiety was judged by considering the total number of somatic mutations of the major gene including DNA polymerase epsilon (POLE).

실시예Example 5: 차세대 5: Next generation 전사체Transcript 염기서열 분석 ( Sequence analysis ( RNARNA -- seqseq ))

전체 RNA는 RNAiso Plus(타카다 바이오사, 일본)로 추출하여 DNase I(키아젠사)로 전처리 한 다음 RNeasy Mini kit(키아젠사)로 정제하였다. RNA 완전성은 Bioanalyer (아질런트사, 미국)로 분석하였고, RNA 완전성 지표(RNA Integrity Number: RIN)가 6 이상인 대장암 임상 조직 유래 RNA를 RNA-seq에 적용하였다. RNA-seq을 위한 라이브러리는 TruSeq RNA sample preparation kit (일루미나사, 미국)으로 제작하였다. 예컨대, mRNA은 폴리-T 올리고가 부착된 마그네틱 비드(magnetic beads)로 회수한 후 음향전단(acoustic shearing) 방식으로 파편화하였다. 파편화 된 mRNA로부터 역전사효소(reverse transcriptase) 및 랜덤핵사머(random hexamer)를 이용하여 1차 cDNA를 제작하였고, DNA 중합효소 I(DNA polymerase I)과 RNase H를 이용하여 2차 cDNA를 제작하였다. 2차 cDNA는 어뎁터를 접합 한 후 연쇄중합반응을 통해 cDNA 라이브러리를 완성하였다. cDNA 라이브러리는 HiSeq 2000 (일루미나사)를 이용하여 염기서열을 분석하였고, 약 100백만개의 101bp 길이의 페어드엔드 리드(paired-end reads)가 생산하였다.
Total RNA was extracted with RNAiso Plus (Takada Bioscience, Japan), pre-treated with DNase I (Kiagen), and purified with RNeasy Mini kit (KIAJEN). RNA integrity was analyzed by Bioanalyer (Agilent, USA) and RNA-derived RNA from colon cancer clinical tissue with an RNA Integrity Number (RIN) of 6 or greater was applied to the RNA-seq. The library for RNA-seq was prepared with a TruSeq RNA sample preparation kit (Illuminas, USA). For example, mRNA was recovered with magnetic beads attached with poly-T oligo and then fragmented by acoustic shearing. Primary cDNA was prepared from the fragmented mRNA using reverse transcriptase and random hexamer and secondary cDNA was prepared using DNA polymerase I and RNase H. Secondary cDNA was ligated to the adapter and then subjected to a chain polymerization to complete a cDNA library. The cDNA library was sequenced using HiSeq 2000 (Illuminosa) and produced approximately 100 million paired-end reads of 101 bp in length.

실시예Example 6: 염기서열 분석 6: Sequence analysis

상기 실시예 5에서 차세대 전사체 염기서열 분석을 통해 생산한 페어드엔드 리드는 GSNAP 및 TopHat을 이용하여 5% 불일치를 허용하는 조건에서 국립생명공학정보센터(National Center for Biotechnology Information: NCBI, 미국)에서 제공하는 인간참조유전체(hg 19)에 얼라인(align) 하였다. 페어드엔드 리드는 또한 공공데이터베이스로부터 추출한 161,250개의 mRNA (RefSeq 데이터베이스로부터 36,742개의 mRNA, USCS 데이터베이스로부터 73,671개의 mRNA 및 Ensembl 데이터베이스로부터 161,214개의 mRNA)로 구성한 인간참조전사체에 얼라인하여 mRNA 편집(mRNA splicing)에 기인한 미스얼라인(misalignment)을 최소화하였다.
The paired end leads produced by the sequencing analysis of the next generation transcript in Example 5 were obtained from National Center for Biotechnology Information (NCBI, USA) under conditions permitting 5% discrepancy using GSNAP and TopHat. Lt; / RTI > (hg 19), which is provided by < RTI ID = 0.0 > Fairend Reid also mRNA splicing by aligning to a human reference transcript consisting of 161,250 mRNAs extracted from the public database (36,742 mRNA from the RefSeq database, 73,671 mRNA from the USCS database, and 161,214 mRNA from the Ensembl database) To minimize misalignment.

실시예Example 7: 융합유전자 분석 7: Fusion gene analysis

인프레임(in-frame) 융합유전자는 GFP 알고리즘을 통해 검색하였고, deFuse 및 FusionMap 알고리즘으로 교차검증하였다. 예컨대, GFP를 통한 융합유전자는 다음의 조건을 통해 검색하였다.The in-frame fusion gene was searched through the GFP algorithm and cross-validated with the deFuse and FusionMap algorithms. For example, fusion genes through GFP were searched under the following conditions.

(1) 융합전사체를 구성하는 상이한 공여유전자(donor gene)와 수납유전자(acceptor gene)에 얼라인 된 리드쌍(discordant read pairs) 및 융합 부위(exonic fusion breakpoint)에 얼라인된 리드(exon-spanning reads)가 존재하는 경우 수락한다.(1) The discordant read pairs and the exon-fusion breakpoints are ligated to different donor genes and acceptor genes constituting the fusion body, spanning reads, if any.

(2) 공여 및 수납유전자 DNA 가닥의 방향성(strand orientation)이 염색체 전좌 또는 역위 또는 결실을 통해 유전자융합을 설명하는 경우 수락한다.(2) Donor and housekeeping genes If the strand orientation of the DNA strand describes gene fusion through chromosomal translocation or inversion or deletion, accept it.

(3) 염기서열의 상동성이 높은 경우 (E값 < 0.01) 제외한다.(3) When the homology of the nucleotide sequence is high (E value <0.01).

(4) 거리가 10bp 미만의 비논리적 리드쌍인 경우 제외한다.
(4) Excludes cases where the distance is less than 10bp for illogical lead pairs.

융합유전자는 다음의 조건을 통해 확정하였다.The fusion gene was confirmed by the following conditions.

(1) GFP 또는 deFuse 또는 FusionMap 중 2개의 알고리즘에서 검색된 융합유전자를 수락한다.(1) Accept fusion genes found in two algorithms: GFP or deFuse or FusionMap.

(2) 공여유전자와 수납유전자에 얼라인된 리드쌍이 10개 이상인 경우 수락한다.(2) Accept if there are more than 10 lead pairs aligned in the donor gene and in the storage gene.

(3) 융합 부위에 얼라인된 리드가 10개 이상인 경우 수락한다.(3) Accept if more than 10 leads are aligned at the fusion site.

(4) 공여유전자와 수납유전자가 동일한 염색체 내에 존재하고 유전자간 거리가 100Kb 이상인 경우 수락한다.(4) Accepts when the donor gene and the storage gene are present in the same chromosome and the intergenic distance is 100 Kb or more.

(5) 아웃프레임(out-frame) 융합유전자는 제외한다.(5) The out-frame fusion gene is excluded.

(6) 융합 부위 이후에 존재하는 수납유전자의 엑손들이 융합되지 않은 수납유전자의 엑손들과 비교하여 과발현된 경우 수락한다.
(6) The exon of the stored gene existing after the fusion site is accepted when compared with the exon of the unfused storage gene.

실시예Example 8: 이상발현유전자( 8: abnormal expression gene ( differentiallydifferentially expressedexpressed genegene : : DEGDEG ) 분석) analysis

이상발현유전자는 미국 브로드연구소(Broad Institute)의 GenePattern 소프트웨어를 이용하여 분석하였다. 차세대 전사체 염기서열분석을 통해 생산하고 TopHat으로 얼라인된 리드는 Cufflinks를 이용하여 전사체로 조합하였다. 각각의 유전자별로 얼라인된 리드의 수는 FPKM (Fragments Per Kilobase of exon per Million) 방법으로 표준화한 후 각각의 유전자별 발현량으로 표시하였다. 80% 이상의 임상조직에서 0 이상의 FPKM을 갖는 18,725개의 유전자 발현량을 기초로 전체 임상조직을 PCA (Principle Component Analysis)로 분석하여 이상치(outlier) 임상조직 3개(6쌍)를 제거하였다. PCA 분석을 통해 총 147개의 대장암 임상조직 및 47개의 정상 임상조직에서 얻어진 18,725개의 유전자가 이상발현유전자 분석의 대상이 되었다. 대장암 임상조직에서 특이적인 이상발현유전자는 쌍체 양측 t검정(pairwise 2-sided t-test) (p < 0.05) 및 Benjamini-Hochart 다중비교보정(FDR < 0.05)를 통해 검색하였다.
The overexpressed genes were analyzed using the GenePattern software of the Broad Institute of the United States. The next generation transcript was sequenced and TopHat aligned leads were assembled into a transcript using Cufflinks. The number of lined leads for each gene was standardized by FPKM (Fragments Per Kilobase of Exon per Million) method and expressed as the expression level of each gene. Based on the expression of 18,725 genes with over 0 FPKM in over 80% of clinical tissues, all clinical tissues were analyzed by PCA (Principle Component Analysis) to remove 3 outlier clinical tissues (6 pairs). Through the PCA analysis, 18,725 genes obtained from 147 colon cancer clinical specimens and 47 normal clinical specimens were subjected to abnormal expression gene analysis. Specific abnormal expression genes in colon cancer tissues were retrieved by pairwise 2-sided t-test (p <0.05) and Benjamini-Hochart multiple comparison correction (FDR <0.05).

실시예Example 9: 체세포 돌연변이( 9: somatic cell mutation ( nonnon -- synonymoussynonymous somaticsomatic mutationmutation ) 분석) analysis

단일염기변이(single nucleotide variances: SNVs)는 차세대 전사체 염기서열분석을 통해 생산된 리드를 GSNAP을 통해 인간참조전사체에 얼라인하여 검색하였다. 단일염기변이는 다음의 조건을 통해 검색하였다.Single nucleotide variances (SNVs) were searched by next-generation transcript sequencing analysis by aligning leads produced by GSNAP to human reference transcripts. Single base mutations were detected under the following conditions.

(1) 위치 특이적으로 얼라인되는 리드의 수가 2개 이상인 경우 수락한다.(1) Accept if the number of leads that are position-specifically aligned is two or more.

(2) 상기 위치의 평균 염기품질이 20 이상인 경우 수락한다.(2) Accept if the average base quality at this location is 20 or more.

(3) 상기 위치의 대립유전자(allele) 빈도가 3% 이상인 경우 수락한다.(3) Accept if the frequency of the allele in the location is 3% or more.

(4) 상기 위치에 얼라인된 리드의 수가 10개 이상인 경우 수락한다.(4) If the number of leads aligned at the above position is 10 or more, accept.

유전자의 어노테이션(annotation)은 RefSeq을 참고하고, 생식세포 돌연변이(germline variants)는 다음의 조건을 통하여 제거하였다.The annotation of the gene was referred to RefSeq and the germline variants were removed under the following conditions.

(1) dbSNP137와 비교하여 전체 임상조직 중 대립유전자형(minor allele)의 빈도가 1% 이상인 경우 제외한다.(1) Excludes cases in which the frequency of minor alleles in all clinical tissues is greater than 1%, as compared to dbSNP137.

(2) 59명의 한국인참조유전체와 비교하여 정상인에서 관찰된 생식세포 돌연변이는 제외한다.(2) Excludes germ cell mutations observed in normal individuals compared to 59 Korean reference genomes.

(3) 본 발명에서 사용된 47례의 정상 대장조직에서 관찰된 생식세포 돌연변이는 제외한다.(3) The germ cell mutations observed in 47 normal colon tissues used in the present invention are excluded.

정상 대장조직쌍이 없는 대장암 임상조직에서는 예상치보다 높은 체세포 돌연변이의 빈도가 관찰되는 점을 고려하여, 체세포 돌연변이는 음성선발(negative selection)을 기준으로 검색하였다.
Considering that the incidence of somatic mutations higher than expected in the colon cancer clinical tissue without normal colon tissue pairs was observed, somatic cell mutations were searched based on negative selection.

실시예Example 10: 역전사연쇄중합반응( 10: Reverse transcription polymerization reversereverse transcriptasetranscriptase -- polymerase중합체 chainchain reaction: reaction: RTRT -- PCRPCR )과 )and 생어시퀀싱Fishing Sequencing (( SangerSanger sequencingsequencing ))

NTRK1 융합전사체는 역전사연쇄중합반응 및 생어시퀀싱을 통해 확인하였다. 연쇄중합반응의 조건은 아래의 표 11에 정리하였다.
NTRK1 fusion transcripts were confirmed by reverse transcription polymerase chain reaction and seaweed sequencing. The conditions of the chain polymerization reaction are summarized in Table 11 below.

NTRK1 융합전사체 확인용 연쇄중합반응 조건Sequence polymerization conditions for confirmation of NTRK1 fusion transcript 공여유전자Donor gene 수납유전자Storage gene 프라이머primer 연쇄중합반응 조건Chain polymerization reaction conditions 연쇄중합반응 산물Chain polymerization product TPM3 (엑손 8)TPM3 (exon 8) NTRK1 (엑손 9 또는 엑손 11 또는 엑손 12)NTRK1 (exon 9 or exon 11 or exon 12) 서열번호 35 및 서열번호 36SEQ ID NO: 35 and SEQ ID NO: 36 변성(denaturation): 94도 30초
결합(annealing): 55.7도 30초
신장(extension)
72도 30초
총 30회Denaturation: 94 degrees 30 seconds
Annealing: 55.7 degrees 30 seconds
Extension
72 degrees 30 seconds
30 times in total 406bp
553bp
712bp406bp
553bp
712bp LMNA (엑손 6)LMNA (exon 6) NTRK1 (엑손 11)NTRK1 (exon 11) 서열번호 37 및 서열번호 38SEQ ID NO: 37 and SEQ ID NO: 38 변성: 94도 30초
결합: 54도 30초
신장: 72도 30초
총 30회Denatured: 94 degrees 30 seconds
Combination: 54 degrees 30 seconds
Height: 72 degrees 30 seconds
30 times in total 354bp354bp

연쇄중합반응의 산물은 TOPO TA 벡터(라이프 테크놀로지사, 미국)에 삽입한 다음 생어법으로 시퀀싱하였다.
The product of the chain polymerization was inserted into the TOPO TA vector (Life Technologies, USA), and sequenced by a bioluminescence method.

실시예Example 11: 형광직접접합( 11: Fluorescence direct bonding ( fluorescence여광체 inin situsitu hybridizationhybridization : : FISHFISH ))

NTRK1 융합유전자는 NTRK1 유전자의 5' 말단 및 3' 말단에 각각 결합하는 분리 형광직접접합 (split FISH) 프로브를 이용하여 확인하였다(아브노바사, 대만). 예컨대, 포르말린 고정 후 파라핀으로 포매 된 대장암 및 정상 임상조직의 절단면은 파라핀을 제거하고, 프로티아제를 처리한 다음 75도에서 변성하였다. 변성된 임상조직의 절단면은 표 8에 정리된 프로브와 함께 ThermoBrite (아보트사, 미국)에서 17시간 배양하였다. 배앙된 임상조직의 절단면은 세척 후 DAPI(아보트사)를 이용하여 대조염색하였고, TexRed와 FITC 신호의 거리를 관찰하였다.
The NTRK1 fusion gene was identified using a split fluorescent direct probe (Avnova, Taiwan) that binds to the 5 'and 3' ends of the NTRK1 gene, respectively. For example, cut sections of paraffin-embedded colon cancer and normal clinical tissue after formalin fixation were de-paraffinized and denatured at 75 degrees after treatment with protease. The sections of the denatured clinical tissue were incubated for 17 hours in ThermoBrite (Abbott, USA) with the probes listed in Table 8. The cut sections of the clinical tissues were cleaned and stained with DAPI (Abbott), and the distance between TexRed and FITC signals was observed.

실시예Example 12: 면역조직학분석( 12: Immunohistochemical analysis immunohistochemistryimmunohistochemistry ) 및 임상병리학적() And clinicopathological ( clinicopathologicalclinicopathological ) 분석) analysis

NTRK1 융합단백질은 NTRK1 유전자 융합에 의존적으로 발현한다는 점에 착안하여, NTRK1 융합단백질의 발현은 표 10에 정리된 NTRK1 단백질 특이적인 항체(오리진사, 미국)를 이용하여 면역조직학적으로 확인하였다. 뇌의 신경절 세포 및 임파구는 각각 NTRK1 융합단백질 발현의 면역조직학분석의 양성대조군 및 음성대조군으로 사용하였다. The expression of NTRK1 fusion protein was immunohistologically confirmed using the NTRK1 protein specific antibody (Orijin, USA) as shown in Table 10, in view of the fact that NTRK1 fusion protein expresses dependent on NTRK1 gene fusion. Brain ganglion cells and lymphocytes were used as positive and negative controls for immunohistochemical analysis of NTRK1 fusion protein expression, respectively.

NTRK1 융합단백질의 발현 빈도는 216명의 한국인 대장암 환자 코호트 및 472명의 중국 대장암 환자 코호트(바이오맥스사, 미국)에서 구축한 조직마이크로어레이(tissue microarray)를 이용하여 면역조직학적으로 분석하였다. NTRK1 융합단백질 발현의 면역조직학분석을 통해 한국인 대장암 환자는 NTRK1 융합단백질 양성환자와 음성환자로 계층화하였고, 계층화된 환자군은 대장암의 위치, 대장암 침윤도(invasion depth), 신경주위침윤(perineural invasion), 림프혈관색전(lymphovascular emboli), 임파절전이(lymphnode metastasis), 현미부수체 불안정성 및 10년 생존율을 비교하였다.
The expression frequency of the NTRK1 fusion protein was analyzed immunohistochemically using a tissue microarray constructed in 216 Korean colon cancer patient cohorts and 472 Chinese colon cancer patient cohorts (Biomax, USA). In immunohistochemical analysis of NTRK1 fusion protein expression, Korean patients with colorectal cancer were stratified into NTRK1 fusion protein positive patients and negative patients. In the stratified group, colon cancer location, invasion depth, perineural invasion ), Lymphovascular emboli, lymphnode metastasis, microscopic instability, and 10-year survival rates.

실시예Example 13: 13: NIH3T3NIH3T3 세포주에서 In the cell line NTRK1NTRK1 융합유전자의 발현 Expression of fusion gene

TPM3, LMNA 및 NTRK1 유전자의 cDNA는 오리진사로부터 구입하였다. LMNA (엑손 6)-NTRK1 (엑손 11) 및 TPM3 (엑손 8)-NTRK1 (엑손 9 또는 엑손 11 또는 엑손 12) 융합전사산물은 오버랩익스텐션(overlap extension) 연쇄중합반응을 통해 제작하였다. 제작된 융합전사산물은 Cold Fusion Cloning kit (시스템 바이오사이언스사, 미국)를 이용하여 IRES-GFP가 장착된 pcDNA3.1 (라이프 테크놀로지사)에 클로닝하였다. 클로닝된 융합유전자는 생어시퀀싱을 통해 확인한 후 TPM3-NTRK1 및 LMNA-NTRK1 융합전사산물 발현 벡터를 완성하였다. NIH3T3 세포주는 한국세포주은행에서 구입하였다. 융합전사산물 발현 벡터는 FUGENE 6 (로슈, 독일)을 이용하여 NIH3T3 세포주에 도입하였다. 형질전환 된 NIH3T3 세포주는 G418(라이크 테크놀로지사)를 이용하여 선발하였고, GFP 단백질 발현을 이용한 유세포 분석을 통해 완성하였다.
The cDNAs of the TPM3, LMNA and NTRK1 genes were purchased from Origin. The fusion products of LMNA (exon 6) -NTRK1 (exon 11) and TPM3 (exon 8) -NTRK1 (exon 9 or exon 11 or exon 12) were prepared by overlap extension chain polymerization. The resulting fusion transcription product was cloned into pcDNA3.1 (Life Technologies) equipped with IRES-GFP using a Cold Fusion Cloning kit (System Biosciences, USA). The cloned fusion gene was confirmed by bioassay sequencing and the fusion gene expression vector for TPM3-NTRK1 and LMNA-NTRK1 fusion was completed. The NIH3T3 cell line was purchased from the Korean Cell Line Bank. Fusion transcription product expression vector was introduced into NIH3T3 cell line using FUGENE 6 (Roche, Germany). The transformed NIH3T3 cell line was selected using G418 (RIKE TECHNOLOGY INC.) And completed by flow cytometry analysis using GFP protein expression.

실시예Example 14: 14: NTRK1NTRK1 융합유전자의 Of the fusion gene 종양성Bred 분석 analysis

TPM3-NTRK1 또는 LMNA-NTRK1 융합유전자의 종양성은 형질전환 된 NIH3T3 세포주의 체외 세포괴 형성능 및 체내 종양성 평가를 통해 확인하였다. 체외 세포괴 형성능은 총 1,200개의 형질전환 된 NIH3T3 세포를 0.3% 상단 아가로즈(agarose)에 도입한 다음 0.5% 하단 아가로즈 위에서 3주일간 배양하여 형성된 세포괴의 수와 크기를 관찰하였다. 융합유전자가 없는 공벡터가 도입된 NIH3T3 세포주와 TPM3-NTRK1 융합유전자가 있는 KM12 대장암 세포주는 각각 음성 및 양성대조군으로 사용되었다. 형성된 세포괴는 0.05% 크리스탈 바이올렛으로 염색한 다음 광학현미경으로 개수하였다. 체내 종양성은 총 백만 개의 형질전환 된 NIH3T3 세포 또는 KM12 세포를 면역능이 결핍된 누드마우스의 등쪽 피하에 접종한 후 관찰하였다. 실험군별로 5마리의 누드마우스를 사용하였다. 세포 접종 후 측정 가능한 종양이 형성되면 3일 간격으로 세포 접종 후 총 32일 동안 종양의 크기를 측정하였다.
Tumorigenicity of the TPM3-NTRK1 or LMNA-NTRK1 fusion gene was confirmed by the ability of the transformed NIH3T3 cell line to in vitro cell mass-forming ability and the tumorigenicity. In vitro cell mass formation ability was examined by counting 1,200 transfected NIH3T3 cells in 0.3% top agarose and culturing for 3 weeks on 0.5% lower agarose. The NIH3T3 cell line into which the empty vector without the fusion gene was introduced and the KM12 colon cancer cell line with the TPM3-NTRK1 fusion gene were used as negative and positive control, respectively. The formed cell mass was stained with 0.05% crystal violet and then reconstituted with an optical microscope. In vivo tumorigenicity was observed after inoculation of a total of one million transgenic NIH3T3 cells or KM12 cells in dorsal subcutaneously of nude mice lacking immunity. Five nude mice were used for each experimental group. After cell inoculation, the size of the tumor was measured for a total of 32 days after cell inoculation at 3 days intervals when a measurable tumor was formed.

실시예Example 15: 약물 스크리닝 15: Drug Screening

NTRK1 융합단백질의 기능 저해를 통한 대장암 치료 약물 스크리닝 방법은 KM12 세포주를 이용하여 확인하였다. 다중 티로신키나아제 저해제인 레스타우르티닙과 크리조티닙은 토크리스 바이오사이언스사 (영국)에서 구입하였고, NTRK1 단백질 특이적인 저해제인 ARRY-470은 LG생명과학에서 합성하였고, 한국과학기술원에서 제공받은 화합물을 상기 본 발명의 스크리닝 방법을 통하여 스크리닝한 결과 공개특허 제10-2013-0106186호에 개시된 화합물 8(본 발명에서 5f) 및 화합물 27(본 발명에서 6s)이 스크리닝되었다. 상기 스크리닝된 화합물 8(5f)은 하기 화학식 1로 표시되는 화합물이고 화합물 27(6s)는 하기 화학식 2로 표시되는 화합물이다.The screening method for the treatment of colorectal cancer by inhibiting the function of the NTRK1 fusion protein was confirmed using a KM12 cell line. The multi-tyrosine kinase inhibitors Lestaurintinib and Krystotinib were purchased from Torquis Biosciences (UK). ARRY-470, a specific inhibitor of NTRK1 protein, was synthesized by LG Life Sciences and the compounds provided by Korea Advanced Institute of Science and Technology As a result of screening through the screening method of the present invention, Compound 8 (5f in the present invention) and Compound 27 (6s in the present invention) disclosed in Japanese Patent Laid-Open No. 10-2013-0106186 were screened. The screened compound 8 (5f) is a compound represented by the following formula (1) and the compound 27 (6s) is a compound represented by the following formula (2).

[화학식 1][Chemical Formula 1]

[화학식 2](2)

상기 NTRK1 융합유전자 또는 이에 암호화된 NTRK1 융합단백질의 활성을 저해하여 대장암을 치료할 수 있는 약물을 스크리닝 한 결과, 다중 티로신키나아제 억제제 중 NTRK1 단백질에 활성을 갖는 레스타우르티닙은 약 10.7 nM의 농도에서 KM12 세포의 성장을 50% 억제하였고, NTRK1 단백질에 선택적인 저해제인 ARRY-470은 약 3.2 nM에서 KM12 세포의 성장을 50% 억제하였다. 반면, NTRK1 단백질에 낮은 활성을 갖는 크리조티닙은 약 184.8 nM의 농도에서 KM12 세포의 성장을 50% 억제하였다. NTRK1 단백질에 높은 선택성을 갖는 공개특허 제10-2013-0106186호에 개시된 화합물 8(본 발명에서 5f) 및 화합물 27(본 발명에서 6s)은 낮은 세포투과도 때문에 비교적 낮은 544.2 nM 및 929.1 nM에서 KM12 세포의 성장을 50% 억제하는 것을 확인하였다(도 13 참고).
As a result of screening for a drug capable of treating colon cancer by inhibiting the activity of the NTRK1 fusion gene or the NTRK1 fusion protein encoded by the NTRK1 fusion gene, lestauritinib, which has activity on the NTRK1 protein among the multiple tyrosine kinase inhibitors, Cell growth was inhibited by 50% and ARRY-470, a selective inhibitor of the NTRK1 protein, inhibited KM12 cell growth by 50% at about 3.2 nM. On the other hand, chrysotanib, which has low activity on the NTRK1 protein, inhibited the growth of KM12 cells by 50% at a concentration of about 184.8 nM. Compound 8 (5f in the present invention) and Compound 27 (6s in the present invention) disclosed in Patent Publication No. 10-2013-0106186 having high selectivity for the NTRK1 protein exhibited a relatively low KM12 cell at 544.2 nM and 929.1 nM due to low cell permeability (Fig. 13).

실시예Example 16: 한국인 대장암 임상조직에서 관찰된 16: observed in Korean colon cancer clinical tissue 인프레임Inflammation 융합유전자 Fusion gene

상기 차세대 전사체 염기서열분석을 통해 중간값 118.5 백만개의 얼라인된 리드는 생산하였고 100개의 염기 중 1개의 염기에서 오차가 발생할 수 있는 수준은 94.3%로 확인되었다 (도 14 참고). PCA 분석 결과 3개의 임상조직이 이상치로 확인되었고, 이상치 임상조직을 포함하는 대장암 및 정상 대장 임상조직 3쌍이 본 발명에서 제외되었다 (도 15 참고). 따라서 대장암 임상조직 147례 및 정상 대장 임상조직 47례가 본 발명에 사용되었다. GFP 또는 deFuse 또는 FusionMap 알고리즘을 이용한 융합유전자 분석 결과 9개의 인프레임 융합유전자가 확인되었다 (표 12 참고).
The next generation transcript sequencing analysis yielded a median 118.5 million aligned leads, and 94.3% of the 100 bases were found to be error-prone (see FIG. 14). PCA analysis showed that three clinical tissues were abnormal and three pairs of colon cancer and normal colon tissue containing abnormal tissue were excluded from the present invention (see FIG. 15). Thus, 147 cases of colon cancer and 47 cases of normal colon cancer were used in the present invention. Fusion genetic analysis using GFP or deFuse or FusionMap algorithms identified nine infiltrating fusion genes (see Table 12).

한국인 대장암 임상조직에서 관찰된 인프레임 융합유전자 Inflammation fusion gene observed in Korean colon cancer clinical tissue 시료IDSample ID StageStage Discordant paired-end readsDiscordant paired-end reads spanning readsspanning reads Distance
(bp)Distance
(bp) 공여 유전자
(Donor gene)Donor gene
(Donor gene) FPKMFPKM LocationLocation 수여 유전자
(Acceptor)Awarded gene
(Acceptor) FPKMFPKM LocationLocation LGP088TLGP088T ⅢⅢ 2727 153153 26209332620933 TPM3TPM3 5959 chr1:154142876chr1: 154142876 NTRK1NTRK1 3131 chr1:156844363chr1: 156844363 LGC012TLGC012T ⅢⅢ 2525 132132 26209332620933 TPM3TPM3 9494 chr1:154142876chr1: 154142876 NTRK1NTRK1 2020 chr1:156845312chr1: 156845312 TPM3TPM3 9494 chr1:154142876chr1: 154142876 NTRK1NTRK1 2020 chr1:156845872chr1: 156845872 LGC026TLGC026T ⅡⅡ 1111 1515 675664675664 LMNALMNA 145145 chr1:156105740chr1: 156105740 NTRK1NTRK1 3232 chr1:156845312chr1: 156845312 LGC007TLGC007T ⅢⅢ 2121 197197 771740771740 PTPRKPTPRK 1717 chr6:128841404chr6: 128841404 RSPO3RSPO3 3131 chr6:127469793chr6: 127469793 LGC054TLGC054T ⅢⅢ 3636 140140 771740771740 PTPRKPTPRK 1515 chr6:128505577chr6: 128505577 RSPO3RSPO3 3737 chr6:127469793chr6: 127469793 LGC030TLGC030T ⅢⅢ 5656 374374 26675102667510 NAGLUNAGLU 255255 chr17:40693224chr17: 40693224 IKZF3IKZF3 5555 chr17:37922746chr17: 37922746 LGP047TLGP047T ⅡⅡ 4545 206206 10201121020112 GTF3AGTF3A 641641 chr13:27999075chr13: 27999075 CDK8CDK8 9393 chr13:26923209chr13: 26923209 LGP024TLGP024T ⅡⅡ 2424 9898 5546355463 RAD51AP1RAD51AP1 77 chr12:4662255chr12: 4662255 AKAP3AKAP3 1818 chr12:4737971chr12: 4737971 LGP031TLGP031T ⅡⅡ 1717 5353 2882961528829615 RASA1RASA1 1111 chr5:86564807chr5: 86564807 LOC644100LOC644100 1919 chr5:115394422chr5: 115394422

확인된 융합유전자로는 타이로신 탈인산화효소, 수용체 타입, K(tyrosine phosphatase, receptor type, K: PTPRK) 유전자와 R 스폰딘 3(R-spondin 3: RSPO3) 유전자간의 융합유전자가 2례의 대장암 임상조직에서 확인되었고(1.4%), N-아세틸클루코사미니다제, 알파(N-acetylglucosaminidase, alpha: NAGLU) 유전자와 아키로스 페밀리 징크 핑거 3(IKAROS family zinc finger 3: IKZF3) 유전자간의 융합유전자가 1례의 대장암 임상조직에서 확인되었고(0.7%), 일반 전사인자 IIIA(general transcription factor IIIA: GTF3A) 유전자와 사이클린 의존적 키나아제 8(cyclin-dependent kinase 8: CDK8) 유전자간의 융합유전자가 1례의 대장암 임상조직에서 확인되었고(0.7%), RAD51 연관 단백질 1(RAD51 associated protein 1: RAD51AP1) 유전자와 A 키나아제 고정 단백질 3(A kinase anchor protein 3: AKAP3) 유전자간의 융합유전자가 1례의 대장암 임상조직에서 확인되었고(0.7%), RAS p21 활성 단백질 1(RAS p21 protein activator 1: RASA1) 유전자와 ADP 리보실레이션 유사인자 14 유사 단백질(ADP-ribosylation factor-like 14 effector protein-like: LOC644100) 유전자간의 융합유전자가 1례의 대장암 임상조직에서 확인되었고(0.7%), 라민 A/C(lamin A/C: LMNA) 또는 트로포미오신 3(tropomysosin 3: TPM3) 유전자와 신경영양성 타이로신 키나아제, 수용체, 타입 1(neurotrophic tyrosine kinase, receptor, type 1: NTRK1) 유전자간의 융합유전자가 3례의 대장암 임상조직에서 확인되었다(2.0%). 상기의 융합유전자 중 NTRK1 유전자는 세포막에 고정되어 있는 TrkA 단백질을 암호화하는 유전자로 대장암 및 갑상선 유두암에서 TPM3-NTRK1 융합유전자가 보고되었고, 폐선암에서는 미오신 탈인산화효소 Rho 상호작용 단백질(myosin phosphatase Rho interacting protein: MPRIP) 유전자 또는 CD74 유전자와 NTRK1 유전자간의 융합유전자가 보고되었고, 스피트조이드(Spitzoid) 흑색종에서는 TP53 유전자 또는 LMNA 유전자와 NTRK1 유전자간의 융합유전자가 보고되었고, 담낭암에서는 RAB 지티파아제 유사 활성 단백질 1(RAB GTPase activating protein 1-like: RABGAP1L) 유전자와 NTRK1 유전자간의 융합유전자가 보고되었고, 다형성고아종에서는 뉴로파신(neurofascin: NFASC) 유전자 또는 브레비칸(brevica: BCAN) 유전자와 NTRK1 유전자간의 융합유전자가 보고되었다. 그러나 대장암에서 NTRK1 융합유전자의 빈도 및 임상적 의의는 규명되지 않았다.
The fusion genes identified include tyrosine dephosphorylase, receptor type, K (tyrosine phosphatase, receptor type, K: PTPRK) gene and R-spondin 3 (RSPO3) (1.4%), and a fusion gene between the N-acetylglucosaminidase (alpha: NAGLU) gene and the IKAROS family zinc finger 3 (IKZF3) gene One case of colon cancer was confirmed in clinical tissue (0.7%), and the fusion gene between the general transcription factor IIIA (GTF3A) gene and the cyclin-dependent kinase 8 (CDK8) (0.7%), a fusion gene between the RAD51 associated protein 1 (RAD51AP1) gene and the A kinase anchor protein 3 (AKAP3) gene was found in one case of colorectal cancer In clinical tissue (RAS p21 protein activator 1: RASA1) and the ADP-ribosylation factor-like 14 effector protein-like (LOC644100) gene The gene was identified in one case of colorectal cancer (0.7%), and the gene for lamin A / C (LMNA) or tropomyosin 3 (TPM3) and neurotrophic tyrosine kinase, receptor type 1 (neurotrophic tyrosine kinase, receptor, type 1: NTRK1) gene was identified in three colon cancer clinical tissues (2.0%). Among the fusion genes, the NTRK1 gene encodes the TrkA protein immobilized on the cell membrane. TPM3-NTRK1 fusion gene has been reported in colorectal cancer and thyroid papillary cancer. Myosin phosphatase Rho interaction protein interacting protein (MPRIP) gene or a fusion gene between CD74 gene and NTRK1 gene has been reported. In Spitzoid melanoma, a fusion gene between TP53 gene or LMNA gene and NTRK1 gene has been reported. In gallbladder cancer, RAB A fusion gene between protein 1 (RAB GTPase activating protein 1-like: RABGAP1L) gene and NTRK1 gene has been reported. In the polymorphic orphan species, fusion of neurofascin (NFASC) gene or brevica Genes were reported. However, the frequency and clinical significance of NTRK1 fusion gene in colorectal cancer has not been clarified.

실시예Example 17: 17: NTRK1NTRK1 유전자, gene, NTRK1NTRK1 융합유전자 및 이의 단백질 분석 Fusion gene and its protein analysis

NTRK1 유전자 엑손의 발현은 NTRK1 융합유전자가 확인된 대장암 임상조직에서만 관찰되었다(도 1 참고). 역전사연쇄중합반응 및 생어시퀀싱을 이용하여 NTRK1 융합전사산물을 분석한 결과, TPM3-NTRK1 융합유전자는 TMP3 유전자 중 chr1: 154142876 위치에서 절단되어 서열번호 2를 보존하고, 구체적으로 서열번호 3을 5' 말단방향으로 보존하고, NTRK1 유전자 중 chr1:156844363 또는 chr1:156845312 또는 chr1:156845872 위치에서 절단되어 서열번호 8 또는 서열번호 10 또는 서열번호 12를 보존하고, 구체적으로 서열번호 9 또는 서열번호 11 또는 서열번호 13을 3' 말단방향으로 보존한 형태로 융합되었음이 확인되었다. LMNA-NTRK1 유전자는 LMNA 유전자 중 chr1:156105740 위치에서 절단되어 서얼변호 28을 보존하고, 구체적으로 서열번호 29를 5' 말단방향으로 보존하고, NTRK1 유전자 중 chr1:156845312 위치에서 절단되어 서열번호 10을 보존하고, 구체적으로 서열번호 11을 3' 말단방향으로 보존한 형태로 융합되었음이 확인되었다 (도 2 참고).
Expression of the NTRK1 gene exon was observed only in the colon cancer clinical tissue in which the NTRK1 fusion gene was identified (see FIG. 1). The fusion gene of NTRK1 was analyzed using reverse transcription polymerase chain reaction and seaweed sequencing. As a result, the TPM3-NTRK1 fusion gene was cleaved at the position of chr1: 154142876 among the TMP3 gene to preserve SEQ ID NO: 2, specifically SEQ ID NO: Terminus and conserved in SEQ ID NO: 8 or SEQ ID NO: 10 or SEQ ID NO: 12 in the NTRK1 gene at the position of chr1: 156844363 or chr1: 156845312 or chr1: 156845872 and specifically SEQ ID NO: 9 or SEQ ID NO: 11 or SEQ ID NO: It was confirmed that Fusion 13 was conserved in the 3 'terminal direction. The LMNA-NTRK1 gene was cleaved at the chr1: 156105740 position of the LMNA gene to preserve the meristematic 28, specifically, the sequence SEQ ID NO: 29 was conserved at the 5'-terminal direction, and the truncated at the chr1: 156845312 position of the NTRK1 gene, (SEQ ID NO: 11) was conserved in the 3 'terminal direction (see FIG. 2).

NTRK1 단백질에 대한 항체, 구체적으로 NTRK1에 의한 암호화되는 NTRK1 단백질, 구체적으로 TrkA 단백질의 C말단 부위에 결합하는 항체를 이용한 면역조직학 분석 결과, NTRK1 융합전사산물이 확인된 대장암 임상조직에서 TRKA 발현이 확인되었다(도 3 참고). NTRK1 융합유전자는 NTRK1 유전자에 대한 형광직접접합 분석으로 재확인하였다. NTRK1 유전자의 5' 말단은 TexRed가 표지된 프로브로 접합하였고, NTRK1 유전자의 3' 말단은 FITC가 표지된 프로브로 접합한 결과, NTRK1 융합전사산물 및 융합단백질이 확인된 대장암 임상조직에서 분리된 적색과 녹색의 프로브 신호가 확인되었고(도 4 참고), 정상 대장 임상조직에서는 합쳐진 적색과 녹색의 프로브 신호가 확인되었다(도 5 참고). NTRK1 융합유전자 및 융합단백질을 추정한 결과, NTRK1 융합단백질은 NTRK1 단백질 중 티로신키아나제 도메인이 잘 보존되어 있는 것이 확인되었다(도 6 참고).
Immunohistochemical analysis of antibody against NTRK1 protein, specifically NTRK1 protein encoded by NTRK1, specifically antibody binding to the C-terminal region of TrkA protein revealed that TRKA expression in clinical colon cancer tissues confirmed NTRK1 fusion transcription product (See FIG. 3). The NTRK1 fusion gene was reaffirmed by fluorescence direct conjugation assay for the NTRK1 gene. The 5 'end of the NTRK1 gene was conjugated with a TexRed-labeled probe, and the 3' end of the NTRK1 gene was ligated with a FITC-labeled probe. As a result, NTRK1 fusion gene product and fusion protein Red and green probe signals were identified (see FIG. 4), and red and green probe signals were combined in normal colon tissue (see FIG. 5). As a result of the NTRK1 fusion gene and the fusion protein, it was confirmed that the NTRK1 fusion protein has a well-conserved tyrosine kinase domain in the NTRK1 protein (see FIG. 6).

실시예Example 18: 18: NTRK1NTRK1 융합유전자의 Of the fusion gene 종양성Bred 확인 Confirm

NTRK1 융합유전자의 종양성은 갑상선 유두암 및 폐선암에서 보고되었다. 그러나 대장암에서 NTRK1 융합유전자의 종양성은 충분히 검증되지 않았다. TPM3-NTRK1 융합유전자 및 LMNA-NTRK1 융합유전자의 종양성을 확인하고자 인실리코(in silico) 및 체외 및 체내 평가가 실시되었다. 체세포 돌연변이 분석 결과를 기반으로 NTRK1 융합유전자의 종양성을 인실리코 방법으로 분석한 결과, NTRK1 융합유전자가 확인된 대장암 임상조직에서는 대표적인 대장암 유전자의 체세포 돌연변이가 관찰되지 않았다(도 7 참고). 이를 통하여 NTRK1 융합유전자가 대장암의 원인 유전자(driver oncogene)로서 작용하는 것을 확인하였다. NTRK1 융합유전자의 강제발현을 통해 형질전환된 NIH3T3 세포는 체외에서 KM12 대장암 세포주와 유사한 수준의 세포괴를 형성하였다. 이와 대조적으로 NTRK1 융합유전자가 없는 공벡터를 도입한 NIH3T3 세포주는 세포괴를 형성하지 않았다(도 8 참고). 형질전환된 NIH3T3 세포주를 면역능이 결핍된 누드마우스에 접종한, NTRK1 융합유전자가 도입된 NIH3T3 세포주는 세포 접종 후 18일째부터 측정이 가능한 종양이 형성되었고, 세포 접종 후 32일째에는 KM12 대장암 세포주와 유사한 수준의 종양으로 발달하였다(도 9 참고).
Tumorigenicity of the NTRK1 fusion gene has been reported in papillary thyroid carcinoma and lung cancer. However, the tumorigenicity of the NTRK1 fusion gene in colorectal cancer has not been adequately verified. In silico and in vitro and in vivo evaluations were carried out to confirm the heterogeneity of TPM3-NTRK1 fusion gene and LMNA-NTRK1 fusion gene. Based on the results of the somatic cell mutation analysis, the tumorigenicity of the NTRK1 fusion gene was analyzed by the in silico method. As a result, somatic mutation of the typical colon cancer gene was not observed in the colon cancer clinical tissue in which the NTRK1 fusion gene was identified (see FIG. 7). It was confirmed that the NTRK1 fusion gene acts as a driver oncogene in colon cancer. NIH3T3 cells transformed through forced expression of the NTRK1 fusion gene formed cell masses similar to those of the KM12 colon cancer cell line in vitro. In contrast, the NIH3T3 cell line transfected with a blank vector lacking the NTRK1 fusion gene did not form cell mass (see FIG. 8). The NIH3T3 cell line transfected with transformed NIH3T3 cells in nude mice lacking immunity was able to measure from day 18 after cell inoculation and the KM12 colon cancer cell line at day 32 after cell inoculation Developed to a similar level of tumor (see FIG. 9).

실시예Example 19: 19: NTRK1NTRK1 단백질 발현과 Protein expression and NTRK1NTRK1 융합유전자의 상관성 Correlation of fusion genes

NTRK1 단백질은 NTRK1 유전자 융합에 의존적으로 발현한다는 점에 착안하여, 216명의 한국인 대장암 환자 코호트 및 472명의 중국인 대장암 환자 코호트로부터 구축한 조직마이크로어레이와 NTRK1 단백질, 구체적으로 NTRK1 유전자에 의해 암호화된 TrkA 단백질의 C말단 부위에 결합하는 항체를 이용하여 NTRK1 융합유전자의 빈도를 관찰한 결과, 29.2%의 한국인 대장암 환자 유래의 대장암 임상조직에서 NTRK1 단백질이 발현하는 것이 확인되었고, 25.6%의 중국인 대장암 환자 유래의 대장암 임상조직에서 NTRK1 단백질이 발현하는 것이 확인되었다 (도 10 참고). NTRK1 단백질 발현과 NTRK1 융합유전자의 상관성을 확인하고자, 한국인 대장암 환자 코호트로부터 NTRK1 단백질 발현이 확인된 한국인 대장암 환자의 임상조직에 NTRK1 유전자에 대한 형광직접접합을 실시하였다. 분리된 NTRK1 형광직접접합 신호의 빈도는 NTRK1 단백질의 발현이 확인된 임상조직에서 관찰되었다 (p < 0.0192) (도 11 및 표 13 참고). 따라서, NTRK1 융합유전자는 NTRK1 단백질 발현의 일부, 구체적으로 42.9%의 원인 유전자임이 확인되었다.
In view of the fact that the NTRK1 protein expresses dependent on the NTRK1 gene fusion, a tissue microarray constructed from 216 Korean colon cancer patient cohorts and 472 Chinese colorectal cancer patient cohorts, and NTRK1 protein, specifically TrkA As a result of observing the frequency of the NTRK1 fusion gene using an antibody that binds to the C-terminal region of the protein, NTRK1 protein expression was confirmed in 29.2% of colon cancer tissue derived from Korean patients with colon cancer, and 25.6% It was confirmed that NTRK1 protein is expressed in colon cancer-derived clinical tissues derived from cancer patients (see FIG. 10). To confirm the correlation between NTRK1 protein expression and NTRK1 fusion gene, fluorescent direct conjugation of NTRK1 gene was performed on the clinical tissues of Korean patients with colorectal cancer whose NTRK1 protein expression was confirmed from a Korean colorectal cancer patient cohort. The frequency of isolated NTRK1 fluorescent direct junction signals was observed in clinical tissues in which the expression of NTRK1 protein was confirmed (p < 0.0192) (see Fig. 11 and Table 13). Thus, the NTRK1 fusion gene was confirmed to be a part of NTRK1 protein expression, specifically 42.9% of the causative genes.

NTRK1 단백질 발현과 NTRK1 융합유전자의 상관성Relationship between NTRK1 protein expression and NTRK1 fusion gene Sample IDSample ID Cytoplasmic TrkA expressionCytoplasmic TrkA expression NTRK1 fusionNTRK1 fusion P valueP value IHCIHC FISHFISH Colon_50_FISH01Colon_50_FISH01 StrongStrong Frequently detectedFrequently detected 0.01920.0192 Colon_50_FISH02Colon_50_FISH02 StrongStrong Not detectedNot detected Colon_50_FISH03Colon_50_FISH03 StrongStrong Not availableNot available Colon_50_FISH04Colon_50_FISH04 StrongStrong Frequently detectedFrequently detected Colon_50_FISH05Colon_50_FISH05 StrongStrong Not detectedNot detected Colon_50_FISH06Colon_50_FISH06 ModerateModerate Frequently detectedFrequently detected Colon_50_FISH07Colon_50_FISH07 ModerateModerate Not detectedNot detected Colon_50_FISH08Colon_50_FISH08 MildMild Not detectedNot detected Colon_50_FISH09Colon_50_FISH09 MildMild Not detectedNot detected Colon_50_FISH10Colon_50_FISH10 MildMild Not detectedNot detected Colon_50_FISH11Colon_50_FISH11 NegativeNegative Not detectedNot detected Colon_50_FISH12Colon_50_FISH12 NegativeNegative Not detectedNot detected Colon_50_FISH13Colon_50_FISH13 NegativeNegative Not detectedNot detected Colon_50_FISH14Colon_50_FISH14 NegativeNegative Not detectedNot detected Colon_50_FISH15Colon_50_FISH15 NegativeNegative Not detectedNot detected

실시예Example 20: 20: NTRK1NTRK1 융합단백질과Fusion protein 대장암 환자군 생존율 Survival rate of colorectal cancer patients

상기 임상 시료를 받은 대장암 환자 중에 추적이 가능한 216명의 한국인 대장암 환자에 대해 코호트 분석을 진행하였다. 216명의 한국인 대장암 환자에 대한 10년 생존율 추적 중 42명의 환자(24.1%)가 추적기간 동안 사망하였다. A cohort analysis of 216 colon cancer patients who were followed up among the colon cancer patients who received the clinical sample was conducted. Forty - two patients (24.1%) died during the follow - up period in the follow - up 10 year survival rate for 216 Korean patients with colorectal cancer.

먼저, 임상 시료를 분석한 결과에 따라 전체 216명의 한국인 대장암 환자는 NTRK1 단백질의 발현에 따라 NTRK1 융합단백질 양성 환자군과 NTRK1 융합단백질 음성 환자군으로 계층화하였다. 계층화된 대장암 환자군 간의 임상병리학적 분석 결과, NTRK1 융합유전자 또는 이에 암호화된 NTRK1 융합단백질이 확인된 대장암 환자군은 좌측 대장에 대장암이 위치하고(p = 0.0504), 대장암의 침윤도가 높으며(p = 0.0437), 양성의 신경주위침윤 또는 림프혈관색전이가 관찰되고(p = 0.0429), 임파절 전이가 높고(p = 0.015), 현미부수체가 안정한 것으로(p = 0.0024) 확인되었다 (표 14 참고).
First, according to the results of clinical sample analysis, 216 patients with colorectal cancer were stratified into NTRK1 fusion protein positive patients and NTRK1 fusion protein negative patients according to the expression of NTRK1 protein. Clinical pathologic analysis of patients with stratified colon cancer revealed that the NTRK1 fusion gene or the NTRK1 fusion protein encoded in the colon cancer patient group had colon cancer in the left colon (p = 0.0504), high colon cancer invasion (p (P = 0.0437), positive neural invasion or lymphovascular embolus was observed (p = 0.0429), lymph node metastasis was high (p = 0.015), and brownish anomalies were stable (p = 0.0024) .

대장암 환자(한국인) 코호트의 임상병리적 특징Clinical pathologic features of colorectal cancer patients (Korean) cohort NN Cytoplasmic TrkACytoplasmic TrkA P valueP value (-)(-) (+)(+) 나이 (years)Age (years) 216216 70.3570.35 70.2270.22 0.46750.4675 크기 (cm)Size (cm) 216216 5.1995.199 5.5485.548 0.29590.2959 성별gender 0.33510.3351 MaleMale 118118 8585 3333 FemaleFemale 9898 6868 3030 위치location 0.05040.0504 Right colonRight colon 8787 6767 2020 Left colonLeft colon 129129 8686 4343 Histological typeHistological type 0.07610.0761 wellwell 1616 1313 33 ModerateModerate 178178 120120 5858 PoorPoor 1010 1010 00 MucinousMucinous 1212 1010 22 Invasion depth
(T1-2 vs. T3)Invasion depth
(T1-2 vs. T3) 0.04370.0437 T1T1 1212 1010 22 T2T2 2323 1919 44 T3T3 181181 124124 5757 Perineural invation
(PNI)Perineural invasion
(PNI) 0.13150.1315 NegativeNegative 149149 109109 4040 PositivePositive 6767 4444 2323 Lymphovascular emboli (LVE)Lymphovascular emboli (LVE) 0.22290.2229 NegativeNegative 142142 103103 3939 PositivePositive 7474 5050 2424 PNI/LVEPNI / LVE 0.04290.0429 NegativeNegative 119119 9090 2929 PositivePositive 9797 6363 3434 Lymphnode metastasis
(N0 vs N1-2)Lymphnode metastasis
(N0 vs N1-2) 0.0150.015 N0N0 124124 9595 2929 N1aN1a 4040 2323 1717 N1bN1b 3232 2222 1010 N2aN2a 1313 99 44 N2bN2b 77 44 33 Microsatellite statusMicrosatellite status 0.00240.0024 MSSMSS 180180 119119 6161 MSI-LMSI-L 66 55 1One MSI-HMSI-H 3030 2929 1One

상기 대장암 계층군의 10년 생존율을 분석한 결과, NTRK1 융합단백질이 확인된 대장암 환자군은 NTRK1 융합단백질이 확인되지 않은 대장암 환자군보다 유의적으로 낮은 생존율이 관찰되었다 (p = 0.0411, 위험도 = 0.5346, 95% 신뢰구간 = 0.2548~0.9722)(도 12 참고).
The 10-year survival rate of the colon cancer group was significantly lower than that of the NTRK1 fusion protein-free colon cancer patients (p = 0.0411, risk = 0.5346, 95% confidence interval = 0.2548 to 0.9722) (see FIG. 12).

실시예Example 21: 결론 21: Conclusion

이상 설명한 바와 같이, 본 발명에 의하면, TPM3-NTRK1 융합유전자 또는 이에 암호화된 융합전사산물 및 융합단백질 또는 NTRK1 유전자의 융합에 의존적인 NTRK1 유전자의 전사산물 및 단백질의 검출에 의하여 대장암 환자의 예후를 진단하고, NTRK1 단백질 저해제에 의한 대장암 치료의 유효성을 예측하는 것이 가능해진다.
As described above, according to the present invention, the prognosis of patients with colorectal cancer can be evaluated by detecting transcription products and proteins of the NTRK1 gene, which are dependent on the fusion of the TPM3-NTRK1 fusion gene or the fusion fusion protein and the fusion protein or NTRK1 gene encoded thereby And to predict the efficacy of treatment of colon cancer with the NTRK1 protein inhibitor.

NTRK1 단백질에 대한 저해 효과를 가지는 저해제는 이미 여러 암종의 치료를 위한 임상연구에 도입되어 있다. 예를 들면 레스타우르티닙은 전립선암, 신경아세포종, 골수성 백혈병에 대한 치료효과를 연구 중에 있고, 다누세르팁은 전립선암, 비소세포폐암, 만성 골수성 백혈병에 대한 치료효과를 연구 중에 있고, 미르시클립은 갑상선암 및 기타 고형암에 대한 치료효과를 연구 중에 있고, RXDA-101은 기타의 고형암에, TSR-011은 비소세포폐암에, NMS-P626은 대장암에, RXDX-102는 기타의 암종에 대한 치료효과를 연구 중에 있다.
Inhibitors that have an inhibitory effect on the NTRK1 protein have already been introduced in clinical studies for the treatment of various carcinomas. For example, lestaurtinib is studying the therapeutic effects of prostate cancer, neuroblastoma, and myeloid leukemia, while danucer tips are studying the therapeutic effects of prostate cancer, non-small cell lung cancer and chronic myelogenous leukemia, RXDX-102 is a treatment for other carcinomas, while TSR-011 for non-small cell lung cancer, NMS-P626 for colorectal cancer, and RXDX-102 for other carcinomas The effects are under study.

이와 같이, NTRK1 유전자의 융합 또는 이에 의존적인 NTRK1 유전자의 전사산물 및 단백질의 발현은 복수의 암종에서 생겨 있다. NTRK1 융합유전자는, KRAS, NRAS, PIK3CA 등 대장암 유전자의 돌연변이로부터 괴리하고 있다. 따라서, NTRK1 융합유전자는, 기존의 NTRK1 단백질 저해제, 구체적으로 NTRK1 티로신키나아제에 대한 저해능을 갖는 티로신티나아제 저해제의 표적이 될수 있다. 또한, NTRK1 융합유전자의 존재는 한국인을 포함한 중국인 대장암 환자에 있어서도 인정된다. 따라서, 본 발명의 방법은, 한국인 및 중국인 대장암 환자에 있어서, 대장암 치료의 효율을 높이는 동시에, 매우 유용하다.
Thus, the fusion of the NTRK1 gene or the expression of the transcription product and protein of the NTRK1 gene dependent thereon occurs in multiple carcinomas. The NTRK1 fusion gene is diverging from the mutation of colon cancer genes such as KRAS, NRAS and PIK3CA. Thus, the NTRK1 fusion gene can be a target of a conventional NTRK1 protein inhibitor, specifically a tyrosine kinase inhibitor having an inhibitory effect on NTRK1 tyrosine kinase. The presence of the NTRK1 fusion gene is also recognized in Chinese colorectal cancer patients, including Koreans. Therefore, the method of the present invention is very useful in increasing the efficiency of colorectal cancer treatment in Korean and Chinese patients with colorectal cancer.

이상의 설명으로부터, 본 발명이 속하는 기술분야의 당업자는 본 발명이 그 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있다는 것을 이해할 수 있을 것이다. 이와 관련하여, 이상에서 기술한 실시 예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로서 이해해야만 한다. 본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허 청구범위의 의미 및 범위 그리고 그 등가 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.From the above description, it will be understood by those skilled in the art that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. In this regard, it should be understood that the above-described embodiments are to be considered in all respects as illustrative and not restrictive. The scope of the present invention should be construed as being included in the scope of the present invention without departing from the scope of the present invention as defined by the appended claims.

<110> LG Life Sciences Ltd. <120> Novel NTRK1 fusion gene as colon cancer marker and the uses thereof <130> KPA140037-KR-P1 <150> KR 10-2014-0004491 <151> 2014-01-14 <160> 38 <170> KopatentIn 2.0 <210> 1 <211> 858 <212> DNA <213> TPM3 cds <400> 1 atgatggagg ccatcaagaa aaagatgcag atgctgaagt tagacaagga gaatgctctg 60 gatcgggcag agcaagctga agctgagcag aagcaggcag aagaaagaag taaacagctg 120 gaggatgagc tggcagccat gcagaagaag ctgaaaggga cagaggatga gctggacaag 180 tattctgaag ctttgaagga tgcccaggag aagctggaac tggcagagaa gaaggctgct 240 gatgctgagg ctgaggtggc ctccttgaac cgtaggatcc agctggttga agaagagctg 300 gaccgtgctc aggagcgcct ggccactgcc ctgcaaaagc tggaagaagc tgaaaaagct 360 gctgatgaga gtgagagagg tatgaaggtt attgaaaacc gggccttaaa agatgaagaa 420 aagatggaac tccaggaaat ccaactcaaa gaagctaagc acattgcaga agaggcagat 480 aggaagtatg aagaggtggc tcgtaagttg gtgatcattg aaggagactt ggaacgcaca 540 gaggaacgag ctgagctggc agagtctaag tgttctgagc tggaggagga gctgaagaat 600 gtcaccaaca acctcaagtc tcttgaggct caggcggaga agtactctca aaaagaagat 660 aaatatgagg aagaaatcaa gattcttact gataaactca aggaggcaga gacccgtgct 720 gagtttgctg agagatcggt agccaagctg gaaaagacaa ttgatgacct ggaagatgag 780 ctctatgccc agaaactgaa gtacaaggcc attagcgagg agctggacca cgccctcaat 840 gacatgacct ctatataa 858 <210> 2 <211> 775 <212> DNA <213> TPM3 fragment cDNA <400> 2 atgatggagg ccatcaagaa aaagatgcag atgctgaagt tagacaagga gaatgctctg 60 gatcgggcag agcaagctga agctgagcag aagcaggcag aagaaagaag taaacagctg 120 gaggatgagc tggcagccat gcagaagaag ctgaaaggga cagaggatga gctggacaag 180 tattctgaag ctttgaagga tgcccaggag aagctggaac tggcagagaa gaaggctgct 240 gatgctgagg ctgaggtggc ctccttgaac cgtaggatcc agctggttga agaagagctg 300 gaccgtgctc aggagcgcct ggccactgcc ctgcaaaagc tggaagaagc tgaaaaagct 360 gctgatgaga gtgagagagg tatgaaggtt attgaaaacc gggccttaaa agatgaagaa 420 aagatggaac tccaggaaat ccaactcaaa gaagctaagc acattgcaga agaggcagat 480 aggaagtatg aagaggtggc tcgtaagttg gtgatcattg aaggagactt ggaacgcaca 540 gaggaacgag ctgagctggc agagtctaag tgttctgagc tggaggagga gctgaagaat 600 gtcaccaaca acctcaagtc tcttgaggct caggcggaga agtactctca aaaagaagat 660 aaatatgagg aagaaatcaa gattcttact gataaactca aggaggcaga gacccgtgct 720 gagtttgctg agagatcggt agccaagctg gaaaagacaa ttgatgacct ggaag 775 <210> 3 <211> 16 <212> DNA <213> TPM3 5'terminal cleavage site seq <400> 3 attgatgacc tggaag 16 <210> 4 <211> 285 <212> PRT <213> TPM3 full length <400> 4 Met Met Glu Ala Ile Lys Lys Lys Met Gln Met Leu Lys Leu Asp Lys 1 5 10 15 Glu Asn Ala Leu Asp Arg Ala Glu Gln Ala Glu Ala Glu Gln Lys Gln 20 25 30 Ala Glu Glu Arg Ser Lys Gln Leu Glu Asp Glu Leu Ala Ala Met Gln 35 40 45 Lys Lys Leu Lys Gly Thr Glu Asp Glu Leu Asp Lys Tyr Ser Glu Ala 50 55 60 Leu Lys Asp Ala Gln Glu Lys Leu Glu Leu Ala Glu Lys Lys Ala Ala 65 70 75 80 Asp Ala Glu Ala Glu Val Ala Ser Leu Asn Arg Arg Ile Gln Leu Val 85 90 95 Glu Glu Glu Leu Asp Arg Ala Gln Glu Arg Leu Ala Thr Ala Leu Gln 100 105 110 Lys Leu Glu Glu Ala Glu Lys Ala Ala Asp Glu Ser Glu Arg Gly Met 115 120 125 Lys Val Ile Glu Asn Arg Ala Leu Lys Asp Glu Glu Lys Met Glu Leu 130 135 140 Gln Glu Ile Gln Leu Lys Glu Ala Lys His Ile Ala Glu Glu Ala Asp 145 150 155 160 Arg Lys Tyr Glu Glu Val Ala Arg Lys Leu Val Ile Ile Glu Gly Asp 165 170 175 Leu Glu Arg Thr Glu Glu Arg Ala Glu Leu Ala Glu Ser Lys Cys Ser 180 185 190 Glu Leu Glu Glu Glu Leu Lys Asn Val Thr Asn Asn Leu Lys Ser Leu 195 200 205 Glu Ala Gln Ala Glu Lys Tyr Ser Gln Lys Glu Asp Lys Tyr Glu Glu 210 215 220 Glu Ile Lys Ile Leu Thr Asp Lys Leu Lys Glu Ala Glu Thr Arg Ala 225 230 235 240 Glu Phe Ala Glu Arg Ser Val Ala Lys Leu Glu Lys Thr Ile Asp Asp 245 250 255 Leu Glu Asp Glu Leu Tyr Ala Gln Lys Leu Lys Tyr Lys Ala Ile Ser 260 265 270 Glu Glu Leu Asp His Ala Leu Asn Asp Met Thr Ser Ile 275 280 285 <210> 5 <211> 258 <212> PRT <213> TPM3 fragment <400> 5 Met Met Glu Ala Ile Lys Lys Lys Met Gln Met Leu Lys Leu Asp Lys 1 5 10 15 Glu Asn Ala Leu Asp Arg Ala Glu Gln Ala Glu Ala Glu Gln Lys Gln 20 25 30 Ala Glu Glu Arg Ser Lys Gln Leu Glu Asp Glu Leu Ala Ala Met Gln 35 40 45 Lys Lys Leu Lys Gly Thr Glu Asp Glu Leu Asp Lys Tyr Ser Glu Ala 50 55 60 Leu Lys Asp Ala Gln Glu Lys Leu Glu Leu Ala Glu Lys Lys Ala Ala 65 70 75 80 Asp Ala Glu Ala Glu Val Ala Ser Leu Asn Arg Arg Ile Gln Leu Val 85 90 95 Glu Glu Glu Leu Asp Arg Ala Gln Glu Arg Leu Ala Thr Ala Leu Gln 100 105 110 Lys Leu Glu Glu Ala Glu Lys Ala Ala Asp Glu Ser Glu Arg Gly Met 115 120 125 Lys Val Ile Glu Asn Arg Ala Leu Lys Asp Glu Glu Lys Met Glu Leu 130 135 140 Gln Glu Ile Gln Leu Lys Glu Ala Lys His Ile Ala Glu Glu Ala Asp 145 150 155 160 Arg Lys Tyr Glu Glu Val Ala Arg Lys Leu Val Ile Ile Glu Gly Asp 165 170 175 Leu Glu Arg Thr Glu Glu Arg Ala Glu Leu Ala Glu Ser Lys Cys Ser 180 185 190 Glu Leu Glu Glu Glu Leu Lys Asn Val Thr Asn Asn Leu Lys Ser Leu 195 200 205 Glu Ala Gln Ala Glu Lys Tyr Ser Gln Lys Glu Asp Lys Tyr Glu Glu 210 215 220 Glu Ile Lys Ile Leu Thr Asp Lys Leu Lys Glu Ala Glu Thr Arg Ala 225 230 235 240 Glu Phe Ala Glu Arg Ser Val Ala Lys Leu Glu Lys Thr Ile Asp Asp 245 250 255 Leu Glu <210> 6 <211> 5 <212> PRT <213> TPM3 N terminus cleavage site seq <400> 6 Ile Asp Asp Leu Glu 1 5 <210> 7 <211> 2391 <212> DNA <213> NTRK1 CDS <400> 7 atgctgcgag gcggacggcg cgggcagctt ggctggcaca gctgggctgc ggggccgggc 60 agcctgctgg cttggctgat actggcatct gcgggcgccg caccctgccc cgatgcctgc 120 tgcccccacg gctcctcggg actgcgatgc acccgggatg gggccctgga tagcctccac 180 cacctgcccg gcgcagagaa cctgactgag ctctacatcg agaaccagca gcatctgcag 240 catctggagc tccgtgatct gaggggcctg ggggagctga gaaacctcac catcgtgaag 300 agtggtctcc gtttcgtggc gccagatgcc ttccatttca ctcctcggct cagtcgcctg 360 aatctctcct tcaacgctct ggagtctctc tcctggaaaa ctgtgcaggg cctctcctta 420 caggaactgg tcctgtcggg gaaccctctg cactgttctt gtgccctgcg ctggctacag 480 cgctgggagg aggagggact gggcggagtg cctgaacaga agctgcagtg tcatgggcaa 540 gggcccctgg cccacatgcc caatgccagc tgtggtgtgc ccacgctgaa ggtccaggtg 600 cccaatgcct cggtggatgt gggggacgac gtgctgctgc ggtgccaggt ggaggggcgg 660 ggcctggagc aggccggctg gatcctcaca gagctggagc agtcagccac ggtgatgaaa 720 tctgggggtc tgccatccct ggggctgacc ctggccaatg tcaccagtga cctcaacagg 780 aagaacgtga cgtgctgggc agagaacgat gtgggccggg cagaggtctc tgttcaggtc 840 aacgtctcct tcccggccag tgtgcagctg cacacggcgg tggagatgca ccactggtgc 900 atccccttct ctgtggatgg gcagccggca ccgtctctgc gctggctctt caatggctcc 960 gtgctcaatg agaccagctt catcttcact gagttcctgg agccggcagc caatgagacc 1020 gtgcggcacg ggtgtctgcg cctcaaccag cccacccacg tcaacaacgg caactacacg 1080 ctgctggctg ccaacccctt cggccaggcc tccgcctcca tcatggctgc cttcatggac 1140 aaccctttcg agttcaaccc cgaggacccc atccctgtct ccttctcgcc ggtggacact 1200 aacagcacat ctggagaccc ggtggagaag aaggacgaaa caccttttgg ggtctcggtg 1260 gctgtgggcc tggccgtctt tgcctgcctc ttcctttcta cgctgctcct tgtgctcaac 1320 aaatgtggac ggagaaacaa gtttgggatc aaccgcccgg ctgtgctggc tccagaggat 1380 gggctggcca tgtccctgca tttcatgaca ttgggtggca gctccctgtc ccccaccgag 1440 ggcaaaggct ctgggctcca aggccacatc atcgagaacc cacaatactt cagtgatgcc 1500 tgtgttcacc acatcaagcg ccgggacatc gtgctcaagt gggagctggg ggagggcgcc 1560 tttgggaagg tcttccttgc tgagtgccac aacctcctgc ctgagcagga caagatgctg 1620 gtggctgtca aggcactgaa ggaggcgtcc gagagtgctc ggcaggactt ccagcgtgag 1680 gctgagctgc tcaccatgct gcagcaccag cacatcgtgc gcttcttcgg cgtctgcacc 1740 gagggccgcc ccctgctcat ggtctttgag tatatgcggc acggggacct caaccgcttc 1800 ctccgatccc atggacctga tgccaagctg ctggctggtg gggaggatgt ggctccaggc 1860 cccctgggtc tggggcagct gctggccgtg gctagccagg tcgctgcggg gatggtgtac 1920 ctggcgggtc tgcattttgt gcaccgggac ctggccacac gcaactgtct agtgggccag 1980 ggactggtgg tcaagattgg tgattttggc atgagcaggg atatctacag caccgactat 2040 taccgtgtgg gaggccgcac catgctgccc attcgctgga tgccgcccga gagcatcctg 2100 taccgtaagt tcaccaccga gagcgacgtg tggagcttcg gcgtggtgct ctgggagatc 2160 ttcacctacg gcaagcagcc ctggtaccag ctctccaaca cggaggcaat cgactgcatc 2220 acgcagggac gtgagttgga gcggccacgt gcctgcccac cagaggtcta cgccatcatg 2280 cggggctgct ggcagcggga gccccagcaa cgccacagca tcaaggatgt gcacgcccgg 2340 ctgcaagccc tggcccaggc acctcctgtc tacctggatg tcctgggcta g 2391 <210> 8 <211> 1196 <212> DNA <213> NTRK1 fragment cDNA <400> 8 acactaacag cacatctgga gacccggtgg agaagaagga cgaaacacct tttggggtct 60 cggtggctgt gggcctggcc gtctttgcct gcctcttcct ttctacgctg ctccttgtgc 120 tcaacaaatg tggacggaga aacaagtttg ggatcaaccg cccggctgtg ctggctccag 180 aggatgggct ggccatgtcc ctgcatttca tgacattggg tggcagctcc ctgtccccca 240 ccgagggcaa aggctctggg ctccaaggcc acatcatcga gaacccacaa tacttcagtg 300 atgcctgtgt tcaccacatc aagcgccggg acatcgtgct caagtgggag ctgggggagg 360 gcgcctttgg gaaggtcttc cttgctgagt gccacaacct cctgcctgag caggacaaga 420 tgctggtggc tgtcaaggca ctgaaggagg cgtccgagag tgctcggcag gacttccagc 480 gtgaggctga gctgctcacc atgctgcagc accagcacat cgtgcgcttc ttcggcgtct 540 gcaccgaggg ccgccccctg ctcatggtct ttgagtatat gcggcacggg gacctcaacc 600 gcttcctccg atcccatgga cctgatgcca agctgctggc tggtggggag gatgtggctc 660 caggccccct gggtctgggg cagctgctgg ccgtggctag ccaggtcgct gcggggatgg 720 tgtacctggc gggtctgcat tttgtgcacc gggacctggc cacacgcaac tgtctagtgg 780 gccagggact ggtggtcaag attggtgatt ttggcatgag cagggatatc tacagcaccg 840 actattaccg tgtgggaggc cgcaccatgc tgcccattcg ctggatgccg cccgagagca 900 tcctgtaccg taagttcacc accgagagcg acgtgtggag cttcggcgtg gtgctctggg 960 agatcttcac ctacggcaag cagccctggt accagctctc caacacggag gcaatcgact 1020 gcatcacgca gggacgtgag ttggagcggc cacgtgcctg cccaccagag gtctacgcca 1080 tcatgcgggg ctgctggcag cgggagcccc agcaacgcca cagcatcaag gatgtgcacg 1140 cccggctgca agccctggcc caggcacctc ctgtctacct ggatgtcctg ggctag 1196 <210> 9 <211> 17 <212> DNA <213> NTRK1 3' terminal cleavage site seq <400> 9 acactaacag cacatct 17 <210> 10 <211> 1037 <212> DNA <213> NTRK1 fragment cDNA <400> 10 gcccggctgt gctggctcca gaggatgggc tggccatgtc cctgcatttc atgacattgg 60 gtggcagctc cctgtccccc accgagggca aaggctctgg gctccaaggc cacatcatcg 120 agaacccaca atacttcagt gatgcctgtg ttcaccacat caagcgccgg gacatcgtgc 180 tcaagtggga gctgggggag ggcgcctttg ggaaggtctt ccttgctgag tgccacaacc 240 tcctgcctga gcaggacaag atgctggtgg ctgtcaaggc actgaaggag gcgtccgaga 300 gtgctcggca ggacttccag cgtgaggctg agctgctcac catgctgcag caccagcaca 360 tcgtgcgctt cttcggcgtc tgcaccgagg gccgccccct gctcatggtc tttgagtata 420 tgcggcacgg ggacctcaac cgcttcctcc gatcccatgg acctgatgcc aagctgctgg 480 ctggtgggga ggatgtggct ccaggccccc tgggtctggg gcagctgctg gccgtggcta 540 gccaggtcgc tgcggggatg gtgtacctgg cgggtctgca ttttgtgcac cgggacctgg 600 ccacacgcaa ctgtctagtg ggccagggac tggtggtcaa gattggtgat tttggcatga 660 gcagggatat ctacagcacc gactattacc gtgtgggagg ccgcaccatg ctgcccattc 720 gctggatgcc gcccgagagc atcctgtacc gtaagttcac caccgagagc gacgtgtgga 780 gcttcggcgt ggtgctctgg gagatcttca cctacggcaa gcagccctgg taccagctct 840 ccaacacgga ggcaatcgac tgcatcacgc agggacgtga gttggagcgg ccacgtgcct 900 gcccaccaga ggtctacgcc atcatgcggg gctgctggca gcgggagccc cagcaacgcc 960 acagcatcaa ggatgtgcac gcccggctgc aagccctggc ccaggcacct cctgtctacc 1020 tggatgtcct gggctag 1037 <210> 11 <211> 17 <212> DNA <213> NTRK1 3' terminal cleavage site seq <400> 11 gcccggctgt gctggct 17 <210> 12 <211> 890 <212> DNA <213> NTRK1 fragment cDNA <400> 12 gtgttcacca catcaagcgc cgggacatcg tgctcaagtg ggagctgggg gagggcgcct 60 ttgggaaggt cttccttgct gagtgccaca acctcctgcc tgagcaggac aagatgctgg 120 tggctgtcaa ggcactgaag gaggcgtccg agagtgctcg gcaggacttc cagcgtgagg 180 ctgagctgct caccatgctg cagcaccagc acatcgtgcg cttcttcggc gtctgcaccg 240 agggccgccc cctgctcatg gtctttgagt atatgcggca cggggacctc aaccgcttcc 300 tccgatccca tggacctgat gccaagctgc tggctggtgg ggaggatgtg gctccaggcc 360 ccctgggtct ggggcagctg ctggccgtgg ctagccaggt cgctgcgggg atggtgtacc 420 tggcgggtct gcattttgtg caccgggacc tggccacacg caactgtcta gtgggccagg 480 gactggtggt caagattggt gattttggca tgagcaggga tatctacagc accgactatt 540 accgtgtggg aggccgcacc atgctgccca ttcgctggat gccgcccgag agcatcctgt 600 accgtaagtt caccaccgag agcgacgtgt ggagcttcgg cgtggtgctc tgggagatct 660 tcacctacgg caagcagccc tggtaccagc tctccaacac ggaggcaatc gactgcatca 720 cgcagggacg tgagttggag cggccacgtg cctgcccacc agaggtctac gccatcatgc 780 ggggctgctg gcagcgggag ccccagcaac gccacagcat caaggatgtg cacgcccggc 840 tgcaagccct ggcccaggca cctcctgtct acctggatgt cctgggctag 890 <210> 13 <211> 17 <212> DNA <213> NTRK1 3' terminal cleavage site seq <400> 13 gtgttcacca catcaag 17 <210> 14 <211> 796 <212> PRT <213> NTRK1 full length <400> 14 Met Leu Arg Gly Gly Arg Arg Gly Gln Leu Gly Trp His Ser Trp Ala 1 5 10 15 Ala Gly Pro Gly Ser Leu Leu Ala Trp Leu Ile Leu Ala Ser Ala Gly 20 25 30 Ala Ala Pro Cys Pro Asp Ala Cys Cys Pro His Gly Ser Ser Gly Leu 35 40 45 Arg Cys Thr Arg Asp Gly Ala Leu Asp Ser Leu His His Leu Pro Gly 50 55 60 Ala Glu Asn Leu Thr Glu Leu Tyr Ile Glu Asn Gln Gln His Leu Gln 65 70 75 80 His Leu Glu Leu Arg Asp Leu Arg Gly Leu Gly Glu Leu Arg Asn Leu 85 90 95 Thr Ile Val Lys Ser Gly Leu Arg Phe Val Ala Pro Asp Ala Phe His 100 105 110 Phe Thr Pro Arg Leu Ser Arg Leu Asn Leu Ser Phe Asn Ala Leu Glu 115 120 125 Ser Leu Ser Trp Lys Thr Val Gln Gly Leu Ser Leu Gln Glu Leu Val 130 135 140 Leu Ser Gly Asn Pro Leu His Cys Ser Cys Ala Leu Arg Trp Leu Gln 145 150 155 160 Arg Trp Glu Glu Glu Gly Leu Gly Gly Val Pro Glu Gln Lys Leu Gln 165 170 175 Cys His Gly Gln Gly Pro Leu Ala His Met Pro Asn Ala Ser Cys Gly 180 185 190 Val Pro Thr Leu Lys Val Gln Val Pro Asn Ala Ser Val Asp Val Gly 195 200 205 Asp Asp Val Leu Leu Arg Cys Gln Val Glu Gly Arg Gly Leu Glu Gln 210 215 220 Ala Gly Trp Ile Leu Thr Glu Leu Glu Gln Ser Ala Thr Val Met Lys 225 230 235 240 Ser Gly Gly Leu Pro Ser Leu Gly Leu Thr Leu Ala Asn Val Thr Ser 245 250 255 Asp Leu Asn Arg Lys Asn Val Thr Cys Trp Ala Glu Asn Asp Val Gly 260 265 270 Arg Ala Glu Val Ser Val Gln Val Asn Val Ser Phe Pro Ala Ser Val 275 280 285 Gln Leu His Thr Ala Val Glu Met His His Trp Cys Ile Pro Phe Ser 290 295 300 Val Asp Gly Gln Pro Ala Pro Ser Leu Arg Trp Leu Phe Asn Gly Ser 305 310 315 320 Val Leu Asn Glu Thr Ser Phe Ile Phe Thr Glu Phe Leu Glu Pro Ala 325 330 335 Ala Asn Glu Thr Val Arg His Gly Cys Leu Arg Leu Asn Gln Pro Thr 340 345 350 His Val Asn Asn Gly Asn Tyr Thr Leu Leu Ala Ala Asn Pro Phe Gly 355 360 365 Gln Ala Ser Ala Ser Ile Met Ala Ala Phe Met Asp Asn Pro Phe Glu 370 375 380 Phe Asn Pro Glu Asp Pro Ile Pro Val Ser Phe Ser Pro Val Asp Thr 385 390 395 400 Asn Ser Thr Ser Gly Asp Pro Val Glu Lys Lys Asp Glu Thr Pro Phe 405 410 415 Gly Val Ser Val Ala Val Gly Leu Ala Val Phe Ala Cys Leu Phe Leu 420 425 430 Ser Thr Leu Leu Leu Val Leu Asn Lys Cys Gly Arg Arg Asn Lys Phe 435 440 445 Gly Ile Asn Arg Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala Met 450 455 460 Ser Leu His Phe Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr Glu 465 470 475 480 Gly Lys Gly Ser Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln Tyr 485 490 495 Phe Ser Asp Ala Cys Val His His Ile Lys Arg Arg Asp Ile Val Leu 500 505 510 Lys Trp Glu Leu Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu 515 520 525 Cys His Asn Leu Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys 530 535 540 Ala Leu Lys Glu Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu 545 550 555 560 Ala Glu Leu Leu Thr Met Leu Gln His Gln His Ile Val Arg Phe Phe 565 570 575 Gly Val Cys Thr Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met 580 585 590 Arg His Gly Asp Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala 595 600 605 Lys Leu Leu Ala Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly Leu 610 615 620 Gly Gln Leu Leu Ala Val Ala Ser Gln Val Ala Ala Gly Met Val Tyr 625 630 635 640 Leu Ala Gly Leu His Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys 645 650 655 Leu Val Gly Gln Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser 660 665 670 Arg Asp Ile Tyr Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met 675 680 685 Leu Pro Ile Arg Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe 690 695 700 Thr Thr Glu Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile 705 710 715 720 Phe Thr Tyr Gly Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala 725 730 735 Ile Asp Cys Ile Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys 740 745 750 Pro Pro Glu Val Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro 755 760 765 Gln Gln Arg His Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu 770 775 780 Ala Gln Ala Pro Pro Val Tyr Leu Asp Val Leu Gly 785 790 795 <210> 15 <211> 397 <212> PRT <213> NTRK1 fragment <400> 15 Thr Asn Ser Thr Ser Gly Asp Pro Val Glu Lys Lys Asp Glu Thr Pro 1 5 10 15 Phe Gly Val Ser Val Ala Val Gly Leu Ala Val Phe Ala Cys Leu Phe 20 25 30 Leu Ser Thr Leu Leu Leu Val Leu Asn Lys Cys Gly Arg Arg Asn Lys 35 40 45 Phe Gly Ile Asn Arg Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala 50 55 60 Met Ser Leu His Phe Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr 65 70 75 80 Glu Gly Lys Gly Ser Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln 85 90 95 Tyr Phe Ser Asp Ala Cys Val His His Ile Lys Arg Arg Asp Ile Val 100 105 110 Leu Lys Trp Glu Leu Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala 115 120 125 Glu Cys His Asn Leu Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val 130 135 140 Lys Ala Leu Lys Glu Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg 145 150 155 160 Glu Ala Glu Leu Leu Thr Met Leu Gln His Gln His Ile Val Arg Phe 165 170 175 Phe Gly Val Cys Thr Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr 180 185 190 Met Arg His Gly Asp Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp 195 200 205 Ala Lys Leu Leu Ala Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly 210 215 220 Leu Gly Gln Leu Leu Ala Val Ala Ser Gln Val Ala Ala Gly Met Val 225 230 235 240 Tyr Leu Ala Gly Leu His Phe Val His Arg Asp Leu Ala Thr Arg Asn 245 250 255 Cys Leu Val Gly Gln Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met 260 265 270 Ser Arg Asp Ile Tyr Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr 275 280 285 Met Leu Pro Ile Arg Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys 290 295 300 Phe Thr Thr Glu Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu 305 310 315 320 Ile Phe Thr Tyr Gly Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu 325 330 335 Ala Ile Asp Cys Ile Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala 340 345 350 Cys Pro Pro Glu Val Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu 355 360 365 Pro Gln Gln Arg His Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala 370 375 380 Leu Ala Gln Ala Pro Pro Val Tyr Leu Asp Val Leu Gly 385 390 395 <210> 16 <211> 344 <212> PRT <213> NTRK1 fragment <400> 16 Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala Met Ser Leu His Phe 1 5 10 15 Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr Glu Gly Lys Gly Ser 20 25 30 Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln Tyr Phe Ser Asp Ala 35 40 45 Cys Val His His Ile Lys Arg Arg Asp Ile Val Leu Lys Trp Glu Leu 50 55 60 Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu Cys His Asn Leu 65 70 75 80 Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys Ala Leu Lys Glu 85 90 95 Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu Ala Glu Leu Leu 100 105 110 Thr Met Leu Gln His Gln His Ile Val Arg Phe Phe Gly Val Cys Thr 115 120 125 Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met Arg His Gly Asp 130 135 140 Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala Lys Leu Leu Ala 145 150 155 160 Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly Leu Gly Gln Leu Leu 165 170 175 Ala Val Ala Ser Gln Val Ala Ala Gly Met Val Tyr Leu Ala Gly Leu 180 185 190 His Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Gln 195 200 205 Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser Arg Asp Ile Tyr 210 215 220 Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met Leu Pro Ile Arg 225 230 235 240 Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe Thr Thr Glu Ser 245 250 255 Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile Phe Thr Tyr Gly 260 265 270 Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala Ile Asp Cys Ile 275 280 285 Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys Pro Pro Glu Val 290 295 300 Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro Gln Gln Arg His 305 310 315 320 Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu Ala Gln Ala Pro 325 330 335 Pro Val Tyr Leu Asp Val Leu Gly 340 <210> 17 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site aa seq <400> 17 Pro Ala Val Leu Ala 1 5 <210> 18 <211> 295 <212> PRT <213> NTRK1 fragment site <400> 18 Val His His Ile Lys Arg Arg Asp Ile Val Leu Lys Trp Glu Leu Gly 1 5 10 15 Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu Cys His Asn Leu Leu 20 25 30 Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys Ala Leu Lys Glu Ala 35 40 45 Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu Ala Glu Leu Leu Thr 50 55 60 Met Leu Gln His Gln His Ile Val Arg Phe Phe Gly Val Cys Thr Glu 65 70 75 80 Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met Arg His Gly Asp Leu 85 90 95 Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala Lys Leu Leu Ala Gly 100 105 110 Gly Glu Asp Val Ala Pro Gly Pro Leu Gly Leu Gly Gln Leu Leu Ala 115 120 125 Val Ala Ser Gln Val Ala Ala Gly Met Val Tyr Leu Ala Gly Leu His 130 135 140 Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Gln Gly 145 150 155 160 Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser Arg Asp Ile Tyr Ser 165 170 175 Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met Leu Pro Ile Arg Trp 180 185 190 Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe Thr Thr Glu Ser Asp 195 200 205 Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile Phe Thr Tyr Gly Lys 210 215 220 Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala Ile Asp Cys Ile Thr 225 230 235 240 Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys Pro Pro Glu Val Tyr 245 250 255 Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro Gln Gln Arg His Ser 260 265 270 Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu Ala Gln Ala Pro Pro 275 280 285 Val Tyr Leu Asp Val Leu Gly 290 295 <210> 19 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site seq <400> 19 Val His His Ile Lys 1 5 <210> 20 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site seq <400> 20 Val His His Ile Lys 1 5 <210> 21 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 21 attgatgacc tggaagacac taacagcaca tct 33 <210> 22 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 22 attgatgacc tggaaggccc ggctgtgctg gct 33 <210> 23 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 23 attgatgacc tggaaggtgt tcaccacatc aag 33 <210> 24 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 24 Ile Asp Asp Leu Glu Asp Thr Asn Ser Thr Ser 1 5 10 <210> 25 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 25 Ile Asp Asp Leu Glu Gly Pro Ala Val Leu Ala 1 5 10 <210> 26 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 26 Ile Asp Asp Leu Glu Gly Val His His Ile Lys 1 5 10 <210> 27 <211> 1995 <212> DNA <213> LMNA CDS <400> 27 atggagaccc cgtcccagcg gcgcgccacc cgcagcgggg cgcaggccag ctccactccg 60 ctgtcgccca cccgcatcac ccggctgcag gagaaggagg acctgcagga gctcaatgat 120 cgcttggcgg tctacatcga ccgtgtgcgc tcgctggaaa cggagaacgc agggctgcgc 180 cttcgcatca ccgagtctga agaggtggtc agccgcgagg tgtccggcat caaggccgcc 240 tacgaggccg agctcgggga tgcccgcaag acccttgact cagtagccaa ggagcgcgcc 300 cgcctgcagc tggagctgag caaagtgcgt gaggagttta aggagctgaa agcgcgcaat 360 accaagaagg agggtgacct gatagctgct caggctcggc tgaaggacct ggaggctctg 420 ctgaactcca aggaggccgc actgagcact gctctcagtg agaagcgcac gctggagggc 480 gagctgcatg atctgcgggg ccaggtggcc aagcttgagg cagccctagg tgaggccaag 540 aagcaacttc aggatgagat gctgcggcgg gtggatgctg agaacaggct gcagaccatg 600 aaggaggaac tggacttcca gaagaacatc tacagtgagg agctgcgtga gaccaagcgc 660 cgtcatgaga cccgactggt ggagattgac aatgggaagc agcgtgagtt tgagagccgg 720 ctggcggatg cgctgcagga actgcgggcc cagcatgagg accaggtgga gcagtataag 780 aaggagctgg agaagactta ttctgccaag ctggacaatg ccaggcagtc tgctgagagg 840 aacagcaacc tggtgggggc tgcccacgag gagctgcagc agtcgcgcat ccgcatcgac 900 agcctctctg cccagctcag ccagctccag aagcagctgg cagccaagga ggcgaagctt 960 cgagacctgg aggactcact ggcccgtgag cgggacacca gccggcggct gctggcggaa 1020 aaggagcggg agatggccga gatgcgggca aggatgcagc agcagctgga cgagtaccag 1080 gagcttctgg acatcaagct ggccctggac atggagatcc acgcctaccg caagctcttg 1140 gagggcgagg aggagaggct acgcctgtcc cccagcccta cctcgcagcg cagccgtggc 1200 cgtgcttcct ctcactcatc ccagacacag ggtgggggca gcgtcaccaa aaagcgcaaa 1260 ctggagtcca ctgagagccg cagcagcttc tcacagcacg cacgcactag cgggcgcgtg 1320 gccgtggagg aggtggatga ggagggcaag tttgtccggc tgcgcaacaa gtccaatgag 1380 gaccagtcca tgggcaattg gcagatcaag cgccagaatg gagatgatcc cttgctgact 1440 taccggttcc caccaaagtt caccctgaag gctgggcagg tggtgacgat ctgggctgca 1500 ggagctgggg ccacccacag cccccctacc gacctggtgt ggaaggcaca gaacacctgg 1560 ggctgcggga acagcctgcg tacggctctc atcaactcca ctggggaaga agtggccatg 1620 cgcaagctgg tgcgctcagt gactgtggtt gaggacgacg aggatgagga tggagatgac 1680 ctgctccatc accaccacgg ctcccactgc agcagctcgg gggaccccgc tgagtacaac 1740 ctgcgctcgc gcaccgtgct gtgcgggacc tgcgggcagc ctgccgacaa ggcatctgcc 1800 agcggctcag gagcccaggt gggcggaccc atctcctctg gctcttctgc ctccagtgtc 1860 acggtcactc gcagctaccg cagtgtgggg ggcagtgggg gtggcagctt cggggacaat 1920 ctggtcaccc gctcctacct cctgggcaac tccagccccc gaacccagag cccccagaac 1980 tgcagcatca tgtaa 1995 <210> 28 <211> 985 <212> DNA <213> LMNA fragment cDNA <400> 28 atggagaccc cgtcccagcg gcgcgccacc cgcagcgggg cgcaggccag ctccactccg 60 ctgtcgccca cccgcatcac ccggctgcag gagaaggagg acctgcagga gctcaatgat 120 cgcttggcgg tctacatcga ccgtgtgcgc tcgctggaaa cggagaacgc agggctgcgc 180 cttcgcatca ccgagtctga agaggtggtc agccgcgagg tgtccggcat caaggccgcc 240 tacgaggccg agctcgggga tgcccgcaag acccttgact cagtagccaa ggagcgcgcc 300 cgcctgcagc tggagctgag caaagtgcgt gaggagttta aggagctgaa agcgcgcaat 360 accaagaagg agggtgacct gatagctgct caggctcggc tgaaggacct ggaggctctg 420 ctgaactcca aggaggccgc actgagcact gctctcagtg agaagcgcac gctggagggc 480 gagctgcatg atctgcgggg ccaggtggcc aagcttgagg cagccctagg tgaggccaag 540 aagcaacttc aggatgagat gctgcggcgg gtggatgctg agaacaggct gcagaccatg 600 aaggaggaac tggacttcca gaagaacatc tacagtgagg agctgcgtga gaccaagcgc 660 cgtcatgaga cccgactggt ggagattgac aatgggaagc agcgtgagtt tgagagccgg 720 ctggcggatg cgctgcagga actgcgggcc cagcatgagg accaggtgga gcagtataag 780 aaggagctgg agaagactta ttctgccaag ctggacaatg ccaggcagtc tgctgagagg 840 aacagcaacc tggtgggggc tgcccacgag gagctgcagc agtcgcgcat ccgcatcgac 900 agcctctctg cccagctcag ccagctccag aagcagctgg cagccaagga ggcgaagctt 960 cgagacctgg aggactcact ggccc 985 <210> 29 <211> 16 <212> DNA <213> LMNA 5' terminal cleavage site seq <400> 29 gaggactcac tggccc 16 <210> 30 <211> 664 <212> PRT <213> LMNA full length <400> 30 Met Glu Thr Pro Ser Gln Arg Arg Ala Thr Arg Ser Gly Ala Gln Ala 1 5 10 15 Ser Ser Thr Pro Leu Ser Pro Thr Arg Ile Thr Arg Leu Gln Glu Lys 20 25 30 Glu Asp Leu Gln Glu Leu Asn Asp Arg Leu Ala Val Tyr Ile Asp Arg 35 40 45 Val Arg Ser Leu Glu Thr Glu Asn Ala Gly Leu Arg Leu Arg Ile Thr 50 55 60 Glu Ser Glu Glu Val Val Ser Arg Glu Val Ser Gly Ile Lys Ala Ala 65 70 75 80 Tyr Glu Ala Glu Leu Gly Asp Ala Arg Lys Thr Leu Asp Ser Val Ala 85 90 95 Lys Glu Arg Ala Arg Leu Gln Leu Glu Leu Ser Lys Val Arg Glu Glu 100 105 110 Phe Lys Glu Leu Lys Ala Arg Asn Thr Lys Lys Glu Gly Asp Leu Ile 115 120 125 Ala Ala Gln Ala Arg Leu Lys Asp Leu Glu Ala Leu Leu Asn Ser Lys 130 135 140 Glu Ala Ala Leu Ser Thr Ala Leu Ser Glu Lys Arg Thr Leu Glu Gly 145 150 155 160 Glu Leu His Asp Leu Arg Gly Gln Val Ala Lys Leu Glu Ala Ala Leu 165 170 175 Gly Glu Ala Lys Lys Gln Leu Gln Asp Glu Met Leu Arg Arg Val Asp 180 185 190 Ala Glu Asn Arg Leu Gln Thr Met Lys Glu Glu Leu Asp Phe Gln Lys 195 200 205 Asn Ile Tyr Ser Glu Glu Leu Arg Glu Thr Lys Arg Arg His Glu Thr 210 215 220 Arg Leu Val Glu Ile Asp Asn Gly Lys Gln Arg Glu Phe Glu Ser Arg 225 230 235 240 Leu Ala Asp Ala Leu Gln Glu Leu Arg Ala Gln His Glu Asp Gln Val 245 250 255 Glu Gln Tyr Lys Lys Glu Leu Glu Lys Thr Tyr Ser Ala Lys Leu Asp 260 265 270 Asn Ala Arg Gln Ser Ala Glu Arg Asn Ser Asn Leu Val Gly Ala Ala 275 280 285 His Glu Glu Leu Gln Gln Ser Arg Ile Arg Ile Asp Ser Leu Ser Ala 290 295 300 Gln Leu Ser Gln Leu Gln Lys Gln Leu Ala Ala Lys Glu Ala Lys Leu 305 310 315 320 Arg Asp Leu Glu Asp Ser Leu Ala Arg Glu Arg Asp Thr Ser Arg Arg 325 330 335 Leu Leu Ala Glu Lys Glu Arg Glu Met Ala Glu Met Arg Ala Arg Met 340 345 350 Gln Gln Gln Leu Asp Glu Tyr Gln Glu Leu Leu Asp Ile Lys Leu Ala 355 360 365 Leu Asp Met Glu Ile His Ala Tyr Arg Lys Leu Leu Glu Gly Glu Glu 370 375 380 Glu Arg Leu Arg Leu Ser Pro Ser Pro Thr Ser Gln Arg Ser Arg Gly 385 390 395 400 Arg Ala Ser Ser His Ser Ser Gln Thr Gln Gly Gly Gly Ser Val Thr 405 410 415 Lys Lys Arg Lys Leu Glu Ser Thr Glu Ser Arg Ser Ser Phe Ser Gln 420 425 430 His Ala Arg Thr Ser Gly Arg Val Ala Val Glu Glu Val Asp Glu Glu 435 440 445 Gly Lys Phe Val Arg Leu Arg Asn Lys Ser Asn Glu Asp Gln Ser Met 450 455 460 Gly Asn Trp Gln Ile Lys Arg Gln Asn Gly Asp Asp Pro Leu Leu Thr 465 470 475 480 Tyr Arg Phe Pro Pro Lys Phe Thr Leu Lys Ala Gly Gln Val Val Thr 485 490 495 Ile Trp Ala Ala Gly Ala Gly Ala Thr His Ser Pro Pro Thr Asp Leu 500 505 510 Val Trp Lys Ala Gln Asn Thr Trp Gly Cys Gly Asn Ser Leu Arg Thr 515 520 525 Ala Leu Ile Asn Ser Thr Gly Glu Glu Val Ala Met Arg Lys Leu Val 530 535 540 Arg Ser Val Thr Val Val Glu Asp Asp Glu Asp Glu Asp Gly Asp Asp 545 550 555 560 Leu Leu His His His His Gly Ser His Cys Ser Ser Ser Gly Asp Pro 565 570 575 Ala Glu Tyr Asn Leu Arg Ser Arg Thr Val Leu Cys Gly Thr Cys Gly 580 585 590 Gln Pro Ala Asp Lys Ala Ser Ala Ser Gly Ser Gly Ala Gln Val Gly 595 600 605 Gly Pro Ile Ser Ser Gly Ser Ser Ala Ser Ser Val Thr Val Thr Arg 610 615 620 Ser Tyr Arg Ser Val Gly Gly Ser Gly Gly Gly Ser Phe Gly Asp Asn 625 630 635 640 Leu Val Thr Arg Ser Tyr Leu Leu Gly Asn Ser Ser Pro Arg Thr Gln 645 650 655 Ser Pro Gln Asn Cys Ser Ile Met 660 <210> 31 <211> 328 <212> PRT <213> LMNA fragment <400> 31 Met Glu Thr Pro Ser Gln Arg Arg Ala Thr Arg Ser Gly Ala Gln Ala 1 5 10 15 Ser Ser Thr Pro Leu Ser Pro Thr Arg Ile Thr Arg Leu Gln Glu Lys 20 25 30 Glu Asp Leu Gln Glu Leu Asn Asp Arg Leu Ala Val Tyr Ile Asp Arg 35 40 45 Val Arg Ser Leu Glu Thr Glu Asn Ala Gly Leu Arg Leu Arg Ile Thr 50 55 60 Glu Ser Glu Glu Val Val Ser Arg Glu Val Ser Gly Ile Lys Ala Ala 65 70 75 80 Tyr Glu Ala Glu Leu Gly Asp Ala Arg Lys Thr Leu Asp Ser Val Ala 85 90 95 Lys Glu Arg Ala Arg Leu Gln Leu Glu Leu Ser Lys Val Arg Glu Glu 100 105 110 Phe Lys Glu Leu Lys Ala Arg Asn Thr Lys Lys Glu Gly Asp Leu Ile 115 120 125 Ala Ala Gln Ala Arg Leu Lys Asp Leu Glu Ala Leu Leu Asn Ser Lys 130 135 140 Glu Ala Ala Leu Ser Thr Ala Leu Ser Glu Lys Arg Thr Leu Glu Gly 145 150 155 160 Glu Leu His Asp Leu Arg Gly Gln Val Ala Lys Leu Glu Ala Ala Leu 165 170 175 Gly Glu Ala Lys Lys Gln Leu Gln Asp Glu Met Leu Arg Arg Val Asp 180 185 190 Ala Glu Asn Arg Leu Gln Thr Met Lys Glu Glu Leu Asp Phe Gln Lys 195 200 205 Asn Ile Tyr Ser Glu Glu Leu Arg Glu Thr Lys Arg Arg His Glu Thr 210 215 220 Arg Leu Val Glu Ile Asp Asn Gly Lys Gln Arg Glu Phe Glu Ser Arg 225 230 235 240 Leu Ala Asp Ala Leu Gln Glu Leu Arg Ala Gln His Glu Asp Gln Val 245 250 255 Glu Gln Tyr Lys Lys Glu Leu Glu Lys Thr Tyr Ser Ala Lys Leu Asp 260 265 270 Asn Ala Arg Gln Ser Ala Glu Arg Asn Ser Asn Leu Val Gly Ala Ala 275 280 285 His Glu Glu Leu Gln Gln Ser Arg Ile Arg Ile Asp Ser Leu Ser Ala 290 295 300 Gln Leu Ser Gln Leu Gln Lys Gln Leu Ala Ala Lys Glu Ala Lys Leu 305 310 315 320 Arg Asp Leu Glu Asp Ser Leu Ala 325 <210> 32 <211> 5 <212> PRT <213> LMNA N terminus cleavage site seq <400> 32 Glu Asp Ser Leu Ala 1 5 <210> 33 <211> 33 <212> DNA <213> LMNA-NTRK1 fusion site seq <400> 33 gaggactcac tggcccgccc ggctgtgctg gct 33 <210> 34 <211> 11 <212> PRT <213> LMNA-NTRK1 fusion site aa seq <400> 34 Glu Asp Ser Leu Ala Gly Pro Ala Val Leu Ala 1 5 10 <210> 35 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> TPM3-NTRK1 Forward Primer Sequence <400> 35 aagaagataa atatgagg 18 <210> 36 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> TPM3-NTRK1 Reverse Primer Sequence <400> 36 ccgtgccgca tatactcaaa 20 <210> 37 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> LMNA-NTRK1 Forward Primer Sequence <400> 37 ccaggtggag cagtataaga ag 22 <210> 38 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> LMNA-NTRK1 Reverse Primer Sequence <400> 38 tgtgggttct cgatgatgtg 20 <110> LG Life Sciences Ltd. <120> Novel NTRK1 fusion gene as colon cancer marker and the uses the <130> KPA140037-KR-P1 <150> KR 10-2014-0004491 <151> 2014-01-14 <160> 38 <170> Kopatentin 2.0 <210> 1 <211> 858 <212> DNA <213> TPM3 CDS <400> 1 atgatggagg ccatcaagaa aaagatgcag atgctgaagt tagacaagga gaatgctctg 60 gatcgggcag agcaagctga agctgagcag aagcaggcag aagaaagaag taaacagctg 120 gaggatgagc tggcagccat gcagaagaag ctgaaaggga cagaggatga gctggacaag 180 tattctgaag ctttgaagga tgcccaggag aagctggaac tggcagagaa gaaggctgct 240 gatgctgagg ctgaggtggc ctccttgaac cgtaggatcc agctggttga agaagagctg 300 gaccgtgctc aggagcgcct ggccactgcc ctgcaaaagc tggaagaagc tgaaaaagct 360 gctgatgaga gtgagagagg tatgaaggtt attgaaaacc gggccttaaa agatgaagaa 420 aagatggaac tccaggaaat ccaactcaaa gaagctaagc acattgcaga agaggcagat 480 aggaagtatg aagaggtggc tcgtaagttg gtgatcattg aaggagactt ggaacgcaca 540 gaggaacgag ctgagctggc agagtctaag tgttctgagc tggaggagga gctgaagaat 600 gtcaccaaca acctcaagtc tcttgaggct caggcggaga agtactctca aaaagaagat 660 aaatatgagg aagaaatcaa gattcttact gataaactca aggaggcaga gacccgtgct 720 gagtttgctg agagatcggt agccaagctg gaaaagacaa ttgatgacct ggaagatgag 780 ctctatgccc agaaactgaa gtacaaggcc attagcgagg agctggacca cgccctcaat 840 gacatgacct ctatataa 858 <210> 2 <211> 775 <212> DNA <213> TPM3 fragment cDNA <400> 2 atgatggagg ccatcaagaa aaagatgcag atgctgaagt tagacaagga gaatgctctg 60 gatcgggcag agcaagctga agctgagcag aagcaggcag aagaaagaag taaacagctg 120 gaggatgagc tggcagccat gcagaagaag ctgaaaggga cagaggatga gctggacaag 180 tattctgaag ctttgaagga tgcccaggag aagctggaac tggcagagaa gaaggctgct 240 gatgctgagg ctgaggtggc ctccttgaac cgtaggatcc agctggttga agaagagctg 300 gaccgtgctc aggagcgcct ggccactgcc ctgcaaaagc tggaagaagc tgaaaaagct 360 gctgatgaga gtgagagagg tatgaaggtt attgaaaacc gggccttaaa agatgaagaa 420 aagatggaac tccaggaaat ccaactcaaa gaagctaagc acattgcaga agaggcagat 480 aggaagtatg aagaggtggc tcgtaagttg gtgatcattg aaggagactt ggaacgcaca 540 gaggaacgag ctgagctggc agagtctaag tgttctgagc tggaggagga gctgaagaat 600 gtcaccaaca acctcaagtc tcttgaggct caggcggaga agtactctca aaaagaagat 660 aaatatgagg aagaaatcaa gattcttact gataaactca aggaggcaga gacccgtgct 720 gagtttgctg agagatcggt agccaagctg gaaaagacaa ttgatgacct ggaag 775 <210> 3 <211> 16 <212> DNA <213> TPM3 5'terminal cleavage site seq <400> 3 attgatgacc tggaag 16 <210> 4 <211> 285 <212> PRT <213> TPM3 full length <400> 4 Met Met Glu Ala Ile Lys Lys Lys Met Gln Met Leu Lys Leu Asp Lys 1 5 10 15 Glu Asn Ala Leu Asp Arg Ala Glu Gln Ala Glu Ala Glu Gln Lys Gln 20 25 30 Ala Glu Glu Arg Ser Lys Gln Leu Glu Asp Glu Leu Ala Ala Met Gln 35 40 45 Lys Lys Leu Lys Gly Thr Glu Asp Glu Leu Asp Lys Tyr Ser Glu Ala 50 55 60 Leu Lys Asp Ala Gln Glu Lys Leu Glu Leu Ala Glu Lys Lys Ala Ala 65 70 75 80 Asp Ala Glu Ala Glu Val Ala Ser Leu Asn Arg Arg Ile Gln Leu Val 85 90 95 Glu Glu Glu Leu Asp Arg Ala Gln Glu Arg Leu Ala Thr Ala Leu Gln 100 105 110 Lys Leu Glu Glu Ala Glu Lys Ala Ala Asp Glu Ser Glu Arg Gly Met 115 120 125 Lys Val Ile Glu Asn Arg Ala Leu Lys Asp Glu Glu Lys Met Glu Leu 130 135 140 Gln Glu Ile Gln Leu Lys Glu Ala Lys His Ile Ala Glu Glu Ala Asp 145 150 155 160 Arg Lys Tyr Glu Glu Val Ala Arg Lys Leu Val Ile Ile Glu Gly Asp 165 170 175 Leu Glu Arg Thr Glu Glu Arg Ala Glu Leu Ala Glu Ser Lys Cys Ser 180 185 190 Glu Leu Glu Glu Glu Leu Lys Asn Val Thr Asn Asn Leu Lys Ser Leu 195 200 205 Glu Ala Gln Ala Glu Lys Tyr Ser Gln Lys Glu Asp Lys Tyr Glu Glu 210 215 220 Glu Ile Lys Ile Leu Thr Asp Lys Leu Lys Glu Ala Glu Thr Arg Ala 225 230 235 240 Glu Phe Ala Glu Arg Ser Val Ala Lys Leu Glu Lys Thr Ile Asp Asp 245 250 255 Leu Glu Asp Glu Leu Tyr Ala Gln Lys Leu Lys Tyr Lys Ala Ile Ser 260 265 270 Glu Glu Leu Asp His Ala Leu Asn Asp Met Thr Ser Ile 275 280 285 <210> 5 <211> 258 <212> PRT <213> TPM3 fragment <400> 5 Met Met Glu Ala Ile Lys Lys Lys Met Gln Met Leu Lys Leu Asp Lys 1 5 10 15 Glu Asn Ala Leu Asp Arg Ala Glu Gln Ala Glu Ala Glu Gln Lys Gln 20 25 30 Ala Glu Glu Arg Ser Lys Gln Leu Glu Asp Glu Leu Ala Ala Met Gln 35 40 45 Lys Lys Leu Lys Gly Thr Glu Asp Glu Leu Asp Lys Tyr Ser Glu Ala 50 55 60 Leu Lys Asp Ala Gln Glu Lys Leu Glu Leu Ala Glu Lys Lys Ala Ala 65 70 75 80 Asp Ala Glu Ala Glu Val Ala Ser Leu Asn Arg Arg Ile Gln Leu Val 85 90 95 Glu Glu Glu Leu Asp Arg Ala Gln Glu Arg Leu Ala Thr Ala Leu Gln 100 105 110 Lys Leu Glu Glu Ala Glu Lys Ala Ala Asp Glu Ser Glu Arg Gly Met 115 120 125 Lys Val Ile Glu Asn Arg Ala Leu Lys Asp Glu Glu Lys Met Glu Leu 130 135 140 Gln Glu Ile Gln Leu Lys Glu Ala Lys His Ile Ala Glu Glu Ala Asp 145 150 155 160 Arg Lys Tyr Glu Glu Val Ala Arg Lys Leu Val Ile Ile Glu Gly Asp 165 170 175 Leu Glu Arg Thr Glu Glu Arg Ala Glu Leu Ala Glu Ser Lys Cys Ser 180 185 190 Glu Leu Glu Glu Glu Leu Lys Asn Val Thr Asn Asn Leu Lys Ser Leu 195 200 205 Glu Ala Gln Ala Glu Lys Tyr Ser Gln Lys Glu Asp Lys Tyr Glu Glu 210 215 220 Glu Ile Lys Ile Leu Thr Asp Lys Leu Lys Glu Ala Glu Thr Arg Ala 225 230 235 240 Glu Phe Ala Glu Arg Ser Val Ala Lys Leu Glu Lys Thr Ile Asp Asp 245 250 255 Leu Glu <210> 6 <211> 5 <212> PRT <213> TPM3 N terminus cleavage site seq <400> 6 Ile Asp Asp Leu Glu 1 5 <210> 7 <211> 2391 <212> DNA <213> NTRK1 CDS <400> 7 atgctgcgag gcggacggcg cgggcagctt ggctggcaca gctgggctgc ggggccgggc 60 agcctgctgg cttggctgat actggcatct gcgggcgccg caccctgccc cgatgcctgc 120 tgcccccacg gctcctcggg actgcgatgc acccgggatg gggccctgga tagcctccac 180 cacctgcccg gcgcagagaa cctgactgag ctctacatcg agaaccagca gcatctgcag 240 catctggagc tccgtgatct gaggggcctg ggggagctga gaaacctcac catcgtgaag 300 agtggtctcc gtttcgtggc gccagatgcc ttccatttca ctcctcggct cagtcgcctg 360 aatctctcct tcaacgctct ggagtctctc tcctggaaaa ctgtgcaggg cctctcctta 420 caggaactgg tcctgtcggg gaaccctctg cactgttctt gtgccctgcg ctggctacag 480 cgctgggagg aggagggact gggcggagtg cctgaacaga agctgcagtg tcatgggcaa 540 gggcccctgg cccacatgcc caatgccagc tgtggtgtgc ccacgctgaa ggtccaggtg 600 cccaatgcct cggtggatgt gggggacgac gtgctgctgc ggtgccaggt ggaggggcgg 660 ggcctggagc aggccggctg gatcctcaca gagctggagc agtcagccac ggtgatgaaa 720 tctgggggtc tgccatccct ggggctgacc ctggccaatg tcaccagtga cctcaacagg 780 aagaacgtga cgtgctgggc agagaacgat gtgggccggg cagaggtctc tgttcaggtc 840 aacgtctcct tcccggccag tgtgcagctg cacacggcgg tggagatgca ccactggtgc 900 atccccttct ctgtggatgg gcagccggca ccgtctctgc gctggctctt caatggctcc 960 gtgctcaatg agaccagctt catcttcact gagttcctgg agccggcagc caatgagacc 1020 gtgcggcacg ggtgtctgcg cctcaaccag cccacccacg tcaacaacgg caactacacg 1080 ctgctggctg ccaacccctt cggccaggcc tccgcctcca tcatggctgc cttcatggac 1140 aaccctttcg agttcaaccc cgaggacccc atccctgtct ccttctcgcc ggtggacact 1200 aacagcacat ctggagaccc ggtggagaag aaggacgaaa caccttttgg ggtctcggtg 1260 gctgtgggcc tggccgtctt tgcctgcctc ttcctttcta cgctgctcct tgtgctcaac 1320 aaatgtggac ggagaaacaa gtttgggatc aaccgcccgg ctgtgctggc tccagaggat 1380 gggctggcca tgtccctgca tttcatgaca ttgggtggca gctccctgtc ccccaccgag 1440 ggcaaaggct ctgggctcca aggccacatc atcgagaacc cacaatactt cagtgatgcc 1500 tgtgttcacc acatcaagcg ccgggacatc gtgctcaagt gggagctggg ggagggcgcc 1560 tttgggaagg tcttccttgc tgagtgccac aacctcctgc ctgagcagga caagatgctg 1620 gtggctgtca aggcactgaa ggaggcgtcc gagagtgctc ggcaggactt ccagcgtgag 1680 gctgagctgc tcaccatgct gcagcaccag cacatcgtgc gcttcttcgg cgtctgcacc 1740 gagggccgcc ccctgctcat ggtctttgag tatatgcggc acggggacct caaccgcttc 1800 ctccgatccc atggacctga tgccaagctg ctggctggtg gggaggatgt ggctccaggc 1860 cccctgggtc tggggcagct gctggccgtg gctagccagg tcgctgcggg gatggtgtac 1920 ctggcgggtc tgcattttgt gcaccgggac ctggccacac gcaactgtct agtgggccag 1980 ggactggtgg tcaagattgg tgattttggc atgagcaggg atatctacag caccgactat 2040 taccgtgtgg gaggccgcac catgctgccc attcgctgga tgccgcccga gagcatcctg 2100 taccgtaagt tcaccaccga gagcgacgtg tggagcttcg gcgtggtgct ctgggagatc 2160 ttcacctacg gcaagcagcc ctggtaccag ctctccaaca cggaggcaat cgactgcatc 2220 acgcagggac gtgagttgga gcggccacgt gcctgcccac cagaggtcta cgccatcatg 2280 cggggctgct ggcagcggga gccccagcaa cgccacagca tcaaggatgt gcacgcccgg 2340 ctgcaagccc tggcccaggc acctcctgtc tacctggatg tcctgggcta g 2391 <210> 8 <211> 1196 <212> DNA <213> NTRK1 fragment cDNA <400> 8 acactaacag cacatctgga gacccggtgg agaagaagga cgaaacacct tttggggtct 60 cggtggctgt gggcctggcc gtctttgcct gcctcttcct ttctacgctg ctccttgtgc 120 tcaacaaatg tggacggaga aacaagtttg ggatcaaccg cccggctgtg ctggctccag 180 aggatgggct ggccatgtcc ctgcatttca tgacattggg tggcagctcc ctgtccccca 240 ccgagggcaa aggctctggg ctccaaggcc acatcatcga gaacccacaa tacttcagtg 300 atgcctgtgt tcaccacatc aagcgccggg acatcgtgct caagtgggag ctgggggagg 360 gcgcctttgg gaaggtcttc cttgctgagt gccacaacct cctgcctgag caggacaaga 420 tgctggtggc tgtcaaggca ctgaaggagg cgtccgagag tgctcggcag gacttccagc 480 gtgaggctga gctgctcacc atgctgcagc accagcacat cgtgcgcttc ttcggcgtct 540 gcaccgaggg ccgccccctg ctcatggtct ttgagtatat gcggcacggg gacctcaacc 600 gcttcctccg atcccatgga cctgatgcca agctgctggc tggtggggag gatgtggctc 660 caggccccct gggtctgggg cagctgctgg ccgtggctag ccaggtcgct gcggggatgg 720 tgtacctggc gggtctgcat tttgtgcacc gggacctggc cacacgcaac tgtctagtgg 780 gccagggact ggtggtcaag attggtgatt ttggcatgag cagggatatc tacagcaccg 840 actattaccg tgtgggaggc cgcaccatgc tgcccattcg ctggatgccg cccgagagca 900 tcctgtaccg taagttcacc accgagagcg acgtgtggag cttcggcgtg gtgctctggg 960 agatcttcac ctacggcaag cagccctggt accagctctc caacacggag gcaatcgact 1020 gcatcacgca gggacgtgag ttggagcggc cacgtgcctg cccaccagag gtctacgcca 1080 tcatgcgggg ctgctggcag cgggagcccc agcaacgcca cagcatcaag gatgtgcacg 1140 cccggctgca agccctggcc caggcacctc ctgtctacct ggatgtcctg ggctag 1196 <210> 9 <211> 17 <212> DNA <213> NTRK1 3 'terminal cleavage site seq <400> 9 acactaacag cacatct 17 <210> 10 <211> 1037 <212> DNA <213> NTRK1 fragment cDNA <400> 10 gcccggctgt gctggctcca gaggatgggc tggccatgtc cctgcatttc atgacattgg 60 gtggcagctc cctgtccccc accgagggca aaggctctgg gctccaaggc cacatcatcg 120 agaacccaca atacttcagt gatgcctgtg ttcaccacat caagcgccgg gacatcgtgc 180 tcaagtggga gctgggggag ggcgcctttg ggaaggtctt ccttgctgag tgccacaacc 240 tcctgcctga gcaggacaag atgctggtgg ctgtcaaggc actgaaggag gcgtccgaga 300 gtgctcggca ggacttccag cgtgaggctg agctgctcac catgctgcag caccagcaca 360 tcgtgcgctt cttcggcgtc tgcaccgagg gccgccccct gctcatggtc tttgagtata 420 tgcggcacgg ggacctcaac cgcttcctcc gatcccatgg acctgatgcc aagctgctgg 480 ctggtgggga ggatgtggct ccaggccccc tgggtctggg gcagctgctg gccgtggcta 540 gccaggtcgc tgcggggatg gtgtacctgg cgggtctgca ttttgtgcac cgggacctgg 600 ccacacgcaa ctgtctagtg ggccagggac tggtggtcaa gattggtgat tttggcatga 660 gcagggatat ctacagcacc gactattacc gtgtgggagg ccgcaccatg ctgcccattc 720 gctggatgcc gcccgagagc atcctgtacc gtaagttcac caccgagagc gacgtgtgga 780 gcttcggcgt ggtgctctgg gagatcttca cctacggcaa gcagccctgg taccagctct 840 ccaacacgga ggcaatcgac tgcatcacgc agggacgtga gttggagcgg ccacgtgcct 900 gcccaccaga ggtctacgcc atcatgcggg gctgctggca gcgggagccc cagcaacgcc 960 acagcatcaa ggatgtgcac gcccggctgc aagccctggc ccaggcacct cctgtctacc 1020 tggatgtcct gggctag 1037 <210> 11 <211> 17 <212> DNA <213> NTRK1 3 'terminal cleavage site seq <400> 11 gcccggctgt gctggct 17 <210> 12 <211> 890 <212> DNA <213> NTRK1 fragment cDNA <400> 12 gtgttcacca catcaagcgc cgggacatcg tgctcaagtg ggagctgggg gagggcgcct 60 ttgggaaggt cttccttgct gagtgccaca acctcctgcc tgagcaggac aagatgctgg 120 tggctgtcaa ggcactgaag gaggcgtccg agagtgctcg gcaggacttc cagcgtgagg 180 ctgagctgct caccatgctg cagcaccagc acatcgtgcg cttcttcggc gtctgcaccg 240 agggccgccc cctgctcatg gtctttgagt atatgcggca cggggacctc aaccgcttcc 300 tccgatccca tggacctgat gccaagctgc tggctggtgg ggaggatgtg gctccaggcc 360 ccctgggtct ggggcagctg ctggccgtgg ctagccaggt cgctgcgggg atggtgtacc 420 tggcgggtct gcattttgtg caccgggacc tggccacacg caactgtcta gtgggccagg 480 gactggtggt caagattggt gattttggca tgagcaggga tatctacagc accgactatt 540 accgtgtggg aggccgcacc atgctgccca ttcgctggat gccgcccgag agcatcctgt 600 accgtaagtt caccaccgag agcgacgtgt ggagcttcgg cgtggtgctc tgggagatct 660 tcacctacgg caagcagccc tggtaccagc tctccaacac ggaggcaatc gactgcatca 720 cgcagggacg tgagttggag cggccacgtg cctgcccacc agaggtctac gccatcatgc 780 ggggctgctg gcagcgggag ccccagcaac gccacagcat caaggatgtg cacgcccggc 840 tgcaagccct ggcccaggca cctcctgtct acctggatgt cctgggctag 890 <210> 13 <211> 17 <212> DNA <213> NTRK1 3 'terminal cleavage site seq <400> 13 gtgttcacca catcaag 17 <210> 14 <211> 796 <212> PRT <213> NTRK1 full length <400> 14 Met Leu Arg Gly Gly Arg Arg Gly Gln Leu Gly Trp His Ser Trp Ala 1 5 10 15 Ala Gly Pro Gly Ser Leu Ala Trp Leu Ile Leu Ala Ser Ala Gly 20 25 30 Ala Ala Pro Cys Pro Asp Ala Cys Cys Pro His Gly Ser Ser Gly Leu 35 40 45 Arg Cys Thr Arg Asp Gly Ala Leu Asp Ser Leu His His Leu Pro Gly 50 55 60 Ala Glu Asn Leu Thr Glu Leu Tyr Ile Glu Asn Gln Gln His Leu Gln 65 70 75 80 His Leu Glu Leu Arg Asp Leu Arg Gly Leu Gly Glu Leu Arg Asn Leu 85 90 95 Thr Ile Val Lys Ser Gly Leu Arg Phe Val Ala Pro Asp Ala Phe His 100 105 110 Phe Thr Pro Arg Leu Ser Arg Leu Asn Leu Ser Phe Asn Ala Leu Glu 115 120 125 Ser Leu Ser Trp Lys Thr Val Gln Gly Leu Ser Leu Gln Glu Leu Val 130 135 140 Leu Ser Gly Asn Pro Leu His Cys Ser Cys Ala Leu Arg Trp Leu Gln 145 150 155 160 Arg Trp Glu Glu Glu Gly Leu Gly Gly Val Pro Glu Gln Lys Leu Gln 165 170 175 Cys His Gly Gln Gly Pro Leu Ala His Met Pro Asn Ala Ser Cys Gly 180 185 190 Val Pro Thr Leu Lys Val Gln Val Pro Asn Ala Ser Val Asp Val Gly 195 200 205 Asp Val Leu Leu Arg Cys Gln Val Glu Gly Arg Gly Leu Glu Gln 210 215 220 Ala Gly Trp Ile Leu Thr Glu Leu Glu Gln Ser Ala Thr Val Met Lys 225 230 235 240 Ser Gly Gly Leu Pro Ser Leu Gly Leu Thr Leu Ala Asn Val Thr Ser 245 250 255 Asp Leu Asn Arg Lys Asn Val Thr Cys Trp Ala Glu Asn Asp Val Gly 260 265 270 Arg Ala Glu Val Ser Val Glu Val Asn Val Ser Phe Pro Ala Ser Val 275 280 285 Gln Leu His Thr Ala Val Glu Met His His Trp Cys Ile Pro Phe Ser 290 295 300 Val Asp Gly Gln Pro Ala Pro Ser Leu Arg Trp Leu Phe Asn Gly Ser 305 310 315 320 Val Leu Asn Glu Thr Ser Phe Ile Phe Thr Glu Phe Leu Glu Pro Ala 325 330 335 Ala Asn Glu Thr Val Arg His Gly Cys Leu Arg Leu Asn Gln Pro Thr 340 345 350 His Val Asn Asn Gly Asn Tyr Thr Leu Leu Ala Ala Asn Pro Phe Gly 355 360 365 Gln Ala Ser Ala Ser Ale Ala Phe Met Asp Asn Pro Phe Glu 370 375 380 Phe Asn Pro Glu Asp Pro Ile Pro Val Ser Phe Ser Pro Val Asp Thr 385 390 395 400 Asn Ser Thr Ser Gly Asp Pro Val Glu Lys Lys Asp Glu Thr Pro Phe 405 410 415 Gly Val Val Ala Val Gly Leu Ala Val Phe Ala Cys Leu Phe Leu 420 425 430 Ser Thr Leu Leu Leu Val Leu Asn Lys Cys Gly Arg Arg Asn Lys Phe 435 440 445 Gly Ile Asn Arg Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala Met 450 455 460 Ser Leu His Phe Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr Glu 465 470 475 480 Gly Lys Gly Ser Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln Tyr 485 490 495 Phe Ser Asp Ala Cys Val His His Ile Lys Arg Arg Asp Ile Val Leu 500 505 510 Lys Trp Glu Leu Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu 515 520 525 Cys His Asn Leu Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys 530 535 540 Ala Leu Lys Glu Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu 545 550 555 560 Ala Glu Leu Leu Thr Met Leu Gln His Gln His Ile Val Arg Phe Phe 565 570 575 Gly Val Cys Thr Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met 580 585 590 Arg His Gly Asp Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala 595 600 605 Lys Leu Leu Ala Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly Leu 610 615 620 Gly Gln Leu Leu Ala Val Ala Ser Gln Val Ala Gly Met Val Tyr 625 630 635 640 Leu Ala Gly Leu His Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys 645 650 655 Leu Val Gly Gln Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser 660 665 670 Arg Asp Ile Tyr Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met 675 680 685 Leu Pro Ile Arg Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe 690 695 700 Thr Thr Glu Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile 705 710 715 720 Phe Thr Tyr Gly Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala 725 730 735 Ile Asp Cys Ile Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys 740 745 750 Pro Pro Glu Val Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro 755 760 765 Gln Gln Arg His Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu 770 775 780 Ala Gln Ala Pro Pro Val Tyr Leu Asp Val Leu Gly 785 790 795 <210> 15 <211> 397 <212> PRT <213> NTRK1 fragment <400> 15 Thr Asn Ser Thr Ser Gly Asp Pro Val Glu Lys Lys Asp Glu Thr Pro 1 5 10 15 Phe Gly Val Ser Val Ala Val Gly Leu Ala Val Phe Ala Cys Leu Phe 20 25 30 Leu Ser Thr Leu Leu Leu Val Leu Asn Lys Cys Gly Arg Arg Asn Lys 35 40 45 Phe Gly Ile Asn Arg Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala 50 55 60 Met Ser Leu His Phe Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr 65 70 75 80 Glu Gly Lys Gly Ser Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln 85 90 95 Tyr Phe Ser Asp Ala Cys Val His His Ile Lys Arg Arg Asp Ile Val 100 105 110 Leu Lys Trp Glu Leu Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala 115 120 125 Glu Cys His Asn Leu Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val 130 135 140 Lys Ala Leu Lys Glu Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg 145 150 155 160 Glu Ala Glu Leu Leu Thr Met Leu Gln His Gln His Ile Val Arg Phe 165 170 175 Phe Gly Val Cys Thr Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr 180 185 190 Met Arg His Gly Asp Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp 195 200 205 Ala Lys Leu Leu Ala Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly 210 215 220 Leu Gly Gln Leu Leu Ala Val Ala Ser Gln Val Ala Ala Gly Met Val 225 230 235 240 Tyr Leu Ala Gly Leu His Phe Val His Arg Asp Leu Ala Thr Arg Asn 245 250 255 Cys Leu Val Gly Gln Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met 260 265 270 Ser Arg Asp Ile Tyr Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr 275 280 285 Met Leu Pro Ile Arg Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys 290 295 300 Phe Thr Thr Glu Ser Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu 305 310 315 320 Ile Phe Thr Tyr Gly Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu 325 330 335 Ala Ile Asp Cys Ile Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala 340 345 350 Cys Pro Pro Glu Val Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu 355 360 365 Pro Gln Gln Arg His Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala 370 375 380 Leu Ala Gln Ala Pro Pro Val Tyr Leu Asp Val Leu Gly 385 390 395 <210> 16 <211> 344 <212> PRT <213> NTRK1 fragment <400> 16 Pro Ala Val Leu Ala Pro Glu Asp Gly Leu Ala Met Ser Leu His Phe 1 5 10 15 Met Thr Leu Gly Gly Ser Ser Leu Ser Pro Thr Glu Gly Lys Gly Ser 20 25 30 Gly Leu Gln Gly His Ile Ile Glu Asn Pro Gln Tyr Phe Ser Asp Ala 35 40 45 Cys Val His His Ile Lys Arg Arg Asp Ile Val Leu Lys Trp Glu Leu 50 55 60 Gly Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu Cys His Asn Leu 65 70 75 80 Leu Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys Ala Leu Lys Glu 85 90 95 Ala Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu Ala Glu Leu Leu 100 105 110 Thr Met Leu Gln His Gln His Ile Val Arg Phe Phe Gly Val Cys Thr 115 120 125 Glu Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met Arg His Gly Asp 130 135 140 Leu Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala Lys Leu Leu Ala 145 150 155 160 Gly Gly Glu Asp Val Ala Pro Gly Pro Leu Gly Leu Gly Gln Leu Leu 165 170 175 Ala Val Ala Ser Gln Val Ala Ala Gly Met Val Tyr Leu Ala Gly Leu 180 185 190 His Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Gln 195 200 205 Gly Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser Ser Asp Ile Tyr 210 215 220 Ser Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met Leu Pro Ile Arg 225 230 235 240 Trp Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe Thr Thr Glu Ser 245 250 255 Asp Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile Phe Thr Tyr Gly 260 265 270 Lys Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala Ile Asp Cys Ile 275 280 285 Thr Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys Pro Pro Glu Val 290 295 300 Tyr Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro Gln Gln Arg His 305 310 315 320 Ser Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu Ala Gln Ala Pro 325 330 335 Pro Val Tyr Leu Asp Val Leu Gly 340 <210> 17 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site aa seq <400> 17 Pro Ala Val Leu Ala 1 5 <210> 18 <211> 295 <212> PRT <213> NTRK1 fragment site <400> 18 Val His His Ile Lys Arg Arg Asp Ile Val Leu Lys Trp Glu Leu Gly 1 5 10 15 Glu Gly Ala Phe Gly Lys Val Phe Leu Ala Glu Cys His Asn Leu Leu 20 25 30 Pro Glu Gln Asp Lys Met Leu Val Ala Val Lys Ala Leu Lys Glu Ala 35 40 45 Ser Glu Ser Ala Arg Gln Asp Phe Gln Arg Glu Ala Glu Leu Leu Thr 50 55 60 Met Leu Gln His Gln His Ile Val Arg Phe Phe Gly Val Cys Thr Glu 65 70 75 80 Gly Arg Pro Leu Leu Met Val Phe Glu Tyr Met Arg His Gly Asp Leu 85 90 95 Asn Arg Phe Leu Arg Ser His Gly Pro Asp Ala Lys Leu Leu Ala Gly 100 105 110 Gly Asp Val Ala Pro Gly Pro Leu Gly Leu Gly Gln Leu Leu Ala 115 120 125 Val Ala Ser Gln Val Ala Ala Gly Met Val Tyr Leu Ala Gly Leu His 130 135 140 Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Gln Gly 145 150 155 160 Leu Val Val Lys Ile Gly Asp Phe Gly Met Ser Ser Asp Ile Tyr Ser 165 170 175 Thr Asp Tyr Tyr Arg Val Gly Gly Arg Thr Met Leu Pro Ile Arg Trp 180 185 190 Met Pro Pro Glu Ser Ile Leu Tyr Arg Lys Phe Thr Thr Glu Ser Asp 195 200 205 Val Trp Ser Phe Gly Val Val Leu Trp Glu Ile Phe Thr Tyr Gly Lys 210 215 220 Gln Pro Trp Tyr Gln Leu Ser Asn Thr Glu Ala Ile Asp Cys Ile Thr 225 230 235 240 Gln Gly Arg Glu Leu Glu Arg Pro Arg Ala Cys Pro Pro Glu Val Tyr 245 250 255 Ala Ile Met Arg Gly Cys Trp Gln Arg Glu Pro Gln Gln Arg His Ser 260 265 270 Ile Lys Asp Val His Ala Arg Leu Gln Ala Leu Ala Gln Ala Pro Pro 275 280 285 Val Tyr Leu Asp Val Leu Gly 290 295 <210> 19 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site seq <400> 19 Val His His Ile Lys 1 5 <210> 20 <211> 5 <212> PRT <213> NTRK1 C terminus cleavage site seq <400> 20 Val His His Ile Lys 1 5 <210> 21 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 21 attgatgacc tggaagacac taacagcaca tct 33 <210> 22 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 22 attgatgacc tggaaggccc ggctgtgctg gct 33 <210> 23 <211> 33 <212> DNA <213> TPM3-NTRK1 fusion site seq <400> 23 attgatgacc tggaaggtgt tcaccacatc aag 33 <210> 24 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 24 Ile Asp Asp Leu Glu Asp Thr Asn Ser Thr Ser 1 5 10 <210> 25 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 25 Ile Asp Asp Leu Glu Gly Pro Ala Val Leu Ala 1 5 10 <210> 26 <211> 11 <212> PRT <213> TPM3-NTRK1 fusion site aa seq <400> 26 Ile Asp Asp Leu Glu Gly Val His His Ile Lys 1 5 10 <210> 27 <211> 1995 <212> DNA <213> LMNA CDS <400> 27 atggagaccc cgtcccagcg gcgcgccacc cgcagcgggg cgcaggccag ctccactccg 60 ctgtcgccca cccgcatcac ccggctgcag gagaaggagg acctgcagga gctcaatgat 120 cgcttggcgg tctacatcga ccgtgtgcgc tcgctggaaa cggagaacgc agggctgcgc 180 cttcgcatca ccgagtctga agaggtggtc agccgcgagg tgtccggcat caaggccgcc 240 tacgaggccg agctcgggga tgcccgcaag acccttgact cagtagccaa ggagcgcgcc 300 cgcctgcagc tggagctgag caaagtgcgt gaggagttta aggagctgaa agcgcgcaat 360 accaagaagg agggtgacct gatagctgct caggctcggc tgaaggacct ggaggctctg 420 ctgaactcca aggaggccgc actgagcact gctctcagtg agaagcgcac gctggagggc 480 gagctgcatg atctgcgggg ccaggtggcc aagcttgagg cagccctagg tgaggccaag 540 aagcaacttc aggatgagat gctgcggcgg gtggatgctg agaacaggct gcagaccatg 600 aaggaggaac tggacttcca gaagaacatc tacagtgagg agctgcgtga gaccaagcgc 660 cgtcatgaga cccgactggt ggagattgac aatgggaagc agcgtgagtt tgagagccgg 720 ctggcggatg cgctgcagga actgcgggcc cagcatgagg accaggtgga gcagtataag 780 aaggagctgg agaagactta ttctgccaag ctggacaatg ccaggcagtc tgctgagagg 840 aacagcaacc tggtgggggc tgcccacgag gagctgcagc agtcgcgcat ccgcatcgac 900 agcctctctg cccagctcag ccagctccag aagcagctgg cagccaagga ggcgaagctt 960 cgagacctgg aggactcact ggcccgtgag cgggacacca gccggcggct gctggcggaa 1020 aaggagcggg agatggccga gatgcgggca aggatgcagc agcagctgga cgagtaccag 1080 gagcttctgg acatcaagct ggccctggac atggagatcc acgcctaccg caagctcttg 1140 gagggcgagg aggagaggct acgcctgtcc cccagcccta cctcgcagcg cagccgtggc 1200 cgtgcttcct ctcactcatc ccagacacag ggtgggggca gcgtcaccaa aaagcgcaaa 1260 ctggagtcca ctgagagccg cagcagcttc tcacagcacg cacgcactag cgggcgcgtg 1320 gccgtggagg aggtggatga ggagggcaag tttgtccggc tgcgcaacaa gtccaatgag 1380 gaccagtcca tgggcaattg gcagatcaag cgccagaatg gagatgatcc cttgctgact 1440 taccggttcc caccaaagtt caccctgaag gctgggcagg tggtgacgat ctgggctgca 1500 ggagctgggg ccacccacag cccccctacc gacctggtgt ggaaggcaca gaacacctgg 1560 ggctgcggga acagcctgcg tacggctctc atcaactcca ctggggaaga agtggccatg 1620 cgcaagctgg tgcgctcagt gactgtggtt gaggacgacg aggatgagga tggagatgac 1680 ctgctccatc accaccacgg ctcccactgc agcagctcgg gggaccccgc tgagtacaac 1740 ctgcgctcgc gcaccgtgct gtgcgggacc tgcgggcagc ctgccgacaa ggcatctgcc 1800 agcggctcag gagcccaggt gggcggaccc atctcctctg gctcttctgc ctccagtgtc 1860 acggtcactc gcagctaccg cagtgtgggg ggcagtgggg gtggcagctt cggggacaat 1920 ctggtcaccc gctcctacct cctgggcaac tccagccccc gaacccagag cccccagaac 1980 tgcagcatca tgtaa 1995 <210> 28 <211> 985 <212> DNA <213> LMNA fragment cDNA <400> 28 atggagaccc cgtcccagcg gcgcgccacc cgcagcgggg cgcaggccag ctccactccg 60 ctgtcgccca cccgcatcac ccggctgcag gagaaggagg acctgcagga gctcaatgat 120 cgcttggcgg tctacatcga ccgtgtgcgc tcgctggaaa cggagaacgc agggctgcgc 180 cttcgcatca ccgagtctga agaggtggtc agccgcgagg tgtccggcat caaggccgcc 240 tacgaggccg agctcgggga tgcccgcaag acccttgact cagtagccaa ggagcgcgcc 300 cgcctgcagc tggagctgag caaagtgcgt gaggagttta aggagctgaa agcgcgcaat 360 accaagaagg agggtgacct gatagctgct caggctcggc tgaaggacct ggaggctctg 420 ctgaactcca aggaggccgc actgagcact gctctcagtg agaagcgcac gctggagggc 480 gagctgcatg atctgcgggg ccaggtggcc aagcttgagg cagccctagg tgaggccaag 540 aagcaacttc aggatgagat gctgcggcgg gtggatgctg agaacaggct gcagaccatg 600 aaggaggaac tggacttcca gaagaacatc tacagtgagg agctgcgtga gaccaagcgc 660 cgtcatgaga cccgactggt ggagattgac aatgggaagc agcgtgagtt tgagagccgg 720 ctggcggatg cgctgcagga actgcgggcc cagcatgagg accaggtgga gcagtataag 780 aaggagctgg agaagactta ttctgccaag ctggacaatg ccaggcagtc tgctgagagg 840 aacagcaacc tggtgggggc tgcccacgag gagctgcagc agtcgcgcat ccgcatcgac 900 agcctctctg cccagctcag ccagctccag aagcagctgg cagccaagga ggcgaagctt 960 cgagacctgg aggactcact ggccc 985 <210> 29 <211> 16 <212> DNA <213> LMNA 5 'terminal cleavage site seq <400> 29 gaggactcac tggccc 16 <210> 30 <211> 664 <212> PRT <213> LMNA full length <400> 30 Met Glu Thr Pro Ser Gln Arg Arg Ala Thr Arg Ser Gly Ala Gln Ala 1 5 10 15 Ser Ser Thr Pro Leu Ser Pro Thr Arg Ile Thr Arg Leu Gln Glu Lys 20 25 30 Glu Asp Leu Gln Glu Leu Asn Asp Arg Leu Ala Val Tyr Ile Asp Arg 35 40 45 Val Arg Ser Leu Glu Thr Glu Asn Ala Gly Leu Arg Leu Arg Ile Thr 50 55 60 Glu Ser Glu Glu Val Ser Ser Glu Val Ser Gly Ile Lys Ala Ala 65 70 75 80 Tyr Glu Ala Glu Leu Gly Asp Ala Arg Lys Thr Leu Asp Ser Val Ala 85 90 95 Lys Glu Arg Ala Arg Leu Gln Leu Glu Leu Ser Lys Val Arg Glu Glu 100 105 110 Phe Lys Glu Leu Lys Ala Arg Asn Thr Lys Lys Glu Gly Asp Leu Ile 115 120 125 Ala Ala Gln Ala Arg Leu Lys Asp Leu Glu Ala Leu Leu Asn Ser Lys 130 135 140 Glu Ala Ala Leu Ser Thr Ala Leu Ser Glu Lys Arg Thr Leu Glu Gly 145 150 155 160 Glu Leu His Asp Leu Arg Gly Gln Val Ala Lys Leu Glu Ala Ala Leu 165 170 175 Gly Glu Ala Lys Lys Gln Leu Gln Asp Glu Met Leu Arg Arg Val Asp 180 185 190 Ala Glu Asn Arg Leu Gln Thr Met Lys Glu Glu Leu Asp Phe Gln Lys 195 200 205 Asn Ile Tyr Ser Glu Glu Leu Arg Glu Thr Lys Arg Arg His Glu Thr 210 215 220 Arg Leu Val Glu Ile Asp Asn Gly Lys Gln Arg Glu Phe Glu Ser Arg 225 230 235 240 Leu Ala Asp Ala Leu Gln Glu Leu Arg Ala Gln His Glu Asp Gln Val 245 250 255 Glu Gln Tyr Lys Lys Glu Leu Glu Lys Thr Tyr Ser Ala Lys Leu Asp 260 265 270 Asn Ala Arg Gln Ser Ala Glu Arg Asn Ser Asn Leu Val Gly Ala Ala 275 280 285 His Glu Glu Leu Gln Gln Ser Arg Ile Arg Ile Asp Ser Leu Ser Ala 290 295 300 Gln Leu Ser Gln Leu Gln Lys Gln Leu Ala Ala Lys Glu Ala Lys Leu 305 310 315 320 Arg Asp Leu Glu Asp Ser Leu Ala Arg Glu Arg Asp Thr Ser Arg Arg 325 330 335 Leu Leu Ala Glu Lys Glu Arg Glu Met Ala Glu Met Arg Ala Arg Met 340 345 350 Gln Gln Gln Leu Asp Glu Tyr Gln Glu Leu Leu Asp Ile Lys Leu Ala 355 360 365 Leu Asp Met Glu Ile His Ala Tyr Arg Lys Leu Leu Glu Gly Glu Glu 370 375 380 Glu Arg Leu Arg Leu Ser Pro Ser Ser Thr Ser Gln Arg Ser Ser Arg Gly 385 390 395 400 Arg Ala Ser Ser Ser Ser Gln Thr Gln Gly Gly Gly Ser Val Thr 405 410 415 Lys Lys Arg Lys Leu Glu Ser Thr Glu Ser Arg Ser Ser Phe Ser Gln 420 425 430 His Ala Arg Thr Ser Gly Arg Val Ala Val Glu Glu Val Asp Glu Glu 435 440 445 Gly Lys Phe Val Arg Leu Arg Asn Lys Ser Asn Glu Asp Gln Ser Met 450 455 460 Gly Asn Trp Gln Ile Lys Arg Gln Asn Gly Asp Asp Pro Leu Leu Thr 465 470 475 480 Tyr Arg Phe Pro Pro Lys Phe Thr Leu Lys Ala Gly Gln Val Val Thr 485 490 495 Ile Trp Ala Ala Gly Ala Gly Ala Thr His Ser Pro Thr Asp Leu 500 505 510 Val Trp Lys Ala Gln Asn Thr Trp Gly Cys Gly Asn Ser Leu Arg Thr 515 520 525 Ala Leu Ile Asn Ser Thr Gly Glu Glu Val Ala Met Arg Lys Leu Val 530 535 540 Arg Ser Val Thr Val Val Glu Asp Asp Glu Asp Glu Asp Gly Asp Asp 545 550 555 560 Leu Leu His His His His Gly Ser His Cys Ser Ser Ser Gly Asp Pro 565 570 575 Ala Glu Tyr Asn Leu Arg Ser Ser Thr Val Leu Cys Gly Thr Cys Gly 580 585 590 Gln Pro Ala Asp Lys Ala Ser Ala Ser Gly Ser Gly Ala Gln Val Gly 595 600 605 Gly Pro Ile Ser Ser Gly Ser Ser Ala Ser Ser Val Thr Val Thr Arg 610 615 620 Ser Tyr Arg Ser Val Gly Gly Ser Gly Gly Gly Ser Phe Gly Asp Asn 625 630 635 640 Leu Val Thr Arg Ser Tyr Leu Leu Gly Asn Ser Ser Pro Arg Thr Gln 645 650 655 Ser Pro Gln Asn Cys Ser Ile Met 660 <210> 31 <211> 328 <212> PRT <213> LMNA fragment <400> 31 Met Glu Thr Pro Ser Gln Arg Arg Ala Thr Arg Ser Gly Ala Gln Ala 1 5 10 15 Ser Ser Thr Pro Leu Ser Pro Thr Arg Ile Thr Arg Leu Gln Glu Lys 20 25 30 Glu Asp Leu Gln Glu Leu Asn Asp Arg Leu Ala Val Tyr Ile Asp Arg 35 40 45 Val Arg Ser Leu Glu Thr Glu Asn Ala Gly Leu Arg Leu Arg Ile Thr 50 55 60 Glu Ser Glu Glu Val Ser Ser Glu Val Ser Gly Ile Lys Ala Ala 65 70 75 80 Tyr Glu Ala Glu Leu Gly Asp Ala Arg Lys Thr Leu Asp Ser Val Ala 85 90 95 Lys Glu Arg Ala Arg Leu Gln Leu Glu Leu Ser Lys Val Arg Glu Glu 100 105 110 Phe Lys Glu Leu Lys Ala Arg Asn Thr Lys Lys Glu Gly Asp Leu Ile 115 120 125 Ala Ala Gln Ala Arg Leu Lys Asp Leu Glu Ala Leu Leu Asn Ser Lys 130 135 140 Glu Ala Ala Leu Ser Thr Ala Leu Ser Glu Lys Arg Thr Leu Glu Gly 145 150 155 160 Glu Leu His Asp Leu Arg Gly Gln Val Ala Lys Leu Glu Ala Ala Leu 165 170 175 Gly Glu Ala Lys Lys Gln Leu Gln Asp Glu Met Leu Arg Arg Val Asp 180 185 190 Ala Glu Asn Arg Leu Gln Thr Met Lys Glu Glu Leu Asp Phe Gln Lys 195 200 205 Asn Ile Tyr Ser Glu Glu Leu Arg Glu Thr Lys Arg Arg His Glu Thr 210 215 220 Arg Leu Val Glu Ile Asp Asn Gly Lys Gln Arg Glu Phe Glu Ser Arg 225 230 235 240 Leu Ala Asp Ala Leu Gln Glu Leu Arg Ala Gln His Glu Asp Gln Val 245 250 255 Glu Gln Tyr Lys Lys Glu Leu Glu Lys Thr Tyr Ser Ala Lys Leu Asp 260 265 270 Asn Ala Arg Gln Ser Ala Glu Arg Asn Ser Asn Leu Val Gly Ala Ala 275 280 285 His Glu Glu Leu Gln Gln Ser Arg Ile Arg Ile Asp Ser Leu Ser Ala 290 295 300 Gln Leu Ser Gln Leu Gln Lys Gln Leu Ala Ala Lys Glu Ala Lys Leu 305 310 315 320 Arg Asp Leu Glu Asp Ser Leu Ala 325 <210> 32 <211> 5 <212> PRT <213> LMNA N terminus cleavage site seq <400> 32 Glu Asp Ser Leu Ala 1 5 <210> 33 <211> 33 <212> DNA <213> LMNA-NTRK1 fusion site seq <400> 33 gaggactcac tggcccgccc ggctgtgctg gct 33 <210> 34 <211> 11 <212> PRT <213> LMNA-NTRK1 fusion site aa seq <400> 34 Glu Asp Ser Leu Ala Gly Pro Ala Val Leu Ala 1 5 10 <210> 35 <211> 18 <212> DNA <213> Artificial Sequence <220> <223> TPM3-NTRK1 Forward Primer Sequence <400> 35 aagaagataa atatgagg 18 <210> 36 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> TPM3-NTRK1 Reverse Primer Sequence <400> 36 ccgtgccgca tatactcaaa 20 <210> 37 <211> 22 <212> DNA <213> Artificial Sequence <220> <223> LMNA-NTRK1 Forward Primer Sequence <400> 37 ccaggtggag cagtataaga ag 22 <210> 38 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> LMNA-NTRK1 Reverse Primer Sequence <400> 38 tgtgggttct cgatgatgtg 20

Claims

A fusion protein comprising the following structural formula:
A fragment containing the N-terminus of N- [LMNA (Lamin A)] - [a fragment containing the C-terminus of NTRK1 (Neurotrophic Tyrosine Kinase, Receptor, Type 1]
A fragment containing the N-terminus of N- [TPM3 (Tropomyosin 3)] - [a fragment containing the C-terminus of NTRK1] -C.

The fusion protein according to claim 1, wherein the LMNA, TPM3 or NTRK1 is derived from human.

3. The fusion protein according to claim 2, wherein the human-derived LMNA comprises the amino acid sequence of SEQ ID NO: 27.

2. The fusion protein according to claim 1, wherein the fragment comprising the N-terminus of the LMNA comprises a first to 328th amino acid from the N-terminus in the human-derived LMNA protein.

2. The fusion protein of claim 1, wherein the fragment comprising the N-terminus of the LMNA comprises the amino acid sequence of SEQ ID NO: 31.

3. The fusion protein according to claim 2, wherein the human-derived TPM3 comprises the amino acid sequence of SEQ ID NO: 4.

The fusion protein according to claim 1, wherein the fragment comprising the N-terminus of TPM3 comprises amino acids 1 to 258 from the N-terminus in the human-derived TPM3 protein.

The fusion protein of claim 1, wherein the fragment comprising the N-terminus of TPM3 is comprised of the amino acid sequence of SEQ ID NO: 5.

3. The fusion protein of claim 2, wherein the human-derived NTRK1 comprises the amino acid sequence of SEQ ID NO: 7.

The fragment of claim 1, wherein the fragment comprising the C-terminus of NTRK1 comprises the amino acid sequence from the N-terminus to the N-terminus of the human NTRK1 protein, from amino acid 453 to amino acid 796 or amino acid from position 502 to position 796 Phosphorylated fusion protein.

The fusion protein of claim 1, wherein the fragment comprising the C-terminus of NTRK1 is comprised of the amino acid sequence of SEQ ID NO: 15, SEQ ID NO: 17 or SEQ ID NO: 19.

A polynucleotide encoding the fusion protein of claim 1.

A fusion gene polynucleotide consisting of the following structural formula:
5 '- [fragment containing the 5' end of the LMNA gene] - [fragment containing the 3 'end of the NTRK1 gene] -3'
5 '- [Fragment containing the 5' end of TPM3] - [Fragment containing the 3 'end of the NTRK1 gene] -3'.

14. The TRK1 fusion gene polynucleotide according to claim 13, wherein the LMNA gene is a nucleic acid sequence of SEQ ID NO: 28.

14. The TRK1 fusion gene polynucleotide according to claim 13, wherein the TPM3 gene is composed of the nucleotide sequence of SEQ ID NO: 2.

14. The TRK1 fusion gene polynucleotide according to claim 13, wherein the NTRK1 gene is composed of the nucleotide sequence of SEQ ID NO: 8, SEQ ID NO: 10 or SEQ ID NO: 13.

13. A colon cancer diagnostic composition comprising the NTRK1 fusion protein according to claim 1, the fusion gene according to claim 13, or an agent for measuring the transcription product of said fusion gene.

18. The composition for diagnosing colorectal cancer according to claim 17, wherein the transcription product of the fusion gene is mRNA.

19. The composition for diagnosing colorectal cancer according to claim 18, wherein the agent for measuring the mRNA of the fusion gene comprises a primer that specifically binds to the mRNA of the fusion gene.

[Claim 21] The method according to claim 19, wherein the primer specifically binding to the mRNA of the fusion gene is a primer designed so that a region amplified by the primer includes a fusion point of two genes to be fused, A composition for cancer diagnosis.

21. The method according to claim 20, wherein when the two genes to be fused are the TPM3 gene and the NTRK1 gene, a primer specific to the nucleotide sequence encoding the N-terminal of the TPM3 protein and a primer specific to the nucleotide sequence encoding the C- Wherein the composition is a primer pair.

The composition for diagnosing colorectal cancer according to claim 21, wherein the primer specifically binding to the mRNA of the fusion gene is a primer pair consisting of the nucleotide sequence of SEQ ID NO: 35 and SEQ ID NO: 36.

The method according to claim 19, wherein when the two genes to be fused are the LMNA gene and the NTRK1 gene, a primer specific to the nucleotide sequence encoding the N-terminal of the LMNA protein and a primer specific to the nucleotide sequence encoding the C- Wherein the composition is a primer pair.

24. The composition for diagnosing colorectal cancer according to claim 23, wherein the primer that specifically binds to the mRNA of the fusion gene is a primer pair consisting of the nucleotide sequence of SEQ ID NO: 37 and SEQ ID NO: 38.

18. The composition for diagnosing colorectal cancer according to claim 17, wherein the agent for measuring the fusion gene comprises a probe or a primer that specifically binds to the fusion gene.

18. The composition for diagnosing colorectal cancer according to claim 17, wherein the agent for measuring the NTRK1 fusion protein comprises an antibody specific to the protein.

27. The composition for diagnosing colorectal cancer according to claim 26, wherein the antibody specific to the NTRK1 fusion protein binds to the C terminus of the NTRK1 protein.

17. A colon cancer diagnostic kit comprising the composition of claim 17.

29. The diagnostic kit for colon cancer according to claim 28, which is an RT-PCR kit, DNA chip kit or protein chip kit.

Measuring the level of the NTRK1 fusion gene, its transcription product or its protein from a separate sample of individuals suspected of having colon cancer; And comparing the measured expression level of the fusion gene, its transcription product or its protein with a fusion gene of a normal control sample, a transcription product thereof or a protein expression level thereof.

31. The method of claim 30, wherein the NTRK1 fusion gene comprises a polynucleotide that constitutes the 5 ' end of the LMNA gene or the TPM3 gene; And a polynucleotide that constitutes the 3 ' end of the NTRK1 gene.

31. The method of claim 30, wherein the level of the fusion gene, its transcription product, or protein expression thereof measured from a separate sample of a subject suspected of having colorectal cancer is higher than the level of the fusion gene, transcription product thereof or protein expression thereof of a normal control sample, A method for providing information necessary for diagnosis of colorectal cancer, which comprises diagnosing colon cancer.

Measuring the level of the NTRK1 fusion gene, its transcription product or its protein from a separate sample of colon cancer patients; And comparing the measured expression level of the fusion gene, its transcription product, or its protein with a fusion gene of a normal control sample, a transcription product thereof or a protein expression level thereof, to provide information necessary for predicting colorectal cancer treatment prognosis How to.

34. The method of claim 33, wherein the level of expression of the fusion gene, its transcription product, or protein thereof from the isolated sample of a colon cancer patient is higher than the fusion gene of the normal control sample, the transcription product thereof or the protein expression thereof, Wherein the predicted outcome is predicted to be worse than that when the same or lower is used.

34. The method of claim 33, wherein the level of expression of the fusion gene, its transcription product, or protein thereof from a separate sample of a patient with colorectal cancer is higher than the level of the fusion gene, transcription product thereof or protein expression thereof of a normal control sample, A method for providing information necessary for predicting colorectal cancer treatment prognosis, which is predicted to be highly effective in colorectal cancer treatment.

Treating a candidate cancer cell cancer candidate cell with a cell expressing the NTRK1 fusion gene; And measuring the expression level of the NTRK1 fusion gene.

37. The screening method for a colorectal cancer therapeutic agent according to claim 36, wherein when the expression level of the NTRK1 fusion gene is lowered by treatment with a candidate substance for treatment of colorectal cancer, it is judged to be a therapeutic agent for colon cancer.

A pharmaceutical composition for the prophylaxis or treatment of colorectal cancer comprising an NTRK1 fusion protein inhibitor as an active ingredient.

The pharmaceutical composition according to claim 38, wherein the NTRK1 fusion protein inhibitor is ARRY-470, a compound represented by the following general formula (1) or a compound represented by the general formula (2).
[Chemical Formula 1]

(2)

39. The pharmaceutical composition according to claim 38, wherein the NTRK1 fusion protein inhibitor is an NTRK1 fusion gene expression inhibitor.

41. The composition of claim 40, wherein the NTRK1 fusion gene expression inhibitor is selected from the group consisting of antisense oligonucleotides, siRNA, shRNA and microRNA of the NTRK1 fusion gene.

39. The composition of claim 38, wherein the NTRK1 fusion protein inhibitor is an antibody that specifically binds to the NTRK1 fusion protein.