[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

KR101626648B1 - Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer - Google Patents

Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer Download PDF

Info

Publication number
KR101626648B1
KR101626648B1 KR1020137028000A KR20137028000A KR101626648B1 KR 101626648 B1 KR101626648 B1 KR 101626648B1 KR 1020137028000 A KR1020137028000 A KR 1020137028000A KR 20137028000 A KR20137028000 A KR 20137028000A KR 101626648 B1 KR101626648 B1 KR 101626648B1
Authority
KR
South Korea
Prior art keywords
seq
gene shown
expression product
gene
specific
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
KR1020137028000A
Other languages
Korean (ko)
Other versions
KR20140016953A (en
Inventor
순 예
페이 우
킹후아 수
팡 리우
시아 멩
브루노 모긴
Original Assignee
비오메리으
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 비오메리으 filed Critical 비오메리으
Priority claimed from PCT/CN2012/072931 external-priority patent/WO2012130103A1/en
Publication of KR20140016953A publication Critical patent/KR20140016953A/en
Application granted granted Critical
Publication of KR101626648B1 publication Critical patent/KR101626648B1/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/574Immunoassay; Biospecific binding assay; Materials therefor for cancer
    • G01N33/57407Specifically defined cancers
    • G01N33/57419Specifically defined cancers of colon
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6876Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N33/00Investigating or analysing materials by specific methods not covered by groups G01N1/00 - G01N31/00
    • G01N33/48Biological material, e.g. blood, urine; Haemocytometers
    • G01N33/50Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
    • G01N33/53Immunoassay; Biospecific binding assay; Materials therefor
    • G01N33/5308Immunoassay; Biospecific binding assay; Materials therefor for analytes not provided for elsewhere, e.g. nucleic acids, uric acid, worms, mites
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q2600/00Oligonucleotides characterized by their use
    • C12Q2600/158Expression markers

Landscapes

  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Engineering & Computer Science (AREA)
  • Immunology (AREA)
  • Molecular Biology (AREA)
  • Analytical Chemistry (AREA)
  • Urology & Nephrology (AREA)
  • Hematology (AREA)
  • Biomedical Technology (AREA)
  • Biotechnology (AREA)
  • Physics & Mathematics (AREA)
  • Organic Chemistry (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Microbiology (AREA)
  • Pathology (AREA)
  • Zoology (AREA)
  • General Physics & Mathematics (AREA)
  • Medicinal Chemistry (AREA)
  • Food Science & Technology (AREA)
  • Wood Science & Technology (AREA)
  • Cell Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Biophysics (AREA)
  • Oncology (AREA)
  • Hospice & Palliative Care (AREA)
  • Genetics & Genomics (AREA)
  • General Engineering & Computer Science (AREA)
  • Tropical Medicine & Parasitology (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)

Abstract

본 발명은 시험될 개체의 유전자의 핵산의 발현 생성물의 양을, 양성 대조군을 구성하는 CRC 환자 군으로부터 수득된 동일한 유전자의 핵산의 발현 생성물의 양 및 음성 대조군을 구성하는 CNC 개체 군으로부터 수득된 동일한 유전자의 핵산의 발현 생성물의 양과 비교함으로써 개체가 결장직장암을 앓을 확률을 시험관내에서 말초혈 샘플에서 측정하는 방법; 및 상기 발현 생성물에 대한 특이적 결합 파트너를 포함하는 키트를 제공한다.The present invention provides a method for quantifying the amount of the expression product of the nucleic acid of the gene of the individual to be tested by comparing the amount of the expression product of the nucleic acid of the same gene obtained from the CRC patient group constituting the positive control and Comparing the amount of expression product of the nucleic acid of the gene to determine the probability that the individual will suffer from colorectal cancer in a peripheral blood sample in vitro; And a specific binding partner for the expression product.

Description

개체가 결장직장암을 앓을 확률을 시험관내에서 측정하는 방법 및 키트{METHOD AND KIT FOR DETERMINING IN VITRO THE PROBABILITY FOR AN INDIVIDUAL TO SUFFER FROM COLORECTAL CANCER}METHOD AND KIT FOR DETERMINING IN VITRO THE PROBABILITY FOR INDIVIDUAL TO SUFFER FROM COLORECTAL CANCER BACKGROUND OF THE INVENTION 1. Field of the Invention < RTI ID = 0.0 >

본 발명은 결장직장암의 검출, 특히 이러한 암을 앓을 확률을 측정하는 방법 및 키트에 관한 것이다.The present invention relates to the detection of colorectal cancer, and in particular to a method and kit for measuring the probability of having such cancer.

결장암 또는 대장암으로도 지칭되는 결장직장암(CRC)은 미국에서 5번째로 가장 흔한 암의 형태이고, 중국에서 4번째로 흔한 암이고, 유럽에서 암 관련 사망의 3번째 주원인이다. CRC의 초기 검출은 성공적인 치료 및 환자 생존에 대한 핵심이고 주요 공중보건 도전과제를 대표한다. 실제로, CRC는 특히 초기 단계에 진단받았을 때 종종 치유가능하다. 여러 스크리닝 기법들이 다양한 국가에서 이미 실시되고 있다. 통상적인 CRC 스크리닝 시험은 대변잠혈 시험(FOBT), S상결장경검사, 결장경검사, 이중 조영 바륨 관장 또는 디지털 직장 검사를 포함한다. 이들 전부가 장점 및 한계점을 갖고 있지만, 순응도는 주로 환자의 실행계획 또는 불편함으로 인해 기대치보다 낮은 수준에 머물러 있다.Colorectal cancer (CRC), also referred to as colon cancer or colon cancer, is the fifth most common form of cancer in the United States, the fourth most common cancer in China, and the third leading cause of cancer-related deaths in Europe. The initial detection of CRC is a key to successful treatment and patient survival and represents a major public health challenge. In fact, CRC is often healable, especially when diagnosed at an early stage. Several screening techniques have already been implemented in various countries. Typical CRC screening tests include fecal occult blood test (FOBT), sigmoidoscopy, colonoscopy, dual contrast barium enema or digital rectal examinations. While all of these have their advantages and limitations, adherence is often lower than expected due to the patient's performance plan or discomfort.

CRC의 초기 검출을 목적으로 하는 말초혈 생체마커에 대한 검색은 특히 이의 편리함 때문에 수년전부터 주목받았다. 한편, 혈액-기초 시험 실행가능성은 혈액 중의 유전자 생체마커가 CRC 환자를 대조군으로부터 구별할 수 있다는 것을 보여주는 극소수의 연구에 의해 뒷받침되었다. 이들 연구들은 미시적 입자들, 예컨대, 세포들을 유체 스트림에 현탁시키고 전자 검출 장치를 이용하여 이들을 분석함으로써 상기 입자들을 카운팅하고 조사하는 기법인 유세포분석에 기초하였다.Retrieval of peripheral blood biomarkers for the initial detection of CRC has been noted for several years because of its convenience. On the other hand, the feasibility of a blood-based test was supported by only a few studies showing that genomic biomarkers in blood can distinguish CRC patients from controls. These studies were based on flow cytometry, a technique for suspending microscopic particles, such as cells, in a fluid stream and counting and examining them by analyzing them using an electronic detector.

본 발명자들은 상이하게 발현된 유전자들이 말초혈 샘플 중의 중요한 생체마커를 대표한다는 것을 발견하였다. 그들은 고전적인 유세포분석 기법을 이용하지 않았고 전혈로부터의 유전자의 상이한 발현의 확인을 이용하였다. 전혈 중의 전사체의 분석을 통해 유전자의 발현 수준을 측정하는 것은 통상적이지 않은데, 이는 특정 정보가 RNA(총 RNA)의 복합 혼합물 중에 희석되어 있는 경우 특정 정제 단계 없이 상기 정보를 검색하는 것이 매우 어렵다는 것이 당업자에 의해 통상적으로 인정되기 때문이다. 본 방법의 장점은 또한 이 RNA 정제 단계를 피한다는 것이다.The inventors have found that differentially expressed genes represent important biomarkers in peripheral blood samples. They did not use classical flow cytometry techniques and used confirmation of different expression of genes from whole blood. It is not uncommon to measure the level of expression of a gene through analysis of transcripts in whole blood because it is very difficult to retrieve the information without specific purification steps if certain information is diluted in a complex mixture of RNA (total RNA) Since they are commonly recognized by those skilled in the art. The advantage of this method is also to avoid this RNA purification step.

따라서, 본 발명은 하기 단계들을 포함하는, 개체가 결장직장암을 앓을 확률을 시험관내에서 말초혈 샘플에서 측정하는 방법에 관한 것이다:Accordingly, the present invention relates to a method for measuring the probability of an individual having a colorectal cancer in a peripheral blood sample in vitro, comprising the steps of:

a) 서열번호 1 내지 11에서 확인된 서열들로부터 선택된 1종 이상의 핵산 서열 내지 7종 이하의 핵산 서열로부터의 1종 이상의 발현 생성물의 양을 상기 말초혈 샘플에서 측정하는 단계;a) measuring in the peripheral blood sample the amount of at least one expression product from at least one nucleic acid sequence selected from the sequences identified in SEQ ID NOS: 1-11 and up to 7 nucleic acid sequences;

b) 단계 a)에서 측정된 상기 발현 생성물의 양을, 결장직장암 환자로서 미리 진단된 개체의 군에 대한 발현 생성물의 기준 양 및 비-결장직장암 개체로서 미리 검증된 개체의 군에 대한 발현 생성물의 기준 양과 비교하는 단계; 및b) comparing the amount of the expression product measured in step a) with a reference amount of expression product for a group of individuals previously diagnosed as a colorectal cancer patient and a reference amount of expression product for a group of individuals previously verified as non- Comparing with a reference quantity; And

c) 단계 b)의 결과의 분석을 수행하는 단계로서, c) performing an analysis of the result of step b)

- 시험된 개체에 대한 결과가 결장직장암 환자로서 미리 진단된 개체의 군으로부터 수득된 결과와 유사하거나 동등한 경우, 상기 시험된 개체가 결장직장암 환자로서 분류되고, - if the result on the tested individual is similar or equivalent to the results obtained from a group of individuals previously diagnosed as a colorectal cancer patient, the tested individual is classified as a colorectal cancer patient,

- 시험된 개체에 대한 결과가 비-결장직장암 개체로서 미리 검증된 개체의 군으로부터 수득된 결과와 유사하거나 동등한 경우, 상기 시험된 개체가 비-결장직장암 개체로서 분류되는, 단계.- the tested individual is classified as a non-colorectal cancer entity if the results on the tested individual are similar or equivalent to those obtained from a group of individuals previously validated as non-colon carcinoma individuals.

발현 생성물의 양은 그의 핵산 서열에 의해 정의된 유전자의 발현 수준과 직접적으로 연관되어 있다.The amount of expression product is directly related to the expression level of the gene defined by its nucleic acid sequence.

상기 핵산들 중 1종 이상의 핵산의 발현 수준은 개체가 CRC 환자인지 아닌지를 확인하는 데에 충분한 정보이다. 그러나, 본 발명의 바람직한 실시양태에서, 단계 a)에서 서열번호 1; 서열번호 2, 서열번호 3 또는 서열번호 4; 서열번호 5 또는 서열번호 6; 서열번호 7 또는 서열번호 8; 서열번호 9; 서열번호 10; 및 서열번호 11로 구성된 군으로부터 선택된 핵산 서열로부터의 발현 생성물의 양이 측정된다.The level of expression of one or more of the nucleic acids is sufficient information to determine whether the individual is a CRC patient or not. However, in a preferred embodiment of the present invention, in step a), SEQ ID NO: 1; SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4; SEQ ID NO: 5 or SEQ ID NO: 6; SEQ ID NO: 7 or SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; And the amount of the expression product from the nucleic acid sequence selected from the group consisting of SEQ ID NO: 11 is measured.

핵산(들)으로부터의 발현 생성물(들)의 양은 상기 발현 생성물(들)을 각각의 발현 생성물에 대해 특이적인 1종 이상의 결합 파트너와 접촉시킴으로써 측정된다.The amount of expression product (s) from the nucleic acid (s) is determined by contacting the expression product (s) with one or more binding partners specific for each expression product.

발현 생성물(들)은 RNA 전사체(들) 또는 폴리펩티드(들)를 의미한다. 따라서, 본 발명의 방법에서 1종 이상의 RNA 전사체 또는 1종 이상의 폴리펩티드의 양이 측정된다.The expression product (s) means RNA transcript (s) or polypeptide (s). Thus, in the method of the present invention, the amount of one or more RNA transcripts or one or more polypeptides is measured.

용어 RNA 전사체는 총 RNA, 즉 말초혈 샘플로부터 직접적으로 수득되거나 세포 용해 후 혈액 샘플로부터 간접적으로 수득된 코딩 또는 비-코딩 RNA를 의미하기 위한 것이다. 특히, 총 RNA는 전달 RNA(tRNA), 메신저 RNA(mRNA), 예컨대, 표적 유전자로부터 전사된 mRNA뿐만 아니라 임의의 다른 유전자로부터 전사된 mRNA 및 리보좀 RNA도 포함한다.The term RNA transcript is intended to mean total RNA, either coding or non-coding RNA obtained directly from a peripheral blood sample or obtained indirectly from a blood sample after cell lysis. In particular, total RNA also includes mRNA and ribosomal RNA transcribed from transcribed RNA (tRNA), messenger RNA (mRNA), such as mRNA transcribed from a target gene, as well as from any other gene.

예를 들면, RNA가 세포내 RNA인 경우, 상기 RNA는 시험될 개체의 세포에 함유된 핵산을 방출시키기 위해 상기 세포의 용해 단계에 의해 혈액 샘플에 존재하는 세포로부터 추출될 수 있다. 예를 들면, 하기 특허출원 공보들에 기재된 용해 방법들이 이용될 수 있다: 혼합된 자기적 및 기계적 용해에 관한 국제 특허출원 공개 제WO 00/05338호, 전기적 용해에 관한 국제 특허출원 공개 제WO 99/53304호, 및 기계적 용해에 관한 국제 특허출원 공개 제WO 99/15321호. 당업자는 다른 잘 공지된 용해 방법, 예컨대, 열 또는 삼투압 충격(shock) 또는 무질서화제(chaotropic agent), 예컨대, 구아니디늄 염을 사용한 화학적 용해를 이용할 수 있다(미국 특허 제5,234,809호). 용해 단계에서 방출된 다른 세포 성분들로부터 핵산을 분리하는 추가 단계를 제공하는 것도 가능하다. 이것은 일반적으로 핵산을 농축하는 것을 가능하게 한다.For example, if the RNA is intracellular RNA, the RNA may be extracted from cells present in the blood sample by a lysis step of the cell to release nucleic acids contained in the cells of the individual to be tested. For example, the dissolution methods described in the following patent application publications may be used: International Patent Application Publication No. WO 00/05338 for mixed magnetic and mechanical dissolution, International Patent Application Publication No. WO 99 / 53304, and International Patent Application Publication No. WO 99/15321 on mechanical dissolution. Those skilled in the art can use other well-known dissolution methods, such as chemical dissolution using heat or osmotic shock or chaotropic agents, such as guanidinium salts (US Pat. No. 5,234,809). It is also possible to provide an additional step of separating nucleic acids from other cellular components released in the lysis step. This generally makes it possible to concentrate the nucleic acid.

본 발명의 방법에서, RNA 전사체는 혼성화, 증폭 또는 서열분석에 의해 검출되고 정량될 수 있다. 특히, 검출되고 정량될 RNA 전사체를, 1종 이상의 프로브 및/또는 1종 이상의 프라이머와 상기 RNA 전사체의 혼성화를 가능하게 하는 예정된 조건 하에 상기 1종 이상의 프로브 또는 상기 1종 이상의 프라이머와 접촉시킨다. 그러나, 본 발명의 또 다른 실시양태에서, RNA 전사체의 DNA 복사체를 제조하고, 상기 DNA 복사체를 1종 이상의 프로브 및/또는 1종 이상의 프라이머와 상기 DNA 복사체의 혼성화를 가능하게 하는 예정된 조건 하에 상기 1종 이상의 프로브 또는 상기 1종 이상의 프라이머와 접촉시킴으로써 상기 DNA 복사체를 측정한다.In the method of the present invention, the RNA transcript can be detected and quantified by hybridization, amplification or sequencing. In particular, the RNA transcript to be detected and quantified is contacted with said one or more probes or said one or more primers under predetermined conditions to enable hybridization of said RNA transcript with one or more probes and / or one or more probes . However, in another embodiment of the present invention, there is provided a method for preparing a DNA transcript of an RNA transcript, which comprises contacting the DNA transcript with one or more probes and / or one or more primers under predetermined conditions, The DNA copy is measured by contacting with at least one probe or at least one primer.

보다 정확하게는, 전술된 방법에서, RNA 전사체 또는 DNA 복사체를 1종 이상의 혼성화 프로브 및 1종 이상의 프라이머, 보다 구체적으로 1종 이상의 혼성화 프로브 및 2종의 프라이머와 접촉시킨다.More precisely, in the above-described method, an RNA transcript or a DNA copy is contacted with at least one hybridization probe and at least one primer, more particularly at least one hybridization probe and two primers.

용어 "혼성화"는 적절한 조건 하에 2종의 뉴클레오티드 단편들이 안정한 특이적 수소결합으로 결합하여 이중 가닥 결합체를 형성하는 과정을 의미하기 위한 것이다. 이들 수소결합은 상보적 아데닌(A)과 티민(T)(또는 우라실(U)) 염기들 사이에 형성되거나(이것은 A-T 결합으로 지칭됨), 상보적 구아닌(G)과 사이토신(C) 염기들 사이에 형성된다(이것은 G-C 결합으로 지칭됨). 2종의 뉴클레오티드 단편들의 혼성화는 완전할 수 있다(이 경우, 상보적 뉴클레오티드 단편 또는 서열로 지칭됨)(즉, 이 혼성화 동안 수득된 이중 가닥 결합체가 A-T 결합 및 C-G 결합만을 포함한다). 이 혼성화는 부분적일 수 있다(이 경우, 충분히 상보적인 뉴클레오티드 단편 또는 서열로 지칭됨)(즉, 수득된 이중 가닥 결합체가 이중 가닥 결합체를 형성하는 것을 가능하게 하는 A-T 결합 및 C-G 결합뿐만 아니라, 상보적 염기에 결합되지 않은 염기도 포함한다). 2종의 뉴클레오티드 단편들 사이의 혼성화는 이용된 작업 조건 및 특히 엄격도에 의해 좌우된다. 엄격도는 특히 2종의 뉴클레오티드 단편들의 염기 조성의 함수로서 정의될 뿐만 아니라 상기 2종의 뉴클레오티드 단편들 사이의 불일치도에 의해서도 정의된다. 엄격도는 반응 파라미터, 예컨대, 혼성화 용액에 존재하는 이온성 종의 농도 및 유형, 변성제의 성질 및 농도 및/또는 혼성화 온도에 의해서도 좌우될 수 있다. 이들 데이터 모두가 잘 공지되어 있고, 적절한 조건은 당업자에 의해 결정될 수 있다. 일반적으로, 혼성화시키고자 하는 뉴클레오티드 단편들의 길이에 따라, 혼성화 온도는 약 0.5 M 내지 1 M 농도의 식염수 용액에서 약 20℃ 내지 70℃, 특히 35℃ 내지 65℃이다. 서열, 뉴클레오티드 단편, 올리고뉴클레오티드 또는 폴리뉴클레오티드는 천연 핵산의 정보 서열을 특징으로 하고 뉴클레오티드 단편에 혼성화할 수 있는, 포스포에스터 결합에 의해 함께 조립된 뉴클레오티드 모티프들의 시리즈이고, 상기 시리즈는 상이한 구조를 갖는 단량체를 함유할 수 있고 천연 핵산 분자로부터 수득될 수 있고/있거나 유전적 재조합 및/또는 화학적 합성에 의해 수득될 수 있다. 모티프는 구성요소가 당, 포스페이트 기 및 질소성 염기인 핵산의 천연 뉴클레오티드일 수 있는 단량체의 유도체이고, DNA에서 당은 데옥시-2-리보스이고, RNA에서 당은 리보스이고, DNA인지 아니면 RNA인지에 따라 질소성 염기는 아데닌, 구아닌, 우라실, 사이토신 및 티민으로부터 선택되고, 대안적으로 상기 단량체는 상기 3종의 구성요소들 중 1종 이상의 구성요소에서 변경된 뉴클레오티드이고, 예를 들면, 상기 변경은 변경된 염기, 예컨대, 이노신, 메틸-5-데옥시사이티딘, 데옥시우리딘, 다이메틸아미노-5-데옥시우리딘, 다이아미노-2,6-푸린, 브로모-5-데옥시우리딘 또는 혼성화할 수 있는 임의의 다른 변경된 염기를 갖도록 염기 수준에서 일어날 수 있거나, 당 수준에서 일어날 수 있거나(예를 들면, 폴리아마이드에 의한 1개 이상의 데옥시리보스의 치환)(문헌[P. E. Nielsen et al, Science, 254, 1497-1500 (1991)]), 또는 포스페이트 기 수준에서 일어날 수 있다(예를 들면, 특히 다이포스페이트, 알킬포스포네이트, 아릴포스포네이트 및 포스포로티오에이트로부터 선택된 에스터에 의한 포스페이트 기의 치환).The term "hybridization" is intended to mean a process in which two nucleotide fragments under appropriate conditions are combined with a stable specific hydrogen bond to form a double-stranded complex. These hydrogen bonds are formed between the complementary adenine (A) and the thymine (T) (or uracil (U)) bases (this is referred to as AT binding) or complementary guanine (G) (This is referred to as GC coupling). The hybridization of the two nucleotide fragments can be complete (in this case, referred to as complementary nucleotide fragment or sequence) (i.e., the double stranded conjugate obtained during this hybridization contains only the A-T and C-G bonds). This hybridization can be partial (in this case, referred to as a sufficiently complementary nucleotide fragment or sequence) (i. E., AT binding and CG binding, which allows the resulting double stranded conjugate to form a double stranded conjugate, Including bases which are not bonded to an atomic base. The hybridization between the two nucleotide fragments depends on the working conditions used and in particular on the stringency. Strictness is defined not only as a function of the base composition of the two nucleotide fragments, but also by the degree of discrepancy between the two nucleotide fragments. The degree of stringency may also depend on the reaction parameters, such as the concentration and type of ionic species present in the hybridization solution, the nature and concentration of the denaturant, and / or the hybridization temperature. All of these data are well known, and suitable conditions can be determined by one of ordinary skill in the art. Generally, depending on the length of the nucleotide fragments to be hybridized, the hybridization temperature is about 20 ° C to 70 ° C, especially 35 ° C to 65 ° C in a saline solution at a concentration of about 0.5 M to 1 M. A sequence, nucleotide fragment, oligonucleotide or polynucleotide is a series of nucleotide motifs assembled together by a phosphoester linkage, characterized by the information sequence of the native nucleic acid and capable of hybridizing to the nucleotide fragment, May contain monomers and may be obtained from natural nucleic acid molecules and / or may be obtained by genetic recombination and / or chemical synthesis. The motif is a derivative of a monomer in which the constituent is a natural nucleotide of a nucleic acid which is a sugar, a phosphate group and a nitrogenous base. In the DNA, the sugar is deoxy-2-ribose. In the RNA, the sugar is ribose. , The nitrogenous base is selected from adenine, guanine, uracil, cytosine and thymine, alternatively the monomer is a nucleotide modified in at least one of the three components, for example, Can be prepared by reacting a modified base such as inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, diamino-2,6-purine, Can occur at the base level to have any other modified base capable of hybridizing, or can occur at the sugar level (e. G., By substitution of one or more deoxyriboses with polyamides (PE Nielsen et al., Science, 254, 1497-1500 (1991)), or at phosphate group levels (e. G., Especially in the form of diphosphates, alkylphosphonates, arylphosphonates, Substitution of phosphate group by ester selected from porothioate).

본 발명의 목적을 위해, 용어 "증폭 프라이머"는 효소 중합, 예를 들면, 효소 증폭 반응의 개시를 가능하게 하는 5개 내지 100개 뉴클레오티드, 바람직하게는 15개 내지 30개 뉴클레오티드를 포함하는 뉴클레오티드 단편을 의미하기 위한 것이다. 용어 "효소 증폭 반응"은 1종 이상의 효소의 작용을 통해 뉴클레오티드 단편의 다수의 복사체를 발생시키는 과정을 의미하기 위한 것이다. 이러한 증폭 반응은 당업자에게 잘 공지되어 있고 특히 하기 기법들이 언급될 수 있다: 미국 특허 제4,683,195호, 미국 특허 제4,683,202호 및 미국 특허 제4,800,159호에 기재된 PCR(중합효소 연쇄 반응); 예를 들면, 유럽 특허출원 제0 201 184호에 개시된 LCR(연결효소(ligase) 연쇄 반응); 국제 특허출원 공개 제WO 90/01069호에 기재된 RCR(복구 연쇄 반응); 국제 특허출원 공개 제WO 90/06995호에 기재된 3SR(자가 지속 서열 복제); 국제 특허출원 공개 제WO 91/02818호에 기재된 NASBA(핵산 서열-기초 증폭); 미국 특허 제5,399,491호에 기재된 TMA(전사-매개된 증폭); 및 RT-PCR.For purposes of the present invention, the term "amplification primer" refers to a nucleotide fragment comprising 5 to 100 nucleotides, preferably 15 to 30 nucleotides, which enables the initiation of enzymatic polymerization, . The term "enzyme amplification reaction " is intended to mean the process of generating multiple copies of a nucleotide fragment through the action of one or more enzymes. Such amplification reactions are well known to those skilled in the art, and in particular the following techniques may be mentioned: PCR (polymerase chain reaction) as described in U.S. Patent No. 4,683,195, U.S. Patent No. 4,683,202 and U.S. Patent No. 4,800,159; For example, the LCR (ligase chain reaction) disclosed in European Patent Application No. 0 201 184; The RCR (repair chain reaction) described in International Patent Application Publication No. WO 90/01069; 3SR (self-sustained sequence replication) as described in International Patent Application Publication No. WO 90/06995; NASBA (nucleic acid sequence-based amplification) described in International Patent Application Publication No. WO 91/02818; TMA (transcription-mediated amplification) described in U.S. Patent No. 5,399,491; And RT-PCR.

효소 증폭이 PCR인 경우, 표적 유전자에 대해 특이적인 증폭 물질을 가능하게 하는, 상기 표적 유전자에 대해 특이적인 2종 이상의 증폭 프라이머가 사용된다. 표적 유전자에 대해 특이적인 물질은 바람직하게는 상기 표적 유전자로부터 유도된 메신저 RNA의 역전사에 의해 수득된 상보적 DNA(표적 유전자 특이적 cDNA로 지칭됨), 또는 표적 유전자에 대해 특이적인 cDNA의 전사에 의해 수득된 상보적 RNA(표적 유전자 특이적 cRNA로 지칭됨)를 포함한다. 효소 증폭이 역전사 반응 후 수행된 PCR인 경우, 이 증폭은 RT-PCR로 지칭된다.When the enzyme amplification is PCR, two or more kinds of amplification primers specific to the target gene are used, which enable an amplification substance specific for the target gene. The substance specific to the target gene is preferably a complementary DNA (referred to as target gene-specific cDNA) obtained by reverse transcription of messenger RNA derived from the target gene, or a cDNA specific for the target gene (Referred to as target gene-specific cRNA). If the enzyme amplification is PCR performed after the reverse transcription reaction, this amplification is referred to as RT-PCR.

용어 "혼성화 프로브"는 표적 유전자에 대해 특이적인 물질과 혼성화 결합체를 형성하도록 소정의 조건 하에 혼성화 특이성을 갖는, 5개 이상의 뉴클레오티드, 예컨대, 5개 내지 100개의 뉴클레오티드, 특히 10개 내지 75개의 뉴클레오티드, 예컨대, 15개 내지 35개의 뉴클레오티드 및 60개 내지 70개의 뉴클레오티드를 포함하는 뉴클레오티드 단편을 의미하기 위한 것이다. 본 발명에서, 표적 유전자에 대해 특이적인 물질은 표적 유전자로부터 유도된 메신저 RNA에 포함된 뉴클레오티드 서열(표적 유전자 특이적 mRNA로 지칭됨), 상기 메신저 RNA의 역전사에 의해 수득된 상보적 DNA에 포함된 뉴클레오티드 서열(표적 유전자 특이적 cDNA로 지칭됨), 또는 전술된 상기 cDNA의 전사에 의해 수득된 상보적 RNA에 포함된 뉴클레오티드 서열(표적 유전자 특이적 cRNA로 지칭됨)일 수 있다. 혼성화 프로브는 그의 검출을 위해 표지를 포함할 수 있다. 용어 "검출"은 직접적인 검출, 예컨대, 카운팅 방법, 또는 표지를 사용한 검출 방법에 의한 간접적인 검출을 의미하기 위한 것이다. 많은 핵산 검출 방법들이 존재한다(예를 들면, 문헌[Kricka et al., Clinical Chemistry, 1999, no 45 (4), p. 453-458] 또는 문헌[Keller G. H. et al., DNA Probes, 2nd Ed., Stockton Press, 1993, sections 5 and 6, p. 173-249] 참조). 용어 "표지"는 검출될 수 있는 신호를 발생시킬 수 있는 트레이서(tracer)를 의미하기 위한 것이다. 이들 트레이서의 비-한정적 목록은 비색법, 형광 또는 발광, 예컨대, 호스라디쉬 퍼록시다제(horseradish peroxidase), 알칼리성 포스파타제(phosphatase), 베타-갈락토시다제(galactosidase) 또는 글루코스-6-포스페이트 데하이드로게나제(dehydrogenase)에 의해 검출될 수 있는 신호를 생성하는 효소; 발색단, 예컨대, 형광, 발광 또는 염료 화합물; 전자 현미경관찰에 의해, 그들의 전기적 성질, 예컨대, 전도성에 의해, 전류측정법 또는 전압측정법에 의해, 또는 임피던스 측정에 의해 검출될 수 있는 전자 조밀 기; 광학적 방법, 예컨대, 회절, 표면 플라스몬 공명 또는 접촉각 변화에 의해, 또는 물리적 방법, 예컨대, 원자력 분광법, 터널 효과 등에 의해 검출될 수 있는 기; 및 방사성 분자, 예컨대, 32P, 35S 또는 125I를 포함한다.The term "hybridization probe" refers to a nucleic acid molecule having 5 or more nucleotides, such as 5 to 100 nucleotides, in particular 10 to 75 nucleotides, having a hybridization specificity under predetermined conditions to form a hybridization complex with a substance specific for the target gene. For example, a nucleotide fragment comprising 15 to 35 nucleotides and 60 to 70 nucleotides. In the present invention, a substance specific to the target gene includes a nucleotide sequence (referred to as target gene-specific mRNA) contained in messenger RNA derived from the target gene, a complementary DNA obtained by complementary DNA obtained by reverse transcription of the messenger RNA (Referred to as the target gene specific cDNA) contained in the complementary RNA obtained by the nucleotide sequence (referred to as the target gene specific cDNA) or the above-described transcription of the cDNA. The hybridization probe may include a label for its detection. The term "detection" is intended to mean a direct detection, such as a counting method, or an indirect detection by a detection method using a label. Many nucleic acid detection methods exist (see, for example, Kricka et al., Clinical Chemistry, 1999, no 45 (4), p. 453-458) or Keller GH et al., DNA Probes, 2nd Ed ., Stockton Press, 1993, sections 5 and 6, p. 173-249). The term "cover" is intended to mean a tracer capable of generating a signal that can be detected. A non-limiting list of these tracers includes colorimetry, fluorescence or luminescence, such as horseradish peroxidase, alkaline phosphatase, beta-galactosidase or glucose-6-phosphate dehydrogenase An enzyme that generates a signal that can be detected by dehydrogenase; Chromophores, such as fluorescence, luminescent or dye compounds; An electron tightener which can be detected by electron microscopic observation, by their electrical properties, for example, by conductivity, by current measurement or voltage measurement, or by impedance measurement; A group which can be detected by optical methods such as diffraction, surface plasmon resonance or contact angle changes, or by physical methods such as nuclear spectroscopy, tunneling effects, and the like; And radioactive molecules, such as 32 P, 35 S or 125 I.

본 발명의 목적을 위해, 혼성화 프로브는 "검출" 프로브일 수 있다. 이 경우, "검출" 프로브는 표지에 의해 표지된다. 검출 프로브는 특히 문헌[Tyagi & Kramer, Nature biotech, 1996, 14:303-308]에 기재된 "분자 비이콘(beacon)" 검출 프로브일 수 있다. 이들 "분자 비이콘"은 혼성화 동안 형광을 나타내게 된다. 이들은 줄기-루프 유형 구조를 갖고 형광단 및 "소광제" 기를 함유한다. 특정 루프 서열과 그의 상보적 표적 핵산 서열의 결합은 줄기가 풀어지게 하고 적절한 파장에서 여기 동안 형광 신호의 방사를 야기한다. 검출 프로브는 특히 나노스트링(NanoString: 상표명) 기술에 따른 "색채-코딩된 바코드"를 포함하는 "레포터 프로브"일 수 있다.For purposes of the present invention, the hybridization probe may be a "detection" probe. In this case, the "detection" probe is labeled by the label. The detection probe may be a "molecular beacon" detection probe, particularly as described in Tyagi & Kramer, Nature biotech, 1996, 14: 303-308. These "molecular beacons" will fluoresce during hybridization. They have a stem-loop type structure and contain fluorescent moieties and "quencher" groups. The combination of a particular loop sequence with its complementary target nucleic acid sequence causes the stem to loosen and cause emission of the fluorescent signal during excitation at the appropriate wavelength. The detection probe may be a " reporter probe ", especially comprising a "color-coded barcode" according to the NanoString (TM) technology.

혼성화 반응의 검출을 위해, 직접적으로(특히 표적 서열 내로의 표지의 도입에 의해) 또는 간접적으로(특히 상기 정의된 검출 프로브의 사용에 의해) 표지된 표적 서열을 사용할 수 있다. 특히, 예를 들면, 효소 증폭 반응 동안 표지된 데옥시리보뉴클레오티드 트라이포스페이트를 사용하여 표적 서열의 표지 및/또는 절단으로 구성된 단계를 혼성화 단계 전에 수행할 수 있다. 상기 절단은 특히 이미다졸 또는 염화망간의 작용에 의해 수행될 수 있다. 표적 서열은 예를 들면, 국제 특허출원 공개 제WO 91/19812호에 기재된 샌드위치 혼성화 기법에 따라 검출 프로브를 혼성화함으로써 증폭 단계 후에 표지될 수도 있다. 핵산을 표지하는 또 다른 특정 바람직한 방법은 프랑스 특허출원 제2780059호에 기재되어 있다.For detection of the hybridization reaction, the target sequence labeled directly (particularly by introduction of the label into the target sequence) or indirectly (particularly by use of the detection probe defined above) may be used. In particular, for example, a step consisting of labeling and / or cleavage of a target sequence using a deoxyribonucleotide triphosphate labeled during an enzyme amplification reaction may be performed prior to the hybridization step. The cleavage can be carried out in particular by the action of imidazole or manganese chloride. The target sequence may be labeled after the amplification step, for example, by hybridizing the detection probe according to the sandwich hybridization technique described in International Patent Application Publication No. WO 91/19812. Another particular preferred method of labeling nucleic acids is described in French patent application no. 2780059.

본 발명의 바람직한 실시양태에 따르면, 검출 프로브는 형광단 및 소광제를 포함한다. 본 발명의 훨씬 더 바람직한 실시양태에 따르면, 혼성화 프로브는 그의 5' 말단에서 FAM(6-카복시-플루오레세인) 또는 ROX(6-카복시-X-로다민) 형광단을 포함하고 그의 3' 말단에서 소광제(답실(Dabsyl))를 포함한다.According to a preferred embodiment of the present invention, the detection probe comprises a fluorescent dye and a quencher. According to a still more preferred embodiment of the present invention, the hybridization probe comprises a FAM (6-carboxy-fluororesin) or ROX (6-carboxy-X-rhodamine) fluorophore at its 5 ' (Dabsyl). ≪ / RTI >

혼성화 프로브는 "포획" 프로브일 수도 있다. 이 경우, "포획" 프로브는 고체 기판 상에 고정되어 있거나 임의의 적합한 수단에 의해, 즉 직접적으로 또는 간접적으로, 예를 들면, 공유 또는 흡착에 의해 고정될 수 있다. 임의적으로 화학적으로 변경된 합성 물질 또는 천연 물질, 특히 폴리사카라이드, 예컨대, 셀룰로스-기제 물질, 예를 들면, 종이, 셀룰로스 유도체, 예컨대, 셀룰로스 아세테이트 및 니트로셀룰로스 또는 덱스트란, 중합체, 공중합체, 특히 스티렌 유형의 단량체-기제 공중합체, 천연 섬유, 예컨대, 면포, 및 합성 섬유, 예컨대, 나일론; 무기 물질, 예컨대, 실리카, 석영, 유리 또는 세라믹; 라텍스; 자성 입자; 금속 유도체; 겔 등을 고체 기판으로서 사용할 수 있다. 고체 기판은 마이크로적정 플레이트, 국제 특허출원 공개 제WO 94/12670호에 기재된 막, 또는 입자의 형태로 존재할 수 있다. 표적 유전자에 대해 각각 특이적인 여러 상이한 포획 프로브들을 기판 상에 고정시키는 것도 가능하다. 특히, 많은 수의 프로브들이 고정될 수 있는 바이오칩을 기판으로서 사용할 수 있다. 용어 "바이오칩"은 많은 수의 포획 프로브들이 예정된 위치에 부착되어 있는 작은 크기의 고체 기판을 의미하기 위한 것이다. 바이오칩 또는 DNA 칩의 개념은 1990년대 초반부터 등장하였다. 상기 칩은 마이크로전자공학, 핵산 화학, 영상 분석 및 정보 기술을 통합한 여러 학문 분야에 걸친 기술에 기초한다. 작동 원리는 분자생물학의 토대인 혼성화 현상, 즉 2종의 DNA 및/또는 RNA 서열들의 염기들의 상보성에 의한 페어링에 기초한다. 바이오칩 방법은 고체 기판에 부착된 포획 프로브의 사용에 기초하는데, 이때 상기 프로브는 형광색소로 직접적으로 또는 간접적으로 표지된 표적 뉴클레오티드 단편의 샘플에 작용하도록 만들어진다. 포획 프로브는 기판 또는 칩 상에 특이적으로 위치하고, 각각의 혼성화는 표적 뉴클레오티드 단편과 관련된 특정 정보를 제공한다. 수득된 정보는 누적되고, 예를 들면, 1종 이상의 표적 유전자의 발현 수준을 정량하는 것을 가능하게 한다. 표적 유전자의 발현을 분석하기 위해, mRNA로 전사되는 표적 유전자의 전부 또는 일부에 상응하는 다수의 프로브들을 포함하는 기판을 제조할 수 있다. 본 발명의 목적을 위해, 용어 "저밀도 기판"은 50개 미만의 프로브들을 포함하는 기판을 의미하기 위한 것이다. 본 발명의 목적을 위해, 용어 "중밀도 기판"은 50개 내지 10,000개의 프로브들을 포함하는 기판을 의미하기 위한 것이다. 본 발명의 목적을 위해, 용어 "고밀도 기판"은 10,000개 초과의 프로브들을 포함하는 기판을 의미하기 위한 것이다.The hybridization probe may be a "capture" probe. In this case, the "capture" probe may be immobilized on a solid substrate or fixed by any suitable means, i.e., directly or indirectly, e.g. by covalent or adsorption. Optionally chemically modified synthetic or natural materials, in particular polysaccharides such as cellulose-based materials such as paper, cellulose derivatives such as cellulose acetate and nitrocellulose or dextran, polymers, copolymers, especially styrene Monomer-based copolymers of the type, natural fibers such as cotton, and synthetic fibers such as nylon; Inorganic materials such as silica, quartz, glass or ceramics; Latex; Magnetic particles; Metal derivatives; Gel or the like can be used as a solid substrate. The solid substrate may be in the form of a microtiter plate, a film as described in International Patent Application Publication No. WO 94/12670, or in the form of particles. It is also possible to immobilize various different capture probes, each specific for the target gene, on the substrate. In particular, a biochip where a large number of probes can be fixed can be used as a substrate. The term "biochip" is intended to mean a small size solid substrate having a large number of capture probes attached at predetermined locations. The concept of biochip or DNA chip has appeared since the early 1990s. The chip is based on a variety of disciplines that integrate microelectronics, nucleic acid chemistry, image analysis and information technology. The working principle is based on the hybridization phenomenon that is the basis of molecular biology, that is, the pairing by the complementarity of the bases of two DNA and / or RNA sequences. The biochip method is based on the use of a capture probe attached to a solid substrate, wherein the probe is made to act on a sample of a target nucleotide fragment labeled directly or indirectly with a fluorescent dye. The capture probe is specifically located on a substrate or chip, and each hybridization provides specific information related to the target nucleotide fragment. The information obtained is cumulative and makes it possible, for example, to quantify the expression level of one or more target genes. In order to analyze the expression of the target gene, a substrate containing a plurality of probes corresponding to all or a part of the target gene to be transcribed into mRNA can be produced. For purposes of the present invention, the term "low density substrate" is intended to mean a substrate comprising less than 50 probes. For purposes of the present invention, the term "medium density substrate" is intended to mean a substrate comprising 50 to 10,000 probes. For the purposes of the present invention, the term "high density substrate" is intended to mean a substrate comprising more than 10,000 probes.

분석하고자 하는 표적 유전자의 핵산에 대해 특이적인 cRNA 또는 cDNA를 예를 들면, 특이적 포획 프로브에 혼성화시킨다. 혼성화 후, 기판 또는 칩을 세척하고, 표지된 cDNA 또는 cRNA/포획 프로브 결합체는 예를 들면, 형광색소 유형의 표지에 결합된 고친화성 리간드에 의해 표시된다. 형광을 예를 들면, 스캐너로 판독하고, 형광의 분석을 정보 기술로 프로세싱한다. 예를 들면, 분자 진단을 위해 아피메트릭스(Affymetrix)에 의해 개발된 DNA 칩(문헌["Accessing Genetic Information with High-Density DNA arrays", M. Chee et al., Science, 1996, 274, 610-614], ["Light-generated oligonucleotide arrays for rapid DNA sequence analysis", A. Caviani Pease et al., Proc. Natl. Acad. Sci. USA, 1994, 91, 5022-5026])이 언급될 수 있다. 이 기술에서, 포획 프로브는 일반적으로 크기에 있어서 작다(약 25개 뉴클레오티드). 바이오칩의 다른 예는 문헌[G. Ramsay, Nature Biotechnology, 1998, No. 16, p. 40-44]; 문헌[F. Ginot, Human Mutation, 1997, No. 10, p. 1-10]; 문헌[J. Cheng et al, Molecular diagnosis, 1996, No. 1 (3), p. 183-200]; 문헌[T. Livache et al, Nucleic Acids Research, 1994, No. 22 (15), p. 2915-2921]; 문헌[J. Cheng et al, Nature Biotechnology, 1998, No. 16, p. 541-546], 미국 특허 제4,981,783호, 미국 특허 제5,700,637호, 미국 특허 제5,445,934호, 미국 특허 제5,744,305호 및 미국 특허 제5,807,522호에 제시되어 있다. 고체 기판의 주요 특징은 표적 뉴클레오티드 단편에 대한 포획 프로브의 혼성화 특징을 보존하면서 동시에 검출 방법을 위해 최소한의 배경 노이즈(noise)를 발생시키는 것이어야 한다. 프로브를 기판 상에 고정시키는 3종의 주요 제작법이 구별될 수 있다. CRNA or cDNA specific for the nucleic acid of the target gene to be analyzed is hybridized to, for example, a specific capture probe. After hybridization, the substrate or chip is washed and the labeled cDNA or cRNA / capture probe conjugate is labeled, for example, by a high affinity ligand bound to the label of the fluorescent dye type. Fluorescence is read by a scanner, for example, and the analysis of fluorescence is processed by information technology. For example, DNA chips developed by Affymetrix for molecular diagnostics (see "Accessing Genetic Information with High-Density DNA arrays ", M. Chee et al., Science, 1996, 274, 610-614 ], ["Light-generated oligonucleotide arrays for rapid DNA sequence analysis", A. Caviani Pease et al., Proc Natl Acad Sci USA, 1994, 91, 5022-5026). In this technique, capture probes are generally small in size (about 25 nucleotides). Other examples of biochips are described in G. Ramsay, Nature Biotechnology, 1998, No. 16, p. 40-44]; F. Ginot, Human Mutation, 1997, No. 10, p. 1-10]; J. Cheng et al., Molecular diagnosis, 1996, no. 1 (3), p. 183-200]; T. Livache et al., Nucleic Acids Research, 1994, No. 22 (15), p. 2915-2921; J. Cheng et al., Nature Biotechnology, 1998, no. 16, p. 541-546, US 4,981,783, US 5,700,637, US 5,445,934, US 5,744,305, and US 5,807,522. The main feature of the solid substrate should be to generate minimal background noise for the detection method while at the same time preserving the hybridization characteristics of the capture probe to the target nucleotide fragment. Three major fabrication methods for immobilizing probes on a substrate can be distinguished.

무엇보다도, 미리 합성된 프로브를 침착시키는 단계로 구성된 제1 기법이 존재한다. 프로브의 부착은 직접적인 전달, 마이크로피펫 또는 마이크로도트, 또는 잉크젯 디바이스에 의해 수행된다. 이 기법은 몇 개의 염기(5개 내지 10개)부터 상대적으로 큰 크기인 60개의 염기(프린팅) 또는 수백 개의 염기(마이크로침착)까지 이르는 크기를 갖는 프로브의 부착을 가능하게 한다.Above all, there is a first technique consisting of depositing a pre-synthesized probe. Attachment of the probe is performed by direct delivery, micropipette or microdot, or inkjet device. This technique allows the attachment of probes of sizes ranging from a few bases (5 to 10) to a relatively large size of 60 bases (printing) or hundreds of bases (micro-deposition).

프린팅은 잉크젯 프린터에 의해 이용되는 방법을 개조한 방법이다. 프린팅은 4000개 소적/초에 도달할 수 있는 속도에서 매우 작은 유체 구(부피 <1 nl)의 추진에 기초한다. 프린팅은 상기 유체를 방출하는 시스템과 이것이 침착되는 표면 사이에 어떠한 접촉도 수반하지 않는다.Printing is a modification of the method used by inkjet printers. Printing is based on propulsion of very small fluid sumps (volume < 1 nl) at a rate that can reach 4000 droplets / s. Printing does not involve any contact between the system that emits the fluid and the surface on which it is deposited.

마이크로침착은 수십 개의 염기 내지 수백 개의 염기로 구성된 긴 프로브들을 유리 슬라이드의 표면에 부착시키는 단계로 구성된다. 이들 프로브들은 일반적으로 데이터베이스로부터 추출되고, 증폭되고 정제된 생성물의 형태로 존재한다. 이 기법은 4 cm.sup.2 미만의 작은 표면적 상에 약 10,000개의 DNA 스폿(spot)(인식 대역으로 지칭됨)을 보유하는 마이크로어레이로 지칭되는 칩을 제조할 수 있게 한다. 그러나, 일반적으로 PCR에 의해 증폭된 생성물을 보유하고 직경이 0.5 mm 내지 1 mm이고 최대 밀도가 25개 스폿/cm.sup.2인, "마크로어레이"로 지칭되는 나일론 막의 사용도 잊어서는 안 된다. 이 매우 유연한 기법은 많은 실험실에 의해 이용된다. 본 발명에서 후자 기법은 바이오칩에 포함되는 것으로 간주된다. 그러나, 일정한 부피의 샘플이 국제 특허출원 공개 제WO 00/71750호 및 프랑스 특허출원 제00/14896호의 경우에서와 같이 마이크로적정 플레이트의 각각의 웰의 바닥에 침착될 수 있거나, 서로 분리되어 있는 일정한 수의 소적이 또 다른 프랑스 특허출원 제00/14691호에 따라 하나의 동일한 페트리 디쉬의 바닥에 침착될 수 있다.Micro-deposition consists of attaching long probes consisting of tens to hundreds of bases to the surface of the glass slide. These probes are generally extracted from the database, and are present in the form of amplified and purified products. This technique makes it possible to produce chips called microarrays having about 10,000 DNA spots (referred to as recognition bands) on a small surface area of less than 4 cm.sup.2. However, the use of nylon membranes, commonly referred to as "macroarrays, " which have products amplified by PCR and have diameters of 0.5 mm to 1 mm and a maximum density of 25 spots / cm.sup.2 . This highly flexible technique is used by many laboratories. In the present invention, the latter technique is considered to be included in the biochip. However, a certain volume of sample may be deposited on the bottom of each well of the microtiter plate, as in the case of WO 00/71750 and in French Patent Application No. 00/14896, Water droplets can be deposited on the bottom of one and the same petri dish in accordance with another French patent application No. 00/14691.

프로브를 기판 또는 칩에 부착시키는 제2 기법은 제자리(in situ) 합성으로 지칭된다. 이 기법은 짧은 프로브들이 칩의 표면에서 직접적으로 생성되게 한다. 상기 기법은 제자리 올리고뉴클레오티드 합성(특히, 국제 특허출원 공개 제WO 89/10977호 및 국제 특허출원 공개 제WO 90/03382호 참조) 및 올리고뉴클레오티드 합성기 과정에 기초한다. 상기 기법은 올리고뉴클레오티드 연장 반응이 일어나는 반응 챔버를 유리 표면을 따라 이동시키는 단계로 구성된다.The second technique of attaching probes to a substrate or chip is referred to as in situ synthesis. This technique allows short probes to be generated directly on the surface of the chip. This technique is based on in situ oligonucleotide synthesis (in particular, International Patent Application Publication No. WO 89/10977 and International Patent Application Publication No. WO 90/03382) and oligonucleotide synthesizer processes. The technique consists of moving the reaction chamber along which the oligonucleotide extension reaction takes place along the glass surface.

최종적으로, 제3 기법은 아피메트릭스에 의해 개발된 바이오칩의 기초가 된 공정인 사진석판술(photolithography)로 지칭된다. 이 기법도 제자리 합성이다. 사진석판술은 마이크로프로세서 기법으로부터 유도된다. 칩의 표면은 광 활성화될 수 있는 광불안정성 화학적 기들의 부착에 의해 변경된다. 일단 조명되면, 이들 기들은 올리고뉴클레오티드의 3' 말단과 반응할 수 있다. 이 표면을 정의된 형태의 마스크로 보호함으로써, 4종의 뉴클레오티드들 중 어느 하나를 부착시키고자 하는 칩의 영역을 선택적으로 조명하여 활성화시킬 수 있다. 상이한 마스크의 성공적인 사용은 보호/반응의 주기를 교대로 수행하여 약 수십 입방 마이크로미터(㎛2)의 스폿 상에서 올리고뉴클레오티드 프로브를 생성할 수 있게 한다. 이 방법은 몇 입방 센티미터(cm2)의 표면적 상에서 최대 수십만 개의 스폿을 생성하는 것을 가능하게 한다. 사진석판술은 단지 4회 N 주기에서 N-머의 칩을 동시에 대량으로 생성하는 것을 가능하게 한다는 장점을 갖는다. 모든 이들 기법들이 본 발명과 함께 이용될 수 있다. 본 발명의 바람직한 실시양태에 따르면, 상기 정의된 단계 b)의 1종 이상의 특이적 시약은 바람직하게는 기판 상에 고정된 1종 이상의 혼성화 프로브를 포함한다. 이 기판은 바람직하게는 상기 정의된 저밀도 기판, 고밀도 기판 또는 중밀도 기판이다.Finally, the third technique is referred to as photolithography, which is the basis of the biochip developed by Apimetrix. This technique is also in situ synthesis. Photolithography is derived from microprocessor techniques. The surface of the chip is altered by the attachment of photo-labile chemical groups that can be optically activated. Once illuminated, these groups can react with the 3 ' end of the oligonucleotide. By protecting the surface with a mask of defined type, the area of the chip to which one of the four kinds of nucleotides is to be attached can be selectively illuminated and activated. Successful use of different masks allows for the generation of oligonucleotide probes on spots of approximately several tens of cubic microns (탆 2 ) by alternating cycles of protection / reaction. This method makes it possible to generate up to several hundred thousand spots on a surface area of several cubic centimeters (cm &lt; 2 &gt;). Photolithography has the advantage that it makes it possible to produce large quantities of N-MER chips simultaneously in only four N cycles. All of these techniques can be used with the present invention. According to a preferred embodiment of the invention, the at least one specific reagent of step b), defined above, preferably comprises at least one hybridization probe immobilized on a substrate. The substrate is preferably a low density substrate, a high density substrate or a medium density substrate defined above.

표적 유전 물질의 양을 증가시키기 위해 다수의 프로브들을 포함하는 기판 상에서의 이들 혼성화 단계 후 상기 정의된 바와 같은 효소 증폭 반응 단계를 수행할 수 있다.After these hybridization steps on a substrate comprising a plurality of probes to increase the amount of target dielectric material, an enzyme amplification reaction step as defined above may be performed.

표적 유전자의 발현 수준의 측정은 당업자에게 공지된 임의의 프로토콜에 의해 수행될 수 있다. 일반적으로, 표적 유전자의 발현은 주어진 순간에 표적 유전자로부터 전사된 mRNA(메신저 RNA)를 검출함으로써 분석될 수 있다.Measurement of the expression level of the target gene can be performed by any protocol known to those skilled in the art. In general, the expression of a target gene can be analyzed by detecting mRNA (messenger RNA) transcribed from the target gene at a given moment.

본 발명은 바람직하게는 당업자에게 잘 공지된 임의의 프로토콜에 따라 표적 유전자로부터 유도된 mRNA를 검출함으로써 이 표적 유전자의 발현 수준을 측정하는 것에 관한 것이다. 본 발명의 특정 실시양태에 따르면, 여러 표적 유전자들의 발현 수준이 여러 상이한 mRNA들(각각의 mRNA는 표적 유전자로부터 유도됨)의 검출에 의해 동시에 측정된다.The present invention is preferably directed to determining the level of expression of this target gene by detecting mRNA derived from the target gene according to any protocol well known to those skilled in the art. According to certain embodiments of the present invention, the expression levels of different target genes are simultaneously measured by detection of several different mRNAs (each mRNA is derived from the target gene).

증폭을 통해 다음과 같이 표적 유전자의 발현 수준을 측정할 수 있다: 1) 전혈로부터 (전달 RNA(tRNA), 리보좀 RNA(rRNA) 및 메신저 RNA(mRNA)를 포함하는) 총 RNA를 추출한 후, 상기 mRNA의 상보적 DNA(또는 cDNA)를 수득하기 위해 역전사 단계를 수행한다. 예를 들면, 이 역전사 반응은 RNA 단편으로부터 상보적 DNA 단편을 수득하는 것을 가능하게 하는 역전사효소를 사용함으로써 수행될 수 있다. 특히, AMV(조류 미요블라스토시스(Myoblastosis) 바이러스) 또는 MMLV(몰로니 뮤린(Moloney Murine) 백혈병 바이러스)로부터의 역전사효소를 사용할 수 있다. 보다 구체적으로, mRNA의 cDNA만을 수득하고자 하는 경우, mRNA의 폴리A 서열에 상보적으로 혼성화하는 티민 염기만을 포함하는 뉴클레오티드 단편(폴리T)의 존재 하에 이 역전사 단계를 수행하여 역전사효소에 의해 수행되는 역전사 반응을 위한 출발점으로서 작용하는 폴리T-폴리A 결합체를 형성한다. 표적 유전자로부터 유도된 mRNA에 상보적인 cDNA(표적 유전자 특이적 cDNA) 및 표적 유전자 이외의 유전자로부터 유도된 mRNA에 상보적인 cDNA(표적 유전자에 대해 특이적이지 않은 cDNA)를 수득한다. 2) 표적 유전자에 대해 특이적인 증폭 프라이머(들)를 표적 유전자 특이적 cDNA 및 표적 유전자에 대해 특이적이지 않은 cDNA와 접촉시킨다. 표적 유전자에 대해 특이적인 증폭 프라이머(들)는 표적 유전자 특이적 cDNA와 혼성화하고, 표적 유전자로부터 유도된 mRNA로부터 유래된 cDNA의 공지된 길이의 예정된 영역이 특이적으로 증폭된다. 표적 유전자에 대해 특이적이지 않은 cDNA는 증폭되지 않는 반면, 다량의 표적 유전자 특이적 cDNA는 수득된다. 본 발명의 목적을 위해, "표적 유전자 특이적 cDNA" 또는 "표적 유전자로부터 유도된 mRNA로부터 유래된 cDNA"가 구별 없이 언급된다. 이 단계는 특히 PCR 유형의 증폭 반응에 의해 또는 상기 정의된 임의의 다른 증폭 기법에 의해 수행될 수 있다. 여러 쌍의 상이한 증폭 프라이머(이들 각각은 표적 유전자에 대해 특이적임)를 사용하여 PCR로 여러 상이한 cDNA들(이들 각각은 상이한 표적 유전자에 대해 특이적임)을 동시에 증폭할 수도 있다(다중 증폭으로 지칭됨). 3) 표적 유전자의 발현은 상기 단계 2)에서 수득된 표적 유전자 특이적 cDNA를 검출하고 정량함으로써 측정된다. 이 검출은 그들의 크기에 따른 표적 유전자 특이적 cDNA들의 전기영동 이동 후 수행될 수 있다. 상기 이동을 위한 겔 및 매질은 주어진 이동 시간 후 겔이 UV-선(자외선) 광 테이블 상에 놓일 때 광 신호의 방사를 통해 표적 유전자 특이적 cDNA들의 직접적인 검출을 허용하도록 에티듐 브로마이드를 포함할 수 있다. 표적 유전자 특이적 cDNA의 양이 많을수록 이 광 신호가 더욱 밝다. 이들 전기영동 기법들은 당업자에게 잘 공지되어 있다. 표적 유전자 특이적 cDNA는 포화될 때까지 수행된 증폭 반응에 의해 수득된 정량 범위를 이용함으로써 검출되고 정량될 수도 있다. 다양한 단계(역전사, PCR 등) 동안 관찰될 수 있는 효소 효율에서의 가변성을 고려하기 위해, 다양한 환자 군들에서 유사한 발현을 갖는 "하우스킵핑" 유전자의 발현을 동시에 측정함으로써 다양한 환자 군들의 표적 유전자의 발현을 표준화할 수 있다. 표적 유전자의 발현 대 하우스킵핑 유전자의 발현의 비를 수득함으로써, 즉 표적 유전자 특이적 cDNA의 양 대 하우스킵핑 유전자 특이적 cDNA의 양의 비를 수득함으로써, 다양한 실험들 사이의 임의의 가변성을 보정한다. 당업자는 특히 하기 문헌을 참조할 수 있다: 문헌[Bustin S A, J Mol Endocrinol, 2002, 29: 23-39] 및 문헌[Giulietti A Methods, 2001, 25: 386-401].Amplification can be used to determine the expression level of the target gene as follows: 1) Total RNA (including transfer RNA (tRNA), ribosomal RNA (rRNA) and messenger RNA (mRNA)) is extracted from whole blood, A reverse transcription step is performed to obtain the complementary DNA (or cDNA) of the mRNA. For example, the reverse transcription reaction can be performed by using a reverse transcriptase that makes it possible to obtain a complementary DNA fragment from the RNA fragment. In particular, reverse transcriptase from AMV (Myoblastosis virus) or MMLV (Moloney Murine leukemia virus) can be used. More specifically, when it is desired to obtain only cDNA of mRNA, this reverse transcription step is carried out in the presence of a nucleotide fragment (poly T) containing only a thymine base complementarily hybridizing to the poly A sequence of the mRNA, To form a poly T-poly A conjugate that serves as a starting point for the reverse transcription reaction. CDNA (target gene specific cDNA) complementary to the mRNA derived from the target gene and cDNA complementary to the mRNA derived from the gene other than the target gene (cDNA not specific to the target gene) are obtained. 2) The amplification primer (s) specific for the target gene is brought into contact with the target gene-specific cDNA and the cDNA that is not specific for the target gene. The amplification primer (s) specific for the target gene hybridizes with the target gene specific cDNA and the predetermined region of the known length of the cDNA derived from the mRNA derived from the target gene is specifically amplified. CDNA that is not specific to the target gene is not amplified while a large amount of target gene-specific cDNA is obtained. For the purpose of the present invention, "target gene specific cDNA" or "cDNA derived from mRNA derived from a target gene" is mentioned without any distinction. This step may be carried out, in particular, by an amplification reaction of the PCR type or by any other amplification technique as defined above. It is also possible to simultaneously amplify several different cDNAs (each of which is specific for a different target gene) by PCR using multiple pairs of different amplification primers (each of which is specific for the target gene) (referred to as multiple amplifications) ). 3) Expression of the target gene is measured by detecting and quantifying the target gene-specific cDNA obtained in the step 2). This detection can be performed after electrophoretic migration of target gene specific cDNAs according to their size. The gel and medium for the movement can contain ethidium bromide to allow direct detection of target gene specific cDNAs through the emission of a light signal when the gel is placed on a UV-ray (ultraviolet) light table after a given migration time have. The greater the amount of target gene specific cDNA, the brighter the optical signal. These electrophoresis techniques are well known to those skilled in the art. The target gene-specific cDNA may be detected and quantified by using the quantification range obtained by the amplification reaction performed until saturation. To simultaneously account for variability in enzyme efficiency that can be observed during various steps (reverse transcription, PCR, etc.), the expression of the "howls skipping" gene, which has similar expression in various patient groups, Expression can be standardized. By obtaining the ratio of the expression of the target gene to the expression of the Haas Skipping gene, that is, by obtaining the ratio of the amount of the target gene specific cDNA to the amount of the Haas Skipping gene specific cDNA, an arbitrary variability . Those skilled in the art can in particular refer to the following references: Bustin SA, J Mol Endocrinol, 2002, 29: 23-39 and Giulietti A Methods, 2001, 25: 386-401.

혼성화를 통해 표적 유전자의 발현을 다음과 같이 측정할 수 있다: 1) 전혈로부터 총 RNA를 추출한 후, 역전사 단계를 전술된 바와 같이 수행하여 표적 유전자로부터 유도된 mRNA에 상보적인 cDNA(표적 유전자 특이적 cDNA) 및 표적 유전자 이외의 유전자로부터 유도된 mRNA에 상보적인 cDNA(표적 유전자에 대해 특이적이지 않은 cDNA)를 수득한다. 2) 모든 cDNA들을 표적 유전자(이의 발현을 분석하고자 함)에 대해 특이적인 포획 프로브들이 고정되어 있는 기판과 접촉시켜, 표적 유전자 특이적 cDNA와 포획 프로브 사이의 혼성화 반응을 수행하는데, 이때 표적 유전자에 대해 특이적이지 않은 cDNA는 상기 포획 프로브에 혼성화하지 않는다. 상기 표시된 모든 물질들을 포함하는 고체 기판 상에서 혼성화 반응을 수행할 수 있다. 바람직한 실시양태에 따르면, 혼성화 프로브는 기판 상에 고정된다. 바람직하게는, 기판은 상기 정의된 저밀도, 고밀도 또는 중밀도 기판이다. 혼성화 반응 후, 전술된 바와 같이 표적 유전자 특이적 cDNA의 효소 증폭으로 구성된 단계를 수행하여, 다량의 표적 유전자 특이적 cDNA들을 수득할 수 있고 표적 유전자 특이적 cDNA가 표적 유전자에 대해 특이적인 포획 프로브에 혼성화하는 확률을 증가시킬 수 있다. 또한, 혼성화 반응 후, 예를 들면, 표지된 데옥시리보뉴클레오티드 트라이포스페이트를 증폭 반응에 사용하여 전술된 바와 같이 표적 유전자 특이적 cDNA의 표지 및/또는 절단으로 구성된 단계를 수행할 수 있다. 상기 절단은 특히 이미다졸 및 염화망간의 작용에 의해 수행될 수 있다. 예를 들면, 표지된 프로브를 국제 특허출원 공개 제WO 91/19812호에 기재된 샌드위치 혼성화 기법에 따라 혼성화함으로써 증폭 단계 후 표적 유전자 특이적 cDNA를 표지할 수도 있다. 다른 바람직한 핵산 표지 및/또는 절단 방법은 국제 특허출원 공개 제WO 99/65926호, 국제 특허출원 공개 제WO 01/44507호, 국제 특허출원 공개 제WO 01/44506호, 국제 특허출원 공개 제WO 02/090584호 및 국제 특허출원 공개 제WO 02/090319호에 기재되어 있다. 3) 혼성화 반응의 검출로 구성된 단계를 후속적으로 수행한다. 검출은 표적 유전자에 대해 특이적인 포획 프로브가 표적 유전자 특이적 cDNA와 혼성화되어 있는 기판을 표지로 표지된 "검출" 프로브와 접촉시키고 상기 표지에 의해 방사된 신호를 검출함으로써 수행될 수 있다. 표적 유전자 특이적 cDNA가 표지로 미리 표지되어 있는 경우, 상기 표지에 의해 방사된 신호는 직접적으로 검출된다.The expression of the target gene can be measured by hybridization as follows: 1) Total RNA is extracted from whole blood, and then the reverse transcription step is performed as described above to amplify the cDNA complementary to the mRNA derived from the target gene cDNA) and cDNA complementary to the mRNA derived from the gene other than the target gene (cDNA not specific to the target gene). 2) The hybridization reaction between the target gene-specific cDNA and the capture probe is carried out by contacting all the cDNAs with a substrate on which capture probes specific for the target gene (to analyze its expression) are immobilized, Lt; / RTI &gt; does not hybridize to the capture probe. A hybridization reaction may be performed on a solid substrate including all the above-indicated materials. According to a preferred embodiment, the hybridization probe is immobilized on a substrate. Preferably, the substrate is a low density, high density or medium density substrate as defined above. After the hybridization reaction, a step composed of enzyme amplification of the target gene-specific cDNA as described above is performed to obtain a large amount of target gene-specific cDNAs, and the target gene-specific cDNA is introduced into a capture probe specific for the target gene The probability of hybridization can be increased. In addition, after the hybridization reaction, for example, a labeled deoxyribonucleotide triphosphate may be used in an amplification reaction to perform a step consisting of labeling and / or cleavage of the target gene specific cDNA as described above. The cleavage can be carried out in particular by the action of imidazole and manganese chloride. For example, a labeled probe may be hybridized according to the sandwich hybridization technique described in WO 91/19812 to label the target gene-specific cDNA after the amplification step. Other preferred nucleic acid labeling and / or cleavage methods are disclosed in International Patent Application Publication No. WO 99/65926, International Patent Application Publication No. WO 01/44507, International Patent Application Publication No. WO 01/44506, International Patent Application Publication WO 02 / 090584 and International Patent Application Publication No. WO 02/090319. 3) The subsequent step is performed by detecting the hybridization reaction. Detection can be performed by contacting a substrate with a capture probe specific for the target gene hybridized with the target gene specific cDNA with a labeled "detection" probe and detecting the signal emitted by the label. When the target gene-specific cDNA is pre-labeled with a label, the signal emitted by the label is directly detected.

표적 유전자의 발현은 하기 방식으로 측정될 수도 있다: 1) 전혈로부터 총 RNA를 추출한 후, 역전사 단계를 전술된 바와 같이 수행하여 생물학적 물질의 mRNA의 cDNA를 수득한다. 그 다음, 프로모터의 조절 하에 작용하고 DNA 주형으로부터 상보적 RNA를 수득하는 것을 가능하게 하는 T7 중합효소를 사용하여 상기 cDNA의 상보적 RNA의 중합을 수행한다. 그 다음, 표적 유전자에 대해 특이적인 mRNA의 cDNA의 cRNA(표적 유전자 특이적 cRNA로 지칭됨) 및 표적 유전자에 대해 특이적이지 않은 mRNA의 cDNA의 cRNA를 수득한다. 2) 모든 cRNA들을 표적 유전자(이의 발현을 분석하고자 함)에 대해 특이적인 포획 프로브들이 고정되어 있는 기판과 접촉시켜, 표적 유전자 특이적 cRNA들와 상기 포획 프로브들 사이의 혼성화 반응을 수행하는데, 이때 표적 유전자에 대해 특이적이지 않은 cRNA는 포획 프로브에 혼성화되지 않는다. 여러 표적 유전자들의 발현을 동시에 분석하고자 하는 경우, 여러 상이한 포획 프로브들(이들 각각은 표적 유전자에 대해 특이적임)을 기판 상에 고정시킬 수 있다. 또한, 혼성화 반응 후, 전술된 바와 같이 표적 유전자 특이적 cRNA의 표지 및/또는 절단으로 구성된 단계를 수행할 수 있다. 3) 혼성화 반응의 검출로 구성된 단계를 후속적으로 수행한다. 검출은 표적 유전자에 대해 특이적인 포획 프로브가 표적 유전자 특이적 cRNA와 혼성화되어 있는 기판을 표지로 표지된 "검출" 프로브와 접촉시키고 상기 표지에 의해 방사된 신호를 검출함으로써 수행될 수 있다. 표적 유전자 특이적 cRNA가 표지로 미리 표지되어 있는 경우, 상기 표지에 의해 방사된 신호는 직접적으로 검출된다. cRNA의 사용은 많은 수의 프로브들이 혼성화되어 있는 바이오칩 유형의 기판이 사용되는 경우 특히 유리하다.Expression of the target gene may be measured in the following manner: 1) Total RNA is extracted from whole blood, and then the reverse transcription step is performed as described above to obtain cDNA of the mRNA of the biological material. The polymerisation of the complementary RNA of the cDNA is then carried out using a T7 polymerase which is capable of acting under the control of the promoter and obtaining complementary RNA from the DNA template. Next, cRNA (referred to as target gene-specific cRNA) of cDNA of mRNA specific to the target gene and cRNA of cDNA of mRNA not specific to the target gene are obtained. 2) All cRNAs are contacted with a substrate onto which capture probes specific for the target gene (to analyze its expression) are immobilized to perform hybridization reactions between the target gene-specific cRNAs and the capture probes, The cRNA that is not specific for the gene is not hybridized to the capture probe. When simultaneous expression of multiple target genes is desired, multiple different capture probes (each of which is specific for the target gene) can be immobilized on the substrate. In addition, after the hybridization reaction, a step consisting of labeling and / or cleavage of the target gene-specific cRNA may be performed as described above. 3) The subsequent step is performed by detecting the hybridization reaction. Detection may be performed by contacting a substrate with a capture probe specific for the target gene hybridized with the target gene-specific cRNA, with a labeled "detection" probe and detecting the signal emitted by the label. When the target gene-specific cRNA is pre-labeled with a label, the signal emitted by the label is directly detected. The use of cRNA is particularly advantageous when a substrate of the biochip type is used in which a large number of probes are hybridized.

발현 생성물이 폴리펩티드인 경우, 상기 폴리펩티드는 이를 1종 이상의 특정 리간드, 예컨대, 하기 정의된 리간드와 접촉시킴으로써 검출될 수 있다. 바람직한 실시양태에서, 발현된 폴리펩티드를 2종 이상의 특정 리간드, 예컨대, 하기 정의된 리간드와 접촉시킨다. 특정 리간드는 예를 들면, 항체 또는 "나노피틴(Nanofitin: 상표명)"으로 명명된 친화성 단백질을 의미한다. If the expression product is a polypeptide, the polypeptide may be detected by contacting it with one or more specific ligands, such as a ligand as defined below. In a preferred embodiment, the expressed polypeptide is contacted with two or more specific ligands, such as the ligands defined below. Certain ligands refer to, for example, antibodies or affinity proteins named "Nanofitin ".

나노피틴은 경쟁 특징을 갖는 친화성 단백질이다. 나노피틴은 항체와 유사한 경쟁 친화성을 나타낸다.Nanophytes are affinity proteins with competitive properties. Nanophytes exhibit competitive affinities similar to antibodies.

용어 "항체 또는 항체들"은 다중클론 항체, 단일클론 항체, 인간화된 항체 및 재조합 항체를 포괄한다. 이의 제조 방법은 당업자에게 잘 공지되어 있다.The term "antibody or antibodies" encompasses a polyclonal antibody, a monoclonal antibody, a humanized antibody, and a recombinant antibody. Methods for their preparation are well known to those skilled in the art.

본 발명은 1종 이상의 핵산 서열에 대해 특이적인 1종 이상의 결합 파트너 내지 7종의 핵산 서열의 7종의 발현 생성물에 대해 특이적인 7종 이하의 결합 파트너를 포함하는, 개체가 결장직장암을 앓을 확률을 시험관내에서 측정하는 키트도 포함하는데, 이때 상기 1종 이상의 결합 파트너는 서열번호 1 내지 11에 기재된 서열들로 구성된 군으로부터 선택된 1종 이상의 핵산 서열의 1종 이상의 발현 생성물에 대해 특이적이다.The present invention relates to a method for screening for the presence or absence of one or more binding partners specific for one or more nucleic acid sequences, Wherein the one or more binding partners are specific for one or more expression products of one or more nucleic acid sequences selected from the group consisting of the sequences set forth in SEQ ID NOS: 1-11.

특히, 상기 키트는 서열번호 1; 서열번호 2, 서열번호 3 또는 서열번호 4; 서열번호 5 또는 서열번호 6; 서열번호 7 또는 서열번호 8; 서열번호 9; 서열번호 10; 및 서열번호 11에 기재된 서열을 갖는 7종의 핵산 서열의 발현 생성물에 대해 특이적인 7종의 결합 파트너의 조합물을 포함한다.In particular, the kit comprises SEQ ID NO: 1; SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4; SEQ ID NO: 5 or SEQ ID NO: 6; SEQ ID NO: 7 or SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; And a combination of seven binding partners specific for the expression products of the seven nucleic acid sequences having the sequences set forth in SEQ ID NO: 11.

상기 키트에서, 상기 특이적 결합 파트너는 1종 이상의 혼성화 프로브, 1종 이상의 혼성화 프로브 및 1종 이상의 프라이머, 1종 이상의 혼성화 프로브 및 2종의 프라이머, 또는 1종 이상의 특이적 리간드 또는 2종 이상의 특이적 리간드, 예컨대, 항체 및/또는 친화성 단백질을 포함한다.In the kit, the specific binding partner may comprise at least one hybridization probe, at least one hybridization probe and at least one primer, at least one hybridization probe and at least two primers, or at least one specific ligand or at least two specific Such as antibodies and / or affinity proteins.

마지막으로, 본 발명은 개체가 결장직장암을 앓을 확률을 시험관내에서 측정하기 위한 조성물의 제조에 있어서 1종 이상의 핵산 서열의 1종 이상의 발현 생성물에 대한 1종 이상의 특이적 결합 파트너 내지 7종의 핵산 서열의 7종의 발현 생성물에 대한 7종 이하의 특이적 결합 파트너의 용도에 관한 것으로서, 이때 상기 1종 이상의 핵산 서열은 서열번호 1 내지 11에 기재된 핵산 서열들로 구성된 군으로부터 선택된 서열을 갖는다.Finally, the present invention relates to the use of one or more specific binding partners for at least one expression product of one or more nucleic acid sequences to seven nucleic acid sequences for the production of a composition for in vitro measurement of the probability that an individual will suffer from colorectal cancer Wherein the one or more nucleic acid sequences have a sequence selected from the group consisting of the nucleic acid sequences set forth in SEQ ID NOS: 1 to 11, and the use of no more than seven specific binding partners for the seven expression products of the sequence.

특히, 본 발명은 서열번호 1; 서열번호 2, 서열번호 3 또는 서열번호 4; 서열번호 5 또는 서열번호 6; 서열번호 7 또는 서열번호 8; 서열번호 9; 서열번호 10; 및 서열번호 11에 기재된 서열을 갖는 7종의 핵산 서열의 7종의 발현 생성물에 대해 특이적인 7종의 특이적 결합 파트너의 조합물의 용도에 관한 것이다.In particular, SEQ ID NO: 2, SEQ ID NO: 3 or SEQ ID NO: 4; SEQ ID NO: 5 or SEQ ID NO: 6; SEQ ID NO: 7 or SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; And the use of a combination of seven specific binding partners specific for the seven expression products of the seven nucleic acid sequences having the sequences set forth in SEQ ID NO: 11.

특이적 결합 파트너는 1종 이상의 혼성화 프로브, 1종 이상의 혼성화 프로브 및 1종 이상의 프라이머, 1종 이상의 혼성화 프로브 및 2종의 프라이머, 또는 1종 이상의 특이적 리간드 또는 2종 이상의 특이적 리간드, 예컨대, 항체 및/또는 친화성 단백질을 포함한다.The specific binding partner may comprise one or more hybridization probes, one or more hybridization probes and one or more primers, one or more hybridization probes and two primers, or one or more specific ligands or two or more specific ligands, Antibodies and / or affinity proteins.

실시예Example

I) 재료 및 방법I) Materials and Methods

1. 환자 및 샘플 수집1. Patient and sample collection

161명의 결장직장암 환자들(CRC) 및 148명의 결장경검사 음성 대조군 환자들(CNC)로부터 말초혈 샘플을 2006년과 2010년 사이에 수집하였다. CRC 환자들은 중국에 소재하는 FDUSCC의 결장직장 수술실에서 동원되었다. 종양들을 국제암연맹(UICC) 및 권장된 종양-결절 전이(TNM) 시스템에 따라 분류하였다. 수술 전 방사선치료 또는 화학치료를 받은 환자는 없었다. 유전성 결장직장암 또는 염증성 장 질환(크론병 또는 궤양성 결장염)을 앓는 환자들은 이 연구로부터 배제되었다. 결장경검사에 의해 확인된, 용종 또는 결장직장암의 임의의 증상을 갖지 않는 CNC는 상하이 지역의 지역병원 및 FDUSCC로부터 등록되었다. 각각의 환자에 대해, 2.5 ml의 말초혈을 팩스진(PAXgene: 상표명) 혈액 RNA 튜브(프리어날리틱스 게엠베하(PreAnalytiX GmbH), 스위스 홈브레흐티콘 소재) 내로 수집하고 제조자의 지침서에 따라 프로세싱하였다.Peripheral blood samples from 161 colorectal cancer patients (CRC) and 148 colonoscopy negative control patients (CNC) were collected between 2006 and 2010. CRC patients were mobilized in the colon rectus operation room of FDUSCC in China. Tumors were grouped according to the International Cancer Federation (UICC) and the recommended tumor-node metastasis (TNM) system. No patients received preoperative radiation or chemotherapy. Patients with hereditary colorectal cancer or inflammatory bowel disease (Crohn's disease or ulcerative colitis) were excluded from this study. CNCs without any symptoms of colon or colorectal cancer confirmed by colonoscopy were registered from local hospitals and FDUSCC in the Shanghai area. For each patient, 2.5 ml of peripheral blood was collected into a PAXgene (TM) blood RNA tube (PreAnalytiX GmbH, Hombruck, Switzerland) and processed according to the manufacturer's instructions Respectively.

본 연구는 2개의 분리된 참여자 집단을 포함한다. 집단 1은 100명의 CRC 환자들 및 100명의 CNC 환자들로 구성된다. CRC 환자의 경우, 수술 전 결장경검사로부터 1주 이상 후 수술 전에 FDUSCC에서 혈액 샘플을 수집하였다. CNC 환자의 경우, 결장경검사 1주 전에 상하이 지역의 지역병원에서 혈액 샘플을 수집하였다. 이들 샘플들로부터의 유전자 발현 프로파일을 훈련 세트로서 분석하여 CRC와 관련된 유의한 유전자를 검색하고 분자 시그너처(signature)를 확인하였다. 집단 2는 61명의 CRC 환자들 및 48명의 CNC 환자들을 포함한다. 샘플을 집단 1과 동일한 방식으로 수집하였다. 집단 2는 집단 1에서 관찰된 시그너처 성능을 검증하기 위한 독립적인 시험 세트로서 사용되었다.This study includes two separate groups of participants. Group 1 consisted of 100 CRC patients and 100 CNC patients. For CRC patients, blood samples were collected from FDUSCC prior to surgery after 1 week or more from preoperative colonoscopy. For CNC patients, blood samples were collected at local hospitals in Shanghai one week prior to colonoscopy. The gene expression profiles from these samples were analyzed as a training set to search for significant genes associated with CRC and to confirm the molecular signature. Group 2 included 61 CRC patients and 48 CNC patients. Samples were collected in the same manner as group 1. Group 2 was used as an independent set of tests to verify the signature performance observed in Group 1.

2. 2. RNARNA 추출 및  Extraction and 마이크로어레이Microarray 실험 Experiment

팩스진(상표명) 혈액 RNA 시스템(프리어날리틱스)을 제조자의 설명서에 따라 사용하여 총 RNA를 추출하였다. 총 RNA의 양을 광학 밀도 260 nm에서 분광광도계로 측정하였고, 바이오어날라이저 아질런트(BioAnalyzer Agilent) 2100(아질런트 테크놀로지스(Agilent Technologies), 미국 캘리포니아주 팔로 알토 소재) 상에서 RNA 6000 나노 랩칩(LabChip: 등록상표명) 키트를 사용하여 상기 RNA의 질을 평가하였다. RNA 온전성(Integrity) 지수가 7 내지 10인 샘플들만을 분석하였다. 그 다음, WT-오베이션(Ovation: 상표명) RNA 증폭 시스템(누진 테크놀로지스 인코포레이티드(NuGEN Technologies Inc.), 미국 캘리포니아주 산 칼로스 소재)을 제조자의 표준 프로토콜에 따라 사용하는 리보-스피아(Ribo-SPIA: 상표명) 기술을 이용하여 50 ng의 총 RNA를 단일 가닥 cDNA로 역전사하고 비례적으로 증폭하였고, 생성물을 퀴아퀵(QIAquick: 상표명) PCR 정제 키트(퀴아젠 게엠베하(QIAGEN GmbH), 독일 힐덴 소재)로 정제하였다. 그 후, 2 ㎍의 증폭되고 정제된 cDNA를 RQ1 RNase 무함유 DNase(프로메가 코포레이션(Promega Corp.), 미국 위스콘신주 피츠버그 소재)로 단편화하고 말단 트랜스퍼라제(로슈 다이아그노스틱스 코포레이션(Roche Diagnostics Corp.), 미국 인디애나주 인디애나폴리스 소재) 및 진칩(GeneChip: 등록상표명) DNA 표지 시약(아피메트릭스 인코포레이티드(Affymetrix Inc.), 미국 캘리포니아주 산타 클라라 소재)을 사용하여 바이오티닐화된 데옥시뉴클레오사이드 트라이포스페이트로 표지하였다. 표지된 cDNA를 50℃에서 분 당 60회 회전의 혼성화 오븐 640(아질런트 테크놀로지스)에서 진칩 HG U133 플러스 2.0 어레이(아피메트릭스) 상에 18시간 동안 혼성화하였다. HG U133 플러스 2.0 어레이는 약 39,000종의 가장 잘 특징규명된 인간 유전자들을 대표하는 54,675개의 프로브 세트를 함유한다. 혼성화 후, 상기 어레이를 세척하였고 진칩(등록상표명) 플루이딕스 스테이션(Fluidics Station) 450(아피메트릭스)을 이용하여 아피메트릭스 프로토콜 EukGE-WS2v4에 따라 염색하였다. 상기 어레이를 진칩(등록상표명) 스캐너 3000(아피메트릭스)으로 스캐닝하였다.Total RNA was extracted using the Fax Jean (TM) blood RNA system (free antigenic) according to the manufacturer's instructions. The amount of total RNA was measured spectrophotometrically at an optical density of 260 nm and analyzed on an RNA 6000 LabChip ™ instrument on a BioAnalyzer Agilent 2100 (Agilent Technologies, Palo Alto, CA) Lt; RTI ID = 0.0 &gt; RNA &lt; / RTI &gt; name) kit. Only samples with an RNA Integrity Index of 7 to 10 were analyzed. Then, a WT-Ovation RNA amplification system (NuGEN Technologies Inc., San Carlos, Calif., USA) was used to construct a ribo- 50 ng of total RNA was reverse transcribed and amplified proportionally using single-stranded cDNA using SPIA: trademark technology, and the product was amplified using QIAquick (TM) PCR purification kit (QIAGEN GmbH, Material). Then 2 μg of amplified and purified cDNA was fragmented with RQ1 RNase-free DNase (Promega Corp., Pittsburgh, Wis., USA) and ligated with end-transferase (Roche Diagnostics Corp.). ) And Biotinylated Deoxyne (GeneChip®) DNA Marking Reagent (Affymetrix Inc., Santa Clara, Calif., USA) and GeneChip ™ DNA Marking Reagent (Affymetrix Inc., Indianapolis, Ind. Lt; / RTI &gt; triphosphate. The labeled cDNA was hybridized for 18 hours on a GeneChip HG U133 plus 2.0 array (ApiMetrix) at 60 ° C in a hybridization oven 640 (Agilent Technologies) rotating at 60 ° C per minute. The HG U133 plus 2.0 array contains 54,675 probe sets representing approximately 39,000 of the best characterized human genes. After hybridization, the arrays were washed and stained according to the Affymetrix protocol EukGE-WS2v4 using Jinchip (R) Fluidics Station 450 (Affymetrix). The array was scanned with a GinChip (registered trademark) scanner 3000 (ApiMetrix).

3. 통계학적 분석3. Statistical analysis

표준 아피메트릭스 질 조절 파라미터의 권고에 따라 마이크로어레이 데이터 질 조절을 수행하였다. 배경 보정, 변위치 표준화 및 중위수 분해(median polish) 요약과 함께 로버스트(Robust) 다중칩 평균 방법(RMA)(문헌[Irizarry RA et al., Biostatistics 20 3;4:249-64])을 이용하여 아피메트릭스 발현 어레이를 전반적으로 예비프로세싱하였다.Microarray data quality control was performed according to the recommendations of the standard apical quality control parameters. (Irizarry RA et al., Biostatistics 20 3; 4: 249-64) with background correction, side-position normalization, and median polish summarization Lt; RTI ID = 0.0 &gt; apiMetrix &lt; / RTI &gt; expression array.

집단 1 데이터의 경우, 과도한 신호 강도(2E14보다 log2(50) 이상 더 낮은)를 갖는 프로브 세트를 걸러내었다. 그 다음, 엔트레즈 유전자 데이터베이스(Entrez Gene Database)의 정보(문헌[Maglot D et al., Nucleic Acids Research 2007;35:D26-31])를 이용하여 생물학적 지식-기초 걸러내기를 수행하였다. 엔트레즈 유전자 ID 주석을 갖지 않는 프로브 세트를 제거하였다. 동일한 엔트레즈 유전자 ID에 맵핑되는 다수의 프로브 세트들의 경우, 상호 변위치 범위의 가장 큰 값을 갖는 프로브 세트만이 보유되었고, 나머지는 제거되었다. 2-단계 걸러내기 후, 다운스트림 분석을 위해 9,859개의 프로브 세트들을 보관하였다. 배치(batch) 효과의 가능성을 감소시키기 위해, 컴배트(Combat) 방법을 걸러내진 발현 데이터에 적용하였다(문헌[Johnson WE et al., Biostatistics 2007;8:118-27]). 상이하게 발현된 유전자(DEG) 분석을 마이크로어레이의 유의성 분석(SAM) 방법으로 수행하였다(거짓 발견율 = 0.05; 유형 = "쌍을 이루지 않은 2개의 클래스"; 검정 통계 = "t-통계"; 순열의 수 = 1,000)(문헌[Tusher VG et al., PNAS USA 2001,98:5116-21]). RFE-SVM 방법과 함께 5배 교차 검증 과정을 이용하여 유의한 유전자 선택 및 예측 모델 구축을 수행하였다. 훈련 세트의 200개 샘플들 중에서 160개의 샘플들을 무작위적으로 선택하여 학습 세트를 형성하였고, RFE-SVM에 의해 점수 매겨진 1종 내지 100종의 유전자에 이르는 상이한 크기를 갖는 예측 모델을 생성하였고, 나머지 40개의 샘플들을 사용하여 모델 성능을 평가하였다. 이 과정을 1,000회 반복하였다. 본 발명자들의 결과는 100종의 유전자-기초 SVM 예측 모델을 이용하여 최대 97% 정확성을 달성할 수 있다는 것을 암시하였다. 시그너처 크기 최적화는 예측 성능, 시그너처 복잡성 및 경제를 고려하였다. 최종적으로, 본 발명자들은 본 발명자들의 표적 성능을 충족시키는 7종의 핵심 유전자들을 전체 90% 정확성으로 확인하였다. 상기 7종의 유전자들은 t-검정 P 값, 배수 변화, 생물학적 기능 및 연령 또는 성별 인자와의 비-관련성에 의해 선택되었다.For group 1 data, a set of probes having excessive signal intensity (lower than 2E14 log2 (50) lower) was filtered out. Then, biological knowledge-based filtering was performed using information from the Entrez Gene Database (Maglot D et al., Nucleic Acids Research 2007; 35: D26-31). A set of probes without an Entreze gene ID annotation was removed. For a plurality of probe sets mapped to the same entrease gene ID, only the probe set with the largest value of the mutual position range was retained and the rest was removed. After two-step filtering, 9,859 probe sets were stored for downstream analysis. To reduce the possibility of a batch effect, the Combat method was applied to the filtered expression data (Johnson WE et al., Biostatistics 2007; 8: 118-27). Analysis of differentially expressed genes (DEG) was performed by the significance analysis (SAM) method of microarray (false detection rate = 0.05; type = "two unpaired classes"; test statistic = "t- statistic" Number = 1,000) (Tusher VG et al., PNAS USA 2001,98: 5116-21). Significant gene selection and prediction models were constructed using the 5-fold cross-validation procedure with the RFE-SVM method. Of the 200 samples in the training set, 160 samples were randomly selected to form a learning set and a prediction model with different sizes ranging from 1 to 100 genes scored by RFE-SVM was generated, Forty samples were used to evaluate the model performance. This process was repeated 1,000 times. Our results suggest that up to 97% accuracy can be achieved using 100 gene-based SVM prediction models. Signature size optimization takes into account prediction performance, signature complexity and economy. Finally, the present inventors confirmed seven core genes that meet the target performance of the present inventors with a total accuracy of 90%. The seven genes were selected by t-test P- value, multiples of variation, biological function and non-relevance to age or gender factors.

II) 결과II) Results

1. 결장직장암 환자 집단 및 대조군 환자 집단의 특징규명1. Characterization of colorectal cancer patients and control patients

두 집단의 309명의 참여자들 중에서 161명의 CRC 환자들 및 148명의 CNC 환자들이 있었다. 상기 환자들의 인구통계학적 특징 및 임상적 특징은 표 1에 요약되어 있다.Of the 309 participants in the two groups, 161 had CRC and 148 had CNC. The demographic and clinical characteristics of the patients are summarized in Table 1.

Figure 112013096114582-pct00001
Figure 112013096114582-pct00001

2. 7종의 유전자 CRC 생체마커 패널: 확인 및 검증2. 7 genetic CRC biomarker panels: identification and validation

훈련 세트: 본 발명자들은 5배 교차 검증 과정에 기초하여 유의한 유전자 선택 및 예측 모델 구축을 수행하였다. 상기 과정을 1,000회 반복하여 실시하였다. 각각의 반복실시 내에서, 본 발명자들은 독특한 상위 7종의 유전자 세트, 및 내부 시험 배수(fold)에 의해 평가된 이의 상응하는 예측 모델 성능을 기록하였다. 최종적으로, 상기 내부 시험 배수에서 1,000개의 예측 모델의 평균 성능을 취하여 전체 성능을 추정하였다. 결과는 예측 모델을 이용하여 90.0%의 전체 정확성 성능을 달성할 수 있다는 것을 보여준다. 본 발명자들은 가장 우수한 7종의 유전자 예측 모델(이에 대한 90.0% 정확성, 89.0% 민감성 및 훈련 세트에 대한 91.0% 특이성)을 선택하였다.Training set: We performed significant gene selection and prediction model construction based on a 5x cross validation procedure. The above procedure was repeated 1000 times. Within each iteration, we recorded a unique set of seven top genes and their corresponding predictive model performance evaluated by internal test folds. Finally, the overall performance was estimated by taking the average performance of the 1,000 prediction models in the internal test run. The results show that a full accuracy performance of 90.0% can be achieved using the predictive model. We chose the seven best gene prediction models (90.0% accuracy, 89.0% sensitivity, and 91.0% specificity for the training set).

시험 세트: 본 발명자들은 109개의 샘플들(61개의 CRC 샘플들 및 48개의 CNC 샘플들)을 포함하는 독립적인 집단(시험 세트)의 훈련 세트에서 확인된 상기 예측 모델의 시그너처의 성능을 검증하였다. 이 시그너처의 전체 성능은 83.0%(CI%: 73.9, 88.9) 정확성, 84.0%(CI%: 71.5, 91.4) 민감성 및 81.0%(CI%: 66.9, 86.6) 특이성이다. Test set: We verified the performance of the signature of the predictive model identified in a training set of independent groups (test sets) containing 109 samples (61 CRC samples and 48 CNC samples). The overall performance of this signature is 83.0% (CI%: 73.9, 88.9) accuracy, 84.0% (CI%: 71.5, 91.4) sensitivity and 81.0% (CI%: 66.9, 86.6) specificity.

3. 훈련 세트로부터 관찰된 시그너처로부터의 개별 유전자의 차별적 성능의 분석3. Analysis of differential performance of individual genes from signatures observed from training set

하기 표 2는 상기 7종의 유전자의 개별 성능을 요약한다. 각각의 유전자에 대해 개별 특징, 예컨대, 프로브 세트_id(아피메트릭스 프로브 세트 식별), 100개의 CNC 샘플들과 100개의 CRC 샘플들 사이에 관찰된 T_검정 P 값, 및 100개의 CNC 샘플들과 100개의 CRC 샘플들 사이에 관찰된 배수 변화가 제공되어 있다.Table 2 summarizes the individual performance of the seven genes. For each gene, individual characteristics such as probe set_id (identifying the aprimetric probe set), T_check P value observed between 100 CNC samples and 100 CRC samples, and 100 CNC samples Multiple observed variations between 100 CRC samples are provided.

Figure 112013096114582-pct00002
Figure 112013096114582-pct00002

상향은 CRC 그룹에 대한 평균 신호가 CNC 그룹에서보다 높음을 의미한다.Upward means that the average signal for the CRC group is higher than for the CNC group.

하향은 CRC 그룹에 대한 평균 신호가 CNC 그룹에서보다 낮음을 의미한다.Downward means that the average signal for the CRC group is lower than for the CNC group.

*은 2010년의 아피메트릭스 주석 버전에 따른 프로브 세트_id를 의미한다(https://www.affymetrix.com/analysis/netaffx/xmlquery.affx?netaffx=netaffx4_annot&_requested=403680).* Means the probe set_id according to the 2010 apimetrix annotation version (https://www.affymetrix.com/analysis/netaffx/xmlquery.affx?netaffx=netaffx4_annot&_requested=403680).

**은 동일한 유전자와 이의 변수 또는 상기 유전자 또는 변수에 대한 관련된 서열을 의미한다.** means the same gene and its variable or related sequence for the gene or variable.

***은 100개의 CRC 및 100개의 CNC 어레이 실험에서 관찰된 평균 신호를 의미한다.
*** means the average signal observed in 100 CRC and 100 CNC array experiments.

SEQUENCE LISTING <110> Biomerieux <120> Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer <130> P2012813C <140> PCT/CN2012/072931 <141> 2012-03-23 <150> PCT/CN2011/072155 <151> 2011-3-25 <160> 11 <170> PatentIn version 3.3 <210> 1 <211> 22743 <212> DNA <213> Homo sapiens <400> 1 ggagttagcg acagggaggg atgcgcgcct gggtgtagtt gtgggggagg aagtggctag 60 ctcagggctt caggggacag acagggagag atgactgagt tagatgagac gagggggcgg 120 gctgggggtg cgagaaggaa gcttggcaag gagactaggt ctagggggac cacagtgggg 180 caggctgcat ggaaaatatc cgcagggtcc cccaggcaga acagccacgc tccaggccag 240 gctgtcccta ctgcctggtg gagggggaac ttgacctctg ggagggcgcc gctcttgcat 300 agctgagcga gcccgggtgc gctggtctgt gtggaaggag gaaggcaggg agaggtagaa 360 ggggtggagg agtcaggagg aataggccgc agcagccctg gaaatgatca ggaaggcagg 420 cagtgggtgc agggctgcag gagggccggg agggctaatc ttcaacttgt ccatgccagc 480 agcccctttt tttccagacc aagggctgtg aacccgcctg gggatgaggc ctggtcttgt 540 ggaactgaac ttagctcgac ggggctgacc gctctggccc agggtggtat gtaattttcg 600 ctcggcctgg gacggggccc aggccgggcc cagcctggtg gagcgtccag gtctgggtgc 660 gaagccaggc ccctgggcgg aggtgagggg tggtctgagg agtgatgtgg agttaaggcg 720 ccatcctcac cggtgactgg tgcggcacct agcatgtttg acaggcgggg actgcgaggc 780 acgctgctcg ggtgttgggg acaacattga ccaacgcttt attttccagg tggcagtgct 840 ccttttggac ttttctctag gtttggcgct aaactcttct tgtgagctca ctccacccct 900 tcttcctccc tttaacttat ccattcactt aaaacattac ctggtcatct ggtaagcccg 960 ggacagtaag ccgagtggct gttggagtcg gtattgttgg taatggtgga ggaagagagg 1020 ccttcccgct gaggctgggg tggggcggat cggtgttgct tgcctgcaga gagggtgggg 1080 agtgaatgtg cacccttggg tgggcctgca gccatccagc tgaaagttac aaaaatgctt 1140 catggaccgt ggtttgttac tatagtgttc ctcatggcga gcagatggaa ccgggagaca 1200 tggagtccct ggccagtgtg agtcctagca ttgcaggagg ggagaccctg gaggagagag 1260 cccgcctcaa ttgatgcctg cagattgaat ttccagaggc ttaggaggag gaagttctcc 1320 aatgttctgt ttccaggcct tgctcaggaa gccctgtatt caggaggcta ccatttaaag 1380 tttgcagatg agcttatggg gggcaatctt aaaaagtcca cagcagatgc atccggctcg 1440 aggggccatc agctttgaat aaatgcttgt tccagagccc atgaatgcca gcaggcaccc 1500 ctcctttcct ggggtaaagg ttttcagatg ctgcatcttc taaattgagc ctccggtcat 1560 actagttttg tgcttggaac cttgcttcaa gaagatccct aagctgtaga acattttaac 1620 gttgatgcca caacgcagat tgatgccttg tagatggagc ttgcagatgg agccccgtga 1680 cctctcacct acccacctgt ttgcctgcct tcttgtgcgt ttctcggaga agttcttagc 1740 ctgatgaaat aacttggggc gttgaagagc tgtttaattt taaatgcctt agactgggga 1800 tatattagag gaagcagatt gtcaaattaa gggtgtcatt gtgttgtgct aaacgctggg 1860 agggtacaag ttggtcattc ctaaatctgt gtgtgagaaa tggcaggtct agtttgggca 1920 ttgtgattgc attgcagatt actaggagaa gggaatggtg ggtacaccgg tagtgctctt 1980 ttgttcttgc ttcgtttttt taaacttgaa ctttacttcg ttagatttca taatactttc 2040 ttggcattct agtaagagga ccctgaggtg ggagttgtgg gggacgggga gaaggggaca 2100 gcttggcacc ggtcccgtgg gcgttgcagt gtgggggatg ggggtatgca gcttggcact 2160 ggtactggga gggatgaggg tgaagaaggg gagagggttg gttagagata cagtgtgggt 2220 ggtgggggtg gtaggaaatg caggttgaag ggaattctct ggggctttgg ggaatttagt 2280 gcgtgggtga gccaagaaaa tactaattaa taatagtaag ttgttagtgt tggttaagtt 2340 gttgcttgga agtgagaagt tgcttagaaa ctttccaaag tgcttagaac tttaagtgca 2400 aacagacaaa ctaacaaaca aaaattgttt tgctttgcta caaggtgggg aagactgaag 2460 aagtgttaac tgaaaacagg tgacacagag tcaccagttt tccgagaacc aaagggaggg 2520 gtgtgtgatg ccatctcaca ggcaggggaa atgtctttac cagcttcctc ctggtggcca 2580 agacagcctg tttcagaggg ttgttttgtt tggggtgtgg gtgttatcaa gtgaattagt 2640 cacttgaaag atgggcgtca gacttgcata cgcagcagat cagcatcctt cgctgcccct 2700 tagcaactta ggtggttgat ttgaaactgt gaaggtgtga ttttttcagg agctggaagt 2760 cttagaaaag ccttgtaaat gcctatattg tgggctttta acgtatttaa gggaccactt 2820 aagacgagat tagatgggct cttctggatt tgttcctcat ttgtcacagg tgtcttgtga 2880 ttgaaaatca tgagcgaagt gaaattgcat tgaatttcaa gggaatttag tatgtaaatc 2940 gtgccttaga aacacatctg ttgtcttttc tgtgtttggt cgatattaat aatggcaaaa 3000 tttttgccta tctagtatct tcaaattgta gtctttgtaa caaccaaata accttttgtg 3060 gtcactgtaa aattaatatt tggtagacag aatccatgta cctttgctaa ggttagaatg 3120 aataatttat tgtattttta atttgaatgt ttgtgctttt taaatgagcc aagactagag 3180 gggaaactat cacctaaaat cagtttggaa aacaagacct aaaaagggaa ggggatgggg 3240 attgtgggga gagagtgggc gaggtgcctt tactacatgt gtgatctgaa aaccctgctt 3300 ggttctgagc tgcgtctatt gaattggtaa agtaatacca atggcttttt atcatttcct 3360 tcttcccttt aagtttcact tgaaatttta aaaatcatgg ttatttttat cgttgggatc 3420 tttctgtctt ctgggttcca ttttttaaat gtttaaaaat atgttgacat ggtagttcag 3480 ttcttaacca atgacttggg gatgatgcaa acaattactg tcgttgggat ttagagtgta 3540 ttagtcacgc atgtatgggg aagtagtctc gggtatgctg ttgtgaaatt gaaactgtaa 3600 aagtagatgg ttgaaagtac tggtatgttg ctctgtatgg taagaactaa ttctgttacg 3660 tcatgtacat aattactaat cacttttctt cccctttaca gcacaaataa agtttgagtt 3720 ctaaactcat tagaattgtt gtattgctat gttacatttc tcgaccccta tcacattgcc 3780 ttcataacga ctttggatgt atcttcatat tgtagattta ggtctagatt tgctagctcc 3840 aagtaattaa ggccatgtag gagagcatgg taaccacaga tagaactggt attatcccaa 3900 gtggtctgca gactgctgag tggggatggg atctgctctc tgttgagagt tggtaatcat 3960 tggtttgaaa tgtgatgaaa ccactcaagc caatgaaggt gggtgtgtag gtggggagta 4020 ctttgccata atattttaaa acattacctg gttagagttc taagtggtac ttatttttgt 4080 ttggttaggg gaaagcctga ataaaaacag aaatggacac ataatatgca tattccatag 4140 tctttgggag gctggaatgt gcctgggatt tgggtctaag tgtatgcgta attcttacct 4200 cactaaagaa tttgccttgt ttttttcctt ttggtgagtg actaaaacgt ctgggcttcc 4260 ctgtgtgcgt gctacagtaa gcaagcagag gctgtgcaaa ggtgtgagca ggatcacgtg 4320 gaatctggag gatacatctt ggcttgcaaa ctgcctctgt ctcctgggtg ggactgttct 4380 gtccttgcac tgctgttctg tgttacctct tggggtgtaa ggttttgctt acaggagaca 4440 aactttgggc gtagaatgga agccactgcc agcctctgtg ctgagaagga aggtgcttgt 4500 ttcaaaggga gcagcaaggg aggcttgttc tactcacctg ggcctgtttg cctgagaagg 4560 ggagataagg gctgaactgg gactagccag ggggaccaac acaaatggtg ggggatcatg 4620 acctgaagga ttctttcctt cccatgagct gcagggctgg ttgccgtcct tgcaactgtg 4680 tcttatttgc ctgtgccgtt atatcttggt gacccctcca cgtgtacact actgacaaac 4740 gggtggagtg ctggggagaa gtcactgtgc cgcccaccta gtaaaccttc tgtctgtgct 4800 catggcatct ccaagatggg gcactgctgt gtgcagaatc cagggtcctc tttctgcttg 4860 caactccttt ccctggatgc cccagaaaca atccaggcct cctttcctat cttacccctt 4920 tgctttgctt tttaccccag cacctctata accgccttct cttcttttca gaactccttg 4980 tttctcgtcc tgttttttat gattacaaaa ctcttgcttc caccctggaa gataactgct 5040 atagatgcct gtatgtaaat ggtgctgtct ccagcaactg gcatgctgaa gaagaattga 5100 ttcacggggt ataaatgttg gggattggaa gtggggatga aatggcactt gttgatacag 5160 gagcagagag gtgaggccga ctgctgaaga cagctcgcca ccctccttgc ctccactcca 5220 atccaggggc tggggccaca ttctttgcct tcatttatcc tcagatcagg tgagatcgac 5280 aggaggtgtt gatggcagtg ccagcaatta ttgctaatcc gtttgcatcc ttatgcatag 5340 atctgaattc agactttgtg aatttccaga ggtgtgggta atataataga attcagtgag 5400 tgggcatggc tgatcttgtg caaattaaaa gttatggggc ataagaatag caaaagttga 5460 acttctttta aaaaggaaag taccctgaga gccagtattg gttgaggctc ttcagtatgc 5520 ccaggttggc agcactgaga accgcaggaa cggcctgttg ttacaaaaag gagattgact 5580 cagctgccct tggtgcatct gactgactat gactgctgag agattccaag gacccttaat 5640 gccagggcta acctctccat gtgcagtgag acctctggag gaagtgtcat cctctggctt 5700 tgtgtggtac tcattatggt gcagtgcggg catgaaatga agacacccaa ataggcttac 5760 agatacgata tgttttaaat gttcgtattt aacaaaaaca tactgacact gtttggaaat 5820 ggcaacagga agatagcaaa atgaatacta acattacgaa aagatgaaca ggtacatgtt 5880 ccaaggcagg tggctgtgaa cttcctctga gtgaaggcat cccctccagc acctttcagc 5940 ctgctagtta ggacgacccg ccgccaccct ccaggacctc cagccctgca ctgcctttcc 6000 tctcttttaa ataattcttc attgagttct aatatgtaaa aaaaaaaagt ttactgtaaa 6060 gtttgcaaat aaggaaattt tttttaaaag tcctcagtaa tcttaccagt aacaattgtt 6120 atgggcacat ttgcttttgg aagatttctt ttgtatgcat gggataagta catttttaaa 6180 caaaaatggg attatgccat aaattctatt ttgtgacttt aatatatagt gaacaccttt 6240 tttaatgatg acaggatgtt cccttgcatg gctgtatcaa tttaaacaat cttgtttcaa 6300 tgggcataca gggtattttc tagttttttt ttcctcttag aaaataatac ttgcgatgac 6360 tttccttgta gctcagactt tttcacgtct gttgttatct ctttgggaat gctgaataca 6420 tacatttcga gaaggaaatg actgttaaac tcttaagact tcaggttcat attgctaaac 6480 tgcccagcag ggagggattt tttcaattag tgttctcact ggtgaggcaa acctgatgcc 6540 ttcccctctt cctcagaacc ggctttatca cattgaaaac ctttgctcct ccgacggatc 6600 gagtctgctt tccctctgga tgtgagcatt gctttgtctg ctggtgactg aacatctcta 6660 ccttgtgtca attggccatt tgtggtgtgt gtgtgtgtgc gtgtgtgtgt gtgtgtgtgt 6720 gtatgatttt ctaattccta gtcatttttc tattgattgt tttgcaaaag ccatttacat 6780 cttaaggata ttgataatct tttgttatat ttgatgcaaa tatttttttc cagtttatag 6840 gttgcctttt aattttgtgt ttcaggtaga taaaagttaa acgattttct taggttagtt 6900 tatcactgtg gtttctgaac ttgttatgtg tagatctttt ccaccccaag agtacataaa 6960 tattaatcca tactttctta tggaacttgt atggtttcgt tttttacatt taaaccttct 7020 tccccgtggt gtgtgttgtg gaatctgtgt ttgtgtgagg aggggcatgg tgctctcaga 7080 acccacctcc tgtggccaga gagccctgtc ctgtgagggt ggttgtcaca gtggcagggt 7140 tcaattcaga agaccttgag ggcaggctga tgtttcctga atgggcccct ggttgttgct 7200 tgtccctgac tctccatttc cccatctgag tggatttgga cctaataggg cactggagct 7260 ggttcgaatc ctgactggac tacttggcaa ctttatgtct gggagcaagt tacttaacct 7320 ccccaagcct gtgtctgtga aatgcgggta aatgaatgta gatgtttggc agcagctact 7380 ccttgttgag ctctcacagt gaactctcct gcctctgccc tccttccccg cctcccctgg 7440 tgcctagcgt caggtctagc cacttcctcc tgggcccctc tcccttttct gtggctggct 7500 gcctgcccgc ctggcgctgg acctttcatg taacgggaat cagcatgtat attctggtct 7560 ggtctgtttc tacacttaat tttgtttcca gtagtatttc cctgtaccgg cagagttcac 7620 aaacacattt gaagaggctt tttctcagga ttcttaacct tcccaaagga agtcccatgg 7680 atgggtttct agaagtctat aaatgctctg aaattgtatt tttctgtgga aagcataact 7740 ttcatctgct tgttcgtgct caaaaaagat catgaatgaa tgattgcatg attttatgcc 7800 attgtgctta tactaaagga tatgtagccc atctcttgag ctgttaaact gttttgacta 7860 ctttaaatcg tgcagctgtg agcatctctg taaatttagt gtacacatgt atcccctgga 7920 gtggcattgc ctcggcagtg agcacttatg gttttataac tctcttcaca gactcaaatg 7980 actccagaaa gctacacttc ctgttgtgag tatatgatat ccatttccct acatagccac 8040 taacatcagg tttttacaat tttatttatt tcttgctact ttaagaaatt tttgtggtga 8100 aatacatata atagaagttg actatctgaa tcatttttaa gtatacattc agtagtgtta 8160 agtatgtcgc cattgttgta caaccaatct ccagaacttt ttcatcttgc aaaacaaact 8220 ctgtacccat taaataacat taaacattcc attccctcca gcctcagcaa ccccattcta 8280 ctttctgttt ctgtgagttt gactattcca agcacttcat atcagttaaa tcatgaagta 8340 tttgtctgtc tgtgactggc ttatttctct gagcacagtg tcctcgagat gcgtctatgt 8400 tgtagcatat gtcagaattt ccttcctttt taaaagatcc aaataatatt cttattttat 8460 atcttttttt tatccattca tccattagtg gacacttggg ttgcttttgg ctattgtaaa 8520 taatggtgct atgtacaaat atctatatta ttgtatttac aagtataatg ctgtaatgta 8580 cacacatctt tttgagatcc taccttcagt tcttttgagt atatagccag aagtggtatt 8640 actaaatctt acgatatttc tatttttaat ttattgagga accactgtag tttttcatag 8700 caactgcacc attttacgtt ctcaccaaga gtgcacaagg gttccgaggt tcccacatcc 8760 tccccaacac ttgttatttt ctgctttttt tagattgcag ccatcatagt gggtgtgagg 8820 tgacatttca ttgtggtttt gatttgcatt tccctaatga ggagtgatgc tgagcatctt 8880 ttcatatgct tactggtcat ttgtatgttg tctttggaaa aatgtctatt caagtccttt 8940 gactatttta aaaattgggt tattagagtt atcgttgttg ttgacttgta ggagtttctt 9000 tctatattct ggatattaat cccctatcag atatatgatt tgcaaatatc ttctcttatt 9060 ccataaggtt actttttcac tttgttgatt gtgttctttg atgtatagaa gtttttagtt 9120 ttgaaatagt ctaatttatc tgtttttact tttgtggtct gtgcttttgg tgtcatatcc 9180 aagaaatcct tgccaaatcc aacgttataa ggtactttta aggtatttta gttgtcttag 9240 tctatatttc tgtactcacc tttctttatc cactcatcag ttgatgggca tgtaggttgg 9300 ttccatatct ttgcaattct gaattgtgct atgatcaggt gtctttttag tataatgatt 9360 tactctcctt tgggtagata cccagtagtg ggattgctgg atcgaatggt ttttataatt 9420 ttctatttta ccacagtttc tctctgcatt tttcctcttt gaccactaac catgtgaaat 9480 tctcatattg acctttataa tgatcatgaa ctcttagtat cattgggaag gccacatttg 9540 ccacttatga ttgtaaacct tatcctccat ttttcctgtt attgttggtg caaaaagcac 9600 ctattatacc aggactttaa aaatcagtct gataagtctt tgataagtct aataataata 9660 actgataagt ccattgaatt tgcttctgat tactttttct ttagtagcta aacatgtatg 9720 tactcctatg attacaatga acactcctct ccatttaaat taattattta cattgatgaa 9780 atagcaaaat gttaatgact aaatactgtc ttggtttttt cgttccaggt cagtcaatat 9840 taacttctta taattttctt ttttttcttt atgtgtgtgt gtgtgtgtat tttttttttt 9900 ttaatttcaa tggcttttgg ggtacaaatg gcttttggtc atatagatga attctacagt 9960 agtgaagtct gagattttac tgcaccggtc acctgagtag tgtacattgt acccaatatg 10020 tggtttttta taccttgccc ccctcttacc ctccccactt tgagtctcta gtgtccatta 10080 tgtcactctg tatacctttt tgtacccata agttagctct cacttataag tgagaacaca 10140 cagtatttgg ttttccattc ctgagttgct tcacttagaa taatatcctc cagctccatc 10200 caaaattgct gcaaaaaaaa aaaaaaccac aaacattatt ttgttctttt ttattgctaa 10260 gtcatattcc atggtgtaga gataccacat tttatttatc cactcactgg ttgatgggtt 10320 ggttccacat ctttgcaatt gtgacttgta ctgccatcaa gtgtctttct ggtataatga 10380 cttcttttcc tttgggtaga tacccaggag tgggattgct agatcaaatg gttcttaaca 10440 ttttctctct ggatctattt ctggaaattt taggctccag tttttgttgt tgttgttaat 10500 aaaatgcaat ggaatgtaat gatcatcact tttcattatg ctttaaaatc tggtaaatgg 10560 aggctagaac actcctgtaa ggcaagaata ttctctctgt tggaactcaa atacacagaa 10620 ctgggtaaat ctcaatctta atctttgatt caggacacaa catggctctc ttttacttgc 10680 tttctttaat tgttttttaa taatgtggta agcatttctg aatctcctat ccaatacaaa 10740 aactaggaca atacagacag taactcctat ggttacaatg aacactcctc tccacttaaa 10800 ttaattattt acactgatga aattgaaata gcaaaatttt aatgactaaa tactgtcttt 10860 gattttttgt tccaggtctg tcaatattaa cttcttataa ttttcttttt ttttctttat 10920 gtgtgtgtgt gtgtgtgtat atatatatat ttaatttcaa tggcttttgg ggtacaaatg 10980 gcttttggtc atatatatga gttctacagt agtgaagtct gagattttac tacaccttcc 11040 acttatgtgg tcccacacca cccgcctccc ctgccgcctc ctgccacccc ctaggccaag 11100 gtaataatca tcctgaatcc tgggtttatc tctcacttgc tttcttttca tataattttg 11160 caaaagaatc tgatctaaat gtgtttttca gagtatatat ttatatttta gctgttctta 11220 gagaaaattt attattttgc atgtaatctt atggaacatt ctcatttaat accatggtaa 11280 gattcagccc ttgcccaggg gatagttcat ttagtttgtt tactggatag agctcatcat 11340 gtgactatac ctcagttagt ttatcagttc tcccatccat ggtgactagg ttgcctctca 11400 gcctctcaac aacactgttt ctcagtgtcc ttgtagaagt gatatgtggg tgttttctcc 11460 ttacacagag ttgaaaggtg acgacaacaa cgttggcact accaatcccc caccctccag 11520 aggggtaacc agtgttacca gtttgctgtg tttcctgcta cacctcgcct tattcacttc 11580 catttgtatc tgaaaaacgt gttgcatggt ttcttttcta tagaagtggt aaaatgctat 11640 tgtgtcctgt acattattga ttactttttt tcatttaaca gtagggagat gcctgggagt 11700 acacagagaa ctgccctcat tgttttcaac ttctgcactg tatgtctgtg agtttagcca 11760 ttctgctgtt aatggaaatt tacagtattc taatcttttg atattacaaa cagttctgtg 11820 cgatcatcgt catacacaac cccttgtgca caatgcatga gtgtttctca gggtaggtac 11880 caagaagtga aattcctggg tcatagggcg tgagtccgac atttttctcc attctgccct 11940 gttgccctcc agagtgggtg tccagctttg catacctaag tatgagagta tctgttgttc 12000 atatcctcta cgacgctcca tatatgaaac ttaagtttct gctagttgcc atctttgatc 12060 tatcatgtat gcagtgacct actaagactg taattggtac agtagattct tgtcatctgt 12120 gtgtgaattt agcattcatg ggcttaatgc tgacaaggcc cccagggtcc aagacatata 12180 atcatgtata attttgtcaa ggtataattt tttaaattgc ttttgtcatg tgtctgctgg 12240 tgatgcccaa cccagtgctc tgcacccagg tcacactgtg gctttgtcct ctgcttatgc 12300 ctgcattgca gcaactgtcc tgaagagacc aaaattatgc agatttaggt aagtccatgg 12360 ctaatgttat tatattatgt gctattgtaa tggatggggc tgtggagtgt atgaatttat 12420 aaatcactgg tcttgtaatt aaaattcaaa cactatagaa aaaggccatg tagaagataa 12480 aagttcctct ataatcccgg acccctaaga taactactaa tgacaacttc atttatattc 12540 cttcagacat tttctggctg tggatgtact aaaatgtatc ctattattct ctgccctaaa 12600 atggaatcat acaaggtgta ctgttatttt tatggctcta taacatgtca tattgtacgt 12660 gttggtatgg tcattttaac catttttcta gtgatggctt tgaggttatt tgcagtttcc 12720 tagccatctc aaagtgtgct gcggggatct cttttgcatc cctctgggtg cagagctgag 12780 gcacccagag gcagtgtcca gaggaggcag catctgtagg tgtcttcacc tgctctggct 12840 cttggcacat ctggttggtg acactgtttt gtgagatggg ttgaaagcac gtgctgccaa 12900 aatagaataa tgttggtcct ctcctcatgt gccgtggaac tggggtaaaa ctgcgtagtg 12960 gctgcagctg cctgtccata ccggaatcga gtataacacg gtgcctggct tagcacaaaa 13020 cagtagtggg tcctgcaggc cccagagtct aattcctggt attctttccc ctacacagat 13080 taaataaacc aaaaacaaac tattctagga aagcgtctgt gacatttgta aaaagtggta 13140 tttaatgatc ttttattcac ttgtctgttt agtttgttga aatcttaagt ggcatcctgg 13200 tctgggaagg agtgctgtct gcgcctgccc tccgctgggc acagcgtggc tgcttcaggg 13260 gctaagcaca cactttctgt cttctaaagg gccgccacat gccaggagct caggtgtgag 13320 cccggctctg gctcttacct catagggtca ctcatagggg cacagggagc agaacattgt 13380 acacagcgag gcaccacccg gcttggcatc tgcctcggtg gacttactac ctctagaagg 13440 aaatacctga gttcctctgg cctcagctcc tagagtgact ggtgtgctgt ccctgttact 13500 cttctgtcaa ggtgacaact gtgtgaccca tcatctgtgt gtcaaagcaa ggccctgcct 13560 gggcctctgc tcctgtgctg accccaaagg caaatgcttt gctagtttcc ttccagttaa 13620 tttcacctat gaatagatgt gtgaaaactg ttcaaagcca tacctgcaca tgtttgaact 13680 tcaaaccctg tgggtgattc agtggcatct ttctctaacc cccagcctcc cttcccacag 13740 aggccaccgt catggccagt tgctgcagtt tctttccaga gaacctgtgt atgtgtaaag 13800 ctgtacaggc gtgggtacac cacacagcct gtcttgcact gtggactgtt gagttactag 13860 tacatctagg taagcaccgc atatctgtat tcatgtctgc cttggtcttt tcaacatctg 13920 tgtggtagcc gtgtttgaat tacccattcc ctttttgggg aaccattaag ttgtttcagc 13980 aatttttact gtagataagg ctataccgca tatctgtgta catgggtttt tatgtacatg 14040 ggcaagtata tctgtgagag aaaagtttcc tcaggaggaa ttctgggcac agcatgtgta 14100 aatttctaaa tatgatggac acccccagct tccacctcaa ggaggttggt cccattgaca 14160 tttccccaca ccttcaccca ggctgtgccc ttaaacttgg ttatttgtca atgtgagaag 14220 tggaaaatag tatttaattg tagtttggat ttgtatttct attgggttgt atacttactg 14280 attaataata agagctcttt acatattaag gaaattaacc cttttcaaat acattcctat 14340 ttctcactaa tctttaagtt ttattgtaat attttgctct ttagtttata tatatatgta 14400 tatatatata tatgtatata tatatatata catatatata tacatatata tatactaatt 14460 ttcttttatg gttcctggat tttgtgagta gtttgaaaag gctaatccag ctgaagattt 14520 tgttgttgtt gttaaacccc atgttttctc ctaactcttt ttatttttat tttggaggac 14580 tctatctaga cttaatttta gcataacaag tgacagggtt agttagcctg ttgtccttac 14640 accattttct ggctaataca gctattaact attgatctgt ctattcacgt gccagttcct 14700 aatggtttta catagtgtaa tctgcacttc aaaatagcga agggaagccc tacctcatta 14760 ttctactttt ccagaattct cctggctatt ccaggctgca tgtttacctt aaccttccct 14820 gtgatgtctt catgccgttg tcttcttatg caagaataag gtacgtcttt ccatccactc 14880 acgtctattt aatttgactt tgcattacac agaaagctgg tcttggtctg tctacctcgg 14940 catctagttg tcctcactgc cccctagccg accccacccc atctgactga ctaccccatc 15000 acagagtact tttatttacg ttttgctctg cctaatggtt acttgatact gtcacgccga 15060 cagtgtccag ttcagtggtc tttgcagttg aaatgctccc gtacacactg tcttgttaaa 15120 aatgccagta agttcataca aacccagctt gcacccaagg tcacattcag agagcgtagg 15180 gctgggatgg gttgttttcc aagcttctgc cactgtgtgg ctagctcttc ccactgggaa 15240 gttctgtgta cccggaatgt cggagtggag tcctgttcta gtgtccagca cctgaccctg 15300 tgcccaaccc ctcaacagcc tattcctgct gtccacagcc tgctggaact ttttacaaaa 15360 tatgttgcca tgctggaccc tgggcactgg acataagccc cctggcagcc tttttcatgt 15420 cacccaaagg ggtaattgtc ctactggtgg tctgtaagat gagttagggt gacttgctaa 15480 tagacattgt aaatcttaat atttatgtat gtattttatt attaccggtt ttccatttat 15540 gatggtaata ttgtttcttc taagaatatt tatttttcct tctaaatatt gagataaaat 15600 tcatgctttt gaaatgttct attcagtggc ttttagtata tttgctatgt tgtgcaacca 15660 tcgacactat ccatttctag aactttttcg tcatcccaaa cagacgctct gtattcataa 15720 aaaaataact tcctacctgt ctctccccct agtctttggt aacctttgtt atactggtaa 15780 actttgttgt gctctctgtc tgtgtgaatt tgcctattct aggggcctca tataagtgta 15840 atcatacagt atttgtcttt ttgggtctgt ctgatttcac ttagcgggtt ttcagggttc 15900 attcatgttg cagcatataa cagtactgcg ttcctttttc tggctgaata atattccact 15960 gtatggatag accccatttt gtttattcac acatcatttg gacatttgga ttatttctgg 16020 tttttggcta ttatgaacaa tggtgctatg aacagttgcg tacaagtttt tgtgtgaaca 16080 tatgttttca attctctcat tatataccta ggagtagaat tactgggtca tatggtaact 16140 gtatattttt gaggaactgc caaactattt tcccacgtcc atgcaccatt tcacattccc 16200 accagtaagt aagagggttc caatttctgc gcattcttgc caacactagt tattatctga 16260 ctttctggtt ataatcattc taatgagtgt gaagtagcct ctggtgtcat ttggatttgc 16320 atttctctga tgagtgatgc tatcaagcac ctttgctggt gctgttggcc atatgtgtat 16380 gttccctgga gaagtgtctg tgctgagcct tggcccactt tttaattagg cgtttgtctt 16440 tttattactg agttgtaaga gttctttata tattctggat tctagaccct tatcagatac 16500 atggtttgca aatattttct cccattctgt gggttgtgtt ttcactttat cgataatgtc 16560 cttagacata taataaattt gtattttaaa agtgacttga tttggctgtg caaggtggct 16620 cacgcttgta atcccagcac tttgggagac tgaggtgggt ggatcatatg aggaggctag 16680 gagttcgagg tcagcctggc cagcatagcg aaaacttgtc tctactaaaa atacaaaaat 16740 tagtcaggca tggtggtgca cgtctgtaat accagcttct caggaggctg aggcacgagg 16800 atcacttgaa cccaggagga ggaggttgca gtgagctgag atcatgccag ggcaacagaa 16860 tgagactttg tttaaaaaaa aaaaaaagtg acttgattta agggaaaaaa tgactggcta 16920 tattcagtca gatatggcaa aaagtctcaa ggtgttaatg tgaatgatta aggtcttggg 16980 gggggtgtcc cctatcagac tacaggtgtt tagaggcaca gaaaaaggtg cagttgggtt 17040 cttaatgtga aatgatgaga agcacaactc cagtgtgtct ctttgtgtag aatgtcagca 17100 gacaccccct gctagatgtg ctggatcatg ggaaagcatt tccatttgtt actagattgt 17160 tcagaagttt taatttatga tgggtgtggt ggctcatgcc tgtagtccca gcactgtggg 17220 aggctgaggc aggaggatca tctgaggcca agagttcaag atcagcctgg gcaacatagt 17280 gataccctat ctcttaaaaa agaagaagtt tttaaatttg aaataataat aggtactgga 17340 tttatgcaaa tgtcttttct gcgtcttttg agatgagtat caggtttttt tttttccttt 17400 tatcatctga tgatgaactt aatgtttcca tttgtattaa tggaatacta agtccctctg 17460 tgatttctga accaagctat tcctaggcct gagttttatt ttgttgacac agaaataaat 17520 tagaaggcca agcgtggtgg catgtgcctg tagtcctagt tgctgaggta agaggattgc 17580 ttgagcccag gagttcaagg ctgcagcaag ctttgattgc gccactgcac tccagccttg 17640 gcgacagact aagacgctgt ctcaaaaaaa aacaaaaacg acaaaaaaaa aacaaaacag 17700 aaaaaataaa ctaaggcaat gacagtccct ggcaaatgct gggagggagg cagcagtggt 17760 cagggaaggt aaccctgaag caggacttgt aaagcaaata agattgggag gccaaggtgg 17820 gtggatcacg aggtcaggag ttcgagacca gcctggccaa catagtgaaa ccccgtcttt 17880 actaaaaata caaaaaaatt agccaggtgt ggtggtgggt gcctgtagtc ccagctactt 17940 gggaggctga ggcaggagaa tctcgaaccc aggaggcgga ggttacagtc agctgagacc 18000 gcaccattgc actccagcct gggtgacaga gcaagattcc gtctcaaaaa aaaaaaaaaa 18060 aaaaaaacca agaagaaaag gaatgaatta gaacttcttc tgcttggact taagggcatc 18120 atcaggcagg ttttgggtag gatagcaggg gaggcagaga catagtcggg gtcagtggtc 18180 atgagtgtgg ctttgagccc aaaaacttgg tttctgttcc ctactttgcc actcagtagt 18240 gcatgacttt ggccaaattt cttaaattca tgaagcaagt ttccgggtga atgaaatggg 18300 gataaaaata gtgttcaaac ctatccgttg gtttgtgtga aactgaaatg aatagtatcg 18360 tgcaggtact tgtgagcaag gggagctgct gtttcctgtc cctttatgat gggaaatatc 18420 tagacaagtt cccaaccctc tgcactgcag gctgcatggc acggagggtc ttgtaacacc 18480 agctggggct ggccttcttt taggagcttc agtggttctg aaaactttta tttgtttgtt 18540 tgttttagta gatgtggggt ctttctgtgt tgcccggact ggtctcaaac ttctggactc 18600 aagtgatcct cccccgctca acctcccaaa gtgttgggat tacaggtgtg agccactgtg 18660 cccagccttg aaaacttttt caggttcttc cagggttact gggctattaa atatttctat 18720 ttcattataa gtcagttttt caaagttata ttatcttaat tacctttttt atatgtatta 18780 gtgtagagta gcattttata ttttgatatc ctccttatgc atagtttttc actttttatt 18840 cctagttttt cgtttttaat aagactttca agaaatttat tttattggcc ttttgaaaaa 18900 agcagcttta gataaagtaa gcagttctgc tttcatttta taatttattt ctacttttgt 18960 ttcattaatc ttttcctccg gcatgccttg gattttgttg tgttactctt tttctagagg 19020 ctcgcattgt gtgtctggtt cacttatgat cacgcttgcc tacttttaag aatggaagag 19080 gggaggtgga gggtggctgc acagtcgagg gtgtgaggca gtcttgctct agccccacca 19140 tgccctcagc ccgctgtggc cacgctggtt cctcaattgc tggggcgtgc agtgtctgta 19200 agggaggcta ctgatgccat ccgaggaaga tgtaaggttt cgtgtgggca gcgagagcct 19260 agcaggcatg tggggtgccc agcaaagggt aacagtggac agttgttgcc tcattccaca 19320 gagttttgat tttttttttt tttttaatgg tcactccatc aacatccccc atggccagag 19380 cctgagctgg tccccagaga cacaggcatt cagctgacag cctcgccttc acgctgctgc 19440 tgttctcatg ggggacaggc ctcaggtggc aatgcacaaa tcattagtta agggcagttg 19500 tgacagttac caaggagtgt agtcccccgc cccccgccca gtgaaaacag ccctaaccag 19560 gggtggggac ctttgggctc tgacccgaag ggtaggagaa gctggaagga cagcattcct 19620 gtctgcgaag gcaggagcaa agctgccagg ctatgaagga aatggctgga gcctgaagtc 19680 atgcaagctg gggctggcag ggacagggcc aacttccagg cctgggggcc accatgagga 19740 ttcaggacgt gacccccagg gcacatgaag gccttccatc tgtatttaag aaaagacttt 19800 atcagacgag tatggtggct cacgcctgaa tcttagcact ttgggaggct gaggcaggtg 19860 gatcacgagg tcaggagttc aataccagcc tggccaatat ggtaaaaccc catctctact 19920 aaaactacaa aaattagcca ggcatggtgg cgcacgcctg tagtcccagc tactcgggag 19980 gctgaggcag aagaatcact tgaacccggg aggtggaggt tacagtgagc caagatcgcg 20040 ccactacact ccagcctggg tgacagagtg agactccgtc tcaaaaaaac caaaagactt 20100 tatcttattt cctatatgtt tgtggtttca gtcctgatgt ataatttgac cctagttaga 20160 atggttatct gaggaagtgg cctgtacgat ttctgctttt ttaaatgtgt ggctcccttt 20220 cttcattgat taacgtatga ttatttttat aaatgttcca tggcagtggg aagggattct 20280 ctgtcacatt ccacatctgg atcagttcct ccccattttg ttggtcaaat ccgatctgcc 20340 atatcctgtg taatgacaag tgagttgcat tctcaccgtc actcctgggg tctctccgct 20400 tcccctgagc tggctcagca gtctgctcca tgtgttttga tgcagggtga cccattggta 20460 ttcccgacac taacgccccc gtctgtggac tgcttgctgc ttgggcttca ctgtgtctgg 20520 tgttgacagt gcagacctaa aggtgtgcac acatgtgcac acacactccg ctgtcttctt 20580 gtttgcactg gacttaaata tctatgaggg ttattttcaa ctgctgaatt tggaatgatt 20640 tttatatctt ttctgctttc tgcccatgta catgtgttta ttttacactg ttgtgattgg 20700 tagttactat gtggggacac aattacttgg gctgaaataa tccacctgtt gtggttgggg 20760 tcctctgggg cattccaggg tgagaggttg tcactgccac ctgggccatg tgggccggca 20820 ccagcatttt gtggttacga attctacagt cacaaatatc tttgggcaaa tccccttcta 20880 tacctcaagg cagcttttgg tttgcaaccc cactggccag agggaagggc cagtcacttg 20940 gctctctcac tgccctgcgc cccagatggt tctagggctg ctgttttccc ttggccctgc 21000 caacaccact gtttttactt ctgctcattg gctgagtgca gtggttcctg gaagccagtg 21060 gcacgtttcc ccgcgtagct cgcttatccc acagcacaca cccaagggtt ctgttgctaa 21120 cacgctgaat taattctttg ctcatcttac agagtgtgtt ttgactgccc ccatttctga 21180 ggccttgtaa ggccagagct ttgttgcttc atcggcaggt tgggacttag atggccgtga 21240 atgtttcctc tctgctgctg cagtaagtaa gtgcccgcac catagtgtgt ttggaggctg 21300 aagttgaagc gaggctgtga ggggagatgg acgtgtgagg agggatgatg gggcttgagc 21360 aaagtggggg agggggcaaa ggcagttggc ccaacacatt ccccacccct ttgagaggtc 21420 tgaggcctgc agacctggct cggagcccac ctggtagtcc tcagactgtg tgtgtgtgtg 21480 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtaaaag agagaagttg 21540 tggagaaatg gggggctgat tctgctcaga ttcatcagga tgagtagaag gcacccagct 21600 ctcaccctgg cctgacatgt gtgtccctga gcaggttaca gtcctctctg agcctctgct 21660 tcccatctgg accctgctgg gcagggcttc tgagctcctt agcactagca ggaggggctc 21720 caggggccct ccctccatgg cagccaggac aggactctca aatgaggaca gcagagctcg 21780 tggggggctc ccacggaccc gccgtgggcc caggggaggc agagcctgag ccaacagcag 21840 tggtgctgtg gaccgtggat cctgagggtg gcctggggca agtaccggct gagggtccag 21900 gtgggctttg tgtacctttg ggtcctgggg ccctggtgac ttggactcca ggttagagtc 21960 aagtgacagg agaaaggctg gtggggccct gtgcttccga cttcatttcg agtgatggca 22020 gttcccagga aggaatccac agctgacggt ggctgacaga tcagagaatg gaaggcgagg 22080 caggcgggcg tctgcgtgac ctcaggtgct tggggcccag cagacccaga gaaccatttc 22140 cactaggcca gggtgccgga agtgtccaca ggtcttagat tccctgttca gatgaaaaga 22200 tttgtgcctt taatgataaa agtgatctgc atagagtcaa aaattcaagc catgggtata 22260 aaatgcaagt aaaatccctg ccctcaccta tcccacccta ctacacagag atgtcctctc 22320 gagtttccta gactcactct ggaaatttct gtatacacac agaagcttgt gcctctgctc 22380 gtgaaggcag agggagggag agctgaaggg ccagcacctt ctcacctgtg ggccccctca 22440 gtgctcggtc ccagagcatg caggactgtg cctcgtgttc agtttgctgg tctgacttca 22500 tgctccttgg gcaggatatg catgtgccat gctaggagac atgtggatgt gaagctgggg 22560 gacaatgtcc cctggctatg cctttacaag ggaagtaagg aaggtaggag gtgagcctgg 22620 gagggaggga gggaggcgcg gagccgccgc aggtgtttct tttactgagt gcagcccatg 22680 gccgcactca ggttttgctt ttcaccttcc catctgtgaa agagtgagca ggaaaaagca 22740 aaa 22743 <210> 2 <211> 4833 <212> DNA <213> Homo sapiens <400> 2 gtgaactgtt gcaccgtgca attgcacact ataaatgtct ttccttatct gtgtgtactc 60 ttatctcact gttctatttt ttctcctcat ttatattaac tctttcttac ctttttttct 120 gaacttctag gccttctctt tccagaactg gtggaagaca aatgaaacgg ccaagatggt 180 aagaaacaag ccgcatttct ccttggggag actgataatt taaaaggttt gttgtgtcag 240 aaacattccc agcttcatca ccaacccttt ccttccacct ctgcccactg gagaccactt 300 atatcccgaa gcggacgcgg cagctgaagt caggaaacca tgcatcacat tagcaggagc 360 caactgcaga ctttaaactc cgttcaacat gtggatgcgg cagagaaatg acctgtccag 420 acaagccggg gcagctcata aactggttca tctgctccct gtgcgtcccg cgggtgcgta 480 agctctggag cagccggcgt ccaaggaccc ggagaaacct tctgctgggc actgcgtgtg 540 ccatctactt gggcttcctg gtgagccagg tggggagggc ctctctccag catggacagg 600 cggctgagaa ggggccacat cgcagccgcg acaccgccga gccatccttc cctgagatac 660 ccctggatgg taccctggcc cctccagagt cccagggcaa tgggtccact ctgcagccca 720 atgtggtgta cattacccta cgctccaagc gcagcaagcc ggccaatatc cgtggcaccg 780 tgaagcccaa gcgcaggaaa aagcatgcag tggcatcggc tgccccaggg caggaggctt 840 tggtcggacc atcccttcag ccgcaggaag cggcaaggga agctgatgct gtagcacctg 900 ggtacgctca gggagcaaac ctggttaaga ttggagagcg accctggagg ttggtgcggg 960 gtccgggagt gcgagccggg ggcccagact tcctgcagcc cagctccagg gagagcaaca 1020 ttaggatcta cagcgagagc gccccctcct ggctgagcaa agatgacatc cgaagaatgc 1080 gactcttggc ggacagcgca gtggcagggc tccggcctgt gtcctctagg agcggagccc 1140 gtttgctggt gctggagggg ggcgcacctg gcgctgtgct ccgctgtggc cctagcccct 1200 gtgggcttct caagcagccc ttggacatga gtgaggtgtt tgccttccac ctagacagga 1260 tcctggggct caacaggacc ctgccgtctg tgagcaggaa agcagagttc atccaagcag 1320 cagcagcagc gtgtctttcc atgcgcttgg cattctttat tttcccagcc tgggaggata 1380 tgagagttcc agggaaatgc tgtattggac atgcaagact cacctgggga acttatcagc 1440 agttgctgaa acagaaatgc tggcagaatg gccgagtacc caagcctgaa tcgggttgta 1500 ctgaaataca tcatcatgag tggtccaaga tggcactctt tgattttttg ttacagattt 1560 ataatcgctt agatacaaat tgctgtggat tcagacctcg caaggaagat gcctgtgtac 1620 agaatggatt gaggccaaaa tgtgatgacc aaggttctgc ggctctagca cacattatcc 1680 agcgaaagca tgacccaagg catttggttt ttatagacaa caagggtttc tttgacagga 1740 gtgaagataa cttaaacttc aaattgttag aaggcatcaa agagtttcca gcttctgcag 1800 tttctgtttt gaagagccag cacttacggc agaaacttct tcagtctctg tttcttgata 1860 aagtgtattg ggaaagtcaa ggaggtagac aaggaattga aaagcttatc gatgtaatag 1920 aacacagagc caaaattctt atcacctata tcaatgcaca cggggtcaaa gtattaccta 1980 tgaatgaatg acaaaagaat cttctggcta gggtgttaga tatatttatg catttttggt 2040 tttgttttta aatcaagcac atcaacctca agcccgttta gcaatgaggc agtgtagatg 2100 aatacgtaaa ataaatgact ttaaccaagt agctataatg ggacttagca ctgtatgcat 2160 acttaaaaag gttttgaaaa acaaactact tgagaaatat ttgtttatat ttttctctaa 2220 catcatgcta tgtgtcagtc tgaacatctg acaacagaaa tttcagttat tattctagct 2280 aagttttgaa aacatttgtc atgctgttta atagaaaact gcaaaccaga gacactgact 2340 ccattaataa accatatttt gtgccgtttt gactgttctg accaaatact aatgggaaca 2400 attcttgacg tttttctgtt gctgattgtt aacatagagc agtctctaca ctaccctgag 2460 gcaactctac attggaacac tgaggcttac agcctgcaag agcatcagag ctgaccatac 2520 atttaaacag aaatgctggt ttatttgcaa aatcaccagt atattttcta ttgtgtctat 2580 aaaaaatcag tcatttaagt acaagaatca tattttccat tcctttttag aaatttattt 2640 tgttgtccct atggaaatca ttcacatctg acaatttata tgttaaagag ttttactctc 2700 tctattttgg tccaatttgt atctagtggc tgagaaatta aataattcta aagtatgaag 2760 ttacctatct gaaaatgtac ttacagagta tcattttaaa atggatgtct ctttaaaaat 2820 tttgttactt ttaccaacaa tgtaatataa tttatgtata ttttattaat aatagtgaat 2880 tccttaaaat ttgttctatg tacttatatt taatttgatt taatggttac tgcccagata 2940 ttgagaattg gttcaaatat tgagtgtgtt tcaatatatt atctggctta tttcaacatg 3000 agtaatatga gcaaaataag ttaaaacctg cgtctgatca attttcctca tgactagaac 3060 taaaacagta aatttggaca atattaagcc tcaaataatc atctccaaac tccttctaac 3120 actttttaaa tcagattgga agacatggac aaatcaggtt catgtgttgc atctttatgt 3180 cctttgccaa tatccaagat catcacatat ggtagatatt cacatggagt ttcaaattca 3240 gaatagatta ccattacctt cctgccctta cacatcctac tccttattta aaagttctat 3300 ttgtgacttt tcatttcctg aaagtttaaa aatacaattt gagaatgttt ataatacatt 3360 ctctcctgtc ttttcacggt tacgtctgtt attgctgaaa tacaccacat tttctttgtt 3420 ctggtcaagg ttaactcaat atctgtgtga aagagaacta ctaacaacgt tacaatagag 3480 gctagatttg aaaaaaaaaa tctatagatc taattgatac aattgtagaa caaaatgtca 3540 aaataatgtt ttaagtataa gagaagatgg accaaggaga gagagatcat ttgaaaatct 3600 aattgtagct tttctaggct cacattcatg tactactttt agcaccctta tgggctgtgc 3660 tcgccccctg gacagttgag ctttggatta tcttcctctt caattttccc tctattgacc 3720 cgagtgtctc cctctgcttc tacagattta tagtactcct tggctctttt gagtctccac 3780 ttttactcac tgtctctggg atttttaaga tccttttctt ctcttataaa tcatcctctt 3840 aatgaaaatt agcctaacaa aagtttggag actggaatcc tactttgagc cactgacttg 3900 aaataactct tttggcaagt tgcctgacat cctgtcttac caaggtggca tatttgcatt 3960 tttactgctt aaaacatttt ttttttttta ccatctttat ccaaatttat catattgatg 4020 gtaggactaa caggcttttt agaagctggc tttaactttg agtctcaagc tacaatgctg 4080 ttgggcagcc tggtcttccc acgtgagggt ttaactttgt ttatttgcct ccagttattc 4140 caaaatgctt attaaatgaa agtcccagga acatgtttat tttagtcacc tttgcttttt 4200 aacaattttg ttttgtaatc aatgagtaat tcatgatgaa ttatttttga ctaatggata 4260 gccgaaggcc aggcttttaa ttctaatagg taatgttctt cttttgtctt attgaaacaa 4320 tgagaatact ctgtgcattt caaatgcact ccgattatgc tgtggtttta ttcacataag 4380 cacaatatgt gttttattta taacttcata acaaacttat aatataataa tttaccttag 4440 cagacatgca aaagcttatt cttgtgtgac ttactttctt taagctaata atataaaaat 4500 aaatatgtat cttaaaaatc tataataaaa cattagaaat taaagatatg tgctttttat 4560 tttgcagatg agttcatttg cttttgtaga tgtgttttca gagctaggta cagaggaatg 4620 tttgctacct ttagcggtga aaaaagaaag agagtcaaga attttgttgg attgtgtttg 4680 tgtgtgcata tatttgatat catcattata tttgtaatct ttggacttgt aatcatagcc 4740 tgtttattct actgtgccat taaatatact ttaccttata cataacgaat aaaataccta 4800 gaagtagatt tatttacaaa aaaaaaaaaa aaa 4833 <210> 3 <211> 4854 <212> DNA <213> Homo sapiens <400> 3 gtttttaaaa gctttgtatc tcttaaaacc atgcagcagt cagtttccaa gttttgcttt 60 gcaatcagta gttttcaagg gagcttttaa agctgaactg aaatgtttga aatgtggaac 120 actcttgacc atgaaatatg ttctacttac atgcctcagc ctttaaaagt tctttgcatt 180 agagtcaagg attacattct tcctggagcc aagcatgggg ccagctgtaa acaagccgca 240 tttctccttg gggagactga taatttaaaa ggtttgttgt gtcagaaaca ttcccagctt 300 catcaccaac cctttccttc cacctctgcc cactggagac cacttatatc ccgaagcgga 360 cgcggcagct gaagtcagga aaccatgcat cacattagca ggagccaact gcagacttta 420 aactccgttc aacatgtgga tgcggcagag aaatgacctg tccagacaag ccggggcagc 480 tcataaactg gttcatctgc tccctgtgcg tcccgcgggt gcgtaagctc tggagcagcc 540 ggcgtccaag gacccggaga aaccttctgc tgggcactgc gtgtgccatc tacttgggct 600 tcctggtgag ccaggtgggg agggcctctc tccagcatgg acaggcggct gagaaggggc 660 cacatcgcag ccgcgacacc gccgagccat ccttccctga gatacccctg gatggtaccc 720 tggcccctcc agagtcccag ggcaatgggt ccactctgca gcccaatgtg gtgtacatta 780 ccctacgctc caagcgcagc aagccggcca atatccgtgg caccgtgaag cccaagcgca 840 ggaaaaagca tgcagtggca tcggctgccc cagggcagga ggctttggtc ggaccatccc 900 ttcagccgca ggaagcggca agggaagctg atgctgtagc acctgggtac gctcagggag 960 caaacctggt taagattgga gagcgaccct ggaggttggt gcggggtccg ggagtgcgag 1020 ccgggggccc agacttcctg cagcccagct ccagggagag caacattagg atctacagcg 1080 agagcgcccc ctcctggctg agcaaagatg acatccgaag aatgcgactc ttggcggaca 1140 gcgcagtggc agggctccgg cctgtgtcct ctaggagcgg agcccgtttg ctggtgctgg 1200 aggggggcgc acctggcgct gtgctccgct gtggccctag cccctgtggg cttctcaagc 1260 agcccttgga catgagtgag gtgtttgcct tccacctaga caggatcctg gggctcaaca 1320 ggaccctgcc gtctgtgagc aggaaagcag agttcatcca agatggccgc ccatgcccca 1380 tcattctttg ggatgcatct ttatcttcag caagtaatga cacccattct tctgttaagc 1440 tcacctgggg aacttatcag cagttgctga aacagaaatg ctggcagaat ggccgagtac 1500 ccaagcctga atcgggttgt actgaaatac atcatcatga gtggtccaag atggcactct 1560 ttgatttttt gttacagatt tataatcgct tagatacaaa ttgctgtgga ttcagacctc 1620 gcaaggaaga tgcctgtgta cagaatggat tgaggccaaa atgtgatgac caaggttctg 1680 cggctctagc acacattatc cagcgaaagc atgacccaag gcatttggtt tttatagaca 1740 acaagggttt ctttgacagg agtgaagata acttaaactt caaattgtta gaaggcatca 1800 aagagtttcc agcttctgca gtttctgttt tgaagagcca gcacttacgg cagaaacttc 1860 ttcagtctct gtttcttgat aaagtgtatt gggaaagtca aggaggtaga caaggaattg 1920 aaaagcttat cgatgtaata gaacacagag ccaaaattct tatcacctat atcaatgcac 1980 acggggtcaa agtattacct atgaatgaat gacaaaagaa tcttctggct agggtgttag 2040 atatatttat gcatttttgg ttttgttttt aaatcaagca catcaacctc aagcccgttt 2100 agcaatgagg cagtgtagat gaatacgtaa aataaatgac tttaaccaag tagctataat 2160 gggacttagc actgtatgca tacttaaaaa ggttttgaaa aacaaactac ttgagaaata 2220 tttgtttata tttttctcta acatcatgct atgtgtcagt ctgaacatct gacaacagaa 2280 atttcagtta ttattctagc taagttttga aaacatttgt catgctgttt aatagaaaac 2340 tgcaaaccag agacactgac tccattaata aaccatattt tgtgccgttt tgactgttct 2400 gaccaaatac taatgggaac aattcttgac gtttttctgt tgctgattgt taacatagag 2460 cagtctctac actaccctga ggcaactcta cattggaaca ctgaggctta cagcctgcaa 2520 gagcatcaga gctgaccata catttaaaca gaaatgctgg tttatttgca aaatcaccag 2580 tatattttct attgtgtcta taaaaaatca gtcatttaag tacaagaatc atattttcca 2640 ttccttttta gaaatttatt ttgttgtccc tatggaaatc attcacatct gacaatttat 2700 atgttaaaga gttttactct ctctattttg gtccaatttg tatctagtgg ctgagaaatt 2760 aaataattct aaagtatgaa gttacctatc tgaaaatgta cttacagagt atcattttaa 2820 aatggatgtc tctttaaaaa ttttgttact tttaccaaca atgtaatata atttatgtat 2880 attttattaa taatagtgaa ttccttaaaa tttgttctat gtacttatat ttaatttgat 2940 ttaatggtta ctgcccagat attgagaatt ggttcaaata ttgagtgtgt ttcaatatat 3000 tatctggctt atttcaacat gagtaatatg agcaaaataa gttaaaacct gcgtctgatc 3060 aattttcctc atgactagaa ctaaaacagt aaatttggac aatattaagc ctcaaataat 3120 catctccaaa ctccttctaa cactttttaa atcagattgg aagacatgga caaatcaggt 3180 tcatgtgttg catctttatg tcctttgcca atatccaaga tcatcacata tggtagatat 3240 tcacatggag tttcaaattc agaatagatt accattacct tcctgccctt acacatccta 3300 ctccttattt aaaagttcta tttgtgactt ttcatttcct gaaagtttaa aaatacaatt 3360 tgagaatgtt tataatacat tctctcctgt cttttcacgg ttacgtctgt tattgctgaa 3420 atacaccaca ttttctttgt tctggtcaag gttaactcaa tatctgtgtg aaagagaact 3480 actaacaacg ttacaataga ggctagattt gaaaaaaaaa atctatagat ctaattgata 3540 caattgtaga acaaaatgtc aaaataatgt tttaagtata agagaagatg gaccaaggag 3600 agagagatca tttgaaaatc taattgtagc ttttctaggc tcacattcat gtactacttt 3660 tagcaccctt atgggctgtg ctcgccccct ggacagttga gctttggatt atcttcctct 3720 tcaattttcc ctctattgac ccgagtgtct ccctctgctt ctacagattt atagtactcc 3780 ttggctcttt tgagtctcca cttttactca ctgtctctgg gatttttaag atccttttct 3840 tctcttataa atcatcctct taatgaaaat tagcctaaca aaagtttgga gactggaatc 3900 ctactttgag ccactgactt gaaataactc ttttggcaag ttgcctgaca tcctgtctta 3960 ccaaggtggc atatttgcat ttttactgct taaaacattt tttttttttt accatcttta 4020 tccaaattta tcatattgat ggtaggacta acaggctttt tagaagctgg ctttaacttt 4080 gagtctcaag ctacaatgct gttgggcagc ctggtcttcc cacgtgaggg tttaactttg 4140 tttatttgcc tccagttatt ccaaaatgct tattaaatga aagtcccagg aacatgttta 4200 ttttagtcac ctttgctttt taacaatttt gttttgtaat caatgagtaa ttcatgatga 4260 attatttttg actaatggat agccgaaggc caggctttta attctaatag gtaatgttct 4320 tcttttgtct tattgaaaca atgagaatac tctgtgcatt tcaaatgcac tccgattatg 4380 ctgtggtttt attcacataa gcacaatatg tgttttattt ataacttcat aacaaactta 4440 taatataata atttacctta gcagacatgc aaaagcttat tcttgtgtga cttactttct 4500 ttaagctaat aatataaaaa taaatatgta tcttaaaaat ctataataaa acattagaaa 4560 ttaaagatat gtgcttttta ttttgcagat gagttcattt gcttttgtag atgtgttttc 4620 agagctaggt acagaggaat gtttgctacc tttagcggtg aaaaaagaaa gagagtcaag 4680 aattttgttg gattgtgttt gtgtgtgcat atatttgata tcatcattat atttgtaatc 4740 tttggacttg taatcatagc ctgtttattc tactgtgcca ttaaatatac tttaccttat 4800 acataacgaa taaaatacct agaagtagat ttatttacaa aaaaaaaaaa aaaa 4854 <210> 4 <211> 4809 <212> DNA <213> Homo sapiens <400> 4 gtgaactgtt gcaccgtgca attgcacact ataaatgtct ttccttatct gtgtgtactc 60 ttatctcact gttctatttt ttctcctcat ttatattaac tctttcttac ctttttttct 120 gaacttctag gccttctctt tccagaactg gtggaagaca aatgaaacgg ccaagatggt 180 aagaaacaag ccgcatttct ccttggggag actgataatt taaaaggttt gttgtgtcag 240 aaacattccc agcttcatca ccaacccttt ccttccacct ctgcccactg gagaccactt 300 atatcccgaa gcggacgcgg cagctgaagt caggaaacca tgcatcacat tagcaggagc 360 caactgcaga ctttaaactc cgttcaacat gtggatgcgg cagagaaatg acctgtccag 420 acaagccggg gcagctcata aactggttca tctgctccct gtgcgtcccg cgggtgcgta 480 agctctggag cagccggcgt ccaaggaccc ggagaaacct tctgctgggc actgcgtgtg 540 ccatctactt gggcttcctg gtgagccagg tggggagggc ctctctccag catggacagg 600 cggctgagaa ggggccacat cgcagccgcg acaccgccga gccatccttc cctgagatac 660 ccctggatgg taccctggcc cctccagagt cccagggcaa tgggtccact ctgcagccca 720 atgtggtgta cattacccta cgctccaagc gcagcaagcc ggccaatatc cgtggcaccg 780 tgaagcccaa gcgcaggaaa aagcatgcag tggcatcggc tgccccaggg caggaggctt 840 tggtcggacc atcccttcag ccgcaggaag cggcaaggga agctgatgct gtagcacctg 900 ggtacgctca gggagcaaac ctggttaaga ttggagagcg accctggagg ttggtgcggg 960 gtccgggagt gcgagccggg ggcccagact tcctgcagcc cagctccagg gagagcaaca 1020 ttaggatcta cagcgagagc gccccctcct ggctgagcaa agatgacatc cgaagaatgc 1080 gactcttggc ggacagcgca gtggcagggc tccggcctgt gtcctctagg agcggagccc 1140 gtttgctggt gctggagggg ggcgcacctg gcgctgtgct ccgctgtggc cctagcccct 1200 gtgggcttct caagcagccc ttggacatga gtgaggtgtt tgccttccac ctagacagga 1260 tcctggggct caacaggacc ctgccgtctg tgagcaggaa agcagagttc atccaagatg 1320 gccgcccatg ccccatcatt ctttgggatg catctttatc ttcagcaagt aatgacaccc 1380 attcttctgt taagctcacc tggggaactt atcagcagtt gctgaaacag aaatgctggc 1440 agaatggccg agtacccaag cctgaatcgg gttgtactga aatacatcat catgagtggt 1500 ccaagatggc actctttgat tttttgttac agatttataa tcgcttagat acaaattgct 1560 gtggattcag acctcgcaag gaagatgcct gtgtacagaa tggattgagg ccaaaatgtg 1620 atgaccaagg ttctgcggct ctagcacaca ttatccagcg aaagcatgac ccaaggcatt 1680 tggtttttat agacaacaag ggtttctttg acaggagtga agataactta aacttcaaat 1740 tgttagaagg catcaaagag tttccagctt ctgcagtttc tgttttgaag agccagcact 1800 tacggcagaa acttcttcag tctctgtttc ttgataaagt gtattgggaa agtcaaggag 1860 gtagacaagg aattgaaaag cttatcgatg taatagaaca cagagccaaa attcttatca 1920 cctatatcaa tgcacacggg gtcaaagtat tacctatgaa tgaatgacaa aagaatcttc 1980 tggctagggt gttagatata tttatgcatt tttggttttg tttttaaatc aagcacatca 2040 acctcaagcc cgtttagcaa tgaggcagtg tagatgaata cgtaaaataa atgactttaa 2100 ccaagtagct ataatgggac ttagcactgt atgcatactt aaaaaggttt tgaaaaacaa 2160 actacttgag aaatatttgt ttatattttt ctctaacatc atgctatgtg tcagtctgaa 2220 catctgacaa cagaaatttc agttattatt ctagctaagt tttgaaaaca tttgtcatgc 2280 tgtttaatag aaaactgcaa accagagaca ctgactccat taataaacca tattttgtgc 2340 cgttttgact gttctgacca aatactaatg ggaacaattc ttgacgtttt tctgttgctg 2400 attgttaaca tagagcagtc tctacactac cctgaggcaa ctctacattg gaacactgag 2460 gcttacagcc tgcaagagca tcagagctga ccatacattt aaacagaaat gctggtttat 2520 ttgcaaaatc accagtatat tttctattgt gtctataaaa aatcagtcat ttaagtacaa 2580 gaatcatatt ttccattcct ttttagaaat ttattttgtt gtccctatgg aaatcattca 2640 catctgacaa tttatatgtt aaagagtttt actctctcta ttttggtcca atttgtatct 2700 agtggctgag aaattaaata attctaaagt atgaagttac ctatctgaaa atgtacttac 2760 agagtatcat tttaaaatgg atgtctcttt aaaaattttg ttacttttac caacaatgta 2820 atataattta tgtatatttt attaataata gtgaattcct taaaatttgt tctatgtact 2880 tatatttaat ttgatttaat ggttactgcc cagatattga gaattggttc aaatattgag 2940 tgtgtttcaa tatattatct ggcttatttc aacatgagta atatgagcaa aataagttaa 3000 aacctgcgtc tgatcaattt tcctcatgac tagaactaaa acagtaaatt tggacaatat 3060 taagcctcaa ataatcatct ccaaactcct tctaacactt tttaaatcag attggaagac 3120 atggacaaat caggttcatg tgttgcatct ttatgtcctt tgccaatatc caagatcatc 3180 acatatggta gatattcaca tggagtttca aattcagaat agattaccat taccttcctg 3240 cccttacaca tcctactcct tatttaaaag ttctatttgt gacttttcat ttcctgaaag 3300 tttaaaaata caatttgaga atgtttataa tacattctct cctgtctttt cacggttacg 3360 tctgttattg ctgaaataca ccacattttc tttgttctgg tcaaggttaa ctcaatatct 3420 gtgtgaaaga gaactactaa caacgttaca atagaggcta gatttgaaaa aaaaaatcta 3480 tagatctaat tgatacaatt gtagaacaaa atgtcaaaat aatgttttaa gtataagaga 3540 agatggacca aggagagaga gatcatttga aaatctaatt gtagcttttc taggctcaca 3600 ttcatgtact acttttagca cccttatggg ctgtgctcgc cccctggaca gttgagcttt 3660 ggattatctt cctcttcaat tttccctcta ttgacccgag tgtctccctc tgcttctaca 3720 gatttatagt actccttggc tcttttgagt ctccactttt actcactgtc tctgggattt 3780 ttaagatcct tttcttctct tataaatcat cctcttaatg aaaattagcc taacaaaagt 3840 ttggagactg gaatcctact ttgagccact gacttgaaat aactcttttg gcaagttgcc 3900 tgacatcctg tcttaccaag gtggcatatt tgcattttta ctgcttaaaa catttttttt 3960 tttttaccat ctttatccaa atttatcata ttgatggtag gactaacagg ctttttagaa 4020 gctggcttta actttgagtc tcaagctaca atgctgttgg gcagcctggt cttcccacgt 4080 gagggtttaa ctttgtttat ttgcctccag ttattccaaa atgcttatta aatgaaagtc 4140 ccaggaacat gtttatttta gtcacctttg ctttttaaca attttgtttt gtaatcaatg 4200 agtaattcat gatgaattat ttttgactaa tggatagccg aaggccaggc ttttaattct 4260 aataggtaat gttcttcttt tgtcttattg aaacaatgag aatactctgt gcatttcaaa 4320 tgcactccga ttatgctgtg gttttattca cataagcaca atatgtgttt tatttataac 4380 ttcataacaa acttataata taataattta ccttagcaga catgcaaaag cttattcttg 4440 tgtgacttac tttctttaag ctaataatat aaaaataaat atgtatctta aaaatctata 4500 ataaaacatt agaaattaaa gatatgtgct ttttattttg cagatgagtt catttgcttt 4560 tgtagatgtg ttttcagagc taggtacaga ggaatgtttg ctacctttag cggtgaaaaa 4620 agaaagagag tcaagaattt tgttggattg tgtttgtgtg tgcatatatt tgatatcatc 4680 attatatttg taatctttgg acttgtaatc atagcctgtt tattctactg tgccattaaa 4740 tatactttac cttatacata acgaataaaa tacctagaag tagatttatt tacaaaaaaa 4800 aaaaaaaaa 4809 <210> 5 <211> 4745 <212> DNA <213> Homo sapiens <400> 5 ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc 60 ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt 120 taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc 180 aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg 240 ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact 300 acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt 360 ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca 420 ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat 480 ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg 540 atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc 600 ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct 660 ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca 720 accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg 780 ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga 840 atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat 900 atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg 960 gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc 1020 ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc 1080 ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc 1140 atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca 1200 ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca 1260 ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg 1320 ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc 1380 acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg 1440 tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca 1500 gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag 1560 ggggccaggt gtccgtgtgc cccttgccca gggggcagag ggctcggtgg cagtgtgatg 1620 ctgttctcta cggggagcag ggccaaccct ggggccgctt tggggcagcc ctaacagtgc 1680 tgggggacgt aaatggggac aagctgacgg acgtggccat tggggcccca ggagaggagg 1740 acaaccgggg tgctgtttac ctgtttcacg gaacctcagg atctggcatc agcccctccc 1800 atagccagcg gatagcaggc tccaagctct ctcccaggct ccagtatttt ggtcagtcac 1860 tgagtggggg ccaggacctc acaatggatg gactggtaga cctgactgta ggagcccagg 1920 ggcacgtgct gctgctcagg tcccagccag tactgagagt caaggcaatc atggagttca 1980 atcccaggga agtggcaagg aatgtatttg agtgtaatga tcaggtggtg aaaggcaagg 2040 aagccggaga ggtcagagtc tgcctccatg tccagaagag cacacgggat cggctaagag 2100 aaggacagat ccagagtgtt gtgacttatg acctggctct ggactccggc cgcccacatt 2160 cccgcgccgt cttcaatgag acaaagaaca gcacacgcag acagacacag gtcttggggc 2220 tgacccagac ttgtgagacc ctgaaactac agttgccgaa ttgcatcgag gacccagtga 2280 gccccattgt gctgcgcctg aacttctctc tggtgggaac gccattgtct gctttcggga 2340 acctccggcc agtgctggcg gaggatgctc agagactctt cacagccttg tttccctttg 2400 agaagaattg tggcaatgac aacatctgcc aggatgacct cagcatcacc ttcagtttca 2460 tgagcctgga ctgcctcgtg gtgggtgggc cccgggagtt caacgtgaca gtgactgtga 2520 gaaatgatgg tgaggactcc tacaggacac aggtcacctt cttcttcccg cttgacctgt 2580 cctaccggaa ggtgtccacg ctccagaacc agcgctcaca gcgatcctgg cgcctggcct 2640 gtgagtctgc ctcctccacc gaagtgtctg gggccttgaa gagcaccagc tgcagcataa 2700 accaccccat cttcccggaa aactcagagg tcacctttaa tatcacgttt gatgtagact 2760 ctaaggcttc ccttggaaac aaactgctcc tcaaggccaa tgtgaccagt gagaacaaca 2820 tgcccagaac caacaaaacc gaattccaac tggagctgcc ggtgaaatat gctgtctaca 2880 tggtggtcac cagccatggg gtctccacta aatatctcaa cttcacggcc tcagagaata 2940 ccagtcgggt catgcagcat caatatcagg tcagcaacct ggggcagagg agcctcccca 3000 tcagcctggt gttcttggtg cccgtccggc tgaaccagac tgtcatatgg gaccgccccc 3060 aggtcacctt ctccgagaac ctctcgagta cgtgccacac caaggagcgc ttgccctctc 3120 actccgactt tctggctgag cttcggaagg cccccgtggt gaactgctcc atcgctgtct 3180 gccagagaat ccagtgtgac atcccgttct ttggcatcca ggaagaattc aatgctaccc 3240 tcaaaggcaa cctctcgttt gactggtaca tcaagacctc gcataaccac ctcctgatcg 3300 tgagcacagc tgagatcttg tttaacgatt ccgtgttcac cctgctgccg ggacaggggg 3360 cgtttgtgag gtcccagacg gagaccaaag tggagccgtt cgaggtcccc aaccccctgc 3420 cgctcatcgt gggcagctct gtcgggggac tgctgctcct ggccctcatc accgccgcgc 3480 tgtacaagct cggcttcttc aagcggcaat acaaggacat gatgagtgaa gggggtcccc 3540 cgggggccga accccagtag cggctccttc ccgacagagc tgcctctcgg tggccagcag 3600 gactctgccc agaccacacg tagcccccag gctgctggac acgtcggaca gcgaagtatc 3660 cccgacagga cgggcttggg cttccatttg tgtgtgtgca agtgtgtatg tgcgtgtgtg 3720 caagtgtctg tgtgcaagtg tgtgcacatg tgtgcgtgtg cgtgcatgtg cacttgcacg 3780 cccatgtgtg agtgtgtgca agtatgtgag tgtgtccaag tgtgtgtgcg tgtgtccatg 3840 tgtgtgcaag tgtgtgcatg tgtgcgagtg tgtgcatgtg tgtgctcagg ggcgtgtggc 3900 tcacgtgtgt gactcagatg tctctggcgt gtgggtaggt gacggcagcg tagcctctcc 3960 ggcagaaggg aactgcctgg gctcccttgt gcgtgggtga agccgctgct gggttttcct 4020 ccgggagagg ggacggtcaa tcctgtgggt gaagacagag ggaaacacag cagcttctct 4080 ccactgaaag aagtgggact tcccgtcgcc tgcgagcctg cggcctgctg gagcctgcgc 4140 agcttggatg gagactccat gagaagccgt gggtggaacc aggaacctcc tccacaccag 4200 cgctgatgcc caataaagat gcccactgag gaatgatgaa gcttcctttc tggattcatt 4260 tattatttca atgtgacttt aattttttgg atggataagc ttgtctatgg tacaaaaatc 4320 acaaggcatt caagtgtaca gtgaaaagtc tccctttcca gatattcaag tcacctcctt 4380 aaaggtagtc aagattgtgt tttgaggttt ccttcagaca gattccaggc gatgtgcaag 4440 tgtatgcacg tgtgcacaca caccacacat acacacacac aagctttttt acacaaatgg 4500 tagcatactt tatattggtc tgtatcttgc tttttttcac caatatttct cagacatcgg 4560 ttcatattaa gacataaatt actttttcat tcttttatac cgctgcatag tattccattg 4620 tgtgagtgta ccataatgta tttaaccagt cttcttttga tatactattt tcattctctt 4680 gttattgcat caatgctgag ttaataaatc aaatatatgt catttttgca tatatgtaag 4740 gataa 4745 <210> 6 <211> 4742 <212> DNA <213> Homo sapiens <400> 6 ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc 60 ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt 120 taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc 180 aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg 240 ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact 300 acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt 360 ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca 420 ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat 480 ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg 540 atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc 600 ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct 660 ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca 720 accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg 780 ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga 840 atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat 900 atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg 960 gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc 1020 ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc 1080 ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc 1140 atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca 1200 ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca 1260 ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg 1320 ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc 1380 acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg 1440 tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca 1500 gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag 1560 ggggccaggt gtccgtgtgc cccttgccca gggggagggc tcggtggcag tgtgatgctg 1620 ttctctacgg ggagcagggc caaccctggg gccgctttgg ggcagcccta acagtgctgg 1680 gggacgtaaa tggggacaag ctgacggacg tggccattgg ggccccagga gaggaggaca 1740 accggggtgc tgtttacctg tttcacggaa cctcaggatc tggcatcagc ccctcccata 1800 gccagcggat agcaggctcc aagctctctc ccaggctcca gtattttggt cagtcactga 1860 gtgggggcca ggacctcaca atggatggac tggtagacct gactgtagga gcccaggggc 1920 acgtgctgct gctcaggtcc cagccagtac tgagagtcaa ggcaatcatg gagttcaatc 1980 ccagggaagt ggcaaggaat gtatttgagt gtaatgatca ggtggtgaaa ggcaaggaag 2040 ccggagaggt cagagtctgc ctccatgtcc agaagagcac acgggatcgg ctaagagaag 2100 gacagatcca gagtgttgtg acttatgacc tggctctgga ctccggccgc ccacattccc 2160 gcgccgtctt caatgagaca aagaacagca cacgcagaca gacacaggtc ttggggctga 2220 cccagacttg tgagaccctg aaactacagt tgccgaattg catcgaggac ccagtgagcc 2280 ccattgtgct gcgcctgaac ttctctctgg tgggaacgcc attgtctgct ttcgggaacc 2340 tccggccagt gctggcggag gatgctcaga gactcttcac agccttgttt ccctttgaga 2400 agaattgtgg caatgacaac atctgccagg atgacctcag catcaccttc agtttcatga 2460 gcctggactg cctcgtggtg ggtgggcccc gggagttcaa cgtgacagtg actgtgagaa 2520 atgatggtga ggactcctac aggacacagg tcaccttctt cttcccgctt gacctgtcct 2580 accggaaggt gtccacgctc cagaaccagc gctcacagcg atcctggcgc ctggcctgtg 2640 agtctgcctc ctccaccgaa gtgtctgggg ccttgaagag caccagctgc agcataaacc 2700 accccatctt cccggaaaac tcagaggtca cctttaatat cacgtttgat gtagactcta 2760 aggcttccct tggaaacaaa ctgctcctca aggccaatgt gaccagtgag aacaacatgc 2820 ccagaaccaa caaaaccgaa ttccaactgg agctgccggt gaaatatgct gtctacatgg 2880 tggtcaccag ccatggggtc tccactaaat atctcaactt cacggcctca gagaatacca 2940 gtcgggtcat gcagcatcaa tatcaggtca gcaacctggg gcagaggagc ctccccatca 3000 gcctggtgtt cttggtgccc gtccggctga accagactgt catatgggac cgcccccagg 3060 tcaccttctc cgagaacctc tcgagtacgt gccacaccaa ggagcgcttg ccctctcact 3120 ccgactttct ggctgagctt cggaaggccc ccgtggtgaa ctgctccatc gctgtctgcc 3180 agagaatcca gtgtgacatc ccgttctttg gcatccagga agaattcaat gctaccctca 3240 aaggcaacct ctcgtttgac tggtacatca agacctcgca taaccacctc ctgatcgtga 3300 gcacagctga gatcttgttt aacgattccg tgttcaccct gctgccggga cagggggcgt 3360 ttgtgaggtc ccagacggag accaaagtgg agccgttcga ggtccccaac cccctgccgc 3420 tcatcgtggg cagctctgtc gggggactgc tgctcctggc cctcatcacc gccgcgctgt 3480 acaagctcgg cttcttcaag cggcaataca aggacatgat gagtgaaggg ggtcccccgg 3540 gggccgaacc ccagtagcgg ctccttcccg acagagctgc ctctcggtgg ccagcaggac 3600 tctgcccaga ccacacgtag cccccaggct gctggacacg tcggacagcg aagtatcccc 3660 gacaggacgg gcttgggctt ccatttgtgt gtgtgcaagt gtgtatgtgc gtgtgtgcaa 3720 gtgtctgtgt gcaagtgtgt gcacatgtgt gcgtgtgcgt gcatgtgcac ttgcacgccc 3780 atgtgtgagt gtgtgcaagt atgtgagtgt gtccaagtgt gtgtgcgtgt gtccatgtgt 3840 gtgcaagtgt gtgcatgtgt gcgagtgtgt gcatgtgtgt gctcaggggc gtgtggctca 3900 cgtgtgtgac tcagatgtct ctggcgtgtg ggtaggtgac ggcagcgtag cctctccggc 3960 agaagggaac tgcctgggct cccttgtgcg tgggtgaagc cgctgctggg ttttcctccg 4020 ggagagggga cggtcaatcc tgtgggtgaa gacagaggga aacacagcag cttctctcca 4080 ctgaaagaag tgggacttcc cgtcgcctgc gagcctgcgg cctgctggag cctgcgcagc 4140 ttggatggag actccatgag aagccgtggg tggaaccagg aacctcctcc acaccagcgc 4200 tgatgcccaa taaagatgcc cactgaggaa tgatgaagct tcctttctgg attcatttat 4260 tatttcaatg tgactttaat tttttggatg gataagcttg tctatggtac aaaaatcaca 4320 aggcattcaa gtgtacagtg aaaagtctcc ctttccagat attcaagtca cctccttaaa 4380 ggtagtcaag attgtgtttt gaggtttcct tcagacagat tccaggcgat gtgcaagtgt 4440 atgcacgtgt gcacacacac cacacataca cacacacaag cttttttaca caaatggtag 4500 catactttat attggtctgt atcttgcttt ttttcaccaa tatttctcag acatcggttc 4560 atattaagac ataaattact ttttcattct tttataccgc tgcatagtat tccattgtgt 4620 gagtgtacca taatgtattt aaccagtctt cttttgatat actattttca ttctcttgtt 4680 attgcatcaa tgctgagtta ataaatcaaa tatatgtcat ttttgcatat atgtaaggat 4740 aa 4742 <210> 7 <211> 5188 <212> DNA <213> Homo sapiens <400> 7 aaaaccctgc aggagactgc gagccctgca gaactgctag ctgcggggga gagggcaggg 60 gtcgggcgcc tgtggcggag ccgggctggg gccagggcag ggaggctgac aagcggcggg 120 agaagccggc ggagggcggg atcgcgcctc ctgacatgtt gggggtatcc ctggccgggc 180 cgggccgggg ctaagagcgg cgctgcgggc cggggtcggg gtcgggtcgc ggtccgcccc 240 cgctgtccct ccgtcctgcc ctgtcgagga cgtgcgttcc gcactcggcc gcctccagag 300 ggagcgaggg aagcggctag aggatcgggg agaaggagca ttcgccggag gctggaggag 360 gctgacccgc gtccccgccc agcctgctcc tatgcggtac ttgaaggatg gcgaagaggt 420 cgcgcagtga ggatgaggat gatgaccttc agtatgccga tcatgattat gaagtaccac 480 aacaaaaagg actgaagaaa ctctggaaca gagtaaaatg gacaagggac gaggatgata 540 aattaaagaa gttggttgaa caacatggaa ctgatgattg gactctaatt gctagtcatc 600 ttcaaaatcg ctctgatttt cagtgccagc atcgatggca gaaagtttta aatcctgaat 660 tgataaaggg tccttggact aaagaagaag atcagagggt tattgaatta gttcagaaat 720 atgggccaaa aagatggtct ttaattgcaa aacatttaaa aggaagaata ggcaagcagt 780 gtagagaaag atggcataat catctgaatc ctgaggtaaa gaaatcttcc tggacagaag 840 aggaggacag gatcatctat gaagcacata agcggttggg aaatcgttgg gcagaaattg 900 ccaaactact tccaggaagg actgataatt ctatcaaaaa tcattggaat tctactatgc 960 gaagaaaagt ggaacaggag ggctatttac aagatggaat aaaatcagaa cgatcttcat 1020 ctaaacttca acacaaacct tgtgcagcta tggatcatat gcaaacccag aatcagtttt 1080 acatacctgt tcagatccct gggtatcagt atgtgtcacc tgaaggcaat tgtatagaac 1140 atgttcagcc tacttctgcc tttattcagc aacccttcat tgatgaagat cctgataagg 1200 aaaagaaaat aaaggaactt gagatgcttc ttatgtcagc tgagaatgaa gttagaagaa 1260 agcgaattcc atcacagcct ggaagttttt ctagctggtc tggtagtttc ctcatggatg 1320 ataacatgtc taatactcta aatagccttg acgagcacac tagtgagttt tacagtatgg 1380 atgaaaatca gcctgtgtct gctcagcaga attcacccac aaagttcctg gccgtggagg 1440 caaacgctgt gttatcctct ttgcagacca tcccagaatt tgcagagact ctagaactta 1500 ttgaatctga tcctgtagca tggagtgacg ttaccagttt tgatatttct gatgctgctg 1560 cttctcctat caaatccacc ccagttaaat taatgagaat tcagcacaat gaaggagcca 1620 tggaatgcca atttaacgtc agtcttgtac ttgaagggaa aaaaaacact tgtaatggtg 1680 gcaacagtga agctgttcct ttaacatccc caaatatagc caagtttagc actccaccag 1740 ccatcctcag aaagaagaga aaaatgcgag tgggtcattc cccaggcagc gaacttaggg 1800 atggctcatt gaacgatggt ggtaatatgg cgctaaaaca tacaccactg aaaacactac 1860 cattttctcc ttcacagttt ttcaacacat gtcctggtaa tgaacaactt aatatagaaa 1920 atccttcatt tacatcaacc cctatttgtg ggcagaaagc tctcattaca actcctcttc 1980 ataaggaaac aactcccaaa gatcaaaagg aaaatgtagg gtttagaaca cctactatta 2040 gaagatctat actgggtacc acaccaagaa ctcctactcc ttttaagaat gcgcttgctg 2100 ctcaggagaa aaaatatgga cctcttaaaa ttgtgtccca gccacttgct ttcttggaag 2160 aagatattcg ggaagtttta aaagaagaaa ctggaacaga cctattcctc aaagaggaag 2220 atgaacctgc ttacaaaagc tgcaaacaag agaataccgc ttctgggaag aaagtcagaa 2280 aatcactagt cttagataat tgggaaaaag aagaatcagg cactcaactg ttgactgaag 2340 acatttcaga catgcagtca gaaaatagat ttactacatc cttattaatg ataccattat 2400 tggaaataca tgacaatagg tgcaacttga ttcctgaaaa acaagatata aattcaacca 2460 acaaaacata tacacttact aaaaagaaac caaaccctaa cacttccaaa gttgtcaaat 2520 tggaaaagaa tcttcagtca aattgtgaat gggaaacagt ggtttatggg aagacagaag 2580 accaacttat tatgactgaa caagcaagaa gatatctgag tacttacaca gctaccagta 2640 gtacttcaag agctctcata ctgtaattgt tattaaaatt gatgaaatgc cccactccct 2700 tactgcagtc tctactaaat taggttgcag tgaaattttt ctcaattagt tgtttttaaa 2760 gttgtaagat agccctttta atacagcatc ttttttctat tctatatagt aggcagaaag 2820 ctagtaagtc acttaagggg tagatagttt catagtttat tttttaagag atgagatttt 2880 taaaaattgt ttttaaagaa caagatggga aaataataga atgttcatgg atttctaaaa 2940 gtaaattctc atatattttc ttcacaagat atatgttgct actctcttga tgctgcagtt 3000 ttgttataga taggtgtatg agtatatatg atttctgaaa ttagtctatg tatggaaagc 3060 acacatgatt ttatgaagta cttttgccca tgtgctgatt tacttaggct accatttaca 3120 aagaaacaca ttgaaaagga atttaaagga aggatagaaa gttgcactac taattttttg 3180 tttttttttt cagaagcagt aaaattaact acagtgttaa atgtatttat ttgagcatag 3240 tactgaaaac aaaaagcatt caaaaaagag ttttttcttt attagtaaat agtattttct 3300 taatctcaga ggagctgaga gttttgttga atgtattgta cagtatgtag gagcaggaga 3360 actttgtaaa ttggaaagaa gtctgttttt ataatttatt tttattttta aagcttaaat 3420 gtagatattt atacgtatac agggtgccta gaagccaatg ttgtttcctg ttattacagc 3480 taacacagta aagaataatt ttgactttaa gtatgaaaca gtagtaagtt atagctgcaa 3540 agaatacaat atctatactg tatgtcacat ctacctaaat gttgcactat gccctttaaa 3600 tcatgctggt tataaagtag ttctaaaaat gtactaaata ataatttaat attttctttt 3660 taaattatat cgggggtggt catatacatt aatctggtga tttgtatatg tgtttgaaat 3720 ttttgcattt tgtttaaaaa ataatatggt accttggtcc ctaaaaacag tctgcactta 3780 gaagtttata tttactcagt gtttcagaag tggagaacat tatcttttat ttataaaaat 3840 attttgtcct tttttaaatg ttttgtgttt ctctacaggt tacaacagtt gcttcagttg 3900 cctgttttag gtgtttgcac ttattttatt tcttcttgaa agaattttta tttgcttttg 3960 tggtagagat tatatgtaat tttttttcag tcatataatg gtgtgctgtc aacttaaaca 4020 ctgacaggta aatagaattg tacactgtag tttgaattat ttataattga cacactctct 4080 ccctctccac tcctgaagta tgctgctata gaaaatagca gaatcggctt gctgctacga 4140 gagaaggaaa gagcgaccac cacttgcact gtgtgaaaag ataaaaaaca aatgatggca 4200 agttctcaag ttaactaaat ggaatcaacc attaccaggc aaattcttgc aaataccaaa 4260 atactactat gccttataaa acaaaatgaa agcaggttaa gattttctgc tctgtttgta 4320 tgttaataga aatggaaata ctaagtattt taatgcttag ctcttgaaca gtagacctaa 4380 aagggtttta agctatttaa atctacttgc tagtttttgc atattttata tatatatata 4440 tttatatata tatatagtga gaagtgaaga aaatgtatgg tactaagatt atgccttatt 4500 gataaataga taaaccaatt tgaatcctct tagcatgttt aagtatgttg attgctttct 4560 aattaatgaa cttctcacag aaatttcact tagtgaaacc aatgattgta gcaaactcat 4620 actggatcat ttcagttacc ttgaactaat agcacataat ggttttttgt tgttgttgtt 4680 tttaatgtag cccttacctg gatatacata gtctgcaatc accaaagtat aatatcttgt 4740 aaggctatat tttttaaagc atattttttc ttgagcatta aattatccta aatggtaata 4800 tattgtggat aagtctgggc ttattggaca taatacatat ttgggttggt actggttgaa 4860 tccttcagtt aactgctttg ttgctttttg caagattttt tatcttaaac atgtcaggca 4920 tcttaagtca cctttatact gttttgttcc tctgagtttc tttcagtatg ttatacaaat 4980 gccagacata acatgtagca gccatacttg catggaaact gactacacat acataatact 5040 gcattttatt gtaaggtttt cacattaata cagcaattac cctgactaaa ttgagttttg 5100 tgatatatgg aaaacttcat tgtaagagaa tcttgcatac aatgttgaca tattaacatc 5160 caaaataaag catctgtgta caagctga 5188 <210> 8 <211> 5008 <212> DNA <213> Homo sapiens <400> 8 aaaaccctgc aggagactgc gagccctgca gaactgctag ctgcggggga gagggcaggg 60 gtcgggcgcc tgtggcggag ccgggctggg gccagggcag ggaggctgac aagcggcggg 120 agaagccggc ggagggcggg atcgcgcctc ctgacatgtt gggggtatcc ctggccgggc 180 cgggccgggg ctaagagcgg cgctgcgggc cggggtcggg gtcgggtcgc ggtccgcccc 240 cgctgtccct ccgtcctgcc ctgtcgagga cgtgcgttcc gcactcggcc gcctccagag 300 ggagcgaggg aagcggctag aggatcgggg agaaggagca ttcgccggag gctggaggag 360 gctgacccgc gtccccgccc agcctgctcc tatgcggtac ttgaaggatg gcgaagaggt 420 cgcgcagtga ggatgaggat gatgaccttc agtatgccga tcatgattat gaagtaccac 480 aacaaaaagg actgaagaaa ctctggaaca gagtaaaatg gacaagggac gaggatgata 540 aattaaagaa gttggttgaa caacatggaa ctgatgattg gactctaatt gctagtcatc 600 ttcaaaatcg ctctgatttt cagtgccagc atcgatggca gaaagtttta aatcctgaat 660 tgataaaggg tccttggact aaagaagaag atcagagggt tattgaatta gttcagaaat 720 atgggccaaa aagatggtct ttaattgcaa aacatttaaa aggaagaata ggcaagcagt 780 gtagagaaag atggcataat catctgaatc ctgaggtaaa gaaatcttcc tggacagaag 840 aggaggacag gatcatctat gaagcacata agcggttggg aaatcgttgg gcagaaattg 900 ccaaactact tccaggaagg actgataatt ctatcaaaaa tcattggaat tctactatgc 960 gaagaaaagt ggaacaggag ggctatttac aagatggaat aaaatcagaa cgatcttcat 1020 ctaaacttca acacaaacct tgtgcagcta tggatcatat gcaaacccag aatcagtttt 1080 acatacctgt tcagatccct gggtatcagt atgtgtcacc tgaaggcaat tgtatagaac 1140 atgttcagcc tacttctgcc tttattcagc aacccttcat tgatgaagat cctgataagg 1200 aaaagaaaat aaaggaactt gagatgcttc ttatgtcagc tgagaatgaa gttagaagaa 1260 agcgaattcc atcacagcct ggaagttttt ctagctggtc tggtagtttc ctcatggatg 1320 ataacatgtc taatactcta aatagccttg acgagcacac tagtgagttt tacagtatgg 1380 atgaaaatca gcctgtgtct gctcagcaga attcacccac aaagttcctg gccgtggagg 1440 caaacgctgt gttatcctct ttgcagacca tcccagaatt tgcagagact ctagaactta 1500 ttgaatctga tcctgtagca tggagtgacg ttaccagttt tgatatttct gatgctgctg 1560 cttctcctat caaatccacc ccagttaaat taatgagaat tcagcacaat gaaggagcca 1620 tggaatgcca atttaacgtc agtcttgtac ttgaagggaa aaaaaacact tgtaatggtg 1680 gcaacagtga agctgttcct ttaacatccc caaatatagc caagtttagc actccaccag 1740 ccatcctcag aaagaagaga aaaatgcgag tgggtcattc cccaggcagc gaacttaggg 1800 atggctcatt gaacgatggt ggtaatatgg cgctaaaaca tacaccactg aaaacactac 1860 cattttctcc ttcacagttt ttcaacacat gtcctggtaa tgaacaactt aatatagaaa 1920 atccttcatt tacatcaacc cctatttgtg ggcagaaagc tctcattaca actcctcttc 1980 ataaggaaac aactcccaaa gatcaaaagg aaaatgtagg gtttagaaca cctactatta 2040 gaagatctat actgggtacc acaccaagaa ctcctactcc ttttaagaat gcgcttgctg 2100 ctcaggagaa aaaatatgga cctcttaaaa ttgtgtccca gccacttgct ttcttggaag 2160 aagatattcg ggaagtttta aaagaagaaa ctggaacaga cctattcctc aaagaggaag 2220 atgaacctgc ttacaaaagc tgcaaacaag agaataccgc ttctgggaag aaagtcagaa 2280 aatcactagt cttagataat tgggaaaaag aagaatcagg cactcaactg ttgactgaag 2340 acatttcaga catgcagtca aattgtgaat gggaaacagt ggtttatggg aagacagaag 2400 accaacttat tatgactgaa caagcaagaa gatatctgag tacttacaca gctaccagta 2460 gtacttcaag agctctcata ctgtaattgt tattaaaatt gatgaaatgc cccactccct 2520 tactgcagtc tctactaaat taggttgcag tgaaattttt ctcaattagt tgtttttaaa 2580 gttgtaagat agccctttta atacagcatc ttttttctat tctatatagt aggcagaaag 2640 ctagtaagtc acttaagggg tagatagttt catagtttat tttttaagag atgagatttt 2700 taaaaattgt ttttaaagaa caagatggga aaataataga atgttcatgg atttctaaaa 2760 gtaaattctc atatattttc ttcacaagat atatgttgct actctcttga tgctgcagtt 2820 ttgttataga taggtgtatg agtatatatg atttctgaaa ttagtctatg tatggaaagc 2880 acacatgatt ttatgaagta cttttgccca tgtgctgatt tacttaggct accatttaca 2940 aagaaacaca ttgaaaagga atttaaagga aggatagaaa gttgcactac taattttttg 3000 tttttttttt cagaagcagt aaaattaact acagtgttaa atgtatttat ttgagcatag 3060 tactgaaaac aaaaagcatt caaaaaagag ttttttcttt attagtaaat agtattttct 3120 taatctcaga ggagctgaga gttttgttga atgtattgta cagtatgtag gagcaggaga 3180 actttgtaaa ttggaaagaa gtctgttttt ataatttatt tttattttta aagcttaaat 3240 gtagatattt atacgtatac agggtgccta gaagccaatg ttgtttcctg ttattacagc 3300 taacacagta aagaataatt ttgactttaa gtatgaaaca gtagtaagtt atagctgcaa 3360 agaatacaat atctatactg tatgtcacat ctacctaaat gttgcactat gccctttaaa 3420 tcatgctggt tataaagtag ttctaaaaat gtactaaata ataatttaat attttctttt 3480 taaattatat cgggggtggt catatacatt aatctggtga tttgtatatg tgtttgaaat 3540 ttttgcattt tgtttaaaaa ataatatggt accttggtcc ctaaaaacag tctgcactta 3600 gaagtttata tttactcagt gtttcagaag tggagaacat tatcttttat ttataaaaat 3660 attttgtcct tttttaaatg ttttgtgttt ctctacaggt tacaacagtt gcttcagttg 3720 cctgttttag gtgtttgcac ttattttatt tcttcttgaa agaattttta tttgcttttg 3780 tggtagagat tatatgtaat tttttttcag tcatataatg gtgtgctgtc aacttaaaca 3840 ctgacaggta aatagaattg tacactgtag tttgaattat ttataattga cacactctct 3900 ccctctccac tcctgaagta tgctgctata gaaaatagca gaatcggctt gctgctacga 3960 gagaaggaaa gagcgaccac cacttgcact gtgtgaaaag ataaaaaaca aatgatggca 4020 agttctcaag ttaactaaat ggaatcaacc attaccaggc aaattcttgc aaataccaaa 4080 atactactat gccttataaa acaaaatgaa agcaggttaa gattttctgc tctgtttgta 4140 tgttaataga aatggaaata ctaagtattt taatgcttag ctcttgaaca gtagacctaa 4200 aagggtttta agctatttaa atctacttgc tagtttttgc atattttata tatatatata 4260 tttatatata tatatagtga gaagtgaaga aaatgtatgg tactaagatt atgccttatt 4320 gataaataga taaaccaatt tgaatcctct tagcatgttt aagtatgttg attgctttct 4380 aattaatgaa cttctcacag aaatttcact tagtgaaacc aatgattgta gcaaactcat 4440 actggatcat ttcagttacc ttgaactaat agcacataat ggttttttgt tgttgttgtt 4500 tttaatgtag cccttacctg gatatacata gtctgcaatc accaaagtat aatatcttgt 4560 aaggctatat tttttaaagc atattttttc ttgagcatta aattatccta aatggtaata 4620 tattgtggat aagtctgggc ttattggaca taatacatat ttgggttggt actggttgaa 4680 tccttcagtt aactgctttg ttgctttttg caagattttt tatcttaaac atgtcaggca 4740 tcttaagtca cctttatact gttttgttcc tctgagtttc tttcagtatg ttatacaaat 4800 gccagacata acatgtagca gccatacttg catggaaact gactacacat acataatact 4860 gcattttatt gtaaggtttt cacattaata cagcaattac cctgactaaa ttgagttttg 4920 tgatatatgg aaaacttcat tgtaagagaa tcttgcatac aatgttgaca tattaacatc 4980 caaaataaag catctgtgta caagctga 5008 <210> 9 <211> 2632 <212> DNA <213> Homo sapiens <400> 9 agctgaggga cgcgtcagcc aggcaccccg gggtgtggcc agaggacttc ggcgacgctt 60 ccccgagagt agcccccctc ctcaacccag aaaagacaac cccgcggggc tgcagcgagc 120 caggcatgct cactggcgca ggcccggccc gcagcccgag caggaagcgc cggcgctagg 180 cggccccctg cgctgccagc tggagccggg cggagccagc gccccggcgc agggtggctc 240 tgccagtccc cgcgcgcctg ggcggccgca cacgtgtcca ggcgtcacgt ccgcgcgcgc 300 ccccggggct tgcgtcagcg gctgttccag aagcgggtgg gccagggctc tgcgcaccgc 360 tggggttcgg ggcccgggac gccgccggga ggagggcacc gcgcggggtc cgacgcggag 420 gcgtgctcgg aacgccgggg gctgcggagt gcatcagcgc ggtccagccc tccgcctgcc 480 gggcgccgag cgtctccgcc gcccggacct gggctgggcg ccgtggcgtt gcctcggagc 540 tcgctgcccg cggggcgcgc accgccttga cccgggcggc cccgcggcag gcaggcgccc 600 gcagttccat ggttggttcg gagcgcgatg agccgcccgt cctccaccgg ccccagcgct 660 aataaaccct gcagcaagca gccgccgccg cagccccagc acactccgtc cccggctgcg 720 cccccggccg ccgccaccat ctcggctgcg ggccccggct cgtccgcggt gcccgccgcg 780 gcggcggtga tctcgggccc cggcggcggc ggcggggccg gcccggtgtc cccgcagcac 840 cacgagctga cctcgctctt cgagtgtccg gtctgctttg actatgtcct gcctcctatt 900 ctgcagtgcc aggccgggca cctggtgtgt aaccaatgcc gccagaagtt gagctgctgc 960 ccgacgtgca ggggcgccct gacgcccagc atcaggaacc tggctatgga gaaggtggcc 1020 tcggcagtcc tgtttccctg taagtatgcc accacgggct gttccctgac cctgcaccat 1080 acggagaaac cagaacatga agacatatgt gaataccgtc cctactcctg cccatgtcct 1140 ggtgcttcct gcaagtggca ggggtccctg gaagctgtga tgtcccatct catgcacgcc 1200 cacaagagca ttaccaccct tcagggagaa gacatcgtct ttctagctac agacattaac 1260 ttgccagggg ctgtcgactg ggtgatgatg cagtcatgtt ttggccatca cttcatgctg 1320 gtgctggaga aacaagagaa gtacgaaggc caccagcagt tttttgccat cgtcctgctc 1380 attggcaccc gcaagcaagc cgagaacttt gcctacagac tggagttgaa tgggaaccgg 1440 cggagattga cctgggaggc cacgccccgt tcgattcatg acggtgtggc tgcggccatc 1500 atgaacagcg actgccttgt tttcgacaca gccatagcac atctttttgc agataatggg 1560 aaccttggaa tcaatgttac tatttctaca tgttgtccat gatgtgactt tcgtaaacct 1620 tcaaaattat ttgggcatag tgctctatgt ttaataaagg tttttataga tgttttattc 1680 catatgtctt cacaagtcag gacccacaat tacccgtgtt ttgtttgaac agcagtgtcc 1740 catctggctt cgacccaaca aagttcatta acctgggatg aatggggttg gcctgttggt 1800 gatttggatg ctgttctgtg atctaaaaca actcttattg aattgtattt actccctaaa 1860 caacacttga caggctgttg cacagggctt ctatagatca gtgtgttagg aatgggaggc 1920 cccttcctgc ctgccttccc atattggtcc cttgacattg acaaaagcac agtgactgtc 1980 agcagattcc tttacttttg tttgtgggag gtaggaattg ttttaatgca ttttaaacag 2040 tgtttctgaa attggatggc tggctaatag acactgaatc acccggagtg cttatcttaa 2100 aattgcagat ttagggagcc tgccaattta acagtctcat caggtgattc ttttcaacag 2160 taatgtttga gaattactgg gttaaattgt gggaaagggt ccagatttta aaggtgcttt 2220 aaggttgccc tctgccgata ctgtttgtct ttctactgtt tcatccccta acttccccca 2280 accctcaaat taaaactaga actatagatc cacatgaacg cacgcctgag atttggccac 2340 tcacctatgt tttgggtgga ttgcctagga aagcaagtca tatggccatt gatagttctc 2400 atgtaattag ttttgctcac cactagtaca gatgacccgt ttacacgtgg cttccctcgg 2460 aagccctcct caacagtagc tggtgtgaaa gactaaatca gtagagttgg aaaagcttta 2520 taaccggtgt gtcatatgct tgctatttaa agctgtgtgt tggttttgtt tttctgccac 2580 attcactagt tttttaataa atattttcca aaaatggata aaaaaaaaaa aa 2632 <210> 10 <211> 894 <212> DNA <213> Homo sapiens <400> 10 gcccgtcttc gtgtctcctc cctccctcgc cttcctcctt cctagctcct ctcctccagg 60 gccagactga gcccaggttg atttcaggcg gacaccaata gactccacag cagctccagg 120 agcccagaca ccggcggcca gaagcaaggc taggagctgc tgcagccatg tcggccctca 180 gcctcctcat tctgggcctg ctcacggcag tgccacctgc cagctgtcag caaggcctgg 240 ggaaccttca gccctggatg cagggcctta tcgcggtggc cgtgttcctg gtcctcgttg 300 caatcgcctt tgcagtcaac cacttctggt gccaggagga gccggagcct gcacacatga 360 tcctgaccgt cggaaacaag gcagatggag tcctggtggg aacagatgga aggtactctt 420 cgatggcggc cagtttcagg tccagtgagc atgagaatgc ctatgagaat gtgcccgagg 480 aggaaggcaa ggtccgcagc accccgatgt aaccttctct gtggctccaa ccccaagact 540 cccaggcaca tgggatggat gtccagtgct accacccaag ccccctcctt ctttgtgtgg 600 aatctgcaat agtgggctga ctccctccag ccccatgccg gccctacccg cccttgaagt 660 atagccagcc aaggttggag ctcagaccgt gtctaggttg gggctcggct gtggccctgg 720 ggtctcctgc tcagctcaga agagccttct ggagaggaca gtcagctgag cacctcccat 780 cctgctcaca cgtccttccc cataactatg gaaatggccc taatttctgt gaaataaaga 840 ctttttgtat ttctggggct gaggctcagc aacagcccct caggcttcca gtga 894 <210> 11 <211> 5003 <212> DNA <213> Homo sapiens <400> 11 agttgggctc ccgcctggct gggaggcggg agggatcccg ctcctgttgt tttccgccgg 60 caggagtagg ctggcgggcg cagggggcgg ggtgcgccct ccctccccgg ccagggcgct 120 cgggagcggg gacccgagcc tgcagccgag ctccgctgcc ggccctggac actcggctca 180 gccaagcatc cttcctgggg gccgaggaag tggggccact ctgccgttcc gaggacctgg 240 gaggagccct cggtaccccg ggccccgggg ccctggggca cacacgtcca gcccagcccg 300 agcctgcgtt tcctgagccg ggatctgggg cgagatggcc gcaggcggca gtgcgcccga 360 gccccgcgtc ctcgtctgcc tcggggcgct cctggccggc tgggtcgccg taggattgga 420 ggctgttgtc attggagaag ttcatgagaa tgttactctg cactgtggca acatctcggg 480 actgaggggc caggtgacct ggtaccggaa caactcggag cctgtcttcc ttctctcgtc 540 caactctagc ctccggccag ctgagcctcg cttctctcta gtggatgcca cctccctgca 600 cattgaatcg ctgagcctgg gagatgaggg aatctacacc tgccaggaga tcctgaatgt 660 gactcagtgg ttccaagtgt ggctgcaggt ggccagcggc ccctatcaga ttgaggtcca 720 catcgtggcc accggcacac tccccaacgg caccctctac gcagccaggg gctcccaggt 780 ggacttcagc tgcaacagca gctccaggcc accacccgtg gttgaatggt ggttccaggc 840 cctgaattcc agcagcgagt cctttggcca caacctgaca gtcaactttt tctcactgtt 900 actgatatcg ccaaacctcc aagggaacta cacctgttta gccttgaatc agctcagcaa 960 gagacatcga aaggtgacca ccgagctcct ggtctactat ccccctccat cagctcccca 1020 gtgctgggca cagatggcat caggatcgtt catgttgcag cttacctgtc gctgggatgg 1080 gggataccct gaccctgact tcctgtggat agaagagcca ggaggtgtaa tcgtggggaa 1140 gtcaaagctg ggggtggaaa tgctgagcga gtcccagctg tcggatggca agaagttcaa 1200 gtgtgttaca agccacatag ttgggccaga gtcgggcgcc agctgcatgg tgcagatcag 1260 gggtccctcc cttctctctg agcccatgaa gacttgcttc actgggggca atgtgacgct 1320 tacatgccag gtgtctgggg cctacccccc tgccaagatc ctgtggctga ggaaccttac 1380 ccagcccgag gtgatcatcc agcctagcag ccgccatctc attacccagg atggccagaa 1440 ctccaccctc actatccaca actgctccca ggacctggat gagggctact acatctgccg 1500 agctgacagc cctgtagggg tgagggagat ggaaatctgg ctgagtgtga aagaaccttt 1560 aaatatcggg gggattgtgg gaaccattgt gagcctcctt ctgctgggac tggccattat 1620 ctcagggctt ctgttgcatt atagccctgt gttctgctgg aaagtaggaa acacttccag 1680 gggacaaaac atggatgatg tcatggtttt ggtggattca gaagaggaag aggaggagga 1740 ggaggaggag gaggaagatg ctgcagtagg ggaacaggag ggagcacgtg agagagagga 1800 gttgccaaaa gaaataccta agcaggacca cattcacaga gtgaccgcct tggtgaatgg 1860 gaacatagaa cagatgggaa atggattcca ggatcttcaa gatgacagca gtgaggagca 1920 aagtgacatt gttcaagaag aagacaggcc agtctgaaga agaggatggt ccatggttgt 1980 cttgctctga aagcttggag agctacattg aagacgagct cttcattcag ctttgactcc 2040 acctgcaccc ctggcggggg cttgcactaa caatgtttgg gtctcagcaa aaaacaaaac 2100 caagcacaca catctttcct tccatgtatt gaaaaacatt ggtttgattt gctctaagtt 2160 ttcccaatga tgtttaaaag ctttgagaag gaaagctgct ttggtgtctg aggtgccact 2220 tctgctgtga atcctggctt tatccaggtt gatctactgt gatagatgct gatttagagg 2280 gaacagaggt cagggaagca ctgggtcttg gtgccttttg ccgctttttt tttttttttt 2340 tttttttttt gagacggagt ctccctctat tgcccaggct ggagtgcaat ggcacgatct 2400 tggctcacca caacttctgc ctcccaggtt caagcgattc tcctgcctca gccacggcac 2460 cttgcaaata tcagctcctt ggaacaggtg aagttccagg taccaatgcc aatcagagga 2520 aggcagtttg gttcaggctt tggagttaga aacacctgaa gttgaatctg ggctctgttg 2580 cttccttctt tcatgggcta gagcacgact ctttacctct ctcttggcct caatttcctc 2640 acctgtaaaa tagatgagga agctgctcac ttattattgt ctcgttctga aagcttggaa 2700 agctacatcg aagatgagct cttcattcag ctttgatttg acctgtaccc ctggtggggg 2760 attgcactgg caacatttgg gtctcagcaa aaaaaccaag cacacacatc tttccttcca 2820 tttattgaaa aacatctttg taagatccat tcattgaaaa acataatcca tttattgaaa 2880 aatatctttg taagatcacc tgctaaatat gaaaatctga cttgaatttg tactctttaa 2940 agttgcgtat ctgctctagt gggcaggacc tagggcttaa aggggaactt cctttctcca 3000 tttctaagaa ctgggactct aaaatgagaa gctggttgtc tgaagtaacc ctgcaggtgt 3060 ggttggggaa ggtctgtttt cttggatgaa ggaactaaac taagcatatc agagcactgt 3120 cttaaccagt tttatttccc tggagataga attcttttaa aaagagttag ggagctggta 3180 ataggaagtg cctttcatta taactacatt ttgcagagct tcatatttat atacaagcct 3240 cctaggtgat acactgttag cttgcagact ttcctatgct tcatttctcc tgttgctttc 3300 aaagaaggca ggagacacgt ttaataacgg agtatctggt gataagaatt gcttgggcaa 3360 accagctcat ctggactctt tctcagtctt ggaagtggga agaggaaaac ttgtttcctt 3420 cctgcttctt aaggatattc tgagggtaca ctgatcaata acactaaatt tggaatgaaa 3480 ataccatgtg atgagtttag cctgctgatg cttccagtag atccttgtat agtttcaaga 3540 tttaagtttt ccgatttcat ataaatttct taaagtcgag gaccttataa gggtgcaatg 3600 gatgtttgct aaatatgaaa aactgacttg aatttgcact ctttaatgtt gcgtatctgc 3660 tctagtgggc aggacctagg gcttaaaggg gaacttcctt tctccaattc taggaactgg 3720 gactctaaaa tgagaagctg gttgtctgaa gtaaccctgc aggtgtggtt ggggaaggtc 3780 tgttttcttg gatgaaggga ttaaactgag caaatcacta gaagtatgcc ctgtcccctg 3840 ctcagaacac tggggagctc aagagtgggc tgcaatgtgc acccctcagg aatagctgtg 3900 aattgcaggt ctactggctt tttgcttttt gtcttttgct gcaaggtacc ccacgtactt 3960 aaccattctc aacagtgtaa atcagtgtca ttttagaatg agatactcag cttgcttcta 4020 aagtcactga attactgagt gagtctctcc tttagagtct tcggcaacca aattccagaa 4080 ttgaagagtc tactactcag aggcaacaag attaaaaaaa gaaaacacaa aaactgttga 4140 ggtgaaaaaa aaaaaaaacc ctagctagga acacagagaa tgttttgtag gatcactggg 4200 atattttcca caacttcctc ttctctagca cacacatctg ttgataggaa atatttgagg 4260 gtttttccac taccaaatgg gagcttcatg gtcctggtgt caaacactat aaacctttga 4320 ccagctgagc tgtgactgct gtcacatatc tgagtcctgt gtgcacagta atatcctggg 4380 tcaggtaaaa tccaggtctt caagttttaa ggattttttg aagaattcgg gcttctttaa 4440 gacgatccat gcccaaatcc acaagcttgt tgacagtgga ttacagtttg tgtggcaaag 4500 tccaagttgt tacactgtgc tttaaaaaaa atcttatctg catgtattgt taacttagag 4560 accatgagat ctatttatca ggaccaggaa gatacacact tcaggtccat tgcaactgac 4620 ttttttcttg tttttctcaa accctggtgg agcctgggaa gggggcctcc acaattctgt 4680 ggctttgata ttagccccaa ttctcacaag cacatacaag ccccataatt gccgcaggaa 4740 aacacaagat ggaaaattgc aataacccat gcactgagac ttagaaaatc atccttacta 4800 ggcaaaatgt attatgatgc aataagtgcc aactgatatt tctcacgttg ggactggcca 4860 ggaactgctg caaagaaaaa taagcagctc cttctccatt atttacattt taagatgtgg 4920 tggggggagg ttgggagaaa ttagttctga ggttatcata tgcctttttt aaaagataat 4980 ggaataaagc tatttttaag taa 5003                          SEQUENCE LISTING <110> Biomerieux   <120> Method and kit for determining in vitro the probability for an        individual to suffer from colorectal cancer <130> P2012813C <140> PCT / CN2012 / 072931 <141> 2012-03-23 <150> PCT / CN2011 / 072155 <151> 2011-3-25 <160> 11 <170> PatentIn version 3.3 <210> 1 <211> 22743 <212> DNA <213> Homo sapiens <400> 1 ggagttagcg acagggaggg atgcgcgcct gggtgtagtt gtgggggagg aagtggctag 60 ctcagggctt caggggacag acagggagag atgactgagt tagatgagac gagggggcgg 120 gctgggggtg cgagaaggaa gcttggcaag gagactaggt ctagggggac cacagtgggg 180 caggctgcat ggaaaatatc cgcagggtcc cccaggcaga acagccacgc tccaggccag 240 gctgtcccta ctgcctggtg gagggggaac ttgacctctg ggagggcgcc gctcttgcat 300 agctgagcga gcccgggtgc gctggtctgt gtggaaggag gaaggcaggg agaggtagaa 360 ggggtggagg agtcaggagg aataggccgc agcagccctg gaaatgatca ggaaggcagg 420 cagtgggtgc agggctgcag gagggccggg agggctaatc ttcaacttgt ccatgccagc 480 agcccctttt tttccagacc aagggctgtg aacccgcctg gggatgaggc ctggtcttgt 540 ggaactgaac ttagctcgac ggggctgacc gctctggccc agggtggtat gtaattttcg 600 ctcggcctgg gacggggccc aggccgggcc cagcctggtg gagcgtccag gtctgggtgc 660 gaagccaggc ccctgggcgg aggtgagggg tggtctgagg agtgatgtgg agttaaggcg 720 ccatcctcac cggtgactgg tgcggcacct agcatgtttg acaggcgggg actgcgaggc 780 acgctgctcg ggtgttgggg acaacattga ccaacgcttt attttccagg tggcagtgct 840 ccttttggac ttttctctag gtttggcgct aaactcttct tgtgagctca ctccacccct 900 tcttcctccc tttaacttat ccattcactt aaaacattac ctggtcatct ggtaagcccg 960 ggacagtaag ccgagtggct gttggagtcg gtattgttgg taatggtgga ggaagagagg 1020 ccttcccgct gaggctgggg tggggcggat cggtgttgct tgcctgcaga gagggtgggg 1080 agtgaatgtg cacccttggg tgggcctgca gccatccagc tgaaagttac aaaaatgctt 1140 catggaccgt ggtttgttac tatagtgttc ctcatggcga gcagatggaa ccgggagaca 1200 tggagtccct ggccagtgtg agtcctagca ttgcaggagg ggagaccctg gaggagag 1260 cccgcctcaa ttgatgcctg cagattgaat ttccagaggc ttaggaggag gaagttctcc 1320 aatgttctgt ttccaggcct tgctcaggaa gccctgtatt caggaggcta ccatttaaag 1380 tttgcagatg agcttatggg gggcaatctt aaaaagtcca cagcagatgc atccggctcg 1440 aggggccatc agctttgaat aaatgcttgt tccagagccc atgaatgcca gcaggcaccc 1500 ctcctttcct ggggtaaagg ttttcagatg ctgcatcttc taaattgagc ctccggtcat 1560 actagttttg tgcttggaac cttgcttcaa gaagatccct aagctgtaga acattttaac 1620 gttgatgcca caacgcagat tgatgccttg tagatggagc ttgcagatgg agccccgtga 1680 cctctcacct acccacctgt ttgcctgcct tcttgtgcgt ttctcggaga agttcttagc 1740 ctgatgaaat aacttggggc gttgaagagc tgtttaattt taaatgcctt agactgggga 1800 tatattagag gaagcagatt gtcaaattaa gggtgtcatt gtgttgtgct aaacgctggg 1860 agggtacaag ttggtcattc ctaaatctgt gtgtgagaaa tggcaggtct agtttgggca 1920 ttgtgattgc attgcagatt actaggagaa gggaatggtg ggtacaccgg tagtgctctt 1980 ttgttcttgc ttcgtttttt taaacttgaa ctttacttcg ttagatttca taatactttc 2040 ttggcattct agtaagagga ccctgaggtg ggagttgtgg gggacgggga gaaggggaca 2100 gcttggcacc ggtcccgtgg gcgttgcagt gtgggggatg ggggtatgca gcttggcact 2160 ggtactggga gggatgaggg tgaagaaggg gagagggttg gttagagata cagtgtgggt 2220 ggtgggggtg gtaggaaatg caggttgaag ggaattctct ggggctttgg ggaatttagt 2280 gcgtgggtga gccaagaaaa tactaattaa taatagtaag ttgttagtgt tggttaagtt 2340 gttgcttgga agtgagaagt tgcttagaaa ctttccaaag tgcttagaac tttaagtgca 2400 aacagacaaa ctaacaaaca aaaattgttt tgctttgcta caaggtgggg aagactgaag 2460 aagtgttaac tgaaaacagg tgacacagag tcaccagttt tccgagaacc aaagggaggg 2520 gtgtgtgatg ccatctcaca ggcaggggaa atgtctttac cagcttcctc ctggtggcca 2580 agacagcctg tttcagaggg ttgttttgtt tggggtgtgg gtgttatcaa gtgaattagt 2640 cacttgaaag atgggcgtca gacttgcata cgcagcagat cagcatcctt cgctgcccct 2700 tagcaactta ggtggttgat ttgaaactgt gaaggtgtga ttttttcagg agctggaagt 2760 cttagaaaag ccttgtaaat gcctatattg tgggctttta acgtatttaa gggaccactt 2820 aagacgagat tagatgggct cttctggatt tgttcctcat ttgtcacagg tgtcttgtga 2880 ttgaaaatca tgagcgaagt gaaattgcat tgaatttcaa gggaatttag tatgtaaatc 2940 gtgccttaga aacacatctg ttgtcttttc tgtgtttggt cgatattaat aatggcaaaa 3000 tttttgccta tctagtatct tcaaattgta gtctttgtaa caaccaaata accttttgtg 3060 gtcactgtaa aattaatatt tggtagacag aatccatgta cctttgctaa ggttagaatg 3120 aataatttat tgtattttta atttgaatgt ttgtgctttt taaatgagcc aagactagag 3180 gggaaactat cacctaaaat cagtttggaa aacaagacct aaaaagggaa ggggatgggg 3240 attgtgggga gagagtgggc gaggtgcctt tactacatgt gtgatctgaa aaccctgctt 3300 ggttctgagc tgcgtctatt gaattggtaa agtaatacca atggcttttt atcatttcct 3360 tcttcccttt aagtttcact tgaaatttta aaaatcatgg ttatttttat cgttgggatc 3420 tttctgtctt ctgggttcca ttttttaaat gtttaaaaat atgttgacat ggtagttcag 3480 ttcttaacca atgacttggg gatgatgcaa acaattactg tcgttgggat ttagagtgta 3540 ttagtcacgc atgtatgggg aagtagtctc gggtatgctg ttgtgaaatt gaaactgtaa 3600 aagtagatgg ttgaaagtac tggtatgttg ctctgtatgg taagaactaa ttctgttacg 3660 tcatgtacat aattactaat cacttttctt cccctttaca gcacaaataa agtttgagtt 3720 ctaaactcat tagaattgtt gtattgctat gttacatttc tcgaccccta tcacattgcc 3780 ttcataacga ctttggatgt atcttcatat tgtagattta ggtctagatt tgctagctcc 3840 aagtaattaa ggccatgtag gagagcatgg taaccacaga tagaactggt attatcccaa 3900 gtggtctgca gactgctgag tggggatggg atctgctctc tgttgagagt tggtaatcat 3960 tggtttgaaa tgtgatgaaa ccactcaagc caatgaaggt gggtgtgtag gtggggagta 4020 ctttgccata atattttaaa acattacctg gttagagttc taagtggtac ttatttttgt 4080 ttggttaggg gaaagcctga ataaaaacag aaatggacac ataatatgca tattccatag 4140 tctttgggag gctggaatgt gcctgggatt tgggtctaag tgtatgcgta attcttacct 4200 cactaaagaa tttgccttgt ttttttcctt ttggtgagtg actaaaacgt ctgggcttcc 4260 ctgtgtgcgt gctacagtaa gcaagcagag gctgtgcaaa ggtgtgagca ggatcacgtg 4320 gaatctggag gatacatctt ggcttgcaaa ctgcctctgt ctcctgggtg ggactgttct 4380 gtccttgcac tgctgttctg tgttacctct tggggtgtaa ggttttgctt acaggagaca 4440 aactttgggc gtagaatgga agccactgcc agcctctgtg ctgagaagga aggtgcttgt 4500 ttcaaaggga gcagcaaggg aggcttgttc tactcacctg ggcctgtttg cctgagaagg 4560 ggagataagg gctgaactgg gactagccag ggggaccaac acaaatggtg ggggatcatg 4620 acctgaagga ttctttcctt cccatgagct gcagggctgg ttgccgtcct tgcaactgtg 4680 tcttatttgc ctgtgccgtt atatcttggt gacccctcca cgtgtacact actgacaaac 4740 gggtggagtg ctggggagaa gtcactgtgc cgcccaccta gtaaaccttc tgtctgtgct 4800 catggcatct ccaagatggg gcactgctgt gtgcagaatc cagggtcctc tttctgcttg 4860 caactccttt ccctggatgc cccagaaaca atccaggcct cctttcctat cttacccctt 4920 tgctttgctt tttaccccag cacctctata accgccttct cttcttttca gaactccttg 4980 tttctcgtcc tgttttttat gattacaaaa ctcttgcttc caccctggaa gataactgct 5040 atagatgcct gtatgtaaat ggtgctgtct ccagcaactg gcatgctgaa gaagaattga 5100 ttcacggggt ataaatgttg gggattggaa gtggggatga aatggcactt gttgatacag 5160 gagcagagag gtgaggccga ctgctgaaga cagctcgcca ccctccttgc ctccactcca 5220 atccaggggc tggggccaca ttctttgcct tcatttatcc tcagatcagg tgagatcgac 5280 aggaggtgtt gatggcagtg ccagcaatta ttgctaatcc gtttgcatcc ttatgcatag 5340 atctgaattc agactttgtg aatttccaga ggtgtgggta atataataga attcagtgag 5400 tgggcatggc tgatcttgtg caaattaaaa gttatggggc ataagaatag caaaagttga 5460 acttctttta aaaaggaaag taccctgaga gccagtattg gttgaggctc ttcagtatgc 5520 ccaggttggc agcactgaga accgcaggaa cggcctgttg ttacaaaaag gagattgact 5580 cagctgccct tggtgcatct gactgactat gactgctgag agattccaag gacccttaat 5640 gccagggcta acctctccat gtgcagtgag acctctggag gaagtgtcat cctctggctt 5700 tgtgtggtac tcattatggt gcagtgcggg catgaaatga agacacccaa ataggcttac 5760 agatacgata tgttttaaat gttcgtattt aacaaaaaca tactgacact gtttggaaat 5820 ggcaacagga agatagcaaa atgaatacta acattacgaa aagatgaaca ggtacatgtt 5880 ccaaggcagg tggctgtgaa cttcctctga gtgaaggcat cccctccagc acctttcagc 5940 ctgctagtta ggacgacccg ccgccaccct ccaggacctc cagccctgca ctgcctttcc 6000 tctcttttaa ataattcttc attgagttct aatatgtaaa aaaaaaaagt ttactgtaaa 6060 gtttgcaaat aaggaaattt tttttaaaag tcctcagtaa tcttaccagt aacaattgtt 6120 atgggcacat ttgcttttgg aagatttctt ttgtatgcat gggataagta catttttaaa 6180 caaaaatggg attatgccat aaattctatt ttgtgacttt aatatatagt gaacaccttt 6240 tttaatgatg acaggatgtt cccttgcatg gctgtatcaa tttaaacaat cttgtttcaa 6300 tgggcataca gggtattttc tagttttttt ttcctcttag aaaataatac ttgcgatgac 6360 tttccttgta gctcagactt tttcacgtct gttgttatct ctttgggaat gctgaataca 6420 tacatttcga gaaggaaatg actgttaaac tcttaagact tcaggttcat attgctaaac 6480 tgcccagcag ggagggattt tttcaattag tgttctcact ggtgaggcaa acctgatgcc 6540 ttcccctctt cctcagaacc ggctttatca cattgaaaac ctttgctcct ccgacggatc 6600 gagtctgctt tccctctgga tgtgagcatt gctttgtctg ctggtgactg aacatctcta 6660 ccttgtgtca attggccatt tgtggtgtgt gtgtgtgtgc gtgtgtgtgt gtgtgtgtgt 6720 gtatgatttt ctaattccta gtcatttttc tattgattgt tttgcaaaag ccatttacat 6780 cttaaggata ttgataatct tttgttatat ttgatgcaaa tatttttttc cagtttatag 6840 gttgcctttt aattttgtgt ttcaggtaga taaaagttaa acgattttct taggttagtt 6900 tatcactgtg gtttctgaac ttgttatgtg tagatctttt ccaccccaag agtacataaa 6960 tattaatcca tactttctta tggaacttgt atggtttcgt tttttacatt taaaccttct 7020 tccccgtggt gtgtgttgtg gaatctgtgt ttgtgtgagg aggggcatgg tgctctcaga 7080 acccacctcc tgtggccaga gagccctgtc ctgtgagggt ggttgtcaca gtggcagggt 7140 tcaattcaga agaccttgag ggcaggctga tgtttcctga atgggcccct ggttgttgct 7200 tgtccctgac tctccatttc cccatctgag tggatttgga cctaataggg cactggagct 7260 ggttcgaatc ctgactggac tacttggcaa ctttatgtct gggagcaagt tacttaacct 7320 ccccaagcct gtgtctgtga aatgcgggta aatgaatgta gatgtttggc agcagctact 7380 ccttgttgag ctctcacagt gaactctcct gcctctgccc tccttccccg cctcccctgg 7440 tgcctagcgt caggtctagc cacttcctcc tgggcccctc tcccttttct gtggctggct 7500 gcctgcccgc ctggcgctgg acctttcatg taacgggaat cagcatgtat attctggtct 7560 ggtctgtttc tacacttaat tttgtttcca gtagtatttc cctgtaccgg cagagttcac 7620 aaacacattt gaagaggctt tttctcagga ttcttaacct tcccaaagga agtcccatgg 7680 atgggtttct agaagtctat aaatgctctg aaattgtatt tttctgtgga aagcataact 7740 ttcatctgct tgttcgtgct caaaaaagat catgaatgaa tgattgcatg attttatgcc 7800 attgtgctta tactaaagga tatgtagccc atctcttgag ctgttaaact gttttgacta 7860 ctttaaatcg tgcagctgtg agcatctctg taaatttagt gtacacatgt atcccctgga 7920 gtggcattgc ctcggcagtg agcacttatg gttttataac tctcttcaca gactcaaatg 7980 actccagaaa gctacacttc ctgttgtgag tatatgatat ccatttccct acatagccac 8040 taacatcagg tttttacaat tttatttatt tcttgctact ttaagaaatt tttgtggtga 8100 aatacatata atagaagttg actatctgaa tcatttttaa gtatacattc agtagtgtta 8160 agtatgtcgc cattgttgta caaccaatct ccagaacttt ttcatcttgc aaaacaaact 8220 ctgtacccat taaataacat taaacattcc attccctcca gcctcagcaa ccccattcta 8280 ctttctgttt ctgtgagttt gactattcca agcacttcat atcagttaaa tcatgaagta 8340 tttgtctgtc tgtgactggc ttatttctct gagcacagtg tcctcgagat gcgtctatgt 8400 tgtagcatat gtcagaattt ccttcctttt taaaagatcc aaataatatt cttattttat 8460 atcttttttt tatccattca tccattagtg gacacttggg ttgcttttgg ctattgtaaa 8520 taatggtgct atgtacaaat atctatatta ttgtatttac aagtataatg ctgtaatgta 8580 cacacatctt tttgagatcc taccttcagt tcttttgagt atatagccag aagtggtatt 8640 actaaatctt acgatatttc tatttttaat ttattgagga accactgtag tttttcatag 8700 caactgcacc attttacgtt ctcaccaaga gtgcacaagg gttccgaggt tcccacatcc 8760 tccccaacac ttgttatttt ctgctttttt tagattgcag ccatcatagt gggtgtgagg 8820 tgacatttca ttgtggtttt gatttgcatt tccctaatga ggagtgatgc tgagcatctt 8880 ttcatatgct tactggtcat ttgtatgttg tctttggaaa aatgtctatt caagtccttt 8940 gactatttta aaaattgggt tattagagtt atcgttgttg ttgacttgta ggagtttctt 9000 tctatattct ggatattaat cccctatcag atatatgatt tgcaaatatc ttctcttatt 9060 ccataaggtt actttttcac tttgttgatt gtgttctttg atgtatagaa gtttttagtt 9120 ttgaaatagt ctaatttatc tgtttttact tttgtggtct gtgcttttgg tgtcatatcc 9180 aagaaatcct tgccaaatcc aacgttataa ggtactttta aggtatttta gttgtcttag 9240 tctatatttc tgtactcacc tttctttatc cactcatcag ttgatgggca tgtaggttgg 9300 ttccatatct ttgcaattct gaattgtgct atgatcaggt gtctttttag tataatgatt 9360 tactctcctt tgggtagata cccagtagtg ggattgctgg atcgaatggt ttttataatt 9420 ttctatttta ccacagtttc tctctgcatt tttcctcttt gaccactaac catgtgaaat 9480 tctcatattg acctttataa tgatcatgaa ctcttagtat cattgggaag gccacatttg 9540 ccacttatga ttgtaaacct tatcctccat ttttcctgtt attgttggtg caaaaagcac 9600 ctattatacc aggactttaa aaatcagtct gataagtctt tgataagtct aataataata 9660 actgataagt ccattgaatt tgcttctgat tactttttct ttagtagcta aacatgtatg 9720 tactcctatg attacaatga acactcctct ccatttaaat taattattta cattgatgaa 9780 atagcaaaat gttaatgact aaatactgtc ttggtttttt cgttccaggt cagtcaatat 9840 taacttctta taattttctt ttttttcttt atgtgtgtgt gtgtgtgtat tttttttttt 9900 ttaatttcaa tggcttttgg ggtacaaatg gcttttggtc atatagatga attctacagt 9960 agtgaagtct gagattttac tgcaccggtc acctgagtag tgtacattgt acccaatatg 10020 tggtttttta taccttgccc ccctcttacc ctccccactt tgagtctcta gtgtccatta 10080 tgtcactctg tatacctttt tgtacccata agttagctct cacttataag tgagaacaca 10140 cagtatttgg ttttccattc ctgagttgct tcacttagaa taatatcctc cagctccatc 10200 caaaattgct gcaaaaaaaa aaaaaaccac aaacattatt ttgttctttt ttattgctaa 10260 gtcatattcc atggtgtaga gataccacat tttatttatc cactcactgg ttgatgggtt 10320 ggttccacat ctttgcaatt gtgacttgta ctgccatcaa gtgtctttct ggtataatga 10380 cttcttttcc tttgggtaga tacccaggag tgggattgct agatcaaatg gttcttaaca 10440 ttttctctct ggatctattt ctggaaattt taggctccag tttttgttgt tgttgttaat 10500 aaaatgcaat ggaatgtaat gatcatcact tttcattatg ctttaaaatc tggtaaatgg 10560 aggctagaac actcctgtaa ggcaagaata ttctctctgt tggaactcaa atacacagaa 10620 ctgggtaaat ctcaatctta atctttgatt caggacacaa catggctctc ttttacttgc 10680 tttctttaat tgttttttaa taatgtggta agcatttctg aatctcctat ccaatacaaa 10740 aactaggaca atacaca taactcctat ggttacaatg aacactcctc tccacttaaa 10800 ttaattattt acactgatga aattgaaata gcaaaatttt aatgactaaa tactgtcttt 10860 gattttttgt tccaggtctg tcaatattaa cttcttataa ttttcttttt ttttctttat 10920 gtgtgtgtgt gtgtgtgtat atatatatat ttaatttcaa tggcttttgg ggtacaaatg 10980 gcttttggtc atatatatga gttctacagt agtgaagtct gagattttac tacaccttcc 11040 acttatgtgg tcccacacca cccgcctccc ctgccgcctc ctgccacccc ctaggccaag 11100 gtaataatca tcctgaatcc tgggtttatc tctcacttgc tttcttttca tataattttg 11160 caaaagaatc tgatctaaat gtgtttttca gagtatatat ttatatttta gctgttctta 11220 gagaaaattt attattttgc atgtaatctt atggaacatt ctcatttaat accatggtaa 11280 gattcagccc ttgcccaggg gatagttcat ttagtttgtt tactggatag agctcatcat 11340 gtgactatac ctcagttagt ttatcagttc tcccatccat ggtgactagg ttgcctctca 11400 gcctctcaac aacactgttt ctcagtgtcc ttgtagaagt gatatgtggg tgttttctcc 11460 ttacacagag ttgaaaggtg acgacaacaa cgttggcact accaatcccc caccctccag 11520 aggggtaacc agtgttacca gtttgctgtg tttcctgcta cacctcgcct tattcacttc 11580 catttgtatc tgaaaaacgt gttgcatggt ttcttttcta tagaagtggt aaaatgctat 11640 tgtgtcctgt acattattga ttactttttt tcatttaaca gtagggagat gcctgggagt 11700 acacagagaa ctgccctcat tgttttcaac ttctgcactg tatgtctgtg agtttagcca 11760 ttctgctgtt aatggaaatt tacagtattc taatcttttg atattacaaa cagttctgtg 11820 cgatcatcgt catacacaac cccttgtgca caatgcatga gtgtttctca gggtaggtac 11880 caagaagtga aattcctggg tcatagggcg tgagtccgac atttttctcc attctgccct 11940 gttgccctcc agagtgggtg tccagctttg catacctaag tatgagagta tctgttgttc 12000 atatcctcta cgacgctcca tatatgaaac ttaagtttct gctagttgcc atctttgatc 12060 tatcatgtat gcagtgacct actaagactg taattggtac agtagattct tgtcatctgt 12120 gtgtgaattt agcattcatg ggcttaatgc tgacaaggcc cccagggtcc aagacatata 12180 atcatgtata attttgtcaa ggtataattt tttaaattgc ttttgtcatg tgtctgctgg 12240 tgatgcccaa cccagtgctc tgcacccagg tcacactgtg gctttgtcct ctgcttatgc 12300 ctgcattgca gcaactgtcc tgaagagacc aaaattatgc agatttaggt aagtccatgg 12360 ctaatgttat tatattatgt gctattgtaa tggatggggc tgtggagtgt atgaatttat 12420 aaatcactgg tcttgtaatt aaaattcaaa cactatagaa aaaggccatg tagaagataa 12480 aagttcctct ataatcccgg acccctaaga taactactaa tgacaacttc atttatattc 12540 cttcagacat tttctggctg tggatgtact aaaatgtatc ctattattct ctgccctaaa 12600 atggaatcat acaaggtgta ctgttatttt tatggctcta taacatgtca tattgtacgt 12660 gttggtatgg tcattttaac catttttcta gtgatggctt tgaggttatt tgcagtttcc 12720 tagccatctc aaagtgtgct gcggggatct cttttgcatc cctctgggtg cagagctgag 12780 gcacccagag gcagtgtcca gaggaggcag catctgtagg tgtcttcacc tgctctggct 12840 cttggcacat ctggttggtg acactgtttt gtgagatggg ttgaaagcac gtgctgccaa 12900 aatagaataa tgttggtcct ctcctcatgt gccgtggaac tggggtaaaa ctgcgtagtg 12960 gctgcagctg cctgtccata ccggaatcga gtataacacg gtgcctggct tagcacaaaa 13020 cagtagtggg tcctgcaggc cccagagtct aattcctggt attctttccc ctacacagat 13080 taaataaacc aaaaacaaac tattctagga aagcgtctgt gacatttgta aaaagtggta 13140 tttaatgatc ttttattcac ttgtctgttt agtttgttga aatcttaagt ggcatcctgg 13200 tctgggaagg agtgctgtct gcgcctgccc tccgctgggc acagcgtggc tgcttcaggg 13260 gctaagcaca cactttctgt cttctaaagg gccgccacat gccaggagct caggtgtgag 13320 cccggctctg gctcttacct catagggtca ctcatagggg cacagggagc agaacattgt 13380 acacagcgag gcaccacccg gcttggcatc tgcctcggtg gacttactac ctctagaagg 13440 aaatacctga gttcctctgg cctcagctcc tagagtgact ggtgtgctgt ccctgttact 13500 cttctgtcaa ggtgacaact gtgtgaccca tcatctgtgt gtcaaagcaa ggccctgcct 13560 gggcctctgc tcctgtgctg accccaaagg caaatgcttt gctagtttcc ttccagttaa 13620 tttcacctat gaatagatgt gtgaaaactg ttcaaagcca tacctgcaca tgtttgaact 13680 tcaaaccctg tgggtgattc agtggcatct ttctctaacc cccagcctcc cttcccacag 13740 aggccaccgt catggccagt tgctgcagtt tctttccaga gaacctgtgt atgtgtaaag 13800 ctgtacaggc gtgggtacac cacacagcct gtcttgcact gtggactgtt gagttactag 13860 tacatctagg taagcaccgc atatctgtat tcatgtctgc cttggtcttt tcaacatctg 13920 tgtggtagcc gtgtttgaat tacccattcc ctttttgggg aaccattaag ttgtttcagc 13980 aatttttact gtagataagg ctataccgca tatctgtgta catgggtttt tatgtacatg 14040 ggcaagtata tctgtgagag aaaagtttcc tcaggaggaa ttctgggcac agcatgtgta 14100 aatttctaaa tatgatggac acccccagct tccacctcaa ggaggttggt cccattgaca 14160 tttccccaca ccttcaccca ggctgtgccc ttaaacttgg ttatttgtca atgtgagaag 14220 tggaaaatag tatttaattg tagtttggat ttgtatttct attgggttgt atacttactg 14280 attaataata agagctcttt acatattaag gaaattaacc cttttcaaat acattcctat 14340 ttctcactaa tctttaagtt ttattgtaat attttgctct ttagtttata tatatatgta 14400 tatatatata tatgatatata tatatatata catatatata tacatatata tatactaatt 14460 ttcttttatg gttcctggat tttgtgagta gtttgaaaag gctaatccag ctgaagattt 14520 tgttgttgtt gttaaacccc atgttttctc ctaactcttt ttatttttat tttggaggac 14580 tctatctaga cttaatttta gcataacaag tgacagggtt agttagcctg ttgtccttac 14640 accattttct ggctaataca gctattaact attgatctgt ctattcacgt gccagttcct 14700 aatggtttta catagtgtaa tctgcacttc aaaatagcga agggaagccc tacctcatta 14760 ttctactttt ccagaattct cctggctatt ccaggctgca tgtttacctt aaccttccct 14820 gtgatgtctt catgccgttg tcttcttatg caagaataag gtacgtcttt ccatccactc 14880 acgtctattt aatttgactt tgcattacac agaaagctgg tcttggtctg tctacctcgg 14940 catctagttg tcctcactgc cccctagccg accccacccc atctgactga ctaccccatc 15000 acagagtact tttatttacg ttttgctctg cctaatggtt acttgatact gtcacgccga 15060 cagtgtccag ttcagtggtc tttgcagttg aaatgctccc gtacacactg tcttgttaaa 15120 aatgccagta agttcataca aacccagctt gcacccaagg tcacattcag agagcgtagg 15180 gctgggatgg gttgttttcc aagcttctgc cactgtgtgg ctagctcttc ccactgggaa 15240 gttctgtgta cccggaatgt cggagtggag tcctgttcta gtgtccagca cctgaccctg 15300 tgcccaaccc ctcaacagcc tattcctgct gtccacagcc tgctggaact ttttacaaaa 15360 tatgttgcca tgctggaccc tgggcactgg acataagccc cctggcagcc tttttcatgt 15420 cacccaaagg ggtaattgtc ctactggtgg tctgtaagat gagttagggt gacttgctaa 15480 tagacattgt aaatcttaat atttatgtat gtattttatt attaccggtt ttccatttat 15540 gatggtaata ttgtttcttc taagaatatt tatttttcct tctaaatatt gagataaaat 15600 tcatgctttt gaaatgttct attcagtggc ttttagtata tttgctatgt tgtgcaacca 15660 tcgacactat ccatttctag aactttttcg tcatcccaaa cagacgctct gtattcataa 15720 aaaaataact tcctacctgt ctctccccct agtctttggt aacctttgtt atactggtaa 15780 actttgttgt gctctctgtc tgtgtgaatt tgcctattct aggggcctca tataagtgta 15840 atcatacagt atttgtcttt ttgggtctgt ctgatttcac ttagcgggtt ttcagggttc 15900 attcatgttg cagcatataa cagtactgcg ttcctttttc tggctgaata atattccact 15960 gtatggatag accccatttt gtttattcac acatcatttg gacatttgga ttatttctgg 16020 tttttggcta ttatgaacaa tggtgctatg aacagttgcg tacaagtttt tgtgtgaaca 16080 tatgttttca attctctcat tatataccta ggagtagaat tactgggtca tatggtaact 16140 gtatattttt gaggaactgc caaactattt tcccacgtcc atgcaccatt tcacattccc 16200 accagtaagt aagagggttc caatttctgc gcattcttgc caacactagt tattatctga 16260 ctttctggtt ataatcattc taatgagtgt gaagtagcct ctggtgtcat ttggatttgc 16320 atttctctga tgagtgatgc tatcaagcac ctttgctggt gctgttggcc atatgtgtat 16380 gttccctgga gaagtgtctg tgctgagcct tggcccactt tttaattagg cgtttgtctt 16440 tttattactg agttgtaaga gttctttata tattctggat tctagaccct tatcagatac 16500 atggtttgca aatattttct cccattctgt gggttgtgtt ttcactttat cgataatgtc 16560 cttagacata taataaattt gtattttaaa agtgacttga tttggctgtg caaggtggct 16620 cacgcttgta atcccagcac tttgggagac tgaggtgggt ggatcatatg aggaggctag 16680 gagttcgagg tcagcctggc cagcatagcg aaaacttgtc tctactaaaa atacaaaaat 16740 tagtcaggca tggtggtgca cgtctgtaat accagcttct caggaggctg aggcacgagg 16800 atcacttgaa cccaggagga ggaggttgca gtgagctgag atcatgccag ggcaacagaa 16860 tgagactttg tttaaaaaaa aaaaaaagtg acttgattta agggaaaaaa tgactggcta 16920 tattcagtca gatatggcaa aaagtctcaa ggtgttaatg tgaatgatta aggtcttggg 16980 gggggtgtcc cctatcagac tacaggtgtt tagaggcaca gaaaaaggtg cagttgggtt 17040 cttaatgtga aatgatgaga agcacaactc cagtgtgtct ctttgtgtag aatgtcagca 17100 gacaccccct gctagatgtg ctggatcatg ggaaagcatt tccatttgtt actagattgt 17160 tcagaagttt taatttatga tgggtgtggt ggctcatgcc tgtagtccca gcactgtggg 17220 aggctgaggc aggaggatca tctgaggcca agagttcaag atcagcctgg gcaacatagt 17280 gataccctat ctcttaaaaa agaagaagtt tttaaatttg aaataataat aggtactgga 17340 tttatgcaaa tgtcttttct gcgtcttttg agatgagtat caggtttttt tttttccttt 17400 tatcatctga tgatgaactt aatgtttcca tttgtattaa tggaatacta agtccctctg 17460 tgatttctga accaagctat tcctaggcct gagttttatt ttgttgacac agaaataaat 17520 tagaaggcca agcgtggtgg catgtgcctg tagtcctagt tgctgaggta agaggattgc 17580 ttgagcccag gagttcaagg ctgcagcaag ctttgattgc gccactgcac tccagccttg 17640 gcgacagact aagacgctgt ctcaaaaaaa aacaaaaacg acaaaaaaaa aacaaaacag 17700 aaaaaataaa ctaaggcaat gacagtccct ggcaaatgct gggagggagg cagcagtggt 17760 cagggaaggt aaccctgaag caggacttgt aaagcaaata agattgggag gccaaggtgg 17820 gtggatcacg aggtcaggag ttcgagacca gcctggccaa catagtgaaa ccccgtcttt 17880 actaaaaata caaaaaaatt agccaggtgt ggtggtgggt gcctgtagtc ccagctactt 17940 gggaggctga ggcaggagaa tctcgaaccc aggaggcgga ggttacagtc agctgagacc 18000 gcaccattgc actccagcct gggtgacaga gcaagattcc gtctcaaaaa aaaaaaaaaa 18060 aaaaaaacca agaagaaaag gaatgaatta gaacttcttc tgcttggact taagggcatc 18120 atcaggcagg ttttgggtag gatagcaggg gaggcagaga catagtcggg gtcagtggtc 18180 atgagtgtgg ctttgagccc aaaaacttgg tttctgttcc ctactttgcc actcagtagt 18240 gcatgacttt ggccaaattt cttaaattca tgaagcaagt ttccgggtga atgaaatggg 18300 gataaaaata gtgttcaaac ctatccgttg gtttgtgtga aactgaaatg aatagtatcg 18360 tgcaggtact tgtgagcaag gggagctgct gtttcctgtc cctttatgat gggaaatatc 18420 tagacaagtt cccaaccctc tgcactgcag gctgcatggc acggagggtc ttgtaacacc 18480 agctggggct ggccttcttt taggagcttc agtggttctg aaaactttta tttgtttgtt 18540 tgttttagta gatgtggggt ctttctgtgt tgcccggact ggtctcaaac ttctggactc 18600 aagtgatcct cccccgctca acctcccaaa gtgttgggat tacaggtgtg agccactgtg 18660 cccagccttg aaaacttttt caggttcttc cagggttact gggctattaa atatttctat 18720 ttcattataa gtcagttttt caaagttata ttatcttaat tacctttttt atatgtatta 18780 gtgtagagta gcattttata ttttgatatc ctccttatgc atagtttttc actttttatt 18840 cctagttttt cgtttttaat aagactttca agaaatttat tttattggcc ttttgaaaaa 18900 agcagcttta gataaagtaa gcagttctgc tttcatttta taatttattt ctacttttgt 18960 ttcattaatc ttttcctccg gcatgccttg gattttgttg tgttactctt tttctagagg 19020 ctcgcattgt gtgtctggtt cacttatgat cacgcttgcc tacttttaag aatggaagag 19080 gggaggtgga gggtggctgc acagtcgagg gtgtgaggca gtcttgctct agccccacca 19140 tgccctcagc ccgctgtggc cacgctggtt cctcaattgc tggggcgtgc agtgtctgta 19200 agggaggcta ctgatgccat ccgaggaaga tgtaaggttt cgtgtgggca gcgagagcct 19260 agcaggcatg tggggtgccc agcaaagggt aacagtggac agttgttgcc tcattccaca 19320 gagttttgat tttttttttt tttttaatgg tcactccatc aacatccccc atggccagag 19380 cctgagctgg tccccagaga cacaggcatt cagctgacag cctcgccttc acgctgctgc 19440 tgttctcatg ggggacaggc ctcaggtggc aatgcacaaa tcattagtta agggcagttg 19500 tgacagttac caaggagtgt agtcccccgc cccccgccca gtgaaaacag ccctaaccag 19560 gggtggggac ctttgggctc tgacccgaag ggtaggagaa gctggaagga cagcattcct 19620 gtctgcgaag gcaggagcaa agctgccagg ctatgaagga aatggctgga gcctgaagtc 19680 atgcaagctg gggctggcag ggacagggcc aacttccagg cctgggggcc accatgagga 19740 ttcaggacgt gacccccagg gcacatgaag gccttccatc tgtatttaag aaaagacttt 19800 atcagacgag tatggtggct cacgcctgaa tcttagcact ttgggaggct gaggcaggtg 19860 gatcacgagg tcaggagttc aataccagcc tggccaatat ggtaaaaccc catctctact 19920 aaaactacaa aaattagcca ggcatggtgg cgcacgcctg tagtcccagc tactcgggag 19980 gctgaggcag aagaatcact tgaacccggg aggtggaggt tacagtgagc caagatcgcg 20040 ccactacact ccagcctggg tgacagagtg agactccgtc tcaaaaaaac caaaagactt 20100 tatcttattt cctatatgtt tgtggtttca gtcctgatgt ataatttgac cctagttaga 20160 atggttatct gaggaagtgg cctgtacgat ttctgctttt ttaaatgtgt ggctcccttt 20220 cttcattgat taacgtatga ttatttttat aaatgttcca tggcagtggg aagggattct 20280 ctgtcacatt ccacatctgg atcagttcct ccccattttg ttggtcaaat ccgatctgcc 20340 atatcctgtg taatgacaag tgagttgcat tctcaccgtc actcctgggg tctctccgct 20400 tcccctgagc tggctcagca gtctgctcca tgtgttttga tgcagggtga cccattggta 20460 ttcccgacac taacgccccc gtctgtggac tgcttgctgc ttgggcttca ctgtgtctgg 20520 tgttgacagt gcagacctaa aggtgtgcac acatgtgcac acacactccg ctgtcttctt 20580 gtttgcactg gacttaaata tctatgaggg ttattttcaa ctgctgaatt tggaatgatt 20640 tttatatctt ttctgctttc tgcccatgta catgtgttta ttttacactg ttgtgattgg 20700 tagttactat gtggggacac aattacttgg gctgaaataa tccacctgtt gtggttgggg 20760 tcctctgggg cattccaggg tgagaggttg tcactgccac ctgggccatg tgggccggca 20820 ccagcatttt gtggttacga attctacagt cacaaatatc tttgggcaaa tccccttcta 20880 tacctcaagg cagcttttgg tttgcaaccc cactggccag agggaagggc cagtcacttg 20940 gctctctcac tgccctgcgc cccagatggt tctagggctg ctgttttccc ttggccctgc 21000 caacaccact gtttttactt ctgctcattg gctgagtgca gtggttcctg gaagccagtg 21060 gcacgtttcc ccgcgtagct cgcttatccc acagcacaca cccaagggtt ctgttgctaa 21120 cacgctgaat taattctttg ctcatcttac agagtgtgtt ttgactgccc ccatttctga 21180 ggccttgtaa ggccagagct ttgttgcttc atcggcaggt tgggacttag atggccgtga 21240 atgtttcctc tctgctgctg cagtaagtaa gtgcccgcac catagtgtgt ttggaggctg 21300 aagttgaagc gaggctgtga ggggagatgg acgtgtgagg agggatgatg gggcttgagc 21360 aaagtggggg agggggcaaa ggcagttggc ccaacacatt ccccacccct ttgagaggtc 21420 tgaggcctgc agacctggct cggagcccac ctggtagtcc tcagactgtg tgtgtgtgtg 21480 tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtaaaag agagaagttg 21540 tggagaaatg gggggctgat tctgctcaga ttcatcagga tgagtagaag gcacccagct 21600 ctcaccctgg cctgacatgt gtgtccctga gcaggttaca gtcctctctg agcctctgct 21660 tcccatctgg accctgctgg gcagggcttc tgagctcctt agcactagca ggaggggctc 21720 caggggccct ccctccatgg cagccaggac aggactctca aatgaggaca gcagagctcg 21780 tggggggctc ccacggaccc gccgtgggcc caggggaggc agagcctgag ccaacagcag 21840 tggtgctgtg gaccgtggat cctgagggtg gcctggggca agtaccggct gagggtccag 21900 gtgggctttg tgtacctttg ggtcctgggg ccctggtgac ttggactcca ggttagagtc 21960 aagtgacagg agaaaggctg gtggggccct gtgcttccga cttcatttcg agtgatggca 22020 gttcccagga aggaatccac agctgacggt ggctgacaga tcagagaatg gaaggcgagg 22080 caggcgggcg tctgcgtgac ctcaggtgct tggggcccag cagacccaga gaaccatttc 22140 cactaggcca gggtgccgga agtgtccaca ggtcttagat tccctgttca gatgaaaaga 22200 tttgtgcctt taatgataaa agtgatctgc atagagtcaa aaattcaagc catgggtata 22260 aaatgcaagt aaaatccctg ccctcaccta tcccacccta ctacacagag atgtcctctc 22320 gagtttccta gactcactct ggaaatttct gtatacacac agaagcttgt gcctctgctc 22380 gtgaaggcag agggagggag agctgaaggg ccagcacctt ctcacctgtg ggccccctca 22440 gtgctcggtc ccagagcatg caggactgtg cctcgtgttc agtttgctgg tctgacttca 22500 tgctccttgg gcaggatatg catgtgccat gctaggagac atgtggatgt gaagctgggg 22560 gacaatgtcc cctggctatg cctttacaag ggaagtaagg aaggtaggag gtgagcctgg 22620 gagggaggga gggaggcgcg gagccgccgc aggtgtttct tttactgagt gcagcccatg 22680 gccgcactca ggttttgctt ttcaccttcc catctgtgaa agagtgagca ggaaaaagca 22740 aaa 22743 <210> 2 <211> 4833 <212> DNA <213> Homo sapiens <400> 2 gtgaactgtt gcaccgtgca attgcacact ataaatgtct ttccttatct gtgtgtactc 60 ttatctcact gttctatttt ttctcctcat ttatattaac tctttcttac ctttttttct 120 gaacttctag gccttctctt tccagaactg gtggaagaca aatgaaacgg ccaagatggt 180 aagaaacaag ccgcatttct ccttggggag actgataatt taaaaggttt gttgtgtcag 240 aaacattccc agcttcatca ccaacccttt ccttccacct ctgcccactg gagaccactt 300 atatcccgaa gcggacgcgg cagctgaagt caggaaacca tgcatcacat tagcaggagc 360 caactgcaga ctttaaactc cgttcaacat gtggatgcgg cagagaaatg acctgtccag 420 acaagccggg gcagctcata aactggttca tctgctccct gtgcgtcccg cgggtgcgta 480 agctctggag cagccggcgt ccaaggaccc ggagaaacct tctgctgggc actgcgtgtg 540 ccatctactt gggcttcctg gtgagccagg tggggagggc ctctctccag catggacagg 600 cggctgagaa ggggccacat cgcagccgcg acaccgccga gccatccttc cctgagatac 660 ccctggatgg taccctggcc cctccagagt cccagggcaa tgggtccact ctgcagccca 720 atgtggtgta cattacccta cgctccaagc gcagcaagcc ggccaatatc cgtggcaccg 780 tgaagcccaa gcgcaggaaa aagcatgcag tggcatcggc tgccccaggg caggaggctt 840 tggtcggacc atcccttcag ccgcaggaag cggcaaggga agctgatgct gtagcacctg 900 ggtacgctca gggagcaaac ctggttaaga ttggagagcg accctggagg ttggtgcggg 960 gtccgggagt gcgagccggg ggcccagact tcctgcagcc cagctccagg gagagcaaca 1020 ttaggatcta cagcgagagc gccccctcct ggctgagcaa agatgacatc cgaagaatgc 1080 gactcttggc ggacagcgca gtggcagggc tccggcctgt gtcctctagg agcggagccc 1140 gtttgctggt gctggagggg ggcgcacctg gcgctgtgct ccgctgtggc cctagcccct 1200 gtgggcttct caagcagccc ttggacatga gtgaggtgtt tgccttccac ctagacagga 1260 tcctggggct caacaggacc ctgccgtctg tgagcaggaa agcagagttc atccaagcag 1320 cagcagcagc gtgtctttcc atgcgcttgg cattctttat tttcccagcc tgggaggata 1380 tgagagttcc agggaaatgc tgtattggac atgcaagact cacctgggga acttatcagc 1440 agttgctgaa acagaaatgc tggcagaatg gccgagtacc caagcctgaa tcgggttgta 1500 ctgaaataca tcatcatgag tggtccaaga tggcactctt tgattttttg ttacagattt 1560 ataatcgctt agatacaaat tgctgtggat tcagacctcg caaggaagat gcctgtgtac 1620 agaatggatt gaggccaaaa tgtgatgacc aaggttctgc ggctctagca cacattatcc 1680 agcgaaagca tgacccaagg catttggttt ttatagacaa caagggtttc tttgacagga 1740 gtgaagataa cttaaacttc aaattgttag aaggcatcaa agagtttcca gcttctgcag 1800 tttctgtttt gaagagccag cacttacggc agaaacttct tcagtctctg tttcttgata 1860 aagtgtattg ggaaagtcaa ggaggtagac aaggaattga aaagcttatc gatgtaatag 1920 aacacagiqc caaaattctt atcacctata tcaatgcaca cggggtcaaa gtattaccta 1980 tgaatgaatg acaaaagaat cttctggcta gggtgttaga tatatttatg catttttggt 2040 tttgttttta aatcaagcac atcaacctca agcccgttta gcaatgaggc agtgtagatg 2100 aatacgtaaa ataaatgact ttaaccaagt agctataatg ggacttagca ctgtatgcat 2160 acttaaaaag gttttgaaaa acaaactact tgagaaatat ttgtttatat ttttctctaa 2220 catcatgcta tgtgtcagtc tgaacatctg acaacagaaa tttcagttat tattctagct 2280 aagttttgaa aacatttgtc atgctgttta atagaaaact gcaaaccaga gacactgact 2340 ccattaataa accatatttt gtgccgtttt gactgttctg accaaatact aatgggaaca 2400 attcttgacg tttttctgtt gctgattgtt aacatagagc agtctctaca ctaccctgag 2460 gcaactctac attggaacac tgaggcttac agcctgcaag agcatcagag ctgaccatac 2520 atttaaacag aaatgctggt ttatttgcaa aatcaccagt atattttcta ttgtgtctat 2580 aaaaaatcag tcatttaagt acaagaatca tattttccat tcctttttag aaatttattt 2640 tgttgtccct atggaaatca ttcacatctg acaatttata tgttaaagag ttttactctc 2700 tctattttgg tccaatttgt atctagtggc tgagaaatta aataattcta aagtatgaag 2760 ttacctatct gaaaatgtac ttacagagta tcattttaaa atggatgtct ctttaaaaat 2820 tttgttactt ttaccaacaa tgtaatataa tttatgtata ttttattaat aatagtgaat 2880 tccttaaaat ttgttctatg tacttatatt taatttgatt taatggttac tgcccagata 2940 ttgagaattg gttcaaatat tgagtgtgtt tcaatatatt atctggctta tttcaacatg 3000 agtaatatga gcaaaataag ttaaaacctg cgtctgatca attttcctca tgactagaac 3060 taaaacagta aatttggaca atattaagcc tcaaataatc atctccaaac tccttctaac 3120 actttttaaa tcagattgga agacatggac aaatcaggtt catgtgttgc atctttatgt 3180 cctttgccaa tatccaagat catcacatat ggtagatatt cacatggagt ttcaaattca 3240 gaatagatta ccattacctt cctgccctta cacatcctac tccttattta aaagttctat 3300 ttgtgacttt tcatttcctg aaagtttaaa aatacaattt gagaatgttt ataatacatt 3360 ctctcctgtc ttttcacggt tacgtctgtt attgctgaaa tacaccacat tttctttgtt 3420 ctggtcaagg ttaactcaat atctgtgtga aagagaacta ctaacaacgt tacaatagag 3480 gctagatttg aaaaaaaaaa tctatagatc taattgatac aattgtagaa caaaatgtca 3540 aaataatgtt ttaagtataa gagaagatgg accaaggaga gagagatcat ttgaaaatct 3600 aattgtagct tttctaggct cacattcatg tactactttt agcaccctta tgggctgtgc 3660 tcgccccctg gacagttgag ctttggatta tcttcctctt caattttccc tctattgacc 3720 cgagtgtctc cctctgcttc tacagattta tagtactcct tggctctttt gagtctccac 3780 ttttactcac tgtctctggg atttttaaga tccttttctt ctcttataaa tcatcctctt 3840 aatgaaaatt agcctaacaa aagtttggag actggaatcc tactttgagc cactgacttg 3900 aaataactct tttggcaagt tgcctgacat cctgtcttac caaggtggca tatttgcatt 3960 tttactgctt aaaacatttt ttttttttta ccatctttat ccaaatttat catattgatg 4020 gtaggactaa caggcttttt agaagctggc tttaactttg agtctcaagc tacaatgctg 4080 ttgggcagcc tggtcttccc acgtgagggt ttaactttgt ttatttgcct ccagttattc 4140 caaaatgctt attaaatgaa agtcccagga acatgtttat tttagtcacc tttgcttttt 4200 aacaattttg ttttgtaatc aatgagtaat tcatgatgaa ttatttttga ctaatggata 4260 gccgaaggcc aggcttttaa ttctaatagg taatgttctt cttttgtctt attgaaacaa 4320 tgagaatact ctgtgcattt caaatgcact ccgattatgc tgtggtttta ttcacataag 4380 cacaatatgt gttttattta taacttcata acaaacttat aatataataa tttaccttag 4440 cagacatgca aaagcttatt cttgtgtgac ttactttctt taagctaata atataaaaat 4500 aaatatgtat cttaaaaatc tataataaaa cattagaaat taaagatatg tgctttttat 4560 tttgcagatg agttcatttg cttttgtaga tgtgttttca gagctaggta cagaggaatg 4620 tttgctacct ttagcggtga aaaaagaaag agagtcaaga attttgttgg attgtgtttg 4680 tgtgtgcata tatttgatat catcattata tttgtaatct ttggacttgt aatcatagcc 4740 tgtttattct actgtgccat taaatatact ttaccttata cataacgaat aaaataccta 4800 gaagtagatt tatttacaaa aaaaaaaaaa aaa 4833 <210> 3 <211> 4854 <212> DNA <213> Homo sapiens <400> 3 gtttttaaaa gctttgtatc tcttaaaacc atgcagcagt cagtttccaa gttttgcttt 60 gcaatcagta gttttcaagg gagcttttaa agctgaactg aaatgtttga aatgtggaac 120 actcttgacc atgaaatatg ttctacttac atgcctcagc ctttaaaagt tctttgcatt 180 agagtcaagg attacattct tcctggagcc aagcatgggg ccagctgtaa acaagccgca 240 tttctccttg gggagactga taatttaaaa ggtttgttgt gtcagaaaca ttcccagctt 300 catcaccaac cctttccttc cacctctgcc cactggagac cacttatatc ccgaagcgga 360 cgcggcagct gaagtcagga aaccatgcat cacattagca ggagccaact gcagacttta 420 aactccgttc aacatgtgga tgcggcagag aaatgacctg tccagacaag ccggggcagc 480 tcataaactg gttcatctgc tccctgtgcg tcccgcgggt gcgtaagctc tggagcagcc 540 ggcgtccaag gacccggaga aaccttctgc tgggcactgc gtgtgccatc tacttgggct 600 tcctggtgag ccaggtgggg agggcctctc tccagcatgg acaggcggct gagaaggggc 660 cacatcgcag ccgcgacacc gccgagccat ccttccctga gatacccctg gatggtaccc 720 tggcccctcc agagtcccag ggcaatgggt ccactctgca gcccaatgtg gtgtacatta 780 ccctacgctc caagcgcagc aagccggcca atatccgtgg caccgtgaag cccaagcgca 840 ggaaaaagca tgcagtggca tcggctgccc cagggcagga ggctttggtc ggaccatccc 900 ttcagccgca ggaagcggca agggaagctg atgctgtagc acctgggtac gctcagggag 960 caaacctggt taagattgga gagcgaccct ggaggttggt gcggggtccg ggagtgcgag 1020 ccgggggccc agacttcctg cagcccagct ccagggagag caacattagg atctacagcg 1080 agagcgcccc ctcctggctg agcaaagatg acatccgaag aatgcgactc ttggcggaca 1140 gcgcagtggc agggctccgg cctgtgtcct ctaggagcgg agcccgtttg ctggtgctgg 1200 aggggggcgc acctggcgct gtgctccgct gtggccctag cccctgtggg cttctcaagc 1260 agcccttgga catgagtgag gtgtttgcct tccacctaga caggatcctg gggctcaaca 1320 ggaccctgcc gtctgtgagc aggaaagcag agttcatcca agatggccgc ccatgcccca 1380 tcattctttg ggatgcatct ttatcttcag caagtaatga cacccattct tctgttaagc 1440 tcacctgggg aacttatcag cagttgctga aacagaaatg ctggcagaat ggccgagtac 1500 ccaagcctga atcgggttgt actgaaatac atcatcatga gtggtccaag atggcactct 1560 ttgatttttt gttacagatt tataatcgct tagatacaaa ttgctgtgga ttcagacctc 1620 gcaaggaaga tgcctgtgta cagaatggat tgaggccaaa atgtgatgac caaggttctg 1680 cggctctagc acacattatc cagcgaaagc atgacccaag gcatttggtt tttatagaca 1740 acaagggttt ctttgacagg agtgaagata acttaaactt caaattgtta gaaggcatca 1800 aagagtttcc agcttctgca gtttctgttt tgaagagcca gcacttacgg cagaaacttc 1860 ttcagtctct gtttcttgat aaagtgtatt gggaaagtca aggaggtaga caaggaattg 1920 aaaagcttat cgatgtaata gaacacagag ccaaaattct tatcacctat atcaatgcac 1980 acggggtcaa agtattacct atgaatgaat gacaaaagaa tcttctggct agggtgttag 2040 atatatttat gcatttttgg ttttgttttt aaatcaagca catcaacctc aagcccgttt 2100 agcaatgagg cagtgtagat gaatacgtaa aataaatgac tttaaccaag tagctataat 2160 gggacttagc actgtatgca tacttaaaaa ggttttgaaa aacaaactac ttgagaaata 2220 tttgtttata tttttctcta acatcatgct atgtgtcagt ctgaacatct gacaacagaa 2280 atttcagtta ttattctagc taagttttga aaacatttgt catgctgttt aatagaaaac 2340 tgcaaaccag agacactgac tccattaata aaccatattt tgtgccgttt tgactgttct 2400 gaccaaatac taatgggaac aattcttgac gtttttctgt tgctgattgt taacatagag 2460 cagtctctac actaccctga ggcaactcta cattggaaca ctgaggctta cagcctgcaa 2520 gagcatcaga gctgaccata catttaaaca gaaatgctgg tttatttgca aaatcaccag 2580 tatattttct attgtgtcta taaaaaatca gtcatttaag tacaagaatc atattttcca 2640 ttccttttta gaaatttatt ttgttgtccc tatggaaatc attcacatct gacaatttat 2700 atgttaaaga gttttactct ctctattttg gtccaatttg tatctagtgg ctgagaaatt 2760 aaataattct aaagtatgaa gttacctatc tgaaaatgta cttacagagt atcattttaa 2820 aatggatgtc tctttaaaaa ttttgttact tttaccaaca atgtaatata atttatgtat 2880 attttattaa taatagtgaa ttccttaaaa tttgttctat gtacttatat ttaatttgat 2940 ttaatggtta ctgcccagat attgagaatt ggttcaaata ttgagtgtgt ttcaatatat 3000 tatctggctt atttcaacat gagtaatatg agcaaaataa gttaaaacct gcgtctgatc 3060 aattttcctc atgactagaa ctaaaacagt aaatttggac aatattaagc ctcaaataat 3120 catctccaaa ctccttctaa cactttttaa atcagattgg aagacatgga caaatcaggt 3180 tcatgtgttg catctttatg tcctttgcca atatccaaga tcatcacata tggtagatat 3240 tcacatggag tttcaaattc agaatagatt accattacct tcctgccctt acacatccta 3300 ctccttattt aaaagttcta tttgtgactt ttcatttcct gaaagtttaa aaatacaatt 3360 tgagaatgtt tataatacat tctctcctgt cttttcacgg ttacgtctgt tattgctgaa 3420 atacaccaca ttttctttgt tctggtcaag gttaactcaa tatctgtgtg aaagagaact 3480 actaacaacg ttacaataga ggctagattt gaaaaaaaaa atctatagat ctaattgata 3540 caattgtaga acaaaatgtc aaaataatgt tttaagtata agagaagatg gaccaaggag 3600 agagagatca tttgaaaatc taattgtagc ttttctaggc tcacattcat gtactacttt 3660 tagcaccctt atgggctgtg ctcgccccct ggacagttga gctttggatt atcttcctct 3720 tcaattttcc ctctattgac ccgagtgtct ccctctgctt ctacagattt atagtactcc 3780 ttggctcttt tgagtctcca cttttactca ctgtctctgg gatttttaag atccttttct 3840 tctcttataa atcatcctct taatgaaaat tagcctaaca aaagtttgga gactggaatc 3900 ctactttgag ccactgactt gaaataactc ttttggcaag ttgcctgaca tcctgtctta 3960 ccaaggtggc atatttgcat ttttactgct taaaacattt tttttttttt accatcttta 4020 tccaaattta tcatattgat ggtaggacta acaggctttt tagaagctgg ctttaacttt 4080 gagtctcaag ctacaatgct gttgggcagc ctggtcttcc cacgtgaggg tttaactttg 4140 tttatttgcc tccagttatt ccaaaatgct tattaaatga aagtcccagg aacatgttta 4200 ttttagtcac ctttgctttt taacaatttt gttttgtaat caatgagtaa ttcatgatga 4260 attatttttg actaatggat agccgaaggc caggctttta attctaatag gtaatgttct 4320 tcttttgtct tattgaaaca atgagaatac tctgtgcatt tcaaatgcac tccgattatg 4380 ctgtggtttt attcacataa gcacaatatg tgttttattt ataacttcat aacaaactta 4440 taatataata atttacctta gcagacatgc aaaagcttat tcttgtgtga cttactttct 4500 ttaagctaat aatataaaaa taaatatgta tcttaaaaat ctataataaa acattagaaa 4560 ttaaagatat gtgcttttta ttttgcagat gagttcattt gcttttgtag atgtgttttc 4620 agagctaggt acagaggaat gtttgctacc tttagcggtg aaaaaagaaa gagagtcaag 4680 aattttgttg gattgtgttt gtgtgtgcat atatttgata tcatcattat atttgtaatc 4740 tttggacttg taatcatagc ctgtttattc tactgtgcca ttaaatatac tttaccttat 4800 acataacgaa taaaatacct agaagtagat ttatttacaa aaaaaaaaaa aaaa 4854 <210> 4 <211> 4809 <212> DNA <213> Homo sapiens <400> 4 gtgaactgtt gcaccgtgca attgcacact ataaatgtct ttccttatct gtgtgtactc 60 ttatctcact gttctatttt ttctcctcat ttatattaac tctttcttac ctttttttct 120 gaacttctag gccttctctt tccagaactg gtggaagaca aatgaaacgg ccaagatggt 180 aagaaacaag ccgcatttct ccttggggag actgataatt taaaaggttt gttgtgtcag 240 aaacattccc agcttcatca ccaacccttt ccttccacct ctgcccactg gagaccactt 300 atatcccgaa gcggacgcgg cagctgaagt caggaaacca tgcatcacat tagcaggagc 360 caactgcaga ctttaaactc cgttcaacat gtggatgcgg cagagaaatg acctgtccag 420 acaagccggg gcagctcata aactggttca tctgctccct gtgcgtcccg cgggtgcgta 480 agctctggag cagccggcgt ccaaggaccc ggagaaacct tctgctgggc actgcgtgtg 540 ccatctactt gggcttcctg gtgagccagg tggggagggc ctctctccag catggacagg 600 cggctgagaa ggggccacat cgcagccgcg acaccgccga gccatccttc cctgagatac 660 ccctggatgg taccctggcc cctccagagt cccagggcaa tgggtccact ctgcagccca 720 atgtggtgta cattacccta cgctccaagc gcagcaagcc ggccaatatc cgtggcaccg 780 tgaagcccaa gcgcaggaaa aagcatgcag tggcatcggc tgccccaggg caggaggctt 840 tggtcggacc atcccttcag ccgcaggaag cggcaaggga agctgatgct gtagcacctg 900 ggtacgctca gggagcaaac ctggttaaga ttggagagcg accctggagg ttggtgcggg 960 gtccgggagt gcgagccggg ggcccagact tcctgcagcc cagctccagg gagagcaaca 1020 ttaggatcta cagcgagagc gccccctcct ggctgagcaa agatgacatc cgaagaatgc 1080 gactcttggc ggacagcgca gtggcagggc tccggcctgt gtcctctagg agcggagccc 1140 gtttgctggt gctggagggg ggcgcacctg gcgctgtgct ccgctgtggc cctagcccct 1200 gtgggcttct caagcagccc ttggacatga gtgaggtgtt tgccttccac ctagacagga 1260 tcctggggct caacaggacc ctgccgtctg tgagcaggaa agcagagttc atccaagatg 1320 gccgcccatg ccccatcatt ctttgggatg catctttatc ttcagcaagt aatgacaccc 1380 attcttctgt taagctcacc tggggaactt atcagcagtt gctgaaacag aaatgctggc 1440 agaatggccg agtacccaag cctgaatcgg gttgtactga aatacatcat catgagtggt 1500 ccaagatggc actctttgat tttttgttac agatttataa tcgcttagat acaaattgct 1560 gtggattcag acctcgcaag gaagatgcct gtgtacagaa tggattgagg ccaaaatgtg 1620 atgaccaagg ttctgcggct ctagcacaca ttatccagcg aaagcatgac ccaaggcatt 1680 tggtttttat agacaacaag ggtttctttg acaggagtga agataactta aacttcaaat 1740 tgttagaagg catcaaagag tttccagctt ctgcagtttc tgttttgaag agccagcact 1800 tacggcagaa acttcttcag tctctgtttc ttgataaagt gtattgggaa agtcaaggag 1860 gtagacaagg aattgaaaag cttatcgatg taatagaaca cagagccaaa attcttatca 1920 cctatatcaa tgcacacggg gtcaaagtat tacctatgaa tgaatgacaa aagaatcttc 1980 tggctagggt gttagatata tttatgcatt tttggttttg tttttaaatc aagcacatca 2040 acctcaagcc cgtttagcaa tgaggcagtg tagatgaata cgtaaaataa atgactttaa 2100 ccaagtagct ataatgggac ttagcactgt atgcatactt aaaaaggttt tgaaaaacaa 2160 actacttgag aaatatttgt ttatattttt ctctaacatc atgctatgtg tcagtctgaa 2220 catctgacaa cagaaatttc agttattatt ctagctaagt tttgaaaaca tttgtcatgc 2280 tgtttaatag aaaactgcaa accagagaca ctgactccat taataaacca tattttgtgc 2340 cgttttgact gttctgacca aatactaatg ggaacaattc ttgacgtttt tctgttgctg 2400 attgttaaca tagagcagtc tctacactac cctgaggcaa ctctacattg gaacactgag 2460 gcttacagcc tgcaagagca tcagagctga ccatacattt aaacagaaat gctggtttat 2520 ttgcaaaatc accagtatat tttctattgt gtctataaaa aatcagtcat ttaagtacaa 2580 gaatcatatt ttccattcct ttttagaaat ttattttgtt gtccctatgg aaatcattca 2640 catctgacaa tttatatgtt aaagagtttt actctctcta ttttggtcca atttgtatct 2700 agtggctgag aaattaaata attctaaagt atgaagttac ctatctgaaa atgtacttac 2760 agagtatcat tttaaaatgg atgtctcttt aaaaattttg ttacttttac caacaatgta 2820 atataattta tgtatatttt attaataata gtgaattcct taaaatttgt tctatgtact 2880 tatatttaat ttgatttaat ggttactgcc cagatattga gaattggttc aaatattgag 2940 tgtgtttcaa tatattatct ggcttatttc aacatgagta atatgagcaa aataagttaa 3000 aacctgcgtc tgatcaattt tcctcatgac tagaactaaa acagtaaatt tggacaatat 3060 taagcctcaa ataatcatct ccaaactcct tctaacactt tttaaatcag attggaagac 3120 atggacaaat caggttcatg tgttgcatct ttatgtcctt tgccaatatc caagatcatc 3180 acatatggta gatattcaca tggagtttca aattcagaat agattaccat taccttcctg 3240 cccttacaca tcctactcct tatttaaaag ttctatttgt gacttttcat ttcctgaaag 3300 tttaaaaata caatttgaga atgtttataa tacattctct cctgtctttt cacggttacg 3360 tctgttattg ctgaaataca ccacattttc tttgttctgg tcaaggttaa ctcaatatct 3420 gtgtgaaaga gaactactaa caacgttaca atagaggcta gatttgaaaa aaaaaatcta 3480 tagatctaat tgatacaatt gtagaacaaa atgtcaaaat aatgttttaa gtataagaga 3540 agatggacca aggagagaga gatcatttga aaatctaatt gtagcttttc taggctcaca 3600 ttcatgtact acttttagca cccttatggg ctgtgctcgc cccctggaca gttgagcttt 3660 ggattatctt cctcttcaat tttccctcta ttgacccgag tgtctccctc tgcttctaca 3720 gatttatagt actccttggc tcttttgagt ctccactttt actcactgtc tctgggattt 3780 ttaagatcct tttcttctct tataaatcat cctcttaatg aaaattagcc taacaaaagt 3840 ttggagactg gaatcctact ttgagccact gacttgaaat aactcttttg gcaagttgcc 3900 tgacatcctg tcttaccaag gtggcatatt tgcattttta ctgcttaaaa catttttttt 3960 tttttaccat ctttatccaa atttatcata ttgatggtag gactaacagg ctttttagaa 4020 gctggcttta actttgagtc tcaagctaca atgctgttgg gcagcctggt cttcccacgt 4080 gagggtttaa ctttgtttat ttgcctccag ttattccaaa atgcttatta aatgaaagtc 4140 ccaggaacat gtttatttta gtcacctttg ctttttaaca attttgtttt gtaatcaatg 4200 agtaattcat gatgaattat ttttgactaa tggatagccg aaggccaggc ttttaattct 4260 aataggtaat gttcttcttt tgtcttattg aaacaatgag aatactctgt gcatttcaaa 4320 tgcactccga ttatgctgtg gttttattca cataagcaca atatgtgttt tatttataac 4380 ttcataacaa acttataata taataattta ccttagcaga catgcaaaag cttattcttg 4440 tgtgacttac tttctttaag ctaataatat aaaaataaat atgtatctta aaaatctata 4500 ataaaacatt agaaattaaa gatatgtgct ttttattttg cagatgagtt catttgcttt 4560 tgtagatgtg ttttcagagc taggtacaga ggaatgtttg ctacctttag cggtgaaaaa 4620 agaaagagag tcaagaattt tgttggattg tgtttgtgtg tgcatatatt tgatatcatc 4680 attatatttg taatctttgg acttgtaatc atagcctgtt tattctactg tgccattaaa 4740 tatactttac cttatacata acgaataaaa tacctagaag tagatttatt tacaaaaaaa 4800 aaaaaaaaa 4809 <210> 5 <211> 4745 <212> DNA <213> Homo sapiens <400> 5 ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc 60 ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt 120 taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc 180 aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg 240 ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact 300 acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt 360 ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca 420 ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat 480 ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg 540 atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc 600 ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct 660 ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca 720 accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg 780 ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga 840 atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat 900 atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg 960 gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc 1020 ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc 1080 ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc 1140 atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca 1200 ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca 1260 ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg 1320 ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc 1380 acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg 1440 tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca 1500 gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag 1560 ggggccaggt gtccgtgtgc cccttgccca gggggcagag ggctcggtgg cagtgtgatg 1620 ctgttctcta cggggagcag ggccaaccct ggggccgctt tggggcagcc ctaacagtgc 1680 tgggggacgt aaatggggac aagctgacgg acgtggccat tggggcccca ggagaggagg 1740 acaaccgggg tgctgtttac ctgtttcacg gaacctcagg atctggcatc agcccctccc 1800 atagccagcg gatagcaggc tccaagctct ctcccaggct ccagtatttt ggtcagtcac 1860 tgagtggggg ccaggacctc acaatggatg gactggtaga cctgactgta ggagcccagg 1920 ggcacgtgct gctgctcagg tcccagccag tactgagagt caaggcaatc atggagttca 1980 atcccaggga agtggcaagg aatgtatttg agtgtaatga tcaggtggtg aaaggcaagg 2040 aagccggaga ggtcagagtc tgcctccatg tccagaagag cacacgggat cggctaagag 2100 aaggacagat ccagagtgtt gtgacttatg acctggctct ggactccggc cgcccacatt 2160 cccgcgccgt cttcaatgag acaaagaaca gcacacgcag acagacacag gtcttggggc 2220 tgacccagac ttgtgagacc ctgaaactac agttgccgaa ttgcatcgag gacccagtga 2280 gccccattgt gctgcgcctg aacttctctc tggtgggaac gccattgtct gctttcggga 2340 acctccggcc agtgctggcg gaggatgctc agagactctt cacagccttg tttccctttg 2400 agaagaattg tggcaatgac aacatctgcc aggatgacct cagcatcacc ttcagtttca 2460 tgagcctgga ctgcctcgtg gtgggtgggc cccgggagtt caacgtgaca gtgactgtga 2520 gaaatgatgg tgaggactcc tacaggacac aggtcacctt cttcttcccg cttgacctgt 2580 cctaccggaa ggtgtccacg ctccagaacc agcgctcaca gcgatcctgg cgcctggcct 2640 gtgagtctgc ctcctccacc gaagtgtctg gggccttgaa gagcaccagc tgcagcataa 2700 accaccccat cttcccggaa aactcagagg tcacctttaa tatcacgttt gatgtagact 2760 ctaaggcttc ccttggaaac aaactgctcc tcaaggccaa tgtgaccagt gagaacaaca 2820 tgcccagaac caacaaaacc gaattccaac tggagctgcc ggtgaaatat gctgtctaca 2880 tggtggtcac cagccatggg gtctccacta aatatctcaa cttcacggcc tcagagaata 2940 ccagtcgggt catgcagcat caatatcagg tcagcaacct ggggcagagg agcctcccca 3000 tcagcctggt gttcttggtg cccgtccggc tgaaccagac tgtcatatgg gaccgccccc 3060 aggtcacctt ctccgagaac ctctcgagta cgtgccacac caaggagcgc ttgccctctc 3120 actccgactt tctggctgag cttcggaagg cccccgtggt gaactgctcc atcgctgtct 3180 gccagagaat ccagtgtgac atcccgttct ttggcatcca ggaagaattc aatgctaccc 3240 tcaaaggcaa cctctcgttt gactggtaca tcaagacctc gcataaccac ctcctgatcg 3300 tgagcacagc tgagatcttg tttaacgatt ccgtgttcac cctgctgccg ggacaggggg 3360 cgtttgtgag gtcccagacg gagaccaaag tggagccgtt cgaggtcccc aaccccctgc 3420 cgctcatcgt gggcagctct gtcgggggac tgctgctcct ggccctcatc accgccgcgc 3480 tgtacaagct cggcttcttc aagcggcaat acaaggacat gatgagtgaa gggggtcccc 3540 cgggggccga accccagtag cggctccttc ccgacagagc tgcctctcgg tggccagcag 3600 gactctgccc agaccacacg tagcccccag gctgctggac acgtcggaca gcgaagtatc 3660 cccgacagga cgggcttggg cttccatttg tgtgtgtgca agtgtgtatg tgcgtgtgtg 3720 caagtgtctg tgtgcaagtg tgtgcacatg tgtgcgtgtg cgtgcatgtg cacttgcacg 3780 cccatgtgtg agtgtgtgca agtatgtgag tgtgtccaag tgtgtgtgcg tgtgtccatg 3840 tgtgtgcaag tgtgtgcatg tgtgcgagtg tgtgcatgtg tgtgctcagg ggcgtgtggc 3900 tcacgtgtgt gactcagatg tctctggcgt gtgggtaggt gacggcagcg tagcctctcc 3960 ggcagaaggg aactgcctgg gctcccttgt gcgtgggtga agccgctgct gggttttcct 4020 ccgggagagg ggacggtcaa tcctgtgggt gaagacagag ggaaacacag cagcttctct 4080 ccactgaaag aagtgggact tcccgtcgcc tgcgagcctg cggcctgctg gagcctgcgc 4140 agcttggatg gagactccat gagaagccgt gggtggaacc aggaacctcc tccacaccag 4200 cgctgatgcc caataaagat gcccactgag gaatgatgaa gcttcctttc tggattcatt 4260 tattatttca atgtgacttt aattttttgg atggataagc ttgtctatgg tacaaaaatc 4320 acaaggcatt caagtgtaca gtgaaaagtc tccctttcca gatattcaag tcacctcctt 4380 aaaggtagtc aagattgtgt tttgaggttt ccttcagaca gattccaggc gatgtgcaag 4440 tgtatgcacg tgtgcacaca caccacacat acacacacac aagctttttt acacaaatgg 4500 tagcatactt tatattggtc tgtatcttgc tttttttcac caatatttct cagacatcgg 4560 ttcatattaa gacataaatt actttttcat tcttttatac cgctgcatag tattccattg 4620 tgtgagtgta ccataatgta tttaaccagt cttcttttga tatactattt tcattctctt 4680 gttattgcat caatgctgag ttaataaatc aaatatatgt catttttgca tatatgtaag 4740 gataa 4745 <210> 6 <211> 4742 <212> DNA <213> Homo sapiens <400> 6 ttttctgccc ttctttgctt tggtggcttc cttgtggttc ctcagtggtg cctgcaaccc 60 ctggttcacc tccttccagg ttctggctcc ttccagccat ggctctcaga gtccttctgt 120 taacagcctt gaccttatgt catgggttca acttggacac tgaaaacgca atgaccttcc 180 aagagaacgc aaggggcttc gggcagagcg tggtccagct tcagggatcc agggtggtgg 240 ttggagcccc ccaggagata gtggctgcca accaaagggg cagcctctac cagtgcgact 300 acagcacagg ctcatgcgag cccatccgcc tgcaggtccc cgtggaggcc gtgaacatgt 360 ccctgggcct gtccctggca gccaccacca gcccccctca gctgctggcc tgtggtccca 420 ccgtgcacca gacttgcagt gagaacacgt atgtgaaagg gctctgcttc ctgtttggat 480 ccaacctacg gcagcagccc cagaagttcc cagaggccct ccgagggtgt cctcaagagg 540 atagtgacat tgccttcttg attgatggct ctggtagcat catcccacat gactttcggc 600 ggatgaagga gtttgtctca actgtgatgg agcaattaaa aaagtccaaa accttgttct 660 ctttgatgca gtactctgaa gaattccgga ttcactttac cttcaaagag ttccagaaca 720 accctaaccc aagatcactg gtgaagccaa taacgcagct gcttgggcgg acacacacgg 780 ccacgggcat ccgcaaagtg gtacgagagc tgtttaacat caccaacgga gcccgaaaga 840 atgcctttaa gatcctagtt gtcatcacgg atggagaaaa gtttggcgat cccttgggat 900 atgaggatgt catccctgag gcagacagag agggagtcat tcgctacgtc attggggtgg 960 gagatgcctt ccgcagtgag aaatcccgcc aagagcttaa taccatcgca tccaagccgc 1020 ctcgtgatca cgtgttccag gtgaataact ttgaggctct gaagaccatt cagaaccagc 1080 ttcgggagaa gatctttgcg atcgagggta ctcagacagg aagtagcagc tcctttgagc 1140 atgagatgtc tcaggaaggc ttcagcgctg ccatcacctc taatggcccc ttgctgagca 1200 ctgtggggag ctatgactgg gctggtggag tctttctata tacatcaaag gagaaaagca 1260 ccttcatcaa catgaccaga gtggattcag acatgaatga tgcttacttg ggttatgctg 1320 ccgccatcat cttacggaac cgggtgcaaa gcctggttct gggggcacct cgatatcagc 1380 acatcggcct ggtagcgatg ttcaggcaga acactggcat gtgggagtcc aacgctaatg 1440 tcaagggcac ccagatcggc gcctacttcg gggcctccct ctgctccgtg gacgtggaca 1500 gcaacggcag caccgacctg gtcctcatcg gggcccccca ttactacgag cagacccgag 1560 ggggccaggt gtccgtgtgc cccttgccca gggggagggc tcggtggcag tgtgatgctg 1620 ttctctacgg ggagcagggc caaccctggg gccgctttgg ggcagcccta acagtgctgg 1680 gggacgtaaa tggggacaag ctgacggacg tggccattgg ggccccagga gaggaggaca 1740 accggggtgc tgtttacctg tttcacggaa cctcaggatc tggcatcagc ccctcccata 1800 gccagcggat agcaggctcc aagctctctc ccaggctcca gtattttggt cagtcactga 1860 gtgggggcca ggacctcaca atggatggac tggtagacct gactgtagga gcccaggggc 1920 acgtgctgct gctcaggtcc cagccagtac tgagagtcaa ggcaatcatg gagttcaatc 1980 ccagggaagt ggcaaggaat gtatttgagt gtaatgatca ggtggtgaaa ggcaaggaag 2040 ccggagaggt cagagtctgc ctccatgtcc agaagagcac acgggatcgg ctaagagaag 2100 gacagatcca gagtgttgtg acttatgacc tggctctgga ctccggccgc ccacattccc 2160 gcgccgtctt caatgagaca aagaacagca cacgcagaca gacacaggtc ttggggctga 2220 cccagacttg tgagaccctg aaactacagt tgccgaattg catcgaggac ccagtgagcc 2280 ccattgtgct gcgcctgaac ttctctctgg tgggaacgcc attgtctgct ttcgggaacc 2340 tccggccagt gctggcggag gatgctcaga gactcttcac agccttgttt ccctttgaga 2400 agaattgtgg caatgacaac atctgccagg atgacctcag catcaccttc agtttcatga 2460 gcctggactg cctcgtggtg ggtgggcccc gggagttcaa cgtgacagtg actgtgagaa 2520 atgatggtga ggactcctac aggacacagg tcaccttctt cttcccgctt gacctgtcct 2580 accggaaggt gtccacgctc cagaaccagc gctcacagcg atcctggcgc ctggcctgtg 2640 agtctgcctc ctccaccgaa gtgtctgggg ccttgaagag caccagctgc agcataaacc 2700 accccatctt cccggaaaac tcagaggtca cctttaatat cacgtttgat gtagactcta 2760 aggcttccct tggaaacaaa ctgctcctca aggccaatgt gaccagtgag aacaacatgc 2820 ccagaaccaa caaaaccgaa ttccaactgg agctgccggt gaaatatgct gtctacatgg 2880 tggtcaccag ccatggggtc tccactaaat atctcaactt cacggcctca gagaatacca 2940 gtcgggtcat gcagcatcaa tatcaggtca gcaacctggg gcagaggagc ctccccatca 3000 gcctggtgtt cttggtgccc gtccggctga accagactgt catatgggac cgcccccagg 3060 tcaccttctc cgagaacctc tcgagtacgt gccacaccaa ggagcgcttg ccctctcact 3120 ccgactttct ggctgagctt cggaaggccc ccgtggtgaa ctgctccatc gctgtctgcc 3180 agagaatcca gtgtgacatc ccgttctttg gcatccagga agaattcaat gctaccctca 3240 aaggcaacct ctcgtttgac tggtacatca agacctcgca taaccacctc ctgatcgtga 3300 gcacagctga gatcttgttt aacgattccg tgttcaccct gctgccggga cagggggcgt 3360 ttgtgaggtc ccagacggag accaaagtgg agccgttcga ggtccccaac cccctgccgc 3420 tcatcgtggg cagctctgtc gggggactgc tgctcctggc cctcatcacc gccgcgctgt 3480 acaagctcgg cttcttcaag cggcaataca aggacatgat gagtgaaggg ggtcccccgg 3540 gggccgaacc ccagtagcgg ctccttcccg acagagctgc ctctcggtgg ccagcaggac 3600 tctgcccaga ccacacgtag cccccaggct gctggacacg tcggacagcg aagtatcccc 3660 gacaggacgg gcttgggctt ccatttgtgt gtgtgcaagt gtgtatgtgc gtgtgtgcaa 3720 gtgtctgtgt gcaagtgtgt gcacatgtgt gcgtgtgcgt gcatgtgcac ttgcacgccc 3780 atgtgtgagt gtgtgcaagt atgtgagtgt gtccaagtgt gtgtgcgtgt gtccatgtgt 3840 gtgcaagtgt gtgcatgtgt gcgagtgtgt gcatgtgtgt gctcaggggc gtgtggctca 3900 cgtgtgtgac tcagatgtct ctggcgtgtg ggtaggtgac ggcagcgtag cctctccggc 3960 agaagggaac tgcctgggct cccttgtgcg tgggtgaagc cgctgctggg ttttcctccg 4020 ggagagggga cggtcaatcc tgtgggtgaa gacagaggga aacacagcag cttctctcca 4080 ctgaaagaag tgggacttcc cgtcgcctgc gagcctgcgg cctgctggag cctgcgcagc 4140 ttggatggag actccatgag aagccgtggg tggaaccagg aacctcctcc acaccagcgc 4200 tgatgcccaa taaagatgcc cactgaggaa tgatgaagct tcctttctgg attcatttat 4260 tatttcaatg tgactttaat tttttggatg gataagcttg tctatggtac aaaaatcaca 4320 aggcattcaa gtgtacagtg aaaagtctcc ctttccagat attcaagtca cctccttaaa 4380 ggtagtcaag attgtgtttt gaggtttcct tcagacagat tccaggcgat gtgcaagtgt 4440 atgcacgtgt gcacacacac cacacataca cacacacaag cttttttaca caaatggtag 4500 catactttat attggtctgt atcttgcttt ttttcaccaa tatttctcag acatcggttc 4560 atattaagac ataaattact ttttcattct tttataccgc tgcatagtat tccattgtgt 4620 gagtgtacca taatgtattt aaccagtctt cttttgatat actattttca ttctcttgtt 4680 attgcatcaa tgctgagtta ataaatcaaa tatatgtcat ttttgcatat atgtaaggat 4740 aa 4742 <210> 7 <211> 5188 <212> DNA <213> Homo sapiens <400> 7 aaaaccctgc aggagactgc gagccctgca gaactgctag ctgcggggga gagggcaggg 60 gtcgggcgcc tgtggcggag ccgggctggg gccagggcag ggaggctgac aagcggcggg 120 agaagccggc ggagggcggg atcgcgcctc ctgacatgtt gggggtatcc ctggccgggc 180 cgggccgggg ctaagagcgg cgctgcgggc cggggtcggg gtcgggtcgc ggtccgcccc 240 cgctgtccct ccgtcctgcc ctgtcgagga cgtgcgttcc gcactcggcc gcctccagag 300 ggagcgaggg aagcggctag aggatcgggg agaaggagca ttcgccggag gctggaggag 360 gctgacccgc gtccccgccc agcctgctcc tatgcggtac ttgaaggatg gcgaagaggt 420 cgcgcagtga ggatgaggat gatgaccttc agtatgccga tcatgattat gaagtaccac 480 aacaaaaagg actgaagaaa ctctggaaca gagtaaaatg gacaagggac gaggatgata 540 aattaaagaa gttggttgaa caacatggaa ctgatgattg gactctaatt gctagtcatc 600 ttcaaaatcg ctctgatttt cagtgccagc atcgatggca gaaagtttta aatcctgaat 660 tgataaaggg tccttggact aaagaagaag atcagagggt tattgaatta gttcagaaat 720 atgggccaaa aagatggtct ttaattgcaa aacatttaaa aggaagaata ggcaagcagt 780 gtagagaaag atggcataat catctgaatc ctgaggtaaa gaaatcttcc tggacagaag 840 aggaggacag gatcatctat gaagcacata agcggttggg aaatcgttgg gcagaaattg 900 ccaaactact tccaggaagg actgataatt ctatcaaaaa tcattggaat tctactatgc 960 gaagaaaagt ggaacaggag ggctatttac aagatggaat aaaatcagaa cgatcttcat 1020 ctaaacttca acacaaacct tgtgcagcta tggatcatat gcaaacccag aatcagtttt 1080 acatacctgt tcagatccct gggtatcagt atgtgtcacc tgaaggcaat tgtatagaac 1140 atgttcagcc tacttctgcc tttattcagc aacccttcat tgatgaagat cctgataagg 1200 aaaagaaaat aaaggaactt gagatgcttc ttatgtcagc tgagaatgaa gttagaagaa 1260 agcgaattcc atcacagcct ggaagttttt ctagctggtc tggtagtttc ctcatggatg 1320 ataacatgtc taatactcta aatagccttg acgagcacac tagtgagttt tacagtatgg 1380 atgaaaatca gcctgtgtct gctcagcaga attcacccac aaagttcctg gccgtggagg 1440 caaacgctgt gttatcctct ttgcagacca tcccagaatt tgcagagact ctagaactta 1500 ttgaatctga tcctgtagca tggagtgacg ttaccagttt tgatatttct gatgctgctg 1560 cttctcctat caaatccacc ccagttaaat taatgagaat tcagcacaat gaaggagcca 1620 tggaatgcca atttaacgtc agtcttgtac ttgaagggaa aaaaaacact tgtaatggtg 1680 gcaacagtga agctgttcct ttaacatccc caaatatagc caagtttagc actccaccag 1740 ccatcctcag aaagaagaga aaaatgcgag tgggtcattc cccaggcagc gaacttaggg 1800 atggctcatt gaacgatggt ggtaatatgg cgctaaaaca tacaccactg aaaacactac 1860 cattttctcc ttcacagttt ttcaacacat gtcctggtaa tgaacaactt aatatagaaa 1920 atccttcatt tacatcaacc cctatttgtg ggcagaaagc tctcattaca actcctcttc 1980 ataaggaaac aactcccaaa gatcaaaagg aaaatgtagg gtttagaaca cctactatta 2040 gaagatctat actgggtacc acaccaagaa ctcctactcc ttttaagaat gcgcttgctg 2100 ctcaggagaa aaaatatgga cctcttaaaa ttgtgtccca gccacttgct ttcttggaag 2160 aagatattcg ggaagtttta aaagaagaaa ctggaacaga cctattcctc aaagaggaag 2220 atgaacctgc ttacaaaagc tgcaaacaag agaataccgc ttctgggaag aaagtcagaa 2280 aatcactagt cttagataat tgggaaaaag aagaatcagg cactcaactg ttgactgaag 2340 acatttcaga catgcagtca gaaaatagat ttactacatc cttattaatg ataccattat 2400 tggaaataca tgacaatagg tgcaacttga ttcctgaaaa acaagatata aattcaacca 2460 acaaaacata tacacttact aaaaagaaac caaaccctaa cacttccaaa gttgtcaaat 2520 tggaaaagaa tgtcagtca aattgtgaat gggaaacagt ggtttatggg aagacagaag 2580 accaacttat tatgactgaa caagcaagaa gatatctgag tacttacaca gctaccagta 2640 gtacttcaag agctctcata ctgtaattgt tattaaaatt gatgaaatgc cccactccct 2700 tactgcagtc tctactaaat taggttgcag tgaaattttt ctcaattagt tgtttttaaa 2760 gttgtaagat agccctttta atacagcatc ttttttctat tctatatagt aggcagaaag 2820 ctagtaagtc acttaagggg tagatagttt catagtttat tttttaagag atgagatttt 2880 taaaaattgt ttttaaagaa caagatggga aaataataga atgttcatgg atttctaaaa 2940 gtaaattctc atatattttc ttcacaagat atatgttgct actctcttga tgctgcagtt 3000 ttgttataga taggtgtatg agtatatatg atttctgaaa ttagtctatg tatggaaagc 3060 acacatgatt ttatgaagta cttttgccca tgtgctgatt tacttaggct accatttaca 3120 aagaaacaca ttgaaaagga atttaaagga aggatagaaa gttgcactac taattttttg 3180 tttttttttt cagaagcagt aaaattaact acagtgttaa atgtatttat ttgagcatag 3240 tactgaaaac aaaaagcatt caaaaaagag ttttttcttt attagtaaat agtattttct 3300 taatctcaga ggagctgaga gttttgttga atgtattgta cagtatgtag gagcaggaga 3360 actttgtaaa ttggaaagaa gtctgttttt ataatttatt tttattttta aagcttaaat 3420 gtagatattt atacgtatac agggtgccta gaagccaatg ttgtttcctg ttattacagc 3480 taacacagta aagaataatt ttgactttaa gtatgaaaca gtagtaagtt atagctgcaa 3540 agaatacaat atctatactg tatgtcacat ctacctaaat gttgcactat gccctttaaa 3600 tcatgctggt tataaagtag ttctaaaaat gtactaaata ataatttaat attttctttt 3660 taaattatat cgggggtggt catatacatt aatctggtga tttgtatatg tgtttgaaat 3720 ttttgcattt tgtttaaaaa ataatatggt accttggtcc ctaaaaacag tctgcactta 3780 gt; attttgtcct tttttaaatg ttttgtgttt ctctacaggt tacaacagtt gcttcagttg 3900 cctgttttag gtgtttgcac ttattttatt tcttcttgaa agaattttta tttgcttttg 3960 tggtagagat tatatgtaat tttttttcag tcatataatg gtgtgctgtc aacttaaaca 4020 ctgacaggta aatagaattg tacactgtag tttgaattat ttataattga cacactctct 4080 ccctctccac tcctgaagta tgctgctata gaaaatagca gaatcggctt gctgctacga 4140 gagaaggaaa gagcgaccac cacttgcact gtgtgaaaag ataaaaaaca aatgatggca 4200 agttctcaag ttaactaaat ggaatcaacc attaccaggc aaattcttgc aaataccaaa 4260 atactactat gccttataaa acaaaatgaa agcaggttaa gattttctgc tctgtttgta 4320 tgttaataga aatggaaata ctaagtattt taatgcttag ctcttgaaca gtagacctaa 4380 aagggtttta agctatttaa atctacttgc tagtttttgc atattttata tatatatata 4440 tttatatata tatatagtga gaagtgaaga aaatgtatgg tactaagatt atgccttatt 4500 gataaataga taaaccaatt tgaatcctct tagcatgttt aagtatgttg attgctttct 4560 aattaatgaa cttctcacag aaatttcact tagtgaaacc aatgattgta gcaaactcat 4620 actggatcat ttcagttacc ttgaactaat agcacataat ggttttttgt tgttgttgtt 4680 tttaatgtag cccttacctg gatatacata gtctgcaatc accaaagtat aatatcttgt 4740 aaggctatat tttttaaagc atattttttc ttgagcatta aattatccta aatggtaata 4800 tattgtggat aagtctgggc ttattggaca taatacatat ttgggttggt actggttgaa 4860 tccttcagtt aactgctttg ttgctttttg caagattttt tatcttaaac atgtcaggca 4920 tcttaagtca cctttatact gttttgttcc tctgagtttc tttcagtatg ttatacaaat 4980 gccagacata acatgtagca gccatacttg catggaaact gactacacat acataatact 5040 gcattttatt gtaaggtttt cacattaata cagcaattac cctgactaaa ttgagttttg 5100 tgatatatgg aaaacttcat tgtaagagaa tcttgcatac aatgttgaca tattaacatc 5160 caaaataaag catctgtgta caagctga 5188 <210> 8 <211> 5008 <212> DNA <213> Homo sapiens <400> 8 aaaaccctgc aggagactgc gagccctgca gaactgctag ctgcggggga gagggcaggg 60 gtcgggcgcc tgtggcggag ccgggctggg gccagggcag ggaggctgac aagcggcggg 120 agaagccggc ggagggcggg atcgcgcctc ctgacatgtt gggggtatcc ctggccgggc 180 cgggccgggg ctaagagcgg cgctgcgggc cggggtcggg gtcgggtcgc ggtccgcccc 240 cgctgtccct ccgtcctgcc ctgtcgagga cgtgcgttcc gcactcggcc gcctccagag 300 ggagcgaggg aagcggctag aggatcgggg agaaggagca ttcgccggag gctggaggag 360 gctgacccgc gtccccgccc agcctgctcc tatgcggtac ttgaaggatg gcgaagaggt 420 cgcgcagtga ggatgaggat gatgaccttc agtatgccga tcatgattat gaagtaccac 480 aacaaaaagg actgaagaaa ctctggaaca gagtaaaatg gacaagggac gaggatgata 540 aattaaagaa gttggttgaa caacatggaa ctgatgattg gactctaatt gctagtcatc 600 ttcaaaatcg ctctgatttt cagtgccagc atcgatggca gaaagtttta aatcctgaat 660 tgataaaggg tccttggact aaagaagaag atcagagggt tattgaatta gttcagaaat 720 atgggccaaa aagatggtct ttaattgcaa aacatttaaa aggaagaata ggcaagcagt 780 gtagagaaag atggcataat catctgaatc ctgaggtaaa gaaatcttcc tggacagaag 840 aggaggacag gatcatctat gaagcacata agcggttggg aaatcgttgg gcagaaattg 900 ccaaactact tccaggaagg actgataatt ctatcaaaaa tcattggaat tctactatgc 960 gaagaaaagt ggaacaggag ggctatttac aagatggaat aaaatcagaa cgatcttcat 1020 ctaaacttca acacaaacct tgtgcagcta tggatcatat gcaaacccag aatcagtttt 1080 acatacctgt tcagatccct gggtatcagt atgtgtcacc tgaaggcaat tgtatagaac 1140 atgttcagcc tacttctgcc tttattcagc aacccttcat tgatgaagat cctgataagg 1200 aaaagaaaat aaaggaactt gagatgcttc ttatgtcagc tgagaatgaa gttagaagaa 1260 agcgaattcc atcacagcct ggaagttttt ctagctggtc tggtagtttc ctcatggatg 1320 ataacatgtc taatactcta aatagccttg acgagcacac tagtgagttt tacagtatgg 1380 atgaaaatca gcctgtgtct gctcagcaga attcacccac aaagttcctg gccgtggagg 1440 caaacgctgt gttatcctct ttgcagacca tcccagaatt tgcagagact ctagaactta 1500 ttgaatctga tcctgtagca tggagtgacg ttaccagttt tgatatttct gatgctgctg 1560 cttctcctat caaatccacc ccagttaaat taatgagaat tcagcacaat gaaggagcca 1620 tggaatgcca atttaacgtc agtcttgtac ttgaagggaa aaaaaacact tgtaatggtg 1680 gcaacagtga agctgttcct ttaacatccc caaatatagc caagtttagc actccaccag 1740 ccatcctcag aaagaagaga aaaatgcgag tgggtcattc cccaggcagc gaacttaggg 1800 atggctcatt gaacgatggt ggtaatatgg cgctaaaaca tacaccactg aaaacactac 1860 cattttctcc ttcacagttt ttcaacacat gtcctggtaa tgaacaactt aatatagaaa 1920 atccttcatt tacatcaacc cctatttgtg ggcagaaagc tctcattaca actcctcttc 1980 ataaggaaac aactcccaaa gatcaaaagg aaaatgtagg gtttagaaca cctactatta 2040 gaagatctat actgggtacc acaccaagaa ctcctactcc ttttaagaat gcgcttgctg 2100 ctcaggagaa aaaatatgga cctcttaaaa ttgtgtccca gccacttgct ttcttggaag 2160 aagatattcg ggaagtttta aaagaagaaa ctggaacaga cctattcctc aaagaggaag 2220 atgaacctgc ttacaaaagc tgcaaacaag agaataccgc ttctgggaag aaagtcagaa 2280 aatcactagt cttagataat tgggaaaaag aagaatcagg cactcaactg ttgactgaag 2340 acatttcaga catgcagtca aattgtgaat gggaaacagt ggtttatggg aagacagaag 2400 accaacttat tatgactgaa caagcaagaa gatatctgag tacttacaca gctaccagta 2460 gtacttcaag agctctcata ctgtaattgt tattaaaatt gatgaaatgc cccactccct 2520 tactgcagtc tctactaaat taggttgcag tgaaattttt ctcaattagt tgtttttaaa 2580 gttgtaagat agccctttta atacagcatc ttttttctat tctatatagt aggcagaaag 2640 ctagtaagtc acttaagggg tagatagttt catagtttat tttttaagag atgagatttt 2700 taaaaattgt ttttaaagaa caagatggga aaataataga atgttcatgg atttctaaaa 2760 gtaaattctc atatattttc ttcacaagat atatgttgct actctcttga tgctgcagtt 2820 ttgttataga taggtgtatg agtatatatg atttctgaaa ttagtctatg tatggaaagc 2880 acacatgatt ttatgaagta cttttgccca tgtgctgatt tacttaggct accatttaca 2940 aagaaacaca ttgaaaagga atttaaagga aggatagaaa gttgcactac taattttttg 3000 tttttttttt cagaagcagt aaaattaact acagtgttaa atgtatttat ttgagcatag 3060 tactgaaaac aaaaagcatt caaaaaagag ttttttcttt attagtaaat agtattttct 3120 taatctcaga ggagctgaga gttttgttga atgtattgta cagtatgtag gagcaggaga 3180 actttgtaaa ttggaaagaa gtctgttttt ataatttatt tttattttta aagcttaaat 3240 gtagatattt atacgtatac agggtgccta gaagccaatg ttgtttcctg ttattacagc 3300 taacacagta aagaataatt ttgactttaa gtatgaaaca gtagtaagtt atagctgcaa 3360 agaatacaat atctatactg tatgtcacat ctacctaaat gttgcactat gccctttaaa 3420 tcatgctggt tataaagtag ttctaaaaat gtactaaata ataatttaat attttctttt 3480 taaattatat cgggggtggt catatacatt aatctggtga tttgtatatg tgtttgaaat 3540 ttttgcattt tgtttaaaaa ataatatggt accttggtcc ctaaaaacag tctgcactta 3600 gaagtttata tttactcagt gtttcagaag tggagaacat tatcttttat ttataaaaat 3660 attttgtcct tttttaaatg ttttgtgttt ctctacaggt tacaacagtt gcttcagttg 3720 cctgttttag gtgtttgcac ttattttatt tcttcttgaa agaattttta tttgcttttg 3780 tggtagagat tatatgtaat tttttttcag tcatataatg gtgtgctgtc aacttaaaca 3840 ctgacaggta aatagaattg tacactgtag tttgaattat ttataattga cacactctct 3900 ccctctccac tcctgaagta tgctgctata gaaaatagca gaatcggctt gctgctacga 3960 gagaaggaaa gagcgaccac cacttgcact gtgtgaaaag ataaaaaaca aatgatggca 4020 agttctcaag ttaactaaat ggaatcaacc attaccaggc aaattcttgc aaataccaaa 4080 atactactat gccttataaa acaaaatgaa agcaggttaa gattttctgc tctgtttgta 4140 tgttaataga aatggaaata ctaagtattt taatgcttag ctcttgaaca gtagacctaa 4200 aagggtttta agctatttaa atctacttgc tagtttttgc atattttata tatatatata 4260 tttatatata tatatagtga gaagtgaaga aaatgtatgg tactaagatt atgccttatt 4320 gataaataga taaaccaatt tgaatcctct tagcatgttt aagtatgttg attgctttct 4380 aattaatgaa cttctcacag aaatttcact tagtgaaacc aatgattgta gcaaactcat 4440 actggatcat ttcagttacc ttgaactaat agcacataat ggttttttgt tgttgttgtt 4500 tttaatgtag cccttacctg gatatacata gtctgcaatc accaaagtat aatatcttgt 4560 aaggctatat tttttaaagc atattttttc ttgagcatta aattatccta aatggtaata 4620 tattgtggat aagtctgggc ttattggaca taatacatat ttgggttggt actggttgaa 4680 tccttcagtt aactgctttg ttgctttttg caagattttt tatcttaaac atgtcaggca 4740 tcttaagtca cctttatact gttttgttcc tctgagtttc tttcagtatg ttatacaaat 4800 gccagacata acatgtagca gccatacttg catggaaact gactacacat acataatact 4860 gcattttatt gtaaggtttt cacattaata cagcaattac cctgactaaa ttgagttttg 4920 tgatatatgg aaaacttcat tgtaagagaa tcttgcatac aatgttgaca tattaacatc 4980 caaaataaag catctgtgta caagctga 5008 <210> 9 <211> 2632 <212> DNA <213> Homo sapiens <400> 9 agctgaggga cgcgtcagcc aggcaccccg gggtgtggcc agaggacttc ggcgacgctt 60 ccccgagagt agcccccctc ctcaacccag aaaagacaac cccgcggggc tgcagcgagc 120 caggcatgct cactggcgca ggcccggccc gcagcccgag caggaagcgc cggcgctagg 180 cggccccctg cgctgccagc tggagccggg cggagccagc gccccggcgc agggtggctc 240 tgccagtccc cgcgcgcctg ggcggccgca cacgtgtcca ggcgtcacgt ccgcgcgcgc 300 ccccggggct tgcgtcagcg gctgttccag aagcgggtgg gccagggctc tgcgcaccgc 360 tggggttcgg ggcccgggac gccgccggga ggagggcacc gcgcggggtc cgacgcggag 420 gcgtgctcgg aacgccgggg gctgcggagt gcatcagcgc ggtccagccc tccgcctgcc 480 gggcgccgag cgtctccgcc gcccggacct gggctgggcg ccgtggcgtt gcctcggagc 540 tcgctgcccg cggggcgcgc accgccttga cccgggcggc cccgcggcag gcaggcgccc 600 gcagttccat ggttggttcg gagcgcgatg agccgcccgt cctccaccgg ccccagcgct 660 aataaaccct gcagcaagca gccgccgccg cagccccagc acactccgtc cccggctgcg 720 cccccggccg ccgccaccat ctcggctgcg ggccccggct cgtccgcggt gcccgccgcg 780 gcggcggtga tctcgggccc cggcggcggc ggcggggccg gcccggtgtc cccgcagcac 840 cacgagctga cctcgctctt cgagtgtccg gtctgctttg actatgtcct gcctcctatt 900 ctgcagtgcc aggccgggca cctggtgtgt aaccaatgcc gccagaagtt gagctgctgc 960 ccgacgtgca ggggcgccct gacgcccagc atcaggaacc tggctatgga gaaggtggcc 1020 tcggcagtcc tgtttccctg taagtatgcc accacgggct gttccctgac cctgcaccat 1080 acggagaaac cagaacatga agacatatgt gaataccgtc cctactcctg cccatgtcct 1140 ggtgcttcct gcaagtggca ggggtccctg gaagctgtga tgtcccatct catgcacgcc 1200 cacaagagca ttaccaccct tcagggagaa gacatcgtct ttctagctac agacattaac 1260 ttgccagggg ctgtcgactg ggtgatgatg cagtcatgtt ttggccatca cttcatgctg 1320 gtgctggaga aacaagagaa gtacgaaggc caccagcagt tttttgccat cgtcctgctc 1380 attggcaccc gcaagcaagc cgagaacttt gcctacagac tggagttgaa tgggaaccgg 1440 cggagattga cctgggaggc cacgccccgt tcgattcatg acggtgtggc tgcggccatc 1500 atgaacagcg actgccttgt tttcgacaca gccatagcac atctttttgc agataatggg 1560 aaccttggaa tcaatgttac tatttctaca tgttgtccat gatgtgactt tcgtaaacct 1620 tcaaaattat ttgggcatag tgctctatgt ttaataaagg tttttataga tgttttattc 1680 catatgtctt cacaagtcag gacccacaat tacccgtgtt ttgtttgaac agcagtgtcc 1740 catctggctt cgacccaaca aagttcatta acctgggatg aatggggttg gcctgttggt 1800 gatttggatg ctgttctgtg atctaaaaca actcttattg aattgtattt actccctaaa 1860 caacacttga caggctgttg cacagggctt ctatagatca gtgtgttagg aatgggaggc 1920 cccttcctgc ctgccttccc atattggtcc cttgacattg acaaaagcac agtgactgtc 1980 agcagattcc tttacttttg tttgtgggag gtaggaattg ttttaatgca ttttaaacag 2040 tgtttctgaa attggatggc tggctaatag acactgaatc acccggagtg cttatcttaa 2100 aattgcagat ttagggagcc tgccaattta acagtctcat caggtgattc ttttcaacag 2160 taatgtttga gaattactgg gttaaattgt gggaaagggt ccagatttta aaggtgcttt 2220 aaggttgccc tctgccgata ctgtttgtct ttctactgtt tcatccccta acttccccca 2280 accctcaaat taaaactaga actatagatc cacatgaacg cacgcctgag atttggccac 2340 tcacctatgt tttgggtgga ttgcctagga aagcaagtca tatggccatt gatagttctc 2400 atgtaattag ttttgctcac cactagtaca gatgacccgt ttacacgtgg cttccctcgg 2460 aagccctcct caacagtagc tggtgtgaaa gactaaatca gtagagttgg aaaagcttta 2520 gt; attcactagt tttttaataa atattttcca aaaatggata aaaaaaaaaa aa 2632 <210> 10 <211> 894 <212> DNA <213> Homo sapiens <400> 10 gcccgtcttc gtgtctcctc cctccctcgc cttcctcctt cctagctcct ctcctccagg 60 gccagactga gcccaggttg atttcaggcg gacaccaata gactccacag cagctccagg 120 agcccagaca ccggcggcca gaagcaaggc taggagctgc tgcagccatg tcggccctca 180 gcctcctcat tctgggcctg ctcacggcag tgccacctgc cagctgtcag caaggcctgg 240 ggaaccttca gccctggatg cagggcctta tcgcggtggc cgtgttcctg gtcctcgttg 300 caatcgcctt tgcagtcaac cacttctggt gccaggagga gccggagcct gcacacatga 360 tcctgaccgt cggaaacaag gcagatggag tcctggtggg aacagatgga aggtactctt 420 cgatggcggc cagtttcagg tccagtgagc atgagaatgc ctatgagaat gtgcccgagg 480 aggaaggcaa ggtccgcagc accccgatgt aaccttctct gtggctccaa ccccaagact 540 cccaggcaca tgggatggat gtccagtgct accacccaag ccccctcctt ctttgtgtgg 600 aatctgcaat agtgggctga ctccctccag ccccatgccg gccctacccg cccttgaagt 660 atagccagcc aaggttggag ctcagaccgt gtctaggttg gggctcggct gtggccctgg 720 ggtctcctgc tcagctcaga agagccttct ggagaggaca gtcagctgag cacctcccat 780 cctgctcaca cgtccttccc cataactatg gaaatggccc taatttctgt gaaataaaga 840 ctttttgtat ttctggggct gaggctcagc aacagcccct caggcttcca gtga 894 <210> 11 <211> 5003 <212> DNA <213> Homo sapiens <400> 11 agttgggctc ccgcctggct gggaggcggg agggatcccg ctcctgttgt tttccgccgg 60 caggagtagg ctggcgggcg cagggggcgg ggtgcgccct ccctccccgg ccagggcgct 120 cgggagcggg gacccgagcc tgcagccgag ctccgctgcc ggccctggac actcggctca 180 gccaagcatc cttcctgggg gccgaggaag tggggccact ctgccgttcc gaggacctgg 240 gaggagccct cggtaccccg ggccccgggg ccctggggca cacacgtcca gcccagcccg 300 agcctgcgtt tcctgagccg ggatctgggg cgagatggcc gcaggcggca gtgcgcccga 360 gccccgcgtc ctcgtctgcc tcggggcgct cctggccggc tgggtcgccg taggattgga 420 ggctgttgtc attggagaag ttcatgagaa tgttactctg cactgtggca acatctcggg 480 actgaggggc caggtgacct ggtaccggaa caactcggag cctgtcttcc ttctctcgtc 540 caactctagc ctccggccag ctgagcctcg cttctctcta gtggatgcca cctccctgca 600 cattgaatcg ctgagcctgg gagatgaggg aatctacacc tgccaggaga tcctgaatgt 660 gactcagtgg ttccaagtgt ggctgcaggt ggccagcggc ccctatcaga ttgaggtcca 720 catcgtggcc accggcacac tccccaacgg caccctctac gcagccaggg gctcccaggt 780 ggacttcagc tgcaacagca gctccaggcc accacccgtg gttgaatggt ggttccaggc 840 cctgaattcc agcagcgagt cctttggcca caacctgaca gtcaactttt tctcactgtt 900 actgatatcg ccaaacctcc aagggaacta cacctgttta gccttgaatc agctcagcaa 960 gagacatcga aaggtgacca ccgagctcct ggtctactat ccccctccat cagctcccca 1020 gtgctgggca cagatggcat caggatcgtt catgttgcag cttacctgtc gctgggatgg 1080 gggataccct gaccctgact tcctgtggat agaagagcca ggaggtgtaa tcgtggggaa 1140 gtcaaagctg ggggtggaaa tgctgagcga gtcccagctg tcggatggca agaagttcaa 1200 gtgtgttaca agccacatag ttgggccaga gtcgggcgcc agctgcatgg tgcagatcag 1260 gggtccctcc cttctctctg agcccatgaa gacttgcttc actgggggca atgtgacgct 1320 tacatgccag gtgtctgggg cctacccccc tgccaagatc ctgtggctga ggaaccttac 1380 ccagcccgag gtgatcatcc agcctagcag ccgccatctc attacccagg atggccagaa 1440 ctccaccctc actatccaca actgctccca ggacctggat gagggctact acatctgccg 1500 agctgacagc cctgtagggg tgagggagat ggaaatctgg ctgagtgtga aagaaccttt 1560 aaatatcggg gggattgtgg gaaccattgt gagcctcctt ctgctgggac tggccattat 1620 ctcagggctt ctgttgcatt atagccctgt gttctgctgg aaagtaggaa acacttccag 1680 gggacaaaac atggatgatg tcatggtttt ggtggattca gaagaggaag aggaggagga 1740 ggaggaggag gaggaagatg ctgcagtagg ggaacaggag ggagcacgtg agagagagga 1800 gttgccaaaa gaaataccta agcaggacca cattcacaga gtgaccgcct tggtgaatgg 1860 gaacatagaa cagatgggaa atggattcca ggatcttcaa gatgacagca gtgaggagca 1920 aagtgacatt gttcaagaag aagacaggcc agtctgaaga agaggatggt ccatggttgt 1980 cttgctctga aagcttggag agctacattg aagacgagct cttcattcag ctttgactcc 2040 acctgcaccc ctggcggggg cttgcactaa caatgtttgg gtctcagcaa aaaacaaaac 2100 caagcacaca catctttcct tccatgtatt gaaaaacatt ggtttgattt gctctaagtt 2160 ttcccaatga tgtttaaaag ctttgagaag gaaagctgct ttggtgtctg aggtgccact 2220 tctgctgtga atcctggctt tatccaggtt gatctactgt gatagatgct gatttagagg 2280 gaacagaggt cagggaagca ctgggtcttg gtgccttttg ccgctttttt tttttttttt 2340 tttttttttt gagacggagt ctccctctat tgcccaggct ggagtgcaat ggcacgatct 2400 tggctcacca caacttctgc ctcccaggtt caagcgattc tcctgcctca gccacggcac 2460 cttgcaaata tcagctcctt ggaacaggtg aagttccagg taccaatgcc aatcagagga 2520 aggcagtttg gttcaggctt tggagttaga aacacctgaa gttgaatctg ggctctgttg 2580 cttccttctt tcatgggcta gagcacgact ctttacctct ctcttggcct caatttcctc 2640 acctgtaaaa tagatgagga agctgctcac ttattattgt ctcgttctga aagcttggaa 2700 agctacatcg aagatgagct cttcattcag ctttgatttg acctgtaccc ctggtggggg 2760 attgcactgg caacatttgg gtctcagcaa aaaaaccaag cacacacatc tttccttcca 2820 tttattgaaa aacatctttg taagatccat tcattgaaaa acataatcca tttattgaaa 2880 aatatctttg taagatcacc tgctaaatat gaaaatctga cttgaatttg tactctttaa 2940 agttgcgtat ctgctctagt gggcaggacc tagggcttaa aggggaactt cctttctcca 3000 tttctaagaa ctgggactct aaaatgagaa gctggttgtc tgaagtaacc ctgcaggtgt 3060 ggttggggaa ggtctgtttt cttggatgaa ggaactaaac taagcatatc agagcactgt 3120 cttaaccagt tttatttccc tggagataga attcttttaa aaagagttag ggagctggta 3180 ataggaagtg cctttcatta taactacatt ttgcagagct tcatatttat atacaagcct 3240 cctaggtgat acactgttag cttgcagact ttcctatgct tcatttctcc tgttgctttc 3300 aaagaaggca ggagacacgt ttaataacgg agtatctggt gataagaatt gcttgggcaa 3360 accagctcat ctggactctt tctcagtctt ggaagtggga agaggaaaac ttgtttcctt 3420 cctgcttctt aaggatattc tgagggtaca ctgatcaata acactaaatt tggaatgaaa 3480 ataccatgtg atgagtttag cctgctgatg cttccagtag atccttgtat agtttcaaga 3540 tttaagtttt ccgatttcat ataaatttct taaagtcgag gaccttataa gggtgcaatg 3600 gatgtttgct aaatatgaaa aactgacttg aatttgcact ctttaatgtt gcgtatctgc 3660 tctagtgggc aggacctagg gcttaaaggg gaacttcctt tctccaattc taggaactgg 3720 gactctaaaa tgagaagctg gttgtctgaa gtaaccctgc aggtgtggtt ggggaaggtc 3780 tgttttcttg gatgaaggga ttaaactgag caaatcacta gaagtatgcc ctgtcccctg 3840 ctcagaacac tggggagctc aagagtgggc tgcaatgtgc acccctcagg aatagctgtg 3900 aattgcaggt ctactggctt tttgcttttt gtcttttgct gcaaggtacc ccacgtactt 3960 aaccattctc aacagtgtaa atcagtgtca ttttagaatg agatactcag cttgcttcta 4020 aagtcactga attactgagt gagtctctcc tttagagtct tcggcaacca aattccagaa 4080 ttgaagagtc tactactcag aggcaacaag attaaaaaaa gaaaacacaa aaactgttga 4140 ggtgaaaaaa aaaaaaaacc ctagctagga acacagagaa tgttttgtag gatcactggg 4200 atattttcca caacttcctc ttctctagca cacacatctg ttgataggaa atatttgagg 4260 gtttttccac taccaaatgg gagcttcatg gtcctggtgt caaacactat aaacctttga 4320 ccagctgagc tgtgactgct gtcacatatc tgagtcctgt gtgcacagta atatcctggg 4380 tcaggtaaaa tccaggtctt caagttttaa ggattttttg aagaattcgg gcttctttaa 4440 gacgatccat gcccaaatcc acaagcttgt tgacagtggga ttacagtttg tgtggcaaag 4500 tccaagttgt tacactgtgc tttaaaaaaa atcttatctg catgtattgt taacttagag 4560 accatgagat ctatttatca ggaccaggaa gatacacact tcaggtccat tgcaactgac 4620 ttttttcttg tttttctcaa accctggtgg agcctgggaa gggggcctcc acaattctgt 4680 ggctttgata ttagccccaa ttctcacaag cacatacaag ccccataatt gccgcaggaa 4740 aacacaagat ggaaaattgc aataacccat gcactgagac ttagaaaatc atccttacta 4800 ggcaaaatgt attatgatgc aataagtgcc aactgatatt tctcacgttg ggactggcca 4860 ggaactgctg caaagaaaaa taagcagctc cttctccatt atttacattt taagatgtgg 4920 tggggggagg ttgggagaaa ttagttctga ggttatcata tgcctttttt aaaagataat 4980 ggaataaagc tatttttaag taa 5003

Claims (25)

하기 단계들을 포함하는, 개체가 결장직장암을 앓을 확률을 말초혈 샘플에서 시험관내 측정하는 방법:
a) 말초혈 샘플에서, 서열번호 5 또는 6에 도시된 ITGAM 유전자 및 서열번호 11에 도시된 VSIG10 유전자의 발현 생성물의 양을 측정하는 단계;
b) 단계 a)에서 측정된 상기 발현 생성물의 양을, 결장직장암 환자로서 미리 진단된 개체의 군에 대한 발현 생성물의 기준 양 및 비-결장직장암 개체로서 미리 검증된 개체의 군에 대한 발현 생성물의 기준 양과 비교하는 단계; 및
c) 단계 b)의 결과의 분석을 수행하는 단계로서,
- 시험된 개체에 대한 결과가 결장직장암 환자로서 미리 진단된 개체의 군으로부터 수득된 결과와 유사하거나 동등한 경우, 상기 시험된 개체가 결장직장암 환자로서 분류되고,
- 시험된 개체에 대한 결과가 비-결장직장암 개체로서 미리 검증된 개체의 군으로부터 수득된 결과와 유사하거나 동등한 경우, 상기 시험된 개체가 비-결장직장암 개체로서 분류되는, 단계.
CLAIMS What is claimed is: 1. A method for in vitro measurement of the probability of an individual having a colorectal cancer in a peripheral blood sample, comprising the steps of:
a) measuring the amount of the expression product of the ITGAM gene shown in SEQ ID NO: 5 or 6 and the VSIG10 gene shown in SEQ ID NO: 11 in the peripheral blood sample;
b) comparing the amount of the expression product measured in step a) with a reference amount of expression product for a group of individuals previously diagnosed as a colorectal cancer patient and a reference amount of expression product for a group of individuals previously verified as non- Comparing with a reference quantity; And
c) performing an analysis of the result of step b)
- if the result on the tested individual is similar or equivalent to the results obtained from a group of individuals previously diagnosed as a colorectal cancer patient, the tested individual is classified as a colorectal cancer patient,
- the tested individual is classified as a non-colorectal cancer entity if the results on the tested individual are similar or equivalent to those obtained from a group of individuals previously validated as non-colon carcinoma individuals.
제1항에 있어서,
상기 단계 a)가, 말초혈 샘플에서, 서열번호 1에 도시된 NEAT1 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 7 또는 8에 도시된 MYBL1 유전자, 서열번호 9에 도시된 SIAH2 유전자, 및 서열번호 10에 도시된 PDZK1IP1 유전자로부터 선택된 하나 이상의 유전자의 발현 생성물의 양을 측정하는 것을 추가로 포함하는, 방법.
The method according to claim 1,
The step a) comprises, in the peripheral blood sample, the NEAT1 gene shown in SEQ ID NO: 1, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, the MYBL1 gene shown in SEQ ID NO: 7 or 8, The SIAH2 gene, and the PDZK1IP1 gene shown in SEQ ID NO: 10.
제1항 또는 제2항에 있어서,
단계 a)에서 발현 생성물의 양이 상기 발현 생성물을 이 발현 생성물에 대해 특이적인 1종 이상의 결합 파트너와 접촉시킴으로써 측정되는, 방법.
3. The method according to claim 1 or 2,
Wherein the amount of expression product in step a) is determined by contacting said expression product with one or more binding partners specific for this expression product.
제2항에 있어서,
단계 a)에서, 말초혈 샘플로부터, 서열번호 5 또는 6에 도시된 ITGAM 유전자, 서열번호 11에 도시된 VSIG10 유전자, 및 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자의 발현 생성물의 양이 측정되는, 방법.
3. The method of claim 2,
In step a), from the peripheral blood sample, the amount of the expression product of the ITGAM gene shown in SEQ ID NO: 5 or 6, the VSIG10 gene shown in SEQ ID NO: 11, and the FAM198B gene shown in SEQ ID NO: 2, How.
제2항에 있어서,
단계 a)에서, 말초혈 샘플로부터, 서열번호 5 또는 6에 도시된 ITGAM 유전자, 서열번호 11에 도시된 VSIG10 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 및 서열번호 1에 도시된 NEAT1 유전자의 발현 생성물의 양이 측정되는, 방법.
3. The method of claim 2,
In step a), from the peripheral blood sample, the ITGAM gene shown in SEQ ID NO: 5 or 6, the VSIG10 gene shown in SEQ ID NO: 11, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, Wherein the amount of the expression product of the NEAT1 gene is measured.
제2항에 있어서,
단계 a)에서, 말초혈 샘플로부터, 서열번호 5 또는 6에 도시된 ITGAM 유전자, 서열번호 11에 도시된 VSIG10 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 1에 도시된 NEAT1 유전자, 및 서열번호 7 또는 8에 도시된 MYBL1 유전자의 발현 생성물의 양이 측정되는, 방법.
3. The method of claim 2,
In step a), from the peripheral blood sample, the ITGAM gene shown in SEQ ID NO: 5 or 6, the VSIG10 gene shown in SEQ ID NO: 11, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, NEAT1 shown in SEQ ID NO: And the amount of the expression product of the MYBL1 gene shown in SEQ ID NO: 7 or 8 is measured.
제2항에 있어서,
단계 a)에서, 말초혈 샘플로부터, 서열번호 5 또는 6에 도시된 ITGAM 유전자, 서열번호 11에 도시된 VSIG10 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 1에 도시된 NEAT1 유전자, 서열번호 7 또는 8에 도시된 MYBL1 유전자, 및 서열번호 9에 도시된 SIAH2 유전자의 발현 생성물의 양이 측정되는, 방법.
3. The method of claim 2,
In step a), from the peripheral blood sample, the ITGAM gene shown in SEQ ID NO: 5 or 6, the VSIG10 gene shown in SEQ ID NO: 11, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, NEAT1 shown in SEQ ID NO: The amount of the expression product of the gene, the MYBL1 gene shown in SEQ ID NO: 7 or 8, and the SIAH2 gene shown in SEQ ID NO: 9 is measured.
제2항에 있어서,
단계 a)에서, 말초혈 샘플로부터, 서열번호 5 또는 6에 도시된 ITGAM 유전자, 서열번호 11에 도시된 VSIG10 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 1에 도시된 NEAT1 유전자, 서열번호 7 또는 8에 도시된 MYBL1 유전자, 서열번호 9에 도시된 SIAH2 유전자, 및 서열번호 10에 도시된 PDZK1IP1 유전자의 발현 생성물의 양이 측정되는, 방법.
3. The method of claim 2,
In step a), from the peripheral blood sample, the ITGAM gene shown in SEQ ID NO: 5 or 6, the VSIG10 gene shown in SEQ ID NO: 11, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, NEAT1 shown in SEQ ID NO: The amount of the expression product of the gene, the MYBL1 gene shown in SEQ ID NO: 7 or 8, the SIAH2 gene shown in SEQ ID NO: 9, and the PDZK1IP1 gene shown in SEQ ID NO: 10 is measured.
제1항 또는 제2항에 있어서,
발현 생성물이 1종 이상의 RNA 전사체 또는 1종 이상의 폴리펩티드인, 방법.
3. The method according to claim 1 or 2,
Wherein the expression product is at least one RNA transcript or at least one polypeptide.
제9항에 있어서,
발현 생성물이 1종 이상의 mRNA인, 방법.
10. The method of claim 9,
Wherein the expression product is at least one mRNA.
제10항에 있어서,
RNA 전사체가 혼성화, 증폭 또는 서열분석에 의해 검출되고 정량되는, 방법.
11. The method of claim 10,
Wherein the RNA transcript is detected and quantified by hybridization, amplification or sequencing.
제9항에 있어서,
RNA 전사체가 1종 이상의 프로브 또는 1종 이상의 프라이머와 상기 RNA 전사체의 혼성화를 가능하게 하는 예정된 조건 하에서 상기 1종 이상의 프로브 또는 1종 이상의 프라이머와 접촉되는, 방법.
10. The method of claim 9,
Wherein the RNA transcript is contacted with said one or more probes or one or more primers under predetermined conditions to enable hybridization of said RNA transcript with at least one probe or at least one primer.
제9항에 있어서,
RNA 전사체가 1종 이상의 프로브 및 1종 이상의 프라이머와 상기 RNA 전사체의 혼성화를 가능하게 하는 예정된 조건 하에서 상기 1종 이상의 프로브 및 1종 이상의 프라이머와 접촉되는, 방법.
10. The method of claim 9,
Wherein the RNA transcript is contacted with said one or more probes and one or more primers under predetermined conditions to enable hybridization of said RNA transcript with at least one probe and at least one primer.
제11항에 있어서,
RNA 전사체의 DNA 복사체가 제조되고, 상기 DNA 복사체가 1종 이상의 프로브 또는 1종 이상의 프라이머와 상기 DNA 복사체의 혼성화를 가능하게 하는 예정된 조건 하에서 상기 1종 이상의 프로브 또는 1종 이상의 프라이머와 접촉되는, 방법.
12. The method of claim 11,
Wherein a DNA copy of an RNA transcript is produced and said DNA copy is contacted with said one or more probes or one or more primers under predetermined conditions to enable hybridization of said DNA copy with one or more probes or one or more primers, Way.
제11항에 있어서,
RNA 전사체의 DNA 복사체가 제조되고, 상기 DNA 복사체가 1종 이상의 프로브 및 1종 이상의 프라이머와 상기 DNA 복사체의 혼성화를 가능하게 하는 예정된 조건 하에서 상기 1종 이상의 프로브 및 1종 이상의 프라이머와 접촉되는, 방법.
12. The method of claim 11,
Wherein a DNA copy of an RNA transcript is produced and said DNA copy is contacted with said one or more probes and one or more primers under predetermined conditions to enable hybridization of said DNA copy with one or more probes and one or more primers, Way.
제9항에 있어서,
발현된 폴리펩티드가 항체 및 친화성 단백질로 이루어진 군으로부터 선택된 1종 이상의 특이적 리간드와의 접촉에 의해 검출되는, 방법.
10. The method of claim 9,
Wherein the expressed polypeptide is detected by contact with at least one specific ligand selected from the group consisting of an antibody and an affinity protein.
제16항에 있어서,
발현된 폴리펩티드가 2종의 항체, 2종의 친화성 단백질, 또는 1종의 항체 및 1종의 친화성 단백질로 이루어진 군으로부터 선택된 2종 이상의 특이적 리간드와 접촉되는, 방법.
17. The method of claim 16,
Wherein the expressed polypeptide is contacted with two or more specific ligands selected from the group consisting of two antibodies, two affinity proteins, or one antibody and one affinity protein.
서열번호 5 또는 6에 도시된 ITGAM 유전자 및 서열번호 11에 도시된 VSIG10 유전자의 발현 생성물에 특이적인 결합 파트너를 포함하는, 개체가 결장직장암을 앓을 확률을 시험관내 측정하기 위한 키트.A kit for in vitro measurement of the probability of an individual having colorectal cancer, comprising an ITGAM gene shown in SEQ ID NO: 5 or 6 and a binding partner specific for the expression product of the VSIG10 gene shown in SEQ ID NO: 11. 제18항에 있어서,
서열번호 1에 도시된 NEAT1 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 7 또는 8에 도시된 MYBL1 유전자, 서열번호 9에 도시된 SIAH2 유전자, 및 서열번호 10에 도시된 PDZK1IP1 유전자로부터 선택된 하나 이상의 유전자의 발현 생성물에 대해 특이적인 결합 파트너를 추가로 포함하는 키트.
19. The method of claim 18,
The NEAT1 gene shown in SEQ ID NO: 1, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, the MYBL1 gene shown in SEQ ID NO: 7 or 8, the SIAH2 gene shown in SEQ ID NO: 9, and PDZK1IP1 shown in SEQ ID NO: Further comprising a binding partner specific for an expression product of one or more genes selected from the genes.
제19항에 있어서,
서열번호 1에 도시된 NEAT1 유전자, 서열번호 2, 3 또는 4에 도시된 FAM198B 유전자, 서열번호 7 또는 8에 도시된 MYBL1 유전자, 서열번호 9에 도시된 SIAH2 유전자, 서열번호 10에 도시된 PDZK1IP1 유전자, 및 서열번호 11에 도시된 VSIG10 유전자의 발현 생성물에 특이적인 7종의 결합 파트너의 조합물을 포함하는 키트.
20. The method of claim 19,
The NEAT1 gene shown in SEQ ID NO: 1, the FAM198B gene shown in SEQ ID NO: 2, 3 or 4, the MYBL1 gene shown in SEQ ID NO: 7 or 8, the SIAH2 gene shown in SEQ ID NO: 9, the PDZK1IP1 gene shown in SEQ ID NO: , And a combination of seven binding partners specific for the expression product of the VSIG10 gene shown in SEQ ID NO: 11.
제18항 또는 제19항에 있어서,
1종 이상의 특이적 결합 파트너가 1종 이상의 혼성화 프로브를 포함하는, 키트.
20. The method according to claim 18 or 19,
Wherein the at least one specific binding partner comprises at least one hybridization probe.
제21항에 있어서,
특이적 결합 파트너가 1종 이상의 혼성화 프로브 및 1종 이상의 프라이머를 포함하는, 키트.
22. The method of claim 21,
Wherein the specific binding partner comprises at least one hybridization probe and at least one primer.
제21항에 있어서,
특이적 결합 파트너가 1종 이상의 혼성화 프로브 및 2종의 프라이머를 포함하는, 키트.
22. The method of claim 21,
Wherein the specific binding partner comprises at least one hybridization probe and two primers.
제18항 또는 제19항에 있어서,
1종 이상의 특이적 결합 파트너가 1종의 항체 및 1종의 친화성 단백질로 이루어진 군으로부터 선택된 1종 이상의 특이적 리간드를 포함하는, 키트.
20. The method according to claim 18 or 19,
Wherein the at least one specific binding partner comprises at least one specific ligand selected from the group consisting of one antibody and one affinity protein.
제24항에 있어서,
특이적 결합 파트너가 2종의 항체 또는 2종의 친화성 단백질, 또는 1종의 항체 및 1종의 친화성 단백질로 이루어진 군으로부터 선택된 2종 이상의 특이적 리간드를 포함하는, 키트.
25. The method of claim 24,
Wherein the specific binding partner comprises two or more specific ligands selected from the group consisting of two antibodies or two affinity proteins, or one kind of antibody and one kind of affinity protein.
KR1020137028000A 2011-03-25 2012-03-23 Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer Expired - Fee Related KR101626648B1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201172155 2011-03-25
CNPCT/CN2011/072155 2011-03-25
PCT/CN2012/072931 WO2012130103A1 (en) 2011-03-25 2012-03-23 Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer

Related Child Applications (1)

Application Number Title Priority Date Filing Date
KR1020167013748A Division KR101820616B1 (en) 2011-03-25 2012-03-23 Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer

Publications (2)

Publication Number Publication Date
KR20140016953A KR20140016953A (en) 2014-02-10
KR101626648B1 true KR101626648B1 (en) 2016-06-01

Family

ID=50265726

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020137028000A Expired - Fee Related KR101626648B1 (en) 2011-03-25 2012-03-23 Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer

Country Status (1)

Country Link
KR (1) KR101626648B1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008063414A2 (en) * 2006-11-13 2008-05-29 Source Precision Medicine, Inc. Gene expression profiling for identification, monitoring, and treatment of colorectal cancer

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2169078A1 (en) * 2008-09-26 2010-03-31 Fundacion Gaiker Methods and kits for the diagnosis and the staging of colorectal cancer

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008063414A2 (en) * 2006-11-13 2008-05-29 Source Precision Medicine, Inc. Gene expression profiling for identification, monitoring, and treatment of colorectal cancer
US20100196889A1 (en) 2006-11-13 2010-08-05 Bankaitis-Davis Danute M Gene Expression Profiling for Identification, Monitoring and Treatment of Colorectal Cancer

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Cancer Biology & Therapy,제11권,2호,188-195면(2011)*

Also Published As

Publication number Publication date
KR20140016953A (en) 2014-02-10

Similar Documents

Publication Publication Date Title
RU2721916C2 (en) Methods for prostate cancer prediction
CN109790583B (en) Methods for typing lung adenocarcinoma subtypes
DK2644713T3 (en) A Method for Diagnosing Neoplasms II
AU2013277971B2 (en) Molecular malignancy in melanocytic lesions
KR20140044341A (en) Molecular diagnostic test for cancer
KR101421326B1 (en) Composition for predicting prognosis of breast cancer and kit comprising the same
KR20140006898A (en) Colon cancer gene expression signatures and methods of use
KR100964193B1 (en) Markers for liver cancer prognosis
KR20150090246A (en) Molecular diagnostic test for cancer
KR20160052729A (en) Molecular diagnostic test for lung cancer
CN101573453A (en) Methods of predicting distant metastasis of lymph node-negative primary breast cancer using biological pathway gene expression analysis
CN101258249A (en) Methods and reagents for detecting melanoma
AU2008203226A1 (en) Colorectal cancer prognostics
CN114127314A (en) Genetic genomes, methods and kits for identifying or classifying subtypes (subtypes) of breast cancer
KR101820616B1 (en) Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer
KR102480128B1 (en) Single nucleotide polymorphisms associated with immunity of African indicine breeds and their application
US20020137077A1 (en) Genes regulated in activated T cells
CN101111768A (en) lung cancer prognosis
AU2004205270B2 (en) Colorectal cancer prognostics
KR101653131B1 (en) Composition or Kit and Method for predicting prognosis of liver cancer
US20030175761A1 (en) Identification of genes whose expression patterns distinguish benign lymphoid tissue and mantle cell, follicular, and small lymphocytic lymphoma
EP2121971A2 (en) Methods and kits for diagnosis of multiple sclerosis in probable multiple sclerosis subjects
KR101626648B1 (en) Method and kit for determining in vitro the probability for an individual to suffer from colorectal cancer
KR20100115283A (en) Markers for liver cancer prognosis
KR101634047B1 (en) Biomarkers for detection of ADHD

Legal Events

Date Code Title Description
PA0105 International application

Patent event date: 20131024

Patent event code: PA01051R01D

Comment text: International Patent Application

PG1501 Laying open of application
A201 Request for examination
PA0201 Request for examination

Patent event code: PA02012R01D

Patent event date: 20140424

Comment text: Request for Examination of Application

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

Comment text: Notification of reason for refusal

Patent event date: 20150611

Patent event code: PE09021S01D

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

Patent event code: PE07011S01D

Comment text: Decision to Grant Registration

Patent event date: 20160226

A107 Divisional application of patent
PA0104 Divisional application for international application

Comment text: Divisional Application for International Patent

Patent event code: PA01041R01D

Patent event date: 20160524

GRNT Written decision to grant
PR0701 Registration of establishment

Comment text: Registration of Establishment

Patent event date: 20160526

Patent event code: PR07011E01D

PR1002 Payment of registration fee

Payment date: 20160526

End annual number: 3

Start annual number: 1

PG1601 Publication of registration
FPAY Annual fee payment

Payment date: 20190517

Year of fee payment: 4

PR1001 Payment of annual fee

Payment date: 20190517

Start annual number: 4

End annual number: 4

PR1001 Payment of annual fee

Payment date: 20230509

Start annual number: 8

End annual number: 8

PC1903 Unpaid annual fee

Termination category: Default of registration fee

Termination date: 20250306