[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

JP2001169780A - Gene derived from docosahexaenoic acid-producing bacterium - Google Patents

Gene derived from docosahexaenoic acid-producing bacterium

Info

Publication number
JP2001169780A
JP2001169780A JP35661499A JP35661499A JP2001169780A JP 2001169780 A JP2001169780 A JP 2001169780A JP 35661499 A JP35661499 A JP 35661499A JP 35661499 A JP35661499 A JP 35661499A JP 2001169780 A JP2001169780 A JP 2001169780A
Authority
JP
Japan
Prior art keywords
ala
leu
val
ser
gly
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP35661499A
Other languages
Japanese (ja)
Inventor
Naoki Morita
直樹 森田
Mika Tanaka
美加 田中
Hidetoshi Okuyama
英登志 奥山
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Institute of Advanced Industrial Science and Technology AIST
Original Assignee
National Institute of Advanced Industrial Science and Technology AIST
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Institute of Advanced Industrial Science and Technology AIST filed Critical National Institute of Advanced Industrial Science and Technology AIST
Priority to JP35661499A priority Critical patent/JP2001169780A/en
Publication of JP2001169780A publication Critical patent/JP2001169780A/en
Pending legal-status Critical Current

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P20/00Technologies relating to chemical industry
    • Y02P20/50Improvements relating to the production of bulk chemicals
    • Y02P20/52Improvements relating to the production of bulk chemicals using catalysts, e.g. selective catalysts

Landscapes

  • Enzymes And Modification Thereof (AREA)
  • Preparation Of Compounds By Using Micro-Organisms (AREA)
  • Peptides Or Proteins (AREA)

Abstract

PROBLEM TO BE SOLVED: To enable to industrially utilize microorganisms by utilizing the genes capable of enhancing the productivity of DHA, although a wild type that produces a useful substance (e.g. DHA) generally has a low ability of producing the substance of DHA. SOLUTION: A DNA (e.g. sequence 1 refer to the corresponding specification) is provided which encodes a protein that is similar to an eicosapentaenoic acid- biosynthesizing enzyme derived from a bacterium (e.g. Moritella) having an ability of producing docosahexaenoic acid.

Description

【発明の詳細な説明】DETAILED DESCRIPTION OF THE INVENTION

【0001】[0001]

【発明の属する技術分野】本発明は、新規な高度不飽和
脂肪酸合成酵素をコードする遺伝子群、より詳しくはド
コサヘキサエン酸(docosahexaenoic acid、以下「DH
A」という。)生産細菌由来のイコサペンタエン酸(ei
cosapentaenoic acid、以下「EPA」という。)合成
酵素群遺伝子と顕著に類似性のある新規な遺伝子に関す
る。
TECHNICAL FIELD The present invention relates to a group of genes encoding a novel highly unsaturated fatty acid synthase, more specifically, docosahexaenoic acid (hereinafter referred to as "DH").
A ". ) Eicosapentaenoic acid derived from producing bacteria (ei
cosapentaenoic acid, hereinafter referred to as "EPA". 2.) Novel genes with significant similarity to the synthase family genes.

【0002】[0002]

【従来の技術】DHAは、高度不飽和脂肪酸の一種であ
り、近年、コレステロール低下作用、抗血液凝固作用、
学習機能向上作用など多彩な生理作用が報告されてい
る。このような多彩な生理作用に着目し、わが国におい
てもDHAを多く含有するイワシ、サバ、サンマ等の青
背魚の摂食が推奨されている。二重結合を6個有する炭
素数22の直鎖の高度不飽和脂肪酸であるDHAは、そ
の化学構造から明らかなように、化学合成することは極
めて困難である。今日、健康食品として市販されている
DHAは、そのほとんどが煮取法によって得られた魚油
の分別物である。原料が魚油の場合には、いずれの精製
方法を採用しても、高純度のDHAエステルを高収率で
回収することは困難である。
2. Description of the Related Art DHA is a kind of polyunsaturated fatty acid.
Various physiological actions such as learning function improving action have been reported. Paying attention to such various physiological actions, in Japan, it is recommended to feed blue sardines such as sardines, mackerel, saury, etc., which contain a large amount of DHA. It is extremely difficult to chemically synthesize DHA, a straight-chain highly unsaturated fatty acid having 22 carbon atoms having 6 double bonds, as is clear from its chemical structure. Today, most of the DHA marketed as a health food is a fraction of fish oil obtained by the boiling method. When the raw material is fish oil, it is difficult to recover a high-purity DHA ester in a high yield by any of the purification methods.

【0003】一方、最近、不完全な精製・濃縮では、魚
臭が残るなどの欠点を有した魚油からの抽出法を改善す
ることを目的として、菌類、微細藻類などに選択的にD
HAを産生させる検討が行なわれてきた。しかしなが
ら、DHAを含有した微生物や藻類を原料とした場合で
も、これから高純度のDHAエステルを高収率で回収し
たという報告例は見当たらない。
[0003] On the other hand, recently, incomplete purification / concentration, selectively improving fungi and microalgae, etc., for the purpose of improving the extraction method from fish oil, which has a drawback such as remaining fish odor.
Studies have been conducted to produce HA. However, even when a microorganism or algae containing DHA is used as a raw material, there is no report that a high-purity DHA ester was recovered in a high yield.

【0004】ある種の海洋性細菌がDHA等の高度不飽
和脂肪酸を生産することは古くから報告されている。本
発明者らは、培養時間が短く、培養制御が容易であり、
遺伝子の取得も容易な細菌を利用したDHA含有脂質の
生産法を見出す目的で研究を開始した。これまでEPA
生合成酵素群遺伝子に関しては、特開平6-46864
号、特開平8-242867号がある。
It has long been reported that certain marine bacteria produce polyunsaturated fatty acids such as DHA. The present inventors have a short culture time, easy culture control,
Research has begun with the aim of finding a method for producing DHA-containing lipids using bacteria that can easily obtain genes. Until now EPA
Regarding biosynthetic enzyme group genes, see JP-A-6-46864.
And JP-A-8-242867.

【0005】しかしながら、DHA合成に関与している
生合成酵素群及びそれをコードする遺伝子に関する遺伝
子は、未だ単離・解析されておらず、該遺伝子の解明が
当業界で待ち望まれている。
[0005] However, genes related to biosynthetic enzymes involved in DHA synthesis and genes encoding the same have not yet been isolated and analyzed, and elucidation of these genes has been awaited in the art.

【0006】[0006]

【発明が解決しようとする課題】本発明の目的は、DH
A生産細菌に由来する新規な高度不飽和脂肪酸合成酵素
遺伝子、特にDHAの合成に関与する遺伝子を単離し、
その遺伝子DNAを提供することにある。
SUMMARY OF THE INVENTION An object of the present invention is to provide a DH
A novel highly unsaturated fatty acid synthase gene derived from an A-producing bacterium, particularly a gene involved in the synthesis of DHA,
It is to provide the gene DNA.

【0007】[0007]

【課題を解決するための手段】本発明者らは、上記目的
より鋭意研究を重ねた結果、DHA生産細菌の染色体D
NAの一部を単離し、その塩基配列によりコードされる
複数種のアミノ酸配列中においてEPA合成酵素群のア
ミノ酸配列と顕著に類似性のあるものを検索することに
より、従来知られていないDHA合成に関与する酵素タ
ンパク質及びその遺伝子DNAを見出し、本発明を完成
するに至った。
Means for Solving the Problems The present inventors have conducted intensive studies for the above purpose and found that the chromosome D
By isolating a part of NA and searching for amino acid sequences remarkably similar to the amino acid sequence of the EPA synthetase group in a plurality of amino acid sequences encoded by the base sequence, DHA synthesis which has not been known before The present inventors have found an enzyme protein involved in the above and its gene DNA, and have completed the present invention.

【0008】すなわち、本発明は、ドコサヘキサエン酸
を生産する能力を有する細菌由来の、イコサペンタエン
酸生合成酵素群類似タンパク質群をコードするDNAを
提供する。前記細菌は、モリテラ属(Moritella)に属
するものであることが好ましい。前記DNAとしては、
(i)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されていて
もよいアミノ酸配列を含み、かつ、ドコサヘキサエン酸
生合成酵素群のメンバーとして機能し得るタンパク質を
コードする塩基配列、(ii)配列番号5で表わされるア
ミノ酸配列において1以上のアミノ酸が置換、欠失、付
加又は挿入されていてもよいアミノ酸配列を含み、か
つ、ドコサヘキサエン酸生合成酵素群のメンバーとして
機能し得るタンパク質をコードする塩基配列、(iii)
配列番号7で表わされるアミノ酸配列において1以上の
アミノ酸が置換、欠失、付加又は挿入されていてもよい
アミノ酸配列を含み、かつ、ドコサヘキサエン酸生合成
酵素群のメンバーとして機能し得るタンパク質をコード
する塩基配列、及び(iv)配列番号9で表わされるアミ
ノ酸配列において1以上のアミノ酸が置換、欠失、付加
又は挿入されていてもよいアミノ酸配列を含み、かつ、
ドコサヘキサエン酸生合成酵素群のメンバーとして機能
し得るタンパク質をコードする塩基配列を含むものであ
ることが好ましく、例えば、配列番号1で表わされる塩
基配列を含むものが挙げられる。
That is, the present invention provides a DNA derived from a bacterium having an ability to produce docosahexaenoic acid and encoding a group of proteins similar to the group of icosapentaenoic acid biosynthetic enzymes. Preferably, the bacterium belongs to the genus Moritella . As the DNA,
(I) 1 in the amino acid sequence represented by SEQ ID NO: 3
A base sequence encoding a protein containing the amino acid sequence in which the above amino acids may be substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group; (ii) SEQ ID NO: 5 A nucleotide sequence that includes an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented, and encodes a protein that can function as a member of the docosahexaenoic acid biosynthetic enzyme group; iii)
Encodes a protein that includes an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 7, and that can function as a member of the docosahexaenoic acid biosynthetic enzyme group A base sequence, and (iv) an amino acid sequence represented by SEQ ID NO: 9 in which one or more amino acids may be substituted, deleted, added or inserted, and
It preferably contains a base sequence encoding a protein capable of functioning as a member of the docosahexaenoic acid biosynthetic enzyme group, and includes, for example, a base sequence represented by SEQ ID NO: 1.

【0009】さらに、本発明は、以下の(1)又は
(2)に示されるタンパク質を提供する。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
Further, the present invention provides a protein represented by the following (1) or (2). (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.

【0010】さらに、本発明は、以下の(1)又は
(2)に示されるタンパク質をコードするDNAを提供
する。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号2で表わされる塩基
配列を含むものが挙げられる。
Further, the present invention provides a DNA encoding a protein represented by the following (1) or (2). (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such DNA include those containing the base sequence represented by SEQ ID NO: 2.

【0011】さらに、本発明は、以下の(3)又は
(4)に示されるタンパク質を提供する。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
Further, the present invention provides a protein represented by the following (3) or (4). (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.

【0012】さらに、本発明は、以下の(3)又は
(4)に示されるタンパク質をコードするDNAを提供
する。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号4で表わされる塩基
配列を含むものが挙げられる。
Further, the present invention provides a DNA encoding a protein represented by the following (3) or (4). (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such a DNA include a DNA containing the base sequence represented by SEQ ID NO: 4.

【0013】さらに、本発明は、以下の(5)又は
(6)に示されるタンパク質を提供する。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
Further, the present invention provides a protein represented by the following (5) or (6). (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.

【0014】さらに、本発明は、以下の(5)又は
(6)に示されるタンパク質をコードするDNAを提供
する。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号6で表わされる塩基
配列を含むものが挙げられる。
Further, the present invention provides a DNA encoding a protein represented by the following (5) or (6). (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Such DNA includes, for example, those containing the base sequence represented by SEQ ID NO: 6.

【0015】さらに、本発明は、以下の(7)又は
(8)に示されるタンパク質を提供する。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
Further, the present invention provides a protein represented by the following (7) or (8). (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.

【0016】さらに、本発明は、以下の(7)又は
(8)に示されるタンパク質をコードするDNAを提供
する。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。このような
DNAとしては、例えば、配列番号8で表わされる塩基
配列を含むものが挙げられる。
Further, the present invention provides a DNA encoding a protein represented by the following (7) or (8). (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group. Examples of such a DNA include a DNA containing the base sequence represented by SEQ ID NO: 8.

【0017】[0017]

【発明の実施の形態】以下、本発明を詳細に説明する。 1.遺伝子源 本発明において遺伝子源として利用できる生物は、特に
属、種あるいは株などを限定するものではなく、DHA
生産能を有する細菌であればいずれのものでも用いるこ
とができる。これらの微生物については、公的微生物寄
託機関で容易に入手することができる。このような微生
物としては、モリテラ属(Moritella)に属する細菌、
例えば、モリテラ・マリナ(Moritella marina)MP−
1株(ATCC15381)が挙げられる。
BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, the present invention will be described in detail. 1. Gene source The organism that can be used as a gene source in the present invention is not particularly limited to any genus, species or strain.
Any bacteria can be used as long as they have productivity. These microorganisms can be easily obtained from a public microorganism depositary organization. Such microorganisms include bacteria belonging to the genus Moritella ,
For example, Moritella marina MP-
1 strain (ATCC15381).

【0018】2.DHA合成酵素遺伝子群のクローニン
本発明においては、遺伝子源の例として、モリテラ・マ
リナ(Moritella marina)MP−1株(ATCC153
81)を用いる場合について具体的に説明する。しかし
ながら、前述の様に、種々のDHA生産細菌を同様にし
て遺伝子源として使用できる。
2. Clonin of DHA synthase gene group
In the present invention, as an example of a gene source, Moritella marina MP-1 strain (ATCC153) is used.
81) is specifically described. However, as noted above, various DHA producing bacteria can be used as a gene source in a similar manner.

【0019】DHA生産細菌からDHA合成酵素遺伝子
群を単離するためには、脂肪酸合成酵素遺伝子、ポリケ
タイド合成酵素遺伝子、EPA合成酵素群遺伝子の既知
の塩基配列を利用する。例えば、シェワネラ(Shewanel
la)SCRC-2738株由来のEPA生合成酵素群遺
伝子中5番目のオープンリーディングフレーム(以下
「ORF」という。)にはβ-ケトアシル-[アシルキャ
リアプロテイン(以下「ACP」という。)]シンター
ゼ(以下「KAS」という。)とマロニルコエンザイム
A(以下「CoA」という。)-ACPトランスアシラ
ーゼ(以下「MCT」という。)のドメインが隣り合っ
て存在する部分がある。そこで大腸菌(Escherichia co
li)とマイコバクテリウム・チュバキュロシス(Mycoba
cterium tuberculosis)のKASによく保存されている
アミノ酸配列と、大腸菌とストレプトマイセス・コリカ
ラ(Streptomyces coelicolor)のMCTによく保存さ
れているアミノ酸配列からそれぞれオリゴヌクレオチド
を作製し、ゲノムPCR法によってモリテラ・マリナM
P−1株の該当する遺伝子部分を増幅し、プラスミドベ
クターにクローニング後、常法により塩基配列を決定
し、既知のEPA合成酵素群遺伝子と比較することによ
り、モリテラ・マリナMP−1株より目的の遺伝子の一
部が単離できたことを確認する。
In order to isolate DHA synthase genes from DHA-producing bacteria, known base sequences of fatty acid synthase gene, polyketide synthase gene and EPA synthase gene are used. For example, Shewanel
la ) β-ketoacyl- [acyl carrier protein (hereinafter “ACP”)] synthase (hereinafter referred to as “ACP”) is contained in the fifth open reading frame (hereinafter referred to as “ORF”) in the EPA biosynthetic enzyme group gene derived from SCRC-2738 strain. Hereinafter, there is a portion where domains of malonyl coenzyme A (hereinafter, referred to as “CoA”) and ACP transacylase (hereinafter, referred to as “MCT”) are adjacent to each other. So Escherichia co
li ) and Mycobacterium tuberculosis ( Mycoba)
Oligonucleotides were prepared from the amino acid sequence well conserved in KAS of Cterium tuberculosis ) and the amino acid sequence well conserved in MCT of Escherichia coli and Streptomyces coelicolor, respectively. Marina M
After amplifying the corresponding gene portion of the P-1 strain, cloning it into a plasmid vector, determining the nucleotide sequence by a conventional method, and comparing with the known EPA synthase group genes, the objective was obtained from the Moritera marina MP-1 strain. Confirm that a part of the gene could be isolated.

【0020】次に、目的とするDHA合成酵素遺伝子群
は、複数の遺伝子から構成されていることが予想される
ことから、ライブラリーの作製には、コスミドベクター
を用いる。モリテラ・マリナMP−1株のゲノムDNA
を制限酵素で部分消化した後、サイズ分画し、コスミド
ベクターに組み込み、コスミドライブラリーを構築す
る。 PCR法によって得られた目的とする酵素遺伝子
の一部のDNAを[α−32P] dCTPで標識し、それ
をプローブとしてコロニーハイブリダイゼーションを行
い、ポジティブクローンを得る。このポジティブクロー
ンのゲノムDNAについて各種制限酵素によるマッピン
グを行い、更に全塩基配列を決定することにより、目的
とする酵素遺伝子の全体を含んでいるかどうかを確認す
る。
Next, since the target DHA synthase gene group is expected to be composed of a plurality of genes, a cosmid vector is used for preparing the library. Genomic DNA of Moritera marina MP-1 strain
Is partially digested with a restriction enzyme, size-fractionated and incorporated into a cosmid vector to construct a cosmid library. Partial DNA of the target enzyme gene obtained by the PCR method is labeled with [α- 32 P] dCTP, and colony hybridization is performed using the labeled DNA as a probe to obtain a positive clone. The genomic DNA of this positive clone is mapped with various restriction enzymes, and the entire base sequence is determined to confirm whether or not the gene contains the entire target enzyme gene.

【0021】以上の操作により、DHA合成酵素遺伝子
群のクローニング及び配列決定を行うことができる。D
HA合成酵素遺伝子群の塩基配列としては、例えば、モ
リテラ・マリナ(Moritella marina)MP−1株(AT
CC15381)のゲノム中に含まれる、配列番号1で
表わされる塩基配列が挙げられる。
By the above operation, cloning and sequencing of the DHA synthase gene group can be performed. D
Examples of the base sequence of the HA synthase gene group include, for example, Moritella marina MP-1 strain (AT
CC15381) contained in the genome of SEQ ID NO: 1.

【0022】本遺伝子群の一部を用いて宿主生物体の脂
肪酸組成を変えることも可能である。公知の方法によっ
て得られるこの遺伝子の不要な部分を除いた小型化した
遺伝子群も本発明に含まれる。
It is also possible to alter the fatty acid composition of the host organism using a part of this gene group. The present invention also includes a miniaturized gene group obtained by removing unnecessary portions of the gene obtained by a known method.

【0023】3.塩基配列の解析 配列番号1で表わされる塩基配列中には、図3に示すよ
うに、22個のORFが存在する(5'側から3'側に向け
て、それぞれORF1〜22とする)。このうち、OR
F8〜11の推定アミノ酸配列(それぞれ、配列番号
3、5、7及び9に示す)は、それぞれ公知のEPA合
成酵素群遺伝子(DDBJ/EMBL/GenBank登録番号;U73
935)のORF5〜8によりコードされるアミノ酸配
列と、アミノ酸残基数及び配列において類似している。
3. Analysis of Nucleotide Sequence As shown in FIG. 3, there are 22 ORFs in the nucleotide sequence represented by SEQ ID NO: 1 (ORFs 1 to 22 from the 5 ′ side to the 3 ′ side, respectively). Of these, OR
The deduced amino acid sequences of F8-11 (shown in SEQ ID NOs: 3, 5, 7 and 9, respectively) are known EPA synthase group genes (DDBJ / EMBL / GenBank accession numbers; U73
935) is similar to the amino acid sequence encoded by ORFs 5 to 8 in the number and sequence of amino acid residues.

【0024】さらに、ORF8には、β-ケトアシル-AC
PシンターゼII、マロニルCoA-ACPトランスアシラーゼ、
アシルキャリアープロテイン(ACP)及びβ-ケトアシル
-ACPリダクターゼと顕著な類似性を有するドメインが存
在する。ORF9には、β-ケトアシル-ACPシンターゼI
I及びマロニルCoA-ACPトランスアシラーゼと顕著な類似
性を有するドメインが存在する。ORF10には、β-
ケトアシル-ACPシンターゼII及び3-ヒドロキシデカノ
イル-ACPデヒドラーゼと顕著な類似性を有するドメイン
が存在する。このようなドメイン構造は、ポリケタイド
合成酵素(以下「PKS」という。)、I型脂肪酸合成
酵素(以下「FAS I」という。)、ヘテロシスト糖
脂質合成酵素(以下「Hgl」という。)等の、長鎖不
飽和化合物の生合成に関与する多機能酵素のドメイン構
造とよく似ている。以上の結果から、上記ORF8〜1
1によりコードされるタンパク質は、DHA合成酵素群
のメンバーであると考えられる。
Further, ORF8 contains β-ketoacyl-AC
P synthase II, malonyl CoA-ACP transacylase,
Acyl carrier protein (ACP) and β-ketoacyl
-There are domains with significant similarity to ACP reductase. ORF9 contains β-ketoacyl-ACP synthase I
There are domains with significant similarity to I and malonyl CoA-ACP transacylase. ORF10 contains β-
There are domains with significant similarity to ketoacyl-ACP synthase II and 3-hydroxydecanoyl-ACP dehydrase. Such domain structures include polyketide synthase (hereinafter referred to as “PKS”), type I fatty acid synthase (hereinafter referred to as “FAS I”), and heterocyst glycolipid synthase (hereinafter referred to as “Hgl”). It closely resembles the domain structure of multifunctional enzymes involved in the biosynthesis of long-chain unsaturated compounds. From the above results, the above ORFs 8 to 1
The protein encoded by 1 is considered to be a member of the DHA synthase group.

【0025】4.本発明のDNA及びタンパク質 本発明のDNAは、DHA生産能を有する細菌由来の、
イコサペンタエン酸生合成酵素群類似タンパク質群をコ
ードするDNAである。このようなDNAとしては、上
記ORF8〜11によりコードされるアミノ酸配列(そ
れぞれ、配列番号3、5、7及び9に示す)において1
以上のアミノ酸が置換、欠失、付加又は挿入されていて
もよいアミノ酸配列を含み、かつ、DHA合成酵素群の
メンバーとして機能し得るタンパク質をコードする4種
の塩基配列を含むものであることが好ましく、例えば、
配列番号1で表わされる塩基配列を含むものが挙げられ
る。しかし、本発明のDNAはこのようなものに限定さ
れるものではなく、上記「1.遺伝子源」及び「2.D
HA合成酵素遺伝子群のクローニング」の項で説明した
方法により、他の塩基配列を含むものを得ることもでき
る。
4. DNA and protein of the present invention The DNA of the present invention is derived from a bacterium having DHA-producing ability.
DNA encoding an icosapentaenoic acid biosynthetic enzyme group-like protein group. Such DNA includes 1 in the amino acid sequence encoded by ORFs 8 to 11 (shown in SEQ ID NOS: 3, 5, 7, and 9, respectively).
It is preferable that the above amino acids include an amino acid sequence that may be substituted, deleted, added or inserted, and include four base sequences encoding a protein that can function as a member of the DHA synthase group, For example,
One containing the base sequence represented by SEQ ID NO: 1 is exemplified. However, the DNA of the present invention is not limited to such DNA, and the above-mentioned “1. Gene source” and “2.
Cloning of HA synthase gene group "can also be used to obtain those containing other nucleotide sequences.

【0026】本発明のタンパク質は、配列番号1で表わ
される塩基配列中に含まれる上記ORF8〜11により
コードされるアミノ酸配列を含むタンパク質である。こ
れらのアミノ酸配列は、配列番号3(ORF8)、配列
番号5(ORF9)、配列番号7(ORF10)及び配
列番号9(ORF11)に示されるものである。このよ
うなタンパク質はいずれも、DHA合成酵素群のメンバ
ーとして機能し得る。
The protein of the present invention is a protein comprising the amino acid sequence encoded by the above ORFs 8 to 11 contained in the nucleotide sequence represented by SEQ ID NO: 1. These amino acid sequences are shown in SEQ ID NO: 3 (ORF8), SEQ ID NO: 5 (ORF9), SEQ ID NO: 7 (ORF10) and SEQ ID NO: 9 (ORF11). Any such protein can function as a member of the DHA synthase family.

【0027】ただし、本発明のタンパク質のアミノ酸配
列は上記のアミノ酸配列に限定されるものではなく、D
HA合成酵素群のメンバーとして機能する限り、各アミ
ノ酸配列において1以上のアミノ酸の置換、欠失、付
加、挿入等の変異が生じていてもよい。このような変異
が生じてもよいアミノ酸残基の数は特に限定されるもの
ではないが、好ましくは1個〜数個である。また、本発
明のタンパク質は、上記ORF8〜11の各塩基配列を
有するDNAとストリンジェントな条件下でハイブリダ
イズするDNAによりコードされるアミノ酸配列を含
み、かつ、DHA合成酵素群のメンバーとして機能し得
るタンパク質も包含する。この場合において、「ストリ
ンジェントな条件」としては、例えば、5×SSC、10
×Denhaldt's溶液、0.1%SDS、100μg/mlサケ精子
DNAを含むハイブリダイゼーションバッファー中で、
ハイブリダイズさせる温度が65℃という条件が挙げられ
る。例えば、酵素の安定性や活性を高めるために、公知
の手法を用いて遺伝子の一部塩基配列を変更することに
より、翻訳されるアミノ酸配列を変更することも可能で
ある。
However, the amino acid sequence of the protein of the present invention is not limited to the above-mentioned amino acid sequence.
As long as it functions as a member of the HA synthase group, mutations such as substitution, deletion, addition, and insertion of one or more amino acids in each amino acid sequence may occur. The number of amino acid residues where such a mutation may occur is not particularly limited, but is preferably one to several. In addition, the protein of the present invention contains an amino acid sequence encoded by DNA that hybridizes under stringent conditions with DNA having each of the base sequences of ORFs 8 to 11, and functions as a member of the DHA synthase group. The resulting protein is also included. In this case, “stringent conditions” include, for example, 5 × SSC, 10
× in a hybridization buffer containing Denhaldt's solution, 0.1% SDS, 100 μg / ml salmon sperm DNA,
A condition in which the hybridization temperature is 65 ° C. is exemplified. For example, in order to enhance the stability and activity of the enzyme, it is possible to change the amino acid sequence to be translated by changing a partial base sequence of the gene using a known method.

【0028】さらに、本発明のDNAは、配列番号1で
表わされる塩基配列を含むDNAの一部である上記OR
F8〜11をも包含する。これらの塩基配列としては、
例えば、配列番号2(ORF8)、配列番号4(ORF
9)、配列番号6(ORF10)、配列番号8(ORF
11)で表わされるものが挙げられるが、これらに限定
されるものではなく、上記のような本発明のタンパク質
をコードするものであれば本発明のDNAに含まれる。
Further, the DNA of the present invention comprises the above OR which is a part of the DNA containing the base sequence represented by SEQ ID NO: 1.
F8-11 are also included. As these base sequences,
For example, SEQ ID NO: 2 (ORF8), SEQ ID NO: 4 (ORF8)
9), SEQ ID NO: 6 (ORF10), SEQ ID NO: 8 (ORF10)
Examples of the present invention include, but are not limited to, those represented by 11), and any DNA encoding the protein of the present invention as described above is included in the DNA of the present invention.

【0029】なお、DNAに変異を導入するには、Kunk
el法、Gapped duplex法等の公知の手法又はこれに準ず
る方法を採用することができる。例えば、部位特異的突
然変異誘発法を利用した、Mutant-K、Mutant-G(TaKaRa
社製)等の変異導入用キット、又はLA PCR in vitro Mu
tagenesisシリーズキット(TaKaRa社製)を用いて変異
を導入することができる。
To introduce a mutation into DNA, use Kunk
Known methods such as the el method and the gapped duplex method or methods similar thereto can be employed. For example, Mutant-K, Mutant-G (TaKaRa) using site-directed mutagenesis
Mutagenesis kit such as LA PCR in vitro Mu
Mutations can be introduced using a tagenesis series kit (TaKaRa).

【0030】一旦本発明のDNAの塩基配列が決定され
ると、その後は化学合成反応等の化学的方法によって該
塩基配列を含むDNAを調製することができ、さらに
は、該塩基配列の全部又は一部を有するDNA断片をプ
ローブとして用いるハイブリダイゼーション法、該塩基
配列の一部を有するDNA断片をプライマーとして用い
るPCR法等の生物化学的方法によって、DHA生産能
を有する細菌のゲノムDNA等から、本発明のDNAを
得ることができる。このような化学合成反応、ハイブリ
ダイゼーション法及びPCR法は、当業者に公知の方法
によって行うことができる。
Once the nucleotide sequence of the DNA of the present invention has been determined, a DNA containing the nucleotide sequence can be prepared by a chemical method such as a chemical synthesis reaction. A hybridization method using a DNA fragment having a part as a probe, a biochemical method such as a PCR method using a DNA fragment having a part of the base sequence as a primer, from a genomic DNA of a bacterium having a DHA-producing ability, The DNA of the present invention can be obtained. Such a chemical synthesis reaction, a hybridization method, and a PCR method can be performed by methods known to those skilled in the art.

【0031】上記のようにして取得される本発明のDN
AがDHA合成酵素群遺伝子としての機能を有するか否
か、すなわち、上記のようにして取得される、本発明の
各タンパク質がDHA合成酵素群のメンバーとしての機
能を有するか否かは、以下のような手法により確認する
ことができる。
The DN of the present invention obtained as described above
Whether A has a function as a DHA synthase group gene, that is, whether each protein of the present invention obtained as described above has a function as a member of the DHA synthase group, It can be confirmed by such a method as follows.

【0032】すなわち、上記DNAを公知の発現ベクタ
ー中に組み込み、得られるベクターを適当な宿主に導入
して形質転換体を作製し、次いで該形質転換体を培養す
ることによりタンパク質を発現させ、得られるタンパク
質の機能を解析する。ここで、発現ベクターとしては、
例えば、pBluescript II(Stratagene社製)等を用いる
ことができる。また、宿主としては、大腸菌TOP10F'株
(Invitrogen社製)が適当であり、その場合の培養条件
は、培養温度20℃以下とするとよい。タンパク質の機能
解析は、培養した形質転換体の全脂肪酸組成をガスクロ
マトグラフィーによって分析し、DHAの存在を確認す
ることにより行うことができる。
That is, the above DNA is incorporated into a known expression vector, the resulting vector is introduced into an appropriate host to prepare a transformant, and then the transformant is cultured to express a protein. Analyze the function of the protein obtained. Here, as the expression vector,
For example, pBluescript II (manufactured by Stratagene) or the like can be used. In addition, Escherichia coli TOP10F '(manufactured by Invitrogen) is suitable as a host, and the culture conditions in this case may be a culture temperature of 20 ° C or lower. Protein function analysis can be performed by analyzing the total fatty acid composition of the cultured transformant by gas chromatography and confirming the presence of DHA.

【0033】あるいは、本発明のDNAの全部又は一部
に外来遺伝子を挿入し、遺伝子源として用いたDHA生
産細菌内で増殖(複製)することのできるベクターに組
み込み、得られるベクターを該DHA生産細菌に導入
し、相同組換えによって外来遺伝子挿入部分とゲノム上
の遺伝子を置き換えた各ORFの挿入変異株を作製する
ことによって、該ORFの機能を解析する。ここで、外
来遺伝子としては、抗生物質耐性遺伝子を用いることが
できる。遺伝子の機能解析は、上記挿入変異株の全脂肪
酸をガスクロマトグラフィーで分析することにより行う
ことができる。すなわち、挿入変異によりDHAが消失
することが確認されれば、該ORFはDHA合成に必須
であることが確認できる。さらに、新たな脂肪酸が確認
されれば、各酵素タンパク質の機能を推定することがで
きる。
Alternatively, a foreign gene is inserted into all or a part of the DNA of the present invention, incorporated into a vector capable of growing (replicating) in a DHA-producing bacterium used as a gene source, and the resulting vector is subjected to the DHA-producing process. The function of each ORF is analyzed by introducing into a bacterium and producing an insertion mutant of each ORF in which the gene on the genome is replaced with the foreign gene insertion portion by homologous recombination. Here, an antibiotic resistance gene can be used as the foreign gene. The functional analysis of the gene can be performed by analyzing all the fatty acids of the inserted mutant strain by gas chromatography. That is, if it is confirmed that DHA disappears due to the insertion mutation, it can be confirmed that the ORF is essential for DHA synthesis. Furthermore, if a new fatty acid is confirmed, the function of each enzyme protein can be estimated.

【0034】上述の如く、本発明により提供される遺伝
子は、公知のEPA合成酵素群遺伝子と厳密な類似性を
有し、その類似性ある遺伝子の情報に基づくDHA合成
酵素遺伝子群の解析とそれら解析された遺伝子の機能及
び解析についての研究に利用でき、DHA生産性の向上
を目指す研究や、DHA生合成能を持たない生物へのD
HA生合成能の賦与への応用研究に用いることが可能で
ある。さらに、本発明により提供される遺伝子を利用す
ることにより、高度不飽和脂肪酸合成細菌から、高度不
飽和脂肪酸合成に関与する遺伝子の検出に有用なプライ
マー及び/又はプローブを提供することも可能である。
As described above, the genes provided by the present invention have strict similarity to known EPA synthase genes, and the analysis of DHA synthase genes based on information of the similar genes and their analysis. It can be used for research on the function and analysis of the analyzed genes, and studies aimed at improving DHA productivity,
It can be used for application research to impart HA biosynthesis ability. Furthermore, by using the gene provided by the present invention, it is also possible to provide a primer and / or a probe useful for detection of a gene involved in the synthesis of a highly unsaturated fatty acid from a highly unsaturated fatty acid synthesizing bacterium. .

【0035】[0035]

【実施例】以下の実施例により、本発明を理解するため
にさらに詳細に説明する。これらの実施例は本発明の技
術範囲を限定するものではない。 〔実施例1〕供試菌種、培養条件とDNAの単離方法 モリテラ・マリナMP−1株(Moritella marina strai
n MP-1、ATCC15381)は、American Type Cult
ure Collectionより購入した。モリテラ・マリナMP−
1株は、Difco社製マリンブロス培地(タイプ22
16)50mlを500mlのフラスコ内で200rp
mの速度で旋回して、10℃で培養した。培養液を遠心
処理(15000×g, 20分)し、沈殿した細菌を回
収した。モリテラ・マリナMP‐1株のゲノムDNA
は、ニッポンジーン社製イソプラントDNA抽出キット
を用いて、該キットに付帯のプロトコールに従って抽出
した。
The following examples are provided to further illustrate the present invention. These examples do not limit the technical scope of the present invention. [Example 1] Test bacterial species, culture conditions and DNA isolation method Moritella marina strain MP-1
n MP-1, ATCC15381) is American Type Cult
Purchased from ure Collection. Moritera Marina MP-
One strain is a marine broth medium (Type 22) manufactured by Difco.
16) 50 ml in a 500 ml flask at 200 rpm
The culture was performed at 10 ° C. while swirling at a speed of m. The culture was centrifuged (15000 × g, 20 minutes), and the precipitated bacteria were collected. Genomic DNA of Moritera Marina MP-1 strain
Was extracted using an isoplant DNA extraction kit manufactured by Nippon Gene according to a protocol attached to the kit.

【0036】〔実施例2〕PCR及びPCR産物のサブ
クローニング 矢澤の報告(DDBJ/EMBL/GenBank塩基配列データベース
登録番号;U73935)によると、シェワネラ(Shew
anella)SCRC-2738株由来のEPA生合成酵素
群遺伝子中5番目のORFにはKASとMCTのドメイ
ンが隣り合って存在する部分がある。そこで、上記KA
Sドメインと、大腸菌(Escherichia coli)及びマイコ
バクテリウム・チュバキュロシス(Mycobacterium tube
rculosis)のKASドメインとの間でよく保存されてい
るアミノ酸配列からセンスプライマーを、また、上記M
CTドメインと、大腸菌及びストレプトマイセス・コリ
カラ(Streptomyces coelicolor)のMCTドメインと
の間でよく保存されているアミノ酸配列からアンチセン
スプライマーを作製し(図1、矢印はプライマーの位置
を示す)、ゲノムPCRを行った。このとき、ゲノムP
CRに用いたプライマーは、下記表1の通りである。
[0036] Example 2 PCR and PCR product subcloned Yazawa report (DDBJ / EMBL / GenBank nucleotide sequence database accession numbers; U73935) According to the Shewanella (Shew
anella ) In the fifth ORF of the EPA biosynthetic enzyme group gene derived from SCRC-2738 strain, there is a portion where domains of KAS and MCT are adjacent to each other. Therefore, the above KA
And the S domain, E. coli (Escherichia coli) and Mycobacterium tuberculosis (Mycobacterium tube
rculosis ) from the amino acid sequence that is well conserved with the KAS domain of
An antisense primer was prepared from the amino acid sequence well conserved between the CT domain and the MCT domain of Escherichia coli and Streptomyces coelicolor (FIG. 1, arrows indicate the positions of the primers), and the genome was prepared. PCR was performed. At this time, the genome P
The primers used for CR are as shown in Table 1 below.

【0037】[0037]

【表1】 [Table 1]

【0038】次いで、実施例1で得られたモリテラ・マ
リナMP−1株由来全DNAを鋳型とし、上記プライマ
ーを用いてPCR(ポリメラーゼ連鎖反応)を行った。
DNAポリメラーゼとしては、Taqポリメラーゼ(パー
キンエルマー社製)を用いた。すなわち、以下の表2に
示す組成を有するPCR溶液を調製し、サイクル反応を
行った。
Next, PCR (polymerase chain reaction) was carried out using the above primers and the total DNA derived from Moritera marina MP-1 strain obtained in Example 1 as a template.
Taq polymerase (manufactured by PerkinElmer) was used as the DNA polymerase. That is, a PCR solution having the composition shown in Table 2 below was prepared, and a cycle reaction was performed.

【0039】[0039]

【表2】 [Table 2]

【0040】上記サイクル反応の温度条件は、94℃4
分を1回、94℃1分、45℃2分、72℃3分の繰り
返しを30回、最後に72℃10分を1回とした。上記
PCRの結果、図3に示したような710bpのKAS
/MCT断片が増幅された。この断片の塩基配列を常法
に従って決定した。シーケンスデータのデータベース検
索をNational Center of Biochemical Informationのオ
ンラインBLASTを用いて行ったところ、予想される
アミノ酸配列でシェワネラSCRC-2738株由来の
EPA合成酵素群遺伝子の該当する部分と39%の同一
性を示した(図2)。
The temperature condition of the above cycle reaction is 94 ° C.4
One minute, 94 ° C. for 1 minute, 45 ° C. for 2 minutes, and 72 ° C. for 3 minutes were repeated 30 times, and finally, 72 ° C. for 10 minutes was performed once. As a result of the PCR, a KAS of 710 bp as shown in FIG.
The / MCT fragment was amplified. The nucleotide sequence of this fragment was determined according to a conventional method. When a database search of sequence data was performed using online BLAST of the National Center of Biochemical Information, the predicted amino acid sequence showed 39% identity with the corresponding portion of the EPA synthase group gene derived from Shewanella SCRC-2738 strain. (FIG. 2).

【0041】〔実施例3〕コスミドライブラリーの構築
とスクリーニング モリテラ・マリナMP−1株から抽出したゲノムDNA
を制限酵素Sau3AIで部分消化した後、サイズ分画
し、制限酵素BamHIで消化して開環したニッポンジ
ーン社製コスミドベクター、ロリスト6につないでコス
ミドライブラリーを構築した。その手順を以下に示す。
[Example 3] Construction and screening of cosmid library Genomic DNA extracted from Moritera marina MP-1 strain
Was partially digested with the restriction enzyme Sau3AI, size-fractionated, digested with the restriction enzyme BamHI, and opened to a cosmid vector manufactured by Nippon Gene Co., Ltd., Lorist 6, to construct a cosmid library. The procedure is shown below.

【0042】100μgのゲノムDNAを制限酵素Sa
u3AIで部分消化し、ショ糖密度勾配遠心によってサ
イズ分画した。およそ50kbpの断片が濃縮された分
画のDNA断片とロリスト6クローニングベクターを混
合し、T4DNAリガーゼにより連結した。この連結さ
れたDNA鎖を常法に従ってλファージにパッケージン
グし、大腸菌DH5α株に感染させた。得られたカナマ
イシン耐性コロニーを、カナマイシン(25μg/m
l)を含むLB寒天培地の上に置いたナイロンフィルタ
ー(Hybond N+、Amersham Pharmacia Biotech社製)に
移し、30℃で一晩培養した。常法に従い、フィルター
上に増殖したコロニーをアルカリ溶解させ、遊離したD
NAをフィルター上に固定した。このフィルターを、5
×SSC、10×Denhaldt's溶液、0.1%SDS、100μg
/mlサケ精子DNAを含むハイブリダイゼーションバッ
ファー中に浸し、65℃で一晩プレハイブリダイゼーショ
ンを行った。その後、放射線標識したプローブを加え
て、65℃で一晩ハイブリダイゼーションを行った。プロ
ーブとしては、ランダムプライマーDNAラベリングキ
ット(TaKaRa社製)と[α−32P]dCTP(Amersham P
harmacia Biotech社製)で標識した710bpのKAS
/MCT断片(図2)を用いた。ハイブリダイゼーショ
ンの後、該フィルターについて2×SSC、0.1%SD
S中、室温で10分間の洗浄を行い、さらに、1×SS
C、0.1%SDS中、65℃で30分間の洗浄を2回行っ
た。オートラジオグラフィーにより確認したところ、2
45個のコロニーのうち、2個の陽性コロニーが得られ
た。このうち1個は、およそ40kbpの挿入断片をも
っていた(このコスミドをp3D5と称する)。
100 μg of genomic DNA was replaced with the restriction enzyme Sa.
It was partially digested with u3AI and size fractionated by sucrose density gradient centrifugation. The DNA fragment of the fraction enriched with the approximately 50 kbp fragment was mixed with the Loristo 6 cloning vector, and ligated with T4 DNA ligase. The ligated DNA chain was packaged into a λ phage according to a conventional method, and the phage was infected with Escherichia coli DH5α strain. The obtained kanamycin-resistant colonies were transformed with kanamycin (25 μg / m
The mixture was transferred to a nylon filter (Hybond N +, manufactured by Amersham Pharmacia Biotech) placed on an LB agar medium containing 1), and cultured at 30 ° C. overnight. The colonies grown on the filter were dissolved in alkali and the
NA was fixed on the filter. Change this filter to 5
× SSC, 10 × Denhaldt's solution, 0.1% SDS, 100 μg
/ Ml salmon sperm DNA, and prehybridized at 65 ° C overnight. Thereafter, a radiolabeled probe was added, and hybridization was carried out at 65 ° C. overnight. As a probe, a random primer DNA labeling kit (TaKaRa) and [α- 32 P] dCTP (Amersham P
710 bp KAS labeled with harmacia Biotech)
/ MCT fragment (FIG. 2) was used. After hybridization, the filter was 2 × SSC, 0.1% SD
After washing at room temperature for 10 minutes in S,
C, washing twice in 0.1% SDS at 65 ° C for 30 minutes. As confirmed by autoradiography, 2
Of the 45 colonies, two positive colonies were obtained. One of them had an insert of approximately 40 kbp (this cosmid is called p3D5).

【0043】〔実施例4〕コスミドp3D5の塩基配列
の解析 コスミドp3D5中のゲノムDNA挿入部を含むSau3AI
-Sau3AI断片の全塩基配列を配列番号1に示す。この塩
基配列中には、図3に示されるように、22個のORF
があると推定できた。図3において、予想されるORF
の大きさと方向は、矢印で示す。斜線が施されている矢
印は、シェワネラSCRC2738株のEPA合成酵素
群遺伝子と相同性が見られたことを示す。全塩基配列
(配列番号1)とORF8〜11(それぞれ配列番号
2、4、6及び8)の関係は表3の通りである。
[Example 4] Analysis of base sequence of cosmid p3D5 Sau3AI containing genomic DNA insertion site in cosmid p3D5
The entire nucleotide sequence of the -Sau3AI fragment is shown in SEQ ID NO: 1. In this base sequence, as shown in FIG.
It was estimated that there was. In FIG. 3, the expected ORF
Are indicated by arrows. The hatched arrow indicates that homology was found to the EPA synthase group gene of Shewanella SCRC2738 strain. Table 3 shows the relationship between the entire base sequence (SEQ ID NO: 1) and ORFs 8 to 11 (SEQ ID NOs: 2, 4, 6, and 8, respectively).

【0044】[0044]

【表3】 [Table 3]

【0045】さらに、EPA合成酵素群遺伝子(DDBJ/E
MBL/GenBank登録番号;U73935)の全塩基配列と
これに含まれるORF5〜8との関係を、下記の表4に
示す。
Further, the EPA synthase group gene (DDBJ / E
Table 4 below shows the relationship between the entire nucleotide sequence of MBL / GenBank accession number; U73935) and ORFs 5 to 8 contained therein.

【0046】[0046]

【表4】 [Table 4]

【0047】上記の表3と表4を比較したところ、OR
F5,6,7及び8の長さは、p3D5のORF8,
9,10及び11に対応するそれぞれのORFの長さと
ほぼ同じであった。p3D5のORF8、9、10及び
11の推定アミノ酸配列を、EPA合成酵素群遺伝子の
ORF5、6、7及び8によってコードされるアミノ酸
配列と比較した結果を、下記の表5に示す。
When Tables 3 and 4 above were compared, OR
The lengths of F5, 6, 7 and 8 are ORF8, p3D5,
The length of each ORF corresponding to 9, 10, and 11 was almost the same. The results of comparing the deduced amino acid sequences of ORFs 8, 9, 10 and 11 of p3D5 with the amino acid sequences encoded by ORFs 5, 6, 7 and 8 of the EPA synthase group gene are shown in Table 5 below.

【0048】[0048]

【表5】 [Table 5]

【0049】表5からわかるように、p3D5のORF
8、9、10及び11の推定アミノ酸配列は、EPA合
成酵素群遺伝子のORF5、6、7及び8によりコード
されるアミノ酸配列とそれぞれよく類似していた。OR
F8、9、10の推定アミノ酸配列をデータベースと照
らし合わせると、脂肪酸合成の様々な反応にかかわる酵
素やタンパク質に類似した領域がいくつかみられた。そ
の結果を次の表6に示す。
As can be seen from Table 5, the ORF of p3D5
The deduced amino acid sequences of 8, 9, 10 and 11 were very similar to the amino acid sequences encoded by ORFs 5, 6, 7 and 8 of the EPA synthase group gene, respectively. OR
When the deduced amino acid sequences of F8, 9, and 10 were compared with a database, several regions similar to enzymes and proteins involved in various reactions of fatty acid synthesis were found. The results are shown in Table 6 below.

【0050】[0050]

【表6】 [Table 6]

【0051】ORF8にはKAS、MCT、ACP、β
-ケトアシル-ACPリダクターゼ(以下「KAR」とい
う。)によく似た領域があった。このうちACP類似領
域は5回繰り返していた。ORF9にはKASとMCT
に類似した領域があった。ORF10には2ヶ所のKA
S領域と2ヶ所のヒドロキシデカノイル-ACPデヒド
ラーゼ(HDD)と類似した領域があった。
ORF8 includes KAS, MCT, ACP, β
-There was a region very similar to ketoacyl-ACP reductase (hereinafter referred to as "KAR"). Of these, the ACP-like region was repeated five times. ORF9 has KAS and MCT
There was an area similar to ORF10 has two KA
There was an S region and two regions similar to hydroxydecanoyl-ACP dehydrase (HDD).

【0052】これらの領域はどれも、部分的にそれぞれ
の酵素、タンパク質の配列と似ているというより、酵
素、タンパク質の配列全体を含んでおり、それぞれのO
RFがいくつかの触媒部位をもった多機能タンパク質を
コードしていることを示唆するものである。このような
ORFのドメイン構造は、PKSやFAS Iと類似し
ている(Hopwood, D.A. (1997) Chem. Rev. 97: 2465-2
497.)。
Each of these regions contains the entire sequence of the enzyme or protein, rather than being partially similar to the sequence of the respective enzyme or protein.
This suggests that RF encodes a multifunctional protein with several catalytic sites. The domain structure of such an ORF is similar to PKS or FAS I (Hopwood, DA (1997) Chem. Rev. 97: 2465-2).
497.).

【0053】さらに、ORF8のKAS、MCT及びK
ARの各ドメイン、ORF9のKAS及びMCTの各ド
メイン、並びにORF10のKASドメインは、種々微
生物PKSのそれぞれのドメインとよく似ている(相同
性の最高値はそれぞれ、35%、26%、26%、23
%、21%、32%)。ORF8のKAS、MCT及び
KARの各ドメイン、並びにORF10のKASドメイ
ンは、脊椎動物のFAS Iのそれぞれのドメインとよ
く似ている(相同性はそれぞれ、27%、25%、28
%、25%)。ORF8のKAS−MCTドメイン、O
RF9のKAS−MCTドメイン、及びORF10のK
ASドメインは、ノストック・パンクチフォルメ(Nost
oc punctiforme)(Campbell, E.L., et al. (1998) A
rch. Microbiol. 167: 251-258.)とアナベナ(Anabaen
a sp.(GenBank登録番号U13677))のHglのそれぞれ
のドメインとよく似ている(前者との相同性はそれぞ
れ、51%、21%、36%であり、後者との相同性は
それぞれ、30%、29%、47%である)。ORF1
1も枯草菌(Bacillus subtilis)のポリケタイド合成
に関与していると報告されているpksE(Kunst, F.,
et al. (1997) Nature. 390: 249-256.)とよく似てい
る(相同性46%)。
Further, the KAS, MCT and K of ORF8
The AR domains, the KAS and MCT domains of ORF9, and the KAS domains of ORF10 are very similar to the respective domains of various microbial PKSs (the highest homology values are 35%, 26%, and 26%, respectively). , 23
%, 21%, 32%). The KAS, MCT and KAR domains of ORF8 and the KAS domain of ORF10 are very similar to the respective domains of vertebrate FAS I (homologies are 27%, 25% and 28%, respectively).
%, 25%). The KAS-MCT domain of ORF8, O
KAS-MCT domain of RF9 and K of ORF10
AS domain is Nostock puncture form ( Nost
oc punctiforme ) (Campbell, EL, et al. (1998) A
rch. Microbiol. 167: 251-258.) and Anabaen ( Anabaen
a sp. Each homology with the respective domains and are very similar (former Hgl of (GenBank Accession No. U13677)) is 51%, 21%, and 36%, respectively, the homology with the latter, 30 %, 29%, and 47%). ORF1
PksE (Kunst, F., et al.) Reported to be involved in polyketide synthesis of Bacillus subtilis .
et al. (1997) Nature. 390: 249-256.) (46% homology).

【0054】このように、これらのORFにコードされ
るタンパク質は、PKS、FASI、Hglのように多
機能酵素として機能していると推測される。これらのP
KS、FAS I、Hglといった酵素は、どれも脂肪酸
やポリケタイドのような長鎖不飽和化合物の生合成に関
わっており、ORF8、9、10及び11にコードされ
るタンパク質がDHAの合成に関与している可能性は高
いと考えられる。
Thus, the proteins encoded by these ORFs are presumed to function as multifunctional enzymes such as PKS, FASI and Hgl. These P
Enzymes such as KS, FAS I and Hgl are all involved in the biosynthesis of long-chain unsaturated compounds such as fatty acids and polyketides, and the proteins encoded by ORFs 8, 9, 10 and 11 are involved in the synthesis of DHA. It is highly probable.

【0055】このようなドメイン構造は、EPA合成酵
素群遺伝子にもみられ、本発明で得た遺伝子は、それと
非常によく似た構造をとっている。その結果を図4に示
す。図4において、矢印は、ORFの大きさと方向を示
す。DHA合成酵素遺伝子群(A)とEPA合成酵素群
遺伝子(B)とのドメイン構造の大きな違いとして、A
CPドメインの繰り返し数があげられる。EPA合成酵
素群遺伝子のORF5では6回であるのに対して、DH
A合成酵素遺伝子群のORF8では5回しかない。ま
た、EPA合成酵素群遺伝子のORF6にはみられなか
ったKASドメインが、DHA合成酵素遺伝子群のOR
F9には存在する。さらに、EPA合成酵素群遺伝子の
ORF7にはKASドメインは1ヶ所しかないが、DH
A合成酵素遺伝子群のORF10には2ヶ所存在する。
このようなEPA合成酵素群遺伝子とDHA合成酵素遺
伝子群のドメイン構造の違いが、最終産物の違いをもた
らしている可能性が高いと考えられる。
Such a domain structure is also found in EPA synthase group genes, and the gene obtained in the present invention has a very similar structure. FIG. 4 shows the results. In FIG. 4, the arrows indicate the size and direction of the ORF. The major difference in the domain structure between the DHA synthase gene group (A) and the EPA synthase group gene (B) is as follows.
The number of repetitions of the CP domain is given. In EPA synthase group gene ORF5, DH is 6 times, whereas DH is 5 times.
ORF8 of the A synthase gene group has only 5 times. In addition, a KAS domain that was not found in ORF6 of the EPA synthase gene group was OR OR of the DHA synthase gene group.
Present in F9. Furthermore, although there is only one KAS domain in ORF7 of the EPA synthase group gene, DH
There are two locations in ORF10 of the A synthase gene group.
It is highly likely that such a difference in the domain structure between the EPA synthase gene group and the DHA synthase gene group causes a difference in the final product.

【0056】[0056]

【発明の効果】本発明によれば、DHA生産細菌に由来
する、配列番号3、5、7及び9に示すアミノ酸配列を
コードする新規のイコサペンタエン酸合成酵素群類似遺
伝子及びそれから翻訳されるアミノ酸配列を有するポリ
ペプチドが提供される。本発明によれば、確立された遺
伝子発現システムによって該遺伝子を発現させることに
より、DHA生産性の向上を目指す研究や、DHA生合
成能を持たない生物へのDHA生合成能を賦与するこ
と、更にはDHAの有利な製造法を確立することができ
るようになる。更に、本発明から得られる遺伝子情報
は、高度不飽和脂肪酸生産細菌から高度不飽和脂肪酸合
成に関与する遺伝子の検出に有用なプライマー及び/又
はプローブの設計に利用することができる。
According to the present invention, a novel gene similar to the icosapentaenoic acid synthase group encoding the amino acid sequences shown in SEQ ID NOS: 3, 5, 7, and 9 derived from a DHA-producing bacterium, and an amino acid sequence translated therefrom Are provided. According to the present invention, a study aimed at improving DHA productivity by expressing the gene by an established gene expression system, or conferring DHA biosynthesis ability to an organism having no DHA biosynthesis ability, Furthermore, an advantageous method for producing DHA can be established. Furthermore, the genetic information obtained from the present invention can be used for designing primers and / or probes useful for detecting genes involved in polyunsaturated fatty acid synthesis from polyunsaturated fatty acid producing bacteria.

【0057】一般に、有用物質(例えば、DHA)を生
産する野生株の有用物質の生産性は低いことが多いが、
本発明の遺伝子を利用して、微生物におけるDHAの生
産性の向上を図ることによって、それらの微生物を工業
的に利用することが可能となる。
In general, the productivity of a useful substance in a wild strain producing a useful substance (eg, DHA) is often low,
By utilizing the gene of the present invention to improve the productivity of DHA in microorganisms, those microorganisms can be industrially used.

【0058】[0058]

【配列表】 SEQUENCE LISTING <110> Director-General of Agency of Industrial Science and Technology <120> Gene from Docosahexaenoic Acid Producing Bacteria <130> P99-0665 <160> 11 <170> PatentIn Ver. 2.0 <210> 1 <211> 41587 <212> DNA <213> Moritella marina <400> 1 gatcactctg ctgcatggcg agagctgttt aattacaggt tgaaaaaaac gatgtaatgc 60 acttaattgc ttgctgttct taatgcctga ggcgtcgaag ataataccgt tgaagcgatc 120 tgttttagcg atagcattaa ggctaatagg tgtcgcgact aaagacgttt gattaaattc 180 aatattaaga tcggctaacg ctgacgtgtt attaggataa gaaatcgtga cttcagcatc 240 tttaaatgtg ttaagaatgg gtttaattaa tttgctgttg ctggctgcgc cgatgagtaa 300 gttgccagag atgagatcgg ttccctgatc gtagcgtgtt aacgtaaccg gtcgtggcag 360 attaagcgct ttaaataaac ctgatgtcca cttgccatta gcgagttttg cgtatgtatc 420 cgtcattttc taatccttgt tatagtgaac agtttgaatc tcgaagatgt acatgtgtta 480 aaaattatct gatagctatg acttatctgc cactacgtaa taataaatag accagttcat 540 tacatcgtta atcgatatag tataactaaa tactaagtaa attataatga taagactgtt 600 atcgtactcg gatcaaactc tgatcagcaa ataatcaaat tagagttttt attttaaact 660 tgtatcaaca atgttacatt aatgtatctt acgtctaatg tgctacgggc atatttaagt 720 cactaaatta aaggaataaa ccatgacagg tcaaacaata agaagagtag caattatcgg 780 cggtaaccgt atcccgtttg cacgttcaaa tacagcgtat tcaaaactaa gtaaccaaga 840 tatgctgacg gaaactatcc gtggcttggt ggttaaatat aacctacgtg gtgaacaact 900 gggggaagtt gttgctggtg cggtaattaa gcattctcgt gattttaact taacacgtga 960 agccgtgcta agtgcaggtc ttgcacctga aacgccttgt tatgacattc aacaagcttg 1020 tggtactggt ctagctgcag ctatccaagt agcaaacaaa attgcgcttg gtcaaataga 1080 agcgggtatt gctggtggtt ctgatacgac atcagatgca ccgattgcag tcagtgaagg 1140 catgcgtagt gtattacttg agcttaatcg agctaaaacg ggtaagcaac gtttgaaagc 1200 actatctcgt ctacgtctaa aacactttgc gccactaacg cctgcaaata aagagccgcg 1260 taccaaaatg gcgatgggcg atcattgtca agtaacagcg aaagagtgga atatctcacg 1320 tgaagcacaa gatgcattgg cctgcgcaag tcatcaaaaa ttagctgcag catatgaaga 1380 aggtttcttt gatacgttag tttcacctat ggccggctta acgaaagata acgtattacg 1440 cgcagataca acagttgaga aactggctaa attgaaacct tgttttgata aagtaaacgg 1500 cactatgacg gcgggtaaca gtactaacct taccgatgga gcatcagctg tattacttgc 1560 aagtgaagaa tgggcagcgg cacataactt accagtacaa gcttatctaa catttggtga 1620 aacggccgct atcgacttcg ttgataagaa agaaggtctg ttaatggcgc ctgcatacgc 1680 agtgccaaaa atgttgaagc gtgctggcct tacattacaa gacttcgatt actatgaaat 1740 acatgaagca tttgctgcgc agttattagc aacgctagca gcttgggaag acgaaaaatt 1800 ctgtaaagaa aaactgggtc tagatgctgc gcttggttca attgatatga ccaagttaaa 1860 cgtgaaaggg agtagcttag ccacgggtca cccatttgcc gcaactggtg gtcgtgttgt 1920 cgctacgcta gcgcaattac ttgatcagaa aggttcaggt cgtggtttga tctcgatttg 1980 tgctgctggt ggtcaaggta tcacggcaat tttagagaaa taaacgcact gtttattatc 2040 tattgattaa gctgtcctga gatactggat atttttaaat aaaacgccaa tactgcagag 2100 tattggcgtt tttttgtaat accaattcct atataacggt gcattttaaa cacttaattt 2160 ccggcattgg tatcataaaa aagcagcacc gaagtgctgc ttgattgtag attaacctat 2220 taaaatagag aggctagaat tagtcttcgt atgcttcatt atgtacgcca gctgcacgac 2280 ccgatggatc agcattgttt tggaaacttt catcccaagc taatgcttct acagttgaac 2340 aagcaacgga tttaccaaac ggtacgcatt tcgctgctga atcacctggg aagtgatctt 2400 caaagatggc acgatagtag taaccttctt tcgtatctgg tgtgttaatt gggaacttaa 2460 atgctgcact tgctaacatt tgatcagtta ccgcttcttc aacgtgtact ttaagttggt 2520 caatccaaga ataaccaaca ccatcagaga attgttcttt ttgacgccat acaatttctt 2580 caggtagtaa atcttcaaat gcttctcgaa tgatgttttt ctcaatgcgg tcgcccgtga 2640 tcatttttag ttcagggttt agacgcattg acgcatcaac aaattcttta tctaagaaag 2700 gaacacgtgc ttcgatgccc caagctgcca tagatttgtt tgcacgtaag caatcaaaca 2760 tatgtaattt atttacttta cgtaccgtct cttcatggaa ttctttcgca tttggcgctt 2820 tgtggaagta caagtaacca ccgaacagtt catcagcacc ttcaccagaa agcaccatct 2880 taatccccat ggctttaatt ttacgtgcca ttaggtacat aggggttgat gcacgaattg 2940 ttgttacatc gtaggtttca atgtggtaaa tcacgtcgcg taaagcgtcg ataccttctt 3000 gcacagtaaa ttcaattgaa tgatggatag tacctaagtg atctgccact ttttgtgcag 3060 cggctaaatc tggagaacca tttaggccta cagagaaaga gtgtagttgt ggccaccatg 3120 cttcggtttt accaccgtct tcaatacgac gttttgcata ctgttgggtg attgctgaaa 3180 taacagatga atctaacccg cctgataata atacgccgta aggtacatca cacattaatt 3240 gacgtttaac tgcatcttcc aaaccttgct taacaacgct tttatcacca ccattttgtg 3300 caacgttatc aaaatctttc caatcacgtt gataataagg cgtgactaca ccatccttac 3360 tccacaggta atgacctgct gggaattctt caatttgagt acaaattggc actagtgctt 3420 tcatttcaga ggcaacataa aagttaccgt gttcatcata gcccgtataa agagggatga 3480 taccgatatg gtcacggcca atcaggtaag cgtcctctgt ttcgtcatat aaagcgaaag 3540 caaaaatacc atttagatca tctaaaaatt gtgtgccttt ttctttatat agcgcaagta 3600 tcacttcgca atctgattct gtttggaatt caaagtctac gttcagcgtt ttctttaaat 3660 ctttgtggtt ataaatttca ccattaacag caagtacgtg tgtcttttct tcattatata 3720 gcggctgtgc accattattt acatcgacaa tagcaagacg ttcatgaact aaaatagcat 3780 tgtcacttgt atagatacct gaccaatctg ggccgcggtg acgtagtaac tttgatagtt 3840 ctagtgcttg ttcgcgaaga ggtttaatgt ctgatttgat gtctagaatt ccgaatattg 3900 agcacataac taattccttc tggggctgcg tctgcagcta actttctaaa tagtgtgtct 3960 aatttgccac attgtagatt taatgcaaac attaatgata aaacatttat aaaaaatgta 4020 attcaatgtg gaatcgataa tttaatggct taaaagtgaa gatccattaa ttgtgatggc 4080 gaggtgatag accaatgtag accttaatga ataaagcagg cacgattgaa tccattcaac 4140 gcaaagtggt actaactatt gttttaaacg ttataaatag tgttttaaag gttataagta 4200 aataatttaa aaacaataat aatccacatg cattaaattt atcatgataa accgctatat 4260 ctcaatggca atttgggata agtgtaaaat atatgtaaaa tgaatgagtt gacttgcttt 4320 ttttacacta agtgatgaaa ttaaagctag atgtcgttgt tagcattgat taataacgta 4380 ctaaaatacg acatctagta tagaaattta aaaaacagtt ggttttgata gcataactgc 4440 ataaactaat cagcttattg tctgtaatat ttttgtaatt taaataggtt taataaaatt 4500 atatgtctga taaatataaa ccgtacgacc tttcctttaa aaagacgttt ttgctgccta 4560 agttttggcc tgtgtggttc ggggtgtttg caatatactt attagctttt atgccagtaa 4620 agccgcgtga taaatttgct cgattcatag cgaagaaatt gtttagtcta aaaatgatgg 4680 caaagcgtaa aaaggtagca aagatcaatt tatctatgtg cttccctgaa atggatgata 4740 cggaacaaga ccgtataatc atggtcaatc tagttacttt ttgtcaaact atcttaagtt 4800 atgcagagcc aagtgcgcgt agtcgtgctt ataaccgtga ccgtatgata gtgcatggtg 4860 gcgagaattt atttccgcta cttgaacaag gtaaggcttg tatcttatta gtgccgcata 4920 gcttcgctat tgattttgca ggtttacaca ttgcttctta tggcgcgcca ttttgtacta 4980 tgtttaacaa ttctgagaat gagttgttcg attggctgat gacacgtcaa cgcgctatgt 5040 ttggaggcac tgtttatcac cgcaaggcag ggctaggggc tctagttaaa tcacttaaga 5100 gcggtgaaag ctgttattac ttacctgatg aagaccatgg acctaagcgt agtgtatttg 5160 cgcctttatt tgcgactcaa aaagcaactt tacctgtaat gggcaagcta gcagaaaaaa 5220 caaatgcact cgttgttcct gtttatgcgg catataatga atcactaggt aaatttgaaa 5280 cctttattcg accagcaatg caaaactttc catcagaaag cccagaacaa gatgcagtga 5340 tgatgaataa agagattgaa gccttgattg aatgtggtgt tgatcaatat atgtggacac 5400 ttagattatt gagaacacgt ccggacggta aaaaaatcta ctaataaagt ttaataaaca 5460 ccataatctt cgttgaatat ggtgtttacc cccctgaata ccctctaaat taataacaaa 5520 aaaagccatt tacgtaacat ctaatgatga tttagcctgc acttgctttg tttttagtct 5580 taagagccta ataaacttga tctaggtata gattctgtct ttctttacgt aacgcgatct 5640 atttttttta accgatagtt gttataatta gtttcatatg aaagagatat cgtttcagta 5700 aaagctattt cgtttcaata gataatttat ttatagtcat attttctgta atgacaatca 5760 ttttctcatc tagactatag ataagaatac gaattaagta agaacattaa ttttacaaga 5820 atataaaata tcccatcgga gctataagaa tgaaaaagac taaaattgtt tgtacaattg 5880 gtccaaaaac tgaatcagta gagaaactaa cagagcttgt taatgcaggc atgaacgtta 5940 tgcgtttaaa tttctctcat ggtaactttg ctgaacattc agtgcgtatt caaaatatcc 6000 gtcaagtaag tgaaaacctg aataagaaaa ttgctgtttt actggatact aaaggtccag 6060 aaatccgtac gattaaacta gaaaacggtg acgatgtaat gttgaccgct ggtcagtcat 6120 tcacgtttac aacagacatt aacgtggtag gtaataaaga ctgtgttgct gtaacatatg 6180 ctggttttgc taaagacctt aatcctggtg caatcatcct tgttgatgat ggtttaattg 6240 aaatggaagt tgttgcaaca actgacactg aagttaaatg tacagtatta aatactggtg 6300 cacttggtga aaataaaggc gttaacttac ctaacatcag tgtaggtcta cctgcattgt 6360 cagaaaaaga taaagctgat ttagcgtttg gttgtgagca agaagttgat tttgttgctg 6420 catcatttat tcgtaaggct gatgatgtaa gagaaattcg tgaaatccta tttaataatg 6480 gtggcgaaaa cattcagatt atctcgaaaa ttgaaaacca agaaggtgta gacaatttcg 6540 atgaaatctt agctgaatca gacggtatca tggttgctcg tggcgatctc ggtgttgaga 6600 tcccagttga agaagtgatc atggcacaga agatgatgat caaaaaatgt aataaagcag 6660 gtaaagttgt aattactgca acacaaatgc ttgattcaat gatcagtaac ccacgtccaa 6720 cacgtgcaga agcgggcgat gttgccaatg ctgtgcttga cggtaccgac gcggtaatgc 6780 tttctggtga aactgcgaaa ggtaaatacc cagttgaagc tgtgtctatc atggcaaaca 6840 tctgtgaacg tactgataac tcaatgtctt cggatttagg tgcgaacatt gttgctaaaa 6900 gcatgcgcat tacagaagct gtgtgtaaag gtgcggtaga aacaacagaa aaattgtgtg 6960 ctccacttat tgttgttgca actcgtggcg gtaaatcagc aaaatctgtt cgtaaatact 7020 tcccgaaagc aaatattctt gctatcacaa caaatgaaaa agcagcgcaa cagttatgcc 7080 taactaaagg cgtaagcagc tgcatcgttg agcagattga tagcactgat gagttctacc 7140 gtaaaggtaa agagcttgca ttagcaactg gtttagctaa agaaggcgat atcgttgtta 7200 tggtatcagg tgcgttagta ccatcaggta caacgaatac ggcatctgtt caccaacttt 7260 aagttgccat attgatatta taaaaaagag agcgtatgct ctcttttttt atatctgtag 7320 tttatatgtc tgtacaaaaa aatgataaag agtacataaa ctattaatat agcgtaatat 7380 ataatgatta acggtgatga aagggttaaa taaatggata gtgctaaaca taaaattggc 7440 ttagtccttt ctggcggtgg tgcgaaaggt attgctcatc ttggtgtatt aaaatacctg 7500 ttagagcaag atataagacc gaatgtaatt gcgggtacaa gtgctggctc tatggttggt 7560 gcactttatt gctcaggact tgagattgat gacattttac aattcttcat cgatgtaaaa 7620 cctttttctt ggaagtttac ccgtgcccgt gctggcttta tagacccggc aaaattatat 7680 cctgaagtgc taaaatatat ccccgaggat agctttgagt accttcaacc tgaattgcgc 7740 attgttgcca ccaacatgtt actcggtaaa gagcatatat ttaaagatgg ctccgtgatt 7800 aatgccttat tagcatcagc cagctaccct ttagtttttt ctccgatgat cattgacgat 7860 caagtgtatt cagatggcgg tattgttaat catttccccg tgagtgtcat tgaagatgat 7920 tgcgataaaa taatcggcgt atacgtgtcg cccattcgtc aggtcgaagc tgacgaactc 7980 tcgagtataa aagacgtggt attacgtgcg ttcacgctgc agggtagtgg tgctgaatta 8040 gataaactat cgcaatgtga tgtgcaaatt tatccagaag cgctattgaa ttacaatacg 8100 tttgcaaccg atgaaaaatc attacgggag atctaccaga ttggttatga tgctgcaaaa 8160 gatcaacatg acaaccttat ggcattgaaa gaaagtatca ccaccagcga ggttaaaaag 8220 aacgtcttta gcaaatggtt tggtgataaa cttgctagca acagcggcaa atagcggccc 8280 acacggattt atacactagg ataatgggcg ttaatagcct cactgtcgtt gtgtggtctc 8340 taattttagc taaatcttgt gttatactga cttcctatta atcataaacg atttatcacg 8400 gtaaacatga ctcaaataaa taacccgctt cacggcatga cactcgaaaa agtaattaac 8460 agtctcgttg aacaatatgg ctgggatggt cttggatact acatcaacat tcgttgcttt 8520 actgaaaatc caagtgttaa gtctagtctt aaatttttac gtaaaacccc ttgggcacgt 8580 gataaagtag aagcgctata tatcaaaatg gtgactgaag gctaactgtc tccacgctag 8640 cgaaccgctg tttatagtta atataagtac tataagcagg gctcgttaat tcagtatgta 8700 attaatcctg aataccttcc gcttatttca acattgtact ctctagataa cactctcaac 8760 attacacctt caacatcaca gcctccacat aacatccgat gacatagccc tgttattttt 8820 cacatttatc tatatgctat atattttagc catttgatca attgagttaa tttctgcaat 8880 gacaaagata taccatcatc cagtacaaat ttattatgaa gataccgacc attctggtgt 8940 tgtttaccac cctaactttt taaaatactt tgaacgtgca cgtgagcatg tgataaatag 9000 tgacttacta gcaacattgt ggaatgaacg cggtttaggt tttgcggtgt ataaagccaa 9060 tatgactttt caggatgggg tcgaatttgc tgaagtgtgt gatattcgca cttcttttgt 9120 cctagacggt aagtacaaaa cgatctggcg ccaagaagta tggcgtccga atgcgactag 9180 ggctgccgtt atcggtgata ttgaaatggt gtgcttagac aaacaaaaac gtttacagcc 9240 catccctgat gatgtgttag ctgcaatggt tagtgaataa atggttcatg cataaatagt 9300 taatacatga ttctggcccg tcacgtttac agataagagg catccgatgc ctccttccta 9360 ttaccaatac tactgcttat ccctttctaa ctatctttag cgtccataac acactgagca 9420 tttattctat taatcagtga ttgtgattta attatcttct atatatgtaa tttaatgtaa 9480 ttttcaattt atttttagct acattaaggc ttacgaatgt acgctaaaat gagatgtcag 9540 actaatttta gcttattaat ctgttagccg tttatatttt ataaagatgg gatttaactt 9600 aaatgcaatt aattatggcg taaatagagt gaaaacatgg ctaatattca ctaagtcctg 9660 aattttatat aaagtttaat ctgttatttt agcgtttacc tggtcttatc agtgaggttt 9720 atagccatta ttagtgggat tgaagtgatt tttaaagcta tgtatattat tgcaaatata 9780 aattgtaaca attaagactt tggacacttg agttcaattt cgaattgatt ggcataaaat 9840 ttaaaacagc taaatctacc tcaatcattt tagcaaatgt atgcaggtag atttttttcg 9900 ccatttaaga gtacacttgt acgctaggtt tttgtttagt gtgcaaatga acgttttgat 9960 gagcattgtt tttagagcac aaaatagatc cttacaggag caataacgca atggctaaaa 10020 agaacaccac atcgattaag cacgccaagg atgtgttaag tagtgatgat caacagttaa 10080 attctcgctt gcaagaatgt ccgattgcca tcattggtat ggcatcggtt tttgcagatg 10140 ctaaaaactt ggatcaattc tgggataaca tcgttgactc tgtggacgct attattgatg 10200 tgcctagcga tcgctggaac attgacgacc attactcggc tgataaaaaa gcagctgaca 10260 agacatactg caaacgcggt ggtttcattc cagagcttga ttttgatccg atggagtttg 10320 gtttaccgcc aaatatcctc gagttaactg acatcgctca attgttgtca ttaattgttg 10380 ctcgtgatgt attaagtgat gctggcattg gtagtgatta tgaccatgat aaaattggta 10440 tcacgctggg tgtcggtggt ggtcagaaac aaatttcgcc attaacgtcg cgcctacaag 10500 gcccggtatt agaaaaagta ttaaaagcct caggcattga tgaagatgat cgcgctatga 10560 tcatcgacaa atttaaaaaa gcctacatcg gctgggaaga gaactcattc ccaggcatgc 10620 taggtaacgt tattgctggt cgtatcgcca atcgttttga ttttggtggt actaactgtg 10680 tggttgatgc ggcatgcgct ggctcccttg cagctgttaa aatggcgatc tcagacttac 10740 ttgaatatcg ttcagaagtc atgatatcgg gtggtgtatg ttgtgataac tcgccattca 10800 tgtatatgtc attctcgaaa acaccagcat ttaccaccaa tgatgatatc cgtccgtttg 10860 atgacgattc aaaaggcatg ctggttggtg aaggtattgg catgatggcg tttaaacgtc 10920 ttgaagatgc tgaacgtgac ggcgacaaaa tttattctgt actgaaaggt atcggtacat 10980 cttcagatgg tcgtttcaaa tctatttacg ctccacgccc agatggccaa gcaaaagcgc 11040 taaaacgtgc ttatgaagat gccggttttg cccctgaaac atgtggtcta attgaaggcc 11100 atggtacggg taccaaagcg ggtgatgccg cagaatttgc tggcttgacc aaacactttg 11160 gcgccgccag tgatgaaaag caatatatcg ccttaggctt agttaaatcg caaattggtc 11220 atactaaatc tgcggctggc tctgcgggta tgattaaggc ggcattagcg ctgcatcata 11280 aaatcttacc tgcaacgatc catatcgata aaccaagtga agccttggat atcaaaaaca 11340 gcccgttata cctaaacagc gaaacgcgtc cttggatgcc acgtgaagat ggtattccac 11400 gtcgtgcagg tatcagctca tttggttttg gcggcaccaa cttccatatt attttagaag 11460 agtatcgccc aggtcacgat agcgcatatc gcttaaactc agtgagccaa actgtgttga 11520 tctcggcaaa cgaccaacaa ggtattgttg ctgagttaaa taactggcgt actaaactgg 11580 ctgtcgatgc tgatcatcaa gggtttgtat ttaatgagtt agtgacaacg tggccattaa 11640 aaaccccatc cgttaaccaa gctcgtttag gttttgttgc gcgtaatgca aatgaagcga 11700 tcgcgatgat tgatacggca ttgaaacaat tcaatgcgaa cgcagataaa atgacatggt 11760 cagtacctac cggggtttac tatcgtcaag ccggtattga tgcaacaggt aaagtggttg 11820 cgctattctc agggcaaggt tcgcaatacg tgaacatggg tcgtgaatta acctgtaact 11880 tcccaagcat gatgcacagt gctgcggcga tggataaaga gttcagtgcc gctggtttag 11940 gccagttatc tgcagttact ttccctatcc ctgtttatac ggatgccgag cgtaagctac 12000 aagaagagca attacgttta acgcaacatg cgcaaccagc gattggtagt ttgagtgttg 12060 gtctgttcaa aacgtttaag caagcaggtt ttaaagctga ttttgctgcc ggtcatagtt 12120 tcggtgagtt aaccgcatta tgggctgccg atgtattgag cgaaagcgat tacatgatgt 12180 tagcgcgtag tcgtggtcaa gcaatggctg cgccagagca acaagatttt gatgcaggta 12240 agatggccgc tgttgttggt gatccaaagc aagtcgctgt gatcattgat acccttgatg 12300 atgtctctat tgctaacttc aactcgaata accaagttgt tattgctggt actacggagc 12360 aggttgctgt agcggttaca accttaggta atgctggttt caaagttgtg ccactgccgg 12420 tatctgctgc gttccataca cctttagttc gtcacgcgca aaaaccattt gctaaagcgg 12480 ttgatagcgc taaatttaaa gcgccaagca ttccagtgtt tgctaatggc acaggcttgg 12540 tgcattcaag caaaccgaat gacattaaga aaaacctgaa aaaccacatg ctggaatctg 12600 ttcatttcaa tcaagaaatt gacaacatct atgctgatgg tggccgcgta tttatcgaat 12660 ttggtccaaa gaatgtatta actaaattgg ttgaaaacat tctcactgaa aaatctgatg 12720 tgactgctat cgcggttaat gctaatccta aacaacctgc ggacgtacaa atgcgccaag 12780 ctgcgctgca aatggcagtg cttggtgtcg cattagacaa tattgacccg tacgacgccg 12840 ttaagcgtcc acttgttgcg ccgaaagcat caccaatgtt gatgaagtta tctgcagcgt 12900 cttatgttag tccgaaaacg aagaaagcgt ttgctgatgc attgactgat ggctggactg 12960 ttaagcaagc gaaagctgta cctgctgttg tgtcacaacc acaagtgatt gaaaagatcg 13020 ttgaagttga aaagatagtt gaacgcattg tcgaagtaga gcgtattgtc gaagtagaaa 13080 aaatcgtcta cgttaatgct gacggttcgc ttatatcgca aaataatcaa gacgttaaca 13140 gcgctgttgt tagcaacgtg actaatagct cagtgactca tagcagtgat gctgaccttg 13200 ttgcctctat tgaacgcagt gttggtcaat ttgttgcaca ccaacagcaa ttattaaatg 13260 tacatgaaca gtttatgcaa ggtccacaag actacgcgaa aacagtgcag aacgtacttg 13320 ctgcgcagac gagcaatgaa ttaccggaaa gtttagaccg tacattgtct atgtataacg 13380 agttccaatc agaaacgcta cgtgtacatg aaacgtacct gaacaatcag acgagcaaca 13440 tgaacaccat gcttactggt gctgaagctg atgtgctagc aaccccaata actcaggtag 13500 tgaatacagc cgttgccact agtcacaagg tagttgctcc agttattgct aatacagtga 13560 cgaatgttgt atctagtgtc agtaataacg cggcggttgc agtgcaaact gtggcattag 13620 cgcctacgca agaaatcgct ccaacagtcg ctactacgcc agcacccgca ttggttgcta 13680 tcgtggctga acctgtgatt gttgcgcatg ttgctacaga agttgcacca attacaccat 13740 cagttacacc agttgtcgca actcaagcgg ctatcgatgt agcaactatt aacaaagtaa 13800 tgttagaagt tgttgctgat aaaaccggtt atccaacgga tatgctggaa ctgagcatgg 13860 acatggaagc tgacttaggt atcgactcaa tcaaacgtgt tgagatatta ggcgcagtac 13920 aggaattgat ccctgactta cctgaactta atcctgaaga tcttgctgag ctacgcacgc 13980 ttggtgagat tgtcgattac atgaattcaa aagcccaggc tgtagctcct acaacagtac 14040 ctgtaacaag tgcacctgtt tcgcctgcat ctgctggtat tgatttagcc cacatccaaa 14100 acgtaatgtt agaagtggtt gcagacaaaa ccggttaccc aacagacatg ctagaactga 14160 gcatggatat ggaagctgac ttaggtattg attcaatcaa gcgtgtggaa atcttaggtg 14220 cagtacagga gatcataact gatttacctg agctaaaccc tgaagatctt gttgaattac 14280 gcaccctagg tgaaatcgtt agttacatgc aaagcaaagc gccagtcgct gaaagtgcgc 14340 cagtggcgac ggctcctgta gcaacaagct cagcaccgtc tatcgatttg aaccacattc 14400 aaacagtgat gatggatgta gttgcagata agactggtta tccaactgac atgctagaac 14460 ttggcatgga catggaagct gatttaggta tcgattcaat caaacgtgtg gaaatattag 14520 gcgcagtgca ggagatcatc actgatttac ctgagctaaa cccagaagac ctcgctgaat 14580 tacgcacgct aggtgaaatc gttagttaca tgcaaagcaa agcgccagtc gctgagagtg 14640 cgccagtagc gacggcttct gtagcaacaa gctctgcacc gtctatcgat ttaaaccata 14700 tccaaacagt gatgatggaa gtggttgcag acaaaaccgg ttatccagta gacatgttag 14760 aacttgctat ggacatggaa gctgacctag gtatcgattc aatcaagcgt gtagaaattt 14820 taggtgcggt acaggaaatc attactgact tacctgagct taaccctgaa gatcttgctg 14880 aactacgtac attaggtgaa atcgttagtt acatgcaaag caaagcgccc gtagctgaag 14940 cgcctgcagt acctgttgca gtagaaagtg cacctactag tgtaacaagc tcagcaccgt 15000 ctatcgattt agaccacatc caaaatgtaa tgatggatgt tgttgctgat aagactggtt 15060 atcctgccaa tatgcttgaa ttagcaatgg acatggaagc cgaccttggt attgattcaa 15120 tcaagcgtgt tgaaattcta ggcgcggtac aggagatcat tactgattta cctgaactaa 15180 acccagaaga cttagctgaa ctacgtacgt tagaagaaat tgtaacctac atgcaaagca 15240 aggcgagtgg tgttactgta aatgtagtgg ctagccctga aaataatgct gtatcagatg 15300 catttatgca aagcaatgtg gcgactatca cagcggccgc agaacataag gcggaattta 15360 aaccggcgcc gagcgcaacc gttgctatct ctcgtctaag ctctatcagt aaaataagcc 15420 aagattgtaa aggtgctaac gccttaatcg tagctgatgg cactgataat gctgtgttac 15480 ttgcagacca cctattgcaa actggctgga atgtaactgc attgcaacca acttgggtag 15540 ctgtaacaac gacgaaagca tttaataagt cagtgaacct ggtgacttta aatggcgttg 15600 atgaaactga aatcaacaac attattactg ctaacgcaca attggatgca gttatctatc 15660 tgcacgcaag tagcgaaatt aatgctatcg aatacccaca agcatctaag caaggcctga 15720 tgttagcctt cttattagcg aaattgagta aagtaactca agccgctaaa gtgcgtggcg 15780 cctttatgat tgttactcag cagggtggtt cattaggttt tgatgatatc gattctgcta 15840 caagtcatga tgtgaaaaca gacctagtac aaagcggctt aaacggttta gttaagacac 15900 tgtctcacga gtgggataac gtattctgtc gtgcggttga tattgcttcg tcattaacgg 15960 ctgaacaagt tgcaagcctt gttagtgatg aactacttga tgctaacact gtattaacag 16020 aagtgggtta tcaacaagct ggtaaaggcc ttgaacgtat cacgttaact ggtgtggcta 16080 ctgacagcta tgcattaaca gctggcaata acatcgatgc taactcggta tttttagtga 16140 gtggtggcgc aaaaggtgta actgcacatt gtgttgctcg tatagctaaa gaatatcagt 16200 ctaagttcat cttattggga cgttcaacgt tctcaagtga cgaaccgagc tgggcaagtg 16260 gtattactga tgaagcggcg ttaaagaaag cagcgatgca gtctttgatt acagcaggtg 16320 ataaaccaac acccgttaag atcgtacagc taatcaaacc aatccaagct aatcgtgaaa 16380 ttgcgcaaac cttgtctgca attaccgctg ctggtggcca agctgaatat gtttctgcag 16440 atgtaactaa tgcagcaagc gtacaaatgg cagtcgctcc agctatcgct aagttcggtg 16500 caatcactgg catcattcat ggcgcgggtg tgttagctga ccaattcatt gagcaaaaaa 16560 cactgagtga ttttgagtct gtttacagca ctaaaattga cggtttgtta tcgctactat 16620 cagtcactga agcaagcaac atcaagcaat tggtattgtt ctcgtcagcg gctggtttct 16680 acggtaaccc cggccagtct gattactcga ttgccaatga gatcttaaat aaaaccgcat 16740 accgctttaa atcattgcac ccacaagctc aagtattgag ctttaactgg ggtccttggg 16800 acggtggcat ggtaacgcct gagcttaaac gtatgtttga ccaacgtggt gtttacatta 16860 ttccacttga tgcaggtgca cagttattgc tgaatgaact agccgctaat gataaccgtt 16920 gtccacaaat cctcgtgggt aatgacttat ctaaagatgc tagctctgat caaaagtctg 16980 atgaaaagag tactgctgta aaaaagccac aagttagtcg tttatcagat gctttagtaa 17040 ctaaaagtat caaagcgact aacagtagct ctttatcaaa caagactagt gctttatcag 17100 acagtagtgc ttttcaggtt aacgaaaacc actttttagc tgaccacatg atcaaaggca 17160 atcaggtatt accaacggta tgcgcgattg cttggatgag tgatgcagca aaagcgactt 17220 atagtaaccg agactgtgca ttgaagtatg tcggtttcga agactataaa ttgtttaaag 17280 gtgtggtttt tgatggcaat gaggcggcgg attaccaaat ccaattgtcg cctgtgacaa 17340 gggcgtcaga acaggattct gaagtccgta ttgccgcaaa gatctttagc ctgaaaagtg 17400 acggtaaacc tgtgtttcat tatgcagcga caatattgtt agcaactcag ccacttaatg 17460 ctgtgaaggt agaacttccg acattgacag aaagtgttga tagcaacaat aaagtaactg 17520 atgaagcaca agcgttatac agcaatggca ccttgttcca cggtgaaagt ctgcagggca 17580 ttaagcagat attaagttgt gacgacaagg gcctgctatt ggcttgtcag ataaccgatg 17640 ttgcaacagc taagcaggga tccttcccgt tagctgacaa caatatcttt gccaatgatt 17700 tggtttatca ggctatgttg gtctgggtgc gcaaacaatt tggtttaggt agcttacctt 17760 cggtgacaac ggcttggact gtgtatcgtg aagtggttgt agatgaagta ttttatctgc 17820 aacttaatgt tgttgagcat gatctattgg gttcacgcgg cagtaaagcc cgttgtgata 17880 ttcaattgat tgctgctgat atgcaattac ttgccgaagt gaaatcagcg caagtcagtg 17940 tcagtgacat tttgaacgat atgtcatgat cgagtaaata ataacgatag gcgtcatggt 18000 gagcatggcg tctgctttct tcatttttta acattaacaa tattaatagc taaacgcggt 18060 tgctttaaac caagtaaaca agtgctttta gctattacta ttccaaacag gatattaaag 18120 agaatatgac ggaattagct gttattggta tggatgctaa atttagcgga caagacaata 18180 ttgaccgtgt ggaacgcgct ttctatgaag gtgcttatgt aggtaatgtt agccgcgtta 18240 gtaccgaatc taatgttatt agcaatggcg aagaacaagt tattactgcc atgacagttc 18300 ttaactctgt cagtctacta gcgcaaacga atcagttaaa tatagctgat atcgcggtgt 18360 tgctgattgc tgatgtaaaa agtgctgatg atcagcttgt agtccaaatt gcatcagcaa 18420 ttgaaaaaca gtgtgcgagt tgtgttgtta ttgctgattt aggccaagca ttaaatcaag 18480 tagctgattt agttaataac caagactgtc ctgtggctgt aattggcatg aataactcgg 18540 ttaatttatc tcgtcatgat cttgaatctg taactgcaac aatcagcttt gatgaaacct 18600 tcaatggtta taacaatgta gctgggttcg cgagtttact tatcgcttca actgcgtttg 18660 ccaatgctaa gcaatgttat atatacgcca acattaaggg cttcgctcaa tcgggcgtaa 18720 atgctcaatt taacgttgga aacattagcg atactgcaaa gaccgcattg cagcaagcta 18780 gcataactgc agagcaggtt ggtttgttag aagtgtcagc agtcgctgat tcggcaatcg 18840 cattgtctga aagccaaggt ttaatgtctg cttatcatca tacgcaaact ttgcatactg 18900 cattaagcag tgcccgtagt gtgactggtg aaggcgggtg tttttcacag gtcgcaggtt 18960 tattgaaatg tgtaattggt ttacatcaac gttatattcc ggcgattaaa gattggcaac 19020 aaccgagtga caatcaaatg tcacggtggc ggaattcacc attctatatg cctgtagatg 19080 ctcgaccttg gttcccacat gctgatggct ctgcacacat tgccgcttat agttgtgtga 19140 ctgctgacag ctattgtcat attcttttac aagaaaacgt cttacaagaa cttgttttga 19200 aagaaacagt cttgcaagat aatgacttaa ctgaaagcaa gcttcagact cttgaacaaa 19260 acaatccagt agctgatctg cgcactaatg gttactttgc atcgagcgag ttagcattaa 19320 tcatagtaca aggtaatgac gaagcacaat tacgctgtga attagaaact attacagggc 19380 agttaagtac tactggcata agtactatca gtattaaaca gatcgcagca gactgttatg 19440 cccgtaatga tactaacaaa gcctatagcg cagtgcttat tgccgagact gctgaagagt 19500 taagcaaaga aataaccttg gcgtttgctg gtatcgctag cgtgtttaat gaagatgcta 19560 aagaatggaa aaccccgaag ggcagttatt ttaccgcgca gcctgcaaat aaacaggctg 19620 ctaacagcac acagaatggt gtcaccttca tgtacccagg tattggtgct acatatgttg 19680 gtttagggcg tgatctattt catctattcc cacagattta tcagcctgta gcggctttag 19740 ccgatgacat tggcgaaagt ctaaaagata ctttacttaa tccacgcagt attagtcgtc 19800 atagctttaa agaactcaag cagttggatc tggacctgcg cggtaactta gccaatatcg 19860 ctgaagccgg tgtgggtttt gcttgtgtgt ttaccaaggt atttgaagaa gtctttgccg 19920 ttaaagctga ctttgctaca ggttatagca tgggtgaagt aagcatgtat gcagcactag 19980 gctgctggca gcaaccggga ttgatgagtg ctcgccttgc acaatcgaat acctttaatc 20040 atcaactttg cggcgagtta agaacactac gtcagcattg gggcatggat gatgtagcta 20100 acggtacgtt cgagcagatc tgggaaacct ataccattaa ggcaacgatt gaacaggtcg 20160 aaattgcctc tgcagatgaa gatcgtgtgt attgcaccat tatcaataca cctgatagct 20220 tgttgttagc cggttatcca gaagcctgtc agcgagtcat taagaattta ggtgtgcgtg 20280 caatggcatt gaatatggcg aacgcaattc acagcgcgcc agcttatgcc gaatacgatc 20340 atatggttga gctataccat atggatgtta ctccacgtat taataccaag atgtattcaa 20400 gctcatgtta tttaccgatt ccacaacgca gcaaagcgat ttcccacagt attgctaaat 20460 gtttgtgtga tgtggtggat ttcccacgtt tggttaatac cttacatgac aaaggtgcgc 20520 gggtattcat tgaaatgggt ccaggtcgtt cgttatgtag ctgggtagat aagatcttag 20580 ttaatggcga tggcgataat aaaaagcaaa gccaacatgt atctgttcct gtgaatgcca 20640 aaggcaccag tgatgaactt acttatattc gtgcgattgc taagttaatt agtcatggcg 20700 tgaatttgaa tttagatagc tagtttaacg ggtcaatcct ggttaaagca ggccatatag 20760 caaacacgaa caaatagtca acatcgatat ctagcgctgg tgagttatac ctcattagtt 20820 gaaatatgga tttaaagaga gtaattatgg aaaatattgc agtagtaggt attgctaatt 20880 tgttcccggg ctcacaagca ccggatcaat tttggcagca attgcttgaa caacaagatt 20940 gccgcagtaa ggcgaccgct gttcaaatgg gcgttgatcc tgctaaatat accgccaaca 21000 aaggtgacac agataaattt tactgtgtgc acggcggtta catcagtgat ttcaattttg 21060 atgcttcagg ttatcaactc gataatgatt atttagccgg tttagatgac cttaatcaat 21120 gggggcttta tgttacgaaa caagccctta ccgatgcggg ttattggggc agtactgcac 21180 tagaaaactg tggtgtgatt ttaggtaatt tgtcattccc aactaaatca tctaatcagc 21240 tgtttatgcc tttgtatcat caagttgttg ataatgcctt aaaggcggta ttacatcctg 21300 attttcaatt aacgcattac acagcaccga aaaaaacaca tgctgacaat gcattagtag 21360 caggttatcc agctgcattg atcgcgcaag cggcgggtct tggtggttca cattttgcac 21420 tggatgcggc ttgtgcttca tcttgttata gcgttaagtt agcgtgtgat tacctgcata 21480 cgggtaaagc caacatgatg cttgctggtg cggtatctgc agcagatcct atgttcgtaa 21540 atatgggttt ctcgatattc caagcttacc cagctaacaa tgtacatgcc ccgtttgacc 21600 aaaattcaca aggtctattt gccggtgaag gcgcgggcat gatggtattg aaacgtcaaa 21660 gtgatgcagt acgtgatggt gatcatattt acgccattat taaaggcggc gcattatcga 21720 atgacggtaa aggcgagttt gtattaagcc cgaacaccaa gggccaagta ttagtatatg 21780 aacgtgctta tgccgatgca gatgttgacc cgagtacagt tgactatatt gaatgtcatg 21840 caacgggcac acctaagggt gacaatgttg aattgcgttc gatggaaacc tttttcagtc 21900 gcgtaaataa caaaccatta ctgggctcgg ttaaatctaa ccttggtcat ttgttaactg 21960 ccgctggtat gcctggcatg accaaagcta tgttagcgct aggtaaaggt cttattcctg 22020 caacgattaa cttaaagcaa ccactgcaat ctaaaaacgg ttactttact ggcgagcaaa 22080 tgccaacgac gactgtgtct tggccaacaa ctccgggtgc caaggcagat aaaccgcgta 22140 ccgcaggtgt gagcgtattt ggttttggtg gcagcaacgc ccatttggta ttacaacagc 22200 caacgcaaac actcgagact aattttagtg ttgctaaacc acgtgagcct ttggctatta 22260 ttggtatgga cagccatttt ggtagtgcca gtaatttagc gcagttcaaa accttattaa 22320 ataataatca aaataccttc cgtgaattac cagaacaacg ctggaaaggc atggaaagta 22380 acgctaacgt catgcagtcg ttacaattac gcaaagcgcc taaaggcagt tacgttgaac 22440 agctagatat tgatttcttg cgttttaaag taccgcctaa tgaaaaagat tgcttgatcc 22500 cgcaacagtt aatgatgatg caagtggcag acaatgctgc gaaagacgga ggtctagttg 22560 aaggtcgtaa tgttgcggta ttagtagcga tgggcatgga actggaatta catcagtatc 22620 gtggtcgcgt taatctaacc acccaaattg aagacagctt attacagcaa ggtattaacc 22680 tgactgttga gcaacgtgaa gaactgacca atattgctaa agacggtgtt gcctcggctg 22740 cacagctaaa tcagtatacg agtttcattg gtaatattat ggcgtcacgt atttcggcgt 22800 tatgggattt ttctggtcct gctattaccg tatcggctga agaaaactct gtttatcgtt 22860 gtgttgaatt agctgaaaat ctatttcaaa ccagtgatgt tgaagccgtt attattgctg 22920 ctgttgattt gtctggttca attgaaaaca ttactttacg tcagcactac ggtccagtta 22980 atgaaaaggg atctgtaagt gaatgtggtc cggttaatga aagcagttca gtaaccaaca 23040 atattcttga tcagcaacaa tggctggtgg gtgaaggcgc agcggctatt gtcgttaaac 23100 cgtcatcgca agtcactgct gaacaagttt atgcgcgtat tgatgcggtg agttttgccc 23160 ctggtagcaa tgcgaaagca attacgattg cagcggataa agcattaaca cttgctggta 23220 tcagtgctgc tgatgtagct agtgttgaag cacatgcaag tggttttagt gccgaaaata 23280 atgctgaaaa aaccgcgtta ccgactttat acccaagcgc aagtatcagt tcggtgaaag 23340 ccaatattgg tcatacgttt aatgcctcgg gtatggcgag tattattaaa acggcgctgc 23400 tgttagatca gaatacgagt caagatcaga aaagcaaaca tattgctatt aacggtctag 23460 gtcgtgataa cagctgcgcg catcttatct tatcgagttc agcgcaagcg catcaagttg 23520 caccagcgcc tgtatctggt atggccaagc aacgcccaca gttagttaaa accatcaaac 23580 tcggtggtca gttaattagc aacgcgattg ttaacagtgc gagttcatct ttacacgcta 23640 ttaaagcgca gtttgccggt aagcacttaa acaaagttaa ccagccagtg atgatggata 23700 acctgaagcc ccaaggtatt agcgctcatg caaccaatga gtatgtggtg actggagctg 23760 ctaacactca agcttctaac attcaagcat ctcatgttca agcgtcaagt catgcacaag 23820 agatagcacc aaaccaagtt caaaatatgc aagctacagc agccgctgta agttcacccc 23880 tttctcaaca tcaacacaca gcgcagcccg tagcggcacc gagcgttgtt ggagtgactg 23940 tgaaacataa agcaagtaac caaattcatc agcaagcgtc tacgcataaa gcatttttag 24000 aaagtcgttt agctgcacag aaaaacctat cgcaacttgt tgaattgcaa accaagctgt 24060 caatccaaac tggtagtgac aatacatcta acaatactgc gtcaacaagc aatacagtgc 24120 taacaaatcc tgtatcagca acgccattaa cacttgtgta taatgcgcct gtagtagcga 24180 caaacctaac cagtacagaa gcaaaagcgc aagcagctgc tacacaagct ggttttcaga 24240 taaaaggacc tgttggttac aactatccac cgctgcagtt aattgaacgt tataataaac 24300 cagaaaacgt gatttacgat caagctgatt tggttgaatt cgctgaaggt gatattggta 24360 aggtatttgg tgctgaatac aatattattg atggctattc gcgtcgtgta cgtctgccaa 24420 cctcagatta cttgttagta acacgtgtta ctgaacttga tgccaaggtg catgaataca 24480 agaaatcata catgtgtact gaatatgatg tgcctgttga tgcaccgttc ttaattgatg 24540 gtcagatccc ttggtctgtt gccgtcgaat caggccagtg tgatttgatg ttgatttcat 24600 atatcggtat tgatttccaa gcgaaaggcg aacgtgttta ccgtttactt gattgtgaat 24660 taactttcct tgaagagatg gcttttggtg gcgatacttt acgttacgag atccacattg 24720 attcgtatgc acgtaacggc gagcaattat tattcttctt ccattacgat tgttacgtag 24780 gggataagaa ggtacttatc atgcgtaatg gttgtgctgg tttctttact gacgaagaac 24840 tttctgatgg taaaggcgtt attcataacg acaaagacaa agctgagttt agcaatgctg 24900 ttaaatcatc attcacgccg ttattacaac ataaccgtgg tcaatacgat tataacgaca 24960 tgatgaagtt ggttaatggt gatgttgcca gttgttttgg tccgcaatat gatcaaggtg 25020 gccgtaatcc atcattgaaa ttctcgtctg agaagttctt gatgattgaa cgtattacca 25080 agatagaccc aaccggtggt cattggggac taggcctgtt agaaggtcag aaagatttag 25140 accctgagca ttggtatttc ccttgtcact ttaaaggtga tcaagtaatg gctggttcgt 25200 tgatgtcgga aggttgtggc caaatggcga tgttcttcat gctgtctctt ggtatgcata 25260 ccaatgtgaa caacgctcgt ttccaaccac taccaggtga atcacaaacg gtacgttgtc 25320 gtgggcaagt actgccacag cgcaatacct taacttaccg tatggaagtt actgcgatgg 25380 gtatgcatcc acagccattc atgaaagcta atattgatat tttgcttgac ggtaaagtgg 25440 ttgttgattt caaaaacttg agcgtgatga tcagcgaaca agatgagcat tcagattacc 25500 ctgtaacact gccgagtaat gtggcgctta aagcgattac tgcacctgtt gcgtcagtag 25560 caccagcatc ttcacccgct aacagcgcgg atctagacga acgtggtgtt gaaccgttta 25620 agtttcctga acgtccgtta atgcgtgttg agtcagactt gtctgcaccg aaaagcaaag 25680 gtgtgacacc gattaagcat tttgaagcgc ctgctgttgc tggtcatcat agagtgccta 25740 accaagcacc gtttacacct tggcatatgt ttgagtttgc gacgggtaat atttctaact 25800 gtttcggtcc tgattttgat gtttatgaag gtcgtattcc acctcgtaca ccttgtggcg 25860 atttacaagt tgttactcag gttgtagaag tgcagggcga acgtcttgat cttaaaaatc 25920 catcaagctg tgtagctgaa tactatgtac cggaagacgc ttggtacttt actaaaaaca 25980 gccatgaaaa ctggatgcct tattcattaa tcatggaaat tgcattgcaa ccaaatggct 26040 ttatttctgg ttacatgggc acgacgctta aataccctga aaaagatctg ttcttccgta 26100 accttgatgg tagcggcacg ttattaaagc agattgattt acgcggcaag accattgtga 26160 ataaatcagt cttggttagt acggctattg ctggtggcgc gattattcaa agtttcacgt 26220 ttgatatgtc tgtagatggc gagctatttt atactggtaa agctgtattt ggttacttta 26280 gtggtgaatc actgactaac caactgggca ttgataacgg taaaacgact aatgcgtggt 26340 ttgttgataa caataccccc gcagcgaata ttgatgtgtt tgatttaact aatcagtcat 26400 tggctctgta taaagcgcct gtggataaac cgcattataa attggctggt ggtcagatga 26460 actttatcga tacagtgtca gtggttgaag gcggtggtaa agcgggcgtg gcttatgttt 26520 atggcgaacg tacgattgat gctgatgatt ggttcttccg ttatcacttc caccaagatc 26580 cggtgatgcc aggttcatta ggtgttgaag ctattattga gttgatgcag acctatgcgc 26640 ttaaaaatga tttgggtggc aagtttgcta acccacgttt cattgcgccg atgacgcaag 26700 ttgattggaa ataccgtggg caaattacgc cgctgaataa acagatgtca ctggacgtgc 26760 atatcactga gatcgtgaat gacgctggtg aagtgcgaat cgttggtgat gcgaatctgt 26820 ctaaagatgg tctgcgtatt tatgaagtta aaaacatcgt tttaagtatt gttgaagcgt 26880 aaagggtcaa gtgtaacgtg cttaagcgcc gcattggtta aagacgcttt gcacgccgtg 26940 aatccgtcca tggaggcttg gggttggcat ccatgccaac aacagcaagc ttactttaat 27000 caatacggct tggtgtccat ttagacgcct cgaacttagt agttaataga caaaataatt 27060 tagctgtgga atgaatatag taagtaatca ttcggcagct acaaaaaagg aattaagaat 27120 gtcgagttta ggttttaaca ataacaacgc aattaactgg gcttggaaag tagatccagc 27180 gtcagttcat acacaagatg cagaaattaa agcagcttta atggatctaa ctaaacctct 27240 ctatgtggcg aataattcag gcgtaactgg tatagctaat catacgtcag tagcaggtgc 27300 gatcagcaat aacatcgatg ttgatgtatt ggcgtttgcg caaaagttaa acccagaaga 27360 tctgggtgat gatgcttaca agaaacagca cggcgttaaa tatgcttatc atggcggtgc 27420 gatggcaaat ggtattgcct cggttgaatt ggttgttgcg ttaggtaaag cagggctgtt 27480 atgttcattt ggtgctgcag gtctagtgcc tgatgcggtt gaagatgcaa ttcgtcgtat 27540 tcaagctgaa ttaccaaatg gcccttatgc ggttaacttg atccatgcac cagcagaaga 27600 agcattagag cgtggcgcgg ttgaacgttt cctaaaactt ggcgtcaaga cggtagaggc 27660 ttcagcttac cttggtttaa ctgaacacat tgtttggtat cgtgctgctg gtctaactaa 27720 aaacgcagat ggcagtgtta atatcggtaa caaggttatc gctaaagtat cgcgtaccga 27780 agttggtcgc cgctttatgg aacctgcacc gcaaaaatta ctggataagt tattagaaca 27840 aaataagatc acccctgaac aagctgcttt agcgttgctt gtacctatgg ctgatgatat 27900 tactggggaa gcggattctg gtggtcatac agataaccgt ccgtttttaa cattattacc 27960 gacgattatt ggtctgcgtg atgaagtgca agcgaagtat aacttctctc ctgcattacg 28020 tgttggtgct ggtggtggta tcggaacgcc tgaagcagca ctcgctgcat ttaacatggg 28080 cgcggcttat atcgttctgg gttctgtgaa tcaggcgtgt gttgaagcgg gtgcatctga 28140 atatactcgt aaactgttat cgacagttga aatggctgat gtgactatgg cacctgctgc 28200 agatatgttt gaaatgggtg tgaagctgca agtattaaaa cgcggttcta tgttcgcgat 28260 gcgtgcgaag aaactgtatg acttgtatgt ggcttatgac tcgattgaag atatcccagc 28320 tgctgaacgt gagaagattg aaaaacaaat cttccgtgca aacctagacg agatttggga 28380 tggcactatc gctttcttta ctgaacgcga tccagaaatg ctagcccgtg caacgagtag 28440 tcctaaacgt aaaatggcac ttatcttccg ttggtatctt ggcctttctt cacgctggtc 28500 aaacacaggc gagaagggac gtgaaatgga ttatcagatt tgggcaggcc caagtttagg 28560 tgcattcaac agctgggtga aaggttctta ccttgaagac tatacccgcc gtggcgctgt 28620 agatgttgct ttgcatatgc ttaaaggtgc tgcgtattta caacgtgtaa accagttgaa 28680 attgcaaggt gttagcttaa gtacagaatt ggcaagttat cgtacgagtg attaatgtta 28740 cttgatgata tgtgaattaa ttaaagcgcc tgagggcgct ttttttggtt tttaactcag 28800 gtgttgtaac tcgaaattgc ccctttcaag ttagatcgat tactcactca caatatgttg 28860 atatcgcact tgccatatac ttgctcatcc aaagccctat attgataatg gtgttaatag 28920 tctttaatat ccgagtcttt cttcagcata atactaatat agagactcga ccaatgttaa 28980 acacaacaaa gaatatattc ttgtgtactg ccttattatt aacgagtgcg agtacgacag 29040 ctactacgct aaacaattcg atatcagcaa ttgaacaacg tatttctggt cgtatcggtg 29100 tggctgtttt agatacgcaa aataaacaaa cgtgggctta caatggtgat gcacattttc 29160 cgatgatgag tacattcaaa accctcgctt gcgcgaaaat gctaagtgaa tcgacaaatg 29220 gtaatctgga tcccagtact agctcattga taaaggctga agaattaatc ccttggtcac 29280 cagtcactaa aacgtttgtg aataacacta ttacagtggc gaaagcgtgt gaagcaacaa 29340 tgctgaccag tgataatacc gcggctaata ttgttttaca gtatatcgga ggccctcaag 29400 gcgttactgc attcttgcga gaaattggtg atgaagagag tcagttagat cgtatagaac 29460 ctgaattgaa tgaagctaag gtcggagact tgcgtgatac cacgacaccg aaagccatag 29520 ttaccacgct caacaaacta ctacttggtg atgttctact tgatttggat aaaaaccaac 29580 ttaaaacatg gatgcaaaat aataaagtgt cagatccttt actgcgttct atattaccgc 29640 aaggctggtt tattgccgac cgctcaggtg cgggtggtaa tggttctcga ggtataactg 29700 ctatgctttg gcactccgag cgtcaaccgc taatcatcag tatttattta accgaaactg 29760 agttagcaat ggcaatgcgc aatgagatta ttgttgagat cggtaagctg atattcaaag 29820 aatacgcggt gaaataataa gttatttttt gataatactt taacgagcgt agctatcgaa 29880 gtgagggcgt caattagaca cctttgcttc ccctacaaaa tctaatgtgt attacctcgg 29940 ctagtacaat tgccctaagt tatttctgtc cagctttggc ttagtgcaat tgcgttagcc 30000 aatgtgaaca ccaagggact ttgtcgtacc ataactacca agcgactttg tcgtttttat 30060 cttttcttag acaaacagag gttaaatgag tgacgccttc caaatcacag gaatgaatcc 30120 gcatttcaat aaaatctaac ccgtaccaac tccgtacaag ttgatcttta gttgtttaaa 30180 atctataata aattcaatta cggaattaat ccgtacaact ggaggtttta tggctactgc 30240 aagacttgat atccgtttgg atgaagaaat caaagctaag gctgagaaag catcagcttt 30300 actcggctta aaaagtttaa ccgaatacgt tgttcgctta atggacgaag attcaactaa 30360 agtagtttct gagcatgaga gtattaccgt tgaagcgaat gtattcgacc aatttatggc 30420 tgcttgtgat gaagcgaaag ccccaaataa agcattactt gaagccgctg tatttactca 30480 gaatggtgag tttaagtgag ttattccaaa cgtttcaaag aactggataa atcaaaacat 30540 gacagagcat catttgactg tggcgaaaaa gagctaaatg attttatcca aactcaagca 30600 gccaaacata tgcaagcagg tattagccgc actctggttt tacctgcttc tgcgccgtta 30660 ccaaacaaaa aatatccaat ttgctcattt tatagtatcg cgccaagctc aattagccgc 30720 gatacgttac cacaagcaat ggctaaaaag ttaccacgtt atcctatccc tgtttttctt 30780 ttggctcaac ttgccgtcca taaagagttt catgggagtg ggttaggcaa agttagctta 30840 attaaagcgt tagagtacct ttgggaaatt aactctcaca tgagagctta cgccatcgtt 30900 gttgattgtt taactgaaca agctgagtca ttctacgcta aatatggttt cgacgttctc 30960 tgcgaaataa atggtcgagt aagaatgttc atatcaatga aaacagtcaa tcagttattc 31020 acttaacagt aagagttagt ataacagttg tatgaattaa atttattata ttcggtaatc 31080 tcattgcgat cacgctagaa gtgcgagcgg gtcagaccga ggccacaata gcagccgtta 31140 cgtttagggg atgacttaaa aagataacta ctacgtcagt ggcgatccta gaggattaaa 31200 ggtttatgat tcacaacatt tatttattgt gcttaatttt ttctatccaa tatgcgcaag 31260 ctgtaaatat cactgaagta gacttttatg tcagtgatga tatccctaaa gatgttgcca 31320 aattaaagat aggtgaatcc ataacgaact ccagccttat tctaagtaac tcatctattc 31380 cactctcgcg ggagacgggt aacatatatt actcttcatc aattgctaac ttgaactatg 31440 actcgataga atttgttatg gctcaattga tggccgaaga ttccagcctt tacaagatgc 31500 tggtaaatag cgataggttg tccgtgctag taatgacatc ttcccagtcc acagtctcta 31560 tggctcgact tactcggctt attttcctaa tgttgcggtc atcgatttga attgtgactc 31620 gctaacttta gaacatgagc tcggccatct atacggagct gaacatgaag aaatatatga 31680 cgactatgtc ttctatgctg cgatatgtgg agactatacg actatcatga actctatgca 31740 gcctgaaatg aaagaaaaac aaatgataaa ggcatattca ttccctgaat taaaagtgga 31800 tggcttgcag tgcggaaatg aaaatacgaa taacaaaaag gttattttag acaatattgg 31860 tcggtttaga taggattggg atattattct cattcggctc tacttagtgc tgttattatg 31920 agtgccagtg cttctatcta cgatattggt cttaacaagt atttatctat agacgctaag 31980 gtgttatgta tttaagggat gttcaagatg aaactaggtg taaacgatgt atagttgtat 32040 aacatttttt caacggttgg aacgttcgat tctatcgggt aacaagaccg cgacgatccg 32100 cgataagtcc gatagtcatt acttagttgg tcagatgtta gatgcttgta ctcacgaaga 32160 taatcggaaa atgtgtcaaa tagaaatact gagcattgaa tatgtgacgt ttagtgaatt 32220 aaaccgtgcg cacgccaatg ctgaaggttt accgtttttg tttatgctta agtggatagt 32280 tcgaaagatt tatccgactt caaatgattt atttttcata agtttcagag ttgtaactat 32340 cgatatctta taagtcttag tgcacaaaac agaactattt atagcgctca agaaggcgat 32400 aatttgataa tgaattatcg ccttgttact attaagagac tttaaatgac tgagatataa 32460 gatatgacac ggaagaacat attgatcaca ggcgcaagtt cagggttggg ccgaggtatg 32520 gccatcgaat ttgcaaaatc aggtcataac ttagcacttt gtgcacgtag acttgataat 32580 ttagttgcac tgaaagcaga actcttagcc ctcaatcctc acatccaaat cgaaataaaa 32640 cctcttgatg tcaatgaaca tgaacaagtc ttcactgttt tccatgaatt caaagctgaa 32700 tttggtacgc ttgatcgtat tattgttaat gctggattag gcaagggtgg atccgtcggt 32760 acaggttttt tcaaagctaa tctgcaaact gcacaaacta attttattgc ggcgctcgca 32820 caatgtgaag cggcgctcga aatctttagg gcgcaaaatg ctgggcacct agtgacgatt 32880 tcttctatca gcgctgtacg aggattccgc cgtgcgttaa ctgtgtatgc agctactaaa 32940 tcggcactaa catcattaac tgaaggtatc aggattgacg tgatggatac gccaatcaaa 33000 gtgagttgta ttcatcctgg atttattcgc accgagatga atgaaaaagt aaaaacagca 33060 cctttcatga tagatgctga agcgggttgt aaagcgatag tgaaagcaat taataaagaa 33120 aaagcgaata gttatgtacc tagttaccct tgggctatta tgcacttatt actacgtgtg 33180 gcgccaacgc gtttgatccg cagaatgagt taatatcaca gacgcatcaa taaaatttta 33240 aggttctaga aatgatgaag tctcatgttt ggttcaaggc cggtgtagtc atcatatatg 33300 gctcatctat agatgcctct cctcatcgtc atcatgcaat tcaattagcg gcggtgttac 33360 ccaatcccaa gcgaatgtct gcagcaaccc cttcttctta tgtgctcagc cgtgcggcac 33420 aaatttaaga ctcggtgcga tcattaggcg gatctgttta cctgaaaaac ttataacaaa 33480 agctatcgac tgttgaattt atcctgaatg ctttaataga gtgggctggt ggcattacat 33540 gattggaaag ctgaaagaca agtcgttata tttgcaggca gtaaaattaa cactggtatg 33600 gatacttttg attctgtaaa gttcagagta tcagcccctt aacgagcttt ggtataaaca 33660 aatatgaata atcgacagcc taagaaaacc tcttcgacta tatcgacgct caacgaatta 33720 gcgacgttag caaactattc actcatggac acgctaaact gtgatcctga tgcgacagaa 33780 aacggcgacg atcacgcgcc gagacaagtc ctttacgggt cattatgttc ccgtaaaacc 33840 gactccaatc aaagaccctg aatatgtagc gcatagcaaa aatttatttt ctgaacttgg 33900 ctttgccgac agtatggctg agtccgctga ttttgtccgg atgttctctg gtgatatgtc 33960 aggggttcca gtaccaatgc gccaggtagg ttgggcgagt ggctatgcac tttccattta 34020 tggcaccgag tacacccaac agtgcccgtt ccaaactggt aacggatatg gagacggacg 34080 tgcaatttca gtgcttgaga ccctcatcaa gggtcaacgc tgggaaatgc agctgaaagg 34140 cggtggtcgt acaccatatt gccgcggcgc agacggtcgc gctgttttac ggtctagtat 34200 tcgcgagttc ttggctcaag atcacatgca tgcgctcggg gtacctacat cacggtcttt 34260 aagtctgtac gtttcaaaaa cggagacagt taagcgacct tggtactcac agggctcgcg 34320 ttcagagaat cccgacatgc ttatatctga agctgtcgct atctcgacgc gtgttgcacc 34380 gtcgttcatc cgtgttggtc aactcgaact tttcgcgcgc cgcagccgta gtaatgaaca 34440 cccgaaagcg atggaagaac tcgagaagat tgtgctgcac ttgatcgatc gtgaatacgc 34500 tgacgttatc gatacgcagc tagccactcc agaaaaaatc gtgttgctgg ctcgcgagtt 34560 tcgtggccgc cttacctcaa tggttgcgaa ttggatccgt gttggatttt gccaaggtaa 34620 ctttaacagt gataactgcg cagccggtgg ttttacactt gattatggtc cctttggttt 34680 ttgtgatgtg tttaatccgt attatcaacc ttggacgggg gggggtaatc acttctcgtt 34740 catgaaccaa ccaaatgcag cacaacgaaa tttcgatatg ttttgttcgg cgttacggcc 34800 gttactggta tctcatcagc aggatttgct cgcgtttgac gagatccaaa gtgaattttt 34860 agcagtaatg gatacgaaaa tgaaggcgat gtgggctact aaattgggtc ttattaattt 34920 gaagactgag tctgataaag cactgtgtaa cgtactcatc aaagagctac aaacactcat 34980 gatgcaagca cctgttgatt acactatttt cttccgcgaa ctatcctcaa ttcctgacga 35040 tattggccca ctgaagaaaa gtttttacag taatctatac aatgatgcag cggatgatcc 35100 agatacctta gcgttagaaa aatactggat tgagtggctc gaaaaatggc aaatgctcct 35160 taacagtact tgtgacgcga aaggtatctc gtcccgagcc agtgaggaca tcgctatgca 35220 gatgaaactc gtcaacccta aatacgtttt gcgagaatgg ttcgtgatgc cggcttatca 35280 gcaagccact gcgggtgatt attctctcat tcaagagctg caggccgtaa tgacacagcc 35340 atatgcagag cagtcgaagg agctagagga taaatactat cgattgaaac cgcttgagtt 35400 ctttgaggta ggtggattgt cccatcttag ttgctcgtcg tgaacgataa cgcgtcggta 35460 catgtgtatc gacgtatggg cgcttaattt ttattaatat tagaaacaaa aatcgccagc 35520 aaatgctggc gttttaaaga ttaatgtcaa ttattacatc atgcctatat cacgtaggag 35580 atgtggcgat aagcctttta attgaatatc taaagatttt tcttttttat cactaaataa 35640 aatgtcttta gtgtgtttaa tcagtccttt gatagaaaca gcataagctt ttgtatctaa 35700 agcttgtggg atcatattga tgtgcgctgc gtgtgccatt ttagcctcta tctgaattta 35760 ataatttatg ttttaaccag gtgatgtatt gctcatctgg tgaacatagt agcgcattaa 35820 ataaccatgc aataatgata aaaaataaca ctaagcatta gttttgataa tgcattcggc 35880 gctgtgtgac actgtttact gttttataga tattcattca ctttaattgc atataaattg 35940 aattgtttac tccaaatgta gttaaaataa gcacttgtta catcaatgca acaattatac 36000 gctgttaaaa tagccttgat ataccaatga taaataattc tgagtcttta atatttaaaa 36060 tagatgaatt taattcatta gatatactat tacgttgaat tgcgatttac atgcgcattt 36120 agtgtgtttt ttattaaatg aaaattattt tgacgatttt attaacatat ataagaaata 36180 tgtgacttag atctaagtaa acgttaattt atcgccgata aagcagtagt aagcatgttg 36240 catatcaaac cctctctata gatctcaact agcctcaatt atcatcaagt taactgtggt 36300 tttatttatt gctcgtgcgt tcagttatgc ttaaccatga gttaacttca ttctaatatt 36360 tttaacttac agtgaggggt atactctcgg ctcttagaaa tagagagcca aaacatgttt 36420 gaattcgtta ctaattcctc attgaaaaca cacctattgc ttatcaataa tggctatcaa 36480 tagtggttta ttgtttctta cgccacggct tatttttctg aaaatgtact aaatagataa 36540 attatcaata aaaacacaca tcacattaac cgatgtaaac agggaacatc cccatgtatg 36600 aaaatgaaga aaaactaacg aaagcatttg ttattgccgc cataatttgg ggcgttatag 36660 gcatgtgcat gggtttaatg gcagctctgc agctatatct accgcaattg aattttgcta 36720 atgagtatat aaatttcggg aaaataagac ccttgcatac taacgccatc atttttgggt 36780 tggtttgtaa ctttattatc ggtctgtcgt tatacatagt ggcaaaaaca tcagtcgtga 36840 atctagtatc caaaggttta tcgtggttct tgttctgggg ttggcagata acattggtaa 36900 tcggccttat ctcaatcgct ttagggtata catcaaccaa agaatacgct gaatttgagt 36960 ggccaattga tatcgctatt gtggttctct ggttaacgtt tggatatatc ttttttggaa 37020 cgctagcgaa aagaaaaaca aagcatatat ttgtttcaaa ctggttcagt ggcggtgtca 37080 ttattgttat cggcttaatt tacttgataa acaatttagc cattcccgtg tatgcattta 37140 aaggttattc aatattttct ggtgcgagtg atgcgcttgt acagtggtgg tggggacata 37200 atgcagttgg cttcttattg acagctggct ttgtaggtac caactactat ttcattccca 37260 agttagttaa tagacccatt tattcatatc gactgtcttt aattactttt tggggtctaa 37320 tcggctttta tacttgggct ggtacacacc atttactctt tacatccgtt ccatcttgga 37380 ttcaaaatat tggcgtagtg atgtctattt tattatggat cccgtcatgg gctggcgcat 37440 ttaacgcttg gatgacgtgt acttccaata aagaagaatt gaaaacaaat cccgttgtct 37500 ggtttttctt atcgtcaatt gcctattacg cattagcaac gtttgaaggg cctcttatgg 37560 ctatcagatg gttcaatatg atagctcaca ataccagttg ggttatcgga cacgttcact 37620 ctggggcgtt aggttgggtt ggcatgacgt gtatagcaac cttctactat ttcattccta 37680 agctatacaa aaaagaactc tactcatatg gcttagttaa ggtgcatttt gtactcgctc 37740 acataggcgt actgttctac atagtctccc tgtggatagg gggtataggt caaggtgtta 37800 aatcgttaag cctcactgag tctggttctc tgacttattc gtttgttgat attttacgat 37860 ttatggaacc ttatatgctc ggacgtgcaa ttggcggggc gctgtttatc ttgggtatgt 37920 tagtgatggt atataacctc atcatgacgg tgaacaaacc acaaaaagta gttattgaag 37980 gagcatatta atggaagagt caatatccaa gtcagtaatg gcttttatca ctatcacgac 38040 agtcgtggtg ttattttcat tctttgtgtg ggttttccca gggttcttct tcaccaacga 38100 tcttaaagaa ataacgacag ctaaaccata cacagcctta gagttagctg gacgggatgt 38160 gtatatggct gaaggttgcg tggcatgcca tacccagatg gttagaaact tggaaccgga 38220 aagaaaaaga tacggtcgtc ctaataaaat ggaagatgat gtttatgagt ttaacttttt 38280 gtggagctca caaagaactg gccctgattt aacgaatatt ggtttgaagt acacacaagg 38340 ctggcacaaa cagcatctca tcaatcctca ggcagttgtt ccagcctcaa tcatgccaca 38400 atatccgtgg ctgtttgaaa agcaacttaa cgttggtcat gttattgctt caatgaaagc 38460 gatgaaaaaa ctaggtgtgc cgtatacaga cacgcaaatt gaaaattcat caagcaaagt 38520 ggaaggtaaa acaaaaggtg atgcgcttgt tgcttacttg atgagtcttg gcgtagatac 38580 gcgtgaaaaa ggtggggatt taaattaatg ggatccatga acatattatc aagcgtacta 38640 tcgattatct tcttttttat catggttgcc gttatttatt cacagttccg taagaccaaa 38700 actgcagaca gtaataaaac agtagagcaa tttgatggaa tagatgaaaa agatgcacca 38760 attcctaagg ttttctttgt tgcgtatctt attgcgttta taggcgcaat tgtttacgtc 38820 cttctatacc caagtttagc ttcttggaaa gggtttatcg gttggaccga gaacgatgac 38880 gcgtatgtag ctaaatcaat tgatataaac aataacatta acgcaataat caacgcgaat 38940 accgatgaac aagtctttac gctgttacaa aaagatccgc ttgttttgca gagtggtaaa 39000 tcgttatttg gtgataattg ttctgcttgt catggtcagg atgctaaggg gcaatataac 39060 tacccgagtt tagttgataa agattggtta tacggcggct cacctcaaga tgtctatacg 39120 accatacata atggacgtaa gggtaaaatg ccagcttgga aaggtgtact gagcggtaaa 39180 gacatagatg agcttaccca gtatgtgtct gagctaaata aaggaccatt taaaagcaat 39240 gcgcttttcg atgctaattg ttcatcatgt cacggtaaag aggctcaagg ttcacatagc 39300 gtaggagccc ctaacttaac gaatgatatc tggcttcatg gttcaaccaa tgctgatatc 39360 aaacgtaata ttgagaatgg catgtataac gaaatgcctg attttggtca acgccttagc 39420 agaaatcaaa tattgtcttt aacctcttat attgtgtccc tacagagtga accacaagat 39480 aatatcgata ttatgcaagc gaacacttat atcttctctc gaaacgaaca gcaattgccg 39540 gcagtgctaa cgacttgtgt ggcctgtcat ggcgcagatg gtcttggtac tttacctgga 39600 gcgcctaagt tagcaggatt aaagcaagcg tatatctata accaattaca cttgtttgta 39660 tctggtttaa gaaaaaatgc aacgatgcaa aatatagttg ccgacttaga tgtgaaagac 39720 aagttacttg ctgctagcta tttcagttca ctcgattcac cggcgataag taaaattacc 39780 ccagagaaat cagctgacgg tatcatcaaa gatcctactg agcgcctgat atttcaaggt 39840 gattggcaac gcgctattcc tgcttgttct acttgtcatg gtcaagaaac gcaaggtagc 39900 ccatcatttc caagattggc aggtcaatca tctgactatt tagagaaaca attatttgac 39960 tggcgaacag gcgatagaac cggtgatcaa ggtcatatga tgcaaaacgt cgttaacaag 40020 ctacaagatg atgaaattaa atccctgtcg aaatatttat caaaaatgaa ataacctgtg 40080 agccagttaa aggccaatag atcgaaggtt aacagctcaa agattaatag gatactgtaa 40140 ttatgaaaat gaataagtta agaagggaaa tcattaaagc tggtggctat gtcgctttag 40200 ctgctgcacc attaacggct ttctctaaag agtttatgaa atacggcaaa atgtattcag 40260 atggtgaggg agttagctat gccgatggcc ctaagcctgt attaagcaat tttccgcaaa 40320 aagataatgt tgtgatcgta catactcgac cacctcatct tgaaacgcct tttaatgtat 40380 tcaatgaagg gctaataaca ccaaacaacc gtttctttgt tcgttatcat ctagctgacg 40440 tccccgttgc catagacact gataagtaca ctattactat ttcaggggct gttaatgagg 40500 aagtgacatt aagcttggct gaattaaagt cgattgaagg ccaacaagaa attgtcgcgg 40560 tacaacagtg tactggtaat agtcgaggtt attcatctcc acgtgttttt ggtgcgcaat 40620 taagtaatgg cgctatgggg aatgcgaagt tcaaaggcgt gccacttaaa aatgtgttag 40680 ctaaagcggg aatttctagt gctgcgacaa gtgtcattat cgatggtttg gataagccgg 40740 ttcgagatac cacaccagac tttcaaaaat cattacctat tgatcatatt atgacgggcg 40800 aacctatgct tgtttgggaa atgaatggtg aacctttacc atttttaaat ggctttccag 40860 tgaaattaat cgttccgggt tggtatgcaa catattgggt taaacatgta tcgcacctta 40920 aagttataga gggtgagttt gataactttg atgcgttctt tatgacaact gcataccgtc 40980 tacctgataa cgattccaag agtgaattac caactgccag agcgaaaaag acgttacctg 41040 taaatcgttt cccaataaga agttttgtta ctagcttaga aaatggtgat gaagttaatg 41100 ctgcaactag tattgaaatt aaagggatag cttttgatag tggtagtggt atcaaaaaag 41160 ttgaagtttc agtcgatggt ggcaataagt ggatgcaagc agcgcttggt gaaaatcttg 41220 gtcgtttttc ctttcgaggt tggaagttaa gccataattt taatgaaaaa ggcagaacgc 41280 ttgtgatggt aagagctaca ggtaagagtg gagagacaca acctcttaat gcctcttgga 41340 atcatggcgg ttataaccga aacgcgattg aacgaacaag tattaaggtg gtttaaatgc 41400 ggtttttact tattatatta gcgctatgtt cattgactgt taaagctgag atcgtatcaa 41460 ttaccttacc tatggataat accaagctta agccgtcgac attaccagga tatggcctcg 41520 cgcaatctaa atgtcacctt tgtcattcag tcgattacgt tatgtatcaa ccaccagaaa 41580 tggatcc 41587 <210> 2 <211> 7959 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(7956) <400> 2 atg gct aaa aag aac acc aca tcg att aag cac gcc aag gat gtg tta 48 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 agt agt gat gat caa cag tta aat tct cgc ttg caa gaa tgt ccg att 96 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 gcc atc att ggt atg gca tcg gtt ttt gca gat gct aaa aac ttg gat 144 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 caa ttc tgg gat aac atc gtt gac tct gtg gac gct att att gat gtg 192 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 cct agc gat cgc tgg aac att gac gac cat tac tcg gct gat aaa aaa 240 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 gca gct gac aag aca tac tgc aaa cgc ggt ggt ttc att cca gag ctt 288 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 gat ttt gat ccg atg gag ttt ggt tta ccg cca aat atc ctc gag tta 336 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 act gac atc gct caa ttg ttg tca tta att gtt gct cgt gat gta tta 384 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 agt gat gct ggc att ggt agt gat tat gac cat gat aaa att ggt atc 432 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 acg ctg ggt gtc ggt ggt ggt cag aaa caa att tcg cca tta acg tcg 480 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 cgc cta caa ggc ccg gta tta gaa aaa gta tta aaa gcc tca ggc att 528 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 gat gaa gat gat cgc gct atg atc atc gac aaa ttt aaa aaa gcc tac 576 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 atc ggc tgg gaa gag aac tca ttc cca ggc atg cta ggt aac gtt att 624 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 gct ggt cgt atc gcc aat cgt ttt gat ttt ggt ggt act aac tgt gtg 672 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 gtt gat gcg gca tgc gct ggc tcc ctt gca gct gtt aaa atg gcg atc 720 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 tca gac tta ctt gaa tat cgt tca gaa gtc atg ata tcg ggt ggt gta 768 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 tgt tgt gat aac tcg cca ttc atg tat atg tca ttc tcg aaa aca cca 816 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 gca ttt acc acc aat gat gat atc cgt ccg ttt gat gac gat tca aaa 864 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 ggc atg ctg gtt ggt gaa ggt att ggc atg atg gcg ttt aaa cgt ctt 912 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 gaa gat gct gaa cgt gac ggc gac aaa att tat tct gta ctg aaa ggt 960 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 atc ggt aca tct tca gat ggt cgt ttc aaa tct att tac gct cca cgc 1008 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 cca gat ggc caa gca aaa gcg cta aaa cgt gct tat gaa gat gcc ggt 1056 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 ttt gcc cct gaa aca tgt ggt cta att gaa ggc cat ggt acg ggt acc 1104 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 aaa gcg ggt gat gcc gca gaa ttt gct ggc ttg acc aaa cac ttt ggc 1152 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 gcc gcc agt gat gaa aag caa tat atc gcc tta ggc tta gtt aaa tcg 1200 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 caa att ggt cat act aaa tct gcg gct ggc tct gcg ggt atg att aag 1248 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 gcg gca tta gcg ctg cat cat aaa atc tta cct gca acg atc cat atc 1296 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 gat aaa cca agt gaa gcc ttg gat atc aaa aac agc ccg tta tac cta 1344 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 aac agc gaa acg cgt cct tgg atg cca cgt gaa gat ggt att cca cgt 1392 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 cgt gca ggt atc agc tca ttt ggt ttt ggc ggc acc aac ttc cat att 1440 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 att tta gaa gag tat cgc cca ggt cac gat agc gca tat cgc tta aac 1488 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 tca gtg agc caa act gtg ttg atc tcg gca aac gac caa caa ggt att 1536 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 gtt gct gag tta aat aac tgg cgt act aaa ctg gct gtc gat gct gat 1584 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 cat caa ggg ttt gta ttt aat gag tta gtg aca acg tgg cca tta aaa 1632 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 acc cca tcc gtt aac caa gct cgt tta ggt ttt gtt gcg cgt aat gca 1680 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 aat gaa gcg atc gcg atg att gat acg gca ttg aaa caa ttc aat gcg 1728 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 aac gca gat aaa atg aca tgg tca gta cct acc ggg gtt tac tat cgt 1776 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 caa gcc ggt att gat gca aca ggt aaa gtg gtt gcg cta ttc tca ggg 1824 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 caa ggt tcg caa tac gtg aac atg ggt cgt gaa tta acc tgt aac ttc 1872 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 cca agc atg atg cac agt gct gcg gcg atg gat aaa gag ttc agt gcc 1920 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 gct ggt tta ggc cag tta tct gca gtt act ttc cct atc cct gtt tat 1968 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 acg gat gcc gag cgt aag cta caa gaa gag caa tta cgt tta acg caa 2016 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 cat gcg caa cca gcg att ggt agt ttg agt gtt ggt ctg ttc aaa acg 2064 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 ttt aag caa gca ggt ttt aaa gct gat ttt gct gcc ggt cat agt ttc 2112 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 ggt gag tta acc gca tta tgg gct gcc gat gta ttg agc gaa agc gat 2160 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 tac atg atg tta gcg cgt agt cgt ggt caa gca atg gct gcg cca gag 2208 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 caa caa gat ttt gat gca ggt aag atg gcc gct gtt gtt ggt gat cca 2256 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 aag caa gtc gct gtg atc att gat acc ctt gat gat gtc tct att gct 2304 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 aac ttc aac tcg aat aac caa gtt gtt att gct ggt act acg gag cag 2352 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 gtt gct gta gcg gtt aca acc tta ggt aat gct ggt ttc aaa gtt gtg 2400 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 cca ctg ccg gta tct gct gcg ttc cat aca cct tta gtt cgt cac gcg 2448 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 caa aaa cca ttt gct aaa gcg gtt gat agc gct aaa ttt aaa gcg cca 2496 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 agc att cca gtg ttt gct aat ggc aca ggc ttg gtg cat tca agc aaa 2544 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 ccg aat gac att aag aaa aac ctg aaa aac cac atg ctg gaa tct gtt 2592 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 cat ttc aat caa gaa att gac aac atc tat gct gat ggt ggc cgc gta 2640 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 ttt atc gaa ttt ggt cca aag aat gta tta act aaa ttg gtt gaa aac 2688 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 att ctc act gaa aaa tct gat gtg act gct atc gcg gtt aat gct aat 2736 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 cct aaa caa cct gcg gac gta caa atg cgc caa gct gcg ctg caa atg 2784 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 gca gtg ctt ggt gtc gca tta gac aat att gac ccg tac gac gcc gtt 2832 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 aag cgt cca ctt gtt gcg ccg aaa gca tca cca atg ttg atg aag tta 2880 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 tct gca gcg tct tat gtt agt ccg aaa acg aag aaa gcg ttt gct gat 2928 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 gca ttg act gat ggc tgg act gtt aag caa gcg aaa gct gta cct gct 2976 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 gtt gtg tca caa cca caa gtg att gaa aag atc gtt gaa gtt gaa aag 3024 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 ata gtt gaa cgc att gtc gaa gta gag cgt att gtc gaa gta gaa aaa 3072 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 atc gtc tac gtt aat gct gac ggt tcg ctt ata tcg caa aat aat caa 3120 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 gac gtt aac agc gct gtt gtt agc aac gtg act aat agc tca gtg act 3168 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 cat agc agt gat gct gac ctt gtt gcc tct att gaa cgc agt gtt ggt 3216 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 caa ttt gtt gca cac caa cag caa tta tta aat gta cat gaa cag ttt 3264 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 atg caa ggt cca caa gac tac gcg aaa aca gtg cag aac gta ctt gct 3312 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 gcg cag acg agc aat gaa tta ccg gaa agt tta gac cgt aca ttg tct 3360 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 atg tat aac gag ttc caa tca gaa acg cta cgt gta cat gaa acg tac 3408 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 ctg aac aat cag acg agc aac atg aac acc atg ctt act ggt gct gaa 3456 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 gct gat gtg cta gca acc cca ata act cag gta gtg aat aca gcc gtt 3504 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 gcc act agt cac aag gta gtt gct cca gtt att gct aat aca gtg acg 3552 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 aat gtt gta tct agt gtc agt aat aac gcg gcg gtt gca gtg caa act 3600 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 gtg gca tta gcg cct acg caa gaa atc gct cca aca gtc gct act acg 3648 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 cca gca ccc gca ttg gtt gct atc gtg gct gaa cct gtg att gtt gcg 3696 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 cat gtt gct aca gaa gtt gca cca att aca cca tca gtt aca cca gtt 3744 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 gtc gca act caa gcg gct atc gat gta gca act att aac aaa gta atg 3792 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 tta gaa gtt gtt gct gat aaa acc ggt tat cca acg gat atg ctg gaa 3840 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 ctg agc atg gac atg gaa gct gac tta ggt atc gac tca atc aaa cgt 3888 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 gtt gag ata tta ggc gca gta cag gaa ttg atc cct gac tta cct gaa 3936 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 ctt aat cct gaa gat ctt gct gag cta cgc acg ctt ggt gag att gtc 3984 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 gat tac atg aat tca aaa gcc cag gct gta gct cct aca aca gta cct 4032 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 gta aca agt gca cct gtt tcg cct gca tct gct ggt att gat tta gcc 4080 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 cac atc caa aac gta atg tta gaa gtg gtt gca gac aaa acc ggt tac 4128 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 cca aca gac atg cta gaa ctg agc atg gat atg gaa gct gac tta ggt 4176 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 att gat tca atc aag cgt gtg gaa atc tta ggt gca gta cag gag atc 4224 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 ata act gat tta cct gag cta aac cct gaa gat ctt gtt gaa tta cgc 4272 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 acc cta ggt gaa atc gtt agt tac atg caa agc aaa gcg cca gtc gct 4320 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 gaa agt gcg cca gtg gcg acg gct cct gta gca aca agc tca gca ccg 4368 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 tct atc gat ttg aac cac att caa aca gtg atg atg gat gta gtt gca 4416 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 gat aag act ggt tat cca act gac atg cta gaa ctt ggc atg gac atg 4464 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 gaa gct gat tta ggt atc gat tca atc aaa cgt gtg gaa ata tta ggc 4512 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 gca gtg cag gag atc atc act gat tta cct gag cta aac cca gaa gac 4560 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 ctc gct gaa tta cgc acg cta ggt gaa atc gtt agt tac atg caa agc 4608 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 aaa gcg cca gtc gct gag agt gcg cca gta gcg acg gct tct gta gca 4656 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 aca agc tct gca ccg tct atc gat tta aac cat atc caa aca gtg atg 4704 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 atg gaa gtg gtt gca gac aaa acc ggt tat cca gta gac atg tta gaa 4752 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 ctt gct atg gac atg gaa gct gac cta ggt atc gat tca atc aag cgt 4800 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 gta gaa att tta ggt gcg gta cag gaa atc att act gac tta cct gag 4848 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 ctt aac cct gaa gat ctt gct gaa cta cgt aca tta ggt gaa atc gtt 4896 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 agt tac atg caa agc aaa gcg ccc gta gct gaa gcg cct gca gta cct 4944 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 gtt gca gta gaa agt gca cct act agt gta aca agc tca gca ccg tct 4992 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 atc gat tta gac cac atc caa aat gta atg atg gat gtt gtt gct gat 5040 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 aag act ggt tat cct gcc aat atg ctt gaa tta gca atg gac atg gaa 5088 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 gcc gac ctt ggt att gat tca atc aag cgt gtt gaa att cta ggc gcg 5136 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 gta cag gag atc att act gat tta cct gaa cta aac cca gaa gac tta 5184 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 gct gaa cta cgt acg tta gaa gaa att gta acc tac atg caa agc aag 5232 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 gcg agt ggt gtt act gta aat gta gtg gct agc cct gaa aat aat gct 5280 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 gta tca gat gca ttt atg caa agc aat gtg gcg act atc aca gcg gcc 5328 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 gca gaa cat aag gcg gaa ttt aaa ccg gcg ccg agc gca acc gtt gct 5376 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 atc tct cgt cta agc tct atc agt aaa ata agc caa gat tgt aaa ggt 5424 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 gct aac gcc tta atc gta gct gat ggc act gat aat gct gtg tta ctt 5472 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 gca gac cac cta ttg caa act ggc tgg aat gta act gca ttg caa cca 5520 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 act tgg gta gct gta aca acg acg aaa gca ttt aat aag tca gtg aac 5568 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 ctg gtg act tta aat ggc gtt gat gaa act gaa atc aac aac att att 5616 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 act gct aac gca caa ttg gat gca gtt atc tat ctg cac gca agt agc 5664 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 gaa att aat gct atc gaa tac cca caa gca tct aag caa ggc ctg atg 5712 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 tta gcc ttc tta tta gcg aaa ttg agt aaa gta act caa gcc gct aaa 5760 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 gtg cgt ggc gcc ttt atg att gtt act cag cag ggt ggt tca tta ggt 5808 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 ttt gat gat atc gat tct gct aca agt cat gat gtg aaa aca gac cta 5856 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 gta caa agc ggc tta aac ggt tta gtt aag aca ctg tct cac gag tgg 5904 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 gat aac gta ttc tgt cgt gcg gtt gat att gct tcg tca tta acg gct 5952 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 gaa caa gtt gca agc ctt gtt agt gat gaa cta ctt gat gct aac act 6000 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 gta tta aca gaa gtg ggt tat caa caa gct ggt aaa ggc ctt gaa cgt 6048 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 atc acg tta act ggt gtg gct act gac agc tat gca tta aca gct ggc 6096 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 aat aac atc gat gct aac tcg gta ttt tta gtg agt ggt ggc gca aaa 6144 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 ggt gta act gca cat tgt gtt gct cgt ata gct aaa gaa tat cag tct 6192 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 aag ttc atc tta ttg gga cgt tca acg ttc tca agt gac gaa ccg agc 6240 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 tgg gca agt ggt att act gat gaa gcg gcg tta aag aaa gca gcg atg 6288 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 cag tct ttg att aca gca ggt gat aaa cca aca ccc gtt aag atc gta 6336 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 cag cta atc aaa cca atc caa gct aat cgt gaa att gcg caa acc ttg 6384 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 tct gca att acc gct gct ggt ggc caa gct gaa tat gtt tct gca gat 6432 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 gta act aat gca gca agc gta caa atg gca gtc gct cca gct atc gct 6480 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 aag ttc ggt gca atc act ggc atc att cat ggc gcg ggt gtg tta gct 6528 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 gac caa ttc att gag caa aaa aca ctg agt gat ttt gag tct gtt tac 6576 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 agc act aaa att gac ggt ttg tta tcg cta cta tca gtc act gaa gca 6624 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 agc aac atc aag caa ttg gta ttg ttc tcg tca gcg gct ggt ttc tac 6672 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 ggt aac ccc ggc cag tct gat tac tcg att gcc aat gag atc tta aat 6720 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 aaa acc gca tac cgc ttt aaa tca ttg cac cca caa gct caa gta ttg 6768 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 agc ttt aac tgg ggt cct tgg gac ggt ggc atg gta acg cct gag ctt 6816 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 aaa cgt atg ttt gac caa cgt ggt gtt tac att att cca ctt gat gca 6864 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 ggt gca cag tta ttg ctg aat gaa cta gcc gct aat gat aac cgt tgt 6912 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 cca caa atc ctc gtg ggt aat gac tta tct aaa gat gct agc tct gat 6960 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 caa aag tct gat gaa aag agt act gct gta aaa aag cca caa gtt agt 7008 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 cgt tta tca gat gct tta gta act aaa agt atc aaa gcg act aac agt 7056 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 agc tct tta tca aac aag act agt gct tta tca gac agt agt gct ttt 7104 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 cag gtt aac gaa aac cac ttt tta gct gac cac atg atc aaa ggc aat 7152 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 cag gta tta cca acg gta tgc gcg att gct tgg atg agt gat gca gca 7200 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 aaa gcg act tat agt aac cga gac tgt gca ttg aag tat gtc ggt ttc 7248 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 gaa gac tat aaa ttg ttt aaa ggt gtg gtt ttt gat ggc aat gag gcg 7296 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 gcg gat tac caa atc caa ttg tcg cct gtg aca agg gcg tca gaa cag 7344 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 gat tct gaa gtc cgt att gcc gca aag atc ttt agc ctg aaa agt gac 7392 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 ggt aaa cct gtg ttt cat tat gca gcg aca ata ttg tta gca act cag 7440 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 cca ctt aat gct gtg aag gta gaa ctt ccg aca ttg aca gaa agt gtt 7488 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 gat agc aac aat aaa gta act gat gaa gca caa gcg tta tac agc aat 7536 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 ggc acc ttg ttc cac ggt gaa agt ctg cag ggc att aag cag ata tta 7584 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 agt tgt gac gac aag ggc ctg cta ttg gct tgt cag ata acc gat gtt 7632 Ser Cys Asp Asp Lys Gly Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 gca aca gct aag cag gga tcc ttc ccg tta gct gac aac aat atc ttt 7680 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 gcc aat gat ttg gtt tat cag gct atg ttg gtc tgg gtg cgc aaa caa 7728 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 ttt ggt tta ggt agc tta cct tcg gtg aca acg gct tgg act gtg tat 7776 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 cgt gaa gtg gtt gta gat gaa gta ttt tat ctg caa ctt aat gtt gtt 7824 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 gag cat gat cta ttg ggt tca cgc ggc agt aaa gcc cgt tgt gat att 7872 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 caa ttg att gct gct gat atg caa tta ctt gcc gaa gtg aaa tca gcg 7920 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 caa gtc agt gtc agt gac att ttg aac gat atg tca tga 7959 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 3 <211> 2652 <212> PRT <213> Moritella marina <400> 3 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 Ser Cys Asp Asp Lys Gly Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 4 <211> 2598 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(2595) <400> 4 atg acg gaa tta gct gtt att ggt atg gat gct aaa ttt agc gga caa 48 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 gac aat att gac cgt gtg gaa cgc gct ttc tat gaa ggt gct tat gta 96 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 ggt aat gtt agc cgc gtt agt acc gaa tct aat gtt att agc aat ggc 144 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 gaa gaa caa gtt att act gcc atg aca gtt ctt aac tct gtc agt cta 192 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 cta gcg caa acg aat cag tta aat ata gct gat atc gcg gtg ttg ctg 240 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 att gct gat gta aaa agt gct gat gat cag ctt gta gtc caa att gca 288 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 tca gca att gaa aaa cag tgt gcg agt tgt gtt gtt att gct gat tta 336 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 ggc caa gca tta aat caa gta gct gat tta gtt aat aac caa gac tgt 384 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 cct gtg gct gta att ggc atg aat aac tcg gtt aat tta tct cgt cat 432 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 gat ctt gaa tct gta act gca aca atc agc ttt gat gaa acc ttc aat 480 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 ggt tat aac aat gta gct ggg ttc gcg agt tta ctt atc gct tca act 528 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 gcg ttt gcc aat gct aag caa tgt tat ata tac gcc aac att aag ggc 576 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 ttc gct caa tcg ggc gta aat gct caa ttt aac gtt gga aac att agc 624 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 gat act gca aag acc gca ttg cag caa gct agc ata act gca gag cag 672 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 gtt ggt ttg tta gaa gtg tca gca gtc gct gat tcg gca atc gca ttg 720 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 tct gaa agc caa ggt tta atg tct gct tat cat cat acg caa act ttg 768 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 cat act gca tta agc agt gcc cgt agt gtg act ggt gaa ggc ggg tgt 816 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 ttt tca cag gtc gca ggt tta ttg aaa tgt gta att ggt tta cat caa 864 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 cgt tat att ccg gcg att aaa gat tgg caa caa ccg agt gac aat caa 912 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 atg tca cgg tgg cgg aat tca cca ttc tat atg cct gta gat gct cga 960 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 cct tgg ttc cca cat gct gat ggc tct gca cac att gcc gct tat agt 1008 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 tgt gtg act gct gac agc tat tgt cat att ctt tta caa gaa aac gtc 1056 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 tta caa gaa ctt gtt ttg aaa gaa aca gtc ttg caa gat aat gac tta 1104 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 act gaa agc aag ctt cag act ctt gaa caa aac aat cca gta gct gat 1152 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 ctg cgc act aat ggt tac ttt gca tcg agc gag tta gca tta atc ata 1200 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 gta caa ggt aat gac gaa gca caa tta cgc tgt gaa tta gaa act att 1248 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 aca ggg cag tta agt act act ggc ata agt act atc agt att aaa cag 1296 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 atc gca gca gac tgt tat gcc cgt aat gat act aac aaa gcc tat agc 1344 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 gca gtg ctt att gcc gag act gct gaa gag tta agc aaa gaa ata acc 1392 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 ttg gcg ttt gct ggt atc gct agc gtg ttt aat gaa gat gct aaa gaa 1440 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 tgg aaa acc ccg aag ggc agt tat ttt acc gcg cag cct gca aat aaa 1488 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 cag gct gct aac agc aca cag aat ggt gtc acc ttc atg tac cca ggt 1536 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 att ggt gct aca tat gtt ggt tta ggg cgt gat cta ttt cat cta ttc 1584 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 cca cag att tat cag cct gta gcg gct tta gcc gat gac att ggc gaa 1632 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 agt cta aaa gat act tta ctt aat cca cgc agt att agt cgt cat agc 1680 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 ttt aaa gaa ctc aag cag ttg gat ctg gac ctg cgc ggt aac tta gcc 1728 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 575 aat atc gct gaa gcc ggt gtg ggt ttt gct tgt gtg ttt acc aag gta 1776 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 ttt gaa gaa gtc ttt gcc gtt aaa gct gac ttt gct aca ggt tat agc 1824 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 atg ggt gaa gta agc atg tat gca gca cta ggc tgc tgg cag caa ccg 1872 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 gga ttg atg agt gct cgc ctt gca caa tcg aat acc ttt aat cat caa 1920 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 ctt tgc ggc gag tta aga aca cta cgt cag cat tgg ggc atg gat gat 1968 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 gta gct aac ggt acg ttc gag cag atc tgg gaa acc tat acc att aag 2016 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 gca acg att gaa cag gtc gaa att gcc tct gca gat gaa gat cgt gtg 2064 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 tat tgc acc att atc aat aca cct gat agc ttg ttg tta gcc ggt tat 2112 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 cca gaa gcc tgt cag cga gtc att aag aat tta ggt gtg cgt gca atg 2160 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 gca ttg aat atg gcg aac gca att cac agc gcg cca gct tat gcc gaa 2208 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 tac gat cat atg gtt gag cta tac cat atg gat gtt act cca cgt att 2256 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 aat acc aag atg tat tca agc tca tgt tat tta ccg att cca caa cgc 2304 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 agc aaa gcg att tcc cac agt att gct aaa tgt ttg tgt gat gtg gtg 2352 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 gat ttc cca cgt ttg gtt aat acc tta cat gac aaa ggt gcg cgg gta 2400 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 ttc att gaa atg ggt cca ggt cgt tcg tta tgt agc tgg gta gat aag 2448 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 atc tta gtt aat ggc gat ggc gat aat aaa aag caa agc caa cat gta 2496 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 tct gtt cct gtg aat gcc aaa ggc acc agt gat gaa ctt act tat att 2544 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 cgt gcg att gct aag tta att agt cat ggc gtg aat ttg aat tta gat 2592 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 agc tag 2598 Ser 865 <210> 5 <211> 865 <212> PRT <213> Moritella marina <400> 5 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 575 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 Ser 865 <210> 6 <211> 6036 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(6033) <400> 6 atg gaa aat att gca gta gta ggt att gct aat ttg ttc ccg ggc tca 48 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 caa gca ccg gat caa ttt tgg cag caa ttg ctt gaa caa caa gat tgc 96 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 cgc agt aag gcg acc gct gtt caa atg ggc gtt gat cct gct aaa tat 144 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 acc gcc aac aaa ggt gac aca gat aaa ttt tac tgt gtg cac ggc ggt 192 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 tac atc agt gat ttc aat ttt gat gct tca ggt tat caa ctc gat aat 240 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 gat tat tta gcc ggt tta gat gac ctt aat caa tgg ggg ctt tat gtt 288 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 acg aaa caa gcc ctt acc gat gcg ggt tat tgg ggc agt act gca cta 336 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 gaa aac tgt ggt gtg att tta ggt aat ttg tca ttc cca act aaa tca 384 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 tct aat cag ctg ttt atg cct ttg tat cat caa gtt gtt gat aat gcc 432 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 tta aag gcg gta tta cat cct gat ttt caa tta acg cat tac aca gca 480 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 ccg aaa aaa aca cat gct gac aat gca tta gta gca ggt tat cca gct 528 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 gca ttg atc gcg caa gcg gcg ggt ctt ggt ggt tca cat ttt gca ctg 576 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 gat gcg gct tgt gct tca tct tgt tat agc gtt aag tta gcg tgt gat 624 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 tac ctg cat acg ggt aaa gcc aac atg atg ctt gct ggt gcg gta tct 672 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 gca gca gat cct atg ttc gta aat atg ggt ttc tcg ata ttc caa gct 720 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 tac cca gct aac aat gta cat gcc ccg ttt gac caa aat tca caa ggt 768 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 cta ttt gcc ggt gaa ggc gcg ggc atg atg gta ttg aaa cgt caa agt 816 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 gat gca gta cgt gat ggt gat cat att tac gcc att att aaa ggc ggc 864 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 gca tta tcg aat gac ggt aaa ggc gag ttt gta tta agc ccg aac acc 912 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 aag ggc caa gta tta gta tat gaa cgt gct tat gcc gat gca gat gtt 960 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 gac ccg agt aca gtt gac tat att gaa tgt cat gca acg ggc aca cct 1008 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 aag ggt gac aat gtt gaa ttg cgt tcg atg gaa acc ttt ttc agt cgc 1056 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 gta aat aac aaa cca tta ctg ggc tcg gtt aaa tct aac ctt ggt cat 1104 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 ttg tta act gcc gct ggt atg cct ggc atg acc aaa gct atg tta gcg 1152 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 cta ggt aaa ggt ctt att cct gca acg att aac tta aag caa cca ctg 1200 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 caa tct aaa aac ggt tac ttt act ggc gag caa atg cca acg acg act 1248 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 gtg tct tgg cca aca act ccg ggt gcc aag gca gat aaa ccg cgt acc 1296 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 gca ggt gtg agc gta ttt ggt ttt ggt ggc agc aac gcc cat ttg gta 1344 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 tta caa cag cca acg caa aca ctc gag act aat ttt agt gtt gct aaa 1392 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 cca cgt gag cct ttg gct att att ggt atg gac agc cat ttt ggt agt 1440 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 gcc agt aat tta gcg cag ttc aaa acc tta tta aat aat aat caa aat 1488 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 acc ttc cgt gaa tta cca gaa caa cgc tgg aaa ggc atg gaa agt aac 1536 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 gct aac gtc atg cag tcg tta caa tta cgc aaa gcg cct aaa ggc agt 1584 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 tac gtt gaa cag cta gat att gat ttc ttg cgt ttt aaa gta ccg cct 1632 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 aat gaa aaa gat tgc ttg atc ccg caa cag tta atg atg atg caa gtg 1680 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 gca gac aat gct gcg aaa gac gga ggt cta gtt gaa ggt cgt aat gtt 1728 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 gcg gta tta gta gcg atg ggc atg gaa ctg gaa tta cat cag tat cgt 1776 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 ggt cgc gtt aat cta acc acc caa att gaa gac agc tta tta cag caa 1824 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 ggt att aac ctg act gtt gag caa cgt gaa gaa ctg acc aat att gct 1872 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 aaa gac ggt gtt gcc tcg gct gca cag cta aat cag tat acg agt ttc 1920 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 att ggt aat att atg gcg tca cgt att tcg gcg tta tgg gat ttt tct 1968 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 ggt cct gct att acc gta tcg gct gaa gaa aac tct gtt tat cgt tgt 2016 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 gtt gaa tta gct gaa aat cta ttt caa acc agt gat gtt gaa gcc gtt 2064 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 att att gct gct gtt gat ttg tct ggt tca att gaa aac att act tta 2112 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 cgt cag cac tac ggt cca gtt aat gaa aag gga tct gta agt gaa tgt 2160 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 ggt ccg gtt aat gaa agc agt tca gta acc aac aat att ctt gat cag 2208 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 caa caa tgg ctg gtg ggt gaa ggc gca gcg gct att gtc gtt aaa ccg 2256 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 tca tcg caa gtc act gct gaa caa gtt tat gcg cgt att gat gcg gtg 2304 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 agt ttt gcc cct ggt agc aat gcg aaa gca att acg att gca gcg gat 2352 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 aaa gca tta aca ctt gct ggt atc agt gct gct gat gta gct agt gtt 2400 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 gaa gca cat gca agt ggt ttt agt gcc gaa aat aat gct gaa aaa acc 2448 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 gcg tta ccg act tta tac cca agc gca agt atc agt tcg gtg aaa gcc 2496 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 aat att ggt cat acg ttt aat gcc tcg ggt atg gcg agt att att aaa 2544 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 acg gcg ctg ctg tta gat cag aat acg agt caa gat cag aaa agc aaa 2592 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 cat att gct att aac ggt cta ggt cgt gat aac agc tgc gcg cat ctt 2640 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 atc tta tcg agt tca gcg caa gcg cat caa gtt gca cca gcg cct gta 2688 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 tct ggt atg gcc aag caa cgc cca cag tta gtt aaa acc atc aaa ctc 2736 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 ggt ggt cag tta att agc aac gcg att gtt aac agt gcg agt tca tct 2784 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 tta cac gct att aaa gcg cag ttt gcc ggt aag cac tta aac aaa gtt 2832 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 aac cag cca gtg atg atg gat aac ctg aag ccc caa ggt att agc gct 2880 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 cat gca acc aat gag tat gtg gtg act gga gct gct aac act caa gct 2928 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 tct aac att caa gca tct cat gtt caa gcg tca agt cat gca caa gag 2976 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 ata gca cca aac caa gtt caa aat atg caa gct aca gca gcc gct gta 3024 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 agt tca ccc ctt tct caa cat caa cac aca gcg cag ccc gta gcg gca 3072 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 ccg agc gtt gtt gga gtg act gtg aaa cat aaa gca agt aac caa att 3120 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 cat cag caa gcg tct acg cat aaa gca ttt tta gaa agt cgt tta gct 3168 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 gca cag aaa aac cta tcg caa ctt gtt gaa ttg caa acc aag ctg tca 3216 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 atc caa act ggt agt gac aat aca tct aac aat act gcg tca aca agc 3264 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 aat aca gtg cta aca aat cct gta tca gca acg cca tta aca ctt gtg 3312 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 tat aat gcg cct gta gta gcg aca aac cta acc agt aca gaa gca aaa 3360 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 gcg caa gca gct gct aca caa gct ggt ttt cag ata aaa gga cct gtt 3408 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 ggt tac aac tat cca ccg ctg cag tta att gaa cgt tat aat aaa cca 3456 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 gaa aac gtg att tac gat caa gct gat ttg gtt gaa ttc gct gaa ggt 3504 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 gat att ggt aag gta ttt ggt gct gaa tac aat att att gat ggc tat 3552 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 tcg cgt cgt gta cgt ctg cca acc tca gat tac ttg tta gta aca cgt 3600 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 gtt act gaa ctt gat gcc aag gtg cat gaa tac aag aaa tca tac atg 3648 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 tgt act gaa tat gat gtg cct gtt gat gca ccg ttc tta att gat ggt 3696 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 cag atc cct tgg tct gtt gcc gtc gaa tca ggc cag tgt gat ttg atg 3744 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 ttg att tca tat atc ggt att gat ttc caa gcg aaa ggc gaa cgt gtt 3792 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 tac cgt tta ctt gat tgt gaa tta act ttc ctt gaa gag atg gct ttt 3840 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 ggt ggc gat act tta cgt tac gag atc cac att gat tcg tat gca cgt 3888 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 aac ggc gag caa tta tta ttc ttc ttc cat tac gat tgt tac gta ggg 3936 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 gat aag aag gta ctt atc atg cgt aat ggt tgt gct ggt ttc ttt act 3984 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 gac gaa gaa ctt tct gat ggt aaa ggc gtt att cat aac gac aaa gac 4032 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 aaa gct gag ttt agc aat gct gtt aaa tca tca ttc acg ccg tta tta 4080 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 caa cat aac cgt ggt caa tac gat tat aac gac atg atg aag ttg gtt 4128 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 aat ggt gat gtt gcc agt tgt ttt ggt ccg caa tat gat caa ggt ggc 4176 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 cgt aat cca tca ttg aaa ttc tcg tct gag aag ttc ttg atg att gaa 4224 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 cgt att acc aag ata gac cca acc ggt ggt cat tgg gga cta ggc ctg 4272 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 tta gaa ggt cag aaa gat tta gac cct gag cat tgg tat ttc cct tgt 4320 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 cac ttt aaa ggt gat caa gta atg gct ggt tcg ttg atg tcg gaa ggt 4368 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 tgt ggc caa atg gcg atg ttc ttc atg ctg tct ctt ggt atg cat acc 4416 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 aat gtg aac aac gct cgt ttc caa cca cta cca ggt gaa tca caa acg 4464 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 gta cgt tgt cgt ggg caa gta ctg cca cag cgc aat acc tta act tac 4512 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 cgt atg gaa gtt act gcg atg ggt atg cat cca cag cca ttc atg aaa 4560 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 gct aat att gat att ttg ctt gac ggt aaa gtg gtt gtt gat ttc aaa 4608 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 aac ttg agc gtg atg atc agc gaa caa gat gag cat tca gat tac cct 4656 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 gta aca ctg ccg agt aat gtg gcg ctt aaa gcg att act gca cct gtt 4704 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 gcg tca gta gca cca gca tct tca ccc gct aac agc gcg gat cta gac 4752 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 gaa cgt ggt gtt gaa ccg ttt aag ttt cct gaa cgt ccg tta atg cgt 4800 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 gtt gag tca gac ttg tct gca ccg aaa agc aaa ggt gtg aca ccg att 4848 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 aag cat ttt gaa gcg cct gct gtt gct ggt cat cat aga gtg cct aac 4896 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 caa gca ccg ttt aca cct tgg cat atg ttt gag ttt gcg acg ggt aat 4944 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 att tct aac tgt ttc ggt cct gat ttt gat gtt tat gaa ggt cgt att 4992 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 cca cct cgt aca cct tgt ggc gat tta caa gtt gtt act cag gtt gta 5040 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 gaa gtg cag ggc gaa cgt ctt gat ctt aaa aat cca tca agc tgt gta 5088 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 gct gaa tac tat gta ccg gaa gac gct tgg tac ttt act aaa aac agc 5136 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 cat gaa aac tgg atg cct tat tca tta atc atg gaa att gca ttg caa 5184 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 cca aat ggc ttt att tct ggt tac atg ggc acg acg ctt aaa tac cct 5232 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 gaa aaa gat ctg ttc ttc cgt aac ctt gat ggt agc ggc acg tta tta 5280 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 aag cag att gat tta cgc ggc aag acc att gtg aat aaa tca gtc ttg 5328 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 gtt agt acg gct att gct ggt ggc gcg att att caa agt ttc acg ttt 5376 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 gat atg tct gta gat ggc gag cta ttt tat act ggt aaa gct gta ttt 5424 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 ggt tac ttt agt ggt gaa tca ctg act aac caa ctg ggc att gat aac 5472 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 ggt aaa acg act aat gcg tgg ttt gtt gat aac aat acc ccc gca gcg 5520 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 aat att gat gtg ttt gat tta act aat cag tca ttg gct ctg tat aaa 5568 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 gcg cct gtg gat aaa ccg cat tat aaa ttg gct ggt ggt cag atg aac 5616 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 ttt atc gat aca gtg tca gtg gtt gaa ggc ggt ggt aaa gcg ggc gtg 5664 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 gct tat gtt tat ggc gaa cgt acg att gat gct gat gat tgg ttc ttc 5712 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 cgt tat cac ttc cac caa gat ccg gtg atg cca ggt tca tta ggt gtt 5760 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 gaa gct att att gag ttg atg cag acc tat gcg ctt aaa aat gat ttg 5808 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 ggt ggc aag ttt gct aac cca cgt ttc att gcg ccg atg acg caa gtt 5856 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 gat tgg aaa tac cgt ggg caa att acg ccg ctg aat aaa cag atg tca 5904 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 ctg gac gtg cat atc act gag atc gtg aat gac gct ggt gaa gtg cga 5952 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 atc gtt ggt gat gcg aat ctg tct aaa gat ggt ctg cgt att tat gaa 6000 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 gtt aaa aac atc gtt tta agt att gtt gaa gcg taa 6036 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 7 <211> 2011 <212> PRT <213> Moritella marina <400> 7 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 8 <211> 1617 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1)..(1614) <400> 8 atg tcg agt tta ggt ttt aac aat aac aac gca att aac tgg gct tgg 48 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 aaa gta gat cca gcg tca gtt cat aca caa gat gca gaa att aaa gca 96 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 gct tta atg gat cta act aaa cct ctc tat gtg gcg aat aat tca ggc 144 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 gta act ggt ata gct aat cat acg tca gta gca ggt gcg atc agc aat 192 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 aac atc gat gtt gat gta ttg gcg ttt gcg caa aag tta aac cca gaa 240 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 gat ctg ggt gat gat gct tac aag aaa cag cac ggc gtt aaa tat gct 288 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 tat cat ggc ggt gcg atg gca aat ggt att gcc tcg gtt gaa ttg gtt 336 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 gtt gcg tta ggt aaa gca ggg ctg tta tgt tca ttt ggt gct gca ggt 384 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 cta gtg cct gat gcg gtt gaa gat gca att cgt cgt att caa gct gaa 432 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 tta cca aat ggc cct tat gcg gtt aac ttg atc cat gca cca gca gaa 480 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 gaa gca tta gag cgt ggc gcg gtt gaa cgt ttc cta aaa ctt ggc gtc 528 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 aag acg gta gag gct tca gct tac ctt ggt tta act gaa cac att gtt 576 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 tgg tat cgt gct gct ggt cta act aaa aac gca gat ggc agt gtt aat 624 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 atc ggt aac aag gtt atc gct aaa gta tcg cgt acc gaa gtt ggt cgc 672 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 cgc ttt atg gaa cct gca ccg caa aaa tta ctg gat aag tta tta gaa 720 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 caa aat aag atc acc cct gaa caa gct gct tta gcg ttg ctt gta cct 768 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 atg gct gat gat att act ggg gaa gcg gat tct ggt ggt cat aca gat 816 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 aac cgt ccg ttt tta aca tta tta ccg acg att att ggt ctg cgt gat 864 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 gaa gtg caa gcg aag tat aac ttc tct cct gca tta cgt gtt ggt gct 912 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 ggt ggt ggt atc gga acg cct gaa gca gca ctc gct gca ttt aac atg 960 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 ggc gcg gct tat atc gtt ctg ggt tct gtg aat cag gcg tgt gtt gaa 1008 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 gcg ggt gca tct gaa tat act cgt aaa ctg tta tcg aca gtt gaa atg 1056 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 gct gat gtg act atg gca cct gct gca gat atg ttt gaa atg ggt gtg 1104 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 aag ctg caa gta tta aaa cgc ggt tct atg ttc gcg atg cgt gcg aag 1152 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 aaa ctg tat gac ttg tat gtg gct tat gac tcg att gaa gat atc cca 1200 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 gct gct gaa cgt gag aag att gaa aaa caa atc ttc cgt gca aac cta 1248 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 gac gag att tgg gat ggc act atc gct ttc ttt act gaa cgc gat cca 1296 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 gaa atg cta gcc cgt gca acg agt agt cct aaa cgt aaa atg gca ctt 1344 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 atc ttc cgt tgg tat ctt ggc ctt tct tca cgc tgg tca aac aca ggc 1392 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 gag aag gga cgt gaa atg gat tat cag att tgg gca ggc cca agt tta 1440 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 ggt gca ttc aac agc tgg gtg aaa ggt tct tac ctt gaa gac tat acc 1488 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 cgc cgt ggc gct gta gat gtt gct ttg cat atg ctt aaa ggt gct gcg 1536 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 tat tta caa cgt gta aac cag ttg aaa ttg caa ggt gtt agc tta agt 1584 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 aca gaa ttg gca agt tat cgt acg agt gat taa 1617 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 9 <211> 538 <212> PRT <213> Moritella marina <400> 9 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 10 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence:Primer <220> <221> Degenerate <222> (6) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (12) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (15) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (18) <223> "n" is a, t, c or g <400> 10 ttyggnttyg gnggnacnaa 20 <210> 11 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence:Primer <220> <221> Degenerate <222> (4) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (7) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (10) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (16) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (19) <223> "n" is a, t, c or g <400> 11 ytcnccnarn swrtgnccng c 21 [Sequence List] SEQUENCE LISTING <110> Director-General of Agency of Industrial Science and Technology <120> Gene from Docosahexaenoic Acid Producing Bacteria <130> P99-0665 <160> 11 <170> PatentIn Ver. 2.0 <210> 1 <211> 41587 <212> DNA <213> Moritella marina <400>   1   gatcactctg   ctgcatggcg   agagctgttt   aattacaggt   tgaaaaaaac   gatgtaatgc   60   acttaattgc   ttgctgttct   taatgcctga   ggcgtcgaag   ataataccgt   tgaagcgatc   120   tgttttagcg   atagcattaa   ggctaatagg   tgtcgcgact   aaagacgttt   gattaaattc   180   aatattaaga   tcggctaacg   ctgacgtgtt   attaggataa   gaaatcgtga   cttcagcatc   240   tttaaatgtg   ttaagaatgg   gtttaattaa   tttgctgttg   ctggctgcgc   cgatgagtaa   300   gttgccagag   atgagatcgg   ttccctgatc   gtagcgtgtt   aacgtaaccg   gtcgtggcag   360   attaagcgct   ttaaataaac   ctgatgtcca   cttgccatta   gcgagttttg   cgtatgtatc   420   cgtcattttc   taatccttgt   tatagtgaac   agtttgaatc   tcgaagatgt   acatgtgtta   480   aaaattatct   gatagctatg   acttatctgc   cactacgtaa   taataaatag   accagttcat   540   tacatcgtta   atcgatatag   tataactaaa   tactaagtaa   attataatga   taagactgtt   600   atcgtactcg   gatcaaactc   tgatcagcaa   ataatcaaat   tagagttttt   attttaaact   660   tgtatcaaca   atgttacatt   aatgtatctt   acgtctaatg   tgctacgggc   atatttaagt   720   cactaaatta   aaggaataaa   ccatgacagg   tcaaacaata   agaagagtag   caattatcgg   780   cggtaaccgt   atcccgtttg   cacgttcaaa   tacagcgtat   tcaaaactaa   gtaaccaaga   840   tatgctgacg   gaaactatcc   gtggcttggt   ggttaaatat   aacctacgtg   gtgaacaact   900   gggggaagtt   gttgctggtg   cggtaattaa   gcattctcgt   gattttaact   taacacgtga   960   agccgtgcta   agtgcaggtc   ttgcacctga   aacgccttgt   tatgacattc   aacaagcttg   1020   tggtactggt   ctagctgcag   ctatccaagt   agcaaacaaa   attgcgcttg   gtcaaataga   1080   agcgggtatt   gctggtggtt   ctgatacgac   atcagatgca   ccgattgcag   tcagtgaagg   1140   catgcgtagt   gtattacttg   agcttaatcg   agctaaaacg   ggtaagcaac   gtttgaaagc   1200   actatctcgt   ctacgtctaa   aacactttgc   gccactaacg   cctgcaaata   aagagccgcg   1260   taccaaaatg   gcgatgggcg   atcattgtca   agtaacagcg   aaagagtgga   atatctcacg   1320   tgaagcacaa   gatgcattgg   cctgcgcaag   tcatcaaaaa   ttagctgcag   catatgaaga   1380   aggtttcttt   gatacgttag   tttcacctat   ggccggctta   acgaaagata   acgtattacg   1440     cgcagataca   acagttgaga   aactggctaa   attgaaacct   tgttttgata   aagtaaacgg   1500   cactatgacg   gcgggtaaca   gtactaacct   taccgatgga   gcatcagctg   tattacttgc   1560   aagtgaagaa   tgggcagcgg   cacataactt   accagtacaa   gcttatctaa   catttggtga   1620   aacggccgct   atcgacttcg   ttgataagaa   agaaggtctg   ttaatggcgc   ctgcatacgc   1680   agtgccaaaa   atgttgaagc   gtgctggcct   tacattacaa   gacttcgatt   actatgaaat   1740   acatgaagca   tttgctgcgc   agttattagc   aacgctagca   gcttgggaag   acgaaaaatt   1800   ctgtaaagaa   aaactgggtc   tagatgctgc   gcttggttca   attgatatga   ccaagttaaa   1860   cgtgaaaggg   agtagcttag   ccacgggtca   cccatttgcc   gcaactggtg   gtcgtgttgt   1920   cgctacgcta   gcgcaattac   ttgatcagaa   aggttcaggt   cgtggtttga   tctcgatttg   1980   tgctgctggt   ggtcaaggta   tcacggcaat   tttagagaaa   taaacgcact   gtttattatc   2040   tattgattaa   gctgtcctga   gatactggat   atttttaaat   aaaacgccaa   tactgcagag   2100   tattggcgtt   tttttgtaat   accaattcct   atataacggt   gcattttaaa   cacttaattt   2160   ccggcattgg   tatcataaaa   aagcagcacc   gaagtgctgc   ttgattgtag   attaacctat   2220   taaaatagag   aggctagaat   tagtcttcgt   atgcttcatt   atgtacgcca   gctgcacgac   2280   ccgatggatc   agcattgttt   tggaaacttt   catcccaagc   taatgcttct   acagttgaac   2340   aagcaacgga   tttaccaaac   ggtacgcatt   tcgctgctga   atcacctggg   aagtgatctt   2400   caaagatggc   acgatagtag   taaccttctt   tcgtatctgg   tgtgttaatt   gggaacttaa   2460   atgctgcact   tgctaacatt   tgatcagtta   ccgcttcttc   aacgtgtact   ttaagttggt   2520   caatccaaga   ataaccaaca   ccatcagaga   attgttcttt   ttgacgccat   acaatttctt   2580   caggtagtaa   atcttcaaat   gcttctcgaa   tgatgttttt   ctcaatgcgg   tcgcccgtga   2640   tcatttttag   ttcagggttt   agacgcattg   acgcatcaac   aaattcttta   tctaagaaag   2700   gaacacgtgc   ttcgatgccc   caagctgcca   tagatttgtt   tgcacgtaag   caatcaaaca   2760   tatgtaattt   atttacttta   cgtaccgtct   cttcatggaa   ttctttcgca   tttggcgctt   2820   tgtggaagta   caagtaacca   ccgaacagtt   catcagcacc   ttcaccagaa   agcaccatct   2880   taatccccat   ggctttaatt   ttacgtgcca   ttaggtacat   aggggttgat   gcacgaattg   2940   ttgttacatc   gtaggtttca   atgtggtaaa   tcacgtcgcg   taaagcgtcg   ataccttctt   3000   gcacagtaaa   ttcaattgaa   tgatggatag   tacctaagtg   atctgccact   ttttgtgcag   3060   cggctaaatc   tggagaacca   tttaggccta   cagagaaaga   gtgtagttgt   ggccaccatg   3120   cttcggtttt   accaccgtct   tcaatacgac   gttttgcata   ctgttgggtg   attgctgaaa   3180   taacagatga   atctaacccg   cctgataata   atacgccgta   aggtacatca   cacattaatt   3240   gacgtttaac   tgcatcttcc   aaaccttgct   taacaacgct   tttatcacca   ccattttgtg   3300   caacgttatc   aaaatctttc   caatcacgtt   gataataagg   cgtgactaca   ccatccttac   3360   tccacaggta   atgacctgct   gggaattctt   caatttgagt   acaaattggc   actagtgctt   3420   tcatttcaga   ggcaacataa   aagttaccgt   gttcatcata   gcccgtataa   agagggatga   3480   taccgatatg   gtcacggcca   atcaggtaag   cgtcctctgt   ttcgtcatat   aaagcgaaag   3540   caaaaatacc   atttagatca   tctaaaaatt   gtgtgccttt   ttctttatat   agcgcaagta   3600   tcacttcgca   atctgattct   gtttggaatt   caaagtctac   gttcagcgtt   ttctttaaat   3660   ctttgtggtt   ataaatttca   ccattaacag   caagtacgtg   tgtcttttct   tcattatata   3720   gcggctgtgc   accattattt   acatcgacaa   tagcaagacg   ttcatgaact   aaaatagcat   3780   tgtcacttgt   atagatacct   gaccaatctg   ggccgcggtg   acgtagtaac   tttgatagtt   3840   ctagtgcttg   ttcgcgaaga   ggtttaatgt   ctgatttgat   gtctagaatt   ccgaatattg   3900   agcacataac   taattccttc   tggggctgcg   tctgcagcta   actttctaaa   tagtgtgtct   3960   aatttgccac   attgtagatt   taatgcaaac   attaatgata   aaacatttat   aaaaaatgta   4020   attcaatgtg   gaatcgataa   tttaatggct   taaaagtgaa   gatccattaa   ttgtgatggc   4080   gaggtgatag   accaatgtag   accttaatga   ataaagcagg   cacgattgaa   tccattcaac   4140   gcaaagtggt   actaactatt   gttttaaacg   ttataaatag   tgttttaaag   gttataagta   4200   aataatttaa   aaacaataat   aatccacatg   cattaaattt   atcatgataa   accgctatat   4260   ctcaatggca   atttgggata   agtgtaaaat   atatgtaaaa   tgaatgagtt   gacttgcttt   4320   ttttacacta   agtgatgaaa   ttaaagctag   atgtcgttgt   tagcattgat   taataacgta   4380   ctaaaatacg   acatctagta   tagaaattta   aaaaacagtt   ggttttgata   gcataactgc   4440     ataaactaat   cagcttattg   tctgtaatat   ttttgtaatt   taaataggtt   taataaaatt   4500   atatgtctga   taaatataaa   ccgtacgacc   tttcctttaa   aaagacgttt   ttgctgccta   4560   agttttggcc   tgtgtggttc   ggggtgtttg   caatatactt   attagctttt   atgccagtaa   4620   agccgcgtga   taaatttgct   cgattcatag   cgaagaaatt   gtttagtcta   aaaatgatgg   4680   caaagcgtaa   aaaggtagca   aagatcaatt   tatctatgtg   cttccctgaa   atggatgata   4740   cggaacaaga   ccgtataatc   atggtcaatc   tagttacttt   ttgtcaaact   atcttaagtt   4800   atgcagagcc   aagtgcgcgt   agtcgtgctt   ataaccgtga   ccgtatgata   gtgcatggtg   4860   gcgagaattt   atttccgcta   cttgaacaag   gtaaggcttg   tatcttatta   gtgccgcata   4920   gcttcgctat   tgattttgca   ggtttacaca   ttgcttctta   tggcgcgcca   ttttgtacta   4980   tgtttaacaa   ttctgagaat   gagttgttcg   attggctgat   gacacgtcaa   cgcgctatgt   5040   ttggaggcac   tgtttatcac   cgcaaggcag   ggctaggggc   tctagttaaa   tcacttaaga   5100   gcggtgaaag   ctgttattac   ttacctgatg   aagaccatgg   acctaagcgt   agtgtatttg   5160   cgcctttatt   tgcgactcaa   aaagcaactt   tacctgtaat   gggcaagcta   gcagaaaaaa   5220   caaatgcact   cgttgttcct   gtttatgcgg   catataatga   atcactaggt   aaatttgaaa   5280   cctttattcg   accagcaatg   caaaactttc   catcagaaag   cccagaacaa   gatgcagtga   5340   tgatgaataa   agagattgaa   gccttgattg   aatgtggtgt   tgatcaatat   atgtggacac   5400   ttagattatt   gagaacacgt   ccggacggta   aaaaaatcta   ctaataaagt   ttaataaaca   5460   ccataatctt   cgttgaatat   ggtgtttacc   cccctgaata   ccctctaaat   taataacaaa   5520   aaaagccatt   tacgtaacat   ctaatgatga   tttagcctgc   acttgctttg   tttttagtct   5580   taagagccta   ataaacttga   tctaggtata   gattctgtct   ttctttacgt   aacgcgatct   5640   atttttttta   accgatagtt   gttataatta   gtttcatatg   aaagagatat   cgtttcagta   5700   aaagctattt   cgtttcaata   gataatttat   ttatagtcat   attttctgta   atgacaatca   5760   ttttctcatc   tagactatag   ataagaatac   gaattaagta   agaacattaa   ttttacaaga   5820   atataaaata   tcccatcgga   gctataagaa   tgaaaaagac   taaaattgtt   tgtacaattg   5880   gtccaaaaac   tgaatcagta   gagaaactaa   cagagcttgt   taatgcaggc   atgaacgtta   5940   tgcgtttaaa   tttctctcat   ggtaactttg   ctgaacattc   agtgcgtatt   caaaatatcc   6000   gtcaagtaag   tgaaaacctg   aataagaaaa   ttgctgtttt   actggatact   aaaggtccag   6060   aaatccgtac   gattaaacta   gaaaacggtg   acgatgtaat   gttgaccgct   ggtcagtcat   6120   tcacgtttac   aacagacatt   aacgtggtag   gtaataaaga   ctgtgttgct   gtaacatatg   6180   ctggttttgc   taaagacctt   aatcctggtg   caatcatcct   tgttgatgat   ggtttaattg   6240   aaatggaagt   tgttgcaaca   actgacactg   aagttaaatg   tacagtatta   aatactggtg   6300   cacttggtga   aaataaaggc   gttaacttac   ctaacatcag   tgtaggtcta   cctgcattgt   6360   cagaaaaaga   taaagctgat   ttagcgtttg   gttgtgagca   agaagttgat   tttgttgctg   6420   catcatttat   tcgtaaggct   gatgatgtaa   gagaaattcg   tgaaatccta   tttaataatg   6480   gtggcgaaaa   cattcagatt   atctcgaaaa   ttgaaaacca   agaaggtgta   gacaatttcg   6540   atgaaatctt   agctgaatca   gacggtatca   tggttgctcg   tggcgatctc   ggtgttgaga   6600   tcccagttga   agaagtgatc   atggcacaga   agatgatgat   caaaaaatgt   aataaagcag   6660   gtaaagttgt   aattactgca   acacaaatgc   ttgattcaat   gatcagtaac   ccacgtccaa   6720   cacgtgcaga   agcgggcgat   gttgccaatg   ctgtgcttga   cggtaccgac   gcggtaatgc   6780   tttctggtga   aactgcgaaa   ggtaaatacc   cagttgaagc   tgtgtctatc   atggcaaaca   6840   tctgtgaacg   tactgataac   tcaatgtctt   cggatttagg   tgcgaacatt   gttgctaaaa   6900   gcatgcgcat   tacagaagct   gtgtgtaaag   gtgcggtaga   aacaacagaa   aaattgtgtg   6960   ctccacttat   tgttgttgca   actcgtggcg   gtaaatcagc   aaaatctgtt   cgtaaatact   7020   tcccgaaagc   aaatattctt   gctatcacaa   caaatgaaaa   agcagcgcaa   cagttatgcc   7080   taactaaagg   cgtaagcagc   tgcatcgttg   agcagattga   tagcactgat   gagttctacc   7140   gtaaaggtaa   agagcttgca   ttagcaactg   gtttagctaa   agaaggcgat   atcgttgtta   7200   tggtatcagg   tgcgttagta   ccatcaggta   caacgaatac   ggcatctgtt   caccaacttt   7260   aagttgccat   attgatatta   taaaaaagag   agcgtatgct   ctcttttttt   atatctgtag   7320   tttatatgtc   tgtacaaaaa   aatgataaag   agtacataaa   ctattaatat   agcgtaatat   7380   ataatgatta   acggtgatga   aagggttaaa   taaatggata   gtgctaaaca   taaaattggc   7440     ttagtccttt   ctggcggtgg   tgcgaaaggt   attgctcatc   ttggtgtatt   aaaatacctg   7500   ttagagcaag   atataagacc   gaatgtaatt   gcgggtacaa   gtgctggctc   tatggttggt   7560   gcactttatt   gctcaggact   tgagattgat   gacattttac   aattcttcat   cgatgtaaaa   7620   cctttttctt   ggaagtttac   ccgtgcccgt   gctggcttta   tagacccggc   aaaattatat   7680   cctgaagtgc   taaaatatat   ccccgaggat   agctttgagt   accttcaacc   tgaattgcgc   7740   attgttgcca   ccaacatgtt   actcggtaaa   gagcatatat   ttaaagatgg   ctccgtgatt   7800   aatgccttat   tagcatcagc   cagctaccct   ttagtttttt   ctccgatgat   cattgacgat   7860   caagtgtatt   cagatggcgg   tattgttaat   catttccccg   tgagtgtcat   tgaagatgat   7920   tgcgataaaa   taatcggcgt   atacgtgtcg   cccattcgtc   aggtcgaagc   tgacgaactc   7980   tcgagtataa   aagacgtggt   attacgtgcg   ttcacgctgc   agggtagtgg   tgctgaatta   8040   gataaactat   cgcaatgtga   tgtgcaaatt   tatccagaag   cgctattgaa   ttacaatacg   8100   tttgcaaccg   atgaaaaatc   attacgggag   atctaccaga   ttggttatga   tgctgcaaaa   8160   gatcaacatg   acaaccttat   ggcattgaaa   gaaagtatca   ccaccagcga   ggttaaaaag   8220   aacgtcttta   gcaaatggtt   tggtgataaa   cttgctagca   acagcggcaa   atagcggccc   8280   acacggattt   atacactagg   ataatgggcg   ttaatagcct   cactgtcgtt   gtgtggtctc   8340   taattttagc   taaatcttgt   gttatactga   cttcctatta   atcataaacg   atttatcacg   8400   gtaaacatga   ctcaaataaa   taacccgctt   cacggcatga   cactcgaaaa   agtaattaac   8460   agtctcgttg   aacaatatgg   ctgggatggt   cttggatact   acatcaacat   tcgttgcttt   8520   actgaaaatc   caagtgttaa   gtctagtctt   aaatttttac   gtaaaacccc   ttgggcacgt   8580   gataaagtag   aagcgctata   tatcaaaatg   gtgactgaag   gctaactgtc   tccacgctag   8640   cgaaccgctg   tttatagtta   atataagtac   tataagcagg   gctcgttaat   tcagtatgta   8700   attaatcctg   aataccttcc   gcttatttca   acattgtact   ctctagataa   cactctcaac   8760   attacacctt   caacatcaca   gcctccacat   aacatccgat   gacatagccc   tgttattttt   8820   cacatttatc   tatatgctat   atattttagc   catttgatca   attgagttaa   tttctgcaat   8880   gacaaagata   taccatcatc   cagtacaaat   ttattatgaa   gataccgacc   attctggtgt   8940   tgtttaccac   cctaactttt   taaaatactt   tgaacgtgca   cgtgagcatg   tgataaatag   9000   tgacttacta   gcaacattgt   ggaatgaacg   cggtttaggt   tttgcggtgt   ataaagccaa   9060   tatgactttt   caggatgggg   tcgaatttgc   tgaagtgtgt   gatattcgca   cttcttttgt   9120   cctagacggt   aagtacaaaa   cgatctggcg   ccaagaagta   tggcgtccga   atgcgactag   9180   ggctgccgtt   atcggtgata   ttgaaatggt   gtgcttagac   aaacaaaaac   gtttacagcc   9240   catccctgat   gatgtgttag   ctgcaatggt   tagtgaataa   atggttcatg   cataaatagt   9300   taatacatga   ttctggcccg   tcacgtttac   agataagagg   catccgatgc   ctccttccta   9360   ttaccaatac   tactgcttat   ccctttctaa   ctatctttag   cgtccataac   acactgagca   9420   tttattctat   taatcagtga   ttgtgattta   attatcttct   atatatgtaa   tttaatgtaa   9480   ttttcaattt   atttttagct   acattaaggc   ttacgaatgt   acgctaaaat   gagatgtcag   9540   actaatttta   gcttattaat   ctgttagccg   tttatatttt   ataaagatgg   gatttaactt   9600   aaatgcaatt   aattatggcg   taaatagagt   gaaaacatgg   ctaatattca   ctaagtcctg   9660   aattttatat   aaagtttaat   ctgttatttt   agcgtttacc   tggtcttatc   agtgaggttt   9720   atagccatta   ttagtgggat   tgaagtgatt   tttaaagcta   tgtatattat   tgcaaatata   9780   aattgtaaca   attaagactt   tggacacttg   agttcaattt   cgaattgatt   ggcataaaat   9840   ttaaaacagc   taaatctacc   tcaatcattt   tagcaaatgt   atgcaggtag   atttttttcg   9900   ccatttaaga   gtacacttgt   acgctaggtt   tttgtttagt   gtgcaaatga   acgttttgat   9960   gagcattgtt   tttagagcac   aaaatagatc   cttacaggag   caataacgca   atggctaaaa   10020   agaacaccac   atcgattaag   cacgccaagg   atgtgttaag   tagtgatgat   caacagttaa   10080   attctcgctt   gcaagaatgt   ccgattgcca   tcattggtat   ggcatcggtt   tttgcagatg   10140   ctaaaaactt   ggatcaattc   tgggataaca   tcgttgactc   tgtggacgct   attattgatg   10200   tgcctagcga   tcgctggaac   attgacgacc   attactcggc   tgataaaaaa   gcagctgaca   10260   agacatactg   caaacgcggt   ggtttcattc   cagagcttga   ttttgatccg   atggagtttg   10320   gtttaccgcc   aaatatcctc   gagttaactg   acatcgctca   attgttgtca   ttaattgttg   10380   ctcgtgatgt   attaagtgat   gctggcattg   gtagtgatta   tgaccatgat   aaaattggta   10440     tcacgctggg   tgtcggtggt   ggtcagaaac   aaatttcgcc   attaacgtcg   cgcctacaag   10500   gcccggtatt   agaaaaagta   ttaaaagcct   caggcattga   tgaagatgat   cgcgctatga   10560   tcatcgacaa   atttaaaaaa   gcctacatcg   gctgggaaga   gaactcattc   ccaggcatgc   10620   taggtaacgt   tattgctggt   cgtatcgcca   atcgttttga   ttttggtggt   actaactgtg   10680   tggttgatgc   ggcatgcgct   ggctcccttg   cagctgttaa   aatggcgatc   tcagacttac   10740   ttgaatatcg   ttcagaagtc   atgatatcgg   gtggtgtatg   ttgtgataac   tcgccattca   10800   tgtatatgtc   attctcgaaa   acaccagcat   ttaccaccaa   tgatgatatc   cgtccgtttg   10860   atgacgattc   aaaaggcatg   ctggttggtg   aaggtattgg   catgatggcg   tttaaacgtc   10920   ttgaagatgc   tgaacgtgac   ggcgacaaaa   tttattctgt   actgaaaggt   atcggtacat   10980   cttcagatgg   tcgtttcaaa   tctatttacg   ctccacgccc   agatggccaa   gcaaaagcgc   11040   taaaacgtgc   ttatgaagat   gccggttttg   cccctgaaac   atgtggtcta   attgaaggcc   11100   atggtacggg   taccaaagcg   ggtgatgccg   cagaatttgc   tggcttgacc   aaacactttg   11160   gcgccgccag   tgatgaaaag   caatatatcg   ccttaggctt   agttaaatcg   caaattggtc   11220   atactaaatc   tgcggctggc   tctgcgggta   tgattaaggc   ggcattagcg   ctgcatcata   11280   aaatcttacc   tgcaacgatc   catatcgata   aaccaagtga   agccttggat   atcaaaaaca   11340   gcccgttata   cctaaacagc   gaaacgcgtc   cttggatgcc   acgtgaagat   ggtattccac   11400   gtcgtgcagg   tatcagctca   tttggttttg   gcggcaccaa   cttccatatt   attttagaag   11460   agtatcgccc   aggtcacgat   agcgcatatc   gcttaaactc   agtgagccaa   actgtgttga   11520   tctcggcaaa   cgaccaacaa   ggtattgttg   ctgagttaaa   taactggcgt   actaaactgg   11580   ctgtcgatgc   tgatcatcaa   gggtttgtat   ttaatgagtt   agtgacaacg   tggccattaa   11640   aaaccccatc   cgttaaccaa   gctcgtttag   gttttgttgc   gcgtaatgca   aatgaagcga   11700   tcgcgatgat   tgatacggca   ttgaaacaat   tcaatgcgaa   cgcagataaa   atgacatggt   11760   cagtacctac   cggggtttac   tatcgtcaag   ccggtattga   tgcaacaggt   aaagtggttg   11820   cgctattctc   agggcaaggt   tcgcaatacg   tgaacatggg   tcgtgaatta   acctgtaact   11880   tcccaagcat   gatgcacagt   gctgcggcga   tggataaaga   gttcagtgcc   gctggtttag   11940   gccagttatc   tgcagttact   ttccctatcc   ctgtttatac   ggatgccgag   cgtaagctac   12000   aagaagagca   attacgttta   acgcaacatg   cgcaaccagc   gattggtagt   ttgagtgttg   12060   gtctgttcaa   aacgtttaag   caagcaggtt   ttaaagctga   ttttgctgcc   ggtcatagtt   12120   tcggtgagtt   aaccgcatta   tgggctgccg   atgtattgag   cgaaagcgat   tacatgatgt   12180   tagcgcgtag   tcgtggtcaa   gcaatggctg   cgccagagca   acaagatttt   gatgcaggta   12240   agatggccgc   tgttgttggt   gatccaaagc   aagtcgctgt   gatcattgat   acccttgatg   12300   atgtctctat   tgctaacttc   aactcgaata   accaagttgt   tattgctggt   actacggagc   12360   aggttgctgt   agcggttaca   accttaggta   atgctggttt   caaagttgtg   ccactgccgg   12420   tatctgctgc   gttccataca   cctttagttc   gtcacgcgca   aaaaccattt   gctaaagcgg   12480   ttgatagcgc   taaatttaaa   gcgccaagca   ttccagtgtt   tgctaatggc   acaggcttgg   12540   tgcattcaag   caaaccgaat   gacattaaga   aaaacctgaa   aaaccacatg   ctggaatctg   12600   ttcatttcaa   tcaagaaatt   gacaacatct   atgctgatgg   tggccgcgta   tttatcgaat   12660   ttggtccaaa   gaatgtatta   actaaattgg   ttgaaaacat   tctcactgaa   aaatctgatg   12720   tgactgctat   cgcggttaat   gctaatccta   aacaacctgc   ggacgtacaa   atgcgccaag   12780   ctgcgctgca   aatggcagtg   cttggtgtcg   cattagacaa   tattgacccg   tacgacgccg   12840   ttaagcgtcc   acttgttgcg   ccgaaagcat   caccaatgtt   gatgaagtta   tctgcagcgt   12900   cttatgttag   tccgaaaacg   aagaaagcgt   ttgctgatgc   attgactgat   ggctggactg   12960   ttaagcaagc   gaaagctgta   cctgctgttg   tgtcacaacc   acaagtgatt   gaaaagatcg   13020   ttgaagttga   aaagatagtt   gaacgcattg   tcgaagtaga   gcgtattgtc   gaagtagaaa   13080   aaatcgtcta   cgttaatgct   gacggttcgc   ttatatcgca   aaataatcaa   gacgttaaca   13140   gcgctgttgt   tagcaacgtg   actaatagct   cagtgactca   tagcagtgat   gctgaccttg   13200   ttgcctctat   tgaacgcagt   gttggtcaat   ttgttgcaca   ccaacagcaa   ttattaaatg   13260   tacatgaaca   gtttatgcaa   ggtccacaag   actacgcgaa   aacagtgcag   aacgtacttg   13320   ctgcgcagac   gagcaatgaa   ttaccggaaa   gtttagaccg   tacattgtct   atgtataacg   13380   agttccaatc   agaaacgcta   cgtgtacatg   aaacgtacct   gaacaatcag   acgagcaaca   13440     tgaacaccat   gcttactggt   gctgaagctg   atgtgctagc   aaccccaata   actcaggtag   13500   tgaatacagc   cgttgccact   agtcacaagg   tagttgctcc   agttattgct   aatacagtga   13560   cgaatgttgt   atctagtgtc   agtaataacg   cggcggttgc   agtgcaaact   gtggcattag   13620   cgcctacgca   agaaatcgct   ccaacagtcg   ctactacgcc   agcacccgca   ttggttgcta   13680   tcgtggctga   acctgtgatt   gttgcgcatg   ttgctacaga   agttgcacca   attacaccat   13740   cagttacacc   agttgtcgca   actcaagcgg   ctatcgatgt   agcaactatt   aacaaagtaa   13800   tgttagaagt   tgttgctgat   aaaaccggtt   atccaacgga   tatgctggaa   ctgagcatgg   13860   acatggaagc   tgacttaggt   atcgactcaa   tcaaacgtgt   tgagatatta   ggcgcagtac   13920   aggaattgat   ccctgactta   cctgaactta   atcctgaaga   tcttgctgag   ctacgcacgc   13980   ttggtgagat   tgtcgattac   atgaattcaa   aagcccaggc   tgtagctcct   acaacagtac   14040   ctgtaacaag   tgcacctgtt   tcgcctgcat   ctgctggtat   tgatttagcc   cacatccaaa   14100   acgtaatgtt   agaagtggtt   gcagacaaaa   ccggttaccc   aacagacatg   ctagaactga   14160   gcatggatat   ggaagctgac   ttaggtattg   attcaatcaa   gcgtgtggaa   atcttaggtg   14220   cagtacagga   gatcataact   gatttacctg   agctaaaccc   tgaagatctt   gttgaattac   14280   gcaccctagg   tgaaatcgtt   agttacatgc   aaagcaaagc   gccagtcgct   gaaagtgcgc   14340   cagtggcgac   ggctcctgta   gcaacaagct   cagcaccgtc   tatcgatttg   aaccacattc   14400   aaacagtgat   gatggatgta   gttgcagata   agactggtta   tccaactgac   atgctagaac   14460   ttggcatgga   catggaagct   gatttaggta   tcgattcaat   caaacgtgtg   gaaatattag   14520   gcgcagtgca   ggagatcatc   actgatttac   ctgagctaaa   cccagaagac   ctcgctgaat   14580   tacgcacgct   aggtgaaatc   gttagttaca   tgcaaagcaa   agcgccagtc   gctgagagtg   14640   cgccagtagc   gacggcttct   gtagcaacaa   gctctgcacc   gtctatcgat   ttaaaccata   14700   tccaaacagt   gatgatggaa   gtggttgcag   acaaaaccgg   ttatccagta   gacatgttag   14760   aacttgctat   ggacatggaa   gctgacctag   gtatcgattc   aatcaagcgt   gtagaaattt   14820   taggtgcggt   acaggaaatc   attactgact   tacctgagct   taaccctgaa   gatcttgctg   14880   aactacgtac   attaggtgaa   atcgttagtt   acatgcaaag   caaagcgccc   gtagctgaag   14940   cgcctgcagt   acctgttgca   gtagaaagtg   cacctactag   tgtaacaagc   tcagcaccgt   15000   ctatcgattt   agaccacatc   caaaatgtaa   tgatggatgt   tgttgctgat   aagactggtt   15060   atcctgccaa   tatgcttgaa   ttagcaatgg   acatggaagc   cgaccttggt   attgattcaa   15120   tcaagcgtgt   tgaaattcta   ggcgcggtac   aggagatcat   tactgattta   cctgaactaa   15180   acccagaaga   cttagctgaa   ctacgtacgt   tagaagaaat   tgtaacctac   atgcaaagca   15240   aggcgagtgg   tgttactgta   aatgtagtgg   ctagccctga   aaataatgct   gtatcagatg   15300   catttatgca   aagcaatgtg   gcgactatca   cagcggccgc   agaacataag   gcggaattta   15360   aaccggcgcc   gagcgcaacc   gttgctatct   ctcgtctaag   ctctatcagt   aaaataagcc   15420   aagattgtaa   aggtgctaac   gccttaatcg   tagctgatgg   cactgataat   gctgtgttac   15480   ttgcagacca   cctattgcaa   actggctgga   atgtaactgc   attgcaacca   acttgggtag   15540   ctgtaacaac   gacgaaagca   tttaataagt   cagtgaacct   ggtgacttta   aatggcgttg   15600   atgaaactga   aatcaacaac   attattactg   ctaacgcaca   attggatgca   gttatctatc   15660   tgcacgcaag   tagcgaaatt   aatgctatcg   aatacccaca   agcatctaag   caaggcctga   15720   tgttagcctt   cttattagcg   aaattgagta   aagtaactca   agccgctaaa   gtgcgtggcg   15780   cctttatgat   tgttactcag   cagggtggtt   cattaggttt   tgatgatatc   gattctgcta   15840   caagtcatga   tgtgaaaaca   gacctagtac   aaagcggctt   aaacggttta   gttaagacac   15900   tgtctcacga   gtgggataac   gtattctgtc   gtgcggttga   tattgcttcg   tcattaacgg   15960   ctgaacaagt   tgcaagcctt   gttagtgatg   aactacttga   tgctaacact   gtattaacag   16020   aagtgggtta   tcaacaagct   ggtaaaggcc   ttgaacgtat   cacgttaact   ggtgtggcta   16080   ctgacagcta   tgcattaaca   gctggcaata   acatcgatgc   taactcggta   tttttagtga   16140   gtggtggcgc   aaaaggtgta   actgcacatt   gtgttgctcg   tatagctaaa   gaatatcagt   16200   ctaagttcat   cttattggga   cgttcaacgt   tctcaagtga   cgaaccgagc   tgggcaagtg   16260   gtattactga   tgaagcggcg   ttaaagaaag   cagcgatgca   gtctttgatt   acagcaggtg   16320   ataaaccaac   acccgttaag   atcgtacagc   taatcaaacc   aatccaagct   aatcgtgaaa   16380   ttgcgcaaac   cttgtctgca   attaccgctg   ctggtggcca   agctgaatat   gtttctgcag   16440     atgtaactaa   tgcagcaagc   gtacaaatgg   cagtcgctcc   agctatcgct   aagttcggtg   16500   caatcactgg   catcattcat   ggcgcgggtg   tgttagctga   ccaattcatt   gagcaaaaaa   16560   cactgagtga   ttttgagtct   gtttacagca   ctaaaattga   cggtttgtta   tcgctactat   16620   cagtcactga   agcaagcaac   atcaagcaat   tggtattgtt   ctcgtcagcg   gctggtttct   16680   acggtaaccc   cggccagtct   gattactcga   ttgccaatga   gatcttaaat   aaaaccgcat   16740   accgctttaa   atcattgcac   ccacaagctc   aagtattgag   ctttaactgg   ggtccttggg   16800   acggtggcat   ggtaacgcct   gagcttaaac   gtatgtttga   ccaacgtggt   gtttacatta   16860   ttccacttga   tgcaggtgca   cagttattgc   tgaatgaact   agccgctaat   gataaccgtt   16920   gtccacaaat   cctcgtgggt   aatgacttat   ctaaagatgc   tagctctgat   caaaagtctg   16980   atgaaaagag   tactgctgta   aaaaagccac   aagttagtcg   tttatcagat   gctttagtaa   17040   ctaaaagtat   caaagcgact   aacagtagct   ctttatcaaa   caagactagt   gctttatcag   17100   acagtagtgc   ttttcaggtt   aacgaaaacc   actttttagc   tgaccacatg   atcaaaggca   17160   atcaggtatt   accaacggta   tgcgcgattg   cttggatgag   tgatgcagca   aaagcgactt   17220   atagtaaccg   agactgtgca   ttgaagtatg   tcggtttcga   agactataaa   ttgtttaaag   17280   gtgtggtttt   tgatggcaat   gaggcggcgg   attaccaaat   ccaattgtcg   cctgtgacaa   17340   gggcgtcaga   acaggattct   gaagtccgta   ttgccgcaaa   gatctttagc   ctgaaaagtg   17400   acggtaaacc   tgtgtttcat   tatgcagcga   caatattgtt   agcaactcag   ccacttaatg   17460   ctgtgaaggt   agaacttccg   acattgacag   aaagtgttga   tagcaacaat   aaagtaactg   17520   atgaagcaca   agcgttatac   agcaatggca   ccttgttcca   cggtgaaagt   ctgcagggca   17580   ttaagcagat   attaagttgt   gacgacaagg   gcctgctatt   ggcttgtcag   ataaccgatg   17640   ttgcaacagc   taagcaggga   tccttcccgt   tagctgacaa   caatatcttt   gccaatgatt   17700   tggtttatca   ggctatgttg   gtctgggtgc   gcaaacaatt   tggtttaggt   agcttacctt   17760   cggtgacaac   ggcttggact   gtgtatcgtg   aagtggttgt   agatgaagta   ttttatctgc   17820   aacttaatgt   tgttgagcat   gatctattgg   gttcacgcgg   cagtaaagcc   cgttgtgata   17880   ttcaattgat   tgctgctgat   atgcaattac   ttgccgaagt   gaaatcagcg   caagtcagtg   17940   tcagtgacat   tttgaacgat   atgtcatgat   cgagtaaata   ataacgatag   gcgtcatggt   18000   gagcatggcg   tctgctttct   tcatttttta   acattaacaa   tattaatagc   taaacgcggt   18060   tgctttaaac   caagtaaaca   agtgctttta   gctattacta   ttccaaacag   gatattaaag   18120   agaatatgac   ggaattagct   gttattggta   tggatgctaa   atttagcgga   caagacaata   18180   ttgaccgtgt   ggaacgcgct   ttctatgaag   gtgcttatgt   aggtaatgtt   agccgcgtta   18240   gtaccgaatc   taatgttatt   agcaatggcg   aagaacaagt   tattactgcc   atgacagttc   18300   ttaactctgt   cagtctacta   gcgcaaacga   atcagttaaa   tatagctgat   atcgcggtgt   18360   tgctgattgc   tgatgtaaaa   agtgctgatg   atcagcttgt   agtccaaatt   gcatcagcaa   18420   ttgaaaaaca   gtgtgcgagt   tgtgttgtta   ttgctgattt   aggccaagca   ttaaatcaag   18480   tagctgattt   agttaataac   caagactgtc   ctgtggctgt   aattggcatg   aataactcgg   18540   ttaatttatc   tcgtcatgat   cttgaatctg   taactgcaac   aatcagcttt   gatgaaacct   18600   tcaatggtta   taacaatgta   gctgggttcg   cgagtttact   tatcgcttca   actgcgtttg   18660   ccaatgctaa   gcaatgttat   atatacgcca   acattaaggg   cttcgctcaa   tcgggcgtaa   18720   atgctcaatt   taacgttgga   aacattagcg   atactgcaaa   gaccgcattg   cagcaagcta   18780   gcataactgc   agagcaggtt   ggtttgttag   aagtgtcagc   agtcgctgat   tcggcaatcg   18840   cattgtctga   aagccaaggt   ttaatgtctg   cttatcatca   tacgcaaact   ttgcatactg   18900   cattaagcag   tgcccgtagt   gtgactggtg   aaggcgggtg   tttttcacag   gtcgcaggtt   18960   tattgaaatg   tgtaattggt   ttacatcaac   gttatattcc   ggcgattaaa   gattggcaac   19020   aaccgagtga   caatcaaatg   tcacggtggc   ggaattcacc   attctatatg   cctgtagatg   19080   ctcgaccttg   gttcccacat   gctgatggct   ctgcacacat   tgccgcttat   agttgtgtga   19140   ctgctgacag   ctattgtcat   attcttttac   aagaaaacgt   cttacaagaa   cttgttttga   19200   aagaaacagt   cttgcaagat   aatgacttaa   ctgaaagcaa   gcttcagact   cttgaacaaa   19260   acaatccagt   agctgatctg   cgcactaatg   gttactttgc   atcgagcgag   ttagcattaa   19320   tcatagtaca   aggtaatgac   gaagcacaat   tacgctgtga   attagaaact   attacagggc   19380   agttaagtac   tactggcata   agtactatca   gtattaaaca   gatcgcagca   gactgttatg   19440     cccgtaatga   tactaacaaa   gcctatagcg   cagtgcttat   tgccgagact   gctgaagagt   19500   taagcaaaga   aataaccttg   gcgtttgctg   gtatcgctag   cgtgtttaat   gaagatgcta   19560   aagaatggaa   aaccccgaag   ggcagttatt   ttaccgcgca   gcctgcaaat   aaacaggctg   19620   ctaacagcac   acagaatggt   gtcaccttca   tgtacccagg   tattggtgct   acatatgttg   19680   gtttagggcg   tgatctattt   catctattcc   cacagattta   tcagcctgta   gcggctttag   19740   ccgatgacat   tggcgaaagt   ctaaaagata   ctttacttaa   tccacgcagt   attagtcgtc   19800   atagctttaa   agaactcaag   cagttggatc   tggacctgcg   cggtaactta   gccaatatcg   19860   ctgaagccgg   tgtgggtttt   gcttgtgtgt   ttaccaaggt   atttgaagaa   gtctttgccg   19920   ttaaagctga   ctttgctaca   ggttatagca   tgggtgaagt   aagcatgtat   gcagcactag   19980   gctgctggca   gcaaccggga   ttgatgagtg   ctcgccttgc   acaatcgaat   acctttaatc   20040   atcaactttg   cggcgagtta   agaacactac   gtcagcattg   gggcatggat   gatgtagcta   20100   acggtacgtt   cgagcagatc   tgggaaacct   ataccattaa   ggcaacgatt   gaacaggtcg   20160   aaattgcctc   tgcagatgaa   gatcgtgtgt   attgcaccat   tatcaataca   cctgatagct   20220   tgttgttagc   cggttatcca   gaagcctgtc   agcgagtcat   taagaattta   ggtgtgcgtg   20280   caatggcatt   gaatatggcg   aacgcaattc   acagcgcgcc   agcttatgcc   gaatacgatc   20340   atatggttga   gctataccat   atggatgtta   ctccacgtat   taataccaag   atgtattcaa   20400   gctcatgtta   tttaccgatt   ccacaacgca   gcaaagcgat   ttcccacagt   attgctaaat   20460   gtttgtgtga   tgtggtggat   ttcccacgtt   tggttaatac   cttacatgac   aaaggtgcgc   20520   gggtattcat   tgaaatgggt   ccaggtcgtt   cgttatgtag   ctgggtagat   aagatcttag   20580   ttaatggcga   tggcgataat   aaaaagcaaa   gccaacatgt   atctgttcct   gtgaatgcca   20640   aaggcaccag   tgatgaactt   acttatattc   gtgcgattgc   taagttaatt   agtcatggcg   20700   tgaatttgaa   tttagatagc   tagtttaacg   ggtcaatcct   ggttaaagca   ggccatatag   20760   caaacacgaa   caaatagtca   acatcgatat   ctagcgctgg   tgagttatac   ctcattagtt   20820   gaaatatgga   tttaaagaga   gtaattatgg   aaaatattgc   agtagtaggt   attgctaatt   20880   tgttcccggg   ctcacaagca   ccggatcaat   tttggcagca   attgcttgaa   caacaagatt   20940   gccgcagtaa   ggcgaccgct   gttcaaatgg   gcgttgatcc   tgctaaatat   accgccaaca   21000   aaggtgacac   agataaattt   tactgtgtgc   acggcggtta   catcagtgat   ttcaattttg   21060   atgcttcagg   ttatcaactc   gataatgatt   atttagccgg   tttagatgac   cttaatcaat   21120   gggggcttta   tgttacgaaa   caagccctta   ccgatgcggg   ttattggggc   agtactgcac   21180   tagaaaactg   tggtgtgatt   ttaggtaatt   tgtcattccc   aactaaatca   tctaatcagc   21240   tgtttatgcc   tttgtatcat   caagttgttg   ataatgcctt   aaaggcggta   ttacatcctg   21300   attttcaatt   aacgcattac   acagcaccga   aaaaaacaca   tgctgacaat   gcattagtag   21360   caggttatcc   agctgcattg   atcgcgcaag   cggcgggtct   tggtggttca   cattttgcac   21420   tggatgcggc   ttgtgcttca   tcttgttata   gcgttaagtt   agcgtgtgat   tacctgcata   21480   cgggtaaagc   caacatgatg   cttgctggtg   cggtatctgc   agcagatcct   atgttcgtaa   21540   atatgggttt   ctcgatattc   caagcttacc   cagctaacaa   tgtacatgcc   ccgtttgacc   21600   aaaattcaca   aggtctattt   gccggtgaag   gcgcgggcat   gatggtattg   aaacgtcaaa   21660   gtgatgcagt   acgtgatggt   gatcatattt   acgccattat   taaaggcggc   gcattatcga   21720   atgacggtaa   aggcgagttt   gtattaagcc   cgaacaccaa   gggccaagta   ttagtatatg   21780   aacgtgctta   tgccgatgca   gatgttgacc   cgagtacagt   tgactatatt   gaatgtcatg   21840   caacgggcac   acctaagggt   gacaatgttg   aattgcgttc   gatggaaacc   tttttcagtc   21900   gcgtaaataa   caaaccatta   ctgggctcgg   ttaaatctaa   ccttggtcat   ttgttaactg   21960   ccgctggtat   gcctggcatg   accaaagcta   tgttagcgct   aggtaaaggt   cttattcctg   22020   caacgattaa   cttaaagcaa   ccactgcaat   ctaaaaacgg   ttactttact   ggcgagcaaa   22080   tgccaacgac   gactgtgtct   tggccaacaa   ctccgggtgc   caaggcagat   aaaccgcgta   22140   ccgcaggtgt   gagcgtattt   ggttttggtg   gcagcaacgc   ccatttggta   ttacaacagc   22200   caacgcaaac   actcgagact   aattttagtg   ttgctaaacc   acgtgagcct   ttggctatta   22260   ttggtatgga   cagccatttt   ggtagtgcca   gtaatttagc   gcagttcaaa   accttattaa   22320   ataataatca   aaataccttc   cgtgaattac   cagaacaacg   ctggaaaggc   atggaaagta   22380   acgctaacgt   catgcagtcg   ttacaattac   gcaaagcgcc   taaaggcagt   tacgttgaac   22440     agctagatat   tgatttcttg   cgttttaaag   taccgcctaa   tgaaaaagat   tgcttgatcc   22500   cgcaacagtt   aatgatgatg   caagtggcag   acaatgctgc   gaaagacgga   ggtctagttg   22560   aaggtcgtaa   tgttgcggta   ttagtagcga   tgggcatgga   actggaatta   catcagtatc   22620   gtggtcgcgt   taatctaacc   acccaaattg   aagacagctt   attacagcaa   ggtattaacc   22680   tgactgttga   gcaacgtgaa   gaactgacca   atattgctaa   agacggtgtt   gcctcggctg   22740   cacagctaaa   tcagtatacg   agtttcattg   gtaatattat   ggcgtcacgt   atttcggcgt   22800   tatgggattt   ttctggtcct   gctattaccg   tatcggctga   agaaaactct   gtttatcgtt   22860   gtgttgaatt   agctgaaaat   ctatttcaaa   ccagtgatgt   tgaagccgtt   attattgctg   22920   ctgttgattt   gtctggttca   attgaaaaca   ttactttacg   tcagcactac   ggtccagtta   22980   atgaaaaggg   atctgtaagt   gaatgtggtc   cggttaatga   aagcagttca   gtaaccaaca   23040   atattcttga   tcagcaacaa   tggctggtgg   gtgaaggcgc   agcggctatt   gtcgttaaac   23100   cgtcatcgca   agtcactgct   gaacaagttt   atgcgcgtat   tgatgcggtg   agttttgccc   23160   ctggtagcaa   tgcgaaagca   attacgattg   cagcggataa   agcattaaca   cttgctggta   23220   tcagtgctgc   tgatgtagct   agtgttgaag   cacatgcaag   tggttttagt   gccgaaaata   23280   atgctgaaaa   aaccgcgtta   ccgactttat   acccaagcgc   aagtatcagt   tcggtgaaag   23340   ccaatattgg   tcatacgttt   aatgcctcgg   gtatggcgag   tattattaaa   acggcgctgc   23400   tgttagatca   gaatacgagt   caagatcaga   aaagcaaaca   tattgctatt   aacggtctag   23460   gtcgtgataa   cagctgcgcg   catcttatct   tatcgagttc   agcgcaagcg   catcaagttg   23520   caccagcgcc   tgtatctggt   atggccaagc   aacgcccaca   gttagttaaa   accatcaaac   23580   tcggtggtca   gttaattagc   aacgcgattg   ttaacagtgc   gagttcatct   ttacacgcta   23640   ttaaagcgca   gtttgccggt   aagcacttaa   acaaagttaa   ccagccagtg   atgatggata   23700   acctgaagcc   ccaaggtatt   agcgctcatg   caaccaatga   gtatgtggtg   actggagctg   23760   ctaacactca   agcttctaac   attcaagcat   ctcatgttca   agcgtcaagt   catgcacaag   23820   agatagcacc   aaaccaagtt   caaaatatgc   aagctacagc   agccgctgta   agttcacccc   23880   tttctcaaca   tcaacacaca   gcgcagcccg   tagcggcacc   gagcgttgtt   ggagtgactg   23940   tgaaacataa   agcaagtaac   caaattcatc   agcaagcgtc   tacgcataaa   gcatttttag   24000   aaagtcgttt   agctgcacag   aaaaacctat   cgcaacttgt   tgaattgcaa   accaagctgt   24060   caatccaaac   tggtagtgac   aatacatcta   acaatactgc   gtcaacaagc   aatacagtgc   24120   taacaaatcc   tgtatcagca   acgccattaa   cacttgtgta   taatgcgcct   gtagtagcga   24180   caaacctaac   cagtacagaa   gcaaaagcgc   aagcagctgc   tacacaagct   ggttttcaga   24240   taaaaggacc   tgttggttac   aactatccac   cgctgcagtt   aattgaacgt   tataataaac   24300   cagaaaacgt   gatttacgat   caagctgatt   tggttgaatt   cgctgaaggt   gatattggta   24360   aggtatttgg   tgctgaatac   aatattattg   atggctattc   gcgtcgtgta   cgtctgccaa   24420   cctcagatta   cttgttagta   acacgtgtta   ctgaacttga   tgccaaggtg   catgaataca   24480   agaaatcata   catgtgtact   gaatatgatg   tgcctgttga   tgcaccgttc   ttaattgatg   24540   gtcagatccc   ttggtctgtt   gccgtcgaat   caggccagtg   tgatttgatg   ttgatttcat   24600   atatcggtat   tgatttccaa   gcgaaaggcg   aacgtgttta   ccgtttactt   gattgtgaat   24660   taactttcct   tgaagagatg   gcttttggtg   gcgatacttt   acgttacgag   atccacattg   24720   attcgtatgc   acgtaacggc   gagcaattat   tattcttctt   ccattacgat   tgttacgtag   24780   gggataagaa   ggtacttatc   atgcgtaatg   gttgtgctgg   tttctttact   gacgaagaac   24840   tttctgatgg   taaaggcgtt   attcataacg   acaaagacaa   agctgagttt   agcaatgctg   24900   ttaaatcatc   attcacgccg   ttattacaac   ataaccgtgg   tcaatacgat   tataacgaca   24960   tgatgaagtt   ggttaatggt   gatgttgcca   gttgttttgg   tccgcaatat   gatcaaggtg   25020   gccgtaatcc   atcattgaaa   ttctcgtctg   agaagttctt   gatgattgaa   cgtattacca   25080   agatagaccc   aaccggtggt   cattggggac   taggcctgtt   agaaggtcag   aaagatttag   25140   accctgagca   ttggtatttc   ccttgtcact   ttaaaggtga   tcaagtaatg   gctggttcgt   25200   tgatgtcgga   aggttgtggc   caaatggcga   tgttcttcat   gctgtctctt   ggtatgcata   25260   ccaatgtgaa   caacgctcgt   ttccaaccac   taccaggtga   atcacaaacg   gtacgttgtc   25320   gtgggcaagt   actgccacag   cgcaatacct   taacttaccg   tatggaagtt   actgcgatgg   25380   gtatgcatcc   acagccattc   atgaaagcta   atattgatat   tttgcttgac   ggtaaagtgg   25440     ttgttgattt   caaaaacttg   agcgtgatga   tcagcgaaca   agatgagcat   tcagattacc   25500   ctgtaacact   gccgagtaat   gtggcgctta   aagcgattac   tgcacctgtt   gcgtcagtag   25560   caccagcatc   ttcacccgct   aacagcgcgg   atctagacga   acgtggtgtt   gaaccgttta   25620   agtttcctga   acgtccgtta   atgcgtgttg   agtcagactt   gtctgcaccg   aaaagcaaag   25680   gtgtgacacc   gattaagcat   tttgaagcgc   ctgctgttgc   tggtcatcat   agagtgccta   25740   accaagcacc   gtttacacct   tggcatatgt   ttgagtttgc   gacgggtaat   atttctaact   25800   gtttcggtcc   tgattttgat   gtttatgaag   gtcgtattcc   acctcgtaca   ccttgtggcg   25860   atttacaagt   tgttactcag   gttgtagaag   tgcagggcga   acgtcttgat   cttaaaaatc   25920   catcaagctg   tgtagctgaa   tactatgtac   cggaagacgc   ttggtacttt   actaaaaaca   25980   gccatgaaaa   ctggatgcct   tattcattaa   tcatggaaat   tgcattgcaa   ccaaatggct   26040   ttatttctgg   ttacatgggc   acgacgctta   aataccctga   aaaagatctg   ttcttccgta   26100   accttgatgg   tagcggcacg   ttattaaagc   agattgattt   acgcggcaag   accattgtga   26160   ataaatcagt   cttggttagt   acggctattg   ctggtggcgc   gattattcaa   agtttcacgt   26220   ttgatatgtc   tgtagatggc   gagctatttt   atactggtaa   agctgtattt   ggttacttta   26280   gtggtgaatc   actgactaac   caactgggca   ttgataacgg   taaaacgact   aatgcgtggt   26340   ttgttgataa   caataccccc   gcagcgaata   ttgatgtgtt   tgatttaact   aatcagtcat   26400   tggctctgta   taaagcgcct   gtggataaac   cgcattataa   attggctggt   ggtcagatga   26460   actttatcga   tacagtgtca   gtggttgaag   gcggtggtaa   agcgggcgtg   gcttatgttt   26520   atggcgaacg   tacgattgat   gctgatgatt   ggttcttccg   ttatcacttc   caccaagatc   26580   cggtgatgcc   aggttcatta   ggtgttgaag   ctattattga   gttgatgcag   acctatgcgc   26640   ttaaaaatga   tttgggtggc   aagtttgcta   acccacgttt   cattgcgccg   atgacgcaag   26700   ttgattggaa   ataccgtggg   caaattacgc   cgctgaataa   acagatgtca   ctggacgtgc   26760   atatcactga   gatcgtgaat   gacgctggtg   aagtgcgaat   cgttggtgat   gcgaatctgt   26820   ctaaagatgg   tctgcgtatt   tatgaagtta   aaaacatcgt   tttaagtatt   gttgaagcgt   26880   aaagggtcaa   gtgtaacgtg   cttaagcgcc   gcattggtta   aagacgcttt   gcacgccgtg   26940   aatccgtcca   tggaggcttg   gggttggcat   ccatgccaac   aacagcaagc   ttactttaat   27000   caatacggct   tggtgtccat   ttagacgcct   cgaacttagt   agttaataga   caaaataatt   27060   tagctgtgga   atgaatatag   taagtaatca   ttcggcagct   acaaaaaagg   aattaagaat   27120   gtcgagttta   ggttttaaca   ataacaacgc   aattaactgg   gcttggaaag   tagatccagc   27180   gtcagttcat   acacaagatg   cagaaattaa   agcagcttta   atggatctaa   ctaaacctct   27240   ctatgtggcg   aataattcag   gcgtaactgg   tatagctaat   catacgtcag   tagcaggtgc   27300   gatcagcaat   aacatcgatg   ttgatgtatt   ggcgtttgcg   caaaagttaa   acccagaaga   27360   tctgggtgat   gatgcttaca   agaaacagca   cggcgttaaa   tatgcttatc   atggcggtgc   27420   gatggcaaat   ggtattgcct   cggttgaatt   ggttgttgcg   ttaggtaaag   cagggctgtt   27480   atgttcattt   ggtgctgcag   gtctagtgcc   tgatgcggtt   gaagatgcaa   ttcgtcgtat   27540   tcaagctgaa   ttaccaaatg   gcccttatgc   ggttaacttg   atccatgcac   cagcagaaga   27600   agcattagag   cgtggcgcgg   ttgaacgttt   cctaaaactt   ggcgtcaaga   cggtagaggc   27660   ttcagcttac   cttggtttaa   ctgaacacat   tgtttggtat   cgtgctgctg   gtctaactaa   27720   aaacgcagat   ggcagtgtta   atatcggtaa   caaggttatc   gctaaagtat   cgcgtaccga   27780   agttggtcgc   cgctttatgg   aacctgcacc   gcaaaaatta   ctggataagt   tattagaaca   27840   aaataagatc   acccctgaac   aagctgcttt   agcgttgctt   gtacctatgg   ctgatgatat   27900   tactggggaa   gcggattctg   gtggtcatac   agataaccgt   ccgtttttaa   cattattacc   27960   gacgattatt   ggtctgcgtg   atgaagtgca   agcgaagtat   aacttctctc   ctgcattacg   28020   tgttggtgct   ggtggtggta   tcggaacgcc   tgaagcagca   ctcgctgcat   ttaacatggg   28080   cgcggcttat   atcgttctgg   gttctgtgaa   tcaggcgtgt   gttgaagcgg   gtgcatctga   28140   atatactcgt   aaactgttat   cgacagttga   aatggctgat   gtgactatgg   cacctgctgc   28200   agatatgttt   gaaatgggtg   tgaagctgca   agtattaaaa   cgcggttcta   tgttcgcgat   28260   gcgtgcgaag   aaactgtatg   acttgtatgt   ggcttatgac   tcgattgaag   atatcccagc   28320   tgctgaacgt   gagaagattg   aaaaacaaat   cttccgtgca   aacctagacg   agatttggga   28380   tggcactatc   gctttcttta   ctgaacgcga   tccagaaatg   ctagcccgtg   caacgagtag   28440     tcctaaacgt   aaaatggcac   ttatcttccg   ttggtatctt   ggcctttctt   cacgctggtc   28500   aaacacaggc   gagaagggac   gtgaaatgga   ttatcagatt   tgggcaggcc   caagtttagg   28560   tgcattcaac   agctgggtga   aaggttctta   ccttgaagac   tatacccgcc   gtggcgctgt   28620   agatgttgct   ttgcatatgc   ttaaaggtgc   tgcgtattta   caacgtgtaa   accagttgaa   28680   attgcaaggt   gttagcttaa   gtacagaatt   ggcaagttat   cgtacgagtg   attaatgtta   28740   cttgatgata   tgtgaattaa   ttaaagcgcc   tgagggcgct   ttttttggtt   tttaactcag   28800   gtgttgtaac   tcgaaattgc   ccctttcaag   ttagatcgat   tactcactca   caatatgttg   28860   atatcgcact   tgccatatac   ttgctcatcc   aaagccctat   attgataatg   gtgttaatag   28920   tctttaatat   ccgagtcttt   cttcagcata   atactaatat   agagactcga   ccaatgttaa   28980   acacaacaaa   gaatatattc   ttgtgtactg   ccttattatt   aacgagtgcg   agtacgacag   29040   ctactacgct   aaacaattcg   atatcagcaa   ttgaacaacg   tatttctggt   cgtatcggtg   29100   tggctgtttt   agatacgcaa   aataaacaaa   cgtgggctta   caatggtgat   gcacattttc   29160   cgatgatgag   tacattcaaa   accctcgctt   gcgcgaaaat   gctaagtgaa   tcgacaaatg   29220   gtaatctgga   tcccagtact   agctcattga   taaaggctga   agaattaatc   ccttggtcac   29280   cagtcactaa   aacgtttgtg   aataacacta   ttacagtggc   gaaagcgtgt   gaagcaacaa   29340   tgctgaccag   tgataatacc   gcggctaata   ttgttttaca   gtatatcgga   ggccctcaag   29400   gcgttactgc   attcttgcga   gaaattggtg   atgaagagag   tcagttagat   cgtatagaac   29460   ctgaattgaa   tgaagctaag   gtcggagact   tgcgtgatac   cacgacaccg   aaagccatag   29520   ttaccacgct   caacaaacta   ctacttggtg   atgttctact   tgatttggat   aaaaaccaac   29580   ttaaaacatg   gatgcaaaat   aataaagtgt   cagatccttt   actgcgttct   atattaccgc   29640   aaggctggtt   tattgccgac   cgctcaggtg   cgggtggtaa   tggttctcga   ggtataactg   29700   ctatgctttg   gcactccgag   cgtcaaccgc   taatcatcag   tatttattta   accgaaactg   29760   agttagcaat   ggcaatgcgc   aatgagatta   ttgttgagat   cggtaagctg   atattcaaag   29820   aatacgcggt   gaaataataa   gttatttttt   gataatactt   taacgagcgt   agctatcgaa   29880   gtgagggcgt   caattagaca   cctttgcttc   ccctacaaaa   tctaatgtgt   attacctcgg   29940   ctagtacaat   tgccctaagt   tatttctgtc   cagctttggc   ttagtgcaat   tgcgttagcc   30000   aatgtgaaca   ccaagggact   ttgtcgtacc   ataactacca   agcgactttg   tcgtttttat   30060   cttttcttag   acaaacagag   gttaaatgag   tgacgccttc   caaatcacag   gaatgaatcc   30120   gcatttcaat   aaaatctaac   ccgtaccaac   tccgtacaag   ttgatcttta   gttgtttaaa   30180   atctataata   aattcaatta   cggaattaat   ccgtacaact   ggaggtttta   tggctactgc   30240   aagacttgat   atccgtttgg   atgaagaaat   caaagctaag   gctgagaaag   catcagcttt   30300   actcggctta   aaaagtttaa   ccgaatacgt   tgttcgctta   atggacgaag   attcaactaa   30360   agtagtttct   gagcatgaga   gtattaccgt   tgaagcgaat   gtattcgacc   aatttatggc   30420   tgcttgtgat   gaagcgaaag   ccccaaataa   agcattactt   gaagccgctg   tatttactca   30480   gaatggtgag   tttaagtgag   ttattccaaa   cgtttcaaag   aactggataa   atcaaaacat   30540   gacagagcat   catttgactg   tggcgaaaaa   gagctaaatg   attttatcca   aactcaagca   30600   gccaaacata   tgcaagcagg   tattagccgc   actctggttt   tacctgcttc   tgcgccgtta   30660   ccaaacaaaa   aatatccaat   ttgctcattt   tatagtatcg   cgccaagctc   aattagccgc   30720   gatacgttac   cacaagcaat   ggctaaaaag   ttaccacgtt   atcctatccc   tgtttttctt   30780   ttggctcaac   ttgccgtcca   taaagagttt   catgggagtg   ggttaggcaa   agttagctta   30840   attaaagcgt   tagagtacct   ttgggaaatt   aactctcaca   tgagagctta   cgccatcgtt   30900   gttgattgtt   taactgaaca   agctgagtca   ttctacgcta   aatatggttt   cgacgttctc   30960   tgcgaaataa   atggtcgagt   aagaatgttc   atatcaatga   aaacagtcaa   tcagttattc   31020   acttaacagt   aagagttagt   ataacagttg   tatgaattaa   atttattata   ttcggtaatc   31080   tcattgcgat   cacgctagaa   gtgcgagcgg   gtcagaccga   ggccacaata   gcagccgtta   31140   cgtttagggg   atgacttaaa   aagataacta   ctacgtcagt   ggcgatccta   gaggattaaa   31200   ggtttatgat   tcacaacatt   tatttattgt   gcttaatttt   ttctatccaa   tatgcgcaag   31260   ctgtaaatat   cactgaagta   gacttttatg   tcagtgatga   tatccctaaa   gatgttgcca   31320   aattaaagat   aggtgaatcc   ataacgaact   ccagccttat   tctaagtaac   tcatctattc   31380   cactctcgcg   ggagacgggt   aacatatatt   actcttcatc   aattgctaac   ttgaactatg   31440     actcgataga   atttgttatg   gctcaattga   tggccgaaga   ttccagcctt   tacaagatgc   31500   tggtaaatag   cgataggttg   tccgtgctag   taatgacatc   ttcccagtcc   acagtctcta   31560   tggctcgact   tactcggctt   attttcctaa   tgttgcggtc   atcgatttga   attgtgactc   31620   gctaacttta   gaacatgagc   tcggccatct   atacggagct   gaacatgaag   aaatatatga   31680   cgactatgtc   ttctatgctg   cgatatgtgg   agactatacg   actatcatga   actctatgca   31740   gcctgaaatg   aaagaaaaac   aaatgataaa   ggcatattca   ttccctgaat   taaaagtgga   31800   tggcttgcag   tgcggaaatg   aaaatacgaa   taacaaaaag   gttattttag   acaatattgg   31860   tcggtttaga   taggattggg   atattattct   cattcggctc   tacttagtgc   tgttattatg   31920   agtgccagtg   cttctatcta   cgatattggt   cttaacaagt   atttatctat   agacgctaag   31980   gtgttatgta   tttaagggat   gttcaagatg   aaactaggtg   taaacgatgt   atagttgtat   32040   aacatttttt   caacggttgg   aacgttcgat   tctatcgggt   aacaagaccg   cgacgatccg   32100   cgataagtcc   gatagtcatt   acttagttgg   tcagatgtta   gatgcttgta   ctcacgaaga   32160   taatcggaaa   atgtgtcaaa   tagaaatact   gagcattgaa   tatgtgacgt   ttagtgaatt   32220   aaaccgtgcg   cacgccaatg   ctgaaggttt   accgtttttg   tttatgctta   agtggatagt   32280   tcgaaagatt   tatccgactt   caaatgattt   atttttcata   agtttcagag   ttgtaactat   32340   cgatatctta   taagtcttag   tgcacaaaac   agaactattt   atagcgctca   agaaggcgat   32400   aatttgataa   tgaattatcg   ccttgttact   attaagagac   tttaaatgac   tgagatataa   32460   gatatgacac   ggaagaacat   attgatcaca   ggcgcaagtt   cagggttggg   ccgaggtatg   32520   gccatcgaat   ttgcaaaatc   aggtcataac   ttagcacttt   gtgcacgtag   acttgataat   32580   ttagttgcac   tgaaagcaga   actcttagcc   ctcaatcctc   acatccaaat   cgaaataaaa   32640   cctcttgatg   tcaatgaaca   tgaacaagtc   ttcactgttt   tccatgaatt   caaagctgaa   32700   tttggtacgc   ttgatcgtat   tattgttaat   gctggattag   gcaagggtgg   atccgtcggt   32760   acaggttttt   tcaaagctaa   tctgcaaact   gcacaaacta   attttattgc   ggcgctcgca   32820   caatgtgaag   cggcgctcga   aatctttagg   gcgcaaaatg   ctgggcacct   agtgacgatt   32880   tcttctatca   gcgctgtacg   aggattccgc   cgtgcgttaa   ctgtgtatgc   agctactaaa   32940   tcggcactaa   catcattaac   tgaaggtatc   aggattgacg   tgatggatac   gccaatcaaa   33000   gtgagttgta   ttcatcctgg   atttattcgc   accgagatga   atgaaaaagt   aaaaacagca   33060   cctttcatga   tagatgctga   agcgggttgt   aaagcgatag   tgaaagcaat   taataaagaa   33120   aaagcgaata   gttatgtacc   tagttaccct   tgggctatta   tgcacttatt   actacgtgtg   33180   gcgccaacgc   gtttgatccg   cagaatgagt   taatatcaca   gacgcatcaa   taaaatttta   33240   aggttctaga   aatgatgaag   tctcatgttt   ggttcaaggc   cggtgtagtc   atcatatatg   33300   gctcatctat   agatgcctct   cctcatcgtc   atcatgcaat   tcaattagcg   gcggtgttac   33360   ccaatcccaa   gcgaatgtct   gcagcaaccc   cttcttctta   tgtgctcagc   cgtgcggcac   33420   aaatttaaga   ctcggtgcga   tcattaggcg   gatctgttta   cctgaaaaac   ttataacaaa   33480   agctatcgac   tgttgaattt   atcctgaatg   ctttaataga   gtgggctggt   ggcattacat   33540   gattggaaag   ctgaaagaca   agtcgttata   tttgcaggca   gtaaaattaa   cactggtatg   33600   gatacttttg   attctgtaaa   gttcagagta   tcagcccctt   aacgagcttt   ggtataaaca   33660   aatatgaata   atcgacagcc   taagaaaacc   tcttcgacta   tatcgacgct   caacgaatta   33720   gcgacgttag   caaactattc   actcatggac   acgctaaact   gtgatcctga   tgcgacagaa   33780   aacggcgacg   atcacgcgcc   gagacaagtc   ctttacgggt   cattatgttc   ccgtaaaacc   33840   gactccaatc   aaagaccctg   aatatgtagc   gcatagcaaa   aatttatttt   ctgaacttgg   33900   ctttgccgac   agtatggctg   agtccgctga   ttttgtccgg   atgttctctg   gtgatatgtc   33960   aggggttcca   gtaccaatgc   gccaggtagg   ttgggcgagt   ggctatgcac   tttccattta   34020   tggcaccgag   tacacccaac   agtgcccgtt   ccaaactggt   aacggatatg   gagacggacg   34080   tgcaatttca   gtgcttgaga   ccctcatcaa   gggtcaacgc   tgggaaatgc   agctgaaagg   34140   cggtggtcgt   acaccatatt   gccgcggcgc   agacggtcgc   gctgttttac   ggtctagtat   34200   tcgcgagttc   ttggctcaag   atcacatgca   tgcgctcggg   gtacctacat   cacggtcttt   34260   aagtctgtac   gtttcaaaaa   cggagacagt   taagcgacct   tggtactcac   agggctcgcg   34320   ttcagagaat   cccgacatgc   ttatatctga   agctgtcgct   atctcgacgc   gtgttgcacc   34380   gtcgttcatc   cgtgttggtc   aactcgaact   tttcgcgcgc   cgcagccgta   gtaatgaaca   34440     cccgaaagcg   atggaagaac   tcgagaagat   tgtgctgcac   ttgatcgatc   gtgaatacgc   34500   tgacgttatc   gatacgcagc   tagccactcc   agaaaaaatc   gtgttgctgg   ctcgcgagtt   34560   tcgtggccgc   cttacctcaa   tggttgcgaa   ttggatccgt   gttggatttt   gccaaggtaa   34620   ctttaacagt   gataactgcg   cagccggtgg   ttttacactt   gattatggtc   cctttggttt   34680   ttgtgatgtg   tttaatccgt   attatcaacc   ttggacgggg   gggggtaatc   acttctcgtt   34740   catgaaccaa   ccaaatgcag   cacaacgaaa   tttcgatatg   ttttgttcgg   cgttacggcc   34800   gttactggta   tctcatcagc   aggatttgct   cgcgtttgac   gagatccaaa   gtgaattttt   34860   agcagtaatg   gatacgaaaa   tgaaggcgat   gtgggctact   aaattgggtc   ttattaattt   34920   gaagactgag   tctgataaag   cactgtgtaa   cgtactcatc   aaagagctac   aaacactcat   34980   gatgcaagca   cctgttgatt   acactatttt   cttccgcgaa   ctatcctcaa   ttcctgacga   35040   tattggccca   ctgaagaaaa   gtttttacag   taatctatac   aatgatgcag   cggatgatcc   35100   agatacctta   gcgttagaaa   aatactggat   tgagtggctc   gaaaaatggc   aaatgctcct   35160   taacagtact   tgtgacgcga   aaggtatctc   gtcccgagcc   agtgaggaca   tcgctatgca   35220   gatgaaactc   gtcaacccta   aatacgtttt   gcgagaatgg   ttcgtgatgc   cggcttatca   35280   gcaagccact   gcgggtgatt   attctctcat   tcaagagctg   caggccgtaa   tgacacagcc   35340   atatgcagag   cagtcgaagg   agctagagga   taaatactat   cgattgaaac   cgcttgagtt   35400   ctttgaggta   ggtggattgt   cccatcttag   ttgctcgtcg   tgaacgataa   cgcgtcggta   35460   catgtgtatc   gacgtatggg   cgcttaattt   ttattaatat   tagaaacaaa   aatcgccagc   35520   aaatgctggc   gttttaaaga   ttaatgtcaa   ttattacatc   atgcctatat   cacgtaggag   35580   atgtggcgat   aagcctttta   attgaatatc   taaagatttt   tcttttttat   cactaaataa   35640   aatgtcttta   gtgtgtttaa   tcagtccttt   gatagaaaca   gcataagctt   ttgtatctaa   35700   agcttgtggg   atcatattga   tgtgcgctgc   gtgtgccatt   ttagcctcta   tctgaattta   35760   ataatttatg   ttttaaccag   gtgatgtatt   gctcatctgg   tgaacatagt   agcgcattaa   35820   ataaccatgc   aataatgata   aaaaataaca   ctaagcatta   gttttgataa   tgcattcggc   35880   gctgtgtgac   actgtttact   gttttataga   tattcattca   ctttaattgc   atataaattg   35940   aattgtttac   tccaaatgta   gttaaaataa   gcacttgtta   catcaatgca   acaattatac   36000   gctgttaaaa   tagccttgat   ataccaatga   taaataattc   tgagtcttta   atatttaaaa   36060   tagatgaatt   taattcatta   gatatactat   tacgttgaat   tgcgatttac   atgcgcattt   36120   agtgtgtttt   ttattaaatg   aaaattattt   tgacgatttt   attaacatat   ataagaaata   36180   tgtgacttag   atctaagtaa   acgttaattt   atcgccgata   aagcagtagt   aagcatgttg   36240   catatcaaac   cctctctata   gatctcaact   agcctcaatt   atcatcaagt   taactgtggt   36300   tttatttatt   gctcgtgcgt   tcagttatgc   ttaaccatga   gttaacttca   ttctaatatt   36360   tttaacttac   agtgaggggt   atactctcgg   ctcttagaaa   tagagagcca   aaacatgttt   36420   gaattcgtta   ctaattcctc   attgaaaaca   cacctattgc   ttatcaataa   tggctatcaa   36480   tagtggttta   ttgtttctta   cgccacggct   tatttttctg   aaaatgtact   aaatagataa   36540   attatcaata   aaaacacaca   tcacattaac   cgatgtaaac   agggaacatc   cccatgtatg   36600   aaaatgaaga   aaaactaacg   aaagcatttg   ttattgccgc   cataatttgg   ggcgttatag   36660   gcatgtgcat   gggtttaatg   gcagctctgc   agctatatct   accgcaattg   aattttgcta   36720   atgagtatat   aaatttcggg   aaaataagac   ccttgcatac   taacgccatc   atttttgggt   36780   tggtttgtaa   ctttattatc   ggtctgtcgt   tatacatagt   ggcaaaaaca   tcagtcgtga   36840   atctagtatc   caaaggttta   tcgtggttct   tgttctgggg   ttggcagata   acattggtaa   36900   tcggccttat   ctcaatcgct   ttagggtata   catcaaccaa   agaatacgct   gaatttgagt   36960   ggccaattga   tatcgctatt   gtggttctct   ggttaacgtt   tggatatatc   ttttttggaa   37020   cgctagcgaa   aagaaaaaca   aagcatatat   ttgtttcaaa   ctggttcagt   ggcggtgtca   37080   ttattgttat   cggcttaatt   tacttgataa   acaatttagc   cattcccgtg   tatgcattta   37140   aaggttattc   aatattttct   ggtgcgagtg   atgcgcttgt   acagtggtgg   tggggacata   37200   atgcagttgg   cttcttattg   acagctggct   ttgtaggtac   caactactat   ttcattccca   37260   agttagttaa   tagacccatt   tattcatatc   gactgtcttt   aattactttt   tggggtctaa   37320   tcggctttta   tacttgggct   ggtacacacc   atttactctt   tacatccgtt   ccatcttgga   37380   ttcaaaatat   tggcgtagtg   atgtctattt   tattatggat   cccgtcatgg   gctggcgcat   37440     ttaacgcttg   gatgacgtgt   acttccaata   aagaagaatt   gaaaacaaat   cccgttgtct   37500   ggtttttctt   atcgtcaatt   gcctattacg   cattagcaac   gtttgaaggg   cctcttatgg   37560   ctatcagatg   gttcaatatg   atagctcaca   ataccagttg   ggttatcgga   cacgttcact   37620   ctggggcgtt   aggttgggtt   ggcatgacgt   gtatagcaac   cttctactat   ttcattccta   37680   agctatacaa   aaaagaactc   tactcatatg   gcttagttaa   ggtgcatttt   gtactcgctc   37740   acataggcgt   actgttctac   atagtctccc   tgtggatagg   gggtataggt   caaggtgtta   37800   aatcgttaag   cctcactgag   tctggttctc   tgacttattc   gtttgttgat   attttacgat   37860   ttatggaacc   ttatatgctc   ggacgtgcaa   ttggcggggc   gctgtttatc   ttgggtatgt   37920   tagtgatggt   atataacctc   atcatgacgg   tgaacaaacc   acaaaaagta   gttattgaag   37980   gagcatatta   atggaagagt   caatatccaa   gtcagtaatg   gcttttatca   ctatcacgac   38040   agtcgtggtg   ttattttcat   tctttgtgtg   ggttttccca   gggttcttct   tcaccaacga   38100   tcttaaagaa   ataacgacag   ctaaaccata   cacagcctta   gagttagctg   gacgggatgt   38160   gtatatggct   gaaggttgcg   tggcatgcca   tacccagatg   gttagaaact   tggaaccgga   38220   aagaaaaaga   tacggtcgtc   ctaataaaat   ggaagatgat   gtttatgagt   ttaacttttt   38280   gtggagctca   caaagaactg   gccctgattt   aacgaatatt   ggtttgaagt   acacacaagg   38340   ctggcacaaa   cagcatctca   tcaatcctca   ggcagttgtt   ccagcctcaa   tcatgccaca   38400   atatccgtgg   ctgtttgaaa   agcaacttaa   cgttggtcat   gttattgctt   caatgaaagc   38460   gatgaaaaaa   ctaggtgtgc   cgtatacaga   cacgcaaatt   gaaaattcat   caagcaaagt   38520   ggaaggtaaa   acaaaaggtg   atgcgcttgt   tgcttacttg   atgagtcttg   gcgtagatac   38580   gcgtgaaaaa   ggtggggatt   taaattaatg   ggatccatga   acatattatc   aagcgtacta   38640   tcgattatct   tcttttttat   catggttgcc   gttatttatt   cacagttccg   taagaccaaa   38700   actgcagaca   gtaataaaac   agtagagcaa   tttgatggaa   tagatgaaaa   agatgcacca   38760   attcctaagg   ttttctttgt   tgcgtatctt   attgcgttta   taggcgcaat   tgtttacgtc   38820   cttctatacc   caagtttagc   ttcttggaaa   gggtttatcg   gttggaccga   gaacgatgac   38880   gcgtatgtag   ctaaatcaat   tgatataaac   aataacatta   acgcaataat   caacgcgaat   38940   accgatgaac   aagtctttac   gctgttacaa   aaagatccgc   ttgttttgca   gagtggtaaa   39000   tcgttatttg   gtgataattg   ttctgcttgt   catggtcagg   atgctaaggg   gcaatataac   39060   tacccgagtt   tagttgataa   agattggtta   tacggcggct   cacctcaaga   tgtctatacg   39120   accatacata   atggacgtaa   gggtaaaatg   ccagcttgga   aaggtgtact   gagcggtaaa   39180   gacatagatg   agcttaccca   gtatgtgtct   gagctaaata   aaggaccatt   taaaagcaat   39240   gcgcttttcg   atgctaattg   ttcatcatgt   cacggtaaag   aggctcaagg   ttcacatagc   39300   gtaggagccc   ctaacttaac   gaatgatatc   tggcttcatg   gttcaaccaa   tgctgatatc   39360   aaacgtaata   ttgagaatgg   catgtataac   gaaatgcctg   attttggtca   acgccttagc   39420   agaaatcaaa   tattgtcttt   aacctcttat   attgtgtccc   tacagagtga   accacaagat   39480   aatatcgata   ttatgcaagc   gaacacttat   atcttctctc   gaaacgaaca   gcaattgccg   39540   gcagtgctaa   cgacttgtgt   ggcctgtcat   ggcgcagatg   gtcttggtac   tttacctgga   39600   gcgcctaagt   tagcaggatt   aaagcaagcg   tatatctata   accaattaca   cttgtttgta   39660   tctggtttaa   gaaaaaatgc   aacgatgcaa   aatatagttg   ccgacttaga   tgtgaaagac   39720   aagttacttg   ctgctagcta   tttcagttca   ctcgattcac   cggcgataag   taaaattacc   39780   ccagagaaat   cagctgacgg   tatcatcaaa   gatcctactg   agcgcctgat   atttcaaggt   39840   gattggcaac   gcgctattcc   tgcttgttct   acttgtcatg   gtcaagaaac   gcaaggtagc   39900   ccatcatttc   caagattggc   aggtcaatca   tctgactatt   tagagaaaca   attatttgac   39960   tggcgaacag   gcgatagaac   cggtgatcaa   ggtcatatga   tgcaaaacgt   cgttaacaag   40020   ctacaagatg   atgaaattaa   atccctgtcg   aaatatttat   caaaaatgaa   ataacctgtg   40080   agccagttaa   aggccaatag   atcgaaggtt   aacagctcaa   agattaatag   gatactgtaa   40140   ttatgaaaat   gaataagtta   agaagggaaa   tcattaaagc   tggtggctat   gtcgctttag   40200   ctgctgcacc   attaacggct   ttctctaaag   agtttatgaa   atacggcaaa   atgtattcag   40260   atggtgaggg   agttagctat   gccgatggcc   ctaagcctgt   attaagcaat   tttccgcaaa   40320   aagataatgt   tgtgatcgta   catactcgac   cacctcatct   tgaaacgcct   tttaatgtat   40380   tcaatgaagg   gctaataaca   ccaaacaacc   gtttctttgt   tcgttatcat   ctagctgacg   40440     tccccgttgc   catagacact   gataagtaca   ctattactat   ttcaggggct   gttaatgagg   40500   aagtgacatt   aagcttggct   gaattaaagt   cgattgaagg   ccaacaagaa   attgtcgcgg   40560   tacaacagtg   tactggtaat   agtcgaggtt   attcatctcc   acgtgttttt   ggtgcgcaat   40620   taagtaatgg   cgctatgggg   aatgcgaagt   tcaaaggcgt   gccacttaaa   aatgtgttag   40680   ctaaagcggg   aatttctagt   gctgcgacaa   gtgtcattat   cgatggtttg   gataagccgg   40740   ttcgagatac   cacaccagac   tttcaaaaat   cattacctat   tgatcatatt   atgacgggcg   40800   aacctatgct   tgtttgggaa   atgaatggtg   aacctttacc   atttttaaat   ggctttccag   40860   tgaaattaat   cgttccgggt   tggtatgcaa   catattgggt   taaacatgta   tcgcacctta   40920   aagttataga   gggtgagttt   gataactttg   atgcgttctt   tatgacaact   gcataccgtc   40980   tacctgataa   cgattccaag   agtgaattac   caactgccag   agcgaaaaag   acgttacctg   41040   taaatcgttt   cccaataaga   agttttgtta   ctagcttaga   aaatggtgat   gaagttaatg   41100   ctgcaactag   tattgaaatt   aaagggatag   cttttgatag   tggtagtggt   atcaaaaaag   41160   ttgaagtttc   agtcgatggt   ggcaataagt   ggatgcaagc   agcgcttggt   gaaaatcttg   41220   gtcgtttttc   ctttcgaggt   tggaagttaa   gccataattt   taatgaaaaa   ggcagaacgc   41280   ttgtgatggt   aagagctaca   ggtaagagtg   gagagacaca   acctcttaat   gcctcttgga   41340   atcatggcgg   ttataaccga   aacgcgattg   aacgaacaag   tattaaggtg   gtttaaatgc   41400   ggtttttact   tattatatta   gcgctatgtt   cattgactgt   taaagctgag   atcgtatcaa   41460   ttaccttacc   tatggataat   accaagctta   agccgtcgac   attaccagga   tatggcctcg   41520   cgcaatctaa   atgtcacctt   tgtcattcag   tcgattacgt   tatgtatcaa   ccaccagaaa   41580   tggatcc   41587   <210> 2 <211> 7959 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (7956) <400> 2 atg gct aaa aag aac acc aca tcg att aag cac gcc aag gat gtg tta 48 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 agt agt gat gat caa cag tta aat tct cgc ttg caa gaa tgt ccg att 96 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 gcc atc att ggt atg gca tcg gtt ttt gca gat gct aaa aac ttg gat 144 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 caa ttc tgg gat aac atc gtt gac tct gtg gac gct att att gat gtg 192 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 cct agc gat cgc tgg aac att gac gac cat tac tcg gct gat aaa aaa 240 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 gca gct gac aag aca tac tgc aaa cgc ggt ggt ttc att cca gag ctt 288 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 gat ttt gat ccg atg gag ttt ggt tta ccg cca aat atc ctc gag tta 336 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 act gac atc gct caa ttg ttg tca tta att gtt gct cgt gat gta tta 384 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 agt gat gct ggc att ggt agt gat tat gac cat gat aaa att ggt atc 432 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 acg ctg ggt gtc ggt ggt ggt cag aaa caa att tcg cca tta acg tcg 480 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 cgc cta caa ggc ccg gta tta gaa aaa gta tta aaa gcc tca ggc att 528 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 gat gaa gat gat cgc gct atg atc atc gac aaa ttt aaa aaa gcc tac 576 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 atc ggc tgg gaa gag aac tca ttc cca ggc atg cta ggt aac gtt att 624 Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 gct ggt cgt atc gcc aat cgt ttt gat ttt ggt ggt act aac tgt gtg 672 Ala Gly Arg Ile Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 gtt gat gcg gca tgc gct ggc tcc ctt gca gct gtt aaa atg gcg atc 720 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 tca gac tta ctt gaa tat cgt tca gaa gt atg ata tcg ggt ggt gta 768 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 tgt tgt gat aac tcg cca ttc atg tat atg tca ttc tcg aaa aca cca 816 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 gca ttt acc acc aat gat gat atc cgt ccg ttt gat gac gat tca aaa 864 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 ggc atg ctg gtt ggt gaa ggt att ggc atg atg gcg ttt aaa cgt ctt 912 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 gaa gat gct gaa cgt gac ggc gac aaa att tat tct 960 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 atc ggt aca tct tca gat ggt cgt ttc aaa tct att tac gct cca cgc 1008 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Il e Tyr Ala Pro Arg 325 330 335 cca gat ggc caa gca aaa gcg cta aaa cgt gct tat gaa gat gcc ggt 1056 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 350 ttt gcc cct gaa tac ggt cta att gaa ggc cat ggt acg ggt acc 1104 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 aaa gcg ggt gat gcc gca gaa ttt gct ggc ttg acc aaa cac ttt ggc 1152 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 gcc gcc agt gat gaa aag caa tat atc gcc tta ggc tta gtt aaa tcg 1200 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 caa att ggt cat act aaa tct gcg gct ggc tct gcg ggt atg att aag 1248 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 gcg gca tta gcg ctg cat cat aaa atc tta cct gca acg atc cat atc 1296 Ala Ala Leu Ala Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 gat aaa cca agt gaa gcc ttg gat atc aaa aac agc ccg tta tac cta 1344 Asp Lys Pro Ser Glu Ala Le u Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 aac agcc gaa acg cgt cct tgg atg cca cgt gaa gat ggt att cca cgt 1392 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 cgt gca ggt atc agc tca ttt ggt ttt ggc ggc acc aac ttc cat att 1440 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 475 att tta tta gaa gag tat cgc cca ggt cac gat agc gca tta aac 1488 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 tca gtg agc caa act gtg ttg atc tcg gca aac gac caa caa ggt att 1536 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 gtt gct gag tta aat aac tgg cgt act aaa ctg gct gtc gat gct gat 1584 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 525 cat caa ggg ttt gta ttt aat gag tta gtg aca acg tgg cca tta aaa 1632 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 acc cca tcc gtt aac caa gct cgt tta ggt ttt gtt gcg cgt aat gca 1680 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 aat gaa gcg atc gcg atg att gat acg gca ttg aaa caa ttc aat gcg 1728 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys G Ala 565 570 575 aac gca gat aaa atg aca tgg tca gta cct acc ggg gtt tac tat cgt 1776 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 caa gcc ggt att gat gca aca ggt aaa gtg gtt gcg cta ttc tca ggg 1824 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 caa ggt tcg caa tac gtg aac atg ggt cgt gaa tta acc tgt aac ttc 1872 Gln Gly Ser Gln Tyr Val Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 cca agc atg atg cac agt gct gcg gcg atg gat aaa gag ttc agt gcc 1920 Pro Ser Met Met His Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 gct ggt tta ggc cag tta tct gca gtt act ttc cct atc cct gtt tat 1968 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 acg gat gcc gag cgt aag cta caa gaa gag caa tta cgt tta acg caa 2016 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 cat gcg caa cca gcg att ggt agt ttg agt gtt ggt ctg ttc aaa acg 2064 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 ttt aag caa gca ggt ttt aaa gct gat ttt gct gcc ggt cat agt ttc 2112 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 ggt gag tta acc gca tta tgg gct gcc gat gta ttg agc gaa agc gat 2160 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 720 tac atg atg tta gcg cgt agt cgt ggt caa gca atg gct gcg cca gag 2 Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 caa caa gat ttt gat gca ggt aag atg gcc gct gtt gtt ggt gat cca 2256 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 aag caa gtc gct gtg atc att gat acc ctt gat gat gtc tct att gct 2304 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 aac ttc aac tcg aat aac caa gtt g tt att gct ggt act acg gag cag 2352 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 gtt gct gta gcg gtt aca acc tta ggt aat gct ggt ttc aaa gtt gtg 2400 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 cca ctg ccg gta tct gct gcg ttc cat aca cct tta gtt cgt cac gcg 2448 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 caa aaa cca ttt gct aaa gcg gtt gat agc gct aaa ttt aaa gcg cca 2496 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 agc att cca gtg ttt gct aat ggc acaggc tca agc aaa 2544 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 ccg aat gac att aag aaa aac ctg aaa aac cac atg ctg gaa tct gtt 2592 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 cat ttc aat caa gaa att gac aac atc tat gct gat ggt ggc cgc gta 2640 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 ttt atc ga a ttt ggt cca aag aat gta tta act aaa ttg gtt gaa aac 2688 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 att ctc act gaa aaa tct gat gtg act gct atc gcg gtt aat gct 2736 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 cct aaa caa cct gcg gac gta caa atg cgc caa gct gcg ctg caa atg 2784 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 gca gtg ctt ggt gtc gca tta gac aat att gac ccg tac gac gcc gtt 2832 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 aag cgt cca ctt gtt gcg cc gca tca cca atg ttg atg aag tta 2880 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 tct gca gcg tct tat gtt agt ccg aaa acg aag aaa gcg ttt gct gat 2928 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 gca ttg act gat ggc tgg act gtt aag caa gcg aaa gct gta cct gct 2976 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 98 0 985 990 gtt gtg tca caa cca caa gtg att gaa aag atc gtt gaa gtt gaa aag 3024 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 ata gtt gaa cgc att gtc gaa gt gag cgt gtc gaa gta gaa aaa 3072 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 atc gtc tac gtt aat gct gac ggt tcg ctt ata tcg caa aat aat caa 3120 Ile Val Tyr Val Asn Ala Asp Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 gac gtt aac agc gct gtt gtt agc aac gtg act aat agc tca gtg act 3168 Asp Val Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 cat agc agt gat gct gac ctt gtt gcc tct att gaa cgc agt gtt ggt 3216 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 caa ttt gtt gca cac caa cag caa tta tta aat gta cat gaa cag ttt 3264 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 atg caa ggt cca caa gac tac gcg aaa aca gtg cag aac gta ctt gct 3312 Met Gln Gly Pro Gln Asp Tyr A la Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 gcg cag acg agc aat gaa tta ccg gaa agt tta gac cgt aca ttg tct 3360 Ala Gln Thr Ser Sern Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 atg tat aac gag ttc caa tca gaa acg cta cgt gta cat gaa acg tac 3408 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 ctg aac aat cag acg agc aac atg aac acc atg ctt act ggt gaa 3456 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 gct gat gtg cta gca acc cca ata act cag gta gtg aat aca gcc gtt 3504 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 gcc act agt cac aag gta gtt gct cca gtt att gct aat aca gtg acg 3552 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr 1170 1175 1180 aat gtt gta tct agt gtc agt aat aac gcg gcg gtt gca gtg caa act 3600 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 gtg gca tta gcg cct acg caa gaa atc gct cca aca gtc gct act acg 3648 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 cca gca ccc gca ttg gtt gct atc gtg gct gaa cct gtg att gtt gcg 3696 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 cat gtt gct aca gaa gtt gca cca att aca cca tca gtt aca cca gtt 3744 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 gtc gca act caa gcg gct atc gat gta gca act att aac aaa gta atg 3792 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 tta gaa gtt gtt gct gat aaa acc ggt tat cca acg gat atg ctg gaa 3840 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 ctg agc atg gac atg gaa gct gac tta ggt atc gac tca atc aaa cgt 3888 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 gtt gag ata tta ggc gca gta cag gaa ttg atc cct gac tta cct gaa 3936 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 ctt aat cc t gaa gat ctt gct gag cta cgc acg ctt ggt gag att gtc 3984 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 gat tac atg aat tca aaa gcc cag gct gta gct cct aca aca gta 4032 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 gta aca agt gca cct gtt tcg cct gca tct gct ggt att gat tta gcc 4080 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 cac atc caa aac gta atg tta gaa gtg gtt gca gac aaa acc ggt tac 4128 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 cca aca gac atg cta gaag agc atg gat atg gaa gct gac tta ggt 4176 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 att gat tca atc aag cgt gtg gaa atc tta ggt gca gta cag gag atc 4224 Ile Asp Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 ata act gat tta cct gag cta aac cct gaa gat ctt gtt gaa tta cgc 4272 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 acc cta ggt gaa atc gtt agt tac atg caa agc aaa gcg cca gtc gct 4320 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 gaa agt gcgcca gg acg gct cct gta gca aca agc tca gca ccg 4368 Glu Ser Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 tct atc gat ttg aac cac att caa aca gtg atg atg gat gta gtt gca 4416 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 gat aag act ggt tat cca act gac atg cta gaa ctt ggc atg gac atg 4464 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 gaa gct gat tta ggt atc gat tca atc aaa cgt gtg gaa ata tta ggc 4512 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 gca gtg cag gag atc atc act gat c ctta g aac cca gaa gac 4560 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 ctc gct gaa tta cgc acg cta ggt gaa atc gtt agt tac atg caa agc 4608 L eu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 aaa gcg cca gtc gct gag agt gcg cca gta gcg acg gct tct gta gca 4656 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 aca agc tct gca ccg tct atc gat tta aac cat atc caa aca gtg atg 4704 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 atg gaa gtg gtt gca gac aaa acc ggt tat cca gta gac atg tta gaa 4752 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 ctt gct atg gac atg gaa gct gac cta ggt atc gat tca atc aag cgt 4800 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 gta gaa att tta ggt gcg gta cag gaa atc att act gac tta cct gag 4848 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 ctt aac cct gaa gat ctt gct gaa cta cgt aca tta ggt gaa atc gtt 4896 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 agt tac atg caa agc aaa gcg c cc gta gct gaa gcg cct gca gta cct 4944 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Ala Pro Ala Val Pro 1635 1640 1645 gtt gca gta gaa agt gca cct act agt gta aca agc tca gca ccg tct 4992 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 atc gat tta gac cac atc caa aat gta atg atg gat gtt gtt gct gat 5040 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 aag act ggt tat cct gcc aat atg ctt gaa tta gca atg gac atg gaa 5088 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 gcc gac ctt ggt att gat tca atc aag cgt gtt att cta ggc gcg 5136 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 gta cag gag atc att act gat tta cct gaa cta aac cca gaa gac tta 5184 Val Gln Glu Ile Ile Thr Asp Leu Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 gct gaa cta cgt acg tta gaa gaa att gta acc tac atg caa agc aag 5232 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 173 0 1735 1740 gcg agt ggt gtt act gta aat gta gtg gct agc cct gaa aat aat gct 5280 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 gta tca gat gca ttt atg caa agc aat gtg gcg act atc aca gcg gcc 5328 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 gca gaa cat aag gcg gaa ttt aaa ccg gcg ccg agc gca acc gtt gct 5376 Ala Glu His Lys Alu Glu Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 atc tct cgt cta agc tct atc agt aaa ata agc caa gat tgt aaa ggt 5424 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 gct aac gcc tta atc gta gct gat ggc act gat aat gct gtg tta ctt 5472 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 gca gac cac cta ttg caa act ggc tgg aat gta act gca ttg caa 5520 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 act tgg gta gct gta aca acg acg aaa gca ttt aat aag tca gtg aac 5568 Thr Trp Val Ala Val T hr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 ctg gtg act tta aat ggc gtt gat gaa act gaa atc aac aac att att 5616 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 act gct aac gca caa ttg gat gca gtt atc tat ctg cac gca agt agc 5664 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 gaa att aat gct atc gaa tac cca caa gca tct aag caa ctg atg 5712 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 tta gcc ttc tta tta gcg aaa ttg agt aaa gta act caa gcc gct aaa 5760 Leu Ala Phe Leu Leu Ala Lys Leu Sers Thr Gln Ala Ala Lys 1905 1910 1915 1920 gtg cgt ggc gcc ttt atg att gtt act cag cag ggt ggt tca tta ggt 5808 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 ttt gat gat atc gat tct gct aca agt cat gat gtg aaa aca gac cta 5856 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 gta caa agc ggc tta aac ggt tta gtt aag aca ctg tct cac gag tgg 5904 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 gat aac gta ttc tgt cgt gcg gtt gat att gct tcg tca tta acg gct 5952 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 gaa caa gtt gca agc ctt gtt agt gat gaa cta ctt gat gct aac act 6000 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 gta tta aca gaa gtg ggt tat caa caa gct ggt aaa ggc ctt gaa cgt 6048 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 atc acg tta act ggt gtg gct act gac agc tat gca tta aca gct ggc 60 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 aat aac atc gat gct aac tcg gta ttt tta gtg agt ggt ggc gca aaa 6144 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val Ser Gly Gly Ala Lys 2035 2040 2045 ggt gta act gca cat tgt gtt gct cgt ata gct aaa gaa tat cag tct 6192 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 aag ttc atc tta ttg gga cgt tca acg ttc tca agt gac gaa ccg agc 6240 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 tgg gca agt ggt att act gat gaa gcg gcg tta aag aaa atg 6288 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 cag tct ttg att aca gca ggt gat aaa cca aca ccc gtt aag atc gta 6336 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 cag cta atc aaa cca atc caa gct aat cgt gaa att gcg caa acc ttg 6384 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 tct gca att acc gct gct ggt ggc caa gct gaa tat gtt tct gca gat 6432 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 gta act aat gca gca agc gta caa atg gca gtc gct cca gct atc gct 6480 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 aag ttc ggt gca atc act ggc atc att cat ggc gcg ggt gtg tta gct 6528 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 gac caa ttc att gag caa aaa aca ctg agt gat ttt gag tct gtt tac 6576 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 agc act aaa att gac ggt ttg tta tcg cta cta tca gtc act gaa gca 6624 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 agc aac atc aag caa ttg gta ttg ttc tcg tca gcg gct ggt ttct Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 ggt aac ccc ggc cag tct gat tac tcg att gcc aat gag atc tta aat 6720 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 aaa acc gca tac cgc ttt aaa tca ttg cac cca caa gct caa gta ttg 6768 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 agc ttt aac tgg ggt cct tgg gac ggt atg gta acg cct gag ctt 6816 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 aaa cgt atg ttt gac caa cgt ggt gtt tac att att cca ctt gat gca 686 4 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 ggt gca cag tta ttg ctg aat gaa cta gcc gct aat gat aac cgt tgt 6912 Gly Ala Gln Leu Leu Leu Asn Glu Lea Ala Asn Asn Arg Cys 2290 2295 2300 cca caa atc ctc gtg ggt aat gac tta tct aaa gat gct agc tct gat 6960 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 caa aag tct gat gaa aag act gct gta aaa aag cca caa gtt agt 7008 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 cgt tta tca gat gct tta gta act aaa agt atc aaa gcg act aac agt 7056 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 agc tct tta tca aac aag act agt gct tta tca gac agt agt gct ttt 7104 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 cag gtt aac gaa aac cac ttt tta gct gac cac atg atc aaa ggc aat 7152 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 cag gta tta cca acg gta tg c gcg att gct tgg atg agt gat gca gca 7200 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 aaa gcg act tat agt aac cga gac tgt gca ttg aag tat gtc ggt ttc 7248 Lys A Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 gaa gac tat aaa ttg ttt aaa ggt gtg gtt ttt gat ggc aat gag gcg 7296 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu 2420 2425 2430 gcg gat tac caa atc caa ttg tcg cct gtg aca agg gcg tca gaa cag 7344 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 gat tct gaa gtc cgt att gcc gca agc ctg aaa agt gac 7392 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 ggt aaa cct gtg ttt cat tat gca gcg aca ata ttg tta gca act cag 7440 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 cca ctt aat gct gtg aag gta gaa ctt ccg aca ttg aca gaa agt gtt 7488 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Se r Val 2485 2490 2495 gat agc aac aat aaa gta act gat gaa gca caa gcg tta tac agc aat 7536 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 ggc acc ttg ttc cac ggt gaagt cag ggc att aag cag ata tta 7584 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 agt tgt gac gac aag ggc ctg cta ttg gct tgt cag ata acc gat gtt 7632 Ser Cys Asp Asp Lys Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 gca aca gct aag cag gga tcc ttc ccg tta gct gac aac aat atc ttt 7680 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 gcc aat gat ttg gtt tat cag gct atg ttg gtc tgg gtg cgc aaa caa 7728 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 ttt ggt tta ggt agc tta cct tcg gtg agg ggt tat 7776 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 cgt gaa gtg gtt gta gat gaa gta ttt tat ctg caa ctt aat gtt gtt 7824 Arg Glu Val Val V al Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 gag cat gat cta ttg ggt tca cgc ggc agt aaa gcc cgt tgt gat att 7872 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 caa ttg att gct gct gat atg caa tta ctt gcc gaa gtg aaa tca gcg 7920 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 caa gtc agt gtc agt gac att atg tga 7959 Gln Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 3 <211> 2652 <212> PRT <213> Moritella marina <400> 3 Met Ala Lys Lys Asn Thr Thr Ser Ile Lys His Ala Lys Asp Val Leu 1 5 10 15 Ser Ser Asp Asp Gln Gln Leu Asn Ser Arg Leu Gln Glu Cys Pro Ile 20 25 30 Ala Ile Ile Gly Met Ala Ser Val Phe Ala Asp Ala Lys Asn Leu Asp 35 40 45 Gln Phe Trp Asp Asn Ile Val Asp Ser Val Asp Ala Ile Ile Asp Val 50 55 60 Pro Ser Asp Arg Trp Asn Ile Asp Asp His Tyr Ser Ala Asp Lys Lys 65 70 75 80 Ala Ala Asp Lys Thr Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Leu 85 90 95 Asp Phe Asp Pro Met Glu Phe Gly Leu Pro Pro Asn Ile Leu Glu Leu 100 105 110 Thr Asp Ile Ala Gln Leu Leu Ser Leu Ile Val Ala Arg Asp Val Leu 115 120 125 Ser Asp Ala Gly Ile Gly Ser Asp Tyr Asp His Asp Lys Ile Gly Ile 130 135 140 Thr Leu Gly Val Gly Gly Gly Gln Lys Gln Ile Ser Pro Leu Thr Ser 145 150 155 160 Arg Leu Gln Gly Pro Val Leu Glu Lys Val Leu Lys Ala Ser Gly Ile 165 170 175 Asp Glu Asp Asp Arg Ala Met Ile Ile Asp Lys Phe Lys Lys Ala Tyr 180 185 190 Ile Gly Trp Glu Glu Asn Ser Phe Pro Gly Met Leu Gly Asn Val Ile 195 200 205 Ala Gly Arg I le Ala Asn Arg Phe Asp Phe Gly Gly Thr Asn Cys Val 210 215 220 Val Asp Ala Ala Cys Ala Gly Ser Leu Ala Ala Val Lys Met Ala Ile 225 230 235 240 Ser Asp Leu Leu Glu Tyr Arg Ser Glu Val Met Ile Ser Gly Gly Val 245 250 255 Cys Cys Asp Asn Ser Pro Phe Met Tyr Met Ser Phe Ser Lys Thr Pro 260 265 270 Ala Phe Thr Thr Asn Asp Asp Ile Arg Pro Phe Asp Asp Asp Ser Lys 275 280 285 Gly Met Leu Val Gly Glu Gly Ile Gly Met Met Ala Phe Lys Arg Leu 290 295 300 Glu Asp Ala Glu Arg Asp Gly Asp Lys Ile Tyr Ser Val Leu Lys Gly 305 310 315 320 Ile Gly Thr Ser Ser Asp Gly Arg Phe Lys Ser Ile Tyr Ala Pro Arg 325 330 335 Pro Asp Gly Gln Ala Lys Ala Leu Lys Arg Ala Tyr Glu Asp Ala Gly 340 345 350 Phe Ala Pro Glu Thr Cys Gly Leu Ile Glu Gly His Gly Thr Gly Thr 355 360 365 Lys Ala Gly Asp Ala Ala Glu Phe Ala Gly Leu Thr Lys His Phe Gly 370 375 380 Ala Ala Ser Asp Glu Lys Gln Tyr Ile Ala Leu Gly Leu Val Lys Ser 385 390 395 400 Gln Ile Gly His Thr Lys Ser Ala Ala Gly Ser Ala Gly Met Ile Lys 405 410 415 Ala Ala Leu A la Leu His His Lys Ile Leu Pro Ala Thr Ile His Ile 420 425 430 Asp Lys Pro Ser Glu Ala Leu Asp Ile Lys Asn Ser Pro Leu Tyr Leu 435 440 445 Asn Ser Glu Thr Arg Pro Trp Met Pro Arg Glu Asp Gly Ile Pro Arg 450 455 460 Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Thr Asn Phe His Ile 465 470 475 480 Ile Leu Glu Glu Tyr Arg Pro Gly His Asp Ser Ala Tyr Arg Leu Asn 485 490 495 495 Ser Val Ser Gln Thr Val Leu Ile Ser Ala Asn Asp Gln Gln Gly Ile 500 505 510 Val Ala Glu Leu Asn Asn Trp Arg Thr Lys Leu Ala Val Asp Ala Asp 515 520 525 His Gln Gly Phe Val Phe Asn Glu Leu Val Thr Thr Trp Pro Leu Lys 530 535 540 Thr Pro Ser Val Asn Gln Ala Arg Leu Gly Phe Val Ala Arg Asn Ala 545 550 555 560 Asn Glu Ala Ile Ala Met Ile Asp Thr Ala Leu Lys Gln Phe Asn Ala 565 570 575 Asn Ala Asp Lys Met Thr Trp Ser Val Pro Thr Gly Val Tyr Tyr Arg 580 585 590 Gln Ala Gly Ile Asp Ala Thr Gly Lys Val Val Ala Leu Phe Ser Gly 595 600 605 Gln Gly Ser Gln Tyr Val Asn Met Gly Arg Glu Leu Thr Cys Asn Phe 610 615 620 Pro Ser Met MetHis Ser Ala Ala Ala Met Asp Lys Glu Phe Ser Ala 625 630 635 640 Ala Gly Leu Gly Gln Leu Ser Ala Val Thr Phe Pro Ile Pro Val Tyr 645 650 655 Thr Asp Ala Glu Arg Lys Leu Gln Glu Glu Gln Leu Arg Leu Thr Gln 660 665 670 His Ala Gln Pro Ala Ile Gly Ser Leu Ser Val Gly Leu Phe Lys Thr 675 680 685 Phe Lys Gln Ala Gly Phe Lys Ala Asp Phe Ala Ala Gly His Ser Phe 690 695 700 Gly Glu Leu Thr Ala Leu Trp Ala Ala Asp Val Leu Ser Glu Ser Asp 705 710 715 715 720 Tyr Met Met Leu Ala Arg Ser Arg Gly Gln Ala Met Ala Ala Pro Glu 725 730 735 Gln Gln Asp Phe Asp Ala Gly Lys Met Ala Ala Val Val Gly Asp Pro 740 745 750 Lys Gln Val Ala Val Ile Ile Asp Thr Leu Asp Asp Val Ser Ile Ala 755 760 765 Asn Phe Asn Ser Asn Asn Gln Val Val Ile Ala Gly Thr Thr Glu Gln 770 775 780 Val Ala Val Ala Val Thr Thr Leu Gly Asn Ala Gly Phe Lys Val Val 785 790 795 800 Pro Leu Pro Val Ser Ala Ala Phe His Thr Pro Leu Val Arg His Ala 805 810 815 Gln Lys Pro Phe Ala Lys Ala Val Asp Ser Ala Lys Phe Lys Ala Pro 820 825 830 Ser Ile Pro Val Phe Ala Asn Gly Thr Gly Leu Val His Ser Ser Lys 835 840 845 Pro Asn Asp Ile Lys Lys Asn Leu Lys Asn His Met Leu Glu Ser Val 850 855 860 His Phe Asn Gln Glu Ile Asp Asn Ile Tyr Ala Asp Gly Gly Arg Val 865 870 875 880 Phe Ile Glu Phe Gly Pro Lys Asn Val Leu Thr Lys Leu Val Glu Asn 885 890 895 Ile Leu Thr Glu Lys Ser Asp Val Thr Ala Ile Ala Val Asn Ala Asn 900 905 910 Pro Lys Gln Pro Ala Asp Val Gln Met Arg Gln Ala Ala Leu Gln Met 915 920 925 Ala Val Leu Gly Val Ala Leu Asp Asn Ile Asp Pro Tyr Asp Ala Val 930 935 940 Lys Arg Pro Leu Val Ala Pro Lys Ala Ser Pro Met Leu Met Lys Leu 945 950 955 960 Ser Ala Ala Ser Tyr Val Ser Pro Lys Thr Lys Lys Ala Phe Ala Asp 965 970 975 Ala Leu Thr Asp Gly Trp Thr Val Lys Gln Ala Lys Ala Val Pro Ala 980 985 990 Val Val Ser Gln Pro Gln Val Ile Glu Lys Ile Val Glu Val Glu Lys 995 1000 1005 Ile Val Glu Arg Ile Val Glu Val Glu Arg Ile Val Glu Val Glu Lys 1010 1015 1020 Ile Val Tyr Val Asn Ala Asp Gly Ser Leu Ile Ser Gln Asn Asn Gln 1025 1030 1035 1040 Asp Va l Asn Ser Ala Val Val Ser Asn Val Thr Asn Ser Ser Val Thr 1045 1050 1055 His Ser Ser Asp Ala Asp Leu Val Ala Ser Ile Glu Arg Ser Val Gly 1060 1065 1070 Gln Phe Val Ala His Gln Gln Gln Leu Leu Asn Val His Glu Gln Phe 1075 1080 1085 Met Gln Gly Pro Gln Asp Tyr Ala Lys Thr Val Gln Asn Val Leu Ala 1090 1095 1100 Ala Gln Thr Ser Asn Glu Leu Pro Glu Ser Leu Asp Arg Thr Leu Ser 1105 1110 1115 1120 Met Tyr Asn Glu Phe Gln Ser Glu Thr Leu Arg Val His Glu Thr Tyr 1125 1130 1135 Leu Asn Asn Gln Thr Ser Asn Met Asn Thr Met Leu Thr Gly Ala Glu 1140 1145 1150 Ala Asp Val Leu Ala Thr Pro Ile Thr Gln Val Val Asn Thr Ala Val 1155 1160 1165 Ala Thr Ser His Lys Val Val Ala Pro Val Ile Ala Asn Thr Val Thr Thr 1170 1175 1180 Asn Val Val Ser Ser Val Ser Asn Asn Ala Ala Val Ala Val Gln Thr 1185 1190 1195 1200 Val Ala Leu Ala Pro Thr Gln Glu Ile Ala Pro Thr Val Ala Thr Thr 1205 1210 1215 Pro Ala Pro Ala Leu Val Ala Ile Val Ala Glu Pro Val Ile Val Ala 1220 1225 1230 His Val Ala Thr Glu Val Ala Pro Ile Thr Pro Ser Val Thr Pro Val 1235 1240 1245 Val Ala Thr Gln Ala Ala Ile Asp Val Ala Thr Ile Asn Lys Val Met 1250 1255 1260 Leu Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu 1265 1270 1275 1280 Leu Ser Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1285 1290 1295 Val Glu Ile Leu Gly Ala Val Gln Glu Leu Ile Pro Asp Leu Pro Glu 1300 1305 1310 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1315 1320 1325 Asp Tyr Met Asn Ser Lys Ala Gln Ala Val Ala Pro Thr Thr Val Pro 1330 1335 1340 Val Thr Ser Ala Pro Val Ser Pro Ala Ser Ala Gly Ile Asp Leu Ala 1345 1350 1355 1360 His Ile Gln Asn Val Met Leu Glu Val Val Ala Asp Lys Thr Gly Tyr 1365 1370 1375 Pro Thr Asp Met Leu Glu Leu Ser Met Asp Met Glu Ala Asp Leu Gly 1380 1385 1390 Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala Val Gln Glu Ile 1395 1400 1405 Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu Val Glu Leu Arg 1410 1415 1420 Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser Lys Ala Pro Val Ala 1425 1430 1435 1440 Glu S er Ala Pro Val Ala Thr Ala Pro Val Ala Thr Ser Ser Ala Pro 1445 1450 1455 Ser Ile Asp Leu Asn His Ile Gln Thr Val Met Met Asp Val Val Ala 1460 1465 1470 Asp Lys Thr Gly Tyr Pro Thr Asp Met Leu Glu Leu Gly Met Asp Met 1475 1480 1485 Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly 1490 1495 1500 Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp 1505 1510 1515 1520 Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val Ser Tyr Met Gln Ser 1525 1530 1535 Lys Ala Pro Val Ala Glu Ser Ala Pro Val Ala Thr Ala Ser Val Ala 1540 1545 1550 Thr Ser Ser Ala Pro Ser Ile Asp Leu Asn His Ile Gln Thr Val Met 1555 1560 1565 Met Glu Val Val Ala Asp Lys Thr Gly Tyr Pro Val Asp Met Leu Glu 1570 1575 1580 Leu Ala Met Asp Met Glu Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg 1585 1590 1595 1600 Val Glu Ile Leu Gly Ala Val Gln Glu Ile Ile Thr Asp Leu Pro Glu 1605 1610 1615 Leu Asn Pro Glu Asp Leu Ala Glu Leu Arg Thr Leu Gly Glu Ile Val 1620 1625 1630 Ser Tyr Met Gln Ser Lys Ala Pro Val Ala Glu Al a Pro Ala Val Pro 1635 1640 1645 Val Ala Val Glu Ser Ala Pro Thr Ser Val Thr Ser Ser Ala Pro Ser 1650 1655 1660 Ile Asp Leu Asp His Ile Gln Asn Val Met Met Asp Val Val Ala Asp 1665 1670 1675 1680 Lys Thr Gly Tyr Pro Ala Asn Met Leu Glu Leu Ala Met Asp Met Glu 1685 1690 1695 Ala Asp Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Gly Ala 1700 1705 1710 Val Gln Glu Ile Ile Thr Asp Leu Pro Glu Leu Asn Pro Glu Asp Leu 1715 1720 1725 Ala Glu Leu Arg Thr Leu Glu Glu Ile Val Thr Tyr Met Gln Ser Lys 1730 1735 1740 Ala Ser Gly Val Thr Val Asn Val Val Ala Ser Pro Glu Asn Asn Ala 1745 1750 1755 1760 Val Ser Asp Ala Phe Met Gln Ser Asn Val Ala Thr Ile Thr Ala Ala 1765 1770 1775 Ala Glu His Lys Ala Glu Phe Lys Pro Ala Pro Ser Ala Thr Val Ala 1780 1785 1790 Ile Ser Arg Leu Ser Ser Ile Ser Lys Ile Ser Gln Asp Cys Lys Gly 1795 1800 1805 Ala Asn Ala Leu Ile Val Ala Asp Gly Thr Asp Asn Ala Val Leu Leu 1810 1815 1820 Ala Asp His Leu Leu Gln Thr Gly Trp Asn Val Thr Ala Leu Gln Pro 1825 1830 1835 1840 1840 Thr Trp Val Ala Val Thr Thr Thr Lys Ala Phe Asn Lys Ser Val Asn 1845 1850 1855 Leu Val Thr Leu Asn Gly Val Asp Glu Thr Glu Ile Asn Asn Ile Ile 1860 1865 1870 Thr Ala Asn Ala Gln Leu Asp Ala Val Ile Tyr Leu His Ala Ser Ser 1875 1880 1885 Glu Ile Asn Ala Ile Glu Tyr Pro Gln Ala Ser Lys Gln Gly Leu Met 1890 1895 1900 Leu Ala Phe Leu Leu Ala Lys Leu Ser Lys Val Thr Gln Ala Ala Lys 1905 1910 1915 1920 Val Arg Gly Ala Phe Met Ile Val Thr Gln Gln Gly Gly Ser Leu Gly 1925 1930 1935 Phe Asp Asp Ile Asp Ser Ala Thr Ser His Asp Val Lys Thr Asp Leu 1940 1945 1950 Val Gln Ser Gly Leu Asn Gly Leu Val Lys Thr Leu Ser His Glu Trp 1955 1960 1965 Asp Asn Val Phe Cys Arg Ala Val Asp Ile Ala Ser Ser Leu Thr Ala 1970 1975 1980 Glu Gln Val Ala Ser Leu Val Ser Asp Glu Leu Leu Asp Ala Asn Thr 1985 1990 1995 2000 Val Leu Thr Glu Val Gly Tyr Gln Gln Ala Gly Lys Gly Leu Glu Arg 2005 2010 2015 Ile Thr Leu Thr Gly Val Ala Thr Asp Ser Tyr Ala Leu Thr Ala Gly 2020 2025 2030 Asn Asn Ile Asp Ala Asn Ser Val Phe Leu Val S er Gly Gly Ala Lys 2035 2040 2045 Gly Val Thr Ala His Cys Val Ala Arg Ile Ala Lys Glu Tyr Gln Ser 2050 2055 2060 Lys Phe Ile Leu Leu Gly Arg Ser Thr Phe Ser Ser Asp Glu Pro Ser 2065 2070 2075 2080 Trp Ala Ser Gly Ile Thr Asp Glu Ala Ala Leu Lys Lys Ala Ala Met 2085 2090 2095 Gln Ser Leu Ile Thr Ala Gly Asp Lys Pro Thr Pro Val Lys Ile Val 2100 2105 2110 Gln Leu Ile Lys Pro Ile Gln Ala Asn Arg Glu Ile Ala Gln Thr Leu 2115 2120 2125 Ser Ala Ile Thr Ala Ala Gly Gly Gln Ala Glu Tyr Val Ser Ala Asp 2130 2135 2140 Val Thr Asn Ala Ala Ser Val Gln Met Ala Val Ala Pro Ala Ile Ala 2145 2150 2155 2160 Lys Phe Gly Ala Ile Thr Gly Ile Ile His Gly Ala Gly Val Leu Ala 2165 2170 2175 Asp Gln Phe Ile Glu Gln Lys Thr Leu Ser Asp Phe Glu Ser Val Tyr 2180 2185 2190 2190 Ser Thr Lys Ile Asp Gly Leu Leu Ser Leu Leu Ser Val Thr Glu Ala 2195 2200 2205 Ser Asn Ile Lys Gln Leu Val Leu Phe Ser Ser Ala Ala Gly Phe Tyr 2210 2215 2220 Gly Asn Pro Gly Gln Ser Asp Tyr Ser Ile Ala Asn Glu Ile Leu Asn 2225 2230 2235 2240 Lys Thr Ala Tyr Arg Phe Lys Ser Leu His Pro Gln Ala Gln Val Leu 2245 2250 2255 Ser Phe Asn Trp Gly Pro Trp Asp Gly Gly Met Val Thr Pro Glu Leu 2260 2265 2270 Lys Arg Met Phe Asp Gln Arg Gly Val Tyr Ile Ile Pro Leu Asp Ala 2275 2280 2285 Gly Ala Gln Leu Leu Leu Asn Glu Leu Ala Ala Asn Asp Asn Arg Cys 2290 2295 2300 Pro Gln Ile Leu Val Gly Asn Asp Leu Ser Lys Asp Ala Ser Ser Asp 2305 2310 2315 2320 Gln Lys Ser Asp Glu Lys Ser Thr Ala Val Lys Lys Pro Gln Val Ser 2325 2330 2335 Arg Leu Ser Asp Ala Leu Val Thr Lys Ser Ile Lys Ala Thr Asn Ser 2340 2345 2350 Ser Ser Leu Ser Asn Lys Thr Ser Ala Leu Ser Asp Ser Ser Ala Phe 2355 2360 2365 Gln Val Asn Glu Asn His Phe Leu Ala Asp His Met Ile Lys Gly Asn 2370 2375 2380 Gln Val Leu Pro Thr Val Cys Ala Ile Ala Trp Met Ser Asp Ala Ala 2385 2390 2395 2400 Lys Ala Thr Tyr Ser Asn Arg Asp Cys Ala Leu Lys Tyr Val Gly Phe 2405 2410 2415 Glu Asp Tyr Lys Leu Phe Lys Gly Val Val Phe Asp Gly Asn Glu Ala 2420 2425 2430 Ala Asp Tyr Gln Ile Gln Leu Ser Pro Val Thr Arg Ala Ser Glu Gln 2435 2440 2445 Asp Ser Glu Val Arg Ile Ala Ala Lys Ile Phe Ser Leu Lys Ser Asp 2450 2455 2460 Gly Lys Pro Val Phe His Tyr Ala Ala Thr Ile Leu Leu Ala Thr Gln 2465 2470 2475 2480 Pro Leu Asn Ala Val Lys Val Glu Leu Pro Thr Leu Thr Glu Ser Val 2485 2490 2495 Asp Ser Asn Asn Lys Val Thr Asp Glu Ala Gln Ala Leu Tyr Ser Asn 2500 2505 2510 Gly Thr Leu Phe His Gly Glu Ser Leu Gln Gly Ile Lys Gln Ile Leu 2515 2520 2525 Ser Cys Asp Asp Lys Gly Leu Leu Leu Leu Ala Cys Gln Ile Thr Asp Val 2530 2535 2540 Ala Thr Ala Lys Gln Gly Ser Phe Pro Leu Ala Asp Asn Asn Ile Phe 2545 2550 2555 2560 Ala Asn Asp Leu Val Tyr Gln Ala Met Leu Val Trp Val Arg Lys Gln 2565 2570 2575 Phe Gly Leu Gly Ser Leu Pro Ser Val Thr Thr Ala Trp Thr Val Tyr 2580 2585 2590 Arg Glu Val Val Val Asp Glu Val Phe Tyr Leu Gln Leu Asn Val Val 2595 2600 2605 Glu His Asp Leu Leu Gly Ser Arg Gly Ser Lys Ala Arg Cys Asp Ile 2610 2615 2620 Gln Leu Ile Ala Ala Asp Met Gln Leu Leu Ala Glu Val Lys Ser Ala 2625 2630 2635 2640 Gl n Val Ser Val Ser Asp Ile Leu Asn Asp Met Ser 2645 2650 <210> 4 <211> 2598 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (2595) <400> 4 atg acg gaa tta gct gtt att ggt atg gat gct aaa ttt agc gga caa 48 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 gac aat att gac cgt gtg gaa cgc gct ttc tat gaa ggt gct tat gta 96 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Gly Ala Tyr Val 20 25 30 ggt aat gtt agc cgc gtt agt acc gaa tct aat gtt att agc aat ggc 144 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 gaa gaa caa gtt att act gcc atg aca gtt ctt aac tct gtc agt cta 192 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 cta gcg caa acg aat cag tta aat ata gct gat atc gcg gtg ttg ctg 240 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 att gct gat gta aaa agt gct gat gat cag ctt gta gtc att gca 288 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 tca gca att gaa aaa cag tgt gcg agt tgt gtt gtt att gct gat tta 336 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 ggc caa gca tta aat caa gta gct gat tta gtt aat aac caa gac tgt 384 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 cct gtg gct gta att ggc atg aat aac tcg gtt tatat cgt cat 432 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 gat ctt gaa tct gta act gca aca atc agc ttt gat gaa acc ttc aat 480 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 ggt tat aac aat gta gct ggg ttc gcg agt tta ctt atc gct tca act 528 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 gcg ttt gcc aat gct aag caa tgt tat ata tac gcc aac att aag ggc 576 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 ttc gct caa tcg ggc gta aat gct caa ttt aac gtt gga aac att ag ag Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 gat act gca aag acc gca ttg cag caa gct agc ata act gca gag cag 672 Asp Thr Ala Lys Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln210 215 220 gtt ggt ttg tta gaa gtg tca gca gtc gct gat tcg gca atc gca ttg 720 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 tct gaa agc caa ggt tta atg tct gct cat cat acg caa act ttg 768 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 cat act gca tta agc agt gcc cgt agt gtg act ggt gaa ggc ggg tgt 816 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 ttt tca cag gtc gca ggt tta ttg aaa tgt gta att ggt tta cat caa 864 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 cgt tat att ccg gcg att aaa gat tgg caa caa ccg agt gac aat caa 912 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 atg tca cgg tgg cgg aat tca cca ttc tat atg cct gta gat gct 960 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 cct tgg ttc cca cat gct gat ggc tct gca cac att gcc gct tat agt 1008 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Il e Ala Ala Tyr Ser 325 330 335 tgt gtg act gct gac agc tat tgt cat att ctt tta caa gaa aac gtc 1056 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 tta caa gaa ctt gtt ttg aaa gaa aca gtc ttg caa gat aat gac tta 1104 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 act gaa agc aag ctt cag act ctt gaa caa aac aat cca gta gct gat 1152 Thr Glu Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 ctg cgc act aat ggt tac ttt gca tcg agc gag tta gca tta atc ata 1200 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 gta caa ggt aat gac gaa gca caa tta cgc tgt gaa tta gaa act att 1248 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 aca ggg cag tta agt act act ggc ata agt act atc agt att aaa cag 1296 Thr Gly Gln Leu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 atc gca gca gac tgt tat gcc cgt aat gat act aac aaa gcc tat agc 1344 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 gca gtg ctt att gcc gag act gct gaa gag tta agc aaa gaa ata acc 1392 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 ttg gc ttt gct ggt atc gct agc gtg ttt aat gaa gat gct aaa gaa 1440 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 475 480 tgg aaa acc ccg aag ggc agt tat ttt acc gcag cag aaa 1488 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 495 cag gct gct aac agc aca cag aat ggt gtc acc ttc atg tac cca ggt 1536 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 att ggt gct aca tat gtt ggt tta ggg cgt gat cta ttt cat cta ttc 1584 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 cca cag att tat cag cct gta gcg gct tta gcc gat gac att ggc gaa 1632 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 agt cta aaa gat act tta ctt aat cca cgc agt att agt cgt cat agc 1680 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 ttt aaa gaa ctc aag cag ttg gat ctg gac ctg cgc ggt aac tta gcc 1728 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Ala 565 570 575 aat atc gct gaa gcc ggt gtg ggt ttt gct tgt gtg ttt acc aag gta 1776 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 ttt gaa gaa gtc ttt gcc gtt gtt aaa ttt gct aca ggt tat agc 1824 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 atg ggt gaa gta agc atg tat gca gca cta ggc tgc tgg cag caa ccg 1872 Met Gly Glu Val Ser Met Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 gga ttg atg agt gct cgc ctt gca caa tcg aat acc ttt aat cat caa 1920 Gly Leu Met Ser Ala Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 630 635 640 ctt tgc ggc gag tta aga aca cta cgt cag cat tgg ggc atg gat gat 1968 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 gta gct aac ggt acg ttc gag cag atc tgg gaa acctat cc att aag 2016 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 gca acg att gaa cag gtc gaa att gcc tct gca gat gaa gat cgt gtg 2064 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 tat tgc acc att atc aat aca cct gat agc ttg ttg tta gcc ggt tat 2112 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 cca gaa gcc tgt ca cga gtc att aag aat tta ggt gtg cgt gca atg 2160 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 715 720 gca ttg aat atg gcg aac gca att cac agc gcg cca gc tat gcc Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 735 tac gat cat atg gtt gag cta tac cat atg gat gtt act cca cgt att 2256 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 aat acc aag atg tat tca agc tca tgt tat tta ccg att cca caa cgc 2304 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 agc aaa gcg att tcc cac agt att gc t aaa tgt ttg tgt gat gtg gtg 2352 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 780 gat ttc cca cgt ttg gtt aat acc tta cat gac aaa ggt gcg cgg gta 2400 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 ttc att gaa atg ggt cca ggt cgt tcg tta tgt agc tgg gta gat aag 2448 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 atc tta gtt aat ggc gat ggc gat aat aaa aag caa agc caa cat gta 2496 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 tct gtt cct gtg aat gcc aaa ggc acc agt gatga act tat att 2544 Ser Val Pro Val Asn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 cgt gcg att gct aag tta att agt cat ggc gtg aat ttg aat tta gat 2592 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 agc tag 2598 Ser 865 <210> 5 <211> 865 <212> PRT <213> Moritella marina <400> 5 Met Thr Glu Leu Ala Val Ile Gly Met Asp Ala Lys Phe Ser Gly Gln 1 5 10 15 Asp Asn Ile Asp Arg Val Glu Arg Ala Phe Tyr Glu Glu Gly Ala Tyr Val 20 25 30 Gly Asn Val Ser Arg Val Ser Thr Glu Ser Asn Val Ile Ser Asn Gly 35 40 45 Glu Glu Gln Val Ile Thr Ala Met Thr Val Leu Asn Ser Val Ser Leu 50 55 60 Leu Ala Gln Thr Asn Gln Leu Asn Ile Ala Asp Ile Ala Val Leu Leu 65 70 75 80 Ile Ala Asp Val Lys Ser Ala Asp Asp Gln Leu Val Val Gln Ile Ala 85 90 95 Ser Ala Ile Glu Lys Gln Cys Ala Ser Cys Val Val Ile Ala Asp Leu 100 105 110 Gly Gln Ala Leu Asn Gln Val Ala Asp Leu Val Asn Asn Gln Asp Cys 115 120 125 Pro Val Ala Val Ile Gly Met Asn Asn Ser Val Asn Leu Ser Arg His 130 135 140 Asp Leu Glu Ser Val Thr Ala Thr Ile Ser Phe Asp Glu Thr Phe Asn 145 150 155 160 Gly Tyr Asn Asn Val Ala Gly Phe Ala Ser Leu Leu Ile Ala Ser Thr 165 170 175 Ala Phe Ala Asn Ala Lys Gln Cys Tyr Ile Tyr Ala Asn Ile Lys Gly 180 185 190 Phe Ala Gln Ser Gly Val Asn Ala Gln Phe Asn Val Gly Asn Ile Ser 195 200 205 Asp Thr Ala Ly s Thr Ala Leu Gln Gln Ala Ser Ile Thr Ala Glu Gln 210 215 220 Val Gly Leu Leu Glu Val Ser Ala Val Ala Asp Ser Ala Ile Ala Leu 225 230 235 240 Ser Glu Ser Gln Gly Leu Met Ser Ala Tyr His His Thr Gln Thr Leu 245 250 255 His Thr Ala Leu Ser Ser Ala Arg Ser Val Thr Gly Glu Gly Gly Cys 260 265 270 Phe Ser Gln Val Ala Gly Leu Leu Lys Cys Val Ile Gly Leu His Gln 275 280 285 Arg Tyr Ile Pro Ala Ile Lys Asp Trp Gln Gln Pro Ser Asp Asn Gln 290 295 300 Met Ser Arg Trp Arg Asn Ser Pro Phe Tyr Met Pro Val Asp Ala Arg 305 310 315 320 Pro Trp Phe Pro His Ala Asp Gly Ser Ala His Ile Ala Ala Tyr Ser 325 330 335 Cys Val Thr Ala Asp Ser Tyr Cys His Ile Leu Leu Gln Glu Asn Val 340 345 350 Leu Gln Glu Leu Val Leu Lys Glu Thr Val Leu Gln Asp Asn Asp Leu 355 360 365 Thr Glu Ser Lys Leu Gln Thr Leu Glu Gln Asn Asn Pro Val Ala Asp 370 375 380 Leu Arg Thr Asn Gly Tyr Phe Ala Ser Ser Glu Leu Ala Leu Ile Ile 385 390 395 400 Val Gln Gly Asn Asp Glu Ala Gln Leu Arg Cys Glu Leu Glu Thr Ile 405 410 415 Thr Gly Gln L eu Ser Thr Thr Gly Ile Ser Thr Ile Ser Ile Lys Gln 420 425 430 Ile Ala Ala Asp Cys Tyr Ala Arg Asn Asp Thr Asn Lys Ala Tyr Ser 435 440 445 Ala Val Leu Ile Ala Glu Thr Ala Glu Glu Leu Ser Lys Glu Ile Thr 450 455 460 Leu Ala Phe Ala Gly Ile Ala Ser Val Phe Asn Glu Asp Ala Lys Glu 465 470 475 480 Trp Lys Thr Pro Lys Gly Ser Tyr Phe Thr Ala Gln Pro Ala Asn Lys 485 490 490 495 Gln Ala Ala Asn Ser Thr Gln Asn Gly Val Thr Phe Met Tyr Pro Gly 500 505 510 510 Ile Gly Ala Thr Tyr Val Gly Leu Gly Arg Asp Leu Phe His Leu Phe 515 520 525 Pro Gln Ile Tyr Gln Pro Val Ala Ala Leu Ala Asp Asp Ile Gly Glu 530 535 540 Ser Leu Lys Asp Thr Leu Leu Asn Pro Arg Ser Ile Ser Arg His Ser 545 550 555 560 560 Phe Lys Glu Leu Lys Gln Leu Asp Leu Asp Leu Arg Gly Asn Leu Ala 565 570 570 575 Asn Ile Ala Glu Ala Gly Val Gly Phe Ala Cys Val Phe Thr Lys Val 580 585 590 Phe Glu Glu Val Phe Ala Val Lys Ala Asp Phe Ala Thr Gly Tyr Ser 595 600 605 Met Gly Glu Val Ser Met Tyr Ala Ala Leu Gly Cys Trp Gln Gln Pro 610 615 620 Gly Leu Met SerAla Arg Leu Ala Gln Ser Asn Thr Phe Asn His Gln 625 630 635 640 Leu Cys Gly Glu Leu Arg Thr Leu Arg Gln His Trp Gly Met Asp Asp 645 650 655 Val Ala Asn Gly Thr Phe Glu Gln Ile Trp Glu Thr Tyr Thr Ile Lys 660 665 670 Ala Thr Ile Glu Gln Val Glu Ile Ala Ser Ala Asp Glu Asp Arg Val 675 680 685 Tyr Cys Thr Ile Ile Asn Thr Pro Asp Ser Leu Leu Leu Ala Gly Tyr 690 695 700 Pro Glu Ala Cys Gln Arg Val Ile Lys Asn Leu Gly Val Arg Ala Met 705 710 710 715 720 Ala Leu Asn Met Ala Asn Ala Ile His Ser Ala Pro Ala Tyr Ala Glu 725 730 730 735 Tyr Asp His Met Val Glu Leu Tyr His Met Asp Val Thr Pro Arg Ile 740 745 750 Asn Thr Lys Met Tyr Ser Ser Ser Cys Tyr Leu Pro Ile Pro Gln Arg 755 760 765 Ser Lys Ala Ile Ser His Ser Ile Ala Lys Cys Leu Cys Asp Val Val 770 775 775 780 Asp Phe Pro Arg Leu Val Asn Thr Leu His Asp Lys Gly Ala Arg Val 785 790 795 800 Phe Ile Glu Met Gly Pro Gly Arg Ser Leu Cys Ser Trp Val Asp Lys 805 810 815 Ile Leu Val Asn Gly Asp Gly Asp Asn Lys Lys Gln Ser Gln His Val 820 825 830 Ser Val Pro ValAsn Ala Lys Gly Thr Ser Asp Glu Leu Thr Tyr Ile 835 840 845 Arg Ala Ile Ala Lys Leu Ile Ser His Gly Val Asn Leu Asn Leu Asp 850 855 860 Ser 865 <210> 6 <211> 6036 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (6033) <400> 6 atg gaa aat att gca gta gta ggt att gct aat ttg ttc ccg ggc tca 48 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 caa gca ccg gat caa ttt tgg cag caa ttg ctt gaa caa caa gat tgc 96 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 cgc agt aag gcg acc gct gtt caa atg ggc gtt gat cct gct aaa tat 144 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 acc gcc aac aaa ggt gac aca gat aaa ttt tac tgt gtg cac ggc ggt 192 Thr Ala Asn Lys Gly Asp Thrasp As Lys Phe Tyr Cys Val His Gly Gly 50 55 60 tac atc agt gat ttc aat ttt gat gct tca ggt tat caa ctc gat aat 240 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 gat tat tta gcc ggt tta gat gac ctt aat caa tgg ggg ctt tat gtt 288 Asp Tyr Leu Ala Gly Leu Asp Asp Leu Asn Gln Trp Gly Leu Tyr Val 85 90 95 acg aaa caa gcc ctt acc gat gcg ggt tat tgg ggc agt act gca cta 336 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 gaa aac tgt ggt gtg att tta ggt aat ttg tca ttc cca act aaa tca 384 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 tct aat cag ctg ttt atg cct ttg tat cat caa gtt gtt gat aat gcc 432 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 tta aag gcg gta tta cat cct gat ttt caa tta acg cat tac aca gca 480 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 ccg aaa aaa aca cat gct gac aat gca tta gta gca ggt tat cca gct 528 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 gca ttg atc gcg caa gcg gcg ggt ctt ggt ggt tca cat ttt gca ctg 576 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 gat gcg gct tgt gct tca tct tgt tat agc gtt aag tta gcg tgt gat 624 Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 tac ctg cat acg ggt aaa gcc aac atg atg ctt gct ggt gcg gta tct 672 Tyr Leu His Thr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 gca gca gat cct atg ttc gta aat atg ggt ttc tcg ata ttc caa gct 720 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 tac cca gct aac aat gta cat gcc ccg ttt gac caa aat tca caa ggt 768 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 cta ttt gcc ggt gaa ggc gcg ggc atg atg gta ttg aaa cgt caa agt 816 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 270 gat gca gta cgt gat ggt gat cat att tac gcc att att aaa ggc ggc 864 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 gca tta tcg aat gac ggt aaa ggc gag ttt gta tta agc ccg aac acc 912 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 aag ggc caa gta tta gta tat gaa cgt gct tat gcc gat gca gat 960 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 gac ccg agt aca gtt gac tat att gaa tgt cat gca acg ggc aca cct 1008 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His A la Thr Gly Thr Pro 325 330 335 aag ggt gac aat gtt gaa ttg cgt tcg atg gaa acc ttt ttc agt cgc 1056 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 gta aat aac aaa cca tta ctg ggc tcg gtt aaa tct aac ctt ggt cat 1104 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 ttg tta act gcc gct ggt atg cct ggc atg acc aaa gct atg tta gcg 1152 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 cta ggt aaa ggt ctt att cct gca acg att aac tta aag caa cca ctg 1200 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 caa tct aaa aac ggt tac ttt act ggc gag caa atg cca acg acg act 1248 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 gtg tct tgg cca aca act ccg ggt gcc aag gca gat aaa ccg cgt acc 1296 Val Ser Trp Pro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 gca ggt gtg agc gta ttt ggt ttt ggt ggc agc aac gcc cat ttg gta 1344 Ala Gly Val Ser Val Phe Gl y Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 tta caa cag cca acg caa aca ctc gag act aat ttt agt gtt gct aaa 1392 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 cca cgt gag cct ttg gct att att ggt atg gac agc cat ttt ggt agt 1440 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 475 480 gcc agt aat tta gcg cag ttc aaa acc tta tta aat aat aat caa aat 1488 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asn Asn Asn Gln Asn 485 490 495 acc ttc cgt gaa tta cca gaa caa cgc tgg aaa ggc atg gaa agt aac 1536 Thr Phe Arg Glu Leu Pro Glu Gln Arg Gly Met Glu Ser Asn 500 505 510 gct aac gtc atg cag tcg tta caa tta cgc aaa gcg cct aaa ggc agt 1584 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 tac gtt gat gag cacta att gat ttc ttg cgt ttt aaa gta ccg cct 1632 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 aat gaa aaa gat tgc ttg atc ccg caa cag tta atg atg atg Asa glugg 1680 Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 560 gca gac aat gct gcg aaa gac gga ggt cta gtt gaa ggt cgt aat gtt 1728 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Asg Val 565 570 575 gcg gta tta gta gcg atg ggc atg gaa ctg gaa tta cat cag tat cgt 1776 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 ggt cgc gtt aat cta accga caat gac agc tta tta cag caa 1824 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 ggt att aac ctg act gtt gag caa cgt gaa gaa ctg acc aat att gct 1872 Gly Ile Asn Leu Thr Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 aaa gac ggt gtt gcc tcg gct gca cag cta aat cag tat acg agt ttc 1920 Lys Asp Gly Val Ala Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 630 630 att ggt aat att atg gcg tca cgt att tcg gcg tta tgg gat ttt tct 1968 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 ggt cct gct att acc gta tcg gct gaa gaa aac tct tat cgt tgt 2016 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 gtt gaa tta gct gaa aat cta ttt caa acc agt gat gtt gaa gcc gtt 2064 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 att att gct gct gtt gat ttg tct ggt tca att gaa aac att act tta 2112 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 cgt cag cac tac ggt cca gtt aat gaa aag gga tct gta agt gaa tgt 2160 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 ggt ccg gtt aat gaa agc agt tca gta acc aac aat att ctt gat cag 2208 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 caa caa tgg ctg gtg ggt gaa ggc gca gcg gct att gtc gtt aaa ccg 2256 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Vals Pro 740 745 750 tca tcg caa gtc act gct gaa caa gtt tat gcg cgt att gat gcg gtg 2304 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 agt ttt gcc cct ggt agc aat gc g aa gca att acg att gca gcg gat 2352 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 aaa gca tta aca ctt gct ggt atc agt gct gct gat gta gct agt gtt 2400 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Asp Val Ala Ser Val 785 790 795 800 gaa gca cat gca agt ggt ttt agt gcc gaa aat aat gct gaa aaa acc 2448 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 gcg tta ccg act tta tac cca agc gca agt atc agt tcg gtg aaa gcc 2496 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 aat att ggt cat acg ttt aat gcc tcg ggt atg gcgt att att aaa 2544 Asn Ile Gly His Thr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 acg gcg ctg ctg tta gat cag aat acg agt caa gat cag aaa agc aaa 2592 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 cat att gct att aac ggt cta ggt cgt gat aac agc tgc gcg cat ctt 2640 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 875 880 atc tta tc g agt tca gcg caa gcg cat caa gtt gca cca gcg cct gta 2688 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 tct ggt atg gcc aag caa cgc cca cag tta gtt aaa acc atc aaa ctc 2736 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 ggt ggt cag tta att agc aac gcg att gtt aac agt gcg agt tca tct 2784 Gly Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 tta cac gct att aaa gcg cag ttt gcc ggt aag cac tta aac aaa gtt 2832 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 aac cag cca gtg atg atg gat aac ctg aag ccc caa ggt att agc gct 2880 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 cat gca acc aat gag tat gtg gtg act gga gct gct aac act caa gct 2928 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 tct aac att caa gca tct cat gtt caa gcg tca agt cat gca caa gag 2976 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 ata gca cca aac caa gtt caa aat atg caa gct aca gca gcc gct gta 3024 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Ala Val 995 1000 1005 agt tca ccc ctt tct caa cat caa cac aca gcg cag ccc gta gcg gca 3072 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 ccg agc gtt gtt gga gtg act gtg aaa cat aaa gca agt aac caa att 3120 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 cat cag caa gcg tct acg cat aaa gca ttt tta gaa agt cgt tta gct 3168 His Gln Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 gca cag aaa aac cta tcg caa ctt gtt gaa ttg caa acc aag ctg tca 3216 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 atc caa act ggt agt gac aat aca tct aac aat act gcg 3ca aca ag Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 aat aca gtg cta aca aat cct gta tca gca acg cca tta aca ctt gtg 3312 Asn Thr Val Leu Thr Asn Pro V al Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 tat aat gcg cct gta gta gcg aca aac cta acc agt aca gaa gca aaa 3360 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 gcg caa gca gct gct aca caa gct ggt ttt cag ata aaa gga cct gtt 3408 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 ggt tac aac tat cca ccg ctg cag tta att gaa cgt tat a cca 3456 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 gaa aac gtg att tac gat caa gct gat ttg gtt gaa ttc gct gaa ggt 3504 Glu Asn Val Ile Tyr Asp Gln Ala Asp Le Phe Ala Glu Gly 1155 1160 1165 gat att ggt aag gta ttt ggt gct gaa tac aat att att gat ggc tat 3552 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 tcg cgt cgt gcac acc tca gat tac ttg tta gta aca cgt 3600 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 gtt act gaa ctt gat gcc aag gtg cat gaa tac aag aaa tca tac atg 3648 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Ser Tyr Met 1205 1210 1215 tgt act gaa tat gat gtg cct gtt gat gca ccg ttc tta att gat ggt 3696 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 cag atc cct tgg tct gtt gcc gtc gaa tca ggc cag tgt gat ttg atg 3744 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 ttg att tca tat at ggt att gat ttc caa gcg aaa ggc gaa cgt gtt 3792 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 tac cgt tta ctt gat tgt gaa tta act ttc ctt gaa gag atg gtt gtt Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 ggt ggc gat act tta cgt tac gag atc cac att gat tcg tat gca cgt 3888 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 aac ggc gag caa tta tta ttc ttc ttc cat tac gat tgt tac gta ggg 3936 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 gat aag aag gta ctt atc atg cgt aat ggt tgt gct ggt ttc ttt act 3984 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 gac gaa gaa ctt tct gat ggt aaa ggc gtt att cat aac gac aaa g Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 aaa gct gag ttt agc aat gct gtt aaa tca tca ttc acg ccg tta tta 4080 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 caa cat aac cgt ggt caa tac gat tat aac gac atg atg aag ttg gtt 4128 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 aat ggt gat gtt gcc agt tgt ggt ccg caa tat gat caa ggt ggc 4176 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 cgt aat cca tca ttg aaa ttc tcg tct gag aag ttc ttg atg att gaa 4224 Arg Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 cgt att acc aag ata gac cca acc ggt ggt cat tgg gga cta ggc ctg 4272 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 tta gaa ggt cag aaa gat tta gac cct gag cat tgg tat ttc cct tgt 4320 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 cac ttt aaa ggt gat ca gta atg gct ggt tcg ttg atg tcg gaa ggt 4368 His Phe Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 tgt ggc caa atg gcg atg ttc ttc atg ctg tct ctt ggt atg cat Gcc 4416 Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 aat gtg aac aac gct cgt ttc caa cca cta cca ggt gaa tca caa acg 4464 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 gta cgt tgt cgt ggg caa gta ctg cca cag cgc aat acc tta act tac 4512 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 cgt atg gaa gtt act gcg atg ggt atg cat cca cag cca ttc atg aaa 4560 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 gct aat att gat att ttg ctt gac ggt aaa gtg gtt gtt gat ttc aaa 4608 Al a Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 aac ttg agc gtg atg atc agc gaa caa gat gag cat tca gat tac cct 4656 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 gta aca ctg ccg agt aat gtg gcg ctt aaa gcg att act gca cct gtt 4704 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 gcg tca gta gca cca gca tct tca ccc gct aac agc gcg gat cta gac 4752 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 gaa cgt ggt gtt gaa ccg ttt aag ttt cct gaa cgt ccg tta atg cgt 4800 Glu Arg Gly Val Glu Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 gtt gag tca gac ttg tct gca ccg aaa agc aaa ggt gtg aca ccg att 4848 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 aag cat ttt gaa gcg cct gct gtt gct ggt cat cat aga gtg cct aac 4896 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 caa gca ccg ttt aca cct tgg c at atg ttt gag ttt gcg acg ggt aat 4944 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 att tct aac tgt ttc ggt cct gat ttt gat gtt tat gaa ggt cgt att 4992 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 cca cct cgt aca cct tgt ggc gat tta caa gtt gtt act cag gtt gta 5040 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 gaa gtg cag ggc gaa cgt ctt gat ctt aaa aat cca tca agc tgt gta 5088 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 gct gaa tac tat gta ccg gaa gac gct act aaa aac agc 5136 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 cat gaa aac tgg atg cct tat tca tta atc atg gaa att gca ttg caa 5184 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 cca aat ggc ttt att tct ggt tac atg ggc acg acg ctt aaa tac cct 5232 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 173 0 1735 1740 gaa aaa gat ctg ttc ttc cgt aac ctt gat ggt agc ggc acg tta tta 5280 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 aag cag att gat c cag acc at gtg aat aaa tca gtc ttg 5328 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 gtt agt acg gct att gct ggt ggc gcg att att caa agt ttc acg ttt 5376 Val Ser Thr Ala Ile Ala Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 gat atg tct gta gat ggc gag cta ttt tat act ggt aaa gct gta ttt 5424 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 ggt tac ttt agt ggt gaa tca ctg act aac caa ctg ggc att gat aac 5472 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 ggt aaa acg act aat gcg tgg ttt gtt gat aac ac acc gcc gca 5520 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 aat att gat gtg ttt gat tta act aat cag tca ttg gct ctg tat aaa 5568 Asn Ile Asp Val Phe A sp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 gcg cct gtg gat aaa ccg cat tat aaa ttg gct ggt ggt cag atg aac 5616 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gln Met Asn 1860 1865 1870 ttt atc gat aca gtg tca gtg gtt gaa ggc ggt ggt aaa gcg ggc gtg 5664 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 gct tat gtt tat ggc gaa cgt acg att gat gct ttc ttc 5712 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 cgt tat cac ttc cac caa gat ccg gtg atg cca ggt tca tta ggt gtt 5760 Arg Tyr His Phe His Gln Asp Pro Val Met Gly Ser Leu Gly Val 1905 1910 1915 1920 gaa gct att att gag ttg atg cag acc tat gcg ctt aaa aat gat ttg 5808 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 ggt ggc aag t aac cca cgt ttc att gcg ccg atg acg caa gtt 5856 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 gat tgg aaa tac cgt ggg caa att acg ccg ctg aat aaa cag atg tca 5904 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 ctg gac gtg cat atc act gag atc gtg aat gac gct ggt gaa gtg cga 5952 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 atc gtt ggt gat gcg aat ctg tct aaa gat ggt ctg cgt att tat gaa 6000 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 gtt aaa aac atc gtt tta agt att gtt gaa gcg taa 6036 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 7 <211> 2011 <212> PRT <213> Moritella marina <400> 7 Met Glu Asn Ile Ala Val Val Gly Ile Ala Asn Leu Phe Pro Gly Ser 1 5 10 15 Gln Ala Pro Asp Gln Phe Trp Gln Gln Leu Leu Glu Gln Gln Asp Cys 20 25 30 Arg Ser Lys Ala Thr Ala Val Gln Met Gly Val Asp Pro Ala Lys Tyr 35 40 45 Thr Ala Asn Lys Gly Asp Thr Asp Lys Phe Tyr Cys Val His Gly Gly 50 55 60 Tyr Ile Ser Asp Phe Asn Phe Asp Ala Ser Gly Tyr Gln Leu Asp Asn 65 70 75 80 Asp Tyr Leu Ala Gly Leu Asp Asp Lep Asn Gln Trp Gly Leu Tyr Val 85 90 95 Thr Lys Gln Ala Leu Thr Asp Ala Gly Tyr Trp Gly Ser Thr Ala Leu 100 105 110 Glu Asn Cys Gly Val Ile Leu Gly Asn Leu Ser Phe Pro Thr Lys Ser 115 120 125 Ser Asn Gln Leu Phe Met Pro Leu Tyr His Gln Val Val Asp Asn Ala 130 135 140 Leu Lys Ala Val Leu His Pro Asp Phe Gln Leu Thr His Tyr Thr Ala 145 150 155 160 Pro Lys Lys Thr His Ala Asp Asn Ala Leu Val Ala Gly Tyr Pro Ala 165 170 175 Ala Leu Ile Ala Gln Ala Ala Gly Leu Gly Gly Ser His Phe Ala Leu 180 185 190 Asp Ala Ala Cys Ala Ser Ser Cys Tyr Ser Val Lys Leu Ala Cys Asp 195 200 205 Tyr Leu His T hr Gly Lys Ala Asn Met Met Leu Ala Gly Ala Val Ser 210 215 220 Ala Ala Asp Pro Met Phe Val Asn Met Gly Phe Ser Ile Phe Gln Ala 225 230 235 240 Tyr Pro Ala Asn Asn Val His Ala Pro Phe Asp Gln Asn Ser Gln Gly 245 250 255 Leu Phe Ala Gly Glu Gly Ala Gly Met Met Val Leu Lys Arg Gln Ser 260 265 270 Asp Ala Val Arg Asp Gly Asp His Ile Tyr Ala Ile Ile Lys Gly Gly 275 280 285 285 Ala Leu Ser Asn Asp Gly Lys Gly Glu Phe Val Leu Ser Pro Asn Thr 290 295 300 Lys Gly Gln Val Leu Val Tyr Glu Arg Ala Tyr Ala Asp Ala Asp Val 305 310 315 320 Asp Pro Ser Thr Val Asp Tyr Ile Glu Cys His Ala Thr Gly Thr Pro 325 330 335 Lys Gly Asp Asn Val Glu Leu Arg Ser Met Glu Thr Phe Phe Ser Arg 340 345 350 Val Asn Asn Lys Pro Leu Leu Gly Ser Val Lys Ser Asn Leu Gly His 355 360 365 Leu Leu Thr Ala Ala Gly Met Pro Gly Met Thr Lys Ala Met Leu Ala 370 375 380 Leu Gly Lys Gly Leu Ile Pro Ala Thr Ile Asn Leu Lys Gln Pro Leu 385 390 395 400 Gln Ser Lys Asn Gly Tyr Phe Thr Gly Glu Gln Met Pro Thr Thr Thr 405 410 415 Val Ser Trp P ro Thr Thr Pro Gly Ala Lys Ala Asp Lys Pro Arg Thr 420 425 430 Ala Gly Val Ser Val Phe Gly Phe Gly Gly Ser Asn Ala His Leu Val 435 440 445 Leu Gln Gln Pro Thr Gln Thr Leu Glu Thr Asn Phe Ser Val Ala Lys 450 455 460 Pro Arg Glu Pro Leu Ala Ile Ile Gly Met Asp Ser His Phe Gly Ser 465 470 475 480 Ala Ser Asn Leu Ala Gln Phe Lys Thr Leu Leu Asu Asn Asn Asn Gln Asn 485 490 495 495 Thr Phe Arg Glu Leu Pro Glu Gln Arg Trp Lys Gly Met Glu Ser Asn 500 505 510 Ala Asn Val Met Gln Ser Leu Gln Leu Arg Lys Ala Pro Lys Gly Ser 515 520 525 Tyr Val Glu Gln Leu Asp Ile Asp Phe Leu Arg Phe Lys Val Pro Pro 530 535 540 Asn Glu Lys Asp Cys Leu Ile Pro Gln Gln Leu Met Met Met Gln Val 545 550 555 555 560 Ala Asp Asn Ala Ala Lys Asp Gly Gly Leu Val Glu Gly Arg Asn Val 565 570 575 Ala Val Leu Val Ala Met Gly Met Glu Leu Glu Leu His Gln Tyr Arg 580 585 590 Gly Arg Val Asn Leu Thr Thr Gln Ile Glu Asp Ser Leu Leu Gln Gln 595 600 605 Gly Ile Asn Leu Thr Val Glu Gln Arg Glu Glu Leu Thr Asn Ile Ala 610 615 620 lys Asp Gly ValAla Ser Ala Ala Gln Leu Asn Gln Tyr Thr Ser Phe 625 630 635 640 Ile Gly Asn Ile Met Ala Ser Arg Ile Ser Ala Leu Trp Asp Phe Ser 645 650 655 Gly Pro Ala Ile Thr Val Ser Ala Glu Glu Asn Ser Val Tyr Arg Cys 660 665 670 Val Glu Leu Ala Glu Asn Leu Phe Gln Thr Ser Asp Val Glu Ala Val 675 680 685 Ile Ile Ala Ala Val Asp Leu Ser Gly Ser Ile Glu Asn Ile Thr Leu 690 695 700 Arg Gln His Tyr Gly Pro Val Asn Glu Lys Gly Ser Val Ser Glu Cys 705 710 715 720 720 Gly Pro Val Asn Glu Ser Ser Ser Val Thr Asn Asn Ile Leu Asp Gln 725 730 735 Gln Gln Trp Leu Val Gly Glu Gly Ala Ala Ala Ile Val Val Lys Pro 740 745 750 Ser Ser Gln Val Thr Ala Glu Gln Val Tyr Ala Arg Ile Asp Ala Val 755 760 765 Ser Phe Ala Pro Gly Ser Asn Ala Lys Ala Ile Thr Ile Ala Ala Asp 770 775 780 Lys Ala Leu Thr Leu Ala Gly Ile Ser Ala Ala Ala Asp Val Ala Ser Val 785 790 795 800 Glu Ala His Ala Ser Gly Phe Ser Ala Glu Asn Asn Ala Glu Lys Thr 805 810 815 Ala Leu Pro Thr Leu Tyr Pro Ser Ala Ser Ile Ser Ser Val Lys Ala 820 825 830 Asn Ile Gly HisThr Phe Asn Ala Ser Gly Met Ala Ser Ile Ile Lys 835 840 845 Thr Ala Leu Leu Leu Asp Gln Asn Thr Ser Gln Asp Gln Lys Ser Lys 850 855 860 His Ile Ala Ile Asn Gly Leu Gly Arg Asp Asn Ser Cys Ala His Leu 865 870 875 880 Ile Leu Ser Ser Ser Ala Gln Ala His Gln Val Ala Pro Ala Pro Val 885 890 895 Ser Gly Met Ala Lys Gln Arg Pro Gln Leu Val Lys Thr Ile Lys Leu 900 905 910 Gly Gly Gln Leu Ile Ser Asn Ala Ile Val Asn Ser Ala Ser Ser Ser 915 920 925 Leu His Ala Ile Lys Ala Gln Phe Ala Gly Lys His Leu Asn Lys Val 930 935 940 Asn Gln Pro Val Met Met Asp Asn Leu Lys Pro Gln Gly Ile Ser Ala 945 950 955 960 His Ala Thr Asn Glu Tyr Val Val Thr Gly Ala Ala Asn Thr Gln Ala 965 970 975 Ser Asn Ile Gln Ala Ser His Val Gln Ala Ser Ser His Ala Gln Glu 980 985 990 Ile Ala Pro Asn Gln Val Gln Asn Met Gln Ala Thr Ala Ala Ala Val 995 1000 1005 Ser Ser Pro Leu Ser Gln His Gln His Thr Ala Gln Pro Val Ala Ala 1010 1015 1020 Pro Ser Val Val Gly Val Thr Val Lys His Lys Ala Ser Asn Gln Ile 1025 1030 1035 1040 His Gl n Gln Ala Ser Thr His Lys Ala Phe Leu Glu Ser Arg Leu Ala 1045 1050 1055 Ala Gln Lys Asn Leu Ser Gln Leu Val Glu Leu Gln Thr Lys Leu Ser 1060 1065 1070 Ile Gln Thr Gly Ser Asp Asn Thr Ser Asn Asn Thr Ala Ser Thr Ser 1075 1080 1085 Asn Thr Val Leu Thr Asn Pro Val Ser Ala Thr Pro Leu Thr Leu Val 1090 1095 1100 Tyr Asn Ala Pro Val Val Ala Thr Asn Leu Thr Ser Thr Glu Ala Lys 1105 1110 1115 1120 Ala Gln Ala Ala Ala Thr Gln Ala Gly Phe Gln Ile Lys Gly Pro Val 1125 1130 1135 Gly Tyr Asn Tyr Pro Pro Leu Gln Leu Ile Glu Arg Tyr Asn Lys Pro 1140 1145 1150 Glu Asn Val Ile Tyr Asp Gln Ala Asp Leu Val Glu Phe Ala Glu Gly 1155 1160 1165 Asp Ile Gly Lys Val Phe Gly Ala Glu Tyr Asn Ile Ile Asp Gly Tyr 1170 1175 1180 Ser Arg Arg Val Arg Leu Pro Thr Ser Asp Tyr Leu Leu Val Thr Arg 1185 1190 1195 1200 Val Thr Glu Leu Asp Ala Lys Val His Glu Tyr Lys Lys Lys Ser Tyr Met 1205 1210 1215 Cys Thr Glu Tyr Asp Val Pro Val Asp Ala Pro Phe Leu Ile Asp Gly 1220 1225 1230 Gln Ile Pro Trp Ser Val Ala Val Glu Ser Gly Gln Cys Asp Leu Met 1235 1240 1245 Leu Ile Ser Tyr Ile Gly Ile Asp Phe Gln Ala Lys Gly Glu Arg Val 1250 1255 1260 Tyr Arg Leu Leu Asp Cys Glu Leu Thr Phe Leu Glu Glu Met Ala Phe 1265 1270 1275 1280 Gly Gly Asp Thr Leu Arg Tyr Glu Ile His Ile Asp Ser Tyr Ala Arg 1285 1290 1295 Asn Gly Glu Gln Leu Leu Phe Phe Phe His Tyr Asp Cys Tyr Val Gly 1300 1305 1310 Asp Lys Lys Val Leu Ile Met Arg Asn Gly Cys Ala Gly Phe Phe Thr 1315 1320 1325 Asp Glu Glu Leu Ser Asp Gly Lys Gly Val Ile His Asn Asp Lys Asp 1330 1335 1340 Lys Ala Glu Phe Ser Asn Ala Val Lys Ser Ser Phe Thr Pro Leu Leu 1345 1350 1355 1360 Gln His Asn Arg Gly Gln Tyr Asp Tyr Asn Asp Met Met Lys Leu Val 1365 1370 1375 Asn Gly Asp Val Ala Ser Cys Phe Gly Pro Gln Tyr Asp Gln Gly Gly 1380 1385 1390 Arg Asn Pro Ser Leu Lys Phe Ser Ser Glu Lys Phe Leu Met Ile Glu 1395 1400 1405 Arg Ile Thr Lys Ile Asp Pro Thr Gly Gly His Trp Gly Leu Gly Leu 1410 1415 1420 Leu Glu Gly Gln Lys Asp Leu Asp Pro Glu His Trp Tyr Phe Pro Cys 1425 1430 1435 1440 His P he Lys Gly Asp Gln Val Met Ala Gly Ser Leu Met Ser Glu Gly 1445 1450 1455 Cys Gly Gln Met Ala Met Phe Phe Met Leu Ser Leu Gly Met His Thr 1460 1465 1470 Asn Val Asn Asn Ala Arg Phe Gln Pro Leu Pro Gly Glu Ser Gln Thr 1475 1480 1485 Val Arg Cys Arg Gly Gln Val Leu Pro Gln Arg Asn Thr Leu Thr Tyr 1490 1495 1500 Arg Met Glu Val Thr Ala Met Gly Met His Pro Gln Pro Phe Met Lys 1505 1510 1515 1520 Ala Asn Ile Asp Ile Leu Leu Asp Gly Lys Val Val Val Asp Phe Lys 1525 1530 1535 Asn Leu Ser Val Met Ile Ser Glu Gln Asp Glu His Ser Asp Tyr Pro 1540 1545 1550 Val Thr Leu Pro Ser Asn Val Ala Leu Lys Ala Ile Thr Ala Pro Val 1555 1560 1565 Ala Ser Val Ala Pro Ala Ser Ser Pro Ala Asn Ser Ala Asp Leu Asp 1570 1575 1580 Glu Arg Gly Val Glu Pro Phe Lys Phe Pro Glu Arg Pro Leu Met Arg 1585 1590 1595 1600 Val Glu Ser Asp Leu Ser Ala Pro Lys Ser Lys Gly Val Thr Pro Ile 1605 1610 1615 Lys His Phe Glu Ala Pro Ala Val Ala Gly His His Arg Val Pro Asn 1620 1625 1630 Gln Ala Pro Phe Thr Pro Trp His Met Phe Glu Phe Ala Thr Gly Asn 1635 1640 1645 Ile Ser Asn Cys Phe Gly Pro Asp Phe Asp Val Tyr Glu Gly Arg Ile 1650 1655 1660 Pro Pro Arg Thr Pro Cys Gly Asp Leu Gln Val Val Thr Gln Val Val 1665 1670 1675 1680 Glu Val Gln Gly Glu Arg Leu Asp Leu Lys Asn Pro Ser Ser Cys Val 1685 1690 1695 Ala Glu Tyr Tyr Val Pro Glu Asp Ala Trp Tyr Phe Thr Lys Asn Ser 1700 1705 1710 His Glu Asn Trp Met Pro Tyr Ser Leu Ile Met Glu Ile Ala Leu Gln 1715 1720 1725 Pro Asn Gly Phe Ile Ser Gly Tyr Met Gly Thr Thr Leu Lys Tyr Pro 1730 1735 1740 Glu Lys Asp Leu Phe Phe Arg Asn Leu Asp Gly Ser Gly Thr Leu Leu 1745 1750 1755 1760 Lys Gln Ile Asp Leu Arg Gly Lys Thr Ile Val Asn Lys Ser Val Leu 1765 1770 1775 Val Ser Thr Ala Ile Ala Gly Gly Ala Ile Ile Gln Ser Phe Thr Phe 1780 1785 1790 Asp Met Ser Val Asp Gly Glu Leu Phe Tyr Thr Gly Lys Ala Val Phe 1795 1800 1805 Gly Tyr Phe Ser Gly Glu Ser Leu Thr Asn Gln Leu Gly Ile Asp Asn 1810 1815 1820 Gly Lys Thr Thr Asn Ala Trp Phe Val Asp Asn Asn Thr Pro Ala Ala 1825 1830 1835 1840 Asn Ile Asp Val Phe Asp Leu Thr Asn Gln Ser Leu Ala Leu Tyr Lys 1845 1850 1855 Ala Pro Val Asp Lys Pro His Tyr Lys Leu Ala Gly Gly Gly Gln Met Asn 1860 1865 1870 Phe Ile Asp Thr Val Ser Val Val Glu Gly Gly Gly Lys Ala Gly Val 1875 1880 1885 Ala Tyr Val Tyr Gly Glu Arg Thr Ile Asp Ala Asp Asp Trp Phe Phe 1890 1895 1900 Arg Tyr His Phe His Gln Asp Pro Val Met Pro Gly Ser Leu Gly Val 1905 1910 1915 1920 Glu Ala Ile Ile Glu Leu Met Gln Thr Tyr Ala Leu Lys Asn Asp Leu 1925 1930 1935 Gly Gly Lys Phe Ala Asn Pro Arg Phe Ile Ala Pro Met Thr Gln Val 1940 1945 1950 Asp Trp Lys Tyr Arg Gly Gln Ile Thr Pro Leu Asn Lys Gln Met Ser 1955 1960 1965 Leu Asp Val His Ile Thr Glu Ile Val Asn Asp Ala Gly Glu Val Arg 1970 1975 1980 Ile Val Gly Asp Ala Asn Leu Ser Lys Asp Gly Leu Arg Ile Tyr Glu 1985 1990 1995 2000 Val Lys Asn Ile Val Leu Ser Ile Val Glu Ala 2005 2010 <210> 8 <211> 1617 <212> DNA <213> Moritella marina <220> <221> CDS <222> (1) .. (1614) <400> 8 atg tcg agt tta ggt ttt aac aat aac aac gca att aac tgg gct tgg 48 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 aaa gta gat cca gcg tca gtt cat aca caa gat gca gaa att aaa gca 96 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 gct tta atg gat cta act aaa cct ctc tat gtg gcg aat aat tca ggc 144 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 gta act ggt ata gct aat cat acg tca gta gca ggt gcg atc agc aat 192 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 aac atc gat gtt gat gta ttg gcg ttt gcg caa aag tta aac cca gaa 240 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 gat ctg ggt gat gat gct tac aag aaa cag cac ggc gtt aaa tat gct 288 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 tat cat ggc ggt gcg atg gca aat ggt att gcc tcg gtt gaa ttg gtt 336 Tyr His Gly Gly Ala Met Ala Asn Gly Ale Ser Val Glu Leu Val 100 105 110 gtt gcg tta ggt aaa gca ggg ctg tta tgt tca ttt ggt gct gca ggt 384 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 cta gtg cct gat gcg gtt gaa gat gca att cgt cgt gct gaa 432 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 tta cca aat ggc cct tat gcg gtt aac ttg atc cat gca cca gca gaa 480 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 gaa gca tta gag cgt ggc gcg gtt gaa cgt ttc cta aaa ctt ggc gtc 528 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 aag acct gta tca gct tac ctt ggt tta act gaa cac att gtt 576 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 tgg tat cgt gct gct ggt cta act aaa aac gca gat ggc agt gtt aat 624 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 atc ggt aac aag gtt atc gct aaa gta tcg cgt acc gaa gtt ggt cgc 672 Ile Gly Asn Lys Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg210 215 220 cgc ttt atg gaa cct gca ccg caa aaa tta ctg gat aag tta tta gaa 720 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 caa aat aag atc acc cct gga caa gct tta gcg ttg ctt gta cct 768 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 atg gct gat gat att act ggg gaa gcg gat tct ggt ggt cat aca gat 816 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 aac cgt ccg ttt tta aca tta tta ccg acg att att ggt ctg cgt gat 864 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thrh Ile Ile Gly Leu Arg Asp 275 280 285 gaa gtg caa gcg aag tat aac ttc tct cct gca tta cgt gtt ggt gct 912 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 ggt ggt ggt atc gga acg cct gaa gca gca ctc gct gca ttt ata 960 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 ggc gcg gct tat atc gtt ctg ggt tct gtg aat cag gcg tgt gtt gaa 1008 Gly Ala Ala Tla Iyr Val Leu Gly Ser Val Asn Ala Cys Val Glu 325 330 335 gcg ggt gca tct gaa tat act cgt aaa ctg tta tcg aca gtt gaa atg 1056 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 gct gat gtg act atg gca cct gct gca gat atg ttt gaa atg ggt gtg 1104 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 aag ctg caa gta tta aaa cgc ggt tct atg ttc gcg atg cgt gcg aag 1152 Lys Le Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 aaa ctg tat gac ttg tat gtg gct tat gac tcg att gaa gat atc cca 1200 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 395 400 gct gct gaa cgt gag aag att gaa aaa caa atc ttc cgt gca aac cta 1248 Ala Ala Glu Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 gac gag att tgg gat ggc act atc gct ttc gaa cgc gat cca 1296 Asp Glu Ile Trp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 gaa atg cta gcc cgt gca acg agt agt cct aaa cgt aaa atg gca ctt 1344 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 atc ttc cgt tgg tat ctt ggc ctt tct tca cgc tgg tca aac aca ggc 1392 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 gag aag gga cgt gaa atg gat tat cag att tgg gca ggc cca agt tta 1440 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 ggt gca ttc aac agc tgg gtg aaa ggt tct tac ctt acc 1488 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 cgc cgt ggc gct gta gat gtt gct ttg cat atg ctt aaa ggt gct gcg 1536 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 tat tta caa cgt gta aac cag ttg aaa ttg caa ggt gtt agc tta agt 1584 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 aca gaa ttg gca gtg agt acg agt gat taa 1617 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 9 <211> 538 <212> PRT <213> Moritella marina <400> 9 Met Ser Ser Leu Gly Phe Asn Asn Asn Asn Ala Ile Asn Trp Ala Trp 1 5 10 15 Lys Val Asp Pro Ala Ser Val His Thr Gln Asp Ala Glu Ile Lys Ala 20 25 30 Ala Leu Met Asp Leu Thr Lys Pro Leu Tyr Val Ala Asn Asn Ser Gly 35 40 45 Val Thr Gly Ile Ala Asn His Thr Ser Val Ala Gly Ala Ile Ser Asn 50 55 60 Asn Ile Asp Val Asp Val Leu Ala Phe Ala Gln Lys Leu Asn Pro Glu 65 70 75 80 Asp Leu Gly Asp Asp Ala Tyr Lys Lys Gln His Gly Val Lys Tyr Ala 85 90 95 Tyr His Gly Gly Ala Met Ala Asn Gly Ile Ala Ser Val Glu Leu Val 100 105 110 Val Ala Leu Gly Lys Ala Gly Leu Leu Cys Ser Phe Gly Ala Ala Gly 115 120 125 Leu Val Pro Asp Ala Val Glu Asp Ala Ile Arg Arg Ile Gln Ala Glu 130 135 140 Leu Pro Asn Gly Pro Tyr Ala Val Asn Leu Ile His Ala Pro Ala Glu 145 150 155 160 Glu Ala Leu Glu Arg Gly Ala Val Glu Arg Phe Leu Lys Leu Gly Val 165 170 175 Lys Thr Val Glu Ala Ser Ala Tyr Leu Gly Leu Thr Glu His Ile Val 180 185 190 Trp Tyr Arg Ala Ala Gly Leu Thr Lys Asn Ala Asp Gly Ser Val Asn 195 200 205 Ile Gly Asn Ly s Val Ile Ala Lys Val Ser Arg Thr Glu Val Gly Arg 210 215 220 Arg Phe Met Glu Pro Ala Pro Gln Lys Leu Leu Asp Lys Leu Leu Glu 225 230 235 240 Gln Asn Lys Ile Thr Pro Glu Gln Ala Ala Leu Ala Leu Leu Val Pro 245 250 255 Met Ala Asp Asp Ile Thr Gly Glu Ala Asp Ser Gly Gly His Thr Asp 260 265 270 Asn Arg Pro Phe Leu Thr Leu Leu Pro Thr Ile Ile Gly Leu Arg Asp 275 280 285 Glu Val Gln Ala Lys Tyr Asn Phe Ser Pro Ala Leu Arg Val Gly Ala 290 295 300 300 Gly Gly Gly Ile Gly Thr Pro Glu Ala Ala Leu Ala Ala Phe Asn Met 305 310 315 320 Gly Ala Ala Tyr Ile Val Leu Gly Ser Val Asn Gln Ala Cys Val Glu 325 330 335 Ala Gly Ala Ser Glu Tyr Thr Arg Lys Leu Leu Ser Thr Val Glu Met 340 345 350 Ala Asp Val Thr Met Ala Pro Ala Ala Asp Met Phe Glu Met Gly Val 355 360 365 Lys Leu Gln Val Leu Lys Arg Gly Ser Met Phe Ala Met Arg Ala Lys 370 375 380 Lys Leu Tyr Asp Leu Tyr Val Ala Tyr Asp Ser Ile Glu Asp Ile Pro 385 390 395 400 Ala Ala Gla Arg Glu Lys Ile Glu Lys Gln Ile Phe Arg Ala Asn Leu 405 410 415 Asp Glu Ile T rp Asp Gly Thr Ile Ala Phe Phe Thr Glu Arg Asp Pro 420 425 430 Glu Met Leu Ala Arg Ala Thr Ser Ser Pro Lys Arg Lys Met Ala Leu 435 440 445 Ile Phe Arg Trp Tyr Leu Gly Leu Ser Ser Arg Trp Ser Asn Thr Gly 450 455 460 Glu Lys Gly Arg Glu Met Asp Tyr Gln Ile Trp Ala Gly Pro Ser Leu 465 470 475 480 Gly Ala Phe Asn Ser Trp Val Lys Gly Ser Tyr Leu Glu Asp Tyr Thr 485 490 495 Arg Arg Gly Ala Val Asp Val Ala Leu His Met Leu Lys Gly Ala Ala 500 505 510 Tyr Leu Gln Arg Val Asn Gln Leu Lys Leu Gln Gly Val Ser Leu Ser 515 520 525 Thr Glu Leu Ala Ser Tyr Arg Thr Ser Asp 530 535 <210> 10 <211> 20 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Primer <220> <221> Degenerate <222> (6) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (12) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (15) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (18) <223> "n" is a, t, c or g <400> 10 ttyggnttyg gnggnacnaa 20 <210> 11 <211> 21 <212> DNA <213> Artificial Sequence <220> <223> Description of Artificial Sequence: Primer <220> <221> Degenerate <222> (4) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (7) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (10) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (16) <223> "n" is a, t, c or g <220> <221> Degenerate <222> (19) <223> "n" is a, t, c or g <400> 11 ytcnccnarn swrtgnccng c 21

【0059】[0059]

【配列表フリーテキスト】[Sequence List Free Text]

【0060】[0060]

【配列番号10】プライマー。第6塩基、第12塩基、
第15塩基及び第18塩基のnは、a、t、c又はgで
ある。
[SEQ ID NO: 10] Primer. 6th base, 12th base,
N of the 15th and 18th bases is a, t, c or g.

【0061】[0061]

【配列番号11】プライマー。第4塩基、第7塩基、第
10塩基、第16塩基及び第19塩基のnは、a、t、
c又はgである。
[SEQ ID NO: 11] Primer. N of the fourth, seventh, tenth, sixteenth, and nineteenth bases is a, t,
c or g.

【図面の簡単な説明】[Brief description of the drawings]

【図1】シェワネラSCRC2738株のEPA合成酵
素群遺伝子ORF5中のKAS−MCTドメインのアミ
ノ酸配列と、微生物由来のKASドメイン及びMCTド
メインのアミノ酸配列との比較を示す図である。
FIG. 1 is a diagram showing a comparison between the amino acid sequence of the KAS-MCT domain in the EPA synthase group gene ORF5 of Shewanella SCRC2738 strain and the amino acid sequences of the KAS domain and MCT domain derived from microorganisms.

【図2】PCRによって得られたモリテラ・マリナMP
−1株のKAS−MCT断片のアミノ酸配列とシェワネ
ラSCRC2738株由来EPA合成酵素群遺伝子OR
F5中のKAS−MCTドメインのアミノ酸配列との比
較を示す図である。
FIG. 2: Moritera Marina MP obtained by PCR
Amino acid sequence of KAS-MCT fragment of -1 strain and OR gene of EPA synthase group derived from Shewanella SCRC2738 strain
It is a figure which shows the comparison with the amino acid sequence of the KAS-MCT domain in F5.

【図3】コスミドクローンp3D5に含まれるORFの
概略を示す図である。
FIG. 3 is a diagram showing an outline of an ORF contained in a cosmid clone p3D5.

【図4】コスミドクローンp3D5(A)とシェワネラ
SCRC2738株のEPA合成酵素群遺伝子(B)に
見られるドメイン構造の比較を示す図である。
FIG. 4 is a view showing a comparison of the domain structures of the cosmid clone p3D5 (A) and the EPA synthase gene (B) of Shewanella SCRC2738 strain.

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.7 識別記号 FI テーマコート゛(参考) C12R 1:01) C12R 1:01) (72)発明者 奥山 英登志 北海道札幌市豊平区月寒東2条17丁目2番 1号 工業技術院北海道工業技術研究所内 Fターム(参考) 4B024 AA05 BA10 CA03 DA05 DA11 EA04 GA11 4B050 CC03 DD02 LL02 4B064 AD90 CA02 CA19 CC24 DA06 DA10 4H045 AA10 BA10 CA11 DA89 EA01 FA72 FA74 ──────────────────────────────────────────────────続 き Continued on the front page (51) Int.Cl. 7 Identification symbol FI Theme coat ゛ (Reference) C12R 1:01) C12R 1:01) (72) Inventor Hidetoshi Okuyama Tsukikan Higashijo, Toyohira-ku, Sapporo, Hokkaido 17-chome 2-1-1 F-term in the National Institute of Advanced Industrial Science and Technology, Hokkaido Institute of Technology 4B024 AA05 BA10 CA03 DA05 DA11 EA04 GA11 4B050 CC03 DD02 LL02 4B064 AD90 CA02 CA19 CC24 DA06 DA10 4H045 AA10 BA10 CA11 DA89 EA01 FA72 FA74

Claims (16)

【特許請求の範囲】[Claims] 【請求項1】 ドコサヘキサエン酸を生産する能力を有
する細菌由来の、イコサペンタエン酸生合成酵素群類似
タンパク質群をコードするDNA。
1. A DNA encoding an icosapentaenoic acid biosynthetic enzyme group-like protein group derived from a bacterium capable of producing docosahexaenoic acid.
【請求項2】 前記細菌が、モリテラ属(Moritella
に属するものである請求項1記載のDNA。
2. The method according to claim 1, wherein the bacterium is of the genus Moritella .
The DNA according to claim 1, which belongs to
【請求項3】 (i)配列番号3で表わされるアミノ酸
配列において1以上のアミノ酸が置換、欠失、付加又は
挿入されていてもよいアミノ酸配列を含み、かつ、ドコ
サヘキサエン酸生合成酵素群のメンバーとして機能し得
るタンパク質をコードする塩基配列、(ii)配列番号5
で表わされるアミノ酸配列において1以上のアミノ酸が
置換、欠失、付加又は挿入されていてもよいアミノ酸配
列を含み、かつ、ドコサヘキサエン酸生合成酵素群のメ
ンバーとして機能し得るタンパク質をコードする塩基配
列、(iii)配列番号7で表わされるアミノ酸配列にお
いて1以上のアミノ酸が置換、欠失、付加又は挿入され
ていてもよいアミノ酸配列を含み、かつ、ドコサヘキサ
エン酸生合成酵素群のメンバーとして機能し得るタンパ
ク質をコードする塩基配列、及び(iv)配列番号9で表
わされるアミノ酸配列において1以上のアミノ酸が置
換、欠失、付加又は挿入されていてもよいアミノ酸配列
を含み、かつ、ドコサヘキサエン酸生合成酵素群のメン
バーとして機能し得るタンパク質をコードする塩基配列
を含む請求項1記載のDNA。
(I) a member of the docosahexaenoic acid biosynthetic enzyme group, which comprises an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 3; Sequence encoding a protein capable of functioning as: (ii) SEQ ID NO: 5
In the amino acid sequence represented by, one or more amino acids substitution, deletion, including an amino acid sequence that may be added or inserted, and, a nucleotide sequence encoding a protein that can function as a member of the docosahexaenoic acid biosynthetic enzyme group, (Iii) a protein comprising an amino acid sequence in which one or more amino acids may be substituted, deleted, added or inserted in the amino acid sequence represented by SEQ ID NO: 7, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group And (iv) an amino acid sequence represented by SEQ ID NO: 9 in which one or more amino acids may be substituted, deleted, added or inserted, and a group of docosahexaenoic acid biosynthetic enzymes 2. The method according to claim 1, which comprises a nucleotide sequence encoding a protein capable of functioning as a member of the protein. NA.
【請求項4】 配列番号1で表わされる塩基配列を含む
請求項1記載のDNA。
4. The DNA according to claim 1, comprising the base sequence represented by SEQ ID NO: 1.
【請求項5】 以下の(1)又は(2)に示されるタン
パク質。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
5. A protein represented by the following (1) or (2): (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項6】 以下の(1)又は(2)に示されるタン
パク質をコードするDNA。 (1)配列番号3で表わされるアミノ酸配列を含むタン
パク質。 (2)配列番号3で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
6. A DNA encoding a protein represented by the following (1) or (2): (1) A protein comprising the amino acid sequence represented by SEQ ID NO: 3. (2) In the amino acid sequence represented by SEQ ID NO: 3, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項7】 配列番号2で表わされる塩基配列を含む
請求項6記載のDNA。
7. The DNA according to claim 6, which comprises the base sequence represented by SEQ ID NO: 2.
【請求項8】 以下の(3)又は(4)に示されるタン
パク質。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
8. A protein represented by the following (3) or (4): (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項9】 以下の(3)又は(4)に示されるタン
パク質をコードするDNA。 (3)配列番号5で表わされるアミノ酸配列を含むタン
パク質。 (4)配列番号5で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
9. A DNA encoding a protein represented by the following (3) or (4): (3) A protein comprising the amino acid sequence represented by SEQ ID NO: 5. (4) 1 in the amino acid sequence represented by SEQ ID NO: 5
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項10】 配列番号4で表わされる塩基配列を含
む請求項9記載のDNA。
10. The DNA according to claim 9, which comprises the base sequence represented by SEQ ID NO: 4.
【請求項11】 以下の(5)又は(6)に示されるタ
ンパク質。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
11. A protein represented by the following (5) or (6): (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項12】 以下の(5)又は(6)に示されるタ
ンパク質をコードするDNA。 (5)配列番号7で表わされるアミノ酸配列を含むタン
パク質。 (6)配列番号7で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
12. A DNA encoding a protein represented by the following (5) or (6): (5) A protein comprising the amino acid sequence represented by SEQ ID NO: 7. (6) In the amino acid sequence represented by SEQ ID NO: 7, 1
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項13】 配列番号6で表わされる塩基配列を含
む請求項12記載のDNA。
13. The DNA according to claim 12, comprising the base sequence represented by SEQ ID NO: 6.
【請求項14】 以下の(7)又は(8)に示されるタ
ンパク質。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
14. A protein represented by the following (7) or (8): (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項15】 以下の(7)又は(8)に示されるタ
ンパク質をコードするDNA。 (7)配列番号9で表わされるアミノ酸配列を含むタン
パク質。 (8)配列番号9で表わされるアミノ酸配列において1
以上のアミノ酸が置換、欠失、付加又は挿入されたアミ
ノ酸配列を含み、かつ、ドコサヘキサエン酸生合成酵素
群のメンバーとして機能し得るタンパク質。
15. A DNA encoding a protein represented by the following (7) or (8): (7) a protein comprising the amino acid sequence represented by SEQ ID NO: 9; (8) 1 in the amino acid sequence represented by SEQ ID NO: 9
A protein comprising the amino acid sequence in which the above amino acids have been substituted, deleted, added or inserted, and which can function as a member of the docosahexaenoic acid biosynthetic enzyme group.
【請求項16】 配列番号8で表わされる塩基配列を含
む請求項15記載のDNA。
16. The DNA according to claim 15, comprising the base sequence represented by SEQ ID NO: 8.
JP35661499A 1999-12-15 1999-12-15 Gene derived from docosahexaenoic acid-producing bacterium Pending JP2001169780A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP35661499A JP2001169780A (en) 1999-12-15 1999-12-15 Gene derived from docosahexaenoic acid-producing bacterium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP35661499A JP2001169780A (en) 1999-12-15 1999-12-15 Gene derived from docosahexaenoic acid-producing bacterium

Publications (1)

Publication Number Publication Date
JP2001169780A true JP2001169780A (en) 2001-06-26

Family

ID=18449912

Family Applications (1)

Application Number Title Priority Date Filing Date
JP35661499A Pending JP2001169780A (en) 1999-12-15 1999-12-15 Gene derived from docosahexaenoic acid-producing bacterium

Country Status (1)

Country Link
JP (1) JP2001169780A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7807849B2 (en) 2004-04-22 2010-10-05 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US7834250B2 (en) 2004-04-22 2010-11-16 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US8809559B2 (en) 2008-11-18 2014-08-19 Commonwelath Scientific And Industrial Research Organisation Enzymes and methods for producing omega-3 fatty acids
US8816111B2 (en) 2012-06-15 2014-08-26 Commonwealth Scientific And Industrial Research Organisation Lipid comprising polyunsaturated fatty acids
US8816106B2 (en) 2006-08-29 2014-08-26 Commonwealth Scientific And Industrial Research Organisation Synthesis of fatty acids
US9718759B2 (en) 2013-12-18 2017-08-01 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
US10005713B2 (en) 2014-06-27 2018-06-26 Commonwealth Scientific And Industrial Research Organisation Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position

Cited By (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9453183B2 (en) 2004-04-22 2016-09-27 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US7834250B2 (en) 2004-04-22 2010-11-16 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US10443079B2 (en) 2004-04-22 2019-10-15 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US8071341B2 (en) 2004-04-22 2011-12-06 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US8106226B2 (en) 2004-04-22 2012-01-31 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US8158392B1 (en) 2004-04-22 2012-04-17 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US8535917B2 (en) 2004-04-22 2013-09-17 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US8575377B2 (en) 2004-04-22 2013-11-05 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US8778644B2 (en) 2004-04-22 2014-07-15 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US9458410B2 (en) 2004-04-22 2016-10-04 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US9963723B2 (en) 2004-04-22 2018-05-08 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US9970033B2 (en) 2004-04-22 2018-05-15 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US8853432B2 (en) 2004-04-22 2014-10-07 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US10781463B2 (en) 2004-04-22 2020-09-22 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US7932438B2 (en) 2004-04-22 2011-04-26 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US11220698B2 (en) 2004-04-22 2022-01-11 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US9951357B2 (en) 2004-04-22 2018-04-24 Commonweatlh Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US9994880B2 (en) 2004-04-22 2018-06-12 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US7807849B2 (en) 2004-04-22 2010-10-05 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US11597953B2 (en) 2004-04-22 2023-03-07 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cells
US9926579B2 (en) 2004-04-22 2018-03-27 Commonwealth Scientific And Industrial Research Organisation Synthesis of long-chain polyunsaturated fatty acids by recombinant cell
US8816106B2 (en) 2006-08-29 2014-08-26 Commonwealth Scientific And Industrial Research Organisation Synthesis of fatty acids
US10513717B2 (en) 2006-08-29 2019-12-24 Commonwealth Scientific And Industrial Research Organisation Synthesis of fatty acids
US9938486B2 (en) 2008-11-18 2018-04-10 Commonwealth Scientific And Industrial Research Organisation Enzymes and methods for producing omega-3 fatty acids
US8809559B2 (en) 2008-11-18 2014-08-19 Commonwelath Scientific And Industrial Research Organisation Enzymes and methods for producing omega-3 fatty acids
US9550718B2 (en) 2012-06-15 2017-01-24 Commonwealth Scientific And Industrial Research Organisation Lipid comprising polyunsaturated fatty acids
US9932289B2 (en) 2012-06-15 2018-04-03 Commonwealth Scientific And Industrial Research Ogranisation Process for producing ethyl esters of polyunsaturated fatty acids
US9556102B2 (en) 2012-06-15 2017-01-31 Commonwealth Scientific And Industrial Research Organisation Process for producing ethyl esters of polyunsaturated fatty acids
US10335386B2 (en) 2012-06-15 2019-07-02 Commonwealth Scientific And Industrial Research Organisation Lipid comprising polyunsaturated fatty acids
US8946460B2 (en) 2012-06-15 2015-02-03 Commonwealth Scientific And Industrial Research Organisation Process for producing polyunsaturated fatty acids in an esterified form
US8816111B2 (en) 2012-06-15 2014-08-26 Commonwealth Scientific And Industrial Research Organisation Lipid comprising polyunsaturated fatty acids
US9718759B2 (en) 2013-12-18 2017-08-01 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
US10190073B2 (en) 2013-12-18 2019-01-29 Commonwealth Scientific And Industrial Research Organisation Lipid comprising long chain polyunsaturated fatty acids
US10125084B2 (en) 2013-12-18 2018-11-13 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
US10800729B2 (en) 2013-12-18 2020-10-13 Commonwealth Scientific And Industrial Research Organisation Lipid comprising long chain polyunsaturated fatty acids
US9725399B2 (en) 2013-12-18 2017-08-08 Commonwealth Scientific And Industrial Research Organisation Lipid comprising long chain polyunsaturated fatty acids
US11623911B2 (en) 2013-12-18 2023-04-11 Commonwealth Scientific And Industrial Research Organisation Lipid comprising docosapentaenoic acid
US10793507B2 (en) 2014-06-27 2020-10-06 Commonwealth Scientific And Industrial Research Organisation Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the SN-2 position
US10005713B2 (en) 2014-06-27 2018-06-26 Commonwealth Scientific And Industrial Research Organisation Lipid compositions comprising triacylglycerol with long-chain polyunsaturated fatty acids at the sn-2 position

Similar Documents

Publication Publication Date Title
KR101539470B1 (en) Chimeric pufa polyketide synthase systems and uses thereof
KR101506347B1 (en) Plant seed oils containing polyunsaturated fatty acids
KR20070084187A (en) Pufa polyketide synthase systems and uses thereof
TW201038734A (en) Polyunsaturated fatty acid synthase nucleic acid molecules and polypeptides, compositions, and methods of making and uses thereof
MXPA01007153A (en) Schizochytrium pks genes.
KR101234198B1 (en) PUFA Polyketide Synthase Systems and Uses Thereof
CN108368491A (en) The algae mutant of lipid production rate with raising
AU673359B2 (en) Gene which codes for eicosapentaenoic acid synthetase group and process for producing eicosapentaenoic acid
US6908992B2 (en) Methanotrophic carbon metabolism pathway genes and enzymes
JP2001169780A (en) Gene derived from docosahexaenoic acid-producing bacterium
EP0836611A1 (en) Sequences for production of 2,4-diacetylphloroglucinol and methods
US6537786B2 (en) Genes encoding exopolysaccharide production
US20030157673A1 (en) Genes involved in cyclododecanone degradation pathway
JP4221476B2 (en) Plasmid cloned icosapentaenoic acid biosynthesis genes and cyanobacteria producing icosapentaenoic acid
CA2391131C (en) Genes and proteins for rosaramicin biosynthesis
CN1325959B (en) Genes from genome
US20030215930A1 (en) Genes involved in cyclododecanone degradation pathway
KR20130097538A (en) Chejuenolide biosynthetic gene cluster from hahella chejuensis
KR20110092510A (en) Tridecaptin synthetase and gene thereof
JP5110511B2 (en) Method for producing highly unsaturated fatty acids and highly unsaturated lipids using microorganisms
JPWO2009147984A1 (en) DNA encoding a polypeptide involved in the biosynthesis of herboxidiene
CN101142313A (en) Genes encoding the synthetic pathway for the production of disorazole
JPH0646864A (en) Gene capable of coding eicosapentaenoic acid synthase and production of elcosapentaenoic acid
JPH08242867A (en) Gene coding for biosynthetic enzyme group for eicosapentaenoic acid and production of eicosapentaenoic acid
JP2002315579A (en) Structural gene on gene cluster