US20140053299A1 - PUFA Polyketide Synthase Systems and Uses Thereof - Google Patents
PUFA Polyketide Synthase Systems and Uses Thereof Download PDFInfo
- Publication number
- US20140053299A1 US20140053299A1 US13/771,840 US201313771840A US2014053299A1 US 20140053299 A1 US20140053299 A1 US 20140053299A1 US 201313771840 A US201313771840 A US 201313771840A US 2014053299 A1 US2014053299 A1 US 2014053299A1
- Authority
- US
- United States
- Prior art keywords
- seq
- plant
- acid sequence
- nucleic acid
- pufa
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 235000020777 polyunsaturated fatty acids Nutrition 0.000 title claims abstract description 566
- 108010030975 Polyketide Synthases Proteins 0.000 title abstract description 351
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 288
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 163
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 163
- 239000012634 fragment Substances 0.000 claims abstract description 35
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 204
- MBMBGCFOFBJSGT-KUBAVDMBSA-N docosahexaenoic acid Natural products CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O MBMBGCFOFBJSGT-KUBAVDMBSA-N 0.000 claims description 174
- 150000004665 fatty acids Chemical class 0.000 claims description 145
- 239000000194 fatty acid Substances 0.000 claims description 142
- 235000014113 dietary fatty acids Nutrition 0.000 claims description 141
- 229930195729 fatty acid Natural products 0.000 claims description 141
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 124
- 230000000694 effects Effects 0.000 claims description 96
- 235000020669 docosahexaenoic acid Nutrition 0.000 claims description 91
- 150000001413 amino acids Chemical class 0.000 claims description 90
- 229940090949 docosahexaenoic acid Drugs 0.000 claims description 83
- YUFFSWGQGVEMMI-JLNKQSITSA-N (7Z,10Z,13Z,16Z,19Z)-docosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCCCC(O)=O YUFFSWGQGVEMMI-JLNKQSITSA-N 0.000 claims description 51
- 235000020664 gamma-linolenic acid Nutrition 0.000 claims description 46
- VZCCETWTMQHEPK-QNEBEIHSSA-N gamma-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCC(O)=O VZCCETWTMQHEPK-QNEBEIHSSA-N 0.000 claims description 46
- 239000011203 carbon fibre reinforced carbon Substances 0.000 claims description 42
- AVKOENOBFIYBSA-WMPRHZDHSA-N (4Z,7Z,10Z,13Z,16Z)-docosa-4,7,10,13,16-pentaenoic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCC(O)=O AVKOENOBFIYBSA-WMPRHZDHSA-N 0.000 claims description 23
- VZCCETWTMQHEPK-UHFFFAOYSA-N gamma-Linolensaeure Natural products CCCCCC=CCC=CCC=CCCCCC(O)=O VZCCETWTMQHEPK-UHFFFAOYSA-N 0.000 claims description 21
- 229960002733 gamolenic acid Drugs 0.000 claims description 21
- HOBAELRKJCKHQD-QNEBEIHSSA-N dihomo-γ-linolenic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/CCCCCCC(O)=O HOBAELRKJCKHQD-QNEBEIHSSA-N 0.000 claims description 15
- HOBAELRKJCKHQD-UHFFFAOYSA-N (8Z,11Z,14Z)-8,11,14-eicosatrienoic acid Natural products CCCCCC=CCC=CCC=CCCCCCCC(O)=O HOBAELRKJCKHQD-UHFFFAOYSA-N 0.000 claims description 14
- 239000000203 mixture Substances 0.000 claims description 13
- DTOSIQBPPRVQHS-PDBXOOCHSA-N alpha-linolenic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCC(O)=O DTOSIQBPPRVQHS-PDBXOOCHSA-N 0.000 claims description 12
- 235000021294 Docosapentaenoic acid Nutrition 0.000 claims description 8
- 235000021298 Dihomo-γ-linolenic acid Nutrition 0.000 claims description 6
- 235000013305 food Nutrition 0.000 claims description 6
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 claims description 6
- OYHQOLUKZRVURQ-HZJYTTRNSA-N Linoleic acid Chemical compound CCCCC\C=C/C\C=C/CCCCCCCC(O)=O OYHQOLUKZRVURQ-HZJYTTRNSA-N 0.000 claims description 5
- 235000020661 alpha-linolenic acid Nutrition 0.000 claims description 5
- 229960004488 linolenic acid Drugs 0.000 claims description 5
- WRIDQFICGBMAFQ-UHFFFAOYSA-N (E)-8-Octadecenoic acid Natural products CCCCCCCCCC=CCCCCCCC(O)=O WRIDQFICGBMAFQ-UHFFFAOYSA-N 0.000 claims description 4
- LQJBNNIYVWPHFW-UHFFFAOYSA-N 20:1omega9c fatty acid Natural products CCCCCCCCCCC=CCCCCCCCC(O)=O LQJBNNIYVWPHFW-UHFFFAOYSA-N 0.000 claims description 4
- QSBYPNXLFMSGKH-UHFFFAOYSA-N 9-Heptadecensaeure Natural products CCCCCCCC=CCCCCCCCC(O)=O QSBYPNXLFMSGKH-UHFFFAOYSA-N 0.000 claims description 4
- ZQPPMHVWECSIRJ-UHFFFAOYSA-N Oleic acid Natural products CCCCCCCCC=CCCCCCCCC(O)=O ZQPPMHVWECSIRJ-UHFFFAOYSA-N 0.000 claims description 4
- 239000005642 Oleic acid Substances 0.000 claims description 4
- QXJSBBXBKPUZAA-UHFFFAOYSA-N isooleic acid Natural products CCCCCCCC=CCCCCCCCCC(O)=O QXJSBBXBKPUZAA-UHFFFAOYSA-N 0.000 claims description 4
- KQQKGWQCNNTQJW-UHFFFAOYSA-N linolenic acid Natural products CC=CCCC=CCC=CCCCCCCCC(O)=O KQQKGWQCNNTQJW-UHFFFAOYSA-N 0.000 claims description 4
- 235000020778 linoleic acid Nutrition 0.000 claims description 3
- OYHQOLUKZRVURQ-IXWMQOLASA-N linoleic acid Natural products CCCCC\C=C/C\C=C\CCCCCCCC(O)=O OYHQOLUKZRVURQ-IXWMQOLASA-N 0.000 claims description 3
- VKOBVWXKNCXXDE-UHFFFAOYSA-N icosanoic acid Chemical compound CCCCCCCCCCCCCCCCCCCC(O)=O VKOBVWXKNCXXDE-UHFFFAOYSA-N 0.000 claims description 2
- 108090000623 proteins and genes Proteins 0.000 abstract description 263
- 102000004169 proteins and genes Human genes 0.000 abstract description 164
- 241000233671 Schizochytrium Species 0.000 abstract description 83
- 244000005700 microbiome Species 0.000 abstract description 73
- 238000000034 method Methods 0.000 abstract description 60
- 230000000975 bioactive effect Effects 0.000 abstract description 43
- 235000003869 genetically modified organism Nutrition 0.000 abstract description 19
- 150000002632 lipids Chemical class 0.000 abstract description 12
- 241000196324 Embryophyta Species 0.000 description 282
- 235000018102 proteins Nutrition 0.000 description 159
- 239000002773 nucleotide Substances 0.000 description 132
- 125000003729 nucleotide group Chemical group 0.000 description 131
- 235000001014 amino acid Nutrition 0.000 description 92
- 229940024606 amino acid Drugs 0.000 description 89
- 210000004027 cell Anatomy 0.000 description 87
- 239000000047 product Substances 0.000 description 86
- 102000004190 Enzymes Human genes 0.000 description 58
- 108090000790 Enzymes Proteins 0.000 description 58
- 101710146995 Acyl carrier protein Proteins 0.000 description 55
- 230000014509 gene expression Effects 0.000 description 55
- 238000004519 manufacturing process Methods 0.000 description 54
- 108010019608 3-Oxoacyl-(Acyl-Carrier-Protein) Synthase Proteins 0.000 description 52
- 102100037149 3-oxoacyl-[acyl-carrier-protein] synthase, mitochondrial Human genes 0.000 description 51
- 239000002299 complementary DNA Substances 0.000 description 50
- 239000003921 oil Substances 0.000 description 50
- 235000019198 oils Nutrition 0.000 description 50
- 238000012239 gene modification Methods 0.000 description 44
- 230000005017 genetic modification Effects 0.000 description 44
- 235000013617 genetically modified food Nutrition 0.000 description 44
- 108700016155 Acyl transferases Proteins 0.000 description 43
- 239000013598 vector Substances 0.000 description 43
- 102000057234 Acyl transferases Human genes 0.000 description 42
- 101001014220 Monascus pilosus Dehydrogenase mokE Proteins 0.000 description 42
- 101000573542 Penicillium citrinum Compactin nonaketide synthase, enoyl reductase component Proteins 0.000 description 42
- 239000013612 plasmid Substances 0.000 description 41
- 230000037361 pathway Effects 0.000 description 40
- 230000004071 biological effect Effects 0.000 description 38
- 230000004048 modification Effects 0.000 description 35
- 238000012986 modification Methods 0.000 description 35
- 241000588724 Escherichia coli Species 0.000 description 34
- 230000001580 bacterial effect Effects 0.000 description 32
- 108020004414 DNA Proteins 0.000 description 30
- 230000006870 function Effects 0.000 description 28
- 108010001814 phosphopantetheinyl transferase Proteins 0.000 description 27
- JAZBEHYOTPTENJ-JLNKQSITSA-N all-cis-5,8,11,14,17-icosapentaenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O JAZBEHYOTPTENJ-JLNKQSITSA-N 0.000 description 26
- 150000001875 compounds Chemical class 0.000 description 25
- 235000020673 eicosapentaenoic acid Nutrition 0.000 description 25
- 108700026244 Open Reading Frames Proteins 0.000 description 23
- 230000015572 biosynthetic process Effects 0.000 description 23
- 238000006243 chemical reaction Methods 0.000 description 23
- 239000002609 medium Substances 0.000 description 23
- 102100022089 Acyl-[acyl-carrier-protein] hydrolase Human genes 0.000 description 22
- 108010039731 Fatty Acid Synthases Proteins 0.000 description 22
- 241001467333 Thraustochytriaceae Species 0.000 description 22
- 239000006227 byproduct Substances 0.000 description 22
- 235000019387 fatty acid methyl ester Nutrition 0.000 description 21
- 238000009396 hybridization Methods 0.000 description 21
- 239000013067 intermediate product Substances 0.000 description 21
- 238000003786 synthesis reaction Methods 0.000 description 21
- 108020004705 Codon Proteins 0.000 description 20
- 241000598397 Schizochytrium sp. Species 0.000 description 20
- 241000894006 Bacteria Species 0.000 description 19
- YZXBAPSDXZZRGB-DOFZRALJSA-N arachidonic acid Chemical compound CCCCC\C=C/C\C=C/C\C=C/C\C=C/CCCC(O)=O YZXBAPSDXZZRGB-DOFZRALJSA-N 0.000 description 19
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 18
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 18
- LTYOQGRJFJAKNA-KKIMTKSISA-N Malonyl CoA Natural products S(C(=O)CC(=O)O)CCNC(=O)CCNC(=O)[C@@H](O)C(CO[P@](=O)(O[P@](=O)(OC[C@H]1[C@@H](OP(=O)(O)O)[C@@H](O)[C@@H](n2c3ncnc(N)c3nc2)O1)O)O)(C)C LTYOQGRJFJAKNA-KKIMTKSISA-N 0.000 description 17
- 230000008859 change Effects 0.000 description 17
- 235000020978 long-chain polyunsaturated fatty acids Nutrition 0.000 description 17
- LTYOQGRJFJAKNA-DVVLENMVSA-N malonyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LTYOQGRJFJAKNA-DVVLENMVSA-N 0.000 description 17
- 210000002706 plastid Anatomy 0.000 description 17
- 230000001105 regulatory effect Effects 0.000 description 17
- 102000000157 3-oxoacyl-(acyl-carrier-protein) reductase Human genes 0.000 description 16
- 108010055468 3-oxoacyl-(acyl-carrier-protein) reductase Proteins 0.000 description 16
- 239000007795 chemical reaction product Substances 0.000 description 16
- 238000010367 cloning Methods 0.000 description 16
- 230000009466 transformation Effects 0.000 description 16
- 108090000765 processed proteins & peptides Proteins 0.000 description 15
- 241000894007 species Species 0.000 description 15
- 238000004458 analytical method Methods 0.000 description 14
- 101000623276 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiBIM 5'region Proteins 0.000 description 13
- 101000623175 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiCIIM 5'region Proteins 0.000 description 13
- 101000626850 Herpetosiphon aurantiacus Uncharacterized 10.2 kDa protein in HgiEIM 5'region Proteins 0.000 description 13
- 108010076504 Protein Sorting Signals Proteins 0.000 description 13
- 125000000539 amino acid group Chemical group 0.000 description 13
- 239000000543 intermediate Substances 0.000 description 13
- 101710147134 Putative transposase for insertion sequence element IS986/IS6110 Proteins 0.000 description 12
- 101710099762 Uncharacterized protein in sodM 3'region Proteins 0.000 description 12
- 125000002252 acyl group Chemical group 0.000 description 12
- 229910052799 carbon Inorganic materials 0.000 description 12
- 230000000295 complement effect Effects 0.000 description 12
- 230000001965 increasing effect Effects 0.000 description 12
- 229920001184 polypeptide Polymers 0.000 description 12
- 102000004196 processed proteins & peptides Human genes 0.000 description 12
- 108700037654 Acyl carrier protein (ACP) Proteins 0.000 description 11
- 102000048456 Acyl carrier protein (ACP) Human genes 0.000 description 11
- 101000774529 Alkalihalobacillus pseudofirmus (strain ATCC BAA-2126 / JCM 17055 / OF4) Uncharacterized protein BpOF4_21049 Proteins 0.000 description 11
- 101000780391 Bacillus licheniformis Uncharacterized protein in ansA 5'region Proteins 0.000 description 11
- 101000818144 Bacillus subtilis (strain 168) Uncharacterized oxidoreductase YusZ Proteins 0.000 description 11
- -1 C18.0 Chemical compound 0.000 description 11
- 101000790711 Chlamydomonas reinhardtii Uncharacterized membrane protein ycf78 Proteins 0.000 description 11
- 101000950354 Clostridium perfringens (strain 13 / Type A) Uncharacterized protein CPE0190 Proteins 0.000 description 11
- 101000827225 Dichelobacter nodosus Uncharacterized protein in lpsA 5'region Proteins 0.000 description 11
- 101710198510 Enoyl-[acyl-carrier-protein] reductase [NADH] Proteins 0.000 description 11
- 101000599641 Escherichia coli (strain K12) Insertion element IS150 protein InsJ Proteins 0.000 description 11
- 101000819098 Escherichia coli Insertion element IS1397 uncharacterized 20.1 kDa protein Proteins 0.000 description 11
- 101000707213 Escherichia coli Insertion element IS150 uncharacterized 14 kDa protein Proteins 0.000 description 11
- 101000977747 Escherichia coli Uncharacterized 31.7 kDa protein in traX-finO intergenic region Proteins 0.000 description 11
- 101000763543 Escherichia coli Uncharacterized endonuclease Proteins 0.000 description 11
- 101000758678 Escherichia phage P1 Uncharacterized 36.0 kDa protein in doc-Gp10 intergenic region Proteins 0.000 description 11
- 101000721155 Herpetosiphon aurantiacus Control protein C.HgiCIP Proteins 0.000 description 11
- 101000828374 Lactobacillus johnsonii Insertion element IS1223 uncharacterized 20.7 kDa protein Proteins 0.000 description 11
- 101000750781 Listeria monocytogenes serovar 1/2a (strain ATCC BAA-679 / EGD-e) Uncharacterized oxidoreductase Lmo0432 Proteins 0.000 description 11
- 101000618025 Methylorubrum extorquens (strain ATCC 14718 / DSM 1338 / JCM 2805 / NCIMB 9133 / AM1) Uncharacterized oxidoreductase MexAM1_META1p0182 Proteins 0.000 description 11
- 101000778792 Mycoplasma capricolum subsp. capricolum (strain California kid / ATCC 27343 / NCTC 10154) Putative kinase MCAP_0235 Proteins 0.000 description 11
- 101000861628 Mycoplasma capricolum subsp. capricolum (strain California kid / ATCC 27343 / NCTC 10154) Uncharacterized lipoprotein MCAP_0231 Proteins 0.000 description 11
- 101000707209 Mycoplasma mycoides subsp. mycoides SC Insertion element IS1296 uncharacterized 21.4 kDa protein Proteins 0.000 description 11
- 101000819697 Podospora anserina (strain S / ATCC MYA-4624 / DSM 980 / FGSC 10383) Uncharacterized 34.3 kDa protein in ATP6 5'region Proteins 0.000 description 11
- 101000770286 Rhizobium meliloti Uncharacterized protein ORF8 in nfe locus Proteins 0.000 description 11
- 101000791677 Rhizobium meliloti Uncharacterized protein in ackA 5'region Proteins 0.000 description 11
- 101000759701 Thermus thermophilus Uncharacterized protein in scsB 5'region Proteins 0.000 description 11
- 101000623306 Trypanosoma brucei brucei Uncharacterized 1.9 kDa protein in aldolase locus Proteins 0.000 description 11
- 101000679337 Zea mays Putative AC transposase Proteins 0.000 description 11
- 210000000349 chromosome Anatomy 0.000 description 11
- 238000000855 fermentation Methods 0.000 description 11
- 230000004151 fermentation Effects 0.000 description 11
- 238000004817 gas chromatography Methods 0.000 description 11
- 231100000350 mutagenesis Toxicity 0.000 description 11
- 235000015112 vegetable and seed oil Nutrition 0.000 description 11
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical group [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 239000013604 expression vector Substances 0.000 description 10
- 238000012546 transfer Methods 0.000 description 10
- 241000219194 Arabidopsis Species 0.000 description 9
- 208000016444 Benign adult familial myoclonic epilepsy Diseases 0.000 description 9
- 241000206602 Eukaryota Species 0.000 description 9
- 241001491672 Labyrinthulaceae Species 0.000 description 9
- 241000863430 Shewanella Species 0.000 description 9
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- 208000016427 familial adult myoclonic epilepsy Diseases 0.000 description 9
- 230000002068 genetic effect Effects 0.000 description 9
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 9
- 238000002703 mutagenesis Methods 0.000 description 9
- 230000036961 partial effect Effects 0.000 description 9
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 9
- 235000004400 serine Nutrition 0.000 description 9
- 239000004474 valine Substances 0.000 description 9
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 8
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 8
- 240000006240 Linum usitatissimum Species 0.000 description 8
- 241001465754 Metazoa Species 0.000 description 8
- 108091034117 Oligonucleotide Proteins 0.000 description 8
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 8
- 108091081024 Start codon Proteins 0.000 description 8
- 229940009098 aspartate Drugs 0.000 description 8
- 230000003247 decreasing effect Effects 0.000 description 8
- 239000003814 drug Substances 0.000 description 8
- 229940079593 drug Drugs 0.000 description 8
- 235000004426 flaxseed Nutrition 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 230000000813 microbial effect Effects 0.000 description 8
- 150000003904 phospholipids Chemical class 0.000 description 8
- 239000000523 sample Substances 0.000 description 8
- 239000000758 substrate Substances 0.000 description 8
- 238000001890 transfection Methods 0.000 description 8
- 230000009261 transgenic effect Effects 0.000 description 8
- 238000011144 upstream manufacturing Methods 0.000 description 8
- 101001110310 Lentilactobacillus kefiri NADP-dependent (R)-specific alcohol dehydrogenase Proteins 0.000 description 7
- 241000382353 Pupa Species 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 229930195712 glutamate Natural products 0.000 description 7
- PEDCQBHIVMGVHV-UHFFFAOYSA-N glycerol group Chemical group OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 7
- 230000035772 mutation Effects 0.000 description 7
- 239000013600 plasmid vector Substances 0.000 description 7
- 108091008146 restriction endonucleases Proteins 0.000 description 7
- 230000008685 targeting Effects 0.000 description 7
- 210000001519 tissue Anatomy 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- ZHNUHDYFZUAESO-UHFFFAOYSA-N Formamide Chemical compound NC=O ZHNUHDYFZUAESO-UHFFFAOYSA-N 0.000 description 6
- 241000233866 Fungi Species 0.000 description 6
- 241000192673 Nostoc sp. Species 0.000 description 6
- ZNXZGRMVNNHPCA-UHFFFAOYSA-N Pantetheine Natural products OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS ZNXZGRMVNNHPCA-UHFFFAOYSA-N 0.000 description 6
- 241000233675 Thraustochytrium Species 0.000 description 6
- 108700019146 Transgenes Proteins 0.000 description 6
- HQPCSDADVLFHHO-LTKCOYKYSA-N all-cis-8,11,14,17-icosatetraenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/CCCCCCC(O)=O HQPCSDADVLFHHO-LTKCOYKYSA-N 0.000 description 6
- 239000003242 anti bacterial agent Substances 0.000 description 6
- 235000021342 arachidonic acid Nutrition 0.000 description 6
- 229940114079 arachidonic acid Drugs 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 238000010353 genetic engineering Methods 0.000 description 6
- 230000012010 growth Effects 0.000 description 6
- 239000012528 membrane Substances 0.000 description 6
- ZNXZGRMVNNHPCA-VIFPVBQESA-N pantetheine Chemical compound OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS ZNXZGRMVNNHPCA-VIFPVBQESA-N 0.000 description 6
- 238000003752 polymerase chain reaction Methods 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- DCXXMTOCNZCJGO-UHFFFAOYSA-N tristearoylglycerol Chemical compound CCCCCCCCCCCCCCCCCC(=O)OCC(OC(=O)CCCCCCCCCCCCCCCCC)COC(=O)CCCCCCCCCCCCCCCCC DCXXMTOCNZCJGO-UHFFFAOYSA-N 0.000 description 6
- 108091026890 Coding region Proteins 0.000 description 5
- 102100026384 L-aminoadipate-semialdehyde dehydrogenase-phosphopantetheinyl transferase Human genes 0.000 description 5
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 5
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 5
- 241001467308 Labyrinthuloides Species 0.000 description 5
- 235000004431 Linum usitatissimum Nutrition 0.000 description 5
- 241000294598 Moritella marina Species 0.000 description 5
- 108700021044 acyl-ACP thioesterase Proteins 0.000 description 5
- AHANXAKGNAKFSK-PDBXOOCHSA-N all-cis-icosa-11,14,17-trienoic acid Chemical compound CC\C=C/C\C=C/C\C=C/CCCCCCCCCC(O)=O AHANXAKGNAKFSK-PDBXOOCHSA-N 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 230000007423 decrease Effects 0.000 description 5
- PRHHYVQTPBEDFE-UHFFFAOYSA-N eicosatrienoic acid Natural products CCCCCC=CCC=CCCCCC=CCCCC(O)=O PRHHYVQTPBEDFE-UHFFFAOYSA-N 0.000 description 5
- 230000002255 enzymatic effect Effects 0.000 description 5
- 238000001727 in vivo Methods 0.000 description 5
- 229930182817 methionine Natural products 0.000 description 5
- 235000013336 milk Nutrition 0.000 description 5
- 210000004080 milk Anatomy 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 238000006467 substitution reaction Methods 0.000 description 5
- 238000013519 translation Methods 0.000 description 5
- 235000021122 unsaturated fatty acids Nutrition 0.000 description 5
- 150000004670 unsaturated fatty acids Chemical class 0.000 description 5
- UNSRRHDPHVZAHH-YOILPLPUSA-N (5Z,8Z,11Z)-icosatrienoic acid Chemical compound CCCCCCCC\C=C/C\C=C/C\C=C/CCCC(O)=O UNSRRHDPHVZAHH-YOILPLPUSA-N 0.000 description 4
- UNSRRHDPHVZAHH-UHFFFAOYSA-N 6beta,11alpha-Dihydroxy-3alpha,5alpha-cyclopregnan-20-on Natural products CCCCCCCCC=CCC=CCC=CCCCC(O)=O UNSRRHDPHVZAHH-UHFFFAOYSA-N 0.000 description 4
- 241000589158 Agrobacterium Species 0.000 description 4
- 244000063299 Bacillus subtilis Species 0.000 description 4
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 4
- 241000195493 Cryptophyta Species 0.000 description 4
- OPGOLNDOMSBSCW-CLNHMMGSSA-N Fursultiamine hydrochloride Chemical compound Cl.C1CCOC1CSSC(\CCO)=C(/C)N(C=O)CC1=CN=C(C)N=C1N OPGOLNDOMSBSCW-CLNHMMGSSA-N 0.000 description 4
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 4
- 238000009825 accumulation Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- JIWBIWFOSCKQMA-LTKCOYKYSA-N all-cis-octadeca-6,9,12,15-tetraenoic acid Chemical compound CC\C=C/C\C=C/C\C=C/C\C=C/CCCCC(O)=O JIWBIWFOSCKQMA-LTKCOYKYSA-N 0.000 description 4
- 210000004102 animal cell Anatomy 0.000 description 4
- 229940088710 antibiotic agent Drugs 0.000 description 4
- HVYWMOMLDIMFJA-DPAQBDIFSA-N cholesterol Chemical compound C1C=C2C[C@@H](O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H]([C@H](C)CCCC(C)C)[C@@]1(C)CC2 HVYWMOMLDIMFJA-DPAQBDIFSA-N 0.000 description 4
- 230000004186 co-expression Effects 0.000 description 4
- 238000012258 culturing Methods 0.000 description 4
- 210000000805 cytoplasm Anatomy 0.000 description 4
- 238000011161 development Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 230000004136 fatty acid synthesis Effects 0.000 description 4
- 150000002185 fatty acyl-CoAs Chemical class 0.000 description 4
- 125000001924 fatty-acyl group Chemical group 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 235000021588 free fatty acids Nutrition 0.000 description 4
- 239000001963 growth medium Substances 0.000 description 4
- 238000002955 isolation Methods 0.000 description 4
- 239000008267 milk Substances 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- WTJKGGKOPKCXLL-RRHRGVEJSA-N phosphatidylcholine Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@H](COP([O-])(=O)OCC[N+](C)(C)C)OC(=O)CCCCCCCC=CCCCCCCCC WTJKGGKOPKCXLL-RRHRGVEJSA-N 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000006722 reduction reaction Methods 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 4
- JIWBIWFOSCKQMA-UHFFFAOYSA-N stearidonic acid Natural products CCC=CCC=CCC=CCC=CCCCCC(O)=O JIWBIWFOSCKQMA-UHFFFAOYSA-N 0.000 description 4
- 230000009469 supplementation Effects 0.000 description 4
- 230000002194 synthesizing effect Effects 0.000 description 4
- 238000005406 washing Methods 0.000 description 4
- 241000251468 Actinopterygii Species 0.000 description 3
- 244000178993 Brassica juncea Species 0.000 description 3
- 235000011332 Brassica juncea Nutrition 0.000 description 3
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 3
- 240000002791 Brassica napus Species 0.000 description 3
- 244000068988 Glycine max Species 0.000 description 3
- 235000010469 Glycine max Nutrition 0.000 description 3
- 108091029795 Intergenic region Proteins 0.000 description 3
- 241001491670 Labyrinthula Species 0.000 description 3
- 241000208204 Linum Species 0.000 description 3
- 241001491708 Macrocystis Species 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 244000061176 Nicotiana tabacum Species 0.000 description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 3
- 241000192656 Nostoc Species 0.000 description 3
- 208000001132 Osteoporosis Diseases 0.000 description 3
- 108020004511 Recombinant DNA Proteins 0.000 description 3
- 241000490596 Shewanella sp. Species 0.000 description 3
- 241000864178 Sorodiplophrys Species 0.000 description 3
- 241000592344 Spermatophyta Species 0.000 description 3
- 102000004357 Transferases Human genes 0.000 description 3
- 108090000992 Transferases Proteins 0.000 description 3
- 241001491678 Ulkenia Species 0.000 description 3
- QTBSBXVTEAMEQO-HQMMCQRPSA-N acetic acid Chemical compound C[14C](O)=O QTBSBXVTEAMEQO-HQMMCQRPSA-N 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 238000001994 activation Methods 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 230000000903 blocking effect Effects 0.000 description 3
- 244000038559 crop plants Species 0.000 description 3
- 239000013078 crystal Substances 0.000 description 3
- 230000003412 degenerative effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000010828 elution Methods 0.000 description 3
- 150000002148 esters Chemical class 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000000605 extraction Methods 0.000 description 3
- 230000002779 inactivation Effects 0.000 description 3
- 238000006317 isomerization reaction Methods 0.000 description 3
- 238000002372 labelling Methods 0.000 description 3
- 239000011159 matrix material Substances 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- VLKZOEOYAKHREP-UHFFFAOYSA-N n-Hexane Chemical compound CCCCCC VLKZOEOYAKHREP-UHFFFAOYSA-N 0.000 description 3
- 229930014626 natural product Natural products 0.000 description 3
- 208000015122 neurodegenerative disease Diseases 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 235000016709 nutrition Nutrition 0.000 description 3
- 235000020660 omega-3 fatty acid Nutrition 0.000 description 3
- 229930001119 polyketide Natural products 0.000 description 3
- 150000003881 polyketide derivatives Chemical class 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 210000001938 protoplast Anatomy 0.000 description 3
- 150000004671 saturated fatty acids Chemical class 0.000 description 3
- 235000003441 saturated fatty acids Nutrition 0.000 description 3
- 230000008117 seed development Effects 0.000 description 3
- 239000002689 soil Substances 0.000 description 3
- 150000007970 thio esters Chemical class 0.000 description 3
- 230000009452 underexpressoin Effects 0.000 description 3
- 230000003827 upregulation Effects 0.000 description 3
- 239000008158 vegetable oil Substances 0.000 description 3
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 2
- 101000818089 Acholeplasma phage L2 Uncharacterized 25.6 kDa protein Proteins 0.000 description 2
- 101000935487 Agrobacterium fabrum (strain C58 / ATCC 33970) 3-oxopimeloyl-[acyl-carrier-protein] synthase Proteins 0.000 description 2
- 241000589156 Agrobacterium rhizogenes Species 0.000 description 2
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 2
- 241000003610 Aplanochytrium Species 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 101000770875 Autographa californica nuclear polyhedrosis virus Uncharacterized 14.2 kDa protein in PK1-LEF1 intergenic region Proteins 0.000 description 2
- 101150076489 B gene Proteins 0.000 description 2
- 235000014469 Bacillus subtilis Nutrition 0.000 description 2
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 2
- 235000006008 Brassica napus var napus Nutrition 0.000 description 2
- 240000000385 Brassica napus var. napus Species 0.000 description 2
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 2
- 101000736909 Campylobacter jejuni Probable nucleotidyltransferase Proteins 0.000 description 2
- 244000020518 Carthamus tinctorius Species 0.000 description 2
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 2
- 241001655287 Chlamydomyxa Species 0.000 description 2
- RGJOEKWQDUBAIZ-IBOSZNHHSA-N CoASH Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS)O[C@H]1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-IBOSZNHHSA-N 0.000 description 2
- 108700010070 Codon Usage Proteins 0.000 description 2
- 108020004635 Complementary DNA Proteins 0.000 description 2
- 241000989765 Diplophrys Species 0.000 description 2
- 235000021292 Docosatetraenoic acid Nutrition 0.000 description 2
- 108010087894 Fatty acid desaturases Proteins 0.000 description 2
- 102000009114 Fatty acid desaturases Human genes 0.000 description 2
- 229930186217 Glycolipid Natural products 0.000 description 2
- 101000748060 Haemophilus phage HP1 (strain HP1c1) Uncharacterized 8.3 kDa protein in rep-hol intergenic region Proteins 0.000 description 2
- 235000003222 Helianthus annuus Nutrition 0.000 description 2
- 102000004867 Hydro-Lyases Human genes 0.000 description 2
- 108090001042 Hydro-Lyases Proteins 0.000 description 2
- 241000003482 Japonochytrium Species 0.000 description 2
- 101000768313 Klebsiella pneumoniae Uncharacterized membrane protein in cps region Proteins 0.000 description 2
- SRBFZHDQGSBBOR-HWQSCIPKSA-N L-arabinopyranose Chemical compound O[C@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-HWQSCIPKSA-N 0.000 description 2
- 108091022912 Mannose-6-Phosphate Isomerase Proteins 0.000 description 2
- 102000048193 Mannose-6-phosphate isomerases Human genes 0.000 description 2
- 101000804418 Methanothermobacter thermautotrophicus (strain ATCC 29096 / DSM 1053 / JCM 10044 / NBRC 100330 / Delta H) Uncharacterized protein MTH_1463 Proteins 0.000 description 2
- 241000592260 Moritella Species 0.000 description 2
- 101000770870 Orgyia pseudotsugata multicapsid polyhedrosis virus Uncharacterized 37.2 kDa protein Proteins 0.000 description 2
- 102000004316 Oxidoreductases Human genes 0.000 description 2
- 108090000854 Oxidoreductases Proteins 0.000 description 2
- 240000009164 Petroselinum crispum Species 0.000 description 2
- 235000002770 Petroselinum crispum Nutrition 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 102000009105 Short Chain Dehydrogenase-Reductases Human genes 0.000 description 2
- 108010048287 Short Chain Dehydrogenase-Reductases Proteins 0.000 description 2
- 229930182558 Sterol Natural products 0.000 description 2
- 101710137500 T7 RNA polymerase Proteins 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- 101710198378 Uncharacterized 10.8 kDa protein in cox-rep intergenic region Proteins 0.000 description 2
- 240000008042 Zea mays Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 2
- KNGQILZSJUUYIK-VIFPVBQESA-N [(2R)-4-hydroxy-3,3-dimethyl-1-oxo-1-[[3-oxo-3-(2-sulfanylethylamino)propyl]amino]butan-2-yl] dihydrogen phosphate Chemical group OCC(C)(C)[C@@H](OP(O)(O)=O)C(=O)NCCC(=O)NCCS KNGQILZSJUUYIK-VIFPVBQESA-N 0.000 description 2
- ZSLZBFCDCINBPY-ZSJPKINUSA-N acetyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)C)O[C@H]1N1C2=NC=NC(N)=C2N=C1 ZSLZBFCDCINBPY-ZSJPKINUSA-N 0.000 description 2
- 230000003213 activating effect Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 2
- 235000020244 animal milk Nutrition 0.000 description 2
- 230000001773 anti-convulsant effect Effects 0.000 description 2
- 230000001430 anti-depressive effect Effects 0.000 description 2
- 230000003110 anti-inflammatory effect Effects 0.000 description 2
- 239000001961 anticonvulsive agent Substances 0.000 description 2
- 239000000935 antidepressant agent Substances 0.000 description 2
- 229960003965 antiepileptics Drugs 0.000 description 2
- 239000002246 antineoplastic agent Substances 0.000 description 2
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 2
- 230000001851 biosynthetic effect Effects 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 230000032823 cell division Effects 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 235000012000 cholesterol Nutrition 0.000 description 2
- RGJOEKWQDUBAIZ-UHFFFAOYSA-N coenzime A Natural products OC1C(OP(O)(O)=O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 RGJOEKWQDUBAIZ-UHFFFAOYSA-N 0.000 description 2
- 239000005516 coenzyme A Substances 0.000 description 2
- 229940093530 coenzyme a Drugs 0.000 description 2
- 238000009833 condensation Methods 0.000 description 2
- 230000005494 condensation Effects 0.000 description 2
- 235000009508 confectionery Nutrition 0.000 description 2
- 238000011109 contamination Methods 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 235000018417 cysteine Nutrition 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 125000000151 cysteine group Chemical group N[C@@H](CS)C(=O)* 0.000 description 2
- 229940127089 cytotoxic agent Drugs 0.000 description 2
- 235000013365 dairy product Nutrition 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- KDTSHFARGAKYJN-UHFFFAOYSA-N dephosphocoenzyme A Natural products OC1C(O)C(COP(O)(=O)OP(O)(=O)OCC(C)(C)C(O)C(=O)NCCC(=O)NCCS)OC1N1C2=NC=NC(N)=C2N=C1 KDTSHFARGAKYJN-UHFFFAOYSA-N 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 235000015872 dietary supplement Nutrition 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 208000035475 disorder Diseases 0.000 description 2
- 230000003828 downregulation Effects 0.000 description 2
- 235000013399 edible fruits Nutrition 0.000 description 2
- JAZBEHYOTPTENJ-UHFFFAOYSA-N eicosapentaenoic acid Natural products CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O JAZBEHYOTPTENJ-UHFFFAOYSA-N 0.000 description 2
- 229960005135 eicosapentaenoic acid Drugs 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 125000004050 enoyl group Chemical group 0.000 description 2
- 238000006911 enzymatic reaction Methods 0.000 description 2
- 210000003495 flagella Anatomy 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- WHUUTDBJXJRKMK-VKHMYHEASA-L glutamate group Chemical group N[C@@H](CCC(=O)[O-])C(=O)[O-] WHUUTDBJXJRKMK-VKHMYHEASA-L 0.000 description 2
- 210000002288 golgi apparatus Anatomy 0.000 description 2
- 210000001990 heterocyst Anatomy 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 125000004435 hydrogen atom Chemical group [H]* 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 230000000977 initiatory effect Effects 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 208000019423 liver disease Diseases 0.000 description 2
- 150000004668 long chain fatty acids Chemical class 0.000 description 2
- 210000003712 lysosome Anatomy 0.000 description 2
- 230000001868 lysosomic effect Effects 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 210000003470 mitochondria Anatomy 0.000 description 2
- 230000011278 mitosis Effects 0.000 description 2
- 230000000869 mutational effect Effects 0.000 description 2
- 230000004770 neurodegeneration Effects 0.000 description 2
- 210000000633 nuclear envelope Anatomy 0.000 description 2
- 230000008520 organization Effects 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 238000012261 overproduction Methods 0.000 description 2
- 239000001301 oxygen Substances 0.000 description 2
- 229910052760 oxygen Inorganic materials 0.000 description 2
- 239000008188 pellet Substances 0.000 description 2
- 238000010647 peptide synthesis reaction Methods 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 239000000546 pharmaceutical excipient Substances 0.000 description 2
- 239000010773 plant oil Substances 0.000 description 2
- 229930001118 polyketide hybrid Natural products 0.000 description 2
- 125000003308 polyketide hybrid group Chemical group 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000037452 priming Effects 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 235000014347 soups Nutrition 0.000 description 2
- 235000003702 sterols Nutrition 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 239000000725 suspension Substances 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 150000003626 triacylglycerols Chemical class 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- DVSZKTAMJJTWFG-SKCDLICFSA-N (2e,4e,6e,8e,10e,12e)-docosa-2,4,6,8,10,12-hexaenoic acid Chemical compound CCCCCCCCC\C=C\C=C\C=C\C=C\C=C\C=C\C(O)=O DVSZKTAMJJTWFG-SKCDLICFSA-N 0.000 description 1
- FPRKGXIOSIUDSE-SYACGTDESA-N (2z,4z,6z,8z)-docosa-2,4,6,8-tetraenoic acid Chemical compound CCCCCCCCCCCCC\C=C/C=C\C=C/C=C\C(O)=O FPRKGXIOSIUDSE-SYACGTDESA-N 0.000 description 1
- TWSWSIQAPQLDBP-CGRWFSSPSA-N (7e,10e,13e,16e)-docosa-7,10,13,16-tetraenoic acid Chemical compound CCCCC\C=C\C\C=C\C\C=C\C\C=C\CCCCCC(O)=O TWSWSIQAPQLDBP-CGRWFSSPSA-N 0.000 description 1
- OYHQOLUKZRVURQ-NTGFUMLPSA-N (9Z,12Z)-9,10,12,13-tetratritiooctadeca-9,12-dienoic acid Chemical compound C(CCCCCCC\C(=C(/C\C(=C(/CCCCC)\[3H])\[3H])\[3H])\[3H])(=O)O OYHQOLUKZRVURQ-NTGFUMLPSA-N 0.000 description 1
- 239000001195 (9Z,12Z,15Z)-octadeca-9,12,15-trienoic acid Substances 0.000 description 1
- LTYOQGRJFJAKNA-GAYNQZCISA-N 3-[2-[3-[[(2R)-4-[[[(2R,3S,4R,5R)-5-(6-aminopurin-9-yl)-4-hydroxy-3-phosphonooxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-hydroxyphosphoryl]oxy-2-hydroxy-3,3-dimethylbutanoyl]amino]propanoylamino]ethylsulfanyl]-3-oxo(314C)propanoic acid Chemical compound [14C](CC(=O)O)(=O)SCCNC(CCNC([C@@H](C(COP(OP(OC[C@@H]1[C@H]([C@H]([C@@H](O1)N1C=NC=2C(N)=NC=NC1=2)O)OP(=O)(O)O)(=O)O)(=O)O)(C)C)O)=O)=O LTYOQGRJFJAKNA-GAYNQZCISA-N 0.000 description 1
- GZJLLYHBALOKEX-UHFFFAOYSA-N 6-Ketone, O18-Me-Ussuriedine Natural products CC=CCC=CCC=CCC=CCC=CCC=CCCCC(O)=O GZJLLYHBALOKEX-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical group CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 1
- 102100036426 Acid phosphatase type 7 Human genes 0.000 description 1
- 102100022734 Acyl carrier protein, mitochondrial Human genes 0.000 description 1
- 229930091051 Arenine Natural products 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000228212 Aspergillus Species 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 101710117545 C protein Proteins 0.000 description 1
- YBCVMFKXIKNREZ-UOEUYCPOSA-N C(C)(=O)O.[14C](C)(=O)O Chemical compound C(C)(=O)O.[14C](C)(=O)O YBCVMFKXIKNREZ-UOEUYCPOSA-N 0.000 description 1
- 206010006895 Cachexia Diseases 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 102000003813 Cis-trans-isomerases Human genes 0.000 description 1
- 108090000175 Cis-trans-isomerases Proteins 0.000 description 1
- 241001633026 Coenocystis Species 0.000 description 1
- NBSCHQHZLSJFNQ-QTVWNMPRSA-N D-Mannose-6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@@H]1O NBSCHQHZLSJFNQ-QTVWNMPRSA-N 0.000 description 1
- JDMUPRLRUUMCTL-VIFPVBQESA-N D-pantetheine 4'-phosphate Chemical compound OP(=O)(O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCS JDMUPRLRUUMCTL-VIFPVBQESA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 108700016256 Dihydropteroate synthases Proteins 0.000 description 1
- 241001649081 Dina Species 0.000 description 1
- 241001462977 Elina Species 0.000 description 1
- 241000672609 Escherichia coli BL21 Species 0.000 description 1
- 102000036181 Fatty Acid Elongases Human genes 0.000 description 1
- 108010058732 Fatty Acid Elongases Proteins 0.000 description 1
- 101150094690 GAL1 gene Proteins 0.000 description 1
- 102100028501 Galanin peptides Human genes 0.000 description 1
- 208000018522 Gastrointestinal disease Diseases 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 241000086388 Gyrodactylus marinus Species 0.000 description 1
- 241000208818 Helianthus Species 0.000 description 1
- 244000020551 Helianthus annuus Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000928881 Homo sapiens Acid phosphatase type 7 Proteins 0.000 description 1
- 101000678845 Homo sapiens Acyl carrier protein, mitochondrial Proteins 0.000 description 1
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 1
- 101000611240 Homo sapiens Low molecular weight phosphotyrosine protein phosphatase Proteins 0.000 description 1
- 101000620894 Homo sapiens Lysophosphatidic acid phosphatase type 6 Proteins 0.000 description 1
- 101001001294 Homo sapiens Lysosomal acid phosphatase Proteins 0.000 description 1
- 101001001272 Homo sapiens Prostatic acid phosphatase Proteins 0.000 description 1
- 101000620880 Homo sapiens Tartrate-resistant acid phosphatase type 5 Proteins 0.000 description 1
- 101001122914 Homo sapiens Testicular acid phosphatase Proteins 0.000 description 1
- 101150017040 I gene Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- 108010025815 Kanamycin Kinase Proteins 0.000 description 1
- 101100288095 Klebsiella pneumoniae neo gene Proteins 0.000 description 1
- 241000235649 Kluyveromyces Species 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 1
- 241001491666 Labyrinthulomycetes Species 0.000 description 1
- 241000186660 Lactobacillus Species 0.000 description 1
- 208000019693 Lung disease Diseases 0.000 description 1
- 102100022916 Lysophosphatidic acid phosphatase type 6 Human genes 0.000 description 1
- 102100035699 Lysosomal acid phosphatase Human genes 0.000 description 1
- OFOBLEOULBTSOW-UHFFFAOYSA-L Malonate Chemical compound [O-]C(=O)CC([O-])=O OFOBLEOULBTSOW-UHFFFAOYSA-L 0.000 description 1
- 241001492414 Marina Species 0.000 description 1
- 101001111653 Mus musculus Retinol dehydrogenase 11 Proteins 0.000 description 1
- VZUNGTLZRAYYDE-UHFFFAOYSA-N N-methyl-N'-nitro-N-nitrosoguanidine Chemical compound O=NN(C)C(=N)N[N+]([O-])=O VZUNGTLZRAYYDE-UHFFFAOYSA-N 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- 241000221960 Neurospora Species 0.000 description 1
- 241001478892 Nostoc sp. PCC 7120 Species 0.000 description 1
- 101710179994 ORF-B protein Proteins 0.000 description 1
- 101710087110 ORF6 protein Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 229910019142 PO4 Inorganic materials 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 241000228143 Penicillium Species 0.000 description 1
- 208000020547 Peroxisomal disease Diseases 0.000 description 1
- 241001260361 Photobacterium profundum Species 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 208000005107 Premature Birth Diseases 0.000 description 1
- 102100035703 Prostatic acid phosphatase Human genes 0.000 description 1
- 102100040307 Protein FAM3B Human genes 0.000 description 1
- 101000824284 Rattus norvegicus Acyl-[acyl-carrier-protein] hydrolase Proteins 0.000 description 1
- 235000003534 Saccharomyces carlsbergensis Nutrition 0.000 description 1
- 241001123227 Saccharomyces pastorianus Species 0.000 description 1
- 102100029437 Serine/threonine-protein kinase A-Raf Human genes 0.000 description 1
- 241000333170 Shewanella japonica Species 0.000 description 1
- 241000947863 Shewanella olleyana Species 0.000 description 1
- 241001466451 Stramenopiles Species 0.000 description 1
- 101000936720 Streptococcus gordonii Accessory secretory protein Asp5 Proteins 0.000 description 1
- 101000912941 Streptococcus gordonii DegV domain-containing protein Proteins 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 101000691656 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA1, modules 1 and 2 Proteins 0.000 description 1
- 101000691655 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA2, modules 3 and 4 Proteins 0.000 description 1
- 101000691658 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA3, module 5 Proteins 0.000 description 1
- 101001125873 Streptomyces venezuelae Narbonolide/10-deoxymethynolide synthase PikA4, module 6 Proteins 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 102100022919 Tartrate-resistant acid phosphatase type 5 Human genes 0.000 description 1
- 102100028526 Testicular acid phosphatase Human genes 0.000 description 1
- 102000005488 Thioesterase Human genes 0.000 description 1
- 241000144181 Thraustochytrium aureum Species 0.000 description 1
- 108010018022 Type II Fatty Acid Synthase Proteins 0.000 description 1
- 101710134973 Uncharacterized 9.7 kDa protein in cox-rep intergenic region Proteins 0.000 description 1
- 101710095001 Uncharacterized protein in nifU 5'region Proteins 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- 241000607598 Vibrio Species 0.000 description 1
- 101000916899 Walleye dermal sarcoma virus Retroviral cyclin Proteins 0.000 description 1
- 150000001242 acetic acid derivatives Chemical class 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000013543 active substance Substances 0.000 description 1
- 208000038016 acute inflammation Diseases 0.000 description 1
- 230000006022 acute inflammation Effects 0.000 description 1
- 102000045404 acyltransferase activity proteins Human genes 0.000 description 1
- 108700014220 acyltransferase activity proteins Proteins 0.000 description 1
- TWSWSIQAPQLDBP-UHFFFAOYSA-N adrenic acid Natural products CCCCCC=CCC=CCC=CCC=CCCCCCC(O)=O TWSWSIQAPQLDBP-UHFFFAOYSA-N 0.000 description 1
- 206010064930 age-related macular degeneration Diseases 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 230000003466 anti-cipated effect Effects 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-L aspartate group Chemical group N[C@@H](CC(=O)[O-])C(=O)[O-] CKLJMWTZIZZHCS-REOHCLBHSA-L 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 235000012487 bakery ware Nutrition 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 235000013361 beverage Nutrition 0.000 description 1
- 230000008827 biological function Effects 0.000 description 1
- 230000031018 biological processes and functions Effects 0.000 description 1
- 239000008280 blood Substances 0.000 description 1
- 210000004369 blood Anatomy 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 235000008429 bread Nutrition 0.000 description 1
- 235000015496 breakfast cereal Nutrition 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 201000011510 cancer Diseases 0.000 description 1
- 235000014171 carbonated beverage Nutrition 0.000 description 1
- 125000001721 carboxyacetyl group Chemical group 0.000 description 1
- 230000000747 cardiac effect Effects 0.000 description 1
- 235000021466 carotenoid Nutrition 0.000 description 1
- 150000001747 carotenoids Chemical class 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229940112822 chewing gum Drugs 0.000 description 1
- 235000015218 chewing gum Nutrition 0.000 description 1
- 235000019219 chocolate Nutrition 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 208000037976 chronic inflammation Diseases 0.000 description 1
- 230000006020 chronic inflammation Effects 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 239000013599 cloning vector Substances 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 235000013409 condiments Nutrition 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 238000002425 crystallisation Methods 0.000 description 1
- 230000008025 crystallization Effects 0.000 description 1
- 238000012136 culture method Methods 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 208000010643 digestive system disease Diseases 0.000 description 1
- 238000007598 dipping method Methods 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- KAUVQQXNCKESLC-UHFFFAOYSA-N docosahexaenoic acid (DHA) Natural products COC(=O)C(C)NOCC1=CC=CC=C1 KAUVQQXNCKESLC-UHFFFAOYSA-N 0.000 description 1
- IQLUYYHUNSSHIY-HZUMYPAESA-N eicosatetraenoic acid Chemical compound CCCCCCCCCCC\C=C\C=C\C=C\C=C\C(O)=O IQLUYYHUNSSHIY-HZUMYPAESA-N 0.000 description 1
- 238000000909 electrodialysis Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 230000009088 enzymatic function Effects 0.000 description 1
- UKFXDFUAPNAMPJ-UHFFFAOYSA-N ethylmalonic acid Chemical compound CCC(C(O)=O)C(O)=O UKFXDFUAPNAMPJ-UHFFFAOYSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 150000002190 fatty acyls Chemical group 0.000 description 1
- 235000021323 fish oil Nutrition 0.000 description 1
- 229940013317 fish oils Drugs 0.000 description 1
- 235000013332 fish product Nutrition 0.000 description 1
- 108010060641 flavanone synthetase Proteins 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 239000005417 food ingredient Substances 0.000 description 1
- 235000013350 formula milk Nutrition 0.000 description 1
- 235000019867 fractionated palm kernal oil Nutrition 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 235000013376 functional food Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 229930182830 galactose Natural products 0.000 description 1
- 238000001030 gas--liquid chromatography Methods 0.000 description 1
- 208000018685 gastrointestinal system disease Diseases 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 230000008571 general function Effects 0.000 description 1
- 230000004077 genetic alteration Effects 0.000 description 1
- 231100000118 genetic alteration Toxicity 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-M glufosinate-P zwitterion(1-) Chemical compound CP([O-])(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-M 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 229930004094 glycosylphosphatidylinositol Natural products 0.000 description 1
- 235000013882 gravy Nutrition 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 125000000623 heterocyclic group Chemical group 0.000 description 1
- 238000013090 high-throughput technology Methods 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 235000020256 human milk Nutrition 0.000 description 1
- 210000004251 human milk Anatomy 0.000 description 1
- 229930195733 hydrocarbon Natural products 0.000 description 1
- 150000002430 hydrocarbons Chemical class 0.000 description 1
- 230000001976 improved effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 229910052500 inorganic mineral Inorganic materials 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 235000008960 ketchup Nutrition 0.000 description 1
- 238000001638 lipofection Methods 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 208000002780 macular degeneration Diseases 0.000 description 1
- 108010083942 mannopine synthase Proteins 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 235000010746 mayonnaise Nutrition 0.000 description 1
- 239000008268 mayonnaise Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 150000002739 metals Chemical class 0.000 description 1
- 150000004702 methyl esters Chemical class 0.000 description 1
- ZIYVHBGGAOATLY-UHFFFAOYSA-N methylmalonic acid Chemical group OC(=O)C(C)C(O)=O ZIYVHBGGAOATLY-UHFFFAOYSA-N 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 239000011707 mineral Substances 0.000 description 1
- 238000003541 multi-stage reaction Methods 0.000 description 1
- 108091005763 multidomain proteins Proteins 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 210000001577 neostriatum Anatomy 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 235000010508 nut-based spreads Nutrition 0.000 description 1
- 235000021436 nutraceutical agent Nutrition 0.000 description 1
- 235000015097 nutrients Nutrition 0.000 description 1
- 230000035764 nutrition Effects 0.000 description 1
- 235000014571 nuts Nutrition 0.000 description 1
- 235000021313 oleic acid Nutrition 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 201000008482 osteoarthritis Diseases 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000013618 particulate matter Substances 0.000 description 1
- 235000015927 pasta Nutrition 0.000 description 1
- 101150113864 pat gene Proteins 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 239000010451 perlite Substances 0.000 description 1
- 235000019362 perlite Nutrition 0.000 description 1
- 239000008177 pharmaceutical agent Substances 0.000 description 1
- 239000010452 phosphate Substances 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 102000040430 polynucleotide Human genes 0.000 description 1
- 108091033319 polynucleotide Proteins 0.000 description 1
- 239000002157 polynucleotide Substances 0.000 description 1
- 108010055896 polyornithine Proteins 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 235000013606 potato chips Nutrition 0.000 description 1
- 235000013613 poultry product Nutrition 0.000 description 1
- 201000011461 pre-eclampsia Diseases 0.000 description 1
- 238000001556 precipitation Methods 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000013823 prenylation Effects 0.000 description 1
- 235000014059 processed cheese Nutrition 0.000 description 1
- 235000020991 processed meat Nutrition 0.000 description 1
- 230000004952 protein activity Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 235000011962 puddings Nutrition 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000008929 regeneration Effects 0.000 description 1
- 238000011069 regeneration method Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 208000037803 restenosis Diseases 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000001223 reverse osmosis Methods 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 230000002786 root growth Effects 0.000 description 1
- 239000004576 sand Substances 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
- 235000015067 sauces Nutrition 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000010187 selection method Methods 0.000 description 1
- 239000006152 selective media Substances 0.000 description 1
- 101150056746 sfp gene Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 1
- 244000000000 soil microbiome Species 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 150000003408 sphingolipids Chemical class 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 150000003432 sterols Chemical class 0.000 description 1
- 229940031000 streptococcus pneumoniae Drugs 0.000 description 1
- 239000006228 supernatant Substances 0.000 description 1
- 239000012134 supernatant fraction Substances 0.000 description 1
- 235000013616 tea Nutrition 0.000 description 1
- 108020002982 thioesterase Proteins 0.000 description 1
- 230000006032 tissue transformation Effects 0.000 description 1
- 231100000331 toxic Toxicity 0.000 description 1
- 230000002588 toxic effect Effects 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000006276 transfer reaction Methods 0.000 description 1
- 125000003203 triacylglycerol group Chemical group 0.000 description 1
- 235000014388 unprocessed cheese Nutrition 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 235000013311 vegetables Nutrition 0.000 description 1
- 239000010455 vermiculite Substances 0.000 description 1
- 229910052902 vermiculite Inorganic materials 0.000 description 1
- 235000019354 vermiculite Nutrition 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 150000003735 xanthophylls Chemical class 0.000 description 1
- 235000008210 xanthophylls Nutrition 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 235000013618 yogurt Nutrition 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8247—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified lipid metabolism, e.g. seed oil composition
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/02—Nutrients, e.g. vitamins, minerals
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P3/00—Drugs for disorders of the metabolism
- A61P3/08—Drugs for disorders of the metabolism for glucose homeostasis
- A61P3/10—Drugs for disorders of the metabolism for glucose homeostasis for hyperglycaemia, e.g. antidiabetics
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/21—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Pseudomonadaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/28—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Vibrionaceae (F)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/405—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from algae
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
- C12P7/6427—Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
- C12P7/6432—Eicosapentaenoic acids [EPA]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6409—Fatty acids
- C12P7/6427—Polyunsaturated fatty acids [PUFA], i.e. having two or more double bonds in their backbone
- C12P7/6434—Docosahexenoic acids [DHA]
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P7/00—Preparation of oxygen-containing organic compounds
- C12P7/64—Fats; Fatty oils; Ester-type waxes; Higher fatty acids, i.e. having at least seven carbon atoms in an unbroken chain bound to a carboxyl group; Oxidised oils or fats
- C12P7/6436—Fatty acid esters
- C12P7/6445—Glycerides
- C12P7/6472—Glycerides containing polyunsaturated fatty acid [PUFA] residues, i.e. having two or more double bonds in their backbone
Definitions
- This invention relates to polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems from Schizochytrium . More particularly, this invention relates to nucleic acids encoding such PUFA PKS systems, to such PUFA PKS systems, to genetically modified organisms comprising such PUFA PKS systems, and to methods of making and using such PUFA PKS systems disclosed herein. This invention also relates to PUFA PKS systems from non-bacterial and bacterial organisms identified using the Schizochytrium PUFA PKS systems described herein.
- PUFA polyunsaturated fatty acid
- PKS polyketide synthase
- PKS Polyketide synthase
- FOS fatty acid synthase
- the PKS pathways for PUFA synthesis in the eukaryotic Thraustochytrid, Schizochytrium is described in detail in U.S. Pat. No. 6,566,583.
- the PKS pathways for PUFA synthesis in eukaryotes such as members of Thraustochytriales including the structural description of a PUFA PKS system in Schizochytrium and the identification of a PUFA PKS system in Thraustochytrium , including details regarding uses of these systems, are described in detail in U.S. Patent Application Publication No. 20020194641, published Dec. 19, 2002 (corresponding to U.S. patent application Ser. No. 10/124,800, filed Apr. 16, 2002).
- 20040235127 published Nov. 25, 2004 (corresponding to U.S. patent application Ser. No. 10/810,352, filed Mar. 24, 2004), discloses the structural description of a PUFA PKS system in Thraustochytrium , and further detail regarding the production of eicosapentaenoic acid (C20:5, ⁇ -3) (EPA) and other PUFAs using such systems.
- U.S. Patent Application Publication No. 20050100995, published May 12, 2005 discloses the structural and functional description of PUFA PKS systems in Shewanella olleyana and Shewanella japonica , and uses of such systems.
- Type I modular or iterative
- Type II polyketide synthase
- Type III polyketide synthase
- the Type II system is characterized by separable proteins, each of which carries out a distinct enzymatic reaction. The enzymes work in concert to produce the end product and each individual enzyme of the system typically participates several times in the production of the end product.
- Type I iterative PKS systems are similar to the Type II system in that the enzymes are used in an iterative fashion to produce the end product.
- the Type I iterative differs from Type II in that enzymatic activities, instead of being associated with separable proteins, occur as domains of larger proteins.
- This system is analogous to the Type I FAS systems found in animals and fungi.
- each enzyme domain is used only once in the production of the end product.
- the domains are found in very large proteins and the product of each reaction is passed on to another domain in the PKS protein.
- the PKS systems described above if a carbon-carbon double bond is incorporated into the end product, it is usually in the trans configuration.
- Type III systems have been more recently discovered and belong to the plant chalcone synthase family of condensing enzymes.
- Type III PKSs are distinct from type I and type II PKS systems and utilize free CoA substrates in iterative condensation reactions to usually produce a heterocyclic end product.
- PUFAs Polyunsaturated fatty acids
- the current supply of PUFAs from natural sources and from chemical synthesis is not sufficient for commercial needs.
- a major current source for PUFAs is from marine fish; however, fish stocks are declining, and this may not be a sustainable resource.
- contamination, from both heavy metals and toxic organic molecules, is a serious issue with oil derived from marine fish.
- Vegetable oils derived from oil seed crops are relatively inexpensive and do not have the contamination issues associated with fish oils.
- the PUFAs found in commercially developed plant oils are typically limited to linoleic acid (eighteen carbons with 2 double bonds, in the delta 9 and 12 positions—18:2 delta 9,12) and linolenic acid (18:3 delta 9,12,15).
- the conventional pathway i.e., the “standard” pathway or “classical” pathway
- medium chain-length saturated fatty acids products of a fatty acid synthase (FAS) system
- the substrates for the elongation reaction are fatty acyl-CoA (the fatty acid chain to be elongated) and malonyl-CoA (the source of the 2 carbons added during each elongation reaction).
- the product of the elongase reaction is a fatty acyl-CoA that has two additional carbons in the linear chain.
- the desaturases create cis double bonds in the preexisting fatty acid chain by extraction of 2 hydrogens in an oxygen-dependant reaction.
- the substrates for the desaturases are either acyl-CoA (in some animals) or the fatty acid that is esterified to the glycerol backbone of a phospholipid (e.g. phosphatidylcholine).
- lipids e.g., triacylglycerol (TAG) and phospholipid (PL)
- TAG triacylglycerol
- PL phospholipid
- One embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence selected from: SEQ ID NO:1, SEQ ID NO:3, and SEQ ID NO:5; (b) a nucleic acid sequence encoding an amino acid sequence selected from: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6; (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine
- the nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:4 or that is a fragment of SEQ ID NO:4, wherein said amino acid sequence has KS activity, chain length factor (CL
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence selected from SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. In another aspect, the nucleic acid molecule comprises a nucleic acid sequence selected from: SEQ ID NO:1, SEQ ID NO:3, and SEQ ID NO:5.
- the nucleic acid molecule of (a) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pKJ1126 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), and pJK320 (ATCC Accession No. ______).
- the nucleic acid molecule of (b) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pJK1129 (ATCC Accession No. ______), and pJK324 (ATCC Accession No. ______).
- the nucleic acid molecule of (c) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pJK1131 (ATCC Accession No. ______) and pBR002 (ATCC Accession No. ______).
- Another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a first nucleic acid sequence encoding a first amino acid sequence that has ⁇ -keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, wherein the first nucleic acid sequence hybridizes under very high stringency conditions to the complement of a second nucleic acid sequence encoding a second amino acid sequence of SEQ ID NO:2, and wherein said first amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a first nucleic acid sequence encoding a first amino acid sequence that has KS activity, chain length factor (CLF) activity,
- nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:9; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:10; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:10 or that is a fragment of SEQ ID NO:10, wherein the amino acid sequence has malonyl-CoA:ACP acyltransferase (MAT) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 93 of SEQ ID NO:10 and a histidine at a position corresponding to amino acid 94 of SEQ ID NO:10.
- MAT malonyl-CoA:ACP acyltransferase
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:10 or that is a fragment of SEQ ID NO:10, wherein the amino acid sequence has malonyl-CoA:ACP acyltransferase (MAT) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 93 of SEQ ID NO:10 and a histidine at a position corresponding to amino acid 94 of SEQ ID NO:10.
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:10.
- Another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:19; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:20; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:20 or that is a fragment of SEQ ID NO:20, wherein the amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:20.
- KS ⁇ -keto acyl-ACP synthase
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:20 or that is a fragment of SEQ ID NO:20, wherein the amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:20.
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:20.
- nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:29; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:30; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:30 or that is a fragment of SEQ ID NO:30, wherein the amino acid sequence has FabA-like ⁇ -hydroxy acyl-ACP dehydrase (DH) activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 426-440 of SEQ ID NO:30.
- DH FabA-like ⁇ -hydroxy acyl-ACP dehydrase
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:30 or that is a fragment of SEQ ID NO:30, wherein the amino acid sequence has FabA-like ⁇ -hydroxy acyl-ACP dehydrase (DH) activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 426-440 of SEQ ID NO:30.
- the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:30.
- Another embodiment of the invention relates to a recombinant nucleic acid molecule comprising any of the nucleic acid molecules described above, operatively linked to at least one transcription control sequence.
- Yet another embodiment of the invention relates to a recombinant cell transfected with any of the nucleic acid molecules described above.
- the recombinant cell is a microorganism.
- the recombinant cell is a plant cell.
- Another embodiment of the present invention relates to an isolated nucleic acid molecule consisting essentially of a nucleic acid sequence that is fully complementary to any of the nucleic acid molecules described above.
- Another embodiment of the present invention relates to a genetically modified microorganism that has been transformed with any of the nucleic acid molecules described above.
- Yet another embodiment of the present invention relates to a genetically modified plant that has been transformed with any of the nucleic acid molecules described above.
- Another embodiment of the present invention relates to a genetically modified microorganism that has been transformed with: (a) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:4 or that is a fragment of SEQ
- the microorganism has been transformed with a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:2, a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:4, and a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:6.
- the microorganism endogenously expresses a PUFA PKS system.
- the microorganism has been further transformed with a recombinant nucleic acid molecule encoding a phosphopantetheine transferase.
- the microorganism can include, but is not limited to, a Thraustochytriales microorganism, a bacterium or a yeast.
- bioactive molecule is a polyunsaturated fatty acid (PUFA).
- PUFA polyunsaturated fatty acid
- Another embodiment of the present invention relates to a genetically modified plant or part of the plant, wherein said plant has been transformed with: (a) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has ⁇ -keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:4 or that is
- the plant has been further genetically modified to express a recombinant nucleic acid molecule encoding a phosphopantetheine transferase.
- the plant is a dicotyledonous plant, and in another aspect, the plant is a monocotyledonous plant.
- the plant is selected from: canola, soybean, rapeseed, linseed, corn, safflower, sunflower and tobacco.
- the plant is an oilseed plant and the part of the plant is a mature oilseed.
- the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one PUFA selected from DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), and wherein the total fatty acids produced as a result of transformation with said nucleic acid molecules, other than said at least one PUFA, comprise less than about 10% of the total fatty acids produced by said plant.
- DHA docosahexaenoic acid
- DPA docosapentaenoic acid
- the total fatty acids produced as a result of transformation with said nucleic acid molecules, other than said at least one PUFA comprise less than 5% by weight of the total fatty acids produced by said plant.
- the fatty acids consisting of gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds comprise less than 5% by weight of the total fatty acids produced by said plant.
- gamma-linolenic acid (GLA; 18:3, n-6) comprises less than 1% by weight of the total fatty acids produced by said plant.
- Yet another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises detectable amounts of DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), wherein the ratio of DPAn-6 to DHA is 1:1 or greater than 1:1.
- DHA docosahexaenoic acid
- DPA docosapentaenoic acid
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises detectable amounts of DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), wherein the ratio of DPAn-6 to DHA is less than 1:1.
- DHA docosahexaenoic acid
- DPA docosapentaenoic acid
- the total fatty acid profile in the plant or part of the plant contains less than 5% by weight in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- GLA gamma-linolenic acid
- PUFAs having 18 carbons and four carbon-carbon double bonds PUFAs having 20 carbons and three carbon-carbon double bonds
- PUFAs having 22 carbons and two or three carbon-carbon double bonds PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Yet another embodiment of the present invention relates to plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUPA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 5% in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- PUPA polyunsaturated fatty acid
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 1% of each of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- PUFA polyunsaturated fatty acid
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 2% of gamma-linolenic acid (GLA; 18:3, n-6) and dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6).
- PUFA polyunsaturated fatty acid
- Another embodiment of the present invention relates to seeds obtained from any of the plants or part of plants described above, a food product comprising such seeds, an oil obtained from such seeds, and a food product comprising such oil. Also included in the invention is an oil blend comprising such oil and another oil, such as, but not limited to, a microbial oil, a fish oil, and a vegetable oil.
- Yet another embodiment of the present invention relates to an oil comprising the following fatty acids: DHA (C22:6n-3), DPAn-6 (C22:5n-6), oleic acid (C18:1), linolenic acid (C18:3), linoleic acid (C18:2), C16:0, C18.0, C20:0, C20:1n-9, C20:2n-6, C22:1n-9; wherein the oil comprises less than 0.5% of any of the following fatty acids: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- DHA C22:6n-3
- DPAn-6 C22:5n-6
- oleic acid C18:1
- linolenic acid C18:3
- Another embodiment of the present invention relates to an oilseed plant that produces mature seeds in which the total seed fatty acid profile comprises at least 1.0% by weight of at least one polyunsaturated fatty acid selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 5% in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- GLA gamma-linolenic acid
- PUFAs having 18 carbons and four carbon-carbon double bonds PUFAs having 20 carbons and three carbon-carbon double bonds
- PUFAs having 22 carbons and two or three carbon-carbon double bonds gamma-linolenic acid
- Yet another embodiment of the present invention relates to an oilseed plant that produces mature seeds in which the total seed fatty acid profile comprises at least 1.0% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 1% of gamma-linolenic acid (GLA; 18:3, n-6).
- PUFA polyunsaturated fatty acid
- Another embodiment of the present invention relates to a method to produce a bioactive molecule, comprising growing under conditions effective to produce said bioactive molecule a genetically modified plant as described above.
- the bioactive molecule is a polyunsaturated fatty acid (PUFA).
- Yet another embodiment of the present invention relates to a method to produce a plant that has a polyunsaturated fatty acid (PUFA) profile that differs from the naturally occurring plant, comprising genetically modifying said plant to express a PUFA PKS system comprising at least one of any of the nucleic acid molecules as described above.
- PUFA polyunsaturated fatty acid
- Another embodiment of the present invention relates to a method to produce a recombinant microbe, comprising genetically modifying microbial cells to express at least one of any of the nucleic acid molecules as described above.
- FIG. 1 is a graphical representation of the domain structure of the Schizochytrium PUFA PKS system.
- FIG. 2 shows a comparison of PKS domains from Schizochytrium and Shewanella.
- FIG. 3 shows a GC FAME profile of control yeast and yeast expressing Orfs sA, sB, C and Het I.
- FIG. 4 shows a GC FAME profile of the PUFA region from FIG. 3 .
- FIG. 5 shows GC FAME profiles of wild-type Arabidopsis and Arabidopsis Line 269 (plastid targeted).
- FIG. 6 is a schematic diagram showing the construction of pSBS4107: Acyl-ACP transit peptide-HetI: Acyl-ACP transit peptide-ORFC.
- FIG. 7 is a schematic diagram showing the construction of pSBS5720: Acyl-ACP transit peptide-ORFB.
- FIG. 8 is a schematic diagram showing the construction of pSBS4757: Acyl-ACP transit peptide-ORFA.
- the present invention generally relates to polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems from Schizochytrium , to genetically modified organisms comprising Schizochytrium PUFA PKS systems, to methods of making and using such systems for the production of products of interest, including bioactive molecules, and to PUFA PKS systems identified using the structural information for the Schizochytrium PUFA PKS systems disclosed herein.
- the present invention relates to a method to produce PUFAs in an oil-seed plant that has been genetically modified to express a PUFA PKS system of the present invention.
- the oils produced by the plant contain at least one PUFA produced by the PUFA PKS system and are substantially free of the mixed shorter-chain and less unsaturated PUFAs that are fatty acid products produced by the modification of products of the FAS system.
- a PUFA PKS system (which may also be referred to as a PUFA synthase system or PUFA synthase) generally has the following identifying features: (1) it produces PUFAs, and particularly, long chain PUFAs, as a natural product of the system; and (2) it comprises several multifunctional proteins assembled into a complex that conducts both iterative processing of the fatty acid chain as well non-iterative processing, including trans-cis isomerization and enoyl reduction reactions in selected cycles.
- the ACP domains present in the PUFA synthase enzymes require activation by attachment of a cofactor (4-phosphopantetheine).
- PPTase phosphopantetheinyl transferases
- PUFAs polyunsaturated fatty acids
- products e.g., an organism that endogenously (naturally) contains such a PKS system makes PUFAs using this system.
- PUFAs are fatty acids with a carbon chain length of at least 16 carbons, and more preferably at least 18 carbons, and more preferably at least 20 carbons, and more preferably 22 or more carbons, with at least 3 or more double bonds, and preferably 4 or more, and more preferably 5 or more, and even more preferably 6 or more double bonds, wherein all double bonds are in the cis configuration.
- LCPUFAs long chain polyunsaturated fatty acids
- LCPUFAs of the omega-6 series include: gamma-linolenic acid (C18:3), di-homo-gammalinolenic acid (C20:3n-6), arachidonic acid (C20:4n-6), adrenic acid (also called docosatetraenoic acid or DTA) (C22:4n-6), and docosapentaenoic acid (C22:5n-6).
- the LCPUFAs of the omega-3 series include: alpha-linolenic acid (C18:3), eicosatrienoic acid (C20:3n-3), eicosatetraenoic acid (C20:4n-3), eicosapentaenoic acid (C20:5n-3), docosapentaenoic acid (C22:5n-3), and docosahexaenoic acid (C22:6n-3).
- the LCPUFAs also include fatty acids with greater than 22 carbons and 4 or more double bonds including but not limited to C28:8(n-3).
- a PUFA PKS system comprises several multifunctional proteins (and can include single function proteins, particularly for PUFA PKS systems from marine bacteria) that are assembled into a complex that conducts both iterative processing of the fatty acid chain as well non-iterative processing, including trans-cis isomerization and enoyl reduction reactions in selected cycles.
- These proteins can also be referred to herein as the core PUFA PKS enzyme complex or the core PUFA PKS system.
- the general functions of the domains and motifs contained within these proteins are individually known in the art and have been described in detail with regard to various PUFA PKS systems from marine bacteria and eukaryotic organisms (see, e.g., U.S. Pat. No.
- the domains may be found as a single protein (i.e., the domain and protein are synonymous) or as one of two or more (multiple) domains in a single protein, as mentioned above.
- the present inventors propose to use these features of the PUFA PKS system to produce a range of bioactive molecules that could not be produced by the previously described (Type I iterative or modular, Type II, or Type III) PKS systems.
- bioactive molecules include, but are not limited to, polyunsaturated fatty acids (PUFAs), antibiotics or other bioactive compounds, many of which will be discussed below.
- PUFAs polyunsaturated fatty acids
- antibiotics or other bioactive compounds
- a PUFA PKS system of the present invention comprises at least the following biologically active domains that are typically contained on three or more proteins: (a) at least one enoyl-ACP reductase (ER) domain; (b) multiple acyl carrier protein (ACP) domain(s) (e.g., at least from one to four, and preferably at least five ACP domains, and in some embodiments up to six, seven, eight, nine, or more than nine ACP domains); (c) at least two ⁇ -ketoacyl-ACP synthase (KS) domains; (d) at least one acyltransferase (AT) domain; (e) at least one ⁇ -ketoacyl-ACP reductase (KR) domain; (f) at least two FabA-like ⁇ -hydroxyacyl-ACP dehydrase (DH) domains; (g) at least one chain length factor (CLF) domain; (h) at least one malonyl-Co
- a Schizochytrium PUFA PKS system comprises at least the following biologically active domains: (a) two enoyl-ACP reductase (ER) domain; (b) nine acyl carrier protein (ACP) domains; (c) two ⁇ -ketoacyl-ACP synthase (KS) domains; (d) one acyltransferase (AT) domain; (e) one ⁇ -ketoacyl-ACP reductase (KR) domain; (f) two FabA-like ⁇ -hydroxyacyl-ACP dehydrase (DH) domains; (g) one chain length factor (CLF) domain; and (h) one malonyl-CoA:ACP acyltransferase (MAT) domain.
- a Schizochytrium PUFA PKS system also comprises at least one region or domain containing a dehydratase (DH) conserved active site motif that is not a part of a FabA-like DH domain.
- DH dehydratase
- a PUFA PKS system can additionally include one or more accessory proteins, which are defined herein as proteins that are not considered to be part of the core PUFA PKS system as described above (i.e., not part of the PUFA synthase enzyme complex itself), but which may be, or are, necessary for PUFA production or at least for efficient PUFA production using the core PUFA synthase enzyme complex of the present invention, particularly in certain host organisms (e.g., plants).
- a PUFA PKS system must work with an accessory protein that transfers a 4′-phosphopantetheinyl moiety from Coenzyme A to the acyl carrier protein (ACP) domain(s).
- a PUFA PKS system can be considered to include at least one 4′-phosphopantetheinyl transferase (PPTase) domain, or such a domain can be considered to be an accessory domain or protein to the PUFA PKS system.
- PPTase 4′-phosphopantetheinyl transferase
- some host organisms may endogenously express accessory proteins that are needed to work with the PUFA PKS to produce PUFAs (e.g., PPTases).
- some organisms may be transformed with nucleic acid molecules encoding one or more accessory proteins described herein to enable and/or to enhance production of PUFAs by the organism, even if the organism endogenously produces a homologous accessory protein (i.e., some heterologous accessory proteins may operate more effectively or efficiently with the transformed PUFA synthase proteins than the host cells' endogenous accessory protein).
- the present invention provides an example of bacteria, yeast and plants that have been genetically modified with the PUFA PKS system of the present invention that includes an accessory PPTase. Structural and functional characteristics of PPTases will be described in more detail below.
- a “standard” or “classical” pathway for the production of PUFAs refers to the fatty acid synthesis pathway where medium chain-length saturated fatty acids (products of a fatty acid synthase (FAS) system) are modified by a series of elongation and desaturation reactions.
- the substrates for the elongation reaction are fatty acyl-CoA (the fatty acid chain to be elongated) and malonyl-CoA (the source of the 2 carbons added during each elongation reaction).
- the product of the elongase reaction is a fatty acyl-CoA that has two additional carbons in the linear chain.
- the desaturases create cis double bonds in the preexisting fatty acid chain by extraction of 2 hydrogens in an oxygen-dependant reaction. Such pathways and the genes involved in such pathways are well-known in the literature.
- lipid includes phospholipids (PL); free fatty acids; esters of fatty acids; triacylglycerols (TAG); diacylglycerides; monoacylglycerides; phosphatides; waxes (esters of alcohols and fatty acids); sterols and sterol esters; carotenoids; xanthophylls (e.g., oxycarotenoids); hydrocarbons; and other lipids known to one of ordinary skill in the art.
- TAG triacylglycerols
- PUFA include not only the free fatty acid form, but other forms as well, such as the TAG form and the PL form.
- a PUFA PKS system described according to the present invention is a non-bacterial PUFA PKS system.
- the PUFA PKS system of the present invention is isolated from an organism that is not a bacteria, or is a homologue of or derived from a PUFA PKS system from an organism that is not a bacteria, such as a eukaryote or an archaebacterium. Eukaryotes are separated from prokaryotes based on the degree of differentiation of the cells, with eukaryotes being more differentiated than prokaryotes.
- prokaryotes do not possess a nuclear membrane, do not exhibit mitosis during cell division, have only one chromosome, their cytoplasm contains 70S ribosomes, they do not possess any mitochondria, endoplasmic reticulum, chloroplasts, lysosomes or golgi apparatus, their flagella (if present) consists of a single fibril.
- eukaryotes have a nuclear membrane, they do exhibit mitosis during cell division, they have many chromosomes, their cytoplasm contains 80S ribosomes, they do possess mitochondria, endoplasmic reticulum, chloroplasts (in algae), lysosomes and golgi apparatus, and their flagella (if present) consists of many fibrils.
- bacteria are prokaryotes, while algae, fungi, protist, protozoa and higher plants are eukaryotes.
- the PUFA PKS systems of the marine bacteria are not the basis of the present invention, although the present invention does contemplate the use of domains from these bacterial PUFA PKS systems in conjunction with domains from the non-bacterial (e.g., Schizochytrium ) PUFA PKS systems of the present invention.
- non-bacterial PUFA PKS systems e.g., Schizochytrium
- genetically modified organisms can be produced which incorporate non-bacterial PUFA PKS functional domains with bacterial PUFA PKS functional domains, as well as PKS functional domains or proteins from other PKS systems (Type I iterative or modular, Type II, or Type III) or FAS systems.
- Schizochytrium is a Thraustochytrid marine microorganism that accumulates large quantities of triacylglycerols rich in DHA and docosapentaenoic acid (DPA; 22:5%-6); e.g., 30% DHA+DPA by dry weight (Barclay et al., J. Appl. Phycol. 6, 123 (1994)).
- Nucleotides 390-4443 of the cDNA sequence containing the first partial open reading frame described in U.S. application Ser. No. 09/231,899 match nucleotides 4677-8730 (plus the stop codon) of the sequence denoted herein as OrfA (SEQ ID NO:1).
- a cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB3033-047-B5 comprises at least a portion of nucleotides 4677-8730 of SEQ ID NO:1 described herein, to the best of the present inventors' knowledge.
- cDNA clone LIB3033-047-B5 begins at nucleotide 6719 of SEQ ID NO:1 and extends to the end of the Orf (position 8730 of SEQ ID NO:1), plus about 71 nucleotides beyond the end of the Orf represented by SEQ ID NO:1.
- cDNA clone LIB3033-047-B5 (denoted cDNA clone LIB3033-047-B5 in the form of an E. coli plasmid vector containing “Orf6 homolog” partial gene sequence from Schizochytrium sp.) was deposited with the American Type Culture Collection (ATCC), University Boulevard, Manassas, Va. 20110-2209 USA on Jun.
- nucleotide sequence of cDNA clone LIB3033-047-B5, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- This cDNA clone did not have a poly-A-tail, and therefore, was a partial cDNA with additional regions of the cDNA found downstream of the sequence.
- the nucleotide sequence of cDNA clone LIB3033-046-E6, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- Nucleotides 1-4867 of the cDNA sequence containing the second partial open reading frame described in U.S. application Ser. No. 09/231,899 matches nucleotides 1311-6177 (plus the stop codon) of the sequence denoted herein as OrfB (SEQ ID NO:3), with the exception of the nucleotide at position 2933 of SEQ ID NO:71 of the '899 application, which corresponds to the nucleotide at position 4243 of SEQ ID NO:3 set forth herein.
- This single nucleotide change results in a single amino acid change in SEQ ID NO:4 disclosed herein, as compared to SEQ ID NO:72 of the '899 application. Specifically, the glutamine residue at position 978 of SEQ ID NO:72 in the '899 application is changed to a glutamate at position 1415 of SEQ ID NO:4. This amino acid occurs in the linker region between the AT domain and the ER domain of SEQ ID NO:4.
- cDNA clone LIB3033-046-D2 comprises nucleotides 1311-6177 of SEQ ID NO:3 described herein, plus about 382 additional nucleotides beyond the end of the Orf represented here as SEQ ID NO:3, to the best of the present inventors' knowledge.
- cDNA clone LIB3033-046-D2 (denoted cDNA clone LIB3033-046-D2 in the form of an E. coli plasmid vector containing “hglC/Orf7/Orf8/Orf9 homolog” gene from Schizochytrium ) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va.
- ATCC American Type Culture Collection
- Nucleotides 145-4653 of the cDNA sequence containing the complete open reading frame described in U.S. application Ser. No. 09/231,899 matches nucleotides 1-2624 and 2675-4506 of the sequence denoted herein as OrfC (SEQ ID NO:5). Sequencing of the genomic DNA encoding OrfC revealed that there is an additional nucleotide at each of positions 2769, 2806 and 2818 of SEQ ID NO:76 of the '899 application which resulted in a frame shift and a short change in the amino acid sequence of the corresponding protein.
- amino acid positions 924-939 of SEQ ID NO:73 of the '899 application represent an incorrect sequence.
- Positions 876-890 of SEQ ID NO:5 herein represent the correct amino acid sequence in this region. This sequence is located in the DH2 domain of OrfC (discussed below).
- a cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB81-042-B9 comprises a portion of the 5′ sequence of SEQ ID NO:5.
- the sequence of the insert in LIB81-042-B9 contains 145 nucleotides upstream of the start codon of SEQ ID NO:5 and extends 2361 nucleotides into the Orf.
- cDNA clone LIB81-042-B9 (denoted cDNA clone LIB81-042-B9 in the form of an E. coli plasmid vector containing “0118 homolog” partial gene sequence from Schizochytrium sp.) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of cDNA clone LIB81-042-B9, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- N230D was one of more than 1,000 randomly-chosen survivors of chemically mutagenised (NTG; 1-methyl-3-nitro-1-nitrosoguanidine) Schizochytrium ATCC 20888 screened for variations in fatty acid content. This particular strain was valued for its improved DHA productivity. Further, the complete identification of the domains with homology to those in Shewanella (see FIG. 2 ) were identified.
- FIG. 1 is a graphical representation of the three open reading frames from the Schizochytrium PUFA PKS system, and includes the domain structure of this PUFA PKS system.
- the domain structure of each open reading frame is as follows:
- SEQ ID NO:1 The complete nucleotide sequence for OrfA is represented herein as SEQ ID NO:1.
- Nucleotides 4677-8730 of SEQ ID NO:1 correspond to nucleotides 390-4443 of the sequence denoted as SEQ ID NO:69 in U.S. application Ser. No. 09/231,899. Therefore, nucleotides 1-4676 of SEQ ID NO:1 represent additional sequence that was not disclosed in U.S. application Ser. No. 09/231,899.
- This novel region of SEQ ID NO:1 encodes the following domains in OrfA: (1) the ORFA-KS domain; (2) the ORFA-MAT domain; and (3) at least a portion of the ACP domain region (e.g., at least ACP domains 1-4).
- nucleotides 1-389 of SEQ ID NO:69 in U.S. application Ser. No. 09/231,899 do not exactly match with the 389 nucleotides that are upstream of position 4677 in SEQ ID NO:1 disclosed herein. Therefore, positions 1-389 of SEQ ID NO:69 in U.S. application Ser. No. 09/231,899 appear to be incorrectly placed next to nucleotides 390-4443 of that sequence. Most of these first 389 nucleotides (about positions 60-389) are a match with an upstream portion of OrfA (SEQ ID NO:1) of the present invention and therefore, it is believed that an error occurred in the effort to prepare the contig of the cDNA constructs in U.S. application Ser.
- OrfA is a 8730 nucleotide sequence (not including the stop codon) which encodes a 2910 amino acid sequence, represented herein as SEQ ID NO:2.
- Within OrfA are twelve domains: (a) one ⁇ -keto acyl-ACP synthase (KS) domain; (b) one malonyl-CoA:ACP acyltransferase (MAT) domain; (c) nine acyl carrier protein (ACP) domains; and (d) one ketoreductase (KR) domain.
- a nucleotide sequence for OrfA has been deposited with GenBank as Accession No. AF378327 (amino acid sequence Accession No. AAK728879).
- GenBank Accession No. AF378327 differs from the sequence represented herein as SEQ ID NO:1 by the point nucleotide changes: (1) at position 1999 (A to G, resulting in an amino acid change from an asparagine to an aspartic acid at position 667 of SEQ ID NO:2); (2) at position 2003 (C to A, resulting in an amino acid change from a proline to a histidine at position 668 of SEQ ID NO:2); and (3) at position 2238 (A to C, resulting in no amino acid change at position 746 of SEQ ID NO:2).
- GenBank Accession No. AAK728879 are located in the MAT domain (SEQ ID NO:10) of SEQ ID NO:2.
- Genomic clone pJK1126 (denoted pJK1126 OrfA genomic clone, in the form of an E. coli plasmid vector containing “OrfA” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK1126 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- Genomic clone pJK306 OrfA genomic clone and pJK320 OrfA genomic clone, isolated from Schizochytrium sp. N230D, together (overlapping clones) comprise, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:1, and encode the amino acid sequence of SEQ ID NO:2.
- Genomic clone pJK306 (denoted pJK306 OrfA genomic clone, in the form of an E. coli plasmid containing 5′ portion of OrfA gene from Schizochytrium sp.
- N230D (2.2 kB overlap with pJK320) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK306 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- Genomic clone pJK320 (denoted pJK320 OrfA genomic clone, in the form of an E. coli plasmid containing 3′ portion of OrfA gene from Schizochytrium sp.
- N230D (2.2 kB overlap with pJK306) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK320 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfA was compared with known sequences in a standard BLAST search (BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches, blastn for nucleic acid searches, and blastX for nucleic acid searches and searches of the translated amino acid sequence in all 6 open reading frames with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S.F., Madden, T. L., Shuäffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389-3402, incorporated herein by reference in its entirety)).
- OrfA has no significant homology to any known nucleotide sequence.
- sequences with the greatest degree of homology to ORFA were: Nostoc sp. 7120 heterocyst glycolipid synthase (Accession No. NC — 003272), which was 42% identical to ORFA over 1001 amino acid residues; and Moritella marinas ( Vibrio marinas ) ORF8 (Accession No. AB025342), which was 40% identical to ORFA over 993 amino acid residues.
- the first domain in OrfA is a KS domain, also referred to herein as ORFA-KS.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1 and 40 of SEQ ID NO:1 (OrfA) to an ending point of between about positions 1428 and 1500 of SEQ ID NO:1 (based on homology to other PUFA PKS domains, the position of the domain spans from about position 1 to about position 1500; based on Pfam analysis, a KS core region spans from about position 40 to about position 1428).
- the nucleotide sequence containing the sequence encoding the ORFA-KS domain is represented herein as SEQ ID NO:7 (positions 1-1500 of SEQ ID NO:1).
- the amino acid sequence containing the KS domain spans from a starting point of between about positions 1 and 14 of SEQ ID NO:2 (ORFA) to an ending point of between about positions 476 and 500 of SEQ ID NO:2 (again, referring to the overall homology to PUFA PKS KS domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFA-KS domain is represented herein as SEQ ID NO:8 (positions 1-500 of SEQ ID NO:2). It is noted that the ORFA-KS domain contains an active site motif: DXAC* (*acyl binding site C 215 ).
- a domain or protein having 3-keto acyl-ACP synthase (KS) biological activity is characterized as the enzyme that carries out the initial step of the FAS (and PKS) elongation reaction cycle.
- the acyl group destined for elongation is linked to a cysteine residue at the active site of the enzyme by a thioester bond.
- the acyl-enzyme undergoes condensation with malonyl-ACP to form -keto acyl-ACP, CO 2 and free enzyme.
- the KS plays a key role in the elongation cycle and in many systems has been shown to possess greater substrate specificity than other enzymes of the reaction cycle. For example, E.
- coli has three distinct KS enzymes—each with its own particular role in the physiology of the organism (Magnuson et al., Microbiol. Rev. 57, 522 (1993)).
- the two KS domains of the PUFA-PKS systems could have distinct roles in the PUFA biosynthetic reaction sequence.
- KS's As a class of enzymes, KS's have been well characterized. The sequences of many verified KS genes are know, the active site motifs have been identified and the crystal structures of several have been determined. Proteins (or domains of proteins) can be readily identified as belonging to the KS family of enzymes by homology to known KS sequences.
- the second domain in OrfA is a MAT domain, also referred to herein as ORFA-MAT.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1723 and 1798 of SEQ ID NO:1 (OrfA) to an ending point of between about positions 2805 and 3000 of SEQ ID NO:1 (based on homology to other PUFA PKS domains, the position of the MAT domain spans from about position 1723 to about position 3000; based on Pfam analysis, a MAT core region spans from about position 1798 to about position 2805).
- the nucleotide sequence containing the sequence encoding the ORFA-MAT domain is represented herein as SEQ ID NO:9 (positions 1723-3000 of SEQ ID NO:1).
- the amino acid sequence containing the MAT domain spans from a starting point of between about positions 575 and 600 of SEQ ID NO:2 (ORFA) to an ending point of between about positions 935 and 1000 of SEQ ID NO:2 (again, referring to the overall homology to PUFA PKS MAT domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFA-MAT domain is represented herein as SEQ ID NO:10 (positions 575-1000 of SEQ ID NO:2).
- the MAT domain comprises an aspartate at position 93 and a histidine at position 94 (corresponding to positions 667 and 668, respectively, of SEQ ID NO:2). It is noted that the ORFA-MAT domain contains an active site motif: GHS*XG (*acyl binding site S 706 ), represented herein as SEQ ID NO:11.
- a domain or protein having malonyl-CoA:ACP acyltransferase (MAT) biological activity is characterized as one that transfers the malonyl moiety from malonyl-CoA to ACP.
- these enzymes possess an extended motif R and Q amino acids in key positions) that identifies them as MAT enzymes (in contrast to the AT domain of Schizochytrium Orf B).
- MAT domains will preferentially load methyl- or ethyl-malonate on to the ACP group (from the corresponding CoA ester), thereby introducing branches into the linear carbon chain.
- MAT domains can be recognized by their homology to known MAT sequences and by their extended motif structure.
- Domains 3-11 of OrfA are nine tandem ACP domains, also referred to herein as ORFA-ACP (the first domain in the sequence is ORFA-ACP1, the second domain is ORFA-ACP2, the third domain is ORFA-ACP3, etc.).
- the first ACP domain, ORFA-ACP1 is contained within the nucleotide sequence spanning from about position 3343 to about position 3600 of SEQ ID NO:1 (OrfA).
- the nucleotide sequence containing the sequence encoding the ORFA-ACP1 domain is represented herein as SEQ ID NO:12 (positions 3343-3600 of SEQ ID NO:1).
- the amino acid sequence containing the first ACP domain spans from about position 1115 to about position 1200 of SEQ ID NO:2.
- the amino acid sequence containing the ORFA-ACP1 domain is represented herein as SEQ ID NO:13 (positions 1115-1200 of SEQ ID NO:2). It is noted that the ORFA-ACP1 domain contains an active site motif: LGIDS* (*pantetheine binding motif S 1157 ), represented herein by SEQ ID NO:14.
- nucleotide and amino acid sequences of all nine ACP domains are highly conserved and therefore, the sequence for each domain is not represented herein by an individual sequence identifier. However, based on the information disclosed herein, one of skill in the art can readily determine the sequence containing each of the other eight ACP domains (see discussion below).
- All nine ACP domains together span a region of OrfA of from about position 3283 to about position 6288 of SEQ ID NO:1, which corresponds to amino acid positions of from about 1095 to about 2096 of SEQ ID NO:2.
- the nucleotide sequence for the entire ACP region containing all nine domains is represented herein as SEQ ID NO:16.
- the region represented by SEQ ID NO:16 includes the linker segments between individual ACP domains.
- the repeat interval for the nine domains is approximately every 330 nucleotides of SEQ ID NO:16 (the actual number of amino acids measured between adjacent active site serines ranges from 104 to 116 amino acids).
- Each of the nine ACP domains contains a pantetheine binding motif LGIDS* (represented herein by SEQ ID NO:14), wherein S* is the pantetheine binding site serine (S).
- S* is the pantetheine binding site serine (S).
- S is located near the center of each ACP domain sequence.
- S is a region that is highly enriched for proline (P) and alanine (A), which is believed to be a linker region.
- P proline
- A alanine
- SEQ ID NO:15 is the sequence: APAPVKAAAPAAPVASAPAPA, represented herein as SEQ ID NO:15.
- an ACP domain is about 85 amino acids, excluding the linker, and about 110 amino acids including the linker, with the active site serine being approximately in the center of the domain, one of skill in the art can readily determine the positions of each of the nine ACP domains in OrfA.
- a domain or protein having acyl carrier protein (ACP) biological activity is characterized as being small polypeptides (typically, 80 to 100 amino acids long), that function as carriers for growing fatty acyl chains via a thioester linkage to a covalently bound co-factor of the protein. They occur as separate units or as domains within larger proteins.
- ACPs are converted from inactive apo-forms to functional holo-forms by transfer of the phosphopantetheinyl moiety of CoA to a highly conserved serine residue of the ACP.
- Acyl groups are attached to ACP by a thioester linkage at the free terminus of the phosphopantetheinyl moiety.
- ACPs can be identified by labeling with radioactive pantetheine and by sequence homology to known ACPs. The presence of variations of the above mentioned motif (LGIDS*) is also a signature of an ACP.
- Domain 12 in OrfA is a KR domain, also referred to herein as ORFA-KR.
- This domain is contained within the nucleotide sequence spanning from a starting point of about position 6598 of SEQ ID NO:1 to an ending point of about position 8730 of SEQ ID NO:1.
- the nucleotide sequence containing the sequence encoding the ORFA-KR domain is represented herein as SEQ ID NO:17 (positions 6598-8730 of SEQ ID NO:1).
- the amino acid sequence containing the KR domain spans from a starting point of about position 2200 of SEQ ID NO:2 (ORFA) to an ending point of about position 2910 of SEQ ID NO:2.
- the amino acid sequence containing the ORFA-KR domain is represented herein as SEQ ID NO:18 (positions 2200-2910 of SEQ ID NO:2).
- SEQ ID NO:18 positions 2200-2910 of SEQ ID NO:2.
- KR is a member of this family. This core region spans from about position 7198 to about position 7500 of SEQ ID NO:1, which corresponds to amino acid positions 2400-2500 of SEQ ID NO:2.
- a domain or protein having ketoreductase activity also referred to as 3-ketoacyl-ACP reductase (KR) biological activity (function) is characterized as one that catalyzes the pyridine-nucleotide-dependent reduction of 3-keto acyl forms of ACP. It is the first reductive step in the de novo fatty acid biosynthesis elongation cycle and a reaction often performed in polyketide biosynthesis. Significant sequence similarity is observed with one family of enoyl ACP reductases (ER), the other reductase of FAS (but not the ER family present in the PUFA PKS system), and the short-chain alcohol dehydrogenase family.
- ER enoyl ACP reductases
- Pfam analysis of the PUFA PKS region indicated above reveals the homology to the short-chain alcohol dehydrogenase family in the core region.
- Blast analysis of the same region reveals matches in the core area to known KR enzymes as well as an extended region of homology to domains from the other characterized PUFA PKS systems.
- SEQ ID NO:3 The complete nucleotide sequence for OrfB is represented herein as SEQ ID NO:3. Nucleotides 1311-6177 of SEQ ID NO:3 correspond to nucleotides 1-4867 of the sequence denoted as SEQ ID NO:71 in U.S. application Ser. No. 09/231,899, with the exception of the nucleotide at position 2933 of SEQ ID NO:71 of the '899 application or nucleotide 4243 of SEQ ID NO:3 herein, as discussed above.
- the cDNA sequence in U.S. application Ser. No. 09/231,899 contains about 345 additional nucleotides beyond the stop codon, including a polyA tail).
- nucleotides 1-1310 of SEQ ID NO:1 represent additional sequence that was not disclosed in U.S. application Ser. No. 09/231,899.
- This novel region of SEQ ID NO:3 contains most of the KS domain encoded by OrfB.
- OrfB is a 6177 nucleotide sequence (not including the stop codon) which encodes a 2059 amino acid sequence, represented herein as SEQ ID NO:4.
- SEQ ID NO:4 Within OrfB are four domains: (a) one ⁇ -keto acyl-ACP synthase (KS) domain; (b) one chain length factor (CLF) domain; (c) one acyl transferase (AT) domain; and, (d) one enoyl ACP-reductase (ER) domain.
- a nucleotide sequence for OrfB has been deposited with GenBank as Accession No. AF378328 (amino acid sequence Accession No. AAK728880).
- GenBank Accession No. AF378328 differs from the nucleotide sequence represented herein as SEQ ID NO:3 by the point nucleotide changes: (1) at position 852 (T to C, resulting in no amino acid change at position 284 of SEQ ID NO:4); (2) at position 1110 (S to C, resulting in no amino acid change at position 370 of SEQ ID NO:4); (3) at position 1112 (Y to T, resulting in the resolution of an ambiguous amino acid call to a definite valine call at position 371 of SEQ ID NO:4); and (4) at position 4243 (C to G, resulting in a change from a glutamine to a glutamate at position 1415 of SEQ ID NO:4).
- Genomic clone pJK1129 (denoted pJK1129 OrfB genomic clone, in the form of an E.
- coli plasmid vector containing “OrfB” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK1126 OrfB genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- a genomic clone described herein as pJK324 OrfB genomic clone, isolated from Schizochytrium sp. N230D, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:3, and encodes the amino acid sequence of SEQ ID NO:4.
- Genomic clone pJK324 (denoted pJK324 OrfB genomic clone, in the form of an E. coli plasmid containing the OrfB gene sequence from Schizochytrium sp. N230D) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK324 OrfB genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfB was compared with known sequences in a standard BLAST search as described above. At the nucleic acid level, OrfB has no significant homology to any known nucleotide sequence. At the amino acid level, the sequences with the greatest degree of homology to ORFB were: Shewanella sp. hypothetical protein (Accession No. U73935), which was 53% identical to ORFB over 458 amino acid residues; Moritella marinus ( Vibrio marinus ) ORF11 (Accession No. AB025342), which was 53% identical to ORFB over 460 amino acid residues; Photobacterium profundum omega-3 polyunsaturated fatty acid synthase PfaD (Accession No.
- the first domain in OrfB is a KS domain, also referred to herein as ORFB-KS.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1 and 43 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 1332 and 1350 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the KS domain spans from about position 1 to about position 1350; based on Pfam analysis, a KS core region spans from about position 43 to about position 1332).
- the nucleotide sequence containing the sequence encoding the ORFB-KS domain is represented herein as SEQ ID NO:19 (positions 1-1350 of SEQ ID NO:3).
- the amino acid sequence containing the KS domain spans from a starting point of between about positions 1 and 15 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 444 and 450 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS KS domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFB-KS domain is represented herein as SEQ ID NO:20 (positions 1-450 of SEQ ID NO:4).
- This KS domain comprises a valine at position 371 of SEQ ID NO:20 (also position 371 of SEQ ID NO:20).
- the ORFB-KS domain contains an active site motif: DXAC* (*acyl binding site C 196 ).
- DXAC* active site motif
- the second domain in OrfB is a CLF domain, also referred to herein as ORFB-CLF.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1378 and 1402 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 2682 and 2700 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the CLF domain spans from about position 1378 to about position 2700; based on Pfam analysis, a CLF core region spans from about position 1402 to about position 2682).
- the nucleotide sequence containing the sequence encoding the ORFB-CLF domain is represented herein as SEQ ID NO:21 (positions 1378-2700 of SEQ ID NO:3).
- the amino acid sequence containing the CLF domain spans from a starting point of between about positions 460 and 468 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 894 and 900 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS CLF domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFB-CLF domain is represented herein as SEQ ID NO:22 (positions 460-900 of SEQ ID NO:4). It is noted that the ORFB-CLF domain contains a KS active site motif without the acyl-binding cysteine.
- a domain or protein is referred to as a chain length factor (CLF) based on the following rationale.
- CLF was originally described as characteristic of Type II (dissociated enzymes) PKS systems and was hypothesized to play a role in determining the number of elongation cycles, and hence the chain length, of the end product.
- CLF amino acid sequences show homology to KS domains (and are thought to form heterodimers with a KS protein), but they lack the active site cysteine. CLF's role in PKS systems is currently controversial. New evidence (C.
- the third domain in OrfB is an AT domain, also referred to herein as ORFB-AT.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 2701 and 3598 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 3975 and 4200 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the AT domain spans from about position 2701 to about position 4200; based on Pfam analysis, an AT core region spans from about position 3598 to about position 3975).
- the nucleotide sequence containing the sequence encoding the ORFB-AT domain is represented herein as SEQ ID NO:23 (positions 2701-4200 of SEQ ID NO:3).
- the amino acid sequence containing the AT domain spans from a starting point of between about positions 901 and 1200 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 1325 and 1400 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS AT domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFB-AT domain is represented herein as SEQ ID NO:24 (positions 901-1400 of SEQ ID NO:4). It is noted that the ORFB-AT domain contains an active site motif of GxS*xG (*acyl binding site S 1140 ) that is characteristic of acyltransferse (AT) proteins.
- acyltransferase or “AT” refers to a general class of enzymes that can carry out a number of distinct acyl transfer reactions.
- the Schizochytrium domain shows good homology to a domain present in all of the other PUFA PKS systems currently examined and very weak homology to some acyltransferases whose specific functions have been identified (e.g. to malonyl-CoA:ACP acyltransferase, MAT).
- this AT domain is not believed to function as a MAT because it does not possess an extended motif structure characteristic of such enzymes (see MAT domain description, above).
- the functions of the AT domain in a PUFA PKS system include, but are not limited to: transfer of the fatty acyl group from the ORFA ACP domain(s) to water (i.e. a thioesterase—releasing the fatty acyl group as a free fatty acid), transfer of a fatty acyl group to an acceptor such as CoA, transfer of the acyl group among the various ACP domains, or transfer of the fatty acyl group to a lipophilic acceptor molecule (e.g. to lysophosphadic acid).
- the fourth domain in OrfB is an ER domain, also referred to herein as ORFB-ER.
- This domain is contained within the nucleotide sequence spanning from a starting point of about position 4648 of SEQ ID NO:3 (OrfB) to an ending point of about position 6177 of SEQ ID NO:3.
- the nucleotide sequence containing the sequence encoding the ORFB-ER domain is represented herein as SEQ ID NO:25 (positions 4648-6177 of SEQ ID NO:3).
- the amino acid sequence containing the ER domain spans from a starting point of about position 1550 of SEQ ID NO:4 (ORFB) to an ending point of about position 2059 of SEQ ID NO:4.
- the amino acid sequence containing the ORFB-ER domain is represented herein as SEQ ID NO:26 (positions 1550-2059 of SEQ ID NO:4).
- this domain has enoyl reductase (ER) biological activity.
- the ER enzyme reduces the trans-double bond (introduced by the DH activity) in the fatty acyl-ACP, resulting in fully saturating those carbons.
- the ER domain in the PUFA-PKS shows homology to a newly characterized family of ER enzymes (Heath et al., Nature 406, 145 (2000)). Heath and Rock identified this new class of ER enzymes by cloning a gene of interest from Streptococcus pneumoniae , purifying a protein expressed from that gene, and showing that it had ER activity in an in vitro assay.
- the sequence of the Schizochytrium ER domain of OrfB shows homology to the S. pneumoniae ER protein. All of the PUFA PKS systems currently examined contain at least one domain with very high sequence homology to the Schizochytrium ER domain.
- the Schizochytrium PUFA PKS system contains two ER domains (one on OrfB and one on OrfC).
- the complete nucleotide sequence for OrfC is represented herein as SEQ ID NO:5.
- Nucleotides 1-4506 of SEQ ID NO:5 i.e., the entire open reading frame sequence, not including the stop codon
- SEQ ID NO:76 i.e., the entire open reading frame sequence, not including the stop codon
- the cDNA sequence in U.S. application Ser. No. 09/231,899 contains about 144 nucleotides upstream of the start codon for OrfC and about 110 nucleotides beyond the stop codon, including a polyA tail.
- nucleotides 145-4653 of the cDNA sequence containing the complete open reading frame described in U.S. application Ser. No. 09/231,899 match nucleotides 1-2624 and 2675-4506 of the sequence denoted herein as OrfC (SEQ ID NO:5).
- OrfC is a 4506 nucleotide sequence (not including the stop codon) which encodes a 1502 amino acid sequence, represented herein as SEQ ID NO:6.
- Within OrfC are three domains: (a) two FabA-like ⁇ -hydroxy acyl-ACP dehydrase (DH) domains; and (b) one enoyl ACP-reductase (ER) domain.
- a nucleotide sequence for OrfC has been deposited with GenBank as Accession No. AF378329 (amino acid sequence Accession No. AAK728881).
- the nucleotide sequence represented by AF378329 differs from the nucleotide sequence represented herein as SEQ ID NO:5 by the point nucleotide insertions: (1) at position 2625 (an insertion of an A); (2) at position 2662 (an insertion of a C); and (3) at position 2674 (an insertion of an A). This resulted in a frame shift out of frame at position 2625 and then back into frame at position 2675.
- AAK728881 differs from the amino acid sequence encoded by SEQ ID NO:5 (i.e., SEQ ID NO:6) in the region spanning from positions 876-891 of GenBank Accession No. AAK728881 or positions 876-890 of SEQ ID NO:6. This change in sequence occurs in the DH2 domain of OrfC (discussed below).
- Genomic DNA clones (plasmids) encoding OrfC from both Schizochytrium sp. ATCC 20888 and a daughter strain of ATCC 20888, denoted Schizochytrium sp., strain N230D, have been isolated and sequenced.
- Genomic clone pJK1131 (denoted pJK1131 OrfC genomic clone, in the form of an E.
- coli plasmid vector containing “OrfC” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pJK1131 OrfC genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- a genomic clone described herein as pBR002 OrfC genomic clone, isolated from Schizochytrium sp. N230D, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:5, and encodes the amino acid sequence of SEQ ID NO:6.
- Genomic clone pBR002 (denoted pBR002 OrfC genomic clone, in the form of an E. coli plasmid vector containing the OrfC gene sequence from Schizochytrium sp. N230D) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______.
- ATCC American Type Culture Collection
- the nucleotide sequence of pBR002 OrfC genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfC was compared with known sequences in a standard BLAST search as described above. At the nucleic acid level, OrfC has no significant homology to any known nucleotide sequence. At the amino acid level (Blastp), the sequences with the greatest degree of homology to ORFC were: Moritella marinus ( Vibrio marinus ) ORF11 (Accession No. ABO25342), which is 45% identical to ORFC over 514 amino acid residues, Shewanella sp. hypothetical protein 8 (Accession No. U73935), which is 49% identical to ORFC over 447 amino acid residues, Nostoc sp. hypothetical protein (Accession No. NC — 003272), which is 49% identical to ORFC over 430 amino acid residues, and Shewanella sp. hypothetical protein 7 (Accession No. U73935), which is 37% identical to ORFC over 930 amino acid residues.
- the first domain in OrfC is a DH domain, also referred to herein as ORFC-DH1.
- This is one of two DH domains in OrfC, and therefore is designated DH1.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1 and 778 of SEQ ID NO:5 (OrfC) to an ending point of between about positions 1233 and 1350 of SEQ ID NO:5 (based on homology to other PUFA PKS domains, the position of the DH1 domain spans from about position 1 to about position 1350; based on Pfam analysis, a DH core region spans from about position 778 to about position 1233).
- the nucleotide sequence containing the sequence encoding the ORFC-DH1 domain is represented herein as SEQ ID NO:27 (positions 1-1350 of SEQ ID NO:5).
- the amino acid sequence containing the DH1 domain spans from a starting point of between about positions 1 and 260 of SEQ ID NO:6 (ORFC) to an ending point of between about positions 411 and 450 of SEQ ID NO:6 (again, referring to the overall homology to PUFA PKS DH domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFC-DH1 domain is represented herein as SEQ ID NO:28 (positions 1-450 of SEQ ID NO:6).
- DH domains both the DH domains (see below for DH 2) in the PUFA PKS systems have been described in the preceding sections.
- This class of enzyme removes HOH from a ⁇ -keto acyl-ACP and leaves a trans double bond in the carbon chain.
- the DH domains of the PUFA PKS systems show homology to bacterial DH enzymes associated with their FAS systems (rather than to the DH domains of other PKS systems).
- the second domain in OrfC is a DH domain, also referred to herein as ORFC-DH2.
- This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1351 and 2437 of SEQ ID NO:5 (OrfC) to an ending point of between about positions 2607 and 2847 of SEQ ID NO:5 (based on homology to other PUFA PKS domains, the position of the DH2 domain spans from about position 1351 to about position 2845; based on Pfam analysis, a DH core region spans from about position 2437 to about position 2847).
- the nucleotide sequence containing the sequence encoding the ORFC-DH2 domain is represented herein as SEQ ID NO:29 (positions 1351-2847 of SEQ ID NO:5).
- the amino acid sequence containing the DH2 domain spans from a starting point of between about positions 451 and 813 of SEQ ID NO:6 (ORFC) to an ending point of between about positions 869 and 949 of SEQ ID NO:6 (again, referring to the overall homology to PUFA PKS DH domains and to Pfam core regions, respectively).
- the amino acid sequence containing the ORFC-DH2 domain is represented herein as SEQ ID NO:30 (positions 451-949 of SEQ ID NO:6).
- This DH domain comprises the amino acids H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions 426-440 of SEQ ID NO:30. DH biological activity has been described above.
- the third domain in OrfC is an ER domain, also referred to herein as ORFC-ER.
- This domain is contained within the nucleotide sequence spanning from a starting point of about position 2995 of SEQ ID NO:5 (OrfC) to an ending point of about position 4506 of SEQ ID NO:5.
- the nucleotide sequence containing the sequence encoding the ORFC-ER domain is represented herein as SEQ ID NO:31 (positions 2995-4506 of SEQ ID NO:5).
- the amino acid sequence containing the ER domain spans from a starting point of about position 999 of SEQ ID NO:6 (ORFC) to an ending point of about position 1502 of SEQ ID NO:6.
- the amino acid sequence containing the ORFC-ER domain is represented herein as SEQ ID NO:32 (positions 999-1502 of SEQ ID NO:6).
- ER biological activity has been described above.
- a domain or protein having 4′-phosphopantetheinyl transferase (PPTase) biological activity is characterized as the enzyme that transfers a 4′-phosphopantetheinyl moiety from Coenzyme A to the acyl carrier protein (ACP).
- ACP acyl carrier protein
- This transfer to an invariant serine reside of the ACP activates the inactive apo-form to the holo-form.
- the phosphopantetheine group forms thioesters with the growing acyl chains.
- the PPTases are a family of enzymes that have been well characterized in fatty acid synthesis, polyketide synthesis, and non-ribosomal peptide synthesis.
- crystal structures have been determined (e.g., Reuter K, Mofid M R, Marahiel M A, Ficner R. “Crystal structure of the surfactin synthetase-activating enzyme sfp: a prototype of the 4′-phosphopantetheinyl transferase superfamily” EMBO J. 1999 Dec. 1; 18(23):6823-31) as well as mutational analysis of amino acid residues important for activity (Mofid M R, Finking R, Essen L O, Marahiel M A. “Structure-based mutational analysis of the 4′-phosphopantetheinyl transferases Sfp from Bacillus subtilis : carrier protein recognition and reaction mechanism” Biochemistry. 2004 Apr. 13; 43(14):4128-36).
- the present inventors have identified two sequences (genes) in the Arabidopsis whole genome database that are likely to encode PPTases. These sequences (GenBank Accession numbers; AAG51443 and AAC05345) are currently listed as encoding “Unknown Proteins”. They can be identified as putative PPTases based on the presence in the translated protein sequences of several signature motifs including; G(I/V)D and WxxKE(A/S)xxK (SEQ ID NO:33), (listed in Lambalot et al., 1996 as characteristic of all PPTases).
- these two putative proteins contain two additional motifs typically found in PPTases typically associated with PKS and non-ribosomal peptide synthesis systems; i.e., FN(I/L/V)SHS (SEQ ID NO:34) and (1N/L)G(I/L/V)D(I/L/V) (SEQ ID NO:35). Furthermore, these motifs occur in the expected relative positions in the protein sequences. It is likely that homologues of the Arabidopsis genes are present in other plants, such as tobacco.
- these genes can be cloned and expressed to see if the enzymes they encode can activate the Schizochytrium ORFA ACP domains, or alternatively, OrfA could be expressed directly in the transgenic plant (either targeted to the plastid or the cytoplasm).
- One embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence from a non-bacterial PUFA PKS system, a homologue thereof, a fragment thereof, and/or a nucleic acid sequence that is complementary to any of such nucleic acid sequences.
- the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: (a) a nucleic acid sequence encoding an amino acid sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and biologically active fragments thereof; (b) a nucleic acid sequence encoding an amino acid sequence selected from the group consisting of: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, and biologically active fragments thereof; (c) a nucleic acid sequence encoding an amino acid sequence that is at least about 60% identical to at least 500 consecutive amino acids of said amino acid sequence of (a), wherein said amino acid sequence has a biological activity of at least one domain of a poly
- an amino acid sequence that has a biological activity of at least one domain of a PUFA PKS system is an amino acid sequence that has the biological activity of at least one domain of the PUFA PKS system described in detail herein, as exemplified by the Schizochytrium PUFA PKS system.
- the biological activities of the various domains within the Schizochytrium PUFA PKS system have been described in detail above. Therefore, an isolated nucleic acid molecule of the present invention can encode the translation product of any PUFA PKS open reading frame, PUFA PKS domain, biologically active fragment thereof, or any homologue of a naturally occurring PUFA PKS open reading frame or domain which has biological activity.
- a homologue of given protein or domain is a protein or polypeptide that has an amino acid sequence which differs from the naturally occurring reference amino acid sequence (i.e., of the reference protein or domain) in that at least one or a few, but not limited to one or a few, amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide or fragment), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosylphosphatidyl inositol).
- homologues of a PUFA PKS protein or domain are described in detail below. It is noted that homologues can include synthetically produced homologues, naturally occurring allelic variants of a given protein or domain, or homologous sequences from organisms other than the organism from which the reference sequence was derived.
- the biological activity or biological action of a protein or domain refers to any function(s) exhibited or performed by the protein or domain that is ascribed to the naturally occurring form of the protein or domain as measured or observed in vivo (i.e., in the natural physiological environment of the protein) or in vitro (i.e., under laboratory conditions).
- Biological activities of PUFA PKS systems and the individual proteins/domains that make up a PUFA PKS system have been described in detail elsewhere herein.
- Modifications of a protein or domain such as in a homologue or mimetic (discussed below), may result in proteins or domains having the same biological activity as the naturally occurring protein or domain, or in proteins or domains having decreased or increased biological activity as compared to the naturally occurring protein or domain.
- a functional domain of a PUFA PKS system is a domain (i.e., a domain can be a portion of a protein) that is capable of performing a biological function (i.e., has biological activity).
- an isolated nucleic acid molecule is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has been subject to human manipulation), its natural milieu being the genome or chromosome in which the nucleic acid molecule is found in nature.
- isolated does not necessarily reflect the extent to which the nucleic acid molecule has been purified, but indicates that the molecule does not include an entire genome or an entire chromosome in which the nucleic acid molecule is found in nature.
- An isolated nucleic acid molecule can include a gene.
- An isolated nucleic acid molecule that includes a gene is not a fragment of a chromosome that includes such gene, but rather includes the coding region and regulatory regions associated with the gene, but no additional genes naturally found on the same chromosome.
- An isolated nucleic acid molecule can also include a specified nucleic acid sequence flanked by (i.e., at the 5′ and/or the 3′ end of the sequence) additional nucleic acids that do not normally flank the specified nucleic acid sequence in nature (i.e., heterologous sequences).
- Isolated nucleic acid molecule can include DNA, RNA (e.g., mRNA), or derivatives of either DNA or RNA (e.g., cDNA).
- nucleic acid molecule primarily refers to the physical nucleic acid molecule and the phrase “nucleic acid sequence” primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a protein or domain of a protein.
- an isolated nucleic acid molecule of the present invention is produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis.
- Isolated nucleic acid molecules include natural nucleic acid molecules and homologues thereof, including, but not limited to, natural allelic variants and modified nucleic acid molecules in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect on PUFA PKS system biological activity as described herein.
- Protein homologues e.g., proteins encoded by nucleic acid homologues
- a nucleic acid molecule homologue can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Labs Press, 1989).
- nucleic acid molecules can be modified using a variety of techniques including, but not limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site-directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, synthesis of oligonucleotide mixtures and ligation of mixture groups to “build” a mixture of nucleic acid molecules and combinations thereof.
- Nucleic acid molecule homologues can be selected from a mixture of modified nucleic acids by screening for the function of the protein encoded by the nucleic acid and/or by
- the minimum size of a nucleic acid molecule of the present invention is a size sufficient to form a probe or oligonucleotide primer that is capable of forming a stable hybrid (e.g., under moderate, high or very high stringency conditions) with the complementary sequence of a nucleic acid molecule useful in the present invention, or of a size sufficient to encode an amino acid sequence having a biological activity of at least one domain of a PUFA PKS system according to the present invention.
- the size of the nucleic acid molecule encoding such a protein can be dependent on nucleic acid composition and percent homology or identity between the nucleic acid molecule and complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration).
- the minimal size of a nucleic acid molecule that is used as an oligonucleotide primer or as a probe is typically at least about 12 to about 15 nucleotides in length if the nucleic acid molecules are GC-rich and at least about 15 to about 18 bases in length if they are AT-rich.
- nucleic acid molecule of the present invention can include a sequence sufficient to encode a biologically active fragment of a domain of a PUFA PKS system, an entire domain of a PUFA PKS system, several domains within an open reading frame (Orf) of a PUFA PKS system, an entire Orf of a PUFA PKS system, or more than one Orf of a PUFA PKS system.
- an isolated nucleic acid molecule comprises or consists essentially of a nucleic acid sequence encoding an amino acid sequence selected from the group of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, or biologically active fragments thereof.
- the nucleic acid sequence is selected from the group of: SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:12, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31.
- any of the above-described PUFA PKS amino acid sequences, as well as homologues of such sequences can be produced with from at least one, and up to about 20, additional heterologous amino acids flanking each of the C- and/or N-terminal end of the given amino acid sequence.
- the resulting protein or polypeptide can be referred to as “consisting essentially of” a given amino acid sequence.
- the heterologous amino acids are a sequence of amino acids that are not naturally found (i.e., not found in nature, in vivo) flanking the given amino acid sequence or which would not be encoded by the nucleotides that flank the naturally occurring nucleic acid sequence encoding the given amino acid sequence as it occurs in the gene, if such nucleotides in the naturally occurring sequence were translated using standard codon usage for the organism from which the given amino acid sequence is derived.
- the phrase “consisting essentially of”, when used with reference to a nucleic acid sequence herein, refers to a nucleic acid sequence encoding a given amino acid sequence that can be flanked by from at least one, and up to as many as about 60, additional heterologous nucleotides at each of the 5′ and/or the 3′ end of the nucleic acid sequence encoding the given amino acid sequence.
- the heterologous nucleotides are not naturally found (i.e., not found in nature, in vivo) flanking the nucleic acid sequence encoding the given amino acid sequence as it occurs in the natural gene.
- the present invention also includes an isolated nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence having a biological activity of at least one domain of a PUFA PKS system.
- a nucleic acid sequence encodes a homologue of any of the Schizochytrium PUFA PKS ORFs or domains, including: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32, wherein the homologue has a biological activity of at least one (or two, three, four or more) domain of a PUFA PKS system as described previously herein.
- a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 60% identical to at least 500 consecutive amino acids of an amino acid sequence chosen from: SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6; wherein said amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- the amino acid sequence of the homologue is at least about 60% identical to at least about 600 consecutive amino acids, and more preferably to at least about 700 consecutive amino acids, and more preferably to at least about 800 consecutive amino acids, and more preferably to at least about 900 consecutive amino acids, and more preferably to at least about 1000 consecutive amino acids, and more preferably to at least about 1100 consecutive amino acids, and more preferably to at least about 1200 consecutive amino acids, and more preferably to at least about 1300 consecutive amino acids, and more preferably to at least about 1400 consecutive amino acids, and more preferably to at least about 1500 consecutive amino acids of any of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, or to the full length of SEQ ID NO:6.
- the amino acid sequence of the homologue is at least about 60% identical to at least about 1600 consecutive amino acids, and more preferably to at least about 1700 consecutive amino acids, and more preferably to at least about 1800 consecutive amino acids, and more preferably to at least about 1900 consecutive amino acids, and more preferably to at least about 2000 consecutive amino acids of any of SEQ ID NO:2 or SEQ ID NO:4, or to the full length of SEQ ID NO:4.
- the amino acid sequence of the homologue is at least about 60% identical to at least about 2100 consecutive amino acids, and more preferably to at least about 2200 consecutive amino acids, and more preferably to at least about 2300 consecutive amino acids, and more preferably to at least about 2400 consecutive amino acids, and more preferably to at least about 2500 consecutive amino acids, and more preferably to at least about 2600 consecutive amino acids, and more preferably to at least about 2700 consecutive amino acids, and more preferably to at least about 2800 consecutive amino acids, and even more preferably, to the full length of SEQ ID NO:2.
- a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 65% identical, and more preferably at least about 70% identical, and more preferably at least about 75% identical, and more preferably at least about 80% identical, and more preferably at least about 85% identical, and more preferably at least about 90% identical, and more preferably at least about 95% identical, and more preferably at least about 96% identical, and more preferably at least about 97% identical, and more preferably at least about 98% identical, and more preferably at least about 99% identical to an amino acid sequence chosen from: SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, over any of the consecutive amino acid lengths described in the paragraph above, wherein the amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 60% identical to an amino acid sequence chosen from: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32, wherein said amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- the amino acid sequence of the homologue is at least about 65% identical, and more preferably at least about 70% identical, and more preferably at least about 75% identical, and more preferably at least about 80% identical, and more preferably at least about 85% identical, and more preferably at least about 90% identical, and more preferably at least about 95% identical, and more preferably at least about 96% identical, and more preferably at least about 97% identical, and more preferably at least about 98% identical, and more preferably at least about 99% identical to an amino acid sequence chosen from: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, wherein the amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- the term “contiguous” or “consecutive”, with regard to nucleic acid or amino acid sequences described herein, means to be connected in an unbroken sequence.
- a first sequence to comprise 30 contiguous (or consecutive) amino acids of a second sequence means that the first sequence includes an unbroken sequence of 30 amino acid residues that is 100% identical to an unbroken sequence of 30 amino acid residues in the second sequence.
- a first sequence to have “100% identity” with a second sequence means that the first sequence exactly matches the second sequence with no gaps between nucleotides or amino acids.
- reference to a percent (%) identity refers to an evaluation of homology which is performed using: (1) a BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches, blastn for nucleic acid searches, and blastX for nucleic acid searches and searches of translated amino acids in all 6 open reading frames, all with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S. F., Madden, T. L., Shuäffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res.
- BLAST 2 alignment using the parameters described below
- PSI-BLAST with the standard default parameters (Position-Specific Iterated BLAST). It is noted that due to some differences in the standard parameters between BLAST 2.0 Basic BLAST and BLAST 2, two specific sequences might be recognized as having significant homology using the BLAST 2 program, whereas a search performed in BLAST 2.0 Basic BLAST using one of the sequences as the query sequence may not identify the second sequence in the top matches.
- PSI-BLAST provides an automated, easy-to-use version of a “profile” search, which is a sensitive way to look for sequence homologues.
- the program first performs a gapped BLAST database search.
- the PSI-BLAST program uses the information from any significant alignments returned to construct a position-specific score matrix, which replaces the query sequence for the next round of database searching. Therefore, it is to be understood that percent identity can be determined by using any one of these programs.
- BLAST 2 sequence alignment is performed in blastp or blastn using the BLAST 2.0 algorithm to perform a Gapped BLAST search (BLAST 2.0) between the two sequences allowing for the introduction of gaps (deletions and insertions) in the resulting alignment.
- BLAST 2.0 Gapped BLAST search
- a BLAST 2 sequence alignment is performed using the standard default parameters as follows.
- gap x_dropoff 50) expect (10) word size (3) filter (on).
- an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system of the present invention includes an amino acid sequence that is sufficiently similar to a naturally occurring PUFA PKS protein or polypeptide that a nucleic acid sequence encoding the amino acid sequence is capable of hybridizing under moderate, high, or very high stringency conditions (described below) to (i.e., with) a nucleic acid molecule encoding the naturally occurring PUFA PKS protein or polypeptide (i.e., to the complement of the nucleic acid strand encoding the naturally occurring PUFA PKS protein or polypeptide).
- an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system of the present invention is encoded by a nucleic acid sequence that hybridizes under moderate, high or very high stringency conditions to the complement of a nucleic acid sequence that encodes a protein comprising an amino acid sequence represented by any of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32.
- a nucleotide sequence of the present invention is a nucleotide sequence isolated from (obtainable from), identical to, or a homologue of, the nucleotide sequence from a Schizochytrium , wherein the nucleotide sequence from a Schizochytrium (including either strand of a DNA molecule from Schizochytrium ) hybridizes under moderate, high, or very high stringency conditions to a nucleotide sequence encoding an amino acid sequence represented by any of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32.
- the Schizochytrium is Schizochytrium ATCC 20888.
- amino acid sequencing and nucleic acid sequencing technologies are not entirely error-free, the sequences presented herein, at best, represent apparent sequences of PUFA PKS domains and proteins of the present invention, or of the nucleotide sequences encoding such amino acid sequences.
- hybridization conditions refer to standard hybridization conditions under which nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., is incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984 , Anal. Biochem. 138, 267-284; Meinkoth et al., ibid., is incorporated by reference herein in its entirety.
- moderate stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 30% or less mismatch of nucleotides).
- High stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 80% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 20% or less mismatch of nucleotides).
- Very high stringency hybridization and washing conditions refer to conditions which permit isolation of nucleic acid molecules having at least about 90% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 10% or less mismatch of nucleotides).
- conditions permitting about 10% or less mismatch of nucleotides i.e., one of skill in the art can use the formulae in Meinkoth et al., ibid. to calculate the appropriate hybridization and wash conditions to achieve these particular levels of nucleotide mismatch. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for DNA:DNA hybrids are 10° C. less than for DNA:RNA hybrids.
- stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 6 ⁇ SSC (0.9 M Na + ) at a temperature of between about 20° C. and about 35° C. (lower stringency), more preferably, between about 28° C. and about 40° C. (more stringent), and even more preferably, between about 35° C. and about 45° C. (even more stringent), with appropriate wash conditions.
- stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic strength of 6 ⁇ SSC (0.9 M Na + ) at a temperature of between about 30° C. and about 45° C., more preferably, between about 38° C.
- wash conditions should be as stringent as possible, and should be appropriate for the chosen hybridization conditions.
- hybridization conditions can include a combination of salt and temperature conditions that are approximately 20-25° C. below the calculated T m of a particular hybrid, and wash conditions typically include a combination of salt and temperature conditions that are approximately 12-20° C.
- hybridization conditions suitable for use with DNA:DNA hybrids includes a 2-24 hour hybridization in 6 ⁇ SSC (50% formamide) at about 42° C., followed by washing steps that include one or more washes at room temperature in about 2 ⁇ SSC, followed by additional washes at higher temperatures and lower ionic strength (e.g., at least one wash as about 37° C. in about 0.1 ⁇ -0.5 ⁇ SSC, followed by at least one wash at about 68° C. in about 0.1 ⁇ -0.5 ⁇ SSC).
- nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that is identical to, or that is a homologue of (as defined above) the nucleic acid sequence of a cDNA plasmid clone selected from LIB3033-046-D2 (ATCC Accession No. ______), LIB3033-047-B5 (ATCC Accession No. ______), or LIB81-042-B9 (ATCC Accession No. ______).
- the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that is identical to, or that is a homologue of (as defined above) the nucleic acid sequence of a genomic plasmid selected from: pJK1126 (ATCC Accession No. ______), pJK1129 (ATCC Accession No. ______), pJK1131 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), pJK320 (ATCC Accession No. ______), pJK324 (ATCC Accession No. ______), or pBR002 (ATCC Accession No. ______)
- nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that encodes an amino acid sequence that is identical to, or that is a homologue of (as defined above) the amino acid sequence encoded by a cDNA plasmid clone selected from LIB3033-046-D2 (ATCC Accession No. ______), LIB3033-047-B5 (ATCC Accession No. ______), or LIB81-042-B9 (ATCC Accession No. ______).
- the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that encodes an amino acid sequence that is identical to, or that is a homologue of (as defined above) the amino acid sequence encoded by a genomic plasmid selected from: pJK1126 (ATCC Accession No. ______), pJK1129 (ATCC Accession No. ______), pJK1131 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), pJK320 (ATCC Accession No. ______), pJK324 (ATCC Accession No. ______), or pBR002 (ATCC Accession No. ______).
- a genomic plasmid selected from: pJK1126 (ATCC Accession No. ______), pJK1129 (ATCC Accession No. ______), pJK1131 (ATCC Accession No. ______), pJK306 (
- a recombinant nucleic acid molecule comprising a recombinant vector and a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence having a biological activity of at least one domain or protein of a PUFA PKS system as described herein.
- a recombinant vector is an engineered (i.e., artificially produced) nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice and for introducing such a nucleic acid sequence into a host cell.
- the recombinant vector is therefore suitable for use in cloning, sequencing, and/or otherwise manipulating the nucleic acid sequence of choice, such as by expressing and/or delivering the nucleic acid sequence of choice into a host cell to form a recombinant cell.
- a vector typically contains heterologous nucleic acid sequences, that is nucleic acid sequences that are not naturally found adjacent to nucleic acid sequence to be cloned or delivered, although the vector can also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) which are naturally found adjacent to nucleic acid molecules of the present invention or which are useful for expression of the nucleic acid molecules of the present invention (discussed in detail below).
- the vector can be either RNA or DNA, either prokaryotic or eukaryotic, and typically is a plasmid.
- the vector can be maintained as an extrachromosomal element (e.g., a plasmid) or it can be integrated into the chromosome of a recombinant organism (e.g., a microbe or a plant).
- the entire vector can remain in place within a host cell, or under certain conditions, the plasmid DNA can be deleted, leaving behind the nucleic acid molecule of the present invention.
- the integrated nucleic acid molecule can be under chromosomal promoter control, under native or plasmid promoter control, or under a combination of several promoter controls. Single or multiple copies of the nucleic acid molecule can be integrated into the chromosome.
- a recombinant vector of the present invention can contain at least one selectable marker.
- a recombinant vector used in a recombinant nucleic acid molecule of the present invention is an expression vector.
- expression vector is used to refer to a vector that is suitable for production of an encoded product (e.g., a protein of interest).
- a nucleic acid sequence encoding the product to be produced e.g., a PUFA PKS domain
- a PUFA PKS domain is inserted into the recombinant vector to produce a recombinant nucleic acid molecule.
- the nucleic acid sequence encoding the protein to be produced is inserted into the vector in a manner that operatively links the nucleic acid sequence to regulatory sequences in the vector which enable the transcription and translation of the nucleic acid sequence within the recombinant host cell.
- a recombinant vector used in a recombinant nucleic acid molecule of the present invention is a targeting vector.
- targeting vector is used to refer to a vector that is used to deliver a particular nucleic acid molecule into a recombinant host cell, wherein the nucleic acid molecule is used to delete or inactivate an endogenous gene within the host cell or microorganism (i.e., used for targeted gene disruption or knock-out technology).
- Such a vector may also be known in the art as a “knock-out” vector.
- a portion of the vector but more typically, the nucleic acid molecule inserted into the vector (i.e., the insert), has a nucleic acid sequence that is homologous to a nucleic acid sequence of a target gene in the host cell (i.e., a gene which is targeted to be deleted or inactivated).
- the nucleic acid sequence of the vector insert is designed to bind to the target gene such that the target gene and the insert undergo homologous recombination, whereby the endogenous target gene is deleted, inactivated or attenuated (i.e., by at least a portion of the endogenous target gene being mutated or deleted).
- a recombinant nucleic acid molecule includes at least one nucleic acid molecule of the present invention operatively linked to one or more transcription control sequences.
- the phrase “recombinant molecule” or “recombinant nucleic acid molecule” primarily refers to a nucleic acid molecule or nucleic acid sequence operatively linked to a transcription control sequence, but can be used interchangeably with the phrase “nucleic acid molecule”, when such nucleic acid molecule is a recombinant molecule as discussed herein.
- the phrase “operatively linked” refers to linking a nucleic acid molecule to a transcription control sequence in a manner such that the molecule is able to be expressed when transfected (i.e., transformed, transduced, transfected, conjugated or conducted) into a host cell.
- Transcription control sequences are sequences which control the initiation, elongation, or termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences.
- Suitable transcription control sequences include any transcription control sequence that can function in a host cell or organism into which the recombinant nucleic acid molecule is to be introduced.
- Recombinant nucleic acid molecules of the present invention can also contain additional regulatory sequences, such as translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell.
- a recombinant molecule of the present invention including those which are integrated into the host cell chromosome, also contains secretory signals (i.e., signal segment nucleic acid sequences) to enable an expressed protein to be secreted from the cell that produces the protein.
- Suitable signal segments include a signal segment that is naturally associated with the protein to be expressed or any heterologous signal segment capable of directing the secretion of the protein according to the present invention.
- a recombinant molecule of the present invention comprises a leader sequence to enable an expressed protein to be delivered to and inserted into the membrane of a host cell.
- Suitable leader sequences include a leader sequence that is naturally associated with the protein, or any heterologous leader sequence capable of directing the delivery and insertion of the protein to the membrane of a cell.
- the present inventors have found that the Schizochytrium PUFA PKS Orfs A and B are closely linked in the genome and region between the Orfs has been sequenced.
- the Orfs are oriented in opposite directions and 4244 base pairs separate the start (ATG) codons (i.e. they are arranged as follows: 3′OrfA5′-4244 bp-5′OrfB3′).
- Examination of the 4244 bp intergenic region did not reveal any obvious Orfs (no significant matches were found on a BlastX search).
- Both Orfs A and B are highly expressed in Schizochytrium , at least during the time of oil production, implying that active promoter elements are embedded in this intergenic region.
- These genetic elements are believed to have utility as a bi-directional promoter sequence for transgenic applications. For example, in a preferred embodiment, one could clone this region, place any genes of interest at each end and introduce the construct into Schizochytrium (or some other host in which the promoters can be shown to function). It is predicted that the regulatory elements, under the appropriate conditions, would provide for coordinated, high level expression of the two introduced genes.
- the complete nucleotide sequence for the regulatory region containing Schizochytrium PUFA PKS regulatory elements (e.g., a promoter) is represented herein as SEQ ID NO:36.
- OrfC is highly expressed in Schizochytrium during the time of oil production and regulatory elements are expected to reside in the region upstream of its start codon.
- a region of genomic DNA upstream of OrfC has been cloned and sequenced and is represented herein as (SEQ ID NO:37). This sequence contains the 3886 nt immediately upstream of the OrfC start codon. Examination of this region did not reveal any obvious Orfs (i.e., no significant matches were found on a BlastX search). It is believed that regulatory elements contained in this region, under the appropriate conditions, will provide for high-level expression of a gene placed behind them. Additionally, under the appropriate conditions, the level of expression may be coordinated with genes under control of the A-B intergenic region (SEQ ID NO:36).
- a recombinant nucleic acid molecule useful in the present invention can include a PUFA PKS regulatory region contained within SEQ ID NO:36 and/or SEQ ID NO:37.
- a regulatory region can include any portion (fragment) of SEQ ID NO:36 and/or SEQ ID NO:37 that has at least basal PUFA PKS transcriptional activity (at least basal promoter activity).
- One or more recombinant molecules of the present invention can be used to produce an encoded product (e.g., a PUFA PKS domain, protein, or system) of the present invention.
- an encoded product is produced by expressing a nucleic acid molecule as described herein under conditions effective to produce the protein.
- a preferred method to produce an encoded protein is by transfecting a host cell with one or more recombinant molecules to form a recombinant cell. Suitable host cells to transfect include, but are not limited to, any bacterial, fungal (e.g., yeast), insect, plant or animal cell that can be transfected. Host cells can be either untransfected cells or cells that are already transfected with at least one other recombinant nucleic acid molecule.
- the term “transfection” is used to refer to any method by which an exogenous nucleic acid molecule (i.e., a recombinant nucleic acid molecule) can be inserted into a cell.
- the term “transformation” can be used interchangeably with the term “transfection” when such term is used to refer to the introduction of nucleic acid molecules into microbial cells, such as algae, bacteria and yeast.
- transfection is used to describe an inherited change due to the acquisition of exogenous nucleic acids by the microorganism and is essentially synonymous with the term “transfection.”
- transformation has acquired a second meaning which can refer to changes in the growth properties of cells in culture after they become cancerous, for example. Therefore, to avoid confusion, the term “transfection” is preferably used with regard to the introduction of exogenous nucleic acids into animal cells, and the term “transfection” will be used herein to generally encompass transfection of animal cells, plant cells and transformation of microbial cells, to the extent that the terms pertain to the introduction of exogenous nucleic acids into a cell. Therefore, transfection techniques include, but are not limited to, transformation, particle bombardment, electroporation, microinjection, lipofection, adsorption, infection and protoplast fusion.
- recombinant DNA technologies can improve control of expression of transfected nucleic acid molecules by manipulating, for example, the number of copies of the nucleic acid molecules within the host cell, the efficiency with which those nucleic acid molecules are transcribed, the efficiency with which the resultant transcripts are translated, and the efficiency of post-translational modifications.
- the promoter sequence might be genetically engineered to improve the level of expression as compared to the native promoter.
- Recombinant techniques useful for controlling the expression of nucleic acid molecules include, but are not limited to, integration of the nucleic acid molecules into one or more host cell chromosomes, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals (e.g., ribosome binding sites, Shine-Dalgarno sequences), modification of nucleic acid molecules to correspond to the codon usage of the host cell, and deletion of sequences that destabilize transcripts.
- transcription control signals e.g., promoters, operators, enhancers
- substitutions or modifications of translational control signals e.g., ribosome binding sites, Shine-Dalgarno sequences
- This invention also relates to PUFA PKS systems (and proteins or domains thereof) from microorganisms other than those described specifically herein that are homologous in structure, domain organization and/or function to a Schizochytrium PUFA PKS system (and proteins or domains thereof) as described herein.
- the microorganism is a non-bacterial microorganism, and preferably, the microorganism is a eukaryotic microorganism.
- this invention relates to use of these microorganisms and the PUFA PKS systems or components thereof from these microorganisms in the various applications for a PUFA PKS system (e.g., genetically modified organisms and methods of producing bioactive molecules) according to the present invention.
- Such microorganisms have the following characteristics: (a) produces at least one PUFA; and (b) has an ability to produce increased PUFAs under dissolved oxygen conditions of less than about 5% of saturation in the fermentation medium, as compared to production of PUFAs by said microorganism under dissolved oxygen conditions of greater than 5% of saturation, more preferably 10% of saturation, more preferably greater than 15% of saturation and more preferably greater than 20% of saturation in the fermentation medium.
- a screening process for identification of microorganisms comprising a PUFA PKS system is described in detail in U.S. Patent Application Publication No. 20020194641, supra.
- Thraustochytrid refers to any members of the order Thraustochytriales, which includes the family Thraustochytriaceae
- the term “Labyrinthulid” refers to any member of the order Labyrinthulales, which includes the family Labyrinthulaceae.
- Labyrinthulaceae have been considered to be members of the order Thraustochytriales, but in revisions of the taxonomy of such organisms, the family is now considered to be a member of the order Labyrinthulales, and both Labyrinthulales and Thraustochytriales are considered to be members of the phylum Labyrinthulomycota.
- Taxonomic theorists generally place Thraustochytrids with the algae or algae-like protists.
- Thraustochytrids include the following organisms: Order: Thraustochytriales; Family: Thraustochytriaceae; Genera: Thraustochytrium (Species: sp., arudimentale, aureum, benthicola, globosum, kinnei, motivum, multirudimentale, pachydermum, proliferum, roseum, striatum ), Ulkenia (previously considered by some to be a member of Thraustochytrium ) (Species: sp., amoeboidea, kerguelensis, minuta, profunda, radiata, sailens, sarkariana, schizochytrops, visurgensis, yorkensis ), Schizochytrium (Species: sp., aggregatum, limnaceum, mangrove
- Labyrinthulids include the following organisms: Order: Labyrinthulales, Family: Labyrinthulaceae, Genera: Labyrinthula (Species: sp., algeriensis, coenocystis, chattonii, macrocystis, macrocystis atlantica, macrocystis macrocystis, marina, minuta, roscoffensis, valkanovii, vitellina, vitellina pacifica, vitellina vitellina, zopfii ), Labyrinthuloides (Species: sp., haliotidis, yorkensis ), Labyrinthomyxa (Species: sp., marina ), Diplophrys (Species: sp., archeri ), Pyrrhosorus (Species: sp., marinus ), Sorodiplophrys
- an organism preferably a microorganism or a plant
- such an organism can endogenously contain and express a PUFA PKS system
- the genetic modification can be a genetic modification of one or more of the functional domains of the endogenous PUFA PKS system, whereby the modification has some effect on the activity of the PUFA PKS system.
- such an organism can endogenously contain and express a PUFA PKS system, and the genetic modification can be an introduction of at least one exogenous nucleic acid sequence (e.g., a recombinant nucleic acid molecule), wherein the exogenous nucleic acid sequence encodes at least one biologically active domain or protein from the same or a second PKS system and/or a protein that affects the activity of said PUFA PKS system (e.g., a phosphopantetheinyl transferases (PPTase), discussed below).
- exogenous nucleic acid sequence e.g., a recombinant nucleic acid molecule
- PPTase phosphopantetheinyl transferases
- the organism does not necessarily endogenously (naturally) contain a PUFA PKS system, but is genetically modified to introduce at least one recombinant nucleic acid molecule encoding an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system.
- PUFA PKS activity is affected by introducing or increasing PUFA PKS activity in the organism.
- one embodiment relates to a genetically modified microorganism, wherein the microorganism expresses a PKS system comprising at least one biologically active domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system.
- the at least one domain of the PUFA PKS system is encoded by a nucleic acid sequence described herein.
- the genetic modification affects the activity of the PKS system in the organism.
- the genetically modified microorganism can include any one or more of the above-identified nucleic acid sequences, and/or any of the other homologues of any of the Schizochytrium PUFA PKS ORFs or domains as described in detail above.
- a genetically modified microorganism can include a genetically modified bacterium, protist, microalgae, fungus, or other microbe, and particularly, any of the genera of the order Thraustochytriales (e.g., a Thraustochytrid) described herein.
- a genetically modified microorganism has a genome which is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form such that the desired result is achieved (i.e., increased or modified PUFA PKS activity and/or production of a desired product using the PUFA PKS system or component thereof).
- a genetically modified microorganism can include a microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that such modifications provide the desired effect within the microorganism.
- Preferred microorganism host cells to modify according to the present invention include, but are not limited to, any bacteria, protist, microalga, fungus, or protozoa.
- preferred microorganisms to genetically modify include, but are not limited to, any microorganism of the order Thraustochytriales or any microorganism of the order Labyrinthulales.
- Particularly preferred host cells for use in the present invention could include microorganisms from a genus including, but not limited to: Thraustochytrium, Ulkenia, Schizochytrium, Japonochytrium, Aplanochytrium, Althornia, Elina, Labyrinthula, Labyrinthuloides, Labyrinthomyxa, Diplophrys, Pyrrhosorus, Sorodiplophrys or Chlamydomyxa .
- suitable host microorganisms for genetic modification include, but are not limited to, yeast including Saccharomyces cerevisiae, Saccharomyces carlsbergensis , or other yeast such as Candida, Kluyveromyces , or other fungi, for example, filamentous fungi such as Aspergillus, Neurospora, Penicillium , etc.
- Bacterial cells also may be used as hosts. This includes Escherichia coli , which can be useful in fermentation processes. Alternatively, a host such as a Lactobacillus species or Bacillus species can be used as a host.
- Another embodiment of the present invention relates to a genetically modified plant or part of a plant (e.g., wherein the plant has been genetically modified to express a PUPA PKS system described herein), which includes at least the core PUFA PKS enzyme complex and, in one embodiment, at least one PUFA PKS accessory protein, (e.g., a PPTase), so that the plant produces PUFAs.
- the plant is an oil seed plant, wherein the oil seeds or oil in the oil seeds contain PUFAs produced by the PUFA PKS system.
- oils contain a detectable amount of at least one target or primary PUFA that is the product of the PUFA PKS system.
- Plants are not known to endogenously contain a PUFA PKS system, and therefore, the PUFA PKS systems of the present invention represent an opportunity to produce plants with unique fatty acid production capabilities. It is a particularly preferred embodiment of the present invention to genetically engineer plants to produce one or more PUFAs in the same plant, including, EPA, DHA, DPA, ARA, GLA, SDA and others.
- the present invention offers the ability to create any one of a number of “designer oils” in various ratios and forms.
- the disclosure of the PUFA PKS genes from the particular marine organisms described herein offer the opportunity to more readily extend the range of PUFA production and successfully produce such PUFAs within temperature ranges used to grow most crop plants.
- A. tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells.
- the Ti and Ri plasmids of A. tumefaciens and A. rhizogenes carry genes responsible for genetic transformation of the plant. See, for example, Kado, C. I., Crit. Rev. Plant. Sci. 10:1 (1991).
- Agrobacterium vector systems and methods for Agrobacterium -mediated gene transfer are provided by numerous references, including Gruber et al., supra, Miki et al., supra, Moloney et al., Plant Cell Reports 8:238 (1989), and U.S. Pat. Nos. 4,940,838 and 5,464,763.
- microprojectile-mediated transformation wherein DNA is carried on the surface of microprojectiles.
- the expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds sufficient to penetrate plant cell walls and membranes.
- a genetically modified plant can include any genetically modified plant including higher plants and particularly, any consumable plants or plants useful for producing a desired bioactive molecule of the present invention.
- Plant parts include any parts of a plant, including, but not limited to, seeds (immature or mature), oils, pollen, embryos, flowers, fruits, shoots, leaves, roots, stems, explants, etc.
- a genetically modified plant has a genome that is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form such that the desired result is achieved (e.g., PUFA PKS activity and production of PUFAs).
- Genetic modification of a plant can be accomplished using classical strain development and/or molecular genetic techniques. Methods for producing a transgenic plant, wherein a recombinant nucleic acid molecule encoding a desired amino acid sequence is incorporated into the genome of the plant, are known in the art.
- a preferred plant to genetically modify according to the present invention is preferably a plant suitable for consumption by animals, including humans.
- Preferred plants to genetically modify according to the present invention include, but are not limited to any higher plants, including both dicotyledonous and monocotyledonous plants, and particularly consumable plants, including crop plants and especially plants used for their oils.
- Such plants can include, for example: canola, soybeans, rapeseed, linseed, corn, safflowers, sunflowers and tobacco.
- Other preferred plants include those plants that are known to produce compounds used as pharmaceutical agents, flavoring agents, nutraceutical agents, functional food ingredients or cosmetically active agents or plants that are genetically engineered to produce these compounds/agents.
- a genetically modified microorganism or plant includes a microorganism or plant that has been modified using recombinant technology.
- genetic modifications that result in a decrease in gene expression, in the function of the gene, or in the function of the gene product (i.e., the protein encoded by the gene) can be referred to as inactivation (complete or partial), deletion, interruption, blockage or down-regulation of a gene.
- a genetic modification in a gene which results in a decrease in the function of the protein encoded by such gene can be the result of a complete deletion of the gene (i.e., the gene does not exist, and therefore the protein does not exist), a mutation in the gene which results in incomplete or no translation of the protein (e.g., the protein is not expressed), or a mutation in the gene which decreases or abolishes the natural function of the protein (e.g., a protein is expressed which has decreased or no enzymatic activity or action).
- Genetic modifications that result in an increase in gene expression or function can be referred to as amplification, overproduction, overexpression, activation, enhancement, addition, or up-regulation of a gene.
- the genetic modification of a microorganism or plant according to the present invention preferably affects the activity of the PKS system expressed by the plant, whether the PKS system is endogenous and genetically modified, endogenous with the introduction of recombinant nucleic acid molecules into the organism, or provided completely by recombinant technology.
- to “affect the activity of a PKS system” includes any genetic modification that causes any detectable or measurable change or modification in the PKS system expressed by the organism as compared to in the absence of the genetic modification.
- a detectable change or modification in the PKS system can include, but is not limited to: the introduction of PKS system activity into an organism such that the organism now has measurable/detectable PKS system activity (i.e., the organism did not contain a PKS system prior to the genetic modification), the introduction into the organism of a functional domain from a different PKS system than a PKS system endogenously expressed by the organism such that the PKS system activity is modified (e.g., a bacterial PUFA PKS domain or a type I PKS domain is introduced into an organism that endogenously expresses a non-bacterial PUFA PKS system), a change in the amount of a bioactive molecule produced by the PKS system (e.g., the system produces more (increased amount) or less (decreased amount) of a given product as compared to in the absence of the genetic modification), a change in the type of a bioactive molecule produced by the PKS system (e.g., the system produces a new or different product, or a variant
- reference to increasing the activity of a functional domain or protein in a PUFA PKS system refers to any genetic modification in the organism containing the domain or protein (or into which the domain or protein is to be introduced) which results in increased functionality of the domain or protein system and can include higher activity of the domain or protein (e.g., specific activity or in vivo enzymatic activity), reduced inhibition or degradation of the domain or protein system, and overexpression of the domain or protein.
- gene copy number can be increased
- expression levels can be increased by use of a promoter that gives higher levels of expression than that of the native promoter, or a gene can be altered by genetic engineering or classical mutagenesis to increase the activity of the domain or protein encoded by the gene.
- reference to decreasing the activity of a functional domain or protein in a PUFA PKS system refers to any genetic modification in the organism containing such domain or protein (or into which the domain or protein is to be introduced) which results in decreased functionality of the domain or protein and includes decreased activity of the domain or protein, increased inhibition or degradation of the domain or protein and a reduction or elimination of expression of the domain or protein.
- the action of domain or protein of the present invention can be decreased by blocking or reducing the production of the domain or protein, “knocking out” the gene or portion thereof encoding the domain or protein, reducing domain or protein activity, or inhibiting the activity of the domain or protein.
- Blocking or reducing the production of a domain or protein can include placing the gene encoding the domain or protein under the control of a promoter that requires the presence of an inducing compound in the growth medium. By establishing conditions such that the inducer becomes depleted from the medium, the expression of the gene encoding the domain or protein (and therefore, of protein synthesis) could be turned off.
- Blocking or reducing the activity of domain or protein could also include using an excision technology approach similar to that described in U.S. Pat. No. 4,743,546, incorporated herein by reference. To use this approach, the gene encoding the protein of interest is cloned between specific genetic sequences that allow specific, controlled excision of the gene from the genome. Excision could be prompted by, for example, a shift in the cultivation temperature of the culture, as in U.S. Pat. No. 4,743,546, or by some other physical or nutritional signal.
- a genetic modification includes a modification of a nucleic acid sequence encoding an amino acid sequence that has a biological activity of at least one domain of a non-bacterial PUFA PKS system as described herein.
- Such a modification can be to an amino acid sequence within an endogenously (naturally) expressed non-bacterial PUFA PKS system, whereby a microorganism that naturally contains such a system is genetically modified by, for example, classical mutagenesis and selection techniques and/or molecular genetic techniques, include genetic engineering techniques.
- Genetic engineering techniques can include, for example, using a targeting recombinant vector to delete a portion of an endogenous gene, or to replace a portion of an endogenous gene with a heterologous sequence.
- heterologous sequences that could be introduced into a host genome include sequences encoding at least one functional domain from another PKS system, such as a different non-bacterial PUFA PKS system, a bacterial PUFA PKS system, a type I PKS system, a type II PKS system, or a modular PKS system.
- Other heterologous sequences to introduce into the genome of a host includes a sequence encoding a protein or functional domain that is not a domain of a PKS system, but which will affect the activity of the endogenous PKS system. For example, one could introduce into the host genome a nucleic acid molecule encoding a phosphopantetheinyl transferase (discussed below). Specific modifications that could be made to an endogenous PUPA PKS system are discussed in detail below.
- the genetic modification can include: (1) the introduction of a recombinant nucleic acid molecule encoding an amino acid sequence having a biological activity of at least one domain of a non-bacterial PUFA PKS system; and/or (2) the introduction of a recombinant nucleic acid molecule encoding a protein or functional domain that affects the activity of a PUFA PKS system, into a host.
- the host can include: (1) a host cell that does not express any PKS system, wherein all functional domains of a PKS system are introduced into the host cell, and wherein at least one functional domain is from a non-bacterial PUFA PKS system; (2) a host cell that expresses a PKS system (endogenous or recombinant) having at least one functional domain of a non-bacterial PUFA PKS system, wherein the introduced recombinant nucleic acid molecule can encode at least one additional non-bacterial PUFA PKS domain function or another protein or domain that affects the activity of the host PKS system; and (3) a host cell that expresses a PKS system (endogenous or recombinant) which does not necessarily include a domain function from a non-bacterial PUFA PKS, and wherein the introduced recombinant nucleic acid molecule includes a nucleic acid sequence encoding at least one functional domain of a non-bacterial PUFA PKS system.
- the present invention intends to encompass any genetically modified organism (e.g., microorganism or plant), wherein the organism comprises at least one non-bacterial PUFA PKS domain function (either endogenously or by recombinant modification), and wherein the genetic modification has a measurable effect on the non-bacterial PUFA PKS domain function or on the PKS system when the organism comprises a functional PKS system.
- a genetically modified organism e.g., microorganism or plant
- the organism comprises at least one non-bacterial PUFA PKS domain function (either endogenously or by recombinant modification)
- the genetic modification has a measurable effect on the non-bacterial PUFA PKS domain function or on the PKS system when the organism comprises a functional PKS system.
- PUFA PKS systems of the present invention gene mixing can be used to extend the range of PUFA products (and ratios thereof) to include EPA, DPA, DHA, ARA, GLA, SDA and others, as well as to produce a wide variety of bioactive molecules, including antibiotics, other pharmaceutical compounds, and other desirable products.
- the method to obtain these bioactive molecules includes not only the mixing of genes from various organisms but also various methods of genetically modifying the non-bacterial PUFA PKS genes disclosed herein.
- Knowledge of the genetic basis and domain structure of the non-bacterial PUFA PKS system of the present invention provides a basis for designing novel genetically modified organisms which produce a variety of bioactive molecules.
- the current products of the Schizochytrium PUFA PKS system are DHA and DPA (C22:5 ⁇ 6). If one manipulated the system to produce C20 fatty acids, one would expect the products to be EPA and ARA (C20:4 ⁇ 6). This could provide a new source for ARA.
- encompassed by the present invention are methods to genetically modify microbial or plant cells by: genetically modifying at least one nucleic acid sequence in the organism that encodes an amino acid sequence having the biological activity of at least one functional domain of a PUFA PKS system according to the present invention, and/or expressing at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding such amino acid sequence.
- genetically modifying at least one nucleic acid sequence in the organism that encodes an amino acid sequence having the biological activity of at least one functional domain of a PUFA PKS system according to the present invention and/or expressing at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding such amino acid sequence.
- Various embodiments of such sequences, methods to genetically modify an organism, and specific modifications have been described in detail above.
- the method is used to produce a particular genetically modified organism that produces a particular bioactive molecule or molecules.
- a mutagenesis program could be combined with a selective screening process to obtain bioactive molecules of interest. This would include methods to search for a range of bioactive compounds. This search would not be restricted to production of those molecules with cis double bonds.
- the mutagenesis methods could include, but are not limited to: chemical mutagenesis, gene shuffling, switching regions of the genes encoding specific enzymatic domains, or mutagenesis restricted to specific regions of those genes, as well as other methods.
- high throughput mutagenesis methods could be used to influence or optimize production of the desired bioactive molecule.
- a product of interest e.g., ARA
- screening methods are used to identify additional non-bacterial organisms having novel PKS systems similar to the PUFA PKS system of Schizochytrium , as described herein (see above).
- Homologous PKS systems identified in such organisms can be used in methods similar to those described herein for the Schizochytrium , as well as for an additional source of genetic material from which to create, further modify and/or mutate a PUFA PKS system for expression in that microorganism, in another microorganism, or in a higher plant, to produce a variety of compounds.
- a preferred embodiment of the invention includes a system to select for only those modifications that do not block the ability of the PUFA PKS system to produce a product.
- the FabB-strain of E. coli is incapable of synthesizing unsaturated fatty acids and requires supplementation of the medium with fatty acids that can substitute for its normal unsaturated fatty acids in order to grow (see Metz et al., 2001, supra).
- this requirement for supplementation of the medium
- the transformed FabB-strain now requires a functional PUFA-PKS system (to produce the unsaturated fatty acids) for growth without supplementation.
- the key element in this example is that production of a wide range of unsaturated fatty acid will suffice (even unsaturated fatty acid substitutes such as branched chain fatty acids). Therefore, in another preferred embodiment of the invention, one could create a large number of mutations in one or more of the PUFA PKS genes disclosed herein, and then transform the appropriately modified FabB-strain (e.g.
- a genetically modified organism has a modification that changes at least one product produced by the endogenous PKS system, as compared to a wild-type organism.
- a genetically modified organism has been modified by transfecting the organism with a recombinant nucleic acid molecule encoding a protein that regulates the chain length of fatty acids produced by the PUFA PKS system.
- the protein that regulates the chain length of fatty acids produced by the PUFA PKS system can be a chain length factor that directs the synthesis of C20 units or C22 units.
- a genetically modified organism expresses a PUFA PKS system comprising a genetic modification in a domain selected from the group consisting of a domain encoding ⁇ -hydroxy acyl-ACP dehydrase (DH) and a domain encoding 3-ketoacyl-ACP synthase (KS), wherein the modification alters the ratio of long chain fatty acids produced by the PUFA PKS system as compared to in the absence of the modification.
- the modification is selected from the group consisting of a deletion of all or a part of the domain, a substitution of a homologous domain from a different organism for the domain, and a mutation of the domain.
- a genetically modified organism expresses a PUFA PKS system comprising a modification in an enoyl-ACP reductase (ER) domain, wherein the modification results in the production of a different compound as compared to in the absence of the modification.
- the modification is selected from the group consisting of a deletion of all or a part of the ER domain, a substitution of an ER domain from a different organism for the ER domain, and a mutation of the ER domain.
- the genetically modified organism produces a polyunsaturated fatty acid (PUFA) profile that differs from the naturally occurring organism without a genetic modification.
- PUFA polyunsaturated fatty acid
- a genetically modified microorganism or plant includes a microorganism or plant which has an enhanced ability to synthesize desired bioactive molecules (products) or which has a newly introduced ability to synthesize specific products (e.g., to synthesize a specific antibiotic).
- an enhanced ability to synthesize refers to any enhancement, or up-regulation, in a pathway related to the synthesis of the product such that the microorganism or plant produces an increased amount of the product (including any production of a product where there was none before) as compared to the wild-type microorganism or plant, cultured or grown, under the same conditions. Methods to produce such genetically modified organisms have been described in detail above.
- the present invention relates to a genetically modified plant or part of a plant (e.g., wherein the plant has been genetically modified to express a PUFA PKS system described herein), which includes at least the core PUFA PKS enzyme complex and, in one embodiment, at least one PUFA PKS accessory protein, (e.g., a PPTase), so that the plant produces PUFAs.
- the plant is an oil seed plant, wherein the oil seeds or oil in the oil seeds contain PUFAs produced by the PUFA PKS system.
- oils contain a detectable amount of at least one target or primary PUFA that is the product of the PUFA PKS system.
- the present inventors demonstrate herein the production of PUFAs in a plant that has been genetically modified to express the genes encoding a PUFA PKS system from Schizochytrium of the present invention and a PUFA PKS accessory enzyme, 4′-phosphopantetheinyl transferase (PPTase).
- the oils produced by these plants contain significant quantities of both DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), which are the predominant PUFAs (the primary PUFAs) produced by the Schizochytrium from which the PUFA PKS genes were derived.
- oils from plants that produce PUFAs using the PUFA PKS pathway have a different fatty acid profile than plants that are genetically engineered to produce the same PUFAs by the “standard” pathway described above.
- oils from plants that have been genetically engineered to produce specific PUFAs by the PUFA PKS pathway are substantially free of the various intermediate products and side products that accumulate in oils that are produced as a result of the use of the standard PUFA synthesis pathway. This characteristic is discussed in detail below.
- the free fatty acid is exported from the plastid and converted to an acyl-CoA.
- the 18:1 can be esterified to phosphatidylcholine (PC) and up to two more cis double bonds can be added.
- the newly introduced elongases can utilize substrates in the acyl-CoA pool to add carbons in two-carbon increments.
- Newly introduced desaturases can utilize either fatty acids esterified to PC, or those in the acyl-CoA pool, depending on the source of the enzyme.
- One consequence of this scheme for long chain PUFA production is that intermediates or side products in the pathway accumulate, which often represent the majority of the novel fatty acids in the plant oil, rather than the target long chain PUFA.
- the target PUFA product i.e., the PUFA product that one is targeting for production, trying to produce, attempting to produce, by using the standard pathway
- DHA or EPA for example (e.g., produced using elongases and desaturases that will produce the DHA or EPA from the products of the FAS system)
- intermediate products and side products will be produced in addition to the DHA or EPA, and these intermediate or side products frequently represent the majority of the products produced by the pathway, or are at least present in significant amounts in the lipids of the production organism.
- Such intermediate and side products include, but are not limited to, fatty acids having fewer carbons and/or fewer double bonds than the target, or primary PUFA, and can include unusual fatty acid side products that may have the same number of carbons as the target or primary PUFA, but which may have double bonds in unusual positions.
- fatty acids having fewer carbons and/or fewer double bonds than the target, or primary PUFA and can include unusual fatty acid side products that may have the same number of carbons as the target or primary PUFA, but which may have double bonds in unusual positions.
- the oils produced by the system include a variety of intermediate and side products including: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other intermediate or side products, such as 20:0; 20:1 ( ⁇ 5); 20:1 ( ⁇ 11): 20:2 ( ⁇ 8,11); 20:2 ( ⁇ 11,14); 20:3 ( ⁇ 5,11,14); 20:3 ( ⁇ 11,14,17); mead acid (20:0; 20:1 ( ⁇ 5); 20:1 ( ⁇ 11): 20:2 ( ⁇ 8,11); 20:2 ( ⁇ 11,14); 20:3 ( ⁇ 5,11,14); 20:3 ( ⁇ 11,14,17); mead
- the PUFA PKS synthase of the present invention does not utilize the fatty acid products of FAS systems. Instead, it produces the final PUFA product (the primary PUFA product) from the same small precursor molecule that is utilized by FASB and elongases (malonyl-CoA). Therefore, intermediates in the synthesis cycle are not released in any significant amount, and the PUFA product (also referred to herein as the primary PUFA product) is efficiently transferred to phospholipids (PL) and triacylglycerol (TAG) fractions of the lipids.
- PL phospholipids
- TAG triacylglycerol
- a PUFA PKS system may produce two target or primary PUFA products (e.g., the PUFA PKS system from Schizochytrium produces both DHA and DPA n-6 as primary products), but DPA is not an intermediate in the pathway to produce DHA. Rather, each is a separate product of the same PUFA PKS system. Therefore, the PUFA PKS genes of the present invention are an excellent means of producing oils containing PUFAs, and particularly, LCPUFAs in a heterologous host, such as a plant, wherein the oils are substantially free (defined below) of the intermediates and side products that contaminate oils produced by the “standard” PUFA pathway.
- PUFAs that can be produced by the present invention include, but are not limited to, DHA (docosahexaenoic acid (C22:6, n-3)), ARA (eicosatetraenoic acid or arachidonic acid (C20:4, n-6)), DPA (docosapentaenoic acid (C22:5, n-6 or n-3)), and EPA (eicosapentaenoic acid (C20:5, n-3)).
- DHA docosahexaenoic acid
- ARA eicosatetraenoic acid or arachidonic acid
- DPA docosapentaenoic acid (C22:5, n-6 or n-3)
- EPA eicosapentaenoic acid
- the present invention allows for the production of commercially valuable lipids enriched in one or more desired (target or primary) PUFAs by the present inventors' development of genetically modified plants through the use of the polyketide synthase system of the present invention, as well as components thereof, that produces PUFAs.
- reference to a “primary PUFA”, “target PUFA”, “intended PUFA”, or “desired PUFA” refers to the particular PUFA or PUFAs that are the intended product of the enzyme pathway that is used to produce the PUFA(s).
- a target or desired PUFA e.g., DHA or EPA.
- target or desired PUFA produced by the standard pathway may not actually be a “primary” PUFA in terms of the amount of PUFA as a percentage of total fatty acids produced by the system, due to the formation of intermediates and side products that can actually represent the majority of products produced by the system.
- primary PUFA even in that instance to refer to the target or intended PUFA product produced by the elongases or desaturases used in the system.
- PUFA PKS system In contrast to the classical pathway for PUFA production, when using a PUFA PKS system, a given PUFA PKS system derived from a particular organism (or created from combining proteins and domains from PUFA PKS systems) will produce particular PUFA(s), such that selection of a PUFA PKS system from a particular organism will result in the production of specified target or primary PUFAs. For example, use of a PUFA PKS system from Schizochytrium according to the present invention will result in the production of DHA and DPAn-6 as the target or primary PUFAs.
- oils produced by the organism are substantially free of intermediate or side products that are not the target or primary PUFA products and that are not naturally produced by the endogenous FAS system in the wild-type organism (e.g., wild-type plants produce some shorter or medium chain PUFAs, such as 18 carbon PUFAs, via the FAS system, but there will be new, or additional, fatty acids produced in the plant as a result of genetic modification with a PUFA PKS system).
- the majority of additional fatty acids in the profile of total fatty acids produced by plants that have been genetically modified with the PUFA PKS system of the present invention (or a component thereof), comprise the target or intended PUFA products of the PUFA PKS system (i.e., the majority of additional fatty acids in the total fatty acids that are produced by the genetically modified plant are the target PUFA(s)).
- intermediate products or “side products” of an enzyme system that produces PUFAs refers to any products, and particularly, fatty acid products, that are produced by the enzyme system as a result of the production of the target or primary PUFA of the system.
- Intermediate and side products are particularly significant in the standard pathway for PUFA synthesis and are substantially less significant in the PUFA PKS pathway, as discussed above.
- a primary or target PUFA of one enzyme system may be an intermediate of a different enzyme system where the primary or target product is a different PUFA, and this is particularly true of products of the standard pathway of PUFA production, since the PUFA PKS system of the present invention substantially avoids the production of intermediates.
- fatty acids such as GLA, DGLA and SDA are produced as intermediate products in significant quantities (e.g., U.S. Patent Application Publication 2004/0172682 illustrates this point).
- U.S. Patent Application Publication 2004/0172682 when using the standard pathway to produce DHA, in addition to the fatty acids mentioned above, ETA and EPA (notably the target PUFA in the first example above) are produced in significant quantities and in fact, may be present in significantly greater quantities relative to the total fatty acid product than the target PUFA itself. This latter point is shown in U.S. Patent Application Publication 2004/0172682, where a plant that was engineered to produce DHA by the standard pathway produces more EPA as a percentage of total fatty acids than DHA.
- any intermediate or side product fatty acids that are produced in the genetically modified plant (and/or parts of plants and/or seed oil fraction) as a result of the enzyme system for producing PUFAS i.e., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification
- any intermediate or side product fatty acids that are produced in the genetically modified plant (and/or parts of plants and/or seed oil fraction) as a result of the enzyme system for producing PUFAS i.e., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification
- PUFAS i.e., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification
- intermediate products and side products that are not present in substantial amounts in the total lipids of plants genetically modified with such PUFA PKS can include, but are not limited to: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other intermediate or side products, such as 20:0; 20:1 ( ⁇ 5); 20:1 ( ⁇ 11); 20:2 ( ⁇ 8,11); 20:2 ( ⁇ 11,14); 20:3 ( ⁇ 5,11,14); 20:
- the target product is a particular PUFA, such as DHA
- the intermediate products and side products that are not present in substantial amounts in the total lipids of the genetically modified plants also include other PUFAs, including other PUFAs that are a natural product of a different PUFA PKS system, such as EPA in this example.
- the PUFA PKS system of the present invention can also be used, if desired, to produce as a target PUFA a PUFA that can include GLA, SDA or DGLA (referring to embodiments where oils are produced using components of a PUFA PKS system described herein).
- the present inventors Using the knowledge of the genetic basis and domain structure of the PUFA PKS system described herein, the present inventors have designed and produced constructs encoding such a PUFA PKS system and have successfully produced transgenic plants expressing the PUFA PKS system.
- the transgenic plants produce oils containing PUFAs, and the oils are substantially free of intermediate products that accumulate in a standard PUFA pathway (see Example 3).
- the present inventors have also demonstrated the use of the constructs to produce PUFAs in another eukaryote, yeast, as a proof-of-concept experiment prior to the production of the transgenic plants (see Example 2).
- one embodiment of the present invention is a method to produce desired bioactive molecules (also referred to as products or compounds) by growing or culturing a genetically modified microorganism or a genetically modified plant of the present invention (described in detail above).
- a method includes the step of culturing in a fermentation medium or growing in a suitable environment, such as soil, a microorganism or plant, respectively, that has a genetic modification as described previously herein and in accordance with the present invention.
- method to produce bioactive molecules of the present invention includes the step of culturing under conditions effective to produce the bioactive molecule a genetically modified organism that expresses a PKS system comprising at least one biologically active domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system as described herein.
- PKS polyunsaturated fatty acid
- a genetically modified microorganism is cultured or grown in a suitable medium, under conditions effective to produce the bioactive compound.
- An appropriate, or effective, medium refers to any medium in which a genetically modified microorganism of the present invention, when cultured, is capable of producing the desired product.
- a medium is typically an aqueous medium comprising assimilable carbon, nitrogen and phosphate sources.
- Such a medium can also include appropriate salts, minerals, metals and other nutrients.
- Microorganisms of the present invention can be cultured in conventional fermentation bioreactors. The microorganisms can be cultured by any fermentation process which includes, but is not limited to, batch, fed-batch, cell recycle, and continuous fermentation.
- the desired bioactive molecules produced by the genetically modified microorganism can be recovered from the fermentation medium using conventional separation and purification techniques.
- the fermentation medium can be filtered or centrifuged to remove microorganisms, cell debris and other particulate matter, and the product can be recovered from the cell-free supernatant by conventional methods, such as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and crystallization.
- microorganisms producing the desired compound, or extracts and various fractions thereof can be used without removal of the microorganism components from the product.
- a genetically modified plant is cultured in a fermentation medium or grown in a suitable medium such as soil.
- a suitable growth medium for higher plants includes any growth medium for plants, including, but not limited to, soil, sand, any other particulate media that support root growth (e.g. vermiculite, perlite, etc.) or Hydroponic culture, as well as suitable light, water and nutritional supplements which optimize the growth of the higher plant.
- the genetically modified plants of the present invention are engineered to produce significant quantities of the desired product through the activity of the PKS system that is genetically modified according to the present invention.
- the compounds can be recovered through purification processes which extract the compounds from the plant.
- the compound is recovered by harvesting the plant.
- the plant can be consumed in its natural state or further processed into consumable products.
- Bioactive molecules include any molecules (compounds, products, etc.) that have a biological activity, and that can be produced by a PKS system that comprises at least one amino acid sequence having a biological activity of at least one functional domain of a non-bacterial PUFA PKS system as described herein.
- Such bioactive molecules can include, but are not limited to: a polyunsaturated fatty acid (PUFA), an anti-inflammatory formulation, a chemotherapeutic agent, an active excipient, an osteoporosis drug, an anti-depressant, an anti-convulsant, an anti- Heliobactor pylori drug, a drug for treatment of neurodegenerative disease, a drug for treatment of degenerative liver disease, an antibiotic, and a cholesterol lowering formulation.
- PUFA polyunsaturated fatty acid
- bioactive compounds of interest are produced by the genetically modified microorganism in an amount that is greater than about 0.05%, and preferably greater than about 0.1%, and more preferably greater than about 0.25%, and more preferably greater than about 0.5%, and more preferably greater than about 0.75%, and more preferably greater than about 1%, and more preferably greater than about 2.5%, and more preferably greater than about 5%, and more preferably greater than about 10%, and more preferably greater than about 15%, and even more preferably greater than about 20% of the dry weight of the microorganism.
- lipid compounds preferably, such compounds are produced in an amount that is greater than about 5% of the dry weight of the microorganism.
- bioactive compounds such as antibiotics or compounds that are synthesized in smaller amounts
- those strains possessing such compounds at of the dry weight of the microorganism are identified as predictably containing a novel.
- PKS system of the type described above.
- particular bioactive molecules compounds
- bioactive molecules are secreted by the microorganism, rather than accumulating. Therefore, such bioactive molecules are generally recovered from the culture medium and the concentration of molecule produced will vary depending on the microorganism and the size of the culture.
- a genetically modified organism e.g., microorganism or plant
- produces one or more polyunsaturated fatty acids including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), DPA (C22:5, n-6 or n-3), ARA (C20:4, n-6), GLA (C18:3, n-6), ALA (C18:3, n-3), and/or SDA (C18:4, n-3)), and more preferably, one or more long chain fatty acids, including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), DPA (C22:5, n-6 or n-3), or DTA (C22:4, n-6).
- EPA C20:5, n-3
- DHA C22:6, n-3
- DPA C22:5, n-6 or n-3
- DTA C22:4, n-6
- a genetically modified organism of the invention produces one or more polyunsaturated fatty acids including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), and/or DPA (C22:5, n-6 or n-3).
- a genetically modified organism of the invention produces at least one PUFA (the target PUPA), wherein the total fatty acid profile in the organism (or a part of the organism that accumulates PUFAs, such as mature seeds or oil from such seeds, if the organism is an oil seed plant), comprises a detectable amount of this PUFA or PUFAs.
- the PUFA is at least a 20 carbon PUFA and comprises at least 3 double bonds, and more preferably at least 4 double bonds, and even more preferably, at least 5 double bonds.
- the PUFA is a PUFA that is not naturally produced by the organism (i.e., the wild-type organism in the absence of genetic modification or the parent organism used as a recipient for the indicated genetic modification).
- the total fatty acid profile in the organism comprises at least 0.1% of the target PUFA(s) by weight of the total fatty acids, and more preferably at least about 0.2%, and more preferably at least about 0.3%, and more preferably at least about 0.4%, and more preferably at least about 0.5%, and more preferably at least about 1%, and more preferably at least about 2%, and more preferably at least about 3%, and more preferably at least about 4%, and more preferably at least about 5%, and more preferably at least about 10%, and more preferably at least about 15%, and more preferably at least about 20%, and more preferably at least about 25%, and more preferably at least about 30%, and more preferably at least about 35%, and more preferably at least about 40%, and more preferably at least about 45%, and more preferably at least about 50%, and more preferably at least about 55%, and more preferably at least about 60%, and more preferably at least about 65%, and more more preferably at least about 45%, and more preferably at least about 50%,
- total fatty acids produced by a plant are presented as a weight percent as determined by gas chromatography (GC) analysis of a fatty acid methyl ester (FAME) preparation.
- GC gas chromatography
- PUFA PKS total fatty acids produced by a plant (and/or parts of plants or seed oil fraction) that has been genetically modified to express a PUFA PKS of the present invention that these total fatty acids produced by the plant comprise less than about 10% by weight of any fatty acids other than the target PUFA(s) that are produced by the enzyme complex that produces the target PUFA(s) (e.g., DHA and DPAn-6 are the target PUFAs if the entire PUFA PKS system of the invention is used).
- DHA and DPAn-6 are the target PUFAs if the entire PUFA PKS system of the invention is used.
- any fatty acids that are produced by the enzyme complex that produces the target PUFA(s) other than the target PUFA(s) are present at less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% by weight of the total fatty acids produced by the plant.
- any fatty acids that are produced by the enzyme complex that produces the target PUFA(s) other than the target PUFA(s) are present at less than about 10% by weight of the total fatty acids that are produced by the enzyme complex that produces the target PUFA(s) in the plant (i.e., this measurement is limited to those total fatty acids that are produced by the enzyme complex that produces the target PUFAs), and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% by weight of the total fatty acids that are produced by the enzyme complex that produces the target PUFA(s) in the plant.
- the total fatty acids produced by the plant contain less than (or do not contain any more than) 10% PUFAs having 18 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than 9% PUFAs having 18 or more carbons, or less than 8% PUFAs having 18 or more carbons, or less than 7% PUFAs having 18 or more carbons, or less than 6% PUFAs having 18 or more carbons, or less than 5% PUFAs having 18 or more carbons, or less than 4% PUFAs having 18 or more carbons, or less than 3% PUFAs having 18 or more carbons, or less than 2% PUFAs having 18 or more carbons, or less than 1% PUFAs having 18 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- the total fatty acids produced by the plant contain less than (or do not contain any more than) 10% PUFAs having 20 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than 9% PUFAs having 20 or more carbons, or less than 8% PUFAs having 20 or more carbons, or less than 7% PUFAs having 20 or more carbons, or less than 6% PUFAs having 20 or more carbons, or less than 5% PUFAs having 20 or more carbons, or less than 4% PUFAs having 20 or more carbons, or less than 3% PUFAs having 20 or more carbons, or less than 2% PUFAs having 20 or more carbons, or less than 1% PUFAs having 20 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- the total fatty acids in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of a fatty acid selected from any one or more of: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other gamma-linolenic acid (GLA; 18
- the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of a fatty acid selected from: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid
- GLA gamma-linolenic acid
- ETA 20:3, n-9 and various other fatty acids, such as 20:0; 20:1 ( ⁇ 5); 20:1 ( ⁇ 11); 20:2 ( ⁇ 8,11); 20:2 ( ⁇ 11,14); 20:3 ( ⁇ 5,11,14); 20:3 ( ⁇ 11,14,17); mead acid (20:3; ⁇ 5,8,11); or 20:4 ( ⁇ 5,1,14,17).
- the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- GLA gamma-linolenic acid
- PUFAs having 18 carbons and four carbon-carbon double bonds PUFAs having 20 carbons and
- the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of each of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- GLA gamma-linolenic acid
- PUFAs having 18 carbons and four carbon-carbon double bonds PUFAs having 20 carbons and
- the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of any one or more of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- GLA gamma-linolenic acid
- PUFAs having 18 carbons and four carbon-carbon double bonds PUFAs having 20
- a genetically modified plant produces at least two target PUFAs (e.g., DHA and DPAn-6), and the total fatty acid profile in the plant, or the part of the plant that accumulates PUFAs (including oils from the oil seeds), comprises a detectable amount of these PUFAs.
- the PUFAs are preferably each at least a 20 carbon PUFA and comprise at least 3 double bonds, and more preferably at least 4 double bonds, and even more preferably, at least 5 double bonds.
- Such PUFAs are most preferably chosen from DHA, DPAn-6 and EPA.
- the plant produces DHA and DPAn-6 (the products of a PUFA PKS system described herein), and the ratio of DHA to DPAn-6 is from about 1:10 to about 10:1, including any ratio in between. In a one embodiment, the ratio of DHA to DPA is from about 1:1 to about 3:1, and in another embodiment, about 2.5:1.
- the plant produces the total fatty acid profile represented by FIG. 5 .
- the invention further includes any seeds produced by the plants described above, as well as any oils produced by the plants or seeds described above.
- the invention also includes any products produced using the plants, seed or oils described herein.
- One embodiment of the present invention relates to a method to modify an endproduct containing at least one fatty acid, comprising adding to said endproduct an oil produced by a recombinant host cell that expresses at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding at least one biologically active domain of a PUFA PKS system as described herein.
- the endproduct is selected from the group consisting of a food, a dietary supplement, a pharmaceutical formulation, a humanized animal milk, and an infant formula.
- suitable pharmaceutical formulations include, but are not limited to, an anti-inflammatory formulation, a chemotherapeutic agent, an active excipient, an osteoporosis drug, an anti-depressant, an anti-convulsant, an anti- Heliobactor pylori drug, a drug for treatment of neurodegenerative disease, a drug for treatment of degenerative liver disease, an antibiotic, and a cholesterol lowering formulation.
- the endproduct is used to treat a condition selected from the group consisting of: chronic inflammation, acute inflammation, gastrointestinal disorder, cancer, cachexia, cardiac restenosis, neurodegenerative disorder, degenerative disorder of the liver, blood lipid disorder, osteoporosis, osteoarthritis, autoimmune disease, preeclampsia, preterm birth, age related maculopathy, pulmonary disorder, and peroxisomal disorder.
- a condition selected from the group consisting of: chronic inflammation, acute inflammation, gastrointestinal disorder, cancer, cachexia, cardiac restenosis, neurodegenerative disorder, degenerative disorder of the liver, blood lipid disorder, osteoporosis, osteoarthritis, autoimmune disease, preeclampsia, preterm birth, age related maculopathy, pulmonary disorder, and peroxisomal disorder.
- Suitable food products include, but are not limited to, fine bakery wares, bread and rolls, breakfast cereals, processed and unprocessed cheese, condiments (ketchup, mayonnaise, etc.), dairy products (milk, yogurt), puddings and gelatine desserts, carbonated drinks, teas, powdered beverage mixes, processed fish products, fruit-based drinks, chewing gum, hard confectionery, frozen dairy products, processed meat products, nut and nut-based spreads, pasta, processed poultry products, gravies and sauces, potato chips and other chips or crisps, chocolate and other confectionery, soups and soup mixes, soya based products (milks, drinks, creams, whiteners), vegetable oil-based spreads, and vegetable-based drinks.
- Yet another embodiment of the present invention relates to a method to produce a humanized animal milk.
- This method includes the steps of genetically modifying milk-producing cells of a milk-producing animal with at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding at least one biologically active domain of a PUFA PKS system as described herein.
- Methods to genetically modify a host cell and to produce a genetically modified non-human, milk-producing animal are known in the art.
- host animals to modify include cattle, sheep, pigs, goats, yaks, etc., which are amenable to genetic manipulation and cloning for rapid expansion of a transgene expressing population.
- PKS-like transgenes can be adapted for expression in target organelles, tissues and body fluids through modification of the gene regulatory regions. Of particular interest is the production of PUFAs in the breast milk of the host animal.
- the three genes encoding the Schizochytrium PUFA PKS system that produce DHA and DPA were cloned into a single E. coli expression vector (derived from pET21c (Novagen)).
- the genes are transcribed as a single message (by the T7 RNA-polymerase), and a ribosome-binding site cloned in front of each of the genes initiates translation. Modification of the Orf B coding sequence was needed to obtain production of a full-length Orf B protein in E. coli (see below).
- An accessory gene, encoding a PPTase was cloned into a second plasmid (derived from pACYC184, New England Biolabs).
- the Orf B gene is predicted to encode a protein with a mass of ⁇ 224 kDa.
- Initial attempts at expression of the gene in E. coli resulted in accumulation of a protein with an apparent molecular mass of ⁇ 165 kDa (as judged by comparison to proteins of known mass during SDS-PAGE).
- Examination of the Orf B nucleotide sequence revealed a region containing 15 sequential serine codons—all of them being the TCT codon.
- the genetic code contains 6 different serine codons, and three of these are used frequently in E. coli .
- the inventors used four overlapping oligonucleotides in combination with a polymerase chain reaction protocol to resynthesize a small portion of the Orf B gene (a ⁇ 195 base pair, BspHI to SacII restriction enzyme fragment) that contained the serine codon repeat region.
- a synthetic Orf B fragment a random mixture of the 3 serine codons commonly used by E. coli was used, and some other potentially problematic codons were changed as well (i.e., other codons rarely used by E. coli ).
- the BspHI to SacII fragment present in the original Orf B was replaced by the resynthesized fragment (to yield Orf B*) and the modified gene was cloned into the relevant expression vectors.
- the modified OrfB* still encodes the amino acid sequence of SEQ ID NO:4.
- Expression of the modified Orf B* clone in E. coli resulted in the appearance of a ⁇ 224 kDa protein, indicating that the full-length product of OrfB was produced.
- the sequence of the resynthesized Orf B* BspHI to SacII fragment is represented herein as SEQ ID NO:38. Referring to SEQ ID NO:38, the nucleotide sequence of the resynthesized BspHI to SacII region of Orf B is shown. The BspHI restriction site and the SacII restriction site are identified. The BspHI site starts at nucleotide 4415 of the Orf B CDS (SEQ ID NO:3) (note: there are a total of three BspHI sites in the Orf B CDS, while the SacII site is unique).
- the ACP domains of the Orf A protein must be activated by addition of phosphopantetheine group in order to function.
- the enzymes that catalyze this general type of reaction are called phosphopantetheine transferases (PPTases).
- E. coli contains two endogenous PPTases, but it was anticipated that they would not recognize the Orf A ACP domains from Schizochytrium . This was confirmed by expressing Orfs A, B* (see above) and C in E. coli without an additional PPTase. In this transformant, no DHA production was detected.
- sfp PPTase has been well characterized and is widely used due to its ability to recognize a broad range of substrates. Based on published sequence information (Nakana, et al., 1992 , Molecular and General Genetics 232: 313-321), an expression vector for sfp was built by cloning the coding region, along with defined up- and downstream flanking DNA sequences, into a pACYC-184 cloning vector. Oligonucleotides were used to amplify the region of interest from genomic B. subtilus DNA. The oligonucleotides:
- the additional restriction sites (BglLL, Nod, SmaI, PmelI, HindIII, SpeI and XhoI) were included to facilitate the cloning process.
- the functionality of this expression vector cassette was tested by using PCR to generate a version of sfp with a NdeI site at the 5′ end and an XhoI site ate the 3′ end. This fragment was cloned into the expression cassette and transferred into E. coli along with the Orf A, B* and C expression vector. Under appropriate conditions, these cells accumulated DHA, demonstrating that a functional sfp had been produced.
- Het I is present in a cluster of genes in Nostoc known to be responsible for the synthesis of long chain hydroxy-fatty acids that are a component of a glyco-lipid layer present in heterocysts of that organism (Black and Wolk, 1994 , J. Bacteriol. 176, 2282-2292; Campbell et al., 1997 , Arch. Microbial. 167, 251-258). Het I activates the ACP domains of a protein, Hgl E, present in that cluster. The two ACP domains of Hgl E have a high degree of sequence homology to the ACP domains found in Schizochytrium Orf A. A Het I expression construct was made using PCR.
- SEQ ID NO:41 represents the amino acid sequence of the Nostoc Het I protein.
- the endogenous start codon of Het I has not been identified (there is no methionine present in the putative protein).
- There are several potential alternative start codons e.g., TTG and ATT) near the 5′ end of the open reading frame.
- No methionine codons are present in the sequence.
- a Het I expression construct was made by using PCR to replace the furthest 5′ potential alternative start codon (TTG) with a methionine codon (ATG, as part of the above described NdeI restriction enzyme recognition site), and introducing an XhoI site at the 3′ end of the coding sequence.
- the modified HetI coding sequence was then inserted into the NdeI and XhoI sites of the pACYC184 vector construct containing the sfp regulatory elements. Expression of this Het I construct in E. coli resulted in the appearance of a new protein of the size expected from the sequence data. Co-expression of Het I with Schizochytrium Orfs A, B*, C in E. coli under several conditions resulted in the accumulation of DHA and DPA in those cells. In all of the experiments in which sfp and Het I were compared, more DHA and DPA accumulated in the cells containing the Het I construct than in cells containing the sfp construct.
- FAME methyl-esters
- DHA and DPA were detected in E. coli cells expressing the Schizochytrium PUFA PKS genes, plus either of the two heterologous PPTases (data not shown).
- No DHA or DPA was detected in FAMEs prepared from control cells (i.e., cells transformed with a plasmid lacking one of the Oils).
- the ratio of DHA to DPA observed in E. coli approximates that of the endogenous DHA and DPA production observed in Schizochytrium .
- the highest level of PUFA (DHA plus DPA) representing ⁇ 17% of the total FAME, was found in cells grown at 32° C. in 765 medium (recipe available from the American Type Culture Collection) supplemented with 10% (by weight) glycerol.
- PUFA accumulation was also observed when cells were grown in Luria Broth supplemented with 5 or 10% glycerol, and when grown at 20° C. Selection for the presence of the respective plasmids was maintained by inclusion of the appropriate antibiotics during the growth, and IPTG (to a final concentration of 0.5 mM) was used to induce expression of Orfs A, B* and C. Co-expression of Het I or sfp with Schizochytrium Orfs A, B*, C in E. coli under several conditions resulted in the accumulation of DHA and DPA in those cells. In all of the experiments in which sfp and Het I were compared, more DHA and DPA accumulated in the cells containing the Het I construct than in cells containing the sfp construct.
- the following example shows the expression of genes encoding the Schizochytrium PUFA synthase (sOrfA, sOrfB and native Orf C) along with Het I in baker's yeast ( Saccharomyces cerevisiae ).
- the Schizochytrium PUFA synthase genes and Het I were expressed in yeast using materials obtained from Invitrogen (Invitrogen Corporation, Carlsbad, Calif.).
- the INVsc1 strain of Saccharomyces cerevisiae was used along with the following transformation vectors: pYESLeu (sOrfA), pYES3/CT (sOrfB), pYES2/CT (OrfC) and pYESHis (HetI).
- pYESLeu sOrfA
- sOrfB pYES3/CT
- OrfC pYES2/CT
- HetI pYESHis
- the nucleotide sequence for the resynthesized OrfA (contained in pYESLeu), designated sOrfA, is represented herein by SEQ ID NO:43.
- SEQ ID NO:43 still encodes the OrfA amino acid sequence of SEQ ID NO:2.
- the nucleotide sequence for the resynthesized OrfB (contained in pYES3/CT), designated sOrfB, is represented herein by SEQ ID NO:44.
- SEQ ID NO:44 still encodes the OrfB amino acid sequence of SEQ ID NO:4.
- the OrfC nucleotide sequence used in these experiments (contained in pYES2/CT) is the wild-type OrfC, represented by SEQ ID NO:5, and encoding SEQ ID NO:6.
- Some of the vectors were modified to accommodate specific cloning requirements (e.g., restriction sites for cloning).
- Appropriate selection media were used (as specified by Invitrogen), depending on the particular experiment.
- the genes were cloned, in each case, behind a GAL1 promoter and expression was induced by re-suspension of washed cells in media containing galactose according to guidelines provide by Invitrogen. Cells were grown at 30° C. and harvested (by centrifugation) after being transferred to the induction medium. The cell pellets were freeze dried and FAMEs were prepared using acidic methanol, extracted into hexane and analyzed by GC.
- FIG. 4 shows the region of the GC chromatogram of FIG. 3 which contains the PUFA FAMEs.
- Both the control cells and the cell expressing the PUFA synthase contain a peak that elutes near the DHA FAME. This has been identified as C26:0 FAME and (based on literature references) is derived from sphingolipids. Although it elutes close to the DHA peak, the resolution is sufficient so that it does not interfere with the quantitation of DHA.
- the DPA n-6 peak is well separated from other endogenous yeast lipids in the FAME profile.
- the Schizochytrium Orfs A, B* (see Example 1) and C along with Het I were cloned (separately or in various combinations including all 4 genes on one Super-construct) into the appropriate binary vectors for introduction of the genes into plants.
- Each gene was cloned behind a linin promoter and was followed by a linin terminator sequence (Chaudhary et al., 2001; PCT Publication Number No. WO 01/16340 A1).
- a linin terminator sequence Chode
- a plastid targeting sequence derived from a Brassica napus acyl-ACP thioesterase were added to the Orfs.
- the amino acid sequence of the encoded targeting peptide is: MLKLSCNVTNHLHTFSFFSDSSLFIPVNRRTLAVS (SEQ ID NO:42).
- the nucleotide sequences encoding this peptide were placed in frame with the start methionine codons of each PUFA synthase Orf as well as the start codon of Het I.
- constructs and plants were prepared as follows:
- This plant binary vector contained a double expression cassette which targeted the co-expression of HetI (SEQ ID NO:41) and ORFC (SEQ ID NO:6) to the plastid.
- the first expression cassette began with a signal peptide (SEQ ID NO:42) derived from an acyl-ACP thioesterase gene from Brassica juncea (GenBank Accession No. AJ294419) to target expression of the polypeptides to the plastid.
- the signal peptide was synthesized from two overlapping oligos with an engineered AfiIII site at the 5′ end and an NcoI/Swal/XmaI multiple cloning site at the 3′ end.
- the second expression cassette also began with the acyl-ACP signal peptide followed immediately in-frame with a cDNA encoding ORFC (SEQ ID NO:5).
- the backbone of this plasmid, pSBS4055 was based on the plant binary vector, pPZP200, described by Hajdukiewicz et al. ( Plant Molecular Biology, 1994, 25:989-994).
- pPZP200 plant binary vector
- a pat gene conferring host plant phosphinothricine resistance (Wohlleben et al., 1988, Gene 70:25-37) driven by the ubiquitin promoter/terminator from Petroselinum crispum (Kawalleck et al., 1993 , Plant. Mol. Bio., 21:673-684) was inserted between the left and right border sequences.
- Linin promoter/terminators in tandem from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number No. WO 01/16340 A1) were used to drive expression of ACP-HetI and ACP-ORFC. Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with cDNAs encoding for either HetI or ORFC using NcoI/XmaI and NcoI/SwaI restriction endonuclease sites, respectively, to the 3′ end of the Linin promoter.
- plasmid pSBS4107 a DNA sequence encoding the Acyl-ACP signal peptide-HetI and Acyl-ACP signal peptide-ORFC polypeptides being placed in a binary vector under expression control of the linin promoter/terminator.
- the linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development.
- the Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993 , Plant Mol Biol 23:769-778).
- the complete plasmid map with annotated elements is shown in FIG. 6 .
- This plant binary vector contained an expression cassette which targeted the expression of ORFB (SEQ ID NO:4) to the plastid.
- the expression cassette began with a signal peptide derived from an acyl-ACP thioesterase gene from Brassica juncea (SEQ ID NO:42) to target expression of the polypeptide to the plastid.
- the signal peptide was synthesized as above.
- a cDNA sequence encoding for ORFB SEQ ID NO:3, except for the resynthesized BspHI to SacII region of Orf B, represented by SEQ ID NO:38; see Example 1 above).
- the backbone of this plasmid, pSBS4055 was based on the plant binary vector, pPZP200, described by Hajdukiewicz et al. ( Plant Molecular Biology, 1994, 25:989-994).
- a phosphomannose isomerase (PMI) gene conferring host plant positive selection for mannose-6-phosphate driven by the ubiquitin promoter/terminator from Petroselinum crispum (Kawalleck et al., 1993 , Plant. Mol. Bio., 21:673-684), was inserted between the left and right border sequences.
- a Linin promoter/terminator from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number WO 01/16340 A1) was used to drive expression of ACP-ORFB.
- Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with cDNAs encoding for ORFB, to the 3′ end of the Linin promoter.
- the result was plasmid pSBS5720: a DNA sequence encoding the Acyl-ACP signal peptide-ORFB polypeptide being placed in a binary vector under expression control of the linin promoter/terminator.
- the linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development.
- the Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993 , Plant Mol Biol 23:769-778).
- the complete plasmid map with annotated elements is shown in FIG. 7 .
- This plant binary vector contained an expression cassette which targeted the expression of ORFA (SEQ ID NO:2) to the plastid.
- the expression cassette began with a signal peptide derived from an acyl-ACP thioesterase gene from Brassica juncea (SEQ ID NO:42) to target expression of the polypeptide to the plastid.
- the signal peptide was synthesized as above.
- a cDNA sequence encoding for ORFA SEQ ID NO:1.
- neomycin phosphotransferase (nptII) gene conferring host plant Kanamycin resistance driven by the mannopine synthase promoter/terminator was inserted between the left and right border sequences.
- Linin promoter/terminator from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number WO 01/16340 A1) were used to drive expression of ACP-ORFA. Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with a cDNA encoding for ORFA to the 3′ end of the Linin promoter. The result was plasmid pSBS4757: a DNA sequence encoding the Acyl-ACP signal peptide-ORFA polypeptide being placed in a binary vector under expression control of the linin promoter/terminator.
- the linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development.
- the Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993 , Plant Mol Biol 23:769-778).
- the complete plasmid map with annotated elements is shown in FIG. 8 .
- the top panel of FIG. 5 shows the typical fatty acid profile of wild type Arabidopsis seeds as represented by GC separation and FID detection of FAMEs prepared from a pooled seed sample.
- the predominant fatty acids of wild type Arabidopsis seeds as represented by GC separation and FID detection of FAMEs prepared from a pooled seed sample are: 16:0, 18:0, 16:1, 18:1, 20:1, 20:2 and 22:1. No DHA or DPA n-6 are present in the samples from wild type seed.
- the lower panel of FIG. 5 shows the fatty acid profile of a pooled seed sample from one of the transgenic Arabidopsis lines (line 269) expressing the Schizochytrium PUFA synthase genes and Het I gene.
- the proteins expressed from these transgenes contain plastid targeting sequences.
- Two FAME peaks are present in the profile from the transgenic plant seeds that are not present in the profile from wild type seeds.
- the elution pattern of these two peaks exactly corresponds to the elution of authentic DHA and DPA n-6 (using FAMEs prepared from Schizochytrium oil as standards, as well as a commercially purchased DHA standard from NuCheck Prep).
- the DHA peak represents 0.8% of total calculated FAMEs while the DPA n-6 peak represents 1.7%.
- the sum of novel PUFAs is 2.5% of total FAMEs.
- the appearance of DHA and DPA n-6 in the seed fatty acid profile demonstrates that introduced Schizochytrium PUFA synthase system functions when expressed in the plant cell and the proteins are targeted to the plastid.
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biotechnology (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Biochemistry (AREA)
- General Engineering & Computer Science (AREA)
- Molecular Biology (AREA)
- Biomedical Technology (AREA)
- Microbiology (AREA)
- Biophysics (AREA)
- Oil, Petroleum & Natural Gas (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Gastroenterology & Hepatology (AREA)
- Plant Pathology (AREA)
- Physics & Mathematics (AREA)
- Diabetes (AREA)
- Nutrition Science (AREA)
- Cell Biology (AREA)
- Obesity (AREA)
- Hematology (AREA)
- Animal Behavior & Ethology (AREA)
- Pharmacology & Pharmacy (AREA)
- Public Health (AREA)
- Veterinary Medicine (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- Emergency Medicine (AREA)
- Endocrinology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
Disclosed are the complete polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems from Schizochytrium, and biologically active fragments and homologues thereof. More particularly, this invention relates to nucleic acids encoding such PUFA PKS systems, to proteins and domains thereof that comprise such PUFA PKS systems, to genetically modified organisms (plants and microorganisms) comprising such PUFA PKS systems, and to methods of making and using the PUFA PKS systems disclosed herein. This invention also relates to genetically modified plants and microorganisms and methods to efficiently produce lipids enriched in various polyunsaturated fatty acids (PUFAs) as well as other bioactive molecules by manipulation of a PUFA polyketide synthase (PKS) system.
Description
- This application is a Continuation of U.S. application Ser. No. 11/452,138, filed Jun. 12, 2006, which claims the benefit of priority under 35 U.S.C. §119(e) from U.S. Provisional Application No. 60/784,616, filed Mar. 21, 2006, and from U.S. Provisional Application No. 60/689,167, filed Jun. 10, 2005. U.S. application Ser. No. 11/452,138 is also a continuation-in-part of U.S. patent application Ser. No. 10/124,800, filed Apr. 16, 2002, which claims priority under 35 U.S.C. §119(e) from: U.S. Provisional Application Ser. No. 60/284,066, filed Apr. 16, 2001; U.S. Provisional Application Ser. No. 60/298,796, filed Jun. 15, 2001; and U.S. Provisional Application Ser. No. 60/323,269, filed Sep. 18, 2001. U.S. patent application Ser. No. 10/124,800 is also a continuation-in-part of U.S. application Ser. No. 09/231,899, filed Jan. 14, 1999, now U.S. Pat. No. 6,566,583. Each of the above-identified patent applications is incorporated herein by reference in its entirety for all purposes.
- U.S. application Ser. No. 11/452,138 does not claim the benefit of priority from U.S. application Ser. No. 09/090,793, filed Jun. 4, 1998, now U.S. Pat. No. 6,140,486, although U.S. application Ser. No. 09/090,793 is incorporated herein by reference in its entirety.
- This application contains a Sequence Listing submitted on a compact disc, in duplicate. Each of the two compact discs, which are identical to each other pursuant to 37 CFR §1.52(e)(4), contains the following file: “Sequence Listing”, having a size in bytes of 301 KB, recorded on 12 Jun. 2006. The information contained on the compact disc is hereby incorporated by reference in its entirety pursuant to 37 CFR §1.77(b)(4).
- This invention relates to polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems from Schizochytrium. More particularly, this invention relates to nucleic acids encoding such PUFA PKS systems, to such PUFA PKS systems, to genetically modified organisms comprising such PUFA PKS systems, and to methods of making and using such PUFA PKS systems disclosed herein. This invention also relates to PUFA PKS systems from non-bacterial and bacterial organisms identified using the Schizochytrium PUFA PKS systems described herein.
- Polyketide synthase (PKS) systems are generally known in the art as enzyme complexes related to fatty acid synthase (FAS) systems, but which are often highly modified to produce specialized products that typically show little resemblance to fatty acids. It has now been shown, however, that polyketide synthase systems exist in marine bacteria and certain eukaryotic organisms that are capable of synthesizing polyunsaturated fatty acids (PUFAs) from acetyl-CoA and malonyl-CoA. The PKS pathways for PUFA synthesis in Shewanella and another marine bacteria, Vibrio marinus, are described in detail in U.S. Pat. No. 6,140,486. The PKS pathways for PUFA synthesis in the eukaryotic Thraustochytrid, Schizochytrium is described in detail in U.S. Pat. No. 6,566,583. The PKS pathways for PUFA synthesis in eukaryotes such as members of Thraustochytriales, including the structural description of a PUFA PKS system in Schizochytrium and the identification of a PUFA PKS system in Thraustochytrium, including details regarding uses of these systems, are described in detail in U.S. Patent Application Publication No. 20020194641, published Dec. 19, 2002 (corresponding to U.S. patent application Ser. No. 10/124,800, filed Apr. 16, 2002). U.S. Patent Application Publication No. 20040235127, published Nov. 25, 2004 (corresponding to U.S. patent application Ser. No. 10/810,352, filed Mar. 24, 2004), discloses the structural description of a PUFA PKS system in Thraustochytrium, and further detail regarding the production of eicosapentaenoic acid (C20:5, ω-3) (EPA) and other PUFAs using such systems. U.S. Patent Application Publication No. 20050100995, published May 12, 2005 (corresponding to U.S. patent application Ser. No. 10/965,017, filed Oct. 13, 2004), discloses the structural and functional description of PUFA PKS systems in Shewanella olleyana and Shewanella japonica, and uses of such systems. These applications also disclose the genetic modification of organisms, including microorganisms and plants, with the genes comprising the PUFA PKS pathway and the production of PUFAs by such organisms. Furthermore, PCT Patent Publication No. WO 05/097982 describes a PUFA PKS system in Ulkenia, U.S. Patent Application Publication No. 20050014231 describes PUFA PKS genes and proteins from Thraustochytrium aureum.
- Researchers have attempted to exploit polyketide synthase (PKS) systems that have been traditionally described in the literature as falling into one of three basic types, typically referred to as: Type I (modular or iterative), Type II, and Type III. For purposes of clarity, it is noted that the Type I modular PKS system has previously also been referred to as simply a “modular” PKS system, and the Type I iterative PKS system has previously also been referred to simply as a “Type I” PKS system. The Type II system is characterized by separable proteins, each of which carries out a distinct enzymatic reaction. The enzymes work in concert to produce the end product and each individual enzyme of the system typically participates several times in the production of the end product. This type of system operates in a manner analogous to the fatty acid synthase (FAS) systems found in plants and bacteria. Type I iterative PKS systems are similar to the Type II system in that the enzymes are used in an iterative fashion to produce the end product. The Type I iterative differs from Type II in that enzymatic activities, instead of being associated with separable proteins, occur as domains of larger proteins. This system is analogous to the Type I FAS systems found in animals and fungi.
- In contrast to the Type II systems, in Type I modular PKS systems, each enzyme domain is used only once in the production of the end product. The domains are found in very large proteins and the product of each reaction is passed on to another domain in the PKS protein. Additionally, in the PKS systems described above, if a carbon-carbon double bond is incorporated into the end product, it is usually in the trans configuration.
- Type III systems have been more recently discovered and belong to the plant chalcone synthase family of condensing enzymes. Type III PKSs are distinct from type I and type II PKS systems and utilize free CoA substrates in iterative condensation reactions to usually produce a heterocyclic end product.
- Polyunsaturated fatty acids (PUFAs) are considered to be useful for nutritional, pharmaceutical, industrial, and other purposes. The current supply of PUFAs from natural sources and from chemical synthesis is not sufficient for commercial needs. A major current source for PUFAs is from marine fish; however, fish stocks are declining, and this may not be a sustainable resource. Additionally, contamination, from both heavy metals and toxic organic molecules, is a serious issue with oil derived from marine fish. Vegetable oils derived from oil seed crops are relatively inexpensive and do not have the contamination issues associated with fish oils. However, the PUFAs found in commercially developed plant oils are typically limited to linoleic acid (eighteen carbons with 2 double bonds, in the delta 9 and 12 positions—18:2 delta 9,12) and linolenic acid (18:3 delta 9,12,15). In the conventional pathway (i.e., the “standard” pathway or “classical” pathway) for PUFA synthesis, medium chain-length saturated fatty acids (products of a fatty acid synthase (FAS) system) are modified by a series of elongation and desaturation reactions. The substrates for the elongation reaction are fatty acyl-CoA (the fatty acid chain to be elongated) and malonyl-CoA (the source of the 2 carbons added during each elongation reaction). The product of the elongase reaction is a fatty acyl-CoA that has two additional carbons in the linear chain. The desaturases create cis double bonds in the preexisting fatty acid chain by extraction of 2 hydrogens in an oxygen-dependant reaction. The substrates for the desaturases are either acyl-CoA (in some animals) or the fatty acid that is esterified to the glycerol backbone of a phospholipid (e.g. phosphatidylcholine).
- Therefore, because a number of separate desaturase and elongase enzymes are required for fatty acid synthesis from linoleic and linolenic acids to produce the more unsaturated and longer chain PUFAs, engineering plant host cells for the expression of PUFAs such as EPA and docosahexaenoic acid (DHA) may require expression of several separate enzymes to achieve synthesis. Additionally, for production of useable quantities of such PUFAs, additional engineering efforts may be required. Therefore, it is of interest to obtain genetic material involved in PUPA biosynthesis from species that naturally produce these fatty acids (e.g., from a PUFA PKS system) and to express the isolated material alone or in combination in a heterologous system which can be manipulated to allow production of commercial quantities of PUFAs.
- There have been many efforts to produce PUFAs in oil-seed crop plants by modification of the endogenously-produced fatty acids. Genetic modification of these plants with various individual genes for fatty acid elongases and desaturases has produced leaves or seeds containing measurable levels of PUFAs such as EPA, but also containing significant levels of mixed shorter-chain and less unsaturated PUFAs (Qi et al., Nature Biotech. 22:739 (2004); PCT Publication No. WO 04/071467; Abbadi et al., Plant Cell 16:1 (2004)); Napier and Sayanova, Proceedings of the Nutrition Society (2005), 64:387-393; Robert et al., Functional Plant Biology (2005) 32:473-479; or U.S. Patent Application Publication 2004/0172682.
- Improvement in both microbial and plant production of PUFAs is a highly desirable commercial goal. Therefore, there remains a need in the art for a method to efficiently and effectively produce quantities of lipids (e.g., triacylglycerol (TAG) and phospholipid (PL)) enriched in desired PUFAs, particularly in commercially useful organisms such as microorganisms and oil-seed plants.
- One embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence selected from: SEQ ID NO:1, SEQ ID NO:3, and SEQ ID NO:5; (b) a nucleic acid sequence encoding an amino acid sequence selected from: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6; (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has β-keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (d) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:4 or that is a fragment of SEQ ID NO:4, wherein said amino acid sequence has KS activity, chain length factor (CLF) activity, acyl transferase (AT) activity, and enoyl ACP-reductase (ER) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:4 and a glutamate at a position corresponding to amino acid 1415 of SEQ ID NO:4; and (e) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:6 or that is a fragment of SEQ ID NO:6, wherein said amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity and ER activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 876-890 of SEQ ID NO:6.
- In one aspect, the nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has β-keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:4 or that is a fragment of SEQ ID NO:4, wherein said amino acid sequence has KS activity, chain length factor (CLF) activity, acyl transferase (AT) activity, and enoyl ACP-reductase (ER) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:4 and a glutamate at a position corresponding to amino acid 1415 of SEQ ID NO:4; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:6 or that is a fragment of SEQ ID NO:6, wherein said amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity and ER activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 876-890 of SEQ ID NO:6.
- In one aspect, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence selected from SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6. In another aspect, the nucleic acid molecule comprises a nucleic acid sequence selected from: SEQ ID NO:1, SEQ ID NO:3, and SEQ ID NO:5.
- In one aspect of this embodiment, the nucleic acid molecule of (a) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pKJ1126 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), and pJK320 (ATCC Accession No. ______). In one aspect of this embodiment, the nucleic acid molecule of (b) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pJK1129 (ATCC Accession No. ______), and pJK324 (ATCC Accession No. ______). In another aspect of this embodiment, the nucleic acid molecule of (c) comprises a nucleic acid sequence encoding the amino acid sequence encoded by a plasmid selected from: pJK1131 (ATCC Accession No. ______) and pBR002 (ATCC Accession No. ______).
- Another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a first nucleic acid sequence encoding a first amino acid sequence that has β-keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, wherein the first nucleic acid sequence hybridizes under very high stringency conditions to the complement of a second nucleic acid sequence encoding a second amino acid sequence of SEQ ID NO:2, and wherein said first amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a first nucleic acid sequence encoding a first amino acid sequence that has KS activity, chain length factor (CLF) activity, acyl transferase (AT) activity, and enoyl ACP-reductase (ER) activity, wherein the first nucleic acid sequence hybridizes under very high stringency conditions to the complement of a second nucleic acid sequence encoding a second amino acid sequence of SEQ ID NO:4, and wherein said first amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:4 and a glutamate at a position corresponding to amino acid 1415 of SEQ ID NO:4; and (c) a first nucleic acid sequence encoding a first amino acid sequence that has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity and ER activity, wherein the first nucleic acid sequence hybridizes under very high stringency conditions to the complement of a second nucleic acid sequence encoding a second amino acid sequence of SEQ ID NO:6, and wherein said first amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 876-890 of SEQ ID NO:6. In one aspect of this embodiment, the first nucleic acid sequence is isolated from a Schizochytrium, such as, but not limited to, Schizochytrium ATCC 20888.
- Yet another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:9; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:10; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:10 or that is a fragment of SEQ ID NO:10, wherein the amino acid sequence has malonyl-CoA:ACP acyltransferase (MAT) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 93 of SEQ ID NO:10 and a histidine at a position corresponding to amino acid 94 of SEQ ID NO:10. In one aspect of this embodiment, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:10 or that is a fragment of SEQ ID NO:10, wherein the amino acid sequence has malonyl-CoA:ACP acyltransferase (MAT) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 93 of SEQ ID NO:10 and a histidine at a position corresponding to amino acid 94 of SEQ ID NO:10. In one aspect, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:10.
- Another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:19; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:20; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:20 or that is a fragment of SEQ ID NO:20, wherein the amino acid sequence has β-keto acyl-ACP synthase (KS) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:20. In one aspect of this embodiment, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:20 or that is a fragment of SEQ ID NO:20, wherein the amino acid sequence has β-keto acyl-ACP synthase (KS) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:20. In one aspect, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:20.
- Yet another embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from: (a) a nucleic acid sequence of SEQ ID NO:29; (b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:30; and (c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:30 or that is a fragment of SEQ ID NO:30, wherein the amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 426-440 of SEQ ID NO:30. In one aspect, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence that is at least 95% identical to SEQ ID NO:30 or that is a fragment of SEQ ID NO:30, wherein the amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 426-440 of SEQ ID NO:30. In one aspect, the nucleic acid molecule comprises a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:30.
- Another embodiment of the invention relates to a recombinant nucleic acid molecule comprising any of the nucleic acid molecules described above, operatively linked to at least one transcription control sequence.
- Yet another embodiment of the invention relates to a recombinant cell transfected with any of the nucleic acid molecules described above. In one aspect, the recombinant cell is a microorganism. In another aspect, the recombinant cell is a plant cell.
- Another embodiment of the present invention relates to an isolated nucleic acid molecule consisting essentially of a nucleic acid sequence that is fully complementary to any of the nucleic acid molecules described above.
- Another embodiment of the present invention relates to a genetically modified microorganism that has been transformed with any of the nucleic acid molecules described above.
- Yet another embodiment of the present invention relates to a genetically modified plant that has been transformed with any of the nucleic acid molecules described above.
- Another embodiment of the present invention relates to a genetically modified microorganism that has been transformed with: (a) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has β-keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:4 or that is a fragment of SEQ ID NO:4, wherein said amino acid sequence has KS activity, chain length factor (CLF) activity, acyl transferase (AT) activity, and enoyl ACP-reductase (ER) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:4 and a glutamate at a position corresponding to amino acid 1415 of SEQ ID NO:4; and (c) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:6 or that is a fragment of SEQ ID NO:6, wherein said amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity and ER activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 876-890 of SEQ ID NO:6. In one aspect, the microorganism has been transformed with a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:2, a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:4, and a nucleic acid molecule comprising a nucleic acid sequence encoding SEQ ID NO:6. In one aspect, the microorganism endogenously expresses a PUFA PKS system. In another aspect, the microorganism has been further transformed with a recombinant nucleic acid molecule encoding a phosphopantetheine transferase. The microorganism can include, but is not limited to, a Thraustochytriales microorganism, a bacterium or a yeast.
- Yet another embodiment of the present invention relates to a method to produce a bioactive molecule, comprising culturing under conditions effective to produce said bioactive molecule a genetically modified organism described above. In one aspect, the bioactive molecule is a polyunsaturated fatty acid (PUFA).
- Another embodiment of the present invention relates to a genetically modified plant or part of the plant, wherein said plant has been transformed with: (a) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:2 or that is a fragment of SEQ ID NO:2, wherein said amino acid sequence has β-keto acyl-ACP synthase (KS) activity, malonyl-CoA:ACP acyltransferase (MAT) activity, acyl carrier protein (ACP) activity and ketoreductase (KR) activity, and wherein said amino acid sequence comprises an aspartate at a position corresponding to amino acid 667 of SEQ ID NO:2 and a histidine at a position corresponding to amino acid 668 of SEQ ID NO:2; (b) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:4 or that is a fragment of SEQ ID NO:4, wherein said amino acid sequence has KS activity, chain length factor (CLF) activity, acyl transferase (AT) activity, and enoyl ACP-reductase (ER) activity, and wherein said amino acid sequence comprises a valine at a position corresponding to amino acid 371 of SEQ ID NO:4 and a glutamate at a position corresponding to amino acid 1415 of SEQ ID NO:4; and (c) a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:6 or that is a fragment of SEQ ID NO:6, wherein said amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity and ER activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 876-890 of SEQ ID NO:6. In one aspect, the plant has been further genetically modified to express a recombinant nucleic acid molecule encoding a phosphopantetheine transferase. In one aspect, the plant is a dicotyledonous plant, and in another aspect, the plant is a monocotyledonous plant. In another aspect, the plant is selected from: canola, soybean, rapeseed, linseed, corn, safflower, sunflower and tobacco.
- In one aspect, the plant is an oilseed plant and the part of the plant is a mature oilseed. In one aspect, the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one PUFA selected from DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), and wherein the total fatty acids produced as a result of transformation with said nucleic acid molecules, other than said at least one PUFA, comprise less than about 10% of the total fatty acids produced by said plant. In one aspect, the total fatty acids produced as a result of transformation with said nucleic acid molecules, other than said at least one PUFA, comprise less than 5% by weight of the total fatty acids produced by said plant. In another aspect, the fatty acids consisting of gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds, comprise less than 5% by weight of the total fatty acids produced by said plant. In another aspect, gamma-linolenic acid (GLA; 18:3, n-6) comprises less than 1% by weight of the total fatty acids produced by said plant.
- Yet another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises detectable amounts of DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), wherein the ratio of DPAn-6 to DHA is 1:1 or greater than 1:1.
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises detectable amounts of DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), wherein the ratio of DPAn-6 to DHA is less than 1:1. In either of the two embodiments above, in one aspect, the total fatty acid profile in the plant or part of the plant contains less than 5% by weight in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Yet another embodiment of the present invention relates to plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUPA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 5% in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 1% of each of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Another embodiment of the present invention relates to a plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 2% of gamma-linolenic acid (GLA; 18:3, n-6) and dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6).
- Another embodiment of the present invention relates to seeds obtained from any of the plants or part of plants described above, a food product comprising such seeds, an oil obtained from such seeds, and a food product comprising such oil. Also included in the invention is an oil blend comprising such oil and another oil, such as, but not limited to, a microbial oil, a fish oil, and a vegetable oil.
- Yet another embodiment of the present invention relates to an oil comprising the following fatty acids: DHA (C22:6n-3), DPAn-6 (C22:5n-6), oleic acid (C18:1), linolenic acid (C18:3), linoleic acid (C18:2), C16:0, C18.0, C20:0, C20:1n-9, C20:2n-6, C22:1n-9; wherein the oil comprises less than 0.5% of any of the following fatty acids: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Another embodiment of the present invention relates to an oilseed plant that produces mature seeds in which the total seed fatty acid profile comprises at least 1.0% by weight of at least one polyunsaturated fatty acid selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 5% in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- Yet another embodiment of the present invention relates to an oilseed plant that produces mature seeds in which the total seed fatty acid profile comprises at least 1.0% by weight of at least one polyunsaturated fatty acid (PUFA) selected from DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 1% of gamma-linolenic acid (GLA; 18:3, n-6).
- Another embodiment of the present invention relates to a method to produce a bioactive molecule, comprising growing under conditions effective to produce said bioactive molecule a genetically modified plant as described above. In one aspect, the bioactive molecule is a polyunsaturated fatty acid (PUFA).
- Yet another embodiment of the present invention relates to a method to produce a plant that has a polyunsaturated fatty acid (PUFA) profile that differs from the naturally occurring plant, comprising genetically modifying said plant to express a PUFA PKS system comprising at least one of any of the nucleic acid molecules as described above.
- Another embodiment of the present invention relates to a method to produce a recombinant microbe, comprising genetically modifying microbial cells to express at least one of any of the nucleic acid molecules as described above.
-
FIG. 1 is a graphical representation of the domain structure of the Schizochytrium PUFA PKS system. -
FIG. 2 shows a comparison of PKS domains from Schizochytrium and Shewanella. -
FIG. 3 shows a GC FAME profile of control yeast and yeast expressing Orfs sA, sB, C and Het I. -
FIG. 4 shows a GC FAME profile of the PUFA region fromFIG. 3 . -
FIG. 5 shows GC FAME profiles of wild-type Arabidopsis and Arabidopsis Line 269 (plastid targeted). -
FIG. 6 is a schematic diagram showing the construction of pSBS4107: Acyl-ACP transit peptide-HetI: Acyl-ACP transit peptide-ORFC. -
FIG. 7 is a schematic diagram showing the construction of pSBS5720: Acyl-ACP transit peptide-ORFB. -
FIG. 8 is a schematic diagram showing the construction of pSBS4757: Acyl-ACP transit peptide-ORFA. - The present invention generally relates to polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) systems from Schizochytrium, to genetically modified organisms comprising Schizochytrium PUFA PKS systems, to methods of making and using such systems for the production of products of interest, including bioactive molecules, and to PUFA PKS systems identified using the structural information for the Schizochytrium PUFA PKS systems disclosed herein. In one preferred embodiment, the present invention relates to a method to produce PUFAs in an oil-seed plant that has been genetically modified to express a PUFA PKS system of the present invention. The oils produced by the plant contain at least one PUFA produced by the PUFA PKS system and are substantially free of the mixed shorter-chain and less unsaturated PUFAs that are fatty acid products produced by the modification of products of the FAS system.
- As used herein, a PUFA PKS system (which may also be referred to as a PUFA synthase system or PUFA synthase) generally has the following identifying features: (1) it produces PUFAs, and particularly, long chain PUFAs, as a natural product of the system; and (2) it comprises several multifunctional proteins assembled into a complex that conducts both iterative processing of the fatty acid chain as well non-iterative processing, including trans-cis isomerization and enoyl reduction reactions in selected cycles. In addition, the ACP domains present in the PUFA synthase enzymes require activation by attachment of a cofactor (4-phosphopantetheine). Attachment of this cofactor is carried out by phosphopantetheinyl transferases (PPTase). If the endogenous PPTases of the host organism are incapable of activating the PUFA synthase ACP domains, then it is necessary to provide a PPTase that is capable of carrying out that function. The inventors have identified the Het I enzyme of Nostoc sp. as an exemplary and suitable PPTase for activating PUFA synthase ACP domains. Reference to a PUFA PKS system or a PUFA synthase refers collectively to all of the genes and their encoded products that work in a complex to produce PUFAs in an organism. Therefore, the PUFA PKS system refers specifically to a PKS system for which the natural products are PUFAs.
- More specifically, first, a PUFA PKS system that forms the basis of this invention produces polyunsaturated fatty acids (PUFAs) and particularly, long chain PUFAs, as products (e.g., an organism that endogenously (naturally) contains such a PKS system makes PUFAs using this system). According to the present invention, PUFAs are fatty acids with a carbon chain length of at least 16 carbons, and more preferably at least 18 carbons, and more preferably at least 20 carbons, and more preferably 22 or more carbons, with at least 3 or more double bonds, and preferably 4 or more, and more preferably 5 or more, and even more preferably 6 or more double bonds, wherein all double bonds are in the cis configuration. Reference to long chain polyunsaturated fatty acids (LCPUFAs) herein more particularly refers to fatty acids of 18 and more carbon chain length, and preferably 20 and more carbon chain length, containing 3 or more double bonds. LCPUFAs of the omega-6 series include: gamma-linolenic acid (C18:3), di-homo-gammalinolenic acid (C20:3n-6), arachidonic acid (C20:4n-6), adrenic acid (also called docosatetraenoic acid or DTA) (C22:4n-6), and docosapentaenoic acid (C22:5n-6). The LCPUFAs of the omega-3 series include: alpha-linolenic acid (C18:3), eicosatrienoic acid (C20:3n-3), eicosatetraenoic acid (C20:4n-3), eicosapentaenoic acid (C20:5n-3), docosapentaenoic acid (C22:5n-3), and docosahexaenoic acid (C22:6n-3). The LCPUFAs also include fatty acids with greater than 22 carbons and 4 or more double bonds including but not limited to C28:8(n-3).
- Second, a PUFA PKS system according to the present invention comprises several multifunctional proteins (and can include single function proteins, particularly for PUFA PKS systems from marine bacteria) that are assembled into a complex that conducts both iterative processing of the fatty acid chain as well non-iterative processing, including trans-cis isomerization and enoyl reduction reactions in selected cycles. These proteins can also be referred to herein as the core PUFA PKS enzyme complex or the core PUFA PKS system. The general functions of the domains and motifs contained within these proteins are individually known in the art and have been described in detail with regard to various PUFA PKS systems from marine bacteria and eukaryotic organisms (see, e.g., U.S. Pat. No. 6,140,486; U.S. Pat. No. 6,566,583; Metz et al., Science 293:290-293 (2001); U.S. Patent Application Publication No. 20020194641; U.S. Patent Application Publication No. 20040235127; and U.S. Patent Application Publication No. 20050100995). The domains may be found as a single protein (i.e., the domain and protein are synonymous) or as one of two or more (multiple) domains in a single protein, as mentioned above.
- Before the discovery of a PUFA PKS system in marine bacteria (see U.S. Pat. No. 6,140,486), PKS systems were not known to possess this combination of iterative and selective enzymatic reactions, and they were not thought of as being able to produce carbon-carbon double bonds in the cis configuration. However, the PUFA PKS system described by the present invention has the capacity to introduce cis double bonds and the capacity to vary the reaction sequence in the cycle.
- The present inventors propose to use these features of the PUFA PKS system to produce a range of bioactive molecules that could not be produced by the previously described (Type I iterative or modular, Type II, or Type III) PKS systems. These bioactive molecules include, but are not limited to, polyunsaturated fatty acids (PUFAs), antibiotics or other bioactive compounds, many of which will be discussed below. For example, using the knowledge of the PUFA PKS gene structures described herein, any of a number of methods can be used to alter the PUFA PKS genes, or combine portions of these genes with other synthesis systems, including other PKS systems, such that new products are produced. The inherent ability of this particular type of system to do both iterative and selective reactions will enable this system to yield products that would not be found if similar methods were applied to other types of PKS systems.
- Preferably, a PUFA PKS system of the present invention comprises at least the following biologically active domains that are typically contained on three or more proteins: (a) at least one enoyl-ACP reductase (ER) domain; (b) multiple acyl carrier protein (ACP) domain(s) (e.g., at least from one to four, and preferably at least five ACP domains, and in some embodiments up to six, seven, eight, nine, or more than nine ACP domains); (c) at least two β-ketoacyl-ACP synthase (KS) domains; (d) at least one acyltransferase (AT) domain; (e) at least one β-ketoacyl-ACP reductase (KR) domain; (f) at least two FabA-like β-hydroxyacyl-ACP dehydrase (DH) domains; (g) at least one chain length factor (CLF) domain; (h) at least one malonyl-CoA:ACP acyltransferase (MAT) domain. In one embodiment, a PUFA PKS system according to the present invention also comprises at least one region containing a dehydratase (DH) conserved active site motif.
- In a preferred embodiment, a Schizochytrium PUFA PKS system comprises at least the following biologically active domains: (a) two enoyl-ACP reductase (ER) domain; (b) nine acyl carrier protein (ACP) domains; (c) two β-ketoacyl-ACP synthase (KS) domains; (d) one acyltransferase (AT) domain; (e) one β-ketoacyl-ACP reductase (KR) domain; (f) two FabA-like β-hydroxyacyl-ACP dehydrase (DH) domains; (g) one chain length factor (CLF) domain; and (h) one malonyl-CoA:ACP acyltransferase (MAT) domain. In one embodiment, a Schizochytrium PUFA PKS system according to the present invention also comprises at least one region or domain containing a dehydratase (DH) conserved active site motif that is not a part of a FabA-like DH domain. The structural and functional characteristics of these domains are generally individually known in the art and will be described in detail below with regard to the PUFA PKS systems of the present invention.
- A PUFA PKS system can additionally include one or more accessory proteins, which are defined herein as proteins that are not considered to be part of the core PUFA PKS system as described above (i.e., not part of the PUFA synthase enzyme complex itself), but which may be, or are, necessary for PUFA production or at least for efficient PUFA production using the core PUFA synthase enzyme complex of the present invention, particularly in certain host organisms (e.g., plants). For example, in order to produce PUFAs, a PUFA PKS system must work with an accessory protein that transfers a 4′-phosphopantetheinyl moiety from Coenzyme A to the acyl carrier protein (ACP) domain(s). Therefore, a PUFA PKS system can be considered to include at least one 4′-phosphopantetheinyl transferase (PPTase) domain, or such a domain can be considered to be an accessory domain or protein to the PUFA PKS system. When genetically modifying organisms (e.g., microorganisms or plants) to express a PUFA PKS system according to the present invention, some host organisms may endogenously express accessory proteins that are needed to work with the PUFA PKS to produce PUFAs (e.g., PPTases). However, some organisms may be transformed with nucleic acid molecules encoding one or more accessory proteins described herein to enable and/or to enhance production of PUFAs by the organism, even if the organism endogenously produces a homologous accessory protein (i.e., some heterologous accessory proteins may operate more effectively or efficiently with the transformed PUFA synthase proteins than the host cells' endogenous accessory protein). The present invention provides an example of bacteria, yeast and plants that have been genetically modified with the PUFA PKS system of the present invention that includes an accessory PPTase. Structural and functional characteristics of PPTases will be described in more detail below.
- According to the present invention, reference to a “standard” or “classical” pathway for the production of PUFAs refers to the fatty acid synthesis pathway where medium chain-length saturated fatty acids (products of a fatty acid synthase (FAS) system) are modified by a series of elongation and desaturation reactions. The substrates for the elongation reaction are fatty acyl-CoA (the fatty acid chain to be elongated) and malonyl-CoA (the source of the 2 carbons added during each elongation reaction). The product of the elongase reaction is a fatty acyl-CoA that has two additional carbons in the linear chain. The desaturases create cis double bonds in the preexisting fatty acid chain by extraction of 2 hydrogens in an oxygen-dependant reaction. Such pathways and the genes involved in such pathways are well-known in the literature.
- As used herein, the term “lipid” includes phospholipids (PL); free fatty acids; esters of fatty acids; triacylglycerols (TAG); diacylglycerides; monoacylglycerides; phosphatides; waxes (esters of alcohols and fatty acids); sterols and sterol esters; carotenoids; xanthophylls (e.g., oxycarotenoids); hydrocarbons; and other lipids known to one of ordinary skill in the art. The terms “polyunsaturated fatty acid” and “PUFA” include not only the free fatty acid form, but other forms as well, such as the TAG form and the PL form.
- A PUFA PKS system described according to the present invention is a non-bacterial PUFA PKS system. In other words, the PUFA PKS system of the present invention is isolated from an organism that is not a bacteria, or is a homologue of or derived from a PUFA PKS system from an organism that is not a bacteria, such as a eukaryote or an archaebacterium. Eukaryotes are separated from prokaryotes based on the degree of differentiation of the cells, with eukaryotes being more differentiated than prokaryotes. In general, prokaryotes do not possess a nuclear membrane, do not exhibit mitosis during cell division, have only one chromosome, their cytoplasm contains 70S ribosomes, they do not possess any mitochondria, endoplasmic reticulum, chloroplasts, lysosomes or golgi apparatus, their flagella (if present) consists of a single fibril. In contrast, eukaryotes have a nuclear membrane, they do exhibit mitosis during cell division, they have many chromosomes, their cytoplasm contains 80S ribosomes, they do possess mitochondria, endoplasmic reticulum, chloroplasts (in algae), lysosomes and golgi apparatus, and their flagella (if present) consists of many fibrils. In general, bacteria are prokaryotes, while algae, fungi, protist, protozoa and higher plants are eukaryotes. The PUFA PKS systems of the marine bacteria (e.g., Shewanella and Vibrio marinus) are not the basis of the present invention, although the present invention does contemplate the use of domains from these bacterial PUFA PKS systems in conjunction with domains from the non-bacterial (e.g., Schizochytrium) PUFA PKS systems of the present invention. For example, according to the present invention, genetically modified organisms can be produced which incorporate non-bacterial PUFA PKS functional domains with bacterial PUFA PKS functional domains, as well as PKS functional domains or proteins from other PKS systems (Type I iterative or modular, Type II, or Type III) or FAS systems.
- Schizochytrium is a Thraustochytrid marine microorganism that accumulates large quantities of triacylglycerols rich in DHA and docosapentaenoic acid (DPA; 22:5%-6); e.g., 30% DHA+DPA by dry weight (Barclay et al., J. Appl. Phycol. 6, 123 (1994)). In eukaryotes that synthesize 20- and 22-carbon PUFAs by an elongation/desaturation pathway, the pools of 18-, 20- and 22-carbon intermediates are relatively large so that in vivo labeling experiments using [14C]-acetate reveal clear precursor-product kinetics for the predicted intermediates (Gellerman et al., Biochim. Biophys. Acta 573:23 (1979)). Furthermore, radiolabeled intermediates provided exogenously to such organisms are converted to the final PUFA products. The present inventors have shown that [1-14C]-acetate was rapidly taken up by Schizochytrium cells and incorporated into fatty acids, but at the shortest labeling time (1 min), DHA contained 31% of the label recovered in fatty acids, and this percentage remained essentially unchanged during the 10-15 min of [14C]-acetate acetate incorporation and the subsequent 24 hours of culture growth (See U.S. Patent Application Publication No. 20020194641, supra). Similarly, DPA represented 10% of the label throughout the experiment. There is no evidence for a precursor-product relationship between 16- or 18-carbon fatty acids and the 22-carbon polyunsaturated fatty acids. These results are consistent with rapid synthesis of DHA from [14C]-acetate involving very small (possibly enzyme-bound) pools of intermediates. A cell-free homogenate derived from Schizochytrium cultures incorporated [1-14C]-malonyl-CoA into DHA, DPA, and saturated fatty acids. The same biosynthetic activities were retained by a 100,000×g supernatant fraction but were not present in the membrane pellet. Thus, DHA and DPA synthesis in Schizochytrium does not involve membrane-bound desaturases or fatty acid elongation enzymes like those described for other eukaryotes (Parker-Barnes et al., 2000, supra; Shanklin et al., 1998, supra). These fractionation data contrast with those obtained from the Shewanella enzymes (See Metz et al., 2001, supra) and may indicate use of a different (soluble) acyl acceptor molecule, such as CoA, by the Schizochytrium enzyme.
- As described in U.S. Pat. No. 6,566,583, a cDNA library from Schizochytrium was constructed and approximately 8,000 random clones (ESTs) were sequenced. Within this dataset, only one moderately expressed gene (0.3% of all sequences) was identified as a fatty acid desaturase, although a second putative desaturase was represented by a single clone (0.01%). By contrast, sequences that exhibited homology to 8 of the 11 domains of the Shewanella PKS genes shown in
FIG. 2 were all identified at frequencies of 0.2-0.5%. In U.S. Pat. No. 6,566,583, several cDNA clones showing homology to the Shewanella PKS genes were sequenced, and various clones were assembled into nucleic acid sequences representing two partial open reading frames and one complete open reading frame. - Nucleotides 390-4443 of the cDNA sequence containing the first partial open reading frame described in U.S. application Ser. No. 09/231,899 (denoted therein as SEQ ID NO:69) match nucleotides 4677-8730 (plus the stop codon) of the sequence denoted herein as OrfA (SEQ ID NO:1). A cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB3033-047-B5 comprises at least a portion of nucleotides 4677-8730 of SEQ ID NO:1 described herein, to the best of the present inventors' knowledge. Specifically, the sequence of the insert in cDNA clone LIB3033-047-B5 begins at nucleotide 6719 of SEQ ID NO:1 and extends to the end of the Orf (position 8730 of SEQ ID NO:1), plus about 71 nucleotides beyond the end of the Orf represented by SEQ ID NO:1. cDNA clone LIB3033-047-B5 (denoted cDNA clone LIB3033-047-B5 in the form of an E. coli plasmid vector containing “Orf6 homolog” partial gene sequence from Schizochytrium sp.) was deposited with the American Type Culture Collection (ATCC), University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of cDNA clone LIB3033-047-B5, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention. A second cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB3033-046-E6 shared homology to the ACP domain of ORF6, contained 6 ACP repeats, and comprises at least a portion of nucleotides of SEQ ID NO:1 of the present invention. This cDNA clone did not have a poly-A-tail, and therefore, was a partial cDNA with additional regions of the cDNA found downstream of the sequence. The nucleotide sequence of cDNA clone LIB3033-046-E6, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- Nucleotides 1-4867 of the cDNA sequence containing the second partial open reading frame described in U.S. application Ser. No. 09/231,899 (denoted therein as SEQ ID NO:71) matches nucleotides 1311-6177 (plus the stop codon) of the sequence denoted herein as OrfB (SEQ ID NO:3), with the exception of the nucleotide at position 2933 of SEQ ID NO:71 of the '899 application, which corresponds to the nucleotide at position 4243 of SEQ ID NO:3 set forth herein. This single nucleotide change (C to G) results in a single amino acid change in SEQ ID NO:4 disclosed herein, as compared to SEQ ID NO:72 of the '899 application. Specifically, the glutamine residue at position 978 of SEQ ID NO:72 in the '899 application is changed to a glutamate at position 1415 of SEQ ID NO:4. This amino acid occurs in the linker region between the AT domain and the ER domain of SEQ ID NO:4. A cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB3033-046-D2 comprises nucleotides 1311-6177 of SEQ ID NO:3 described herein, plus about 382 additional nucleotides beyond the end of the Orf represented here as SEQ ID NO:3, to the best of the present inventors' knowledge. cDNA clone LIB3033-046-D2 (denoted cDNA clone LIB3033-046-D2 in the form of an E. coli plasmid vector containing “hglC/Orf7/Orf8/Orf9 homolog” gene from Schizochytrium) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of cDNA clone LIB3033-046-D2, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- Nucleotides 145-4653 of the cDNA sequence containing the complete open reading frame described in U.S. application Ser. No. 09/231,899 (denoted therein as SEQ ID NO:76 and incorrectly designated as a partial open reading frame) matches nucleotides 1-2624 and 2675-4506 of the sequence denoted herein as OrfC (SEQ ID NO:5). Sequencing of the genomic DNA encoding OrfC revealed that there is an additional nucleotide at each of positions 2769, 2806 and 2818 of SEQ ID NO:76 of the '899 application which resulted in a frame shift and a short change in the amino acid sequence of the corresponding protein. Therefore, amino acid positions 924-939 of SEQ ID NO:73 of the '899 application represent an incorrect sequence. Positions 876-890 of SEQ ID NO:5 herein represent the correct amino acid sequence in this region. This sequence is located in the DH2 domain of OrfC (discussed below). A cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB81-042-B9 comprises a portion of the 5′ sequence of SEQ ID NO:5. To the best of the present inventors' knowledge, the sequence of the insert in LIB81-042-B9 contains 145 nucleotides upstream of the start codon of SEQ ID NO:5 and extends 2361 nucleotides into the Orf. cDNA clone LIB81-042-B9 (denoted cDNA clone LIB81-042-B9 in the form of an E. coli plasmid vector containing “0118 homolog” partial gene sequence from Schizochytrium sp.) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of cDNA clone LIB81-042-B9, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention. A second cDNA clone described in U.S. application Ser. No. 09/231,899 as cDNA clone LIB81-015-D5 aligns with Shewanella ORF8 and also with Shewanella ORF9. The open reading frame of LIB81-015-D5 aligns with SEQ ID NO:5 beginning at nucleotide 2526 of SEQ ID NO:5 and extends to the end of the Orf (i.e., position 4506), plus about 115 bp including a poly A tail beyond SEQ ID NO:5. The nucleotide sequence of cDNA clone LIB81-015-D5, and the amino acid sequence encoded by this cDNA clone are encompassed by the present invention.
- Further sequencing of cDNA and genomic clones by the present inventors allowed the identification of the full-length genomic sequence of each of OrfA, OrfB and OrfC in Schizochytrium, including in Schizochytrium sp. ATCC 20888 and the mutated daughter strain of ATCC 20888, denoted Schizochytrium sp., strain N230D. N230D was one of more than 1,000 randomly-chosen survivors of chemically mutagenised (NTG; 1-methyl-3-nitro-1-nitrosoguanidine) Schizochytrium ATCC 20888 screened for variations in fatty acid content. This particular strain was valued for its improved DHA productivity. Further, the complete identification of the domains with homology to those in Shewanella (see
FIG. 2 ) were identified. It is noted that in Schizochytrium, the Orfs of the genomic DNA and cDNA are identical, due to the lack of introns in the organism genome, to the best of the present inventors' knowledge. Therefore, reference to a nucleotide sequence of Orfs from Schizochytrium can refer to genomic DNA or cDNA. -
FIG. 1 is a graphical representation of the three open reading frames from the Schizochytrium PUFA PKS system, and includes the domain structure of this PUFA PKS system. The domain structure of each open reading frame is as follows: - The complete nucleotide sequence for OrfA is represented herein as SEQ ID NO:1. Nucleotides 4677-8730 of SEQ ID NO:1 correspond to nucleotides 390-4443 of the sequence denoted as SEQ ID NO:69 in U.S. application Ser. No. 09/231,899. Therefore, nucleotides 1-4676 of SEQ ID NO:1 represent additional sequence that was not disclosed in U.S. application Ser. No. 09/231,899. This novel region of SEQ ID NO:1 encodes the following domains in OrfA: (1) the ORFA-KS domain; (2) the ORFA-MAT domain; and (3) at least a portion of the ACP domain region (e.g., at least ACP domains 1-4). It is noted that nucleotides 1-389 of SEQ ID NO:69 in U.S. application Ser. No. 09/231,899 do not exactly match with the 389 nucleotides that are upstream of position 4677 in SEQ ID NO:1 disclosed herein. Therefore, positions 1-389 of SEQ ID NO:69 in U.S. application Ser. No. 09/231,899 appear to be incorrectly placed next to nucleotides 390-4443 of that sequence. Most of these first 389 nucleotides (about positions 60-389) are a match with an upstream portion of OrfA (SEQ ID NO:1) of the present invention and therefore, it is believed that an error occurred in the effort to prepare the contig of the cDNA constructs in U.S. application Ser. No. 09/231,899. The region in which the alignment error occurred in U.S. application Ser. No. 09/231,899 is within the region of highly repetitive sequence (i.e., the ACP region, discussed below), which probably created some confusion in the assembly of that sequence from various cDNA clones.
- OrfA is a 8730 nucleotide sequence (not including the stop codon) which encodes a 2910 amino acid sequence, represented herein as SEQ ID NO:2. Within OrfA are twelve domains: (a) one β-keto acyl-ACP synthase (KS) domain; (b) one malonyl-CoA:ACP acyltransferase (MAT) domain; (c) nine acyl carrier protein (ACP) domains; and (d) one ketoreductase (KR) domain.
- A nucleotide sequence for OrfA has been deposited with GenBank as Accession No. AF378327 (amino acid sequence Accession No. AAK728879). The nucleotide sequence represented by GenBank Accession No. AF378327 differs from the sequence represented herein as SEQ ID NO:1 by the point nucleotide changes: (1) at position 1999 (A to G, resulting in an amino acid change from an asparagine to an aspartic acid at position 667 of SEQ ID NO:2); (2) at position 2003 (C to A, resulting in an amino acid change from a proline to a histidine at position 668 of SEQ ID NO:2); and (3) at position 2238 (A to C, resulting in no amino acid change at position 746 of SEQ ID NO:2). Each of the two amino acid changes from the amino acid sequence encoded by GenBank Accession No. AAK728879 are located in the MAT domain (SEQ ID NO:10) of SEQ ID NO:2.
- Genomic DNA clones (plasmids) encoding OrfA from both Schizochytrium sp. ATCC 20888 and a daughter strain of ATCC 20888, denoted Schizochytrium sp., strain N230D, have been isolated and sequenced. A genomic clone described herein as JK1126, isolated from Schizochytrium sp. ATCC 20888, comprises, to the best of the present inventors' knowledge, the nucleotide sequence spanning from
position 1 to 4119 and from position 5498 to 8730 of SEQ ID NO:1, and encodes the corresponding amino acid sequence of SEQ ID NO:2. Indeed, it is expected that JK1126 comprises SEQ ID NO:1 in its entirety and encodes SEQ ID NO:2. Genomic clone pJK1126 (denoted pJK1126 OrfA genomic clone, in the form of an E. coli plasmid vector containing “OrfA” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK1126 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention. - Two genomic clones described herein as pJK306 OrfA genomic clone and pJK320 OrfA genomic clone, isolated from Schizochytrium sp. N230D, together (overlapping clones) comprise, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:1, and encode the amino acid sequence of SEQ ID NO:2. Genomic clone pJK306 (denoted pJK306 OrfA genomic clone, in the form of an E. coli plasmid containing 5′ portion of OrfA gene from Schizochytrium sp. N230D (2.2 kB overlap with pJK320)) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK306 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention. Genomic clone pJK320 (denoted pJK320 OrfA genomic clone, in the form of an E. coli plasmid containing 3′ portion of OrfA gene from Schizochytrium sp. N230D (2.2 kB overlap with pJK306)) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK320 OrfA genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfA was compared with known sequences in a standard BLAST search (BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches, blastn for nucleic acid searches, and blastX for nucleic acid searches and searches of the translated amino acid sequence in all 6 open reading frames with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S.F., Madden, T. L., Schääffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389-3402, incorporated herein by reference in its entirety)). At the nucleic acid level, OrfA has no significant homology to any known nucleotide sequence. At the amino acid level, the sequences with the greatest degree of homology to ORFA were: Nostoc sp. 7120 heterocyst glycolipid synthase (Accession No. NC—003272), which was 42% identical to ORFA over 1001 amino acid residues; and Moritella marinas (Vibrio marinas) ORF8 (Accession No. AB025342), which was 40% identical to ORFA over 993 amino acid residues.
- The first domain in OrfA is a KS domain, also referred to herein as ORFA-KS. This domain is contained within the nucleotide sequence spanning from a starting point of between about
positions 1 and 40 of SEQ ID NO:1 (OrfA) to an ending point of between about positions 1428 and 1500 of SEQ ID NO:1 (based on homology to other PUFA PKS domains, the position of the domain spans from aboutposition 1 to about position 1500; based on Pfam analysis, a KS core region spans from about position 40 to about position 1428). The nucleotide sequence containing the sequence encoding the ORFA-KS domain is represented herein as SEQ ID NO:7 (positions 1-1500 of SEQ ID NO:1). The amino acid sequence containing the KS domain spans from a starting point of between aboutpositions - According to the present invention, a domain or protein having 3-keto acyl-ACP synthase (KS) biological activity (function) is characterized as the enzyme that carries out the initial step of the FAS (and PKS) elongation reaction cycle. The acyl group destined for elongation is linked to a cysteine residue at the active site of the enzyme by a thioester bond. In the multi-step reaction, the acyl-enzyme undergoes condensation with malonyl-ACP to form -keto acyl-ACP, CO2 and free enzyme. The KS plays a key role in the elongation cycle and in many systems has been shown to possess greater substrate specificity than other enzymes of the reaction cycle. For example, E. coli has three distinct KS enzymes—each with its own particular role in the physiology of the organism (Magnuson et al., Microbiol. Rev. 57, 522 (1993)). The two KS domains of the PUFA-PKS systems could have distinct roles in the PUFA biosynthetic reaction sequence.
- As a class of enzymes, KS's have been well characterized. The sequences of many verified KS genes are know, the active site motifs have been identified and the crystal structures of several have been determined. Proteins (or domains of proteins) can be readily identified as belonging to the KS family of enzymes by homology to known KS sequences.
- The second domain in OrfA is a MAT domain, also referred to herein as ORFA-MAT. This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1723 and 1798 of SEQ ID NO:1 (OrfA) to an ending point of between about positions 2805 and 3000 of SEQ ID NO:1 (based on homology to other PUFA PKS domains, the position of the MAT domain spans from about position 1723 to about position 3000; based on Pfam analysis, a MAT core region spans from about position 1798 to about position 2805). The nucleotide sequence containing the sequence encoding the ORFA-MAT domain is represented herein as SEQ ID NO:9 (positions 1723-3000 of SEQ ID NO:1). The amino acid sequence containing the MAT domain spans from a starting point of between about positions 575 and 600 of SEQ ID NO:2 (ORFA) to an ending point of between about positions 935 and 1000 of SEQ ID NO:2 (again, referring to the overall homology to PUFA PKS MAT domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFA-MAT domain is represented herein as SEQ ID NO:10 (positions 575-1000 of SEQ ID NO:2). The MAT domain comprises an aspartate at position 93 and a histidine at position 94 (corresponding to positions 667 and 668, respectively, of SEQ ID NO:2). It is noted that the ORFA-MAT domain contains an active site motif: GHS*XG (*acyl binding site S706), represented herein as SEQ ID NO:11.
- According to the present invention, a domain or protein having malonyl-CoA:ACP acyltransferase (MAT) biological activity (function) is characterized as one that transfers the malonyl moiety from malonyl-CoA to ACP. In addition to the active site motif (GxSxG), these enzymes possess an extended motif R and Q amino acids in key positions) that identifies them as MAT enzymes (in contrast to the AT domain of SchizochytriumOrf B). In some PKS systems (but not the PUFA PKS domain) MAT domains will preferentially load methyl- or ethyl-malonate on to the ACP group (from the corresponding CoA ester), thereby introducing branches into the linear carbon chain. MAT domains can be recognized by their homology to known MAT sequences and by their extended motif structure.
- Domains 3-11 of OrfA are nine tandem ACP domains, also referred to herein as ORFA-ACP (the first domain in the sequence is ORFA-ACP1, the second domain is ORFA-ACP2, the third domain is ORFA-ACP3, etc.). The first ACP domain, ORFA-ACP1, is contained within the nucleotide sequence spanning from about position 3343 to about position 3600 of SEQ ID NO:1 (OrfA). The nucleotide sequence containing the sequence encoding the ORFA-ACP1 domain is represented herein as SEQ ID NO:12 (positions 3343-3600 of SEQ ID NO:1). The amino acid sequence containing the first ACP domain spans from about position 1115 to about position 1200 of SEQ ID NO:2. The amino acid sequence containing the ORFA-ACP1 domain is represented herein as SEQ ID NO:13 (positions 1115-1200 of SEQ ID NO:2). It is noted that the ORFA-ACP1 domain contains an active site motif: LGIDS* (*pantetheine binding motif S1157), represented herein by SEQ ID NO:14.
- The nucleotide and amino acid sequences of all nine ACP domains are highly conserved and therefore, the sequence for each domain is not represented herein by an individual sequence identifier. However, based on the information disclosed herein, one of skill in the art can readily determine the sequence containing each of the other eight ACP domains (see discussion below).
- All nine ACP domains together span a region of OrfA of from about position 3283 to about position 6288 of SEQ ID NO:1, which corresponds to amino acid positions of from about 1095 to about 2096 of SEQ ID NO:2. The nucleotide sequence for the entire ACP region containing all nine domains is represented herein as SEQ ID NO:16. The region represented by SEQ ID NO:16 includes the linker segments between individual ACP domains. The repeat interval for the nine domains is approximately every 330 nucleotides of SEQ ID NO:16 (the actual number of amino acids measured between adjacent active site serines ranges from 104 to 116 amino acids). Each of the nine ACP domains contains a pantetheine binding motif LGIDS* (represented herein by SEQ ID NO:14), wherein S* is the pantetheine binding site serine (S). The pantetheine binding site serine (S) is located near the center of each ACP domain sequence. At each end of the ACP domain region and between each ACP domain is a region that is highly enriched for proline (P) and alanine (A), which is believed to be a linker region. For example, between
ACP domains 1 and 2 is the sequence: APAPVKAAAPAAPVASAPAPA, represented herein as SEQ ID NO:15. The locations of the active site serine residues (i.e., the pantetheine binding site) for each of the nine ACP domains, with respect to the amino acid sequence of SEQ ID NO:2, are as follows: ACP1=S1157; ACP2=S1266; ACP3=S1377; ACP4=S1488; ACP5=S1604; ACP6=S1715; ACP7=S1819; ACP8=S1930; and ACP9=S2034. Given that the average size of an ACP domain is about 85 amino acids, excluding the linker, and about 110 amino acids including the linker, with the active site serine being approximately in the center of the domain, one of skill in the art can readily determine the positions of each of the nine ACP domains in OrfA. - According to the present invention, a domain or protein having acyl carrier protein (ACP) biological activity (function) is characterized as being small polypeptides (typically, 80 to 100 amino acids long), that function as carriers for growing fatty acyl chains via a thioester linkage to a covalently bound co-factor of the protein. They occur as separate units or as domains within larger proteins. ACPs are converted from inactive apo-forms to functional holo-forms by transfer of the phosphopantetheinyl moiety of CoA to a highly conserved serine residue of the ACP. Acyl groups are attached to ACP by a thioester linkage at the free terminus of the phosphopantetheinyl moiety. ACPs can be identified by labeling with radioactive pantetheine and by sequence homology to known ACPs. The presence of variations of the above mentioned motif (LGIDS*) is also a signature of an ACP.
- Domain 12 in OrfA is a KR domain, also referred to herein as ORFA-KR. This domain is contained within the nucleotide sequence spanning from a starting point of about position 6598 of SEQ ID NO:1 to an ending point of about position 8730 of SEQ ID NO:1. The nucleotide sequence containing the sequence encoding the ORFA-KR domain is represented herein as SEQ ID NO:17 (positions 6598-8730 of SEQ ID NO:1). The amino acid sequence containing the KR domain spans from a starting point of about position 2200 of SEQ ID NO:2 (ORFA) to an ending point of about position 2910 of SEQ ID NO:2. The amino acid sequence containing the ORFA-KR domain is represented herein as SEQ ID NO:18 (positions 2200-2910 of SEQ ID NO:2). Within the KR domain is a core region with homology to short chain aldehyde-dehydrogenases (KR is a member of this family). This core region spans from about position 7198 to about position 7500 of SEQ ID NO:1, which corresponds to amino acid positions 2400-2500 of SEQ ID NO:2.
- According to the present invention, a domain or protein having ketoreductase activity, also referred to as 3-ketoacyl-ACP reductase (KR) biological activity (function), is characterized as one that catalyzes the pyridine-nucleotide-dependent reduction of 3-keto acyl forms of ACP. It is the first reductive step in the de novo fatty acid biosynthesis elongation cycle and a reaction often performed in polyketide biosynthesis. Significant sequence similarity is observed with one family of enoyl ACP reductases (ER), the other reductase of FAS (but not the ER family present in the PUFA PKS system), and the short-chain alcohol dehydrogenase family. Pfam analysis of the PUFA PKS region indicated above reveals the homology to the short-chain alcohol dehydrogenase family in the core region. Blast analysis of the same region reveals matches in the core area to known KR enzymes as well as an extended region of homology to domains from the other characterized PUFA PKS systems.
- The complete nucleotide sequence for OrfB is represented herein as SEQ ID NO:3. Nucleotides 1311-6177 of SEQ ID NO:3 correspond to nucleotides 1-4867 of the sequence denoted as SEQ ID NO:71 in U.S. application Ser. No. 09/231,899, with the exception of the nucleotide at position 2933 of SEQ ID NO:71 of the '899 application or nucleotide 4243 of SEQ ID NO:3 herein, as discussed above. The cDNA sequence in U.S. application Ser. No. 09/231,899 contains about 345 additional nucleotides beyond the stop codon, including a polyA tail). Therefore, nucleotides 1-1310 of SEQ ID NO:1 represent additional sequence that was not disclosed in U.S. application Ser. No. 09/231,899. This novel region of SEQ ID NO:3 contains most of the KS domain encoded by OrfB.
- OrfB is a 6177 nucleotide sequence (not including the stop codon) which encodes a 2059 amino acid sequence, represented herein as SEQ ID NO:4. Within OrfB are four domains: (a) one β-keto acyl-ACP synthase (KS) domain; (b) one chain length factor (CLF) domain; (c) one acyl transferase (AT) domain; and, (d) one enoyl ACP-reductase (ER) domain.
- A nucleotide sequence for OrfB has been deposited with GenBank as Accession No. AF378328 (amino acid sequence Accession No. AAK728880). The nucleotide sequence represented by GenBank Accession No. AF378328 differs from the nucleotide sequence represented herein as SEQ ID NO:3 by the point nucleotide changes: (1) at position 852 (T to C, resulting in no amino acid change at position 284 of SEQ ID NO:4); (2) at position 1110 (S to C, resulting in no amino acid change at position 370 of SEQ ID NO:4); (3) at position 1112 (Y to T, resulting in the resolution of an ambiguous amino acid call to a definite valine call at position 371 of SEQ ID NO:4); and (4) at position 4243 (C to G, resulting in a change from a glutamine to a glutamate at position 1415 of SEQ ID NO:4). The single amino acid change from the amino acid sequence encoded by GenBank Accession No. AAK728880 is located in the linker region located between the AT domain and the ER domain of SEQ ID NO:4.
- Genomic DNA clones (plasmids) encoding OrfB from both Schizochytrium sp. ATCC 20888 and a daughter strain of ATCC 20888, denoted Schizochytrium sp., strain N230D, have been isolated and sequenced. A genomic clone described herein as pJK1129, isolated from Schizochytrium sp. ATCC 20888, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:3, and encodes the amino acid sequence of SEQ ID NO:4. Genomic clone pJK1129 (denoted pJK1129 OrfB genomic clone, in the form of an E. coli plasmid vector containing “OrfB” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK1126 OrfB genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- A genomic clone described herein as pJK324 OrfB genomic clone, isolated from Schizochytrium sp. N230D, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:3, and encodes the amino acid sequence of SEQ ID NO:4. Genomic clone pJK324 (denoted pJK324 OrfB genomic clone, in the form of an E. coli plasmid containing the OrfB gene sequence from Schizochytrium sp. N230D) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK324 OrfB genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfB was compared with known sequences in a standard BLAST search as described above. At the nucleic acid level, OrfB has no significant homology to any known nucleotide sequence. At the amino acid level, the sequences with the greatest degree of homology to ORFB were: Shewanella sp. hypothetical protein (Accession No. U73935), which was 53% identical to ORFB over 458 amino acid residues; Moritella marinus (Vibrio marinus) ORF11 (Accession No. AB025342), which was 53% identical to ORFB over 460 amino acid residues; Photobacterium profundum omega-3 polyunsaturated fatty acid synthase PfaD (Accession No. AF409100), which was 52% identical to ORFB over 457 amino acid residues; and Nostoc sp. 7120 hypothetical protein (Accession No. NC—003272), which was 53% identical to ORFB over 430 amino acid residues.
- The first domain in OrfB is a KS domain, also referred to herein as ORFB-KS. This domain is contained within the nucleotide sequence spanning from a starting point of between about
positions 1 and 43 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 1332 and 1350 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the KS domain spans from aboutposition 1 to about position 1350; based on Pfam analysis, a KS core region spans from about position 43 to about position 1332). The nucleotide sequence containing the sequence encoding the ORFB-KS domain is represented herein as SEQ ID NO:19 (positions 1-1350 of SEQ ID NO:3). The amino acid sequence containing the KS domain spans from a starting point of between aboutpositions 1 and 15 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 444 and 450 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS KS domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFB-KS domain is represented herein as SEQ ID NO:20 (positions 1-450 of SEQ ID NO:4). This KS domain comprises a valine at position 371 of SEQ ID NO:20 (also position 371 of SEQ ID NO:20). It is noted that the ORFB-KS domain contains an active site motif: DXAC* (*acyl binding site C196). KS biological activity and methods of identifying proteins or domains having such activity is described above. - The second domain in OrfB is a CLF domain, also referred to herein as ORFB-CLF. This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1378 and 1402 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 2682 and 2700 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the CLF domain spans from about position 1378 to about position 2700; based on Pfam analysis, a CLF core region spans from about position 1402 to about position 2682). The nucleotide sequence containing the sequence encoding the ORFB-CLF domain is represented herein as SEQ ID NO:21 (positions 1378-2700 of SEQ ID NO:3). The amino acid sequence containing the CLF domain spans from a starting point of between about positions 460 and 468 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 894 and 900 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS CLF domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFB-CLF domain is represented herein as SEQ ID NO:22 (positions 460-900 of SEQ ID NO:4). It is noted that the ORFB-CLF domain contains a KS active site motif without the acyl-binding cysteine.
- According to the present invention, a domain or protein is referred to as a chain length factor (CLF) based on the following rationale. The CLF was originally described as characteristic of Type II (dissociated enzymes) PKS systems and was hypothesized to play a role in determining the number of elongation cycles, and hence the chain length, of the end product. CLF amino acid sequences show homology to KS domains (and are thought to form heterodimers with a KS protein), but they lack the active site cysteine. CLF's role in PKS systems is currently controversial. New evidence (C. Bisang et al., Nature 401, 502 (1999)) suggests a role in priming (providing the initial acyl group to be elongated) the PKS systems. In this role the CLF domain is thought to decarboxylate malonate (as malonyl-ACP), thus forming an acetate group that can be transferred to the KS active site. This acetate therefore acts as the ‘priming’ molecule that can undergo the initial elongation (condensation) reaction. Homologues of the Type II CLF have been identified as ‘loading’ domains in some modular PKS systems. A domain with the sequence features of the CLF is found in all currently identified PUFA PKS systems and in each case is found as part of a multidomain protein.
- The third domain in OrfB is an AT domain, also referred to herein as ORFB-AT. This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 2701 and 3598 of SEQ ID NO:3 (OrfB) to an ending point of between about positions 3975 and 4200 of SEQ ID NO:3 (based on homology to other PUFA PKS domains, the position of the AT domain spans from about position 2701 to about position 4200; based on Pfam analysis, an AT core region spans from about position 3598 to about position 3975). The nucleotide sequence containing the sequence encoding the ORFB-AT domain is represented herein as SEQ ID NO:23 (positions 2701-4200 of SEQ ID NO:3). The amino acid sequence containing the AT domain spans from a starting point of between about positions 901 and 1200 of SEQ ID NO:4 (ORFB) to an ending point of between about positions 1325 and 1400 of SEQ ID NO:4 (again, referring to the overall homology to PUFA PKS AT domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFB-AT domain is represented herein as SEQ ID NO:24 (positions 901-1400 of SEQ ID NO:4). It is noted that the ORFB-AT domain contains an active site motif of GxS*xG (*acyl binding site S1140) that is characteristic of acyltransferse (AT) proteins.
- An “acyltransferase” or “AT” refers to a general class of enzymes that can carry out a number of distinct acyl transfer reactions. The Schizochytrium domain shows good homology to a domain present in all of the other PUFA PKS systems currently examined and very weak homology to some acyltransferases whose specific functions have been identified (e.g. to malonyl-CoA:ACP acyltransferase, MAT). In spite of the weak homology to MAT, this AT domain is not believed to function as a MAT because it does not possess an extended motif structure characteristic of such enzymes (see MAT domain description, above). For the purposes of this disclosure, the functions of the AT domain in a PUFA PKS system include, but are not limited to: transfer of the fatty acyl group from the ORFA ACP domain(s) to water (i.e. a thioesterase—releasing the fatty acyl group as a free fatty acid), transfer of a fatty acyl group to an acceptor such as CoA, transfer of the acyl group among the various ACP domains, or transfer of the fatty acyl group to a lipophilic acceptor molecule (e.g. to lysophosphadic acid).
- The fourth domain in OrfB is an ER domain, also referred to herein as ORFB-ER. This domain is contained within the nucleotide sequence spanning from a starting point of about position 4648 of SEQ ID NO:3 (OrfB) to an ending point of about position 6177 of SEQ ID NO:3. The nucleotide sequence containing the sequence encoding the ORFB-ER domain is represented herein as SEQ ID NO:25 (positions 4648-6177 of SEQ ID NO:3). The amino acid sequence containing the ER domain spans from a starting point of about position 1550 of SEQ ID NO:4 (ORFB) to an ending point of about position 2059 of SEQ ID NO:4. The amino acid sequence containing the ORFB-ER domain is represented herein as SEQ ID NO:26 (positions 1550-2059 of SEQ ID NO:4).
- According to the present invention, this domain has enoyl reductase (ER) biological activity. The ER enzyme reduces the trans-double bond (introduced by the DH activity) in the fatty acyl-ACP, resulting in fully saturating those carbons. The ER domain in the PUFA-PKS shows homology to a newly characterized family of ER enzymes (Heath et al., Nature 406, 145 (2000)). Heath and Rock identified this new class of ER enzymes by cloning a gene of interest from Streptococcus pneumoniae, purifying a protein expressed from that gene, and showing that it had ER activity in an in vitro assay. The sequence of the Schizochytrium ER domain of OrfB shows homology to the S. pneumoniae ER protein. All of the PUFA PKS systems currently examined contain at least one domain with very high sequence homology to the Schizochytrium ER domain. The Schizochytrium PUFA PKS system contains two ER domains (one on OrfB and one on OrfC).
- The complete nucleotide sequence for OrfC is represented herein as SEQ ID NO:5. Nucleotides 1-4506 of SEQ ID NO:5 (i.e., the entire open reading frame sequence, not including the stop codon) nearly correspond to nucleotides 145-4653 of the sequence denoted as SEQ ID NO:76 in U.S. application Ser. No. 09/231,899. The cDNA sequence in U.S. application Ser. No. 09/231,899 contains about 144 nucleotides upstream of the start codon for OrfC and about 110 nucleotides beyond the stop codon, including a polyA tail. In addition, as discussed above, nucleotides 145-4653 of the cDNA sequence containing the complete open reading frame described in U.S. application Ser. No. 09/231,899 (denoted therein as SEQ ID NO:76) match nucleotides 1-2624 and 2675-4506 of the sequence denoted herein as OrfC (SEQ ID NO:5). OrfC is a 4506 nucleotide sequence (not including the stop codon) which encodes a 1502 amino acid sequence, represented herein as SEQ ID NO:6. Within OrfC are three domains: (a) two FabA-like β-hydroxy acyl-ACP dehydrase (DH) domains; and (b) one enoyl ACP-reductase (ER) domain.
- A nucleotide sequence for OrfC has been deposited with GenBank as Accession No. AF378329 (amino acid sequence Accession No. AAK728881). The nucleotide sequence represented by AF378329 differs from the nucleotide sequence represented herein as SEQ ID NO:5 by the point nucleotide insertions: (1) at position 2625 (an insertion of an A); (2) at position 2662 (an insertion of a C); and (3) at position 2674 (an insertion of an A). This resulted in a frame shift out of frame at position 2625 and then back into frame at position 2675. The amino acid sequence encoded by GenBank Accession No. AAK728881 differs from the amino acid sequence encoded by SEQ ID NO:5 (i.e., SEQ ID NO:6) in the region spanning from positions 876-891 of GenBank Accession No. AAK728881 or positions 876-890 of SEQ ID NO:6. This change in sequence occurs in the DH2 domain of OrfC (discussed below).
- Genomic DNA clones (plasmids) encoding OrfC from both Schizochytrium sp. ATCC 20888 and a daughter strain of ATCC 20888, denoted Schizochytrium sp., strain N230D, have been isolated and sequenced. A genomic clone described herein as pJK1131, isolated from Schizochytrium sp. ATCC 20888, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:5, and encodes the amino acid sequence of SEQ ID NO:6. Genomic clone pJK1131 (denoted pJK1131 OrfC genomic clone, in the form of an E. coli plasmid vector containing “OrfC” gene from Schizochytrium ATCC 20888) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pJK1131 OrfC genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- A genomic clone described herein as pBR002 OrfC genomic clone, isolated from Schizochytrium sp. N230D, comprises, to the best of the present inventors' knowledge, the nucleotide sequence of SEQ ID NO:5, and encodes the amino acid sequence of SEQ ID NO:6. Genomic clone pBR002 (denoted pBR002 OrfC genomic clone, in the form of an E. coli plasmid vector containing the OrfC gene sequence from Schizochytrium sp. N230D) was deposited with the American Type Culture Collection (ATCC), 10801 University Boulevard, Manassas, Va. 20110-2209 USA on Jun. 8, 2006, and assigned ATCC Accession No. ______. The nucleotide sequence of pBR002 OrfC genomic clone, and the amino acid sequence encoded by this plasmid are encompassed by the present invention.
- OrfC was compared with known sequences in a standard BLAST search as described above. At the nucleic acid level, OrfC has no significant homology to any known nucleotide sequence. At the amino acid level (Blastp), the sequences with the greatest degree of homology to ORFC were: Moritella marinus (Vibrio marinus) ORF11 (Accession No. ABO25342), which is 45% identical to ORFC over 514 amino acid residues, Shewanella sp. hypothetical protein 8 (Accession No. U73935), which is 49% identical to ORFC over 447 amino acid residues, Nostoc sp. hypothetical protein (Accession No. NC—003272), which is 49% identical to ORFC over 430 amino acid residues, and Shewanella sp. hypothetical protein 7 (Accession No. U73935), which is 37% identical to ORFC over 930 amino acid residues.
- The first domain in OrfC is a DH domain, also referred to herein as ORFC-DH1. This is one of two DH domains in OrfC, and therefore is designated DH1. This domain is contained within the nucleotide sequence spanning from a starting point of between about
positions 1 and 778 of SEQ ID NO:5 (OrfC) to an ending point of between about positions 1233 and 1350 of SEQ ID NO:5 (based on homology to other PUFA PKS domains, the position of the DH1 domain spans from aboutposition 1 to about position 1350; based on Pfam analysis, a DH core region spans from about position 778 to about position 1233). The nucleotide sequence containing the sequence encoding the ORFC-DH1 domain is represented herein as SEQ ID NO:27 (positions 1-1350 of SEQ ID NO:5). The amino acid sequence containing the DH1 domain spans from a starting point of between aboutpositions 1 and 260 of SEQ ID NO:6 (ORFC) to an ending point of between about positions 411 and 450 of SEQ ID NO:6 (again, referring to the overall homology to PUFA PKS DH domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFC-DH1 domain is represented herein as SEQ ID NO:28 (positions 1-450 of SEQ ID NO:6). - The characteristics of both the DH domains (see below for DH 2) in the PUFA PKS systems have been described in the preceding sections. This class of enzyme removes HOH from a β-keto acyl-ACP and leaves a trans double bond in the carbon chain. The DH domains of the PUFA PKS systems show homology to bacterial DH enzymes associated with their FAS systems (rather than to the DH domains of other PKS systems). A subset of bacterial DH's, the FabA-like DH's, possesses cis-trans isomerase activity (Heath et al., J. Biol. Chem., 271, 27795 (1996)). It is the homologies to the FabA-like DH's that indicate that one or both of the DH domains is responsible for insertion of the cis double bonds in the PUFA PKS products.
- The second domain in OrfC is a DH domain, also referred to herein as ORFC-DH2. This is the second of two DH domains in OrfC, and therefore is designated DH2. This domain is contained within the nucleotide sequence spanning from a starting point of between about positions 1351 and 2437 of SEQ ID NO:5 (OrfC) to an ending point of between about positions 2607 and 2847 of SEQ ID NO:5 (based on homology to other PUFA PKS domains, the position of the DH2 domain spans from about position 1351 to about position 2845; based on Pfam analysis, a DH core region spans from about position 2437 to about position 2847). The nucleotide sequence containing the sequence encoding the ORFC-DH2 domain is represented herein as SEQ ID NO:29 (positions 1351-2847 of SEQ ID NO:5). The amino acid sequence containing the DH2 domain spans from a starting point of between about positions 451 and 813 of SEQ ID NO:6 (ORFC) to an ending point of between about positions 869 and 949 of SEQ ID NO:6 (again, referring to the overall homology to PUFA PKS DH domains and to Pfam core regions, respectively). The amino acid sequence containing the ORFC-DH2 domain is represented herein as SEQ ID NO:30 (positions 451-949 of SEQ ID NO:6). This DH domain comprises the amino acids H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions 426-440 of SEQ ID NO:30. DH biological activity has been described above.
- The third domain in OrfC is an ER domain, also referred to herein as ORFC-ER. This domain is contained within the nucleotide sequence spanning from a starting point of about position 2995 of SEQ ID NO:5 (OrfC) to an ending point of about position 4506 of SEQ ID NO:5. The nucleotide sequence containing the sequence encoding the ORFC-ER domain is represented herein as SEQ ID NO:31 (positions 2995-4506 of SEQ ID NO:5). The amino acid sequence containing the ER domain spans from a starting point of about position 999 of SEQ ID NO:6 (ORFC) to an ending point of about position 1502 of SEQ ID NO:6. The amino acid sequence containing the ORFC-ER domain is represented herein as SEQ ID NO:32 (positions 999-1502 of SEQ ID NO:6). ER biological activity has been described above.
- According to the present invention, a domain or protein having 4′-phosphopantetheinyl transferase (PPTase) biological activity (function) is characterized as the enzyme that transfers a 4′-phosphopantetheinyl moiety from Coenzyme A to the acyl carrier protein (ACP). This transfer to an invariant serine reside of the ACP activates the inactive apo-form to the holo-form. In both polyketide and fatty acid synthesis, the phosphopantetheine group forms thioesters with the growing acyl chains. The PPTases are a family of enzymes that have been well characterized in fatty acid synthesis, polyketide synthesis, and non-ribosomal peptide synthesis. The sequences of many PPTases are known, and crystal structures have been determined (e.g., Reuter K, Mofid M R, Marahiel M A, Ficner R. “Crystal structure of the surfactin synthetase-activating enzyme sfp: a prototype of the 4′-phosphopantetheinyl transferase superfamily” EMBO J. 1999 Dec. 1; 18(23):6823-31) as well as mutational analysis of amino acid residues important for activity (Mofid M R, Finking R, Essen L O, Marahiel M A. “Structure-based mutational analysis of the 4′-phosphopantetheinyl transferases Sfp from Bacillus subtilis: carrier protein recognition and reaction mechanism” Biochemistry. 2004 Apr. 13; 43(14):4128-36).
- The present inventors have identified two sequences (genes) in the Arabidopsis whole genome database that are likely to encode PPTases. These sequences (GenBank Accession numbers; AAG51443 and AAC05345) are currently listed as encoding “Unknown Proteins”. They can be identified as putative PPTases based on the presence in the translated protein sequences of several signature motifs including; G(I/V)D and WxxKE(A/S)xxK (SEQ ID NO:33), (listed in Lambalot et al., 1996 as characteristic of all PPTases). In addition, these two putative proteins contain two additional motifs typically found in PPTases typically associated with PKS and non-ribosomal peptide synthesis systems; i.e., FN(I/L/V)SHS (SEQ ID NO:34) and (1N/L)G(I/L/V)D(I/L/V) (SEQ ID NO:35). Furthermore, these motifs occur in the expected relative positions in the protein sequences. It is likely that homologues of the Arabidopsis genes are present in other plants, such as tobacco. Again, these genes can be cloned and expressed to see if the enzymes they encode can activate the Schizochytrium ORFA ACP domains, or alternatively, OrfA could be expressed directly in the transgenic plant (either targeted to the plastid or the cytoplasm).
- Another heterologous PPTase which has been demonstrated by the inventors to recognize the OrfA ACP domains described herein as substrates is the Het I protein of Nostoc sp. PCC 7120 (formerly called Anabaena sp. PCC 7120).
- One embodiment of the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence from a non-bacterial PUFA PKS system, a homologue thereof, a fragment thereof, and/or a nucleic acid sequence that is complementary to any of such nucleic acid sequences. In one aspect, the present invention relates to an isolated nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of: (a) a nucleic acid sequence encoding an amino acid sequence selected from the group consisting of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, and biologically active fragments thereof; (b) a nucleic acid sequence encoding an amino acid sequence selected from the group consisting of: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, and biologically active fragments thereof; (c) a nucleic acid sequence encoding an amino acid sequence that is at least about 60% identical to at least 500 consecutive amino acids of said amino acid sequence of (a), wherein said amino acid sequence has a biological activity of at least one domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system; (d) a nucleic acid sequence encoding an amino acid sequence that is at least about 60% identical to said amino acid sequence of (b), wherein said amino acid sequence has a biological activity of at least one domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system; or (e) a nucleic acid sequence that is fully complementary to the nucleic acid sequence of (a), (b), (c), or (d). In a further embodiment, nucleic acid sequences including a sequence encoding the active site domains or other functional motifs described above for several of the PUFA PKS domains are encompassed by the invention.
- According to the present invention, an amino acid sequence that has a biological activity of at least one domain of a PUFA PKS system is an amino acid sequence that has the biological activity of at least one domain of the PUFA PKS system described in detail herein, as exemplified by the Schizochytrium PUFA PKS system. The biological activities of the various domains within the Schizochytrium PUFA PKS system have been described in detail above. Therefore, an isolated nucleic acid molecule of the present invention can encode the translation product of any PUFA PKS open reading frame, PUFA PKS domain, biologically active fragment thereof, or any homologue of a naturally occurring PUFA PKS open reading frame or domain which has biological activity. A homologue of given protein or domain is a protein or polypeptide that has an amino acid sequence which differs from the naturally occurring reference amino acid sequence (i.e., of the reference protein or domain) in that at least one or a few, but not limited to one or a few, amino acids have been deleted (e.g., a truncated version of the protein, such as a peptide or fragment), inserted, inverted, substituted and/or derivatized (e.g., by glycosylation, phosphorylation, acetylation, myristoylation, prenylation, palmitation, amidation and/or addition of glycosylphosphatidyl inositol). Preferred homologues of a PUFA PKS protein or domain are described in detail below. It is noted that homologues can include synthetically produced homologues, naturally occurring allelic variants of a given protein or domain, or homologous sequences from organisms other than the organism from which the reference sequence was derived.
- In general, the biological activity or biological action of a protein or domain refers to any function(s) exhibited or performed by the protein or domain that is ascribed to the naturally occurring form of the protein or domain as measured or observed in vivo (i.e., in the natural physiological environment of the protein) or in vitro (i.e., under laboratory conditions). Biological activities of PUFA PKS systems and the individual proteins/domains that make up a PUFA PKS system have been described in detail elsewhere herein. Modifications of a protein or domain, such as in a homologue or mimetic (discussed below), may result in proteins or domains having the same biological activity as the naturally occurring protein or domain, or in proteins or domains having decreased or increased biological activity as compared to the naturally occurring protein or domain. Modifications which result in a decrease in expression or a decrease in the activity of the protein or domain, can be referred to as inactivation (complete or partial), down-regulation, or decreased action of a protein or domain. Similarly, modifications which result in an increase in expression or an increase in the activity of the protein or domain, can be referred to as amplification, overproduction, activation, enhancement, up-regulation or increased action of a protein or domain. A functional domain of a PUFA PKS system is a domain (i.e., a domain can be a portion of a protein) that is capable of performing a biological function (i.e., has biological activity).
- In accordance with the present invention, an isolated nucleic acid molecule is a nucleic acid molecule that has been removed from its natural milieu (i.e., that has been subject to human manipulation), its natural milieu being the genome or chromosome in which the nucleic acid molecule is found in nature. As such, “isolated” does not necessarily reflect the extent to which the nucleic acid molecule has been purified, but indicates that the molecule does not include an entire genome or an entire chromosome in which the nucleic acid molecule is found in nature. An isolated nucleic acid molecule can include a gene. An isolated nucleic acid molecule that includes a gene is not a fragment of a chromosome that includes such gene, but rather includes the coding region and regulatory regions associated with the gene, but no additional genes naturally found on the same chromosome. An isolated nucleic acid molecule can also include a specified nucleic acid sequence flanked by (i.e., at the 5′ and/or the 3′ end of the sequence) additional nucleic acids that do not normally flank the specified nucleic acid sequence in nature (i.e., heterologous sequences). Isolated nucleic acid molecule can include DNA, RNA (e.g., mRNA), or derivatives of either DNA or RNA (e.g., cDNA). Although the phrase “nucleic acid molecule” primarily refers to the physical nucleic acid molecule and the phrase “nucleic acid sequence” primarily refers to the sequence of nucleotides on the nucleic acid molecule, the two phrases can be used interchangeably, especially with respect to a nucleic acid molecule, or a nucleic acid sequence, being capable of encoding a protein or domain of a protein.
- Preferably, an isolated nucleic acid molecule of the present invention is produced using recombinant DNA technology (e.g., polymerase chain reaction (PCR) amplification, cloning) or chemical synthesis. Isolated nucleic acid molecules include natural nucleic acid molecules and homologues thereof, including, but not limited to, natural allelic variants and modified nucleic acid molecules in which nucleotides have been inserted, deleted, substituted, and/or inverted in such a manner that such modifications provide the desired effect on PUFA PKS system biological activity as described herein. Protein homologues (e.g., proteins encoded by nucleic acid homologues) have been discussed in detail above.
- A nucleic acid molecule homologue can be produced using a number of methods known to those skilled in the art (see, for example, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press, 1989). For example, nucleic acid molecules can be modified using a variety of techniques including, but not limited to, classic mutagenesis techniques and recombinant DNA techniques, such as site-directed mutagenesis, chemical treatment of a nucleic acid molecule to induce mutations, restriction enzyme cleavage of a nucleic acid fragment, ligation of nucleic acid fragments, PCR amplification and/or mutagenesis of selected regions of a nucleic acid sequence, synthesis of oligonucleotide mixtures and ligation of mixture groups to “build” a mixture of nucleic acid molecules and combinations thereof. Nucleic acid molecule homologues can be selected from a mixture of modified nucleic acids by screening for the function of the protein encoded by the nucleic acid and/or by hybridization with a wild-type gene.
- The minimum size of a nucleic acid molecule of the present invention is a size sufficient to form a probe or oligonucleotide primer that is capable of forming a stable hybrid (e.g., under moderate, high or very high stringency conditions) with the complementary sequence of a nucleic acid molecule useful in the present invention, or of a size sufficient to encode an amino acid sequence having a biological activity of at least one domain of a PUFA PKS system according to the present invention. As such, the size of the nucleic acid molecule encoding such a protein can be dependent on nucleic acid composition and percent homology or identity between the nucleic acid molecule and complementary sequence as well as upon hybridization conditions per se (e.g., temperature, salt concentration, and formamide concentration). The minimal size of a nucleic acid molecule that is used as an oligonucleotide primer or as a probe is typically at least about 12 to about 15 nucleotides in length if the nucleic acid molecules are GC-rich and at least about 15 to about 18 bases in length if they are AT-rich. There is no limit, other than a practical limit, on the maximal size of a nucleic acid molecule of the present invention, in that the nucleic acid molecule can include a sequence sufficient to encode a biologically active fragment of a domain of a PUFA PKS system, an entire domain of a PUFA PKS system, several domains within an open reading frame (Orf) of a PUFA PKS system, an entire Orf of a PUFA PKS system, or more than one Orf of a PUFA PKS system.
- In one embodiment of the present invention, an isolated nucleic acid molecule comprises or consists essentially of a nucleic acid sequence encoding an amino acid sequence selected from the group of: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, or biologically active fragments thereof. In one aspect, the nucleic acid sequence is selected from the group of: SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:12, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, and SEQ ID NO:31.
- In one embodiment of the present invention, any of the above-described PUFA PKS amino acid sequences, as well as homologues of such sequences, can be produced with from at least one, and up to about 20, additional heterologous amino acids flanking each of the C- and/or N-terminal end of the given amino acid sequence. The resulting protein or polypeptide can be referred to as “consisting essentially of” a given amino acid sequence. According to the present invention, the heterologous amino acids are a sequence of amino acids that are not naturally found (i.e., not found in nature, in vivo) flanking the given amino acid sequence or which would not be encoded by the nucleotides that flank the naturally occurring nucleic acid sequence encoding the given amino acid sequence as it occurs in the gene, if such nucleotides in the naturally occurring sequence were translated using standard codon usage for the organism from which the given amino acid sequence is derived. Similarly, the phrase “consisting essentially of”, when used with reference to a nucleic acid sequence herein, refers to a nucleic acid sequence encoding a given amino acid sequence that can be flanked by from at least one, and up to as many as about 60, additional heterologous nucleotides at each of the 5′ and/or the 3′ end of the nucleic acid sequence encoding the given amino acid sequence. The heterologous nucleotides are not naturally found (i.e., not found in nature, in vivo) flanking the nucleic acid sequence encoding the given amino acid sequence as it occurs in the natural gene.
- The present invention also includes an isolated nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence having a biological activity of at least one domain of a PUFA PKS system. In one aspect, such a nucleic acid sequence encodes a homologue of any of the Schizochytrium PUFA PKS ORFs or domains, including: SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32, wherein the homologue has a biological activity of at least one (or two, three, four or more) domain of a PUFA PKS system as described previously herein.
- In one aspect of the invention, a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 60% identical to at least 500 consecutive amino acids of an amino acid sequence chosen from: SEQ ID NO:2, SEQ ID NO:4, and SEQ ID NO:6; wherein said amino acid sequence has a biological activity of at least one domain of a PUFA PKS system. In a further aspect, the amino acid sequence of the homologue is at least about 60% identical to at least about 600 consecutive amino acids, and more preferably to at least about 700 consecutive amino acids, and more preferably to at least about 800 consecutive amino acids, and more preferably to at least about 900 consecutive amino acids, and more preferably to at least about 1000 consecutive amino acids, and more preferably to at least about 1100 consecutive amino acids, and more preferably to at least about 1200 consecutive amino acids, and more preferably to at least about 1300 consecutive amino acids, and more preferably to at least about 1400 consecutive amino acids, and more preferably to at least about 1500 consecutive amino acids of any of SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO:6, or to the full length of SEQ ID NO:6. In a further aspect, the amino acid sequence of the homologue is at least about 60% identical to at least about 1600 consecutive amino acids, and more preferably to at least about 1700 consecutive amino acids, and more preferably to at least about 1800 consecutive amino acids, and more preferably to at least about 1900 consecutive amino acids, and more preferably to at least about 2000 consecutive amino acids of any of SEQ ID NO:2 or SEQ ID NO:4, or to the full length of SEQ ID NO:4. In a further aspect, the amino acid sequence of the homologue is at least about 60% identical to at least about 2100 consecutive amino acids, and more preferably to at least about 2200 consecutive amino acids, and more preferably to at least about 2300 consecutive amino acids, and more preferably to at least about 2400 consecutive amino acids, and more preferably to at least about 2500 consecutive amino acids, and more preferably to at least about 2600 consecutive amino acids, and more preferably to at least about 2700 consecutive amino acids, and more preferably to at least about 2800 consecutive amino acids, and even more preferably, to the full length of SEQ ID NO:2.
- In another aspect, a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 65% identical, and more preferably at least about 70% identical, and more preferably at least about 75% identical, and more preferably at least about 80% identical, and more preferably at least about 85% identical, and more preferably at least about 90% identical, and more preferably at least about 95% identical, and more preferably at least about 96% identical, and more preferably at least about 97% identical, and more preferably at least about 98% identical, and more preferably at least about 99% identical to an amino acid sequence chosen from: SEQ ID NO:2, SEQ ID NO:4, or SEQ ID NO:6, over any of the consecutive amino acid lengths described in the paragraph above, wherein the amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- In one aspect of the invention, a homologue of a Schizochytrium PUFA PKS protein or domain encompassed by the present invention comprises an amino acid sequence that is at least about 60% identical to an amino acid sequence chosen from: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32, wherein said amino acid sequence has a biological activity of at least one domain of a PUFA PKS system. In a further aspect, the amino acid sequence of the homologue is at least about 65% identical, and more preferably at least about 70% identical, and more preferably at least about 75% identical, and more preferably at least about 80% identical, and more preferably at least about 85% identical, and more preferably at least about 90% identical, and more preferably at least about 95% identical, and more preferably at least about 96% identical, and more preferably at least about 97% identical, and more preferably at least about 98% identical, and more preferably at least about 99% identical to an amino acid sequence chosen from: SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, wherein the amino acid sequence has a biological activity of at least one domain of a PUFA PKS system.
- According to the present invention, the term “contiguous” or “consecutive”, with regard to nucleic acid or amino acid sequences described herein, means to be connected in an unbroken sequence. For example, for a first sequence to comprise 30 contiguous (or consecutive) amino acids of a second sequence, means that the first sequence includes an unbroken sequence of 30 amino acid residues that is 100% identical to an unbroken sequence of 30 amino acid residues in the second sequence. Similarly, for a first sequence to have “100% identity” with a second sequence means that the first sequence exactly matches the second sequence with no gaps between nucleotides or amino acids.
- As used herein, unless otherwise specified, reference to a percent (%) identity refers to an evaluation of homology which is performed using: (1) a BLAST 2.0 Basic BLAST homology search using blastp for amino acid searches, blastn for nucleic acid searches, and blastX for nucleic acid searches and searches of translated amino acids in all 6 open reading frames, all with standard default parameters, wherein the query sequence is filtered for low complexity regions by default (described in Altschul, S. F., Madden, T. L., Schääffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:3389-3402, incorporated herein by reference in its entirety); (2) a BLAST 2 alignment (using the parameters described below); (3) and/or PSI-BLAST with the standard default parameters (Position-Specific Iterated BLAST). It is noted that due to some differences in the standard parameters between BLAST 2.0 Basic BLAST and BLAST 2, two specific sequences might be recognized as having significant homology using the BLAST 2 program, whereas a search performed in BLAST 2.0 Basic BLAST using one of the sequences as the query sequence may not identify the second sequence in the top matches. In addition, PSI-BLAST provides an automated, easy-to-use version of a “profile” search, which is a sensitive way to look for sequence homologues. The program first performs a gapped BLAST database search. The PSI-BLAST program uses the information from any significant alignments returned to construct a position-specific score matrix, which replaces the query sequence for the next round of database searching. Therefore, it is to be understood that percent identity can be determined by using any one of these programs.
- Two specific sequences can be aligned to one another using BLAST 2 sequence as described in Tatusova and Madden, (1999), “Blast 2 sequences—a new tool for comparing protein and nucleotide sequences”, FEMS Microbiol Lett. 174:247-250, incorporated herein by reference in its entirety. BLAST 2 sequence alignment is performed in blastp or blastn using the BLAST 2.0 algorithm to perform a Gapped BLAST search (BLAST 2.0) between the two sequences allowing for the introduction of gaps (deletions and insertions) in the resulting alignment. For purposes of clarity herein, a BLAST 2 sequence alignment is performed using the standard default parameters as follows.
- For blastn, using 0 BLOSUM62 matrix:
- Reward for match=1
- Penalty for mismatch=−2
- Open gap (5) and extension gap (2) penalties
- gap x_dropoff (50) expect (10) word size (11) filter (on)
- For blastp, using 0 BLOSUM62 matrix:
- Open gap (11) and extension gap (1) penalties
- gap x_dropoff (50) expect (10) word size (3) filter (on).
- In another embodiment of the invention, an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system of the present invention includes an amino acid sequence that is sufficiently similar to a naturally occurring PUFA PKS protein or polypeptide that a nucleic acid sequence encoding the amino acid sequence is capable of hybridizing under moderate, high, or very high stringency conditions (described below) to (i.e., with) a nucleic acid molecule encoding the naturally occurring PUFA PKS protein or polypeptide (i.e., to the complement of the nucleic acid strand encoding the naturally occurring PUFA PKS protein or polypeptide). Preferably, an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system of the present invention is encoded by a nucleic acid sequence that hybridizes under moderate, high or very high stringency conditions to the complement of a nucleic acid sequence that encodes a protein comprising an amino acid sequence represented by any of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32.
- In another embodiment of the invention, a nucleotide sequence of the present invention is a nucleotide sequence isolated from (obtainable from), identical to, or a homologue of, the nucleotide sequence from a Schizochytrium, wherein the nucleotide sequence from a Schizochytrium (including either strand of a DNA molecule from Schizochytrium) hybridizes under moderate, high, or very high stringency conditions to a nucleotide sequence encoding an amino acid sequence represented by any of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:13, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, or SEQ ID NO:32. In one embodiment, the Schizochytrium is Schizochytrium ATCC 20888. In another embodiment, the Schizochytrium is a daughter strain of Schizochytrium 20888, including mutated strains thereof (e.g., N230D).
- Methods to deduce a complementary sequence are known to those skilled in the art.
- It should be noted that since amino acid sequencing and nucleic acid sequencing technologies are not entirely error-free, the sequences presented herein, at best, represent apparent sequences of PUFA PKS domains and proteins of the present invention, or of the nucleotide sequences encoding such amino acid sequences.
- As used herein, hybridization conditions refer to standard hybridization conditions under which nucleic acid molecules are used to identify similar nucleic acid molecules. Such standard conditions are disclosed, for example, in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press, 1989. Sambrook et al., ibid., is incorporated by reference herein in its entirety (see specifically, pages 9.31-9.62). In addition, formulae to calculate the appropriate hybridization and wash conditions to achieve hybridization permitting varying degrees of mismatch of nucleotides are disclosed, for example, in Meinkoth et al., 1984, Anal. Biochem. 138, 267-284; Meinkoth et al., ibid., is incorporated by reference herein in its entirety.
- More particularly, moderate stringency hybridization and washing conditions, as referred to herein, refer to conditions which permit isolation of nucleic acid molecules having at least about 70% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 30% or less mismatch of nucleotides). High stringency hybridization and washing conditions, as referred to herein, refer to conditions which permit isolation of nucleic acid molecules having at least about 80% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 20% or less mismatch of nucleotides). Very high stringency hybridization and washing conditions, as referred to herein, refer to conditions which permit isolation of nucleic acid molecules having at least about 90% nucleic acid sequence identity with the nucleic acid molecule being used to probe in the hybridization reaction (i.e., conditions permitting about 10% or less mismatch of nucleotides). As discussed above, one of skill in the art can use the formulae in Meinkoth et al., ibid. to calculate the appropriate hybridization and wash conditions to achieve these particular levels of nucleotide mismatch. Such conditions will vary, depending on whether DNA:RNA or DNA:DNA hybrids are being formed. Calculated melting temperatures for DNA:DNA hybrids are 10° C. less than for DNA:RNA hybrids. In particular embodiments, stringent hybridization conditions for DNA:DNA hybrids include hybridization at an ionic strength of 6×SSC (0.9 M Na+) at a temperature of between about 20° C. and about 35° C. (lower stringency), more preferably, between about 28° C. and about 40° C. (more stringent), and even more preferably, between about 35° C. and about 45° C. (even more stringent), with appropriate wash conditions. In particular embodiments, stringent hybridization conditions for DNA:RNA hybrids include hybridization at an ionic strength of 6×SSC (0.9 M Na+) at a temperature of between about 30° C. and about 45° C., more preferably, between about 38° C. and about 50° C., and even more preferably, between about 45° C. and about 55° C., with similarly stringent wash conditions. These values are based on calculations of a melting temperature for molecules larger than about 100 nucleotides, 0% formamide and a G+C content of about 40%. Alternatively, Tm can be calculated empirically as set forth in Sambrook et al., supra, pages 9.31 to 9.62. In general, the wash conditions should be as stringent as possible, and should be appropriate for the chosen hybridization conditions. For example, hybridization conditions can include a combination of salt and temperature conditions that are approximately 20-25° C. below the calculated Tm of a particular hybrid, and wash conditions typically include a combination of salt and temperature conditions that are approximately 12-20° C. below the calculated Tm of the particular hybrid. One example of hybridization conditions suitable for use with DNA:DNA hybrids includes a 2-24 hour hybridization in 6×SSC (50% formamide) at about 42° C., followed by washing steps that include one or more washes at room temperature in about 2×SSC, followed by additional washes at higher temperatures and lower ionic strength (e.g., at least one wash as about 37° C. in about 0.1×-0.5×SSC, followed by at least one wash at about 68° C. in about 0.1×-0.5×SSC).
- Yet another embodiment of the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that is identical to, or that is a homologue of (as defined above) the nucleic acid sequence of a cDNA plasmid clone selected from LIB3033-046-D2 (ATCC Accession No. ______), LIB3033-047-B5 (ATCC Accession No. ______), or LIB81-042-B9 (ATCC Accession No. ______). In another embodiment, the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that is identical to, or that is a homologue of (as defined above) the nucleic acid sequence of a genomic plasmid selected from: pJK1126 (ATCC Accession No. ______), pJK1129 (ATCC Accession No. ______), pJK1131 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), pJK320 (ATCC Accession No. ______), pJK324 (ATCC Accession No. ______), or pBR002 (ATCC Accession No. ______)
- Yet another embodiment of the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that encodes an amino acid sequence that is identical to, or that is a homologue of (as defined above) the amino acid sequence encoded by a cDNA plasmid clone selected from LIB3033-046-D2 (ATCC Accession No. ______), LIB3033-047-B5 (ATCC Accession No. ______), or LIB81-042-B9 (ATCC Accession No. ______). In another embodiment, the present invention includes a nucleic acid molecule comprising, consisting essentially of, or consisting of, a nucleic acid sequence that encodes an amino acid sequence that is identical to, or that is a homologue of (as defined above) the amino acid sequence encoded by a genomic plasmid selected from: pJK1126 (ATCC Accession No. ______), pJK1129 (ATCC Accession No. ______), pJK1131 (ATCC Accession No. ______), pJK306 (ATCC Accession No. ______), pJK320 (ATCC Accession No. ______), pJK324 (ATCC Accession No. ______), or pBR002 (ATCC Accession No. ______).
- Another embodiment of the present invention includes a recombinant nucleic acid molecule comprising a recombinant vector and a nucleic acid molecule comprising a nucleic acid sequence encoding an amino acid sequence having a biological activity of at least one domain or protein of a PUFA PKS system as described herein. Such nucleic acid sequences and domains or proteins are described in detail above. According to the present invention, a recombinant vector is an engineered (i.e., artificially produced) nucleic acid molecule that is used as a tool for manipulating a nucleic acid sequence of choice and for introducing such a nucleic acid sequence into a host cell. The recombinant vector is therefore suitable for use in cloning, sequencing, and/or otherwise manipulating the nucleic acid sequence of choice, such as by expressing and/or delivering the nucleic acid sequence of choice into a host cell to form a recombinant cell. Such a vector typically contains heterologous nucleic acid sequences, that is nucleic acid sequences that are not naturally found adjacent to nucleic acid sequence to be cloned or delivered, although the vector can also contain regulatory nucleic acid sequences (e.g., promoters, untranslated regions) which are naturally found adjacent to nucleic acid molecules of the present invention or which are useful for expression of the nucleic acid molecules of the present invention (discussed in detail below). The vector can be either RNA or DNA, either prokaryotic or eukaryotic, and typically is a plasmid. The vector can be maintained as an extrachromosomal element (e.g., a plasmid) or it can be integrated into the chromosome of a recombinant organism (e.g., a microbe or a plant). The entire vector can remain in place within a host cell, or under certain conditions, the plasmid DNA can be deleted, leaving behind the nucleic acid molecule of the present invention. The integrated nucleic acid molecule can be under chromosomal promoter control, under native or plasmid promoter control, or under a combination of several promoter controls. Single or multiple copies of the nucleic acid molecule can be integrated into the chromosome. A recombinant vector of the present invention can contain at least one selectable marker.
- In one embodiment, a recombinant vector used in a recombinant nucleic acid molecule of the present invention is an expression vector. As used herein, the phrase “expression vector” is used to refer to a vector that is suitable for production of an encoded product (e.g., a protein of interest). In this embodiment, a nucleic acid sequence encoding the product to be produced (e.g., a PUFA PKS domain) is inserted into the recombinant vector to produce a recombinant nucleic acid molecule. The nucleic acid sequence encoding the protein to be produced is inserted into the vector in a manner that operatively links the nucleic acid sequence to regulatory sequences in the vector which enable the transcription and translation of the nucleic acid sequence within the recombinant host cell.
- In another embodiment, a recombinant vector used in a recombinant nucleic acid molecule of the present invention is a targeting vector. As used herein, the phrase “targeting vector” is used to refer to a vector that is used to deliver a particular nucleic acid molecule into a recombinant host cell, wherein the nucleic acid molecule is used to delete or inactivate an endogenous gene within the host cell or microorganism (i.e., used for targeted gene disruption or knock-out technology). Such a vector may also be known in the art as a “knock-out” vector. In one aspect of this embodiment, a portion of the vector, but more typically, the nucleic acid molecule inserted into the vector (i.e., the insert), has a nucleic acid sequence that is homologous to a nucleic acid sequence of a target gene in the host cell (i.e., a gene which is targeted to be deleted or inactivated). The nucleic acid sequence of the vector insert is designed to bind to the target gene such that the target gene and the insert undergo homologous recombination, whereby the endogenous target gene is deleted, inactivated or attenuated (i.e., by at least a portion of the endogenous target gene being mutated or deleted).
- Typically, a recombinant nucleic acid molecule includes at least one nucleic acid molecule of the present invention operatively linked to one or more transcription control sequences. As used herein, the phrase “recombinant molecule” or “recombinant nucleic acid molecule” primarily refers to a nucleic acid molecule or nucleic acid sequence operatively linked to a transcription control sequence, but can be used interchangeably with the phrase “nucleic acid molecule”, when such nucleic acid molecule is a recombinant molecule as discussed herein. According to the present invention, the phrase “operatively linked” refers to linking a nucleic acid molecule to a transcription control sequence in a manner such that the molecule is able to be expressed when transfected (i.e., transformed, transduced, transfected, conjugated or conduced) into a host cell. Transcription control sequences are sequences which control the initiation, elongation, or termination of transcription. Particularly important transcription control sequences are those which control transcription initiation, such as promoter, enhancer, operator and repressor sequences. Suitable transcription control sequences include any transcription control sequence that can function in a host cell or organism into which the recombinant nucleic acid molecule is to be introduced.
- Recombinant nucleic acid molecules of the present invention can also contain additional regulatory sequences, such as translation regulatory sequences, origins of replication, and other regulatory sequences that are compatible with the recombinant cell. In one embodiment, a recombinant molecule of the present invention, including those which are integrated into the host cell chromosome, also contains secretory signals (i.e., signal segment nucleic acid sequences) to enable an expressed protein to be secreted from the cell that produces the protein. Suitable signal segments include a signal segment that is naturally associated with the protein to be expressed or any heterologous signal segment capable of directing the secretion of the protein according to the present invention. In another embodiment, a recombinant molecule of the present invention comprises a leader sequence to enable an expressed protein to be delivered to and inserted into the membrane of a host cell. Suitable leader sequences include a leader sequence that is naturally associated with the protein, or any heterologous leader sequence capable of directing the delivery and insertion of the protein to the membrane of a cell.
- The present inventors have found that the Schizochytrium PUFA PKS Orfs A and B are closely linked in the genome and region between the Orfs has been sequenced. The Orfs are oriented in opposite directions and 4244 base pairs separate the start (ATG) codons (i.e. they are arranged as follows: 3′OrfA5′-4244 bp-5′OrfB3′). Examination of the 4244 bp intergenic region did not reveal any obvious Orfs (no significant matches were found on a BlastX search). Both Orfs A and B are highly expressed in Schizochytrium, at least during the time of oil production, implying that active promoter elements are embedded in this intergenic region. These genetic elements are believed to have utility as a bi-directional promoter sequence for transgenic applications. For example, in a preferred embodiment, one could clone this region, place any genes of interest at each end and introduce the construct into Schizochytrium (or some other host in which the promoters can be shown to function). It is predicted that the regulatory elements, under the appropriate conditions, would provide for coordinated, high level expression of the two introduced genes. The complete nucleotide sequence for the regulatory region containing Schizochytrium PUFA PKS regulatory elements (e.g., a promoter) is represented herein as SEQ ID NO:36.
- In a similar manner, OrfC is highly expressed in Schizochytrium during the time of oil production and regulatory elements are expected to reside in the region upstream of its start codon. A region of genomic DNA upstream of OrfC has been cloned and sequenced and is represented herein as (SEQ ID NO:37). This sequence contains the 3886 nt immediately upstream of the OrfC start codon. Examination of this region did not reveal any obvious Orfs (i.e., no significant matches were found on a BlastX search). It is believed that regulatory elements contained in this region, under the appropriate conditions, will provide for high-level expression of a gene placed behind them. Additionally, under the appropriate conditions, the level of expression may be coordinated with genes under control of the A-B intergenic region (SEQ ID NO:36).
- Therefore, in one embodiment, a recombinant nucleic acid molecule useful in the present invention, as disclosed herein, can include a PUFA PKS regulatory region contained within SEQ ID NO:36 and/or SEQ ID NO:37. Such a regulatory region can include any portion (fragment) of SEQ ID NO:36 and/or SEQ ID NO:37 that has at least basal PUFA PKS transcriptional activity (at least basal promoter activity).
- One or more recombinant molecules of the present invention can be used to produce an encoded product (e.g., a PUFA PKS domain, protein, or system) of the present invention. In one embodiment, an encoded product is produced by expressing a nucleic acid molecule as described herein under conditions effective to produce the protein. A preferred method to produce an encoded protein is by transfecting a host cell with one or more recombinant molecules to form a recombinant cell. Suitable host cells to transfect include, but are not limited to, any bacterial, fungal (e.g., yeast), insect, plant or animal cell that can be transfected. Host cells can be either untransfected cells or cells that are already transfected with at least one other recombinant nucleic acid molecule.
- According to the present invention, the term “transfection” is used to refer to any method by which an exogenous nucleic acid molecule (i.e., a recombinant nucleic acid molecule) can be inserted into a cell. The term “transformation” can be used interchangeably with the term “transfection” when such term is used to refer to the introduction of nucleic acid molecules into microbial cells, such as algae, bacteria and yeast. In microbial systems, the term “transformation” is used to describe an inherited change due to the acquisition of exogenous nucleic acids by the microorganism and is essentially synonymous with the term “transfection.” However, in animal cells, transformation has acquired a second meaning which can refer to changes in the growth properties of cells in culture after they become cancerous, for example. Therefore, to avoid confusion, the term “transfection” is preferably used with regard to the introduction of exogenous nucleic acids into animal cells, and the term “transfection” will be used herein to generally encompass transfection of animal cells, plant cells and transformation of microbial cells, to the extent that the terms pertain to the introduction of exogenous nucleic acids into a cell. Therefore, transfection techniques include, but are not limited to, transformation, particle bombardment, electroporation, microinjection, lipofection, adsorption, infection and protoplast fusion.
- It will be appreciated by one skilled in the art that use of recombinant DNA technologies can improve control of expression of transfected nucleic acid molecules by manipulating, for example, the number of copies of the nucleic acid molecules within the host cell, the efficiency with which those nucleic acid molecules are transcribed, the efficiency with which the resultant transcripts are translated, and the efficiency of post-translational modifications. Additionally, the promoter sequence might be genetically engineered to improve the level of expression as compared to the native promoter. Recombinant techniques useful for controlling the expression of nucleic acid molecules include, but are not limited to, integration of the nucleic acid molecules into one or more host cell chromosomes, addition of vector stability sequences to plasmids, substitutions or modifications of transcription control signals (e.g., promoters, operators, enhancers), substitutions or modifications of translational control signals (e.g., ribosome binding sites, Shine-Dalgarno sequences), modification of nucleic acid molecules to correspond to the codon usage of the host cell, and deletion of sequences that destabilize transcripts.
- General discussion above with regard to recombinant nucleic acid molecules and transfection of host cells is intended to be applied to any recombinant nucleic acid molecule discussed herein, including those encoding any amino acid sequence having a biological activity of at least one domain from a PUFA PKS, those encoding amino acid sequences from other PKS systems, and those encoding other proteins or domains.
- This invention also relates to PUFA PKS systems (and proteins or domains thereof) from microorganisms other than those described specifically herein that are homologous in structure, domain organization and/or function to a Schizochytrium PUFA PKS system (and proteins or domains thereof) as described herein. In one embodiment, the microorganism is a non-bacterial microorganism, and preferably, the microorganism is a eukaryotic microorganism. In addition, this invention relates to use of these microorganisms and the PUFA PKS systems or components thereof from these microorganisms in the various applications for a PUFA PKS system (e.g., genetically modified organisms and methods of producing bioactive molecules) according to the present invention. Such microorganisms have the following characteristics: (a) produces at least one PUFA; and (b) has an ability to produce increased PUFAs under dissolved oxygen conditions of less than about 5% of saturation in the fermentation medium, as compared to production of PUFAs by said microorganism under dissolved oxygen conditions of greater than 5% of saturation, more preferably 10% of saturation, more preferably greater than 15% of saturation and more preferably greater than 20% of saturation in the fermentation medium. A screening process for identification of microorganisms comprising a PUFA PKS system is described in detail in U.S. Patent Application Publication No. 20020194641, supra. The knowledge of the structure and function of the PUFA PKS proteins and domains described herein, and the nucleotide sequence encoding the same, are useful tools for the identification, confirmation, and/or isolation of homologues of such proteins or polynucleotides.
- According to the present invention, the term “Thraustochytrid” refers to any members of the order Thraustochytriales, which includes the family Thraustochytriaceae, and the term “Labyrinthulid” refers to any member of the order Labyrinthulales, which includes the family Labyrinthulaceae. The members of the family Labyrinthulaceae have been considered to be members of the order Thraustochytriales, but in revisions of the taxonomy of such organisms, the family is now considered to be a member of the order Labyrinthulales, and both Labyrinthulales and Thraustochytriales are considered to be members of the phylum Labyrinthulomycota.
- Developments have resulted in frequent revision of the taxonomy of the Thraustochytrids (thraustochytrids). Taxonomic theorists generally place Thraustochytrids with the algae or algae-like protists. However, because of taxonomic uncertainty, it would be best for the purposes of the present invention to consider the strains described in the present invention as Thraustochytrids to include the following organisms: Order: Thraustochytriales; Family: Thraustochytriaceae; Genera: Thraustochytrium (Species: sp., arudimentale, aureum, benthicola, globosum, kinnei, motivum, multirudimentale, pachydermum, proliferum, roseum, striatum), Ulkenia (previously considered by some to be a member of Thraustochytrium) (Species: sp., amoeboidea, kerguelensis, minuta, profunda, radiata, sailens, sarkariana, schizochytrops, visurgensis, yorkensis), Schizochytrium (Species: sp., aggregatum, limnaceum, mangrovei, minutum, octosporum), Japonochytrium (Species: sp., marinum), Aplanochytrium (Species: sp., haliotidis, kerguelensis, profunda, stocchinoi), Althornia (Species: sp., crouchii), or Dina (Species: sp., marisalba, sinorifica).
- Strains described in the present invention as Labyrinthulids include the following organisms: Order: Labyrinthulales, Family: Labyrinthulaceae, Genera: Labyrinthula (Species: sp., algeriensis, coenocystis, chattonii, macrocystis, macrocystis atlantica, macrocystis macrocystis, marina, minuta, roscoffensis, valkanovii, vitellina, vitellina pacifica, vitellina vitellina, zopfii), Labyrinthuloides (Species: sp., haliotidis, yorkensis), Labyrinthomyxa (Species: sp., marina), Diplophrys (Species: sp., archeri), Pyrrhosorus (Species: sp., marinus), Sorodiplophrys (Species: sp., stercorea) or Chlainydomyxa (Species: sp., labyrinthuloides, montana) (although there is currently not a consensus on the exact taxonomic placement of Pyrrhosorus, Sorodiplophrys or Chlamydomyxa).
- It is recognized that at the time of this invention, revision in the taxonomy of Thraustochytrids places the genus Labyrinthuloides in the family of Labyrinthulaceae and confirms the placement of the two families Thraustochytriaceae and Labyrinthulaceae within the Stramenopile lineage. It is noted that the Labyrinthulaceae are sometimes commonly called labyrinthulids or labyrinthula, or labyrinthuloides and the Thraustochytriaceae are commonly called thraustochytrids.
- To produce significantly high yields of various bioactive molecules using the PUFA PKS system of the present invention, an organism, preferably a microorganism or a plant, can be genetically modified to affect the activity of a PUFA PKS system. In one aspect, such an organism can endogenously contain and express a PUFA PKS system, and the genetic modification can be a genetic modification of one or more of the functional domains of the endogenous PUFA PKS system, whereby the modification has some effect on the activity of the PUFA PKS system. In another aspect, such an organism can endogenously contain and express a PUFA PKS system, and the genetic modification can be an introduction of at least one exogenous nucleic acid sequence (e.g., a recombinant nucleic acid molecule), wherein the exogenous nucleic acid sequence encodes at least one biologically active domain or protein from the same or a second PKS system and/or a protein that affects the activity of said PUFA PKS system (e.g., a phosphopantetheinyl transferases (PPTase), discussed below). In yet another aspect, the organism does not necessarily endogenously (naturally) contain a PUFA PKS system, but is genetically modified to introduce at least one recombinant nucleic acid molecule encoding an amino acid sequence having the biological activity of at least one domain of a PUFA PKS system. In this aspect, PUFA PKS activity is affected by introducing or increasing PUFA PKS activity in the organism. Various embodiments associated with each of these aspects will be discussed in greater detail below.
- Therefore, according to the present invention, one embodiment relates to a genetically modified microorganism, wherein the microorganism expresses a PKS system comprising at least one biologically active domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system. The at least one domain of the PUFA PKS system is encoded by a nucleic acid sequence described herein. The genetic modification affects the activity of the PKS system in the organism. The genetically modified microorganism can include any one or more of the above-identified nucleic acid sequences, and/or any of the other homologues of any of the Schizochytrium PUFA PKS ORFs or domains as described in detail above.
- As used herein, a genetically modified microorganism can include a genetically modified bacterium, protist, microalgae, fungus, or other microbe, and particularly, any of the genera of the order Thraustochytriales (e.g., a Thraustochytrid) described herein. Such a genetically modified microorganism has a genome which is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form such that the desired result is achieved (i.e., increased or modified PUFA PKS activity and/or production of a desired product using the PUFA PKS system or component thereof). Genetic modification of a microorganism can be accomplished using classical strain development and/or molecular genetic techniques. Such techniques known in the art and are generally disclosed for microorganisms, for example, in Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Labs Press. The reference Sambrook et al., ibid., is incorporated by reference herein in its entirety. A genetically modified microorganism can include a microorganism in which nucleic acid molecules have been inserted, deleted or modified (i.e., mutated; e.g., by insertion, deletion, substitution, and/or inversion of nucleotides), in such a manner that such modifications provide the desired effect within the microorganism.
- Preferred microorganism host cells to modify according to the present invention include, but are not limited to, any bacteria, protist, microalga, fungus, or protozoa. In one aspect, preferred microorganisms to genetically modify include, but are not limited to, any microorganism of the order Thraustochytriales or any microorganism of the order Labyrinthulales. Particularly preferred host cells for use in the present invention could include microorganisms from a genus including, but not limited to: Thraustochytrium, Ulkenia, Schizochytrium, Japonochytrium, Aplanochytrium, Althornia, Elina, Labyrinthula, Labyrinthuloides, Labyrinthomyxa, Diplophrys, Pyrrhosorus, Sorodiplophrys or Chlamydomyxa. Other examples of suitable host microorganisms for genetic modification include, but are not limited to, yeast including Saccharomyces cerevisiae, Saccharomyces carlsbergensis, or other yeast such as Candida, Kluyveromyces, or other fungi, for example, filamentous fungi such as Aspergillus, Neurospora, Penicillium, etc. Bacterial cells also may be used as hosts. This includes Escherichia coli, which can be useful in fermentation processes. Alternatively, a host such as a Lactobacillus species or Bacillus species can be used as a host.
- Another embodiment of the present invention relates to a genetically modified plant or part of a plant (e.g., wherein the plant has been genetically modified to express a PUPA PKS system described herein), which includes at least the core PUFA PKS enzyme complex and, in one embodiment, at least one PUFA PKS accessory protein, (e.g., a PPTase), so that the plant produces PUFAs. Preferably, the plant is an oil seed plant, wherein the oil seeds or oil in the oil seeds contain PUFAs produced by the PUFA PKS system. Such oils contain a detectable amount of at least one target or primary PUFA that is the product of the PUFA PKS system. Plants are not known to endogenously contain a PUFA PKS system, and therefore, the PUFA PKS systems of the present invention represent an opportunity to produce plants with unique fatty acid production capabilities. It is a particularly preferred embodiment of the present invention to genetically engineer plants to produce one or more PUFAs in the same plant, including, EPA, DHA, DPA, ARA, GLA, SDA and others. The present invention offers the ability to create any one of a number of “designer oils” in various ratios and forms. Moreover, the disclosure of the PUFA PKS genes from the particular marine organisms described herein offer the opportunity to more readily extend the range of PUFA production and successfully produce such PUFAs within temperature ranges used to grow most crop plants.
- Methods for the genetic engineering of plants are well known in the art. For instance, numerous methods for plant transformation have been developed, including biological and physical transformation protocols. See, for example, Miki et al., “Procedures for Introducing Foreign DNA into Plants” in Methods in Plant Molecular Biology and Biotechnology, Glick, B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton, 1993) pp. 67-88. In addition, vectors and in vitro culture methods for plant cell or tissue transformation and regeneration of plants are available. See, for example, Gruber et al., “Vectors for Plant Transformation” in Methods in Plant Molecular Biology and Biotechnology, Glick, B. R. and Thompson, J. E. Eds. (CRC Press, Inc., Boca Raton, 1993) pp. 89-119.
- The most widely utilized method for introducing an expression vector into plants is based on the natural transformation system of Agrobacterium. See, for example, Borsch et al., Science 227:1229 (1985). A. tumefaciens and A. rhizogenes are plant pathogenic soil bacteria which genetically transform plant cells. The Ti and Ri plasmids of A. tumefaciens and A. rhizogenes, respectively, carry genes responsible for genetic transformation of the plant. See, for example, Kado, C. I., Crit. Rev. Plant. Sci. 10:1 (1991). Descriptions of Agrobacterium vector systems and methods for Agrobacterium-mediated gene transfer are provided by numerous references, including Gruber et al., supra, Miki et al., supra, Moloney et al., Plant Cell Reports 8:238 (1989), and U.S. Pat. Nos. 4,940,838 and 5,464,763.
- Another generally applicable method of plant transformation is microprojectile-mediated transformation wherein DNA is carried on the surface of microprojectiles. The expression vector is introduced into plant tissues with a biolistic device that accelerates the microprojectiles to speeds sufficient to penetrate plant cell walls and membranes. Sanford et al., Part. Sci. Technol. 5:27 (1987), Sanford, J. C., Trends Biotech. 6:299 (1988), Sanford, J. C., Physiol. Plant 79:206 (1990), Klein et al., Biotechnology 10:268 (1992).
- Another method for physical delivery of DNA to plants is sonication of target cells. Zhang et al., Bio/Technology 9:996 (1991). Alternatively, liposome or spheroplast fusion have been used to introduce expression vectors into plants. Deshayes et al., EMBO J., 4:2731 (1985), Christou et al., Proc Natl. Acad. Sci. USA 84:3962 (1987). Direct uptake of DNA into protoplasts using CaCl2 precipitation, polyvinyl alcohol or poly-L-ornithine have also been reported. Hain et al., Mol. Gen. Genet. 199:161 (1985) and Draper et al., Plant Cell Physiol. 23:451 (1982). Electroporation of protoplasts and whole cells and tissues have also been described. Donn et al., In Abstracts of VIIth International Congress on Plant Cell and Tissue Culture IAPTC, A2-38, p. 53 (1990); D'Halluin et al., Plant Cell 4:1495-1505 (1992) and Spencer et al., Plant Mol. Biol. 24:51-61 (1994).
- As used herein, a genetically modified plant can include any genetically modified plant including higher plants and particularly, any consumable plants or plants useful for producing a desired bioactive molecule of the present invention. “Plant parts”, as used herein, include any parts of a plant, including, but not limited to, seeds (immature or mature), oils, pollen, embryos, flowers, fruits, shoots, leaves, roots, stems, explants, etc. A genetically modified plant has a genome that is modified (i.e., mutated or changed) from its normal (i.e., wild-type or naturally occurring) form such that the desired result is achieved (e.g., PUFA PKS activity and production of PUFAs). Genetic modification of a plant can be accomplished using classical strain development and/or molecular genetic techniques. Methods for producing a transgenic plant, wherein a recombinant nucleic acid molecule encoding a desired amino acid sequence is incorporated into the genome of the plant, are known in the art. A preferred plant to genetically modify according to the present invention is preferably a plant suitable for consumption by animals, including humans.
- Preferred plants to genetically modify according to the present invention (i.e., plant host cells) include, but are not limited to any higher plants, including both dicotyledonous and monocotyledonous plants, and particularly consumable plants, including crop plants and especially plants used for their oils. Such plants can include, for example: canola, soybeans, rapeseed, linseed, corn, safflowers, sunflowers and tobacco. Other preferred plants include those plants that are known to produce compounds used as pharmaceutical agents, flavoring agents, nutraceutical agents, functional food ingredients or cosmetically active agents or plants that are genetically engineered to produce these compounds/agents.
- According to the present invention, a genetically modified microorganism or plant includes a microorganism or plant that has been modified using recombinant technology. As used herein, genetic modifications that result in a decrease in gene expression, in the function of the gene, or in the function of the gene product (i.e., the protein encoded by the gene) can be referred to as inactivation (complete or partial), deletion, interruption, blockage or down-regulation of a gene. For example, a genetic modification in a gene which results in a decrease in the function of the protein encoded by such gene, can be the result of a complete deletion of the gene (i.e., the gene does not exist, and therefore the protein does not exist), a mutation in the gene which results in incomplete or no translation of the protein (e.g., the protein is not expressed), or a mutation in the gene which decreases or abolishes the natural function of the protein (e.g., a protein is expressed which has decreased or no enzymatic activity or action). Genetic modifications that result in an increase in gene expression or function can be referred to as amplification, overproduction, overexpression, activation, enhancement, addition, or up-regulation of a gene.
- The genetic modification of a microorganism or plant according to the present invention preferably affects the activity of the PKS system expressed by the plant, whether the PKS system is endogenous and genetically modified, endogenous with the introduction of recombinant nucleic acid molecules into the organism, or provided completely by recombinant technology. According to the present invention, to “affect the activity of a PKS system” includes any genetic modification that causes any detectable or measurable change or modification in the PKS system expressed by the organism as compared to in the absence of the genetic modification. A detectable change or modification in the PKS system can include, but is not limited to: the introduction of PKS system activity into an organism such that the organism now has measurable/detectable PKS system activity (i.e., the organism did not contain a PKS system prior to the genetic modification), the introduction into the organism of a functional domain from a different PKS system than a PKS system endogenously expressed by the organism such that the PKS system activity is modified (e.g., a bacterial PUFA PKS domain or a type I PKS domain is introduced into an organism that endogenously expresses a non-bacterial PUFA PKS system), a change in the amount of a bioactive molecule produced by the PKS system (e.g., the system produces more (increased amount) or less (decreased amount) of a given product as compared to in the absence of the genetic modification), a change in the type of a bioactive molecule produced by the PKS system (e.g., the system produces a new or different product, or a variant of a product that is naturally produced by the system), and/or a change in the ratio of multiple bioactive molecules produced by the PKS system (e.g., the system produces a different ratio of one PUFA to another PUFA, produces a completely different lipid profile as compared to in the absence of the genetic modification, or places various PUFAs in different positions in a triacylglycerol as compared to the natural configuration). Such a genetic modification includes any type of genetic modification and specifically includes modifications made by recombinant technology and by classical mutagenesis.
- It should be noted that reference to increasing the activity of a functional domain or protein in a PUFA PKS system refers to any genetic modification in the organism containing the domain or protein (or into which the domain or protein is to be introduced) which results in increased functionality of the domain or protein system and can include higher activity of the domain or protein (e.g., specific activity or in vivo enzymatic activity), reduced inhibition or degradation of the domain or protein system, and overexpression of the domain or protein. For example, gene copy number can be increased, expression levels can be increased by use of a promoter that gives higher levels of expression than that of the native promoter, or a gene can be altered by genetic engineering or classical mutagenesis to increase the activity of the domain or protein encoded by the gene.
- Similarly, reference to decreasing the activity of a functional domain or protein in a PUFA PKS system refers to any genetic modification in the organism containing such domain or protein (or into which the domain or protein is to be introduced) which results in decreased functionality of the domain or protein and includes decreased activity of the domain or protein, increased inhibition or degradation of the domain or protein and a reduction or elimination of expression of the domain or protein. For example, the action of domain or protein of the present invention can be decreased by blocking or reducing the production of the domain or protein, “knocking out” the gene or portion thereof encoding the domain or protein, reducing domain or protein activity, or inhibiting the activity of the domain or protein. Blocking or reducing the production of a domain or protein can include placing the gene encoding the domain or protein under the control of a promoter that requires the presence of an inducing compound in the growth medium. By establishing conditions such that the inducer becomes depleted from the medium, the expression of the gene encoding the domain or protein (and therefore, of protein synthesis) could be turned off. Blocking or reducing the activity of domain or protein could also include using an excision technology approach similar to that described in U.S. Pat. No. 4,743,546, incorporated herein by reference. To use this approach, the gene encoding the protein of interest is cloned between specific genetic sequences that allow specific, controlled excision of the gene from the genome. Excision could be prompted by, for example, a shift in the cultivation temperature of the culture, as in U.S. Pat. No. 4,743,546, or by some other physical or nutritional signal.
- In one embodiment of the present invention, a genetic modification includes a modification of a nucleic acid sequence encoding an amino acid sequence that has a biological activity of at least one domain of a non-bacterial PUFA PKS system as described herein. Such a modification can be to an amino acid sequence within an endogenously (naturally) expressed non-bacterial PUFA PKS system, whereby a microorganism that naturally contains such a system is genetically modified by, for example, classical mutagenesis and selection techniques and/or molecular genetic techniques, include genetic engineering techniques. Genetic engineering techniques can include, for example, using a targeting recombinant vector to delete a portion of an endogenous gene, or to replace a portion of an endogenous gene with a heterologous sequence. Examples of heterologous sequences that could be introduced into a host genome include sequences encoding at least one functional domain from another PKS system, such as a different non-bacterial PUFA PKS system, a bacterial PUFA PKS system, a type I PKS system, a type II PKS system, or a modular PKS system. Other heterologous sequences to introduce into the genome of a host includes a sequence encoding a protein or functional domain that is not a domain of a PKS system, but which will affect the activity of the endogenous PKS system. For example, one could introduce into the host genome a nucleic acid molecule encoding a phosphopantetheinyl transferase (discussed below). Specific modifications that could be made to an endogenous PUPA PKS system are discussed in detail below.
- In another aspect of this embodiment of the invention, the genetic modification can include: (1) the introduction of a recombinant nucleic acid molecule encoding an amino acid sequence having a biological activity of at least one domain of a non-bacterial PUFA PKS system; and/or (2) the introduction of a recombinant nucleic acid molecule encoding a protein or functional domain that affects the activity of a PUFA PKS system, into a host. The host can include: (1) a host cell that does not express any PKS system, wherein all functional domains of a PKS system are introduced into the host cell, and wherein at least one functional domain is from a non-bacterial PUFA PKS system; (2) a host cell that expresses a PKS system (endogenous or recombinant) having at least one functional domain of a non-bacterial PUFA PKS system, wherein the introduced recombinant nucleic acid molecule can encode at least one additional non-bacterial PUFA PKS domain function or another protein or domain that affects the activity of the host PKS system; and (3) a host cell that expresses a PKS system (endogenous or recombinant) which does not necessarily include a domain function from a non-bacterial PUFA PKS, and wherein the introduced recombinant nucleic acid molecule includes a nucleic acid sequence encoding at least one functional domain of a non-bacterial PUFA PKS system. In other words, the present invention intends to encompass any genetically modified organism (e.g., microorganism or plant), wherein the organism comprises at least one non-bacterial PUFA PKS domain function (either endogenously or by recombinant modification), and wherein the genetic modification has a measurable effect on the non-bacterial PUFA PKS domain function or on the PKS system when the organism comprises a functional PKS system.
- Therefore, using the PUFA PKS systems of the present invention, gene mixing can be used to extend the range of PUFA products (and ratios thereof) to include EPA, DPA, DHA, ARA, GLA, SDA and others, as well as to produce a wide variety of bioactive molecules, including antibiotics, other pharmaceutical compounds, and other desirable products. The method to obtain these bioactive molecules includes not only the mixing of genes from various organisms but also various methods of genetically modifying the non-bacterial PUFA PKS genes disclosed herein. Knowledge of the genetic basis and domain structure of the non-bacterial PUFA PKS system of the present invention provides a basis for designing novel genetically modified organisms which produce a variety of bioactive molecules. Although mixing and modification of any PKS domains and related genes are contemplated by the present inventors, by way of example, various possible manipulations of the PUFA-PKS system are discussed in U.S. Patent Application Publication No. 20020194641, U.S. Patent Application Publication No. 20040235127, and U.S. Patent Application Publication No. 20050100995, supra with regard to genetic modification and bioactive molecule production.
- The comparison of the Schizochytrium PUFA PKS architecture (domain organization) with other PUFA PKS system architecture illustrates nature's ability to alter domain order as well as incorporate new domains to create novel end products. In addition, the genes can now be manipulated in the laboratory to create new products. Proposed herein is the manipulation of PUFA PKS systems in either a directed or random way to influence the end products. For example, in a preferred embodiment, one could envision substituting one of the DH (FabA-like) domains of the PUFA-PKS system for a DH domain that did not posses isomerization activity, potentially creating a molecule with a mix of cis- and trans-double bonds. The current products of the Schizochytrium PUFA PKS system are DHA and DPA (C22:5 ω6). If one manipulated the system to produce C20 fatty acids, one would expect the products to be EPA and ARA (C20:4 ω6). This could provide a new source for ARA. One could also substitute domains from related PUFA-PKS systems that produced a different DHA to DPA ratio, for example, by using genes from Thraustochytrium 23B.
- Additionally, one could envision specifically altering one of the ER domains (e.g. removing, or inactivating) in the Schizochytrium PUFA PKS system (other PUFA PKS systems described so far do not have two ER domains) to affect the end product profile. Similar strategies could be attempted in a directed manner for each of the distinct domains of the PUFA-PKS proteins using more or less sophisticated approaches. Of course one would not be limited to the manipulation of single domains. Finally, one could extend the approach by mixing domains from the PUFA-PKS system and other PKS or FAS systems (e.g., type I, type II, type III) to create an entire range of new end products. For example, one could introduce the PUFA-PKS DH domains into systems that do not normally incorporate cis double bonds into their end products.
- Accordingly, encompassed by the present invention are methods to genetically modify microbial or plant cells by: genetically modifying at least one nucleic acid sequence in the organism that encodes an amino acid sequence having the biological activity of at least one functional domain of a PUFA PKS system according to the present invention, and/or expressing at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding such amino acid sequence. Various embodiments of such sequences, methods to genetically modify an organism, and specific modifications have been described in detail above. Typically, the method is used to produce a particular genetically modified organism that produces a particular bioactive molecule or molecules.
- In one embodiment of the present invention, it is contemplated that a mutagenesis program could be combined with a selective screening process to obtain bioactive molecules of interest. This would include methods to search for a range of bioactive compounds. This search would not be restricted to production of those molecules with cis double bonds. The mutagenesis methods could include, but are not limited to: chemical mutagenesis, gene shuffling, switching regions of the genes encoding specific enzymatic domains, or mutagenesis restricted to specific regions of those genes, as well as other methods.
- For example, high throughput mutagenesis methods could be used to influence or optimize production of the desired bioactive molecule. Once an effective model system has been developed, one could modify these genes in a high throughput manner. Utilization of these technologies can be envisioned on two levels. First, if a sufficiently selective screen for production of a product of interest (e.g., ARA) can be devised, it could be used to attempt to alter the system to produce this product (e.g., in lieu of, or in concert with, other strategies such as those discussed above). Additionally, if the strategies outlined above resulted in a set of genes that did produce the product of interest, the high throughput technologies could then be used to optimize the system. For example, if the introduced domain only functioned at relatively low temperatures, selection methods could be devised to permit removing that limitation. In one embodiment of the invention, screening methods are used to identify additional non-bacterial organisms having novel PKS systems similar to the PUFA PKS system of Schizochytrium, as described herein (see above). Homologous PKS systems identified in such organisms can be used in methods similar to those described herein for the Schizochytrium, as well as for an additional source of genetic material from which to create, further modify and/or mutate a PUFA PKS system for expression in that microorganism, in another microorganism, or in a higher plant, to produce a variety of compounds.
- It is recognized that many genetic alterations, either random or directed, which one may introduce into a native (endogenous, natural) PUFA PKS system, will result in an inactivation of enzymatic functions. A preferred embodiment of the invention includes a system to select for only those modifications that do not block the ability of the PUFA PKS system to produce a product. For example, the FabB-strain of E. coli is incapable of synthesizing unsaturated fatty acids and requires supplementation of the medium with fatty acids that can substitute for its normal unsaturated fatty acids in order to grow (see Metz et al., 2001, supra). However, this requirement (for supplementation of the medium) can be removed when the strain is transformed with a functional PUFA-PKS system (i.e. one that produces a PUFA product in the E. coli host—see (Metz et al., 2001, supra,
FIG. 2A ). The transformed FabB-strain now requires a functional PUFA-PKS system (to produce the unsaturated fatty acids) for growth without supplementation. The key element in this example is that production of a wide range of unsaturated fatty acid will suffice (even unsaturated fatty acid substitutes such as branched chain fatty acids). Therefore, in another preferred embodiment of the invention, one could create a large number of mutations in one or more of the PUFA PKS genes disclosed herein, and then transform the appropriately modified FabB-strain (e.g. create mutations in an expression construct containing an ER domain and transform a FabB-strain having the other essential domains on a separate plasmid—or integrated into the chromosome) and select only for those transformants that grow without supplementation of the medium (i.e., that still possessed an ability to produce a molecule that could complement the FabB-defect). Additional screens could be developed to look for particular compounds (e.g. use of GC for fatty acids) being produced in this selective subset of an active PKS system. One could envision a number of similar selective screens for bioactive molecules of interest. - In one embodiment of invention, a genetically modified organism has a modification that changes at least one product produced by the endogenous PKS system, as compared to a wild-type organism.
- In one embodiment, a genetically modified organism has been modified by transfecting the organism with a recombinant nucleic acid molecule encoding a protein that regulates the chain length of fatty acids produced by the PUFA PKS system. For example, the protein that regulates the chain length of fatty acids produced by the PUFA PKS system can be a chain length factor that directs the synthesis of C20 units or C22 units.
- In another embodiment, a genetically modified organism expresses a PUFA PKS system comprising a genetic modification in a domain selected from the group consisting of a domain encoding β-hydroxy acyl-ACP dehydrase (DH) and a domain encoding 3-ketoacyl-ACP synthase (KS), wherein the modification alters the ratio of long chain fatty acids produced by the PUFA PKS system as compared to in the absence of the modification. In one aspect of this embodiment, the modification is selected from the group consisting of a deletion of all or a part of the domain, a substitution of a homologous domain from a different organism for the domain, and a mutation of the domain.
- In another embodiment, a genetically modified organism expresses a PUFA PKS system comprising a modification in an enoyl-ACP reductase (ER) domain, wherein the modification results in the production of a different compound as compared to in the absence of the modification. In one aspect of this embodiment, the modification is selected from the group consisting of a deletion of all or a part of the ER domain, a substitution of an ER domain from a different organism for the ER domain, and a mutation of the ER domain.
- In one embodiment of the invention, the genetically modified organism produces a polyunsaturated fatty acid (PUFA) profile that differs from the naturally occurring organism without a genetic modification.
- Many other genetic modifications useful for producing bioactive molecules will be apparent to those of skill in the art, given the present disclosure, and various other modifications have been discussed previously herein. The present invention contemplates any genetic modification related to a PUFA PKS system as described herein which results in the production of a desired bioactive molecule.
- As described above, in one embodiment of the present invention, a genetically modified microorganism or plant includes a microorganism or plant which has an enhanced ability to synthesize desired bioactive molecules (products) or which has a newly introduced ability to synthesize specific products (e.g., to synthesize a specific antibiotic). According to the present invention, “an enhanced ability to synthesize” a product refers to any enhancement, or up-regulation, in a pathway related to the synthesis of the product such that the microorganism or plant produces an increased amount of the product (including any production of a product where there was none before) as compared to the wild-type microorganism or plant, cultured or grown, under the same conditions. Methods to produce such genetically modified organisms have been described in detail above. In one preferred embodiment, the present invention relates to a genetically modified plant or part of a plant (e.g., wherein the plant has been genetically modified to express a PUFA PKS system described herein), which includes at least the core PUFA PKS enzyme complex and, in one embodiment, at least one PUFA PKS accessory protein, (e.g., a PPTase), so that the plant produces PUFAs. Preferably, the plant is an oil seed plant, wherein the oil seeds or oil in the oil seeds contain PUFAs produced by the PUFA PKS system. Such oils contain a detectable amount of at least one target or primary PUFA that is the product of the PUFA PKS system.
- The present inventors demonstrate herein the production of PUFAs in a plant that has been genetically modified to express the genes encoding a PUFA PKS system from Schizochytrium of the present invention and a PUFA PKS accessory enzyme, 4′-phosphopantetheinyl transferase (PPTase). The oils produced by these plants contain significant quantities of both DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n-6), which are the predominant PUFAs (the primary PUFAs) produced by the Schizochytrium from which the PUFA PKS genes were derived. Significantly, oils from plants that produce PUFAs using the PUFA PKS pathway have a different fatty acid profile than plants that are genetically engineered to produce the same PUFAs by the “standard” pathway described above. In particular, oils from plants that have been genetically engineered to produce specific PUFAs by the PUFA PKS pathway are substantially free of the various intermediate products and side products that accumulate in oils that are produced as a result of the use of the standard PUFA synthesis pathway. This characteristic is discussed in detail below.
- More particularly, efforts to produce long chain PUFAs in plants by the “standard” pathway have all taken the same basic approach, which is dictated by this synthesis pathway. These efforts relied on modification of the plants' endogenous fatty acids by introduction of genes encoding various elongases and desaturases. Plants typically produce 18 carbon fatty acids (e.g., oleic acid, linoleic acid, linolenic acid) via the Type II fatty acid synthase (FAS) in its plastids. Often, a single double bond is formed while that fatty acid is attached to ACP, and then the oleic acid (18:1) is cleaved from the ACP by the action of an acyl-ACP thioesterase. The free fatty acid is exported from the plastid and converted to an acyl-CoA. The 18:1 can be esterified to phosphatidylcholine (PC) and up to two more cis double bonds can be added. The newly introduced elongases can utilize substrates in the acyl-CoA pool to add carbons in two-carbon increments. Newly introduced desaturases can utilize either fatty acids esterified to PC, or those in the acyl-CoA pool, depending on the source of the enzyme. One consequence of this scheme for long chain PUFA production, however, is that intermediates or side products in the pathway accumulate, which often represent the majority of the novel fatty acids in the plant oil, rather than the target long chain PUFA.
- For example, using the standard or classical pathway as described above, when the target PUFA product (i.e., the PUFA product that one is targeting for production, trying to produce, attempting to produce, by using the standard pathway) is DHA or EPA, for example (e.g., produced using elongases and desaturases that will produce the DHA or EPA from the products of the FAS system), a variety of intermediate products and side products will be produced in addition to the DHA or EPA, and these intermediate or side products frequently represent the majority of the products produced by the pathway, or are at least present in significant amounts in the lipids of the production organism. Such intermediate and side products include, but are not limited to, fatty acids having fewer carbons and/or fewer double bonds than the target, or primary PUFA, and can include unusual fatty acid side products that may have the same number of carbons as the target or primary PUFA, but which may have double bonds in unusual positions. By way of example, in the production of EPA using the standard pathway (e.g., see U.S. Patent Application Publication 2004/0172682), while the target PUFA of the pathway is EPA (i.e., due to the use of elongases and desaturases that specifically act on the products of the FAS system to produce EPA), the oils produced by the system include a variety of intermediate and side products including: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other intermediate or side products, such as 20:0; 20:1 (Δ5); 20:1 (Δ11): 20:2 (Δ8,11); 20:2 (Δ11,14); 20:3 (Δ5,11,14); 20:3 (Δ11,14,17); mead acid (20:3; Δ5,8,11); or 20:4 (Δ5,1,14,17). Intermediates of the system can also include long chain PUFAs that are not the target of the genetic modification (e.g., a standard pathway enzyme system for producing DHA can actually produce more EPA as an intermediate product than DHA).
- In contrast, the PUFA PKS synthase of the present invention does not utilize the fatty acid products of FAS systems. Instead, it produces the final PUFA product (the primary PUFA product) from the same small precursor molecule that is utilized by FASB and elongases (malonyl-CoA). Therefore, intermediates in the synthesis cycle are not released in any significant amount, and the PUFA product (also referred to herein as the primary PUFA product) is efficiently transferred to phospholipids (PL) and triacylglycerol (TAG) fractions of the lipids. Indeed, a PUFA PKS system may produce two target or primary PUFA products (e.g., the PUFA PKS system from Schizochytrium produces both DHA and DPA n-6 as primary products), but DPA is not an intermediate in the pathway to produce DHA. Rather, each is a separate product of the same PUFA PKS system. Therefore, the PUFA PKS genes of the present invention are an excellent means of producing oils containing PUFAs, and particularly, LCPUFAs in a heterologous host, such as a plant, wherein the oils are substantially free (defined below) of the intermediates and side products that contaminate oils produced by the “standard” PUFA pathway.
- Therefore, it is an object of the present invention to produce, via the genetic manipulation of plants as described herein, polyunsaturated fatty acids and, by extension, oils obtained from such plants (e.g., obtained from the oil seeds of such plants) comprising these PUFAs. Examples of PUFAs that can be produced by the present invention include, but are not limited to, DHA (docosahexaenoic acid (C22:6, n-3)), ARA (eicosatetraenoic acid or arachidonic acid (C20:4, n-6)), DPA (docosapentaenoic acid (C22:5, n-6 or n-3)), and EPA (eicosapentaenoic acid (C20:5, n-3)). The present invention allows for the production of commercially valuable lipids enriched in one or more desired (target or primary) PUFAs by the present inventors' development of genetically modified plants through the use of the polyketide synthase system of the present invention, as well as components thereof, that produces PUFAs.
- According to the present invention, reference to a “primary PUFA”, “target PUFA”, “intended PUFA”, or “desired PUFA” refers to the particular PUFA or PUFAs that are the intended product of the enzyme pathway that is used to produce the PUFA(s). For example, when using elongases and desaturases to modify products of the FAS system in the classical pathway for PUFA production, one can select particular combinations of elongases and desaturases that, when used together, will produce a target or desired PUFA (e.g., DHA or EPA). As discussed above, such target or desired PUFA produced by the standard pathway may not actually be a “primary” PUFA in terms of the amount of PUFA as a percentage of total fatty acids produced by the system, due to the formation of intermediates and side products that can actually represent the majority of products produced by the system. However, one may use the term “primary PUFA” even in that instance to refer to the target or intended PUFA product produced by the elongases or desaturases used in the system.
- In contrast to the classical pathway for PUFA production, when using a PUFA PKS system, a given PUFA PKS system derived from a particular organism (or created from combining proteins and domains from PUFA PKS systems) will produce particular PUFA(s), such that selection of a PUFA PKS system from a particular organism will result in the production of specified target or primary PUFAs. For example, use of a PUFA PKS system from Schizochytrium according to the present invention will result in the production of DHA and DPAn-6 as the target or primary PUFAs. However, as discussed above, the use of various proteins and domains with proteins and domains from other PUPA PKS systems or other PKS systems (that produce bioactive molecules other than PUFAs) can be combined (“mixed and matched”) to result in the production of different PUFA profiles.
- When using a PUFA PKS system of the present invention, oils produced by the organism, such as a plant, are substantially free of intermediate or side products that are not the target or primary PUFA products and that are not naturally produced by the endogenous FAS system in the wild-type organism (e.g., wild-type plants produce some shorter or medium chain PUFAs, such as 18 carbon PUFAs, via the FAS system, but there will be new, or additional, fatty acids produced in the plant as a result of genetic modification with a PUFA PKS system). In other words, as compared to the profile of total fatty acids from the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification, the majority of additional fatty acids in the profile of total fatty acids produced by plants that have been genetically modified with the PUFA PKS system of the present invention (or a component thereof), comprise the target or intended PUFA products of the PUFA PKS system (i.e., the majority of additional fatty acids in the total fatty acids that are produced by the genetically modified plant are the target PUFA(s)).
- According to the present invention, reference to “intermediate products” or “side products” of an enzyme system that produces PUFAs refers to any products, and particularly, fatty acid products, that are produced by the enzyme system as a result of the production of the target or primary PUFA of the system. Intermediate and side products are particularly significant in the standard pathway for PUFA synthesis and are substantially less significant in the PUFA PKS pathway, as discussed above. It is noted that a primary or target PUFA of one enzyme system may be an intermediate of a different enzyme system where the primary or target product is a different PUFA, and this is particularly true of products of the standard pathway of PUFA production, since the PUFA PKS system of the present invention substantially avoids the production of intermediates. For example, when using the standard pathway to produce EPA, fatty acids such as GLA, DGLA and SDA are produced as intermediate products in significant quantities (e.g., U.S. Patent Application Publication 2004/0172682 illustrates this point). Similarly, and also illustrated by U.S. Patent Application Publication 2004/0172682, when using the standard pathway to produce DHA, in addition to the fatty acids mentioned above, ETA and EPA (notably the target PUFA in the first example above) are produced in significant quantities and in fact, may be present in significantly greater quantities relative to the total fatty acid product than the target PUFA itself. This latter point is shown in U.S. Patent Application Publication 2004/0172682, where a plant that was engineered to produce DHA by the standard pathway produces more EPA as a percentage of total fatty acids than DHA.
- Furthermore, to be “substantially free” of intermediate or side products of the system for synthesizing PUFAs, or to not have intermediate or side products present in substantial amounts, means that any intermediate or side product fatty acids that are produced in the genetically modified plant (and/or parts of plants and/or seed oil fraction) as a result of the enzyme system for producing PUFAS (i.e., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification), are present in a quantity that is less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% by weight of the total fatty acids produced by the plant.
- In a preferred embodiment, to be “substantially free” of intermediate or side products of the system for synthesizing PUFAs, or to not have intermediate or side products present in substantial amounts, means that any intermediate or side product fatty acids that are produced in the genetically modified plant (and/or parts of plants and/or seed oil fraction) as a result of the enzyme system for producing PUFAS (i.e., that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification), are present in a quantity that is less than about 10% by weight of the total additional fatty acids produced by the plant (additional fatty acids being those that are not produced by the wild-type plant or the parent plant used as a recipient for the indicated genetic modification), and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of the total additional fatty acids produced by the plant. Therefore, in contrast to the fatty acid profile of plants that have been genetically modified to produce PUFAs via the standard pathway, the majority of fatty acid products resulting from the genetic modification with a PUFA PKS system will be the target or intended fatty acid products.
- When the target product of a PUFA PKS system is a long chain PUFA, such as DHA or DPA (n-6 or n-3) produced by the PUFA PKS system of the invention described herein, intermediate products and side products that are not present in substantial amounts in the total lipids of plants genetically modified with such PUFA PKS can include, but are not limited to: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other intermediate or side products, such as 20:0; 20:1 (Δ5); 20:1 (Δ11); 20:2 (Δ8,11); 20:2 (Δ11,14); 20:3 (Δ5,11,14); 20:3 (Δ11,14,17); mead acid (20:3; Δ5,8,11); or 20:4 (Δ5,1,14,17). In addition, when the target product is a particular PUFA, such as DHA, the intermediate products and side products that are not present in substantial amounts in the total lipids of the genetically modified plants also include other PUFAs, including other PUFAs that are a natural product of a different PUFA PKS system, such as EPA in this example. It is to be noted that the PUFA PKS system of the present invention can also be used, if desired, to produce as a target PUFA a PUFA that can include GLA, SDA or DGLA (referring to embodiments where oils are produced using components of a PUFA PKS system described herein).
- Using the knowledge of the genetic basis and domain structure of the PUFA PKS system described herein, the present inventors have designed and produced constructs encoding such a PUFA PKS system and have successfully produced transgenic plants expressing the PUFA PKS system. The transgenic plants produce oils containing PUFAs, and the oils are substantially free of intermediate products that accumulate in a standard PUFA pathway (see Example 3). The present inventors have also demonstrated the use of the constructs to produce PUFAs in another eukaryote, yeast, as a proof-of-concept experiment prior to the production of the transgenic plants (see Example 2). The examples demonstrate that transformation of both yeast and plants with a PUFA PKS system that produces DHA and DPAn-6 as the target PUFAs produces both of these PUFAs as the primary additional fatty acids in the total fatty acids of the plant (i.e., subtracting fatty acids that are produced in the wild-type plant), and in the yeast and further, that any other fatty acids that are not present in the fatty acids of the wild-type plant are virtually undetectable. Specific characteristics of genetically modified plants and parts and oils thereof of the present invention are described in detail elsewhere herein.
- Accordingly, one embodiment of the present invention is a method to produce desired bioactive molecules (also referred to as products or compounds) by growing or culturing a genetically modified microorganism or a genetically modified plant of the present invention (described in detail above). Such a method includes the step of culturing in a fermentation medium or growing in a suitable environment, such as soil, a microorganism or plant, respectively, that has a genetic modification as described previously herein and in accordance with the present invention. In a preferred embodiment, method to produce bioactive molecules of the present invention includes the step of culturing under conditions effective to produce the bioactive molecule a genetically modified organism that expresses a PKS system comprising at least one biologically active domain of a polyunsaturated fatty acid (PUFA) polyketide synthase (PKS) system as described herein.
- In the method of production of desired bioactive compounds of the present invention, a genetically modified microorganism is cultured or grown in a suitable medium, under conditions effective to produce the bioactive compound. An appropriate, or effective, medium refers to any medium in which a genetically modified microorganism of the present invention, when cultured, is capable of producing the desired product. Such a medium is typically an aqueous medium comprising assimilable carbon, nitrogen and phosphate sources. Such a medium can also include appropriate salts, minerals, metals and other nutrients. Microorganisms of the present invention can be cultured in conventional fermentation bioreactors. The microorganisms can be cultured by any fermentation process which includes, but is not limited to, batch, fed-batch, cell recycle, and continuous fermentation. Preferred growth conditions for potential host microorganisms according to the present invention are well known in the art. The desired bioactive molecules produced by the genetically modified microorganism can be recovered from the fermentation medium using conventional separation and purification techniques. For example, the fermentation medium can be filtered or centrifuged to remove microorganisms, cell debris and other particulate matter, and the product can be recovered from the cell-free supernatant by conventional methods, such as, for example, ion exchange, chromatography, extraction, solvent extraction, membrane separation, electrodialysis, reverse osmosis, distillation, chemical derivatization and crystallization. Alternatively, microorganisms producing the desired compound, or extracts and various fractions thereof, can be used without removal of the microorganism components from the product.
- In the method for production of desired bioactive compounds of the present invention, a genetically modified plant is cultured in a fermentation medium or grown in a suitable medium such as soil. An appropriate, or effective, fermentation medium has been discussed in detail above. A suitable growth medium for higher plants includes any growth medium for plants, including, but not limited to, soil, sand, any other particulate media that support root growth (e.g. vermiculite, perlite, etc.) or Hydroponic culture, as well as suitable light, water and nutritional supplements which optimize the growth of the higher plant. The genetically modified plants of the present invention are engineered to produce significant quantities of the desired product through the activity of the PKS system that is genetically modified according to the present invention. The compounds can be recovered through purification processes which extract the compounds from the plant. In a preferred embodiment, the compound is recovered by harvesting the plant. In this embodiment, the plant can be consumed in its natural state or further processed into consumable products.
- Bioactive molecules, according to the present invention, include any molecules (compounds, products, etc.) that have a biological activity, and that can be produced by a PKS system that comprises at least one amino acid sequence having a biological activity of at least one functional domain of a non-bacterial PUFA PKS system as described herein. Such bioactive molecules can include, but are not limited to: a polyunsaturated fatty acid (PUFA), an anti-inflammatory formulation, a chemotherapeutic agent, an active excipient, an osteoporosis drug, an anti-depressant, an anti-convulsant, an anti-Heliobactor pylori drug, a drug for treatment of neurodegenerative disease, a drug for treatment of degenerative liver disease, an antibiotic, and a cholesterol lowering formulation. One advantage of the non-bacterial PUFA PKS system of the present invention is the ability of such a system to introduce carbon-carbon double bonds in the cis configuration, and molecules including a double bond at every third carbon. This ability can be utilized to produce a variety of compounds.
- With respect to microorganisms, preferably, bioactive compounds of interest are produced by the genetically modified microorganism in an amount that is greater than about 0.05%, and preferably greater than about 0.1%, and more preferably greater than about 0.25%, and more preferably greater than about 0.5%, and more preferably greater than about 0.75%, and more preferably greater than about 1%, and more preferably greater than about 2.5%, and more preferably greater than about 5%, and more preferably greater than about 10%, and more preferably greater than about 15%, and even more preferably greater than about 20% of the dry weight of the microorganism. For lipid compounds, preferably, such compounds are produced in an amount that is greater than about 5% of the dry weight of the microorganism. For other bioactive compounds, such as antibiotics or compounds that are synthesized in smaller amounts, those strains possessing such compounds at of the dry weight of the microorganism are identified as predictably containing a novel. PKS system of the type described above. In some embodiments, particular bioactive molecules (compounds) are secreted by the microorganism, rather than accumulating. Therefore, such bioactive molecules are generally recovered from the culture medium and the concentration of molecule produced will vary depending on the microorganism and the size of the culture.
- Preferably, a genetically modified organism (e.g., microorganism or plant) of the invention produces one or more polyunsaturated fatty acids including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), DPA (C22:5, n-6 or n-3), ARA (C20:4, n-6), GLA (C18:3, n-6), ALA (C18:3, n-3), and/or SDA (C18:4, n-3)), and more preferably, one or more long chain fatty acids, including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), DPA (C22:5, n-6 or n-3), or DTA (C22:4, n-6). In a particularly preferred embodiment, a genetically modified organism of the invention produces one or more polyunsaturated fatty acids including, but not limited to, EPA (C20:5, n-3), DHA (C22:6, n-3), and/or DPA (C22:5, n-6 or n-3).
- Preferably, a genetically modified organism of the invention produces at least one PUFA (the target PUPA), wherein the total fatty acid profile in the organism (or a part of the organism that accumulates PUFAs, such as mature seeds or oil from such seeds, if the organism is an oil seed plant), comprises a detectable amount of this PUFA or PUFAs. Preferably, the PUFA is at least a 20 carbon PUFA and comprises at least 3 double bonds, and more preferably at least 4 double bonds, and even more preferably, at least 5 double bonds. In one embodiment, the PUFA is a PUFA that is not naturally produced by the organism (i.e., the wild-type organism in the absence of genetic modification or the parent organism used as a recipient for the indicated genetic modification).
- Preferably, the total fatty acid profile in the organism (or part of the organism that accumulates PUFAs) comprises at least 0.1% of the target PUFA(s) by weight of the total fatty acids, and more preferably at least about 0.2%, and more preferably at least about 0.3%, and more preferably at least about 0.4%, and more preferably at least about 0.5%, and more preferably at least about 1%, and more preferably at least about 2%, and more preferably at least about 3%, and more preferably at least about 4%, and more preferably at least about 5%, and more preferably at least about 10%, and more preferably at least about 15%, and more preferably at least about 20%, and more preferably at least about 25%, and more preferably at least about 30%, and more preferably at least about 35%, and more preferably at least about 40%, and more preferably at least about 45%, and more preferably at least about 50%, and more preferably at least about 55%, and more preferably at least about 60%, and more preferably at least about 65%, and more preferably at least about 70%, and more preferably at least about 75%, and more preferably more that 75% of at least one polyunsaturated fatty acid (the target PUFA) by weight of the total fatty acids, or any percentage from 0.1% to 75%, or greater than 75% (up to 100% or about 100%), in 0.1% increments, of the target PUFA(s). As generally used herein, reference to a percentage amount of PUFA production is by weight of the total fatty acids produced by the organism, unless otherwise stated (e.g., in some cases, percentage by weight is relative to the total fatty acids produced by an enzyme complex, such as a PUFA PKS system). In one embodiment, total fatty acids produced by a plant are presented as a weight percent as determined by gas chromatography (GC) analysis of a fatty acid methyl ester (FAME) preparation.
- As described above, it is an additional characteristic of the total fatty acids produced by a plant (and/or parts of plants or seed oil fraction) that has been genetically modified to express a PUFA PKS of the present invention that these total fatty acids produced by the plant comprise less than about 10% by weight of any fatty acids other than the target PUFA(s) that are produced by the enzyme complex that produces the target PUFA(s) (e.g., DHA and DPAn-6 are the target PUFAs if the entire PUFA PKS system of the invention is used). Preferably, any fatty acids that are produced by the enzyme complex that produces the target PUFA(s) other than the target PUFA(s) are present at less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% by weight of the total fatty acids produced by the plant.
- In another embodiment, any fatty acids that are produced by the enzyme complex that produces the target PUFA(s) other than the target PUFA(s) are present at less than about 10% by weight of the total fatty acids that are produced by the enzyme complex that produces the target PUFA(s) in the plant (i.e., this measurement is limited to those total fatty acids that are produced by the enzyme complex that produces the target PUFAs), and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% by weight of the total fatty acids that are produced by the enzyme complex that produces the target PUFA(s) in the plant.
- In another aspect of this embodiment of the invention, the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than (or do not contain any more than) 10% PUFAs having 18 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification. In further aspects, the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than 9% PUFAs having 18 or more carbons, or less than 8% PUFAs having 18 or more carbons, or less than 7% PUFAs having 18 or more carbons, or less than 6% PUFAs having 18 or more carbons, or less than 5% PUFAs having 18 or more carbons, or less than 4% PUFAs having 18 or more carbons, or less than 3% PUFAs having 18 or more carbons, or less than 2% PUFAs having 18 or more carbons, or less than 1% PUFAs having 18 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- In another aspect of this embodiment of the invention, the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than (or do not contain any more than) 10% PUFAs having 20 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification. In further aspects, the total fatty acids produced by the plant (and/or parts of plants or seed oil fraction) contain less than 9% PUFAs having 20 or more carbons, or less than 8% PUFAs having 20 or more carbons, or less than 7% PUFAs having 20 or more carbons, or less than 6% PUFAs having 20 or more carbons, or less than 5% PUFAs having 20 or more carbons, or less than 4% PUFAs having 20 or more carbons, or less than 3% PUFAs having 20 or more carbons, or less than 2% PUFAs having 20 or more carbons, or less than 1% PUFAs having 20 or more carbons by weight of the total fatty acids produced by the plant, other than the target PUFA(s) or the PUFAs that are present in the wild-type plant (not genetically modified) or the parent plant used as a recipient for the indicated genetic modification.
- In one embodiment, the total fatty acids in the plant (and/or parts of plants or seed oil fraction) contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of a fatty acid selected from any one or more of: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid (ETA; 20:3, n-9) and various other fatty acids, such as 20:0; 20:1 (Δ5); 20:1 (Δ11); 20:2 (Δ8,11); 20:2 (Δ11,14); 20:3 (Δ5,11,14); 20:3 (Δ11,14,17); mead acid (20:3; Δ5,8,11); or 20:4 (Δ5,1,14,17).
- In another embodiment, the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of a fatty acid selected from: gamma-linolenic acid (GLA; 18:3, n-6); stearidonic acid (STA or SDA; 18:4, n-3); dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6), arachidonic acid (ARA, C20:4, n-6); eicosatrienoic acid
- (ETA; 20:3, n-9) and various other fatty acids, such as 20:0; 20:1 (Δ5); 20:1 (Δ11); 20:2 (Δ8,11); 20:2 (Δ11,14); 20:3 (Δ5,11,14); 20:3 (Δ11,14,17); mead acid (20:3; Δ5,8,11); or 20:4 (Δ5,1,14,17).
- In another embodiment, the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- In another embodiment, the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of each of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- In another embodiment, the fatty acids that are produced by the enzyme system that produces the long chain PUFAs in the plant contain less than about 10% by weight of the total fatty acids produced by the plant, and more preferably less than about 9%, and more preferably less than about 8%, and more preferably less than about 7%, and more preferably less than about 6%, and more preferably less than about 5%, and more preferably less than about 4%, and more preferably less than about 3%, and more preferably less than about 2%, and more preferably less than about 1% of any one or more of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
- In one aspect of this embodiment of the invention, a genetically modified plant produces at least two target PUFAs (e.g., DHA and DPAn-6), and the total fatty acid profile in the plant, or the part of the plant that accumulates PUFAs (including oils from the oil seeds), comprises a detectable amount of these PUFAs. In this embodiment, the PUFAs are preferably each at least a 20 carbon PUFA and comprise at least 3 double bonds, and more preferably at least 4 double bonds, and even more preferably, at least 5 double bonds. Such PUFAs are most preferably chosen from DHA, DPAn-6 and EPA. In one aspect, the plant produces DHA and DPAn-6 (the products of a PUFA PKS system described herein), and the ratio of DHA to DPAn-6 is from about 1:10 to about 10:1, including any ratio in between. In a one embodiment, the ratio of DHA to DPA is from about 1:1 to about 3:1, and in another embodiment, about 2.5:1.
- In another aspect of this embodiment of the invention, the plant produces the total fatty acid profile represented by
FIG. 5 . - The invention further includes any seeds produced by the plants described above, as well as any oils produced by the plants or seeds described above. The invention also includes any products produced using the plants, seed or oils described herein.
- One embodiment of the present invention relates to a method to modify an endproduct containing at least one fatty acid, comprising adding to said endproduct an oil produced by a recombinant host cell that expresses at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding at least one biologically active domain of a PUFA PKS system as described herein.
- Preferably, the endproduct is selected from the group consisting of a food, a dietary supplement, a pharmaceutical formulation, a humanized animal milk, and an infant formula. Suitable pharmaceutical formulations include, but are not limited to, an anti-inflammatory formulation, a chemotherapeutic agent, an active excipient, an osteoporosis drug, an anti-depressant, an anti-convulsant, an anti-Heliobactor pylori drug, a drug for treatment of neurodegenerative disease, a drug for treatment of degenerative liver disease, an antibiotic, and a cholesterol lowering formulation. In one embodiment, the endproduct is used to treat a condition selected from the group consisting of: chronic inflammation, acute inflammation, gastrointestinal disorder, cancer, cachexia, cardiac restenosis, neurodegenerative disorder, degenerative disorder of the liver, blood lipid disorder, osteoporosis, osteoarthritis, autoimmune disease, preeclampsia, preterm birth, age related maculopathy, pulmonary disorder, and peroxisomal disorder.
- Suitable food products include, but are not limited to, fine bakery wares, bread and rolls, breakfast cereals, processed and unprocessed cheese, condiments (ketchup, mayonnaise, etc.), dairy products (milk, yogurt), puddings and gelatine desserts, carbonated drinks, teas, powdered beverage mixes, processed fish products, fruit-based drinks, chewing gum, hard confectionery, frozen dairy products, processed meat products, nut and nut-based spreads, pasta, processed poultry products, gravies and sauces, potato chips and other chips or crisps, chocolate and other confectionery, soups and soup mixes, soya based products (milks, drinks, creams, whiteners), vegetable oil-based spreads, and vegetable-based drinks.
- Yet another embodiment of the present invention relates to a method to produce a humanized animal milk. This method includes the steps of genetically modifying milk-producing cells of a milk-producing animal with at least one recombinant nucleic acid molecule comprising a nucleic acid sequence encoding at least one biologically active domain of a PUFA PKS system as described herein.
- Methods to genetically modify a host cell and to produce a genetically modified non-human, milk-producing animal, are known in the art. Examples of host animals to modify include cattle, sheep, pigs, goats, yaks, etc., which are amenable to genetic manipulation and cloning for rapid expansion of a transgene expressing population. For animals, PKS-like transgenes can be adapted for expression in target organelles, tissues and body fluids through modification of the gene regulatory regions. Of particular interest is the production of PUFAs in the breast milk of the host animal.
- Each publication or reference cited herein is incorporated herein by reference in its entirety.
- The following examples are provided for the purpose of illustration and are not intended to limit the scope of the present invention.
- The following example demonstrates that SchizochytriumOrfs A, B and C encode a functional DHA/DPA synthesis enzyme via functional expression in E. coli.
- General Preparation of E. coli Transformants
- The three genes encoding the Schizochytrium PUFA PKS system that produce DHA and DPA (Orfs A, B & C; SEQ ID NO:1, SEQ ID NO:3 and SEQ ID NO:5, respectively) were cloned into a single E. coli expression vector (derived from pET21c (Novagen)). The genes are transcribed as a single message (by the T7 RNA-polymerase), and a ribosome-binding site cloned in front of each of the genes initiates translation. Modification of the Orf B coding sequence was needed to obtain production of a full-length Orf B protein in E. coli (see below). An accessory gene, encoding a PPTase (see below) was cloned into a second plasmid (derived from pACYC184, New England Biolabs).
- The Orf B gene is predicted to encode a protein with a mass of ˜224 kDa. Initial attempts at expression of the gene in E. coli resulted in accumulation of a protein with an apparent molecular mass of ˜165 kDa (as judged by comparison to proteins of known mass during SDS-PAGE). Examination of the Orf B nucleotide sequence revealed a region containing 15 sequential serine codons—all of them being the TCT codon. The genetic code contains 6 different serine codons, and three of these are used frequently in E. coli. The inventors used four overlapping oligonucleotides in combination with a polymerase chain reaction protocol to resynthesize a small portion of the Orf B gene (a ˜195 base pair, BspHI to SacII restriction enzyme fragment) that contained the serine codon repeat region. In the synthetic Orf B fragment, a random mixture of the 3 serine codons commonly used by E. coli was used, and some other potentially problematic codons were changed as well (i.e., other codons rarely used by E. coli). The BspHI to SacII fragment present in the original Orf B was replaced by the resynthesized fragment (to yield Orf B*) and the modified gene was cloned into the relevant expression vectors. The modified OrfB* still encodes the amino acid sequence of SEQ ID NO:4. Expression of the modified Orf B* clone in E. coli resulted in the appearance of a ˜224 kDa protein, indicating that the full-length product of OrfB was produced. The sequence of the resynthesized Orf B* BspHI to SacII fragment is represented herein as SEQ ID NO:38. Referring to SEQ ID NO:38, the nucleotide sequence of the resynthesized BspHI to SacII region of Orf B is shown. The BspHI restriction site and the SacII restriction site are identified. The BspHI site starts at nucleotide 4415 of the Orf B CDS (SEQ ID NO:3) (note: there are a total of three BspHI sites in the Orf B CDS, while the SacII site is unique).
- The ACP domains of the Orf A protein (SEQ ID NO:2 in Schizochytrium) must be activated by addition of phosphopantetheine group in order to function. The enzymes that catalyze this general type of reaction are called phosphopantetheine transferases (PPTases). E. coli contains two endogenous PPTases, but it was anticipated that they would not recognize the Orf A ACP domains from Schizochytrium. This was confirmed by expressing Orfs A, B* (see above) and C in E. coli without an additional PPTase. In this transformant, no DHA production was detected. The inventors tested two heterologous PPTases in the E. coli PUFA PKS expression system: (1) sfp (derived from Bacillus subtilis) and (2) Het I (from the cyanobacterium Nostoc strain 7120).
- The sfp PPTase has been well characterized and is widely used due to its ability to recognize a broad range of substrates. Based on published sequence information (Nakana, et al., 1992, Molecular and General Genetics 232: 313-321), an expression vector for sfp was built by cloning the coding region, along with defined up- and downstream flanking DNA sequences, into a pACYC-184 cloning vector. Oligonucleotides were used to amplify the region of interest from genomic B. subtilus DNA. The oligonucleotides:
-
(forward; SEQ ID NO: 39) CGGGGTACCCGGGAGCCGCCTTGGCTTTGT; and (reverse; SEQ ID NO: 40) AAACTGCAGCCCGGGTCCAGCTGGCAGGCACCCTG,
were used to amplify the region of interest from genomic B. subtilus DNA. Convenient restriction enzyme sites were included in the oligonucleotides to facilitate cloning in an intermediate, high copy number vector and finally into the EcoRV site of pACYC184 to create the plasmid: pBR301. Examination of extracts of E. coli transformed with this plasmid revealed the presence of a novel protein with the mobility expected for sfp. Co-expression of the sfp construct in cells expressing the Orf A, B*, C proteins, under certain conditions, resulted in DHA production. This experiment demonstrated that sfp was able to activate the SchizochytriumOrf A ACP domains. In addition, the regulatory elements associated with the sfp gene were used to create an expression cassette into which other genes could be inserted. Specifically, the sfp coding region (along with three nucleotides immediately upstream of the ATG) in pBR301 was replaced with a 53 base pair section of DNA designed so that it contains several unique (for this construct) restriction enzyme sites. The initial restriction enzyme site in this region is NdeI. The ATG sequence embedded in this site is utilized as the initiation methionine codon for introduced genes. The additional restriction sites (BglLL, Nod, SmaI, PmelI, HindIII, SpeI and XhoI) were included to facilitate the cloning process. The functionality of this expression vector cassette was tested by using PCR to generate a version of sfp with a NdeI site at the 5′ end and an XhoI site ate the 3′ end. This fragment was cloned into the expression cassette and transferred into E. coli along with the Orf A, B* and C expression vector. Under appropriate conditions, these cells accumulated DHA, demonstrating that a functional sfp had been produced. - Het I is present in a cluster of genes in Nostoc known to be responsible for the synthesis of long chain hydroxy-fatty acids that are a component of a glyco-lipid layer present in heterocysts of that organism (Black and Wolk, 1994, J. Bacteriol. 176, 2282-2292; Campbell et al., 1997, Arch. Microbial. 167, 251-258). Het I activates the ACP domains of a protein, Hgl E, present in that cluster. The two ACP domains of Hgl E have a high degree of sequence homology to the ACP domains found in SchizochytriumOrf A. A Het I expression construct was made using PCR. Specifically, SEQ ID NO:41 represents the amino acid sequence of the Nostoc Het I protein. The endogenous start codon of Het I has not been identified (there is no methionine present in the putative protein). There are several potential alternative start codons (e.g., TTG and ATT) near the 5′ end of the open reading frame. No methionine codons (ATG) are present in the sequence. A Het I expression construct was made by using PCR to replace the furthest 5′ potential alternative start codon (TTG) with a methionine codon (ATG, as part of the above described NdeI restriction enzyme recognition site), and introducing an XhoI site at the 3′ end of the coding sequence. The modified HetI coding sequence was then inserted into the NdeI and XhoI sites of the pACYC184 vector construct containing the sfp regulatory elements. Expression of this Het I construct in E. coli resulted in the appearance of a new protein of the size expected from the sequence data. Co-expression of Het I with SchizochytriumOrfs A, B*, C in E. coli under several conditions resulted in the accumulation of DHA and DPA in those cells. In all of the experiments in which sfp and Het I were compared, more DHA and DPA accumulated in the cells containing the Het I construct than in cells containing the sfp construct.
- The two plasmids encoding: (1) the Schizochytrium PUFA PKS genes (Orfs A, B* and C) and (2) the PPTase (from sfp or from Het I) were transformed into E. coli strain BL21 which contains an inducible T7 RNA polymerase gene. Synthesis of the Schizochytrium proteins was induced by addition of IPTG to the medium, while PPTase expression was controlled by a separate regulatory element (see above). Cells were grown under various defined conditions and using either of the two heterologous PPTase genes. The cells were harvested and the fatty acids were converted to methyl-esters (FAME) and analyzed using gas-liquid chromatography.
- Under several conditions, DHA and DPA were detected in E. coli cells expressing the Schizochytrium PUFA PKS genes, plus either of the two heterologous PPTases (data not shown). No DHA or DPA was detected in FAMEs prepared from control cells (i.e., cells transformed with a plasmid lacking one of the Oils). The ratio of DHA to DPA observed in E. coli approximates that of the endogenous DHA and DPA production observed in Schizochytrium. The highest level of PUFA (DHA plus DPA), representing ˜17% of the total FAME, was found in cells grown at 32° C. in 765 medium (recipe available from the American Type Culture Collection) supplemented with 10% (by weight) glycerol. PUFA accumulation was also observed when cells were grown in Luria Broth supplemented with 5 or 10% glycerol, and when grown at 20° C. Selection for the presence of the respective plasmids was maintained by inclusion of the appropriate antibiotics during the growth, and IPTG (to a final concentration of 0.5 mM) was used to induce expression of Orfs A, B* and C. Co-expression of Het I or sfp with SchizochytriumOrfs A, B*, C in E. coli under several conditions resulted in the accumulation of DHA and DPA in those cells. In all of the experiments in which sfp and Het I were compared, more DHA and DPA accumulated in the cells containing the Het I construct than in cells containing the sfp construct.
- The following example shows the expression of genes encoding the Schizochytrium PUFA synthase (sOrfA, sOrfB and native Orf C) along with Het I in baker's yeast (Saccharomyces cerevisiae).
- The Schizochytrium PUFA synthase genes and Het I were expressed in yeast using materials obtained from Invitrogen (Invitrogen Corporation, Carlsbad, Calif.). The INVsc1 strain of Saccharomyces cerevisiae was used along with the following transformation vectors: pYESLeu (sOrfA), pYES3/CT (sOrfB), pYES2/CT (OrfC) and pYESHis (HetI). To accommodate yeast codon useage, the nucleotide sequences for OrfA (SEQ ID NO:1) and for OrfB (SEQ ID NO:3) were resynthesized. The nucleotide sequence for the resynthesized OrfA (contained in pYESLeu), designated sOrfA, is represented herein by SEQ ID NO:43. SEQ ID NO:43 still encodes the OrfA amino acid sequence of SEQ ID NO:2. The nucleotide sequence for the resynthesized OrfB (contained in pYES3/CT), designated sOrfB, is represented herein by SEQ ID NO:44. SEQ ID NO:44 still encodes the OrfB amino acid sequence of SEQ ID NO:4. The OrfC nucleotide sequence used in these experiments (contained in pYES2/CT) is the wild-type OrfC, represented by SEQ ID NO:5, and encoding SEQ ID NO:6.
- Some of the vectors were modified to accommodate specific cloning requirements (e.g., restriction sites for cloning). Appropriate selection media were used (as specified by Invitrogen), depending on the particular experiment. The genes were cloned, in each case, behind a GAL1 promoter and expression was induced by re-suspension of washed cells in media containing galactose according to guidelines provide by Invitrogen. Cells were grown at 30° C. and harvested (by centrifugation) after being transferred to the induction medium. The cell pellets were freeze dried and FAMEs were prepared using acidic methanol, extracted into hexane and analyzed by GC.
- A comparison of the fatty acid profile from yeast cells expressing the Schizochytrium PUFA synthase system (sOrfA, sOrfB, OrfC and Het I) and one obtained from control cells (lacking the sOrfA gene) collected ˜20 hrs after induction, showed that two novel FAME peaks have appeared in the profile of the strain expressing the complete PUFA synthase system (
FIG. 3 ). These two peaks were identified as DPAn-6 and DHA by comparison of the elution time with authentic standards and subsequently by MS analyses. As predicted from the characterization of the Schizochytrium PUPA synthase, aside from DHA and DPAn-6, no other novel peaks are evident in the profile.FIG. 4 shows the region of the GC chromatogram ofFIG. 3 which contains the PUFA FAMEs. Both the control cells and the cell expressing the PUFA synthase contain a peak that elutes near the DHA FAME. This has been identified as C26:0 FAME and (based on literature references) is derived from sphingolipids. Although it elutes close to the DHA peak, the resolution is sufficient so that it does not interfere with the quantitation of DHA. The DPA n-6 peak is well separated from other endogenous yeast lipids in the FAME profile. In this particular example, the cells expressing the Schizochytrium PUFA synthase system accumulated 2.4% DHA and 2.0% DPA n-6 (as a percentage of the total FAMEs). The sum of DHA and DPA n-6=4.4% of the measured fatty acids in the cells. The ratio of DHA to DPA n-6 observed in the cells was ˜1.2:1. - The following examples describes the expression of genes encoding the Schizochytrium PUFA synthase (Orf A, Orf B* and Orf C) along with Het I in Arabidopsis.
- The Schizochytrium Orfs A, B* (see Example 1) and C along with Het I were cloned (separately or in various combinations including all 4 genes on one Super-construct) into the appropriate binary vectors for introduction of the genes into plants. Each gene was cloned behind a linin promoter and was followed by a linin terminator sequence (Chaudhary et al., 2001; PCT Publication Number No. WO 01/16340 A1). For localization of the PUFA synthase in the cytoplasm of plant cells, no additional protein encoding sequences were appended to the 5′ end of the Orfs. For directing the proteins to the plastid, additional 5′ sequences encoding a plastid targeting sequence derived from a Brassica napus acyl-ACP thioesterase were added to the Orfs. The amino acid sequence of the encoded targeting peptide is: MLKLSCNVTNHLHTFSFFSDSSLFIPVNRRTLAVS (SEQ ID NO:42). The nucleotide sequences encoding this peptide were placed in frame with the start methionine codons of each PUFA synthase Orf as well as the start codon of Het I.
- More specifically, for one experiment described herein, the constructs and plants were prepared as follows:
- Construction of pSBS4107: Acyl-ACP Transit Peptide-HetI: Acyl-ACP Transit Peptide-ORFC
- This plant binary vector contained a double expression cassette which targeted the co-expression of HetI (SEQ ID NO:41) and ORFC (SEQ ID NO:6) to the plastid. The first expression cassette began with a signal peptide (SEQ ID NO:42) derived from an acyl-ACP thioesterase gene from Brassica juncea (GenBank Accession No. AJ294419) to target expression of the polypeptides to the plastid. The signal peptide was synthesized from two overlapping oligos with an engineered AfiIII site at the 5′ end and an NcoI/Swal/XmaI multiple cloning site at the 3′ end. Immediately downstream was a sequence encoding for PPTase from Nostoc, encoded by HetI, to enable DHA to bind phosphopantetheine attachment sites. The second expression cassette also began with the acyl-ACP signal peptide followed immediately in-frame with a cDNA encoding ORFC (SEQ ID NO:5).
- The backbone of this plasmid, pSBS4055, was based on the plant binary vector, pPZP200, described by Hajdukiewicz et al. (Plant Molecular Biology, 1994, 25:989-994). In place of the described multiple cloning site, a pat gene conferring host plant phosphinothricine resistance (Wohlleben et al., 1988, Gene 70:25-37) driven by the ubiquitin promoter/terminator from Petroselinum crispum (Kawalleck et al., 1993, Plant. Mol. Bio., 21:673-684), was inserted between the left and right border sequences. In addition to this cassette, two separate Linin promoter/terminators in tandem from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number No. WO 01/16340 A1) were used to drive expression of ACP-HetI and ACP-ORFC. Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with cDNAs encoding for either HetI or ORFC using NcoI/XmaI and NcoI/SwaI restriction endonuclease sites, respectively, to the 3′ end of the Linin promoter. The result was plasmid pSBS4107: a DNA sequence encoding the Acyl-ACP signal peptide-HetI and Acyl-ACP signal peptide-ORFC polypeptides being placed in a binary vector under expression control of the linin promoter/terminator. The linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development. The Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993, Plant Mol Biol 23:769-778). The complete plasmid map with annotated elements is shown in
FIG. 6 . - Construction of pSBS5720: Acyl-ACP Transit Peptide-ORFB
- This plant binary vector contained an expression cassette which targeted the expression of ORFB (SEQ ID NO:4) to the plastid. Again, the expression cassette began with a signal peptide derived from an acyl-ACP thioesterase gene from Brassica juncea (SEQ ID NO:42) to target expression of the polypeptide to the plastid. The signal peptide was synthesized as above. Immediately downstream was a cDNA sequence encoding for ORFB (SEQ ID NO:3, except for the resynthesized BspHI to SacII region of Orf B, represented by SEQ ID NO:38; see Example 1 above).
- The backbone of this plasmid, pSBS4055, was based on the plant binary vector, pPZP200, described by Hajdukiewicz et al. (Plant Molecular Biology, 1994, 25:989-994). In place of the described multiple cloning site, a phosphomannose isomerase (PMI) gene conferring host plant positive selection for mannose-6-phosphate driven by the ubiquitin promoter/terminator from Petroselinum crispum (Kawalleck et al., 1993, Plant. Mol. Bio., 21:673-684), was inserted between the left and right border sequences. In addition to this cassette, a Linin promoter/terminator from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number WO 01/16340 A1) was used to drive expression of ACP-ORFB. Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with cDNAs encoding for ORFB, to the 3′ end of the Linin promoter. The result was plasmid pSBS5720: a DNA sequence encoding the Acyl-ACP signal peptide-ORFB polypeptide being placed in a binary vector under expression control of the linin promoter/terminator. The linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development. The Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993, Plant Mol Biol 23:769-778). The complete plasmid map with annotated elements is shown in
FIG. 7 . - Construction of pSBS4757: Acyl-ACP Transit Peptide-ORFA
- This plant binary vector contained an expression cassette which targeted the expression of ORFA (SEQ ID NO:2) to the plastid. Again the expression cassette began with a signal peptide derived from an acyl-ACP thioesterase gene from Brassica juncea (SEQ ID NO:42) to target expression of the polypeptide to the plastid. The signal peptide was synthesized as above. Immediately downstream was a cDNA sequence encoding for ORFA (SEQ ID NO:1).
- The backbone of this plasmid, pSBS4055, was based on the plant binary vector, pPZP200, described by Hajdukiewicz et al. (Plant Molecular Biology, 1994, 25:989-994). In place of the described multiple cloning site, a neomycin phosphotransferase (nptII) gene conferring host plant Kanamycin resistance driven by the mannopine synthase promoter/terminator, was inserted between the left and right border sequences. In addition, a Linin promoter/terminator from Linum usitatissumum (Flax or Linseed) (Chaudhary et al., 2001; PCT Publication Number WO 01/16340 A1) were used to drive expression of ACP-ORFA. Standard restriction cloning was used to fuse the synthetic Acyl-ACP signal peptide in-frame with a cDNA encoding for ORFA to the 3′ end of the Linin promoter. The result was plasmid pSBS4757: a DNA sequence encoding the Acyl-ACP signal peptide-ORFA polypeptide being placed in a binary vector under expression control of the linin promoter/terminator. The linin promoter controls the specific-temporal and tissue-specific expression of the transgene during seed development. The Acyl-ACP signal peptide targets the expression of the protein to the plastid (Loader et al., 1993, Plant Mol Biol 23:769-778). The complete plasmid map with annotated elements is shown in
FIG. 8 . - Standard methods were used for introduction of the genes into Arabidopsis (floral dipping into suspension of Agrobacterium strains containing the appropriate vectors; see Clough and Bent, Plant J.; 16(6):735-43, 1998 December). Seeds obtained from those plants were plated on selective medium and allowed to germinate. Some of the plants that grew were taken to maturity and the seeds analyzed for PUFA content. Based on PUFA content some of those seeds were taken forward to the next generation. Pooled seeds obtained from those plants were analyzed for their fatty acid content. Analysis of a plant line transformed with the constructs specifically described above and denoted 269 is described in more detail below.
- The top panel of
FIG. 5 shows the typical fatty acid profile of wild type Arabidopsis seeds as represented by GC separation and FID detection of FAMEs prepared from a pooled seed sample. The predominant fatty acids of wild type Arabidopsis seeds as represented by GC separation and FID detection of FAMEs prepared from a pooled seed sample are: 16:0, 18:0, 16:1, 18:1, 20:1, 20:2 and 22:1. No DHA or DPA n-6 are present in the samples from wild type seed. The lower panel ofFIG. 5 shows the fatty acid profile of a pooled seed sample from one of the transgenic Arabidopsis lines (line 269) expressing the Schizochytrium PUFA synthase genes and Het I gene. The proteins expressed from these transgenes contain plastid targeting sequences. Two FAME peaks are present in the profile from the transgenic plant seeds that are not present in the profile from wild type seeds. The elution pattern of these two peaks exactly corresponds to the elution of authentic DHA and DPA n-6 (using FAMEs prepared from Schizochytrium oil as standards, as well as a commercially purchased DHA standard from NuCheck Prep). In this particular example, the DHA peak represents 0.8% of total calculated FAMEs while the DPA n-6 peak represents 1.7%. The sum of novel PUFAs is 2.5% of total FAMEs. The appearance of DHA and DPA n-6 in the seed fatty acid profile demonstrates that introduced Schizochytrium PUFA synthase system functions when expressed in the plant cell and the proteins are targeted to the plastid. - While various embodiments of the present invention have been described in detail, it is apparent that modifications and adaptations of those embodiments will occur to those skilled in the art. It is to be expressly understood, however, that such modifications and adaptations are within the scope of the present invention, as set forth in the following claims.
Claims (18)
1-16. (canceled)
17. An isolated nucleic acid molecule comprising a nucleic acid sequence selected from the group consisting of:
a) a nucleic acid sequence of SEQ ID NO:29;
b) a nucleic acid sequence encoding an amino acid sequence of SEQ ID NO:30; and
c) a nucleic acid sequence encoding an amino acid sequence that is at least 90% identical to SEQ ID NO:30 or that is a fragment of SEQ ID NO:30, wherein the amino acid sequence has FabA-like β-hydroxy acyl-ACP dehydrase (DH) activity, and wherein said amino acid sequence comprises the sequence of H-G-I-A-N-P-T-F-V-H-A-P-G-K-I (positions 876-890 of SEQ ID NO:6) at positions corresponding to amino acids 426-440 of SEQ ID NO:30.
18-45. (canceled)
46. A plant or a part of the plant, wherein the total fatty acid profile in the plant or part of the plant comprises detectable amounts of DHA (docosahexaenoic acid (C22:6, n-3)) and DPA (docosapentaenoic acid (C22:5, n 6).
47. The plant or a part of the plant of claim 46 , wherein the ratio of DPAn-6 to DHA is less than 1:1.
48. The plant or a part of the plant of claim 46 , wherein the total fatty acid profile in the plant or part of the plant contains less than 5% by weight in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
49. The plant or a part of the plant of claim 46 , wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from the group consisting of DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 5% in total of all of the following PUFAs: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
50. The plant or a part of the plant of claim 46 , wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from the group consisting of DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
51. The plant or a part of the plant of claim 46 , wherein the total fatty acid profile in the plant or part of the plant comprises at least about 0.5% by weight of at least one polyunsaturated fatty acid (PUFA) selected from the group consisting of DHA (C22:6n-3) and DPAn-6 (C22:5n-6), and wherein the total fatty acid profile in the plant or part of the plant contains less than 2% of gamma-linolenic acid (GLA; 18:3, n-6) and dihomo-gamma-linolenic acid (DGLA or HGLA; 20:3, n-6).
52. Seeds obtained from the plant or part of the plant of claim 46 .
53. A food product comprising the seeds of claim 52 .
54. An oil obtained from seeds of the plant of claim 46 any.
55. A food product comprising the oil of claim 54 .
56. An oil blend comprising the oil of claim 54 and another oil.
57-59. (canceled)
60. The oil of claim 54 , comprising the following fatty acids: DHA (C22:6n-3), DPAn-6 (C22:5n-6), oleic acid (C18:1), linolenic acid (C18:3), linoleic acid (C18:2), C16:0, C18:0, C20:0, C20:1n-9, C20:2n-6, C22:1n-9; wherein the oil comprises less than 0.5% of any of the following fatty acids: gamma-linolenic acid (GLA; 18:3, n-6), PUFAs having 18 carbons and four carbon-carbon double bonds, PUFAs having 20 carbons and three carbon-carbon double bonds, and PUFAs having 22 carbons and two or three carbon-carbon double bonds.
61-66. (canceled)
67. The plant or a part of the plant of claim 46 , wherein the ratio of DPAn-6 to DHA is 1:1 or greater than 1:1.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/771,840 US20140053299A1 (en) | 1999-01-14 | 2013-02-20 | PUFA Polyketide Synthase Systems and Uses Thereof |
Applications Claiming Priority (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/231,899 US6566583B1 (en) | 1997-06-04 | 1999-01-14 | Schizochytrium PKS genes |
US28406601P | 2001-04-16 | 2001-04-16 | |
US29879601P | 2001-06-15 | 2001-06-15 | |
US32326901P | 2001-09-18 | 2001-09-18 | |
US10/124,800 US7247461B2 (en) | 1999-01-14 | 2002-04-16 | Nucleic acid molecule encoding ORFA of a PUFA polyketide synthase system and uses thereof |
US68916705P | 2005-06-10 | 2005-06-10 | |
US78461606P | 2006-03-21 | 2006-03-21 | |
US11/452,138 US7271315B2 (en) | 1999-01-14 | 2006-06-12 | PUFA polyketide synthase systems and uses thereof |
US11/770,474 US7897844B2 (en) | 1999-01-14 | 2007-06-28 | PUFA polyketide synthase systems and uses thereof |
US13/008,756 US20110250342A1 (en) | 1999-01-14 | 2011-01-18 | PUFA Polyketide Synthase Systems and Uses Thereof |
US13/771,840 US20140053299A1 (en) | 1999-01-14 | 2013-02-20 | PUFA Polyketide Synthase Systems and Uses Thereof |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/008,756 Continuation US20110250342A1 (en) | 1999-01-14 | 2011-01-18 | PUFA Polyketide Synthase Systems and Uses Thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
US20140053299A1 true US20140053299A1 (en) | 2014-02-20 |
Family
ID=22871074
Family Applications (6)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/231,899 Expired - Lifetime US6566583B1 (en) | 1997-06-04 | 1999-01-14 | Schizochytrium PKS genes |
US10/331,061 Expired - Lifetime US7214853B2 (en) | 1997-06-04 | 2002-12-27 | Schizochytrium PKS genes |
US11/674,574 Expired - Fee Related US8829274B2 (en) | 1997-06-04 | 2007-02-13 | Schizochytrium PKS genes |
US11/770,474 Expired - Fee Related US7897844B2 (en) | 1999-01-14 | 2007-06-28 | PUFA polyketide synthase systems and uses thereof |
US13/008,756 Abandoned US20110250342A1 (en) | 1999-01-14 | 2011-01-18 | PUFA Polyketide Synthase Systems and Uses Thereof |
US13/771,840 Abandoned US20140053299A1 (en) | 1999-01-14 | 2013-02-20 | PUFA Polyketide Synthase Systems and Uses Thereof |
Family Applications Before (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/231,899 Expired - Lifetime US6566583B1 (en) | 1997-06-04 | 1999-01-14 | Schizochytrium PKS genes |
US10/331,061 Expired - Lifetime US7214853B2 (en) | 1997-06-04 | 2002-12-27 | Schizochytrium PKS genes |
US11/674,574 Expired - Fee Related US8829274B2 (en) | 1997-06-04 | 2007-02-13 | Schizochytrium PKS genes |
US11/770,474 Expired - Fee Related US7897844B2 (en) | 1999-01-14 | 2007-06-28 | PUFA polyketide synthase systems and uses thereof |
US13/008,756 Abandoned US20110250342A1 (en) | 1999-01-14 | 2011-01-18 | PUFA Polyketide Synthase Systems and Uses Thereof |
Country Status (10)
Country | Link |
---|---|
US (6) | US6566583B1 (en) |
EP (4) | EP2333073A1 (en) |
JP (3) | JP2002534123A (en) |
BR (1) | BR0008760A (en) |
CA (1) | CA2359629C (en) |
ES (1) | ES2427017T3 (en) |
HK (2) | HK1041292A1 (en) |
IL (4) | IL144269A0 (en) |
MX (1) | MX252154B (en) |
WO (1) | WO2000042195A2 (en) |
Families Citing this family (78)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6566583B1 (en) * | 1997-06-04 | 2003-05-20 | Daniel Facciotti | Schizochytrium PKS genes |
US7211418B2 (en) * | 1999-01-14 | 2007-05-01 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
US7247461B2 (en) * | 1999-01-14 | 2007-07-24 | Martek Biosciences Corporation | Nucleic acid molecule encoding ORFA of a PUFA polyketide synthase system and uses thereof |
US20070244192A1 (en) * | 1999-01-14 | 2007-10-18 | Martek Biosciences Corporation | Plant seed oils containing polyunsaturated fatty acids |
US8003772B2 (en) * | 1999-01-14 | 2011-08-23 | Martek Biosciences Corporation | Chimeric PUFA polyketide synthase systems and uses thereof |
US7217856B2 (en) * | 1999-01-14 | 2007-05-15 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
US7271315B2 (en) * | 1999-01-14 | 2007-09-18 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
MX232239B (en) | 2000-01-19 | 2005-11-18 | Omegatech Inc | Solventless extraction process. |
RU2326171C2 (en) | 2000-01-28 | 2008-06-10 | Мартек Биосайенсис Корпорейшн | Method for producing lipides containing polyunsaturated fatty acids (variants) and method for cultivating microorganisms producing said lipids |
TWI337619B (en) | 2001-04-16 | 2011-02-21 | Martek Biosciences Corp | Pufa polyketide synthase systems and uses thereof |
US20060165735A1 (en) | 2002-06-18 | 2006-07-27 | Abril Jesus R | Stable emulsions of oils in aqueous solutions and methods for producing same |
ES2525212T3 (en) | 2002-12-19 | 2014-12-19 | University Of Bristol | Novel procedure for the production of polyunsaturated fatty acids |
ATE517984T1 (en) | 2003-02-27 | 2011-08-15 | Basf Plant Science Gmbh | METHOD FOR PRODUCING POLYUNSATURATED FATTY ACIDS |
CA2518197A1 (en) * | 2003-03-07 | 2004-09-23 | Advanced Bionutrition Corporation | Feed formulation for terrestrial and aquatic animals |
AU2004225485B2 (en) * | 2003-03-26 | 2008-08-21 | Dsm Ip Assets B.V. | PUFA polyketide synthase systems and uses thereof |
CA2868312A1 (en) | 2003-03-31 | 2004-10-14 | University Of Bristol | Novel plant acyltransferases specific for long-chained, multiply unsaturated fatty acids |
US7208590B2 (en) * | 2003-07-15 | 2007-04-24 | Abbott Laboratories | Genes involved in polyketide synthase pathways and uses thereof |
US7646815B2 (en) * | 2003-07-15 | 2010-01-12 | Lsi Corporation | Intra estimation chroma mode 0 sub-block dependent prediction |
DE102004017369A1 (en) * | 2004-04-08 | 2005-11-03 | Nutrinova Nutrition Specialties & Food Ingredients Gmbh | Screening method for the identification of PUFA-PKS in samples |
DE102004017370A1 (en) * | 2004-04-08 | 2005-10-27 | Nutrinova Nutrition Specialties & Food Ingredients Gmbh | PUFA-PKS Genes from Ulkenia |
ES2715640T3 (en) | 2004-04-22 | 2019-06-05 | Commw Scient Ind Res Org | Synthesis of long chain polyunsaturated fatty acids by recombinant cells |
CN102559364B (en) | 2004-04-22 | 2016-08-17 | 联邦科学技术研究组织 | Use recombinant cell synthesis of long-chain polyunsaturated fatty acids |
EP1871883A1 (en) * | 2005-03-02 | 2008-01-02 | Metanomics GmbH | Process for the production of fine chemicals |
PL2447356T3 (en) | 2005-06-07 | 2016-10-31 | Eukaryotic microorganisms for producing lipids and antioxidants | |
SG163551A1 (en) | 2005-07-01 | 2010-08-30 | Martek Biosciences Corp | Polyunsaturated fatty acid-containing oil product and uses and production thereof |
JP4943678B2 (en) * | 2005-08-01 | 2012-05-30 | 国立大学法人北海道大学 | Method for modifying fatty acid composition of cells and use thereof |
US8795744B2 (en) | 2005-11-18 | 2014-08-05 | Commonwealth Scientific And Industrial Research Organisation | Feedstuffs for aquaculture comprising stearidonic acid |
MX339812B (en) * | 2006-03-15 | 2016-06-08 | Dsm Ip Assets B V * | Polyunsaturated fatty acid production in heterologous organisms using pufa polyketide synthase systems. |
AU2007238131B2 (en) * | 2006-04-11 | 2010-09-09 | Dsm Ip Assets B.V. | Food products comprising long chain polyunsaturated fatty acids and methods for preparing the same |
PL2034973T5 (en) | 2006-06-22 | 2023-02-20 | Dsm Ip Assets B.V. | Encapsulated labile compound compositions and methods of making the same |
CA2659603C (en) * | 2006-08-01 | 2017-03-07 | Ocean Nutrition Canada Limited | Oil producing microbes and methods of modification thereof |
EP2059588A4 (en) | 2006-08-29 | 2010-07-28 | Commw Scient Ind Res Org | Synthesis of fatty acids |
EP2430926A3 (en) | 2006-08-29 | 2012-06-13 | Martek Biosciences Corporation | Use of DPA(n-6) oils in infant formula |
BRPI0806695A2 (en) * | 2007-01-12 | 2014-06-03 | Univ Colorado | COMPOSITIONS AND METHODS FOR INCREASING TOLERANCE TO THE PRODUCTION OF CHEMICAL PRODUCTS PRODUCED BY MICROORGANISMS |
US20090023808A1 (en) * | 2007-06-29 | 2009-01-22 | Martek Biosciences Corporation | Production and Purification of Esters of Polyunsaturated Fatty Acids |
AU2008282862B2 (en) | 2007-07-26 | 2014-07-31 | Pacific Biosciences Of California, Inc. | Molecular redundant sequencing |
EP2214481B1 (en) | 2007-10-15 | 2019-05-01 | United Animal Health, Inc. | Method for increasing performance of offspring |
US8048624B1 (en) | 2007-12-04 | 2011-11-01 | Opx Biotechnologies, Inc. | Compositions and methods for 3-hydroxypropionate bio-production from biomass |
US20110076379A1 (en) * | 2008-06-13 | 2011-03-31 | Means Michael M | High omega saturated fat free meat products |
JP6068800B2 (en) | 2008-11-18 | 2017-01-25 | コモンウェルス サイエンティフィック アンド インダストリアル リサーチ オーガナイゼーション | Enzymes and methods for producing omega-3 fatty acids |
BRPI1006435B1 (en) | 2009-03-19 | 2021-01-19 | Dsm Ip Assets B.V. | recombinant nucleic acid molecule and microbial host cell |
EP2821492A3 (en) | 2009-05-13 | 2015-04-08 | BASF Plant Science Company GmbH | Acyltransferases and uses thereof in fatty acid production |
US8809027B1 (en) | 2009-09-27 | 2014-08-19 | Opx Biotechnologies, Inc. | Genetically modified organisms for increased microbial production of 3-hydroxypropionic acid involving an oxaloacetate alpha-decarboxylase |
BR112012006801A2 (en) | 2009-09-27 | 2018-04-10 | Opx Biotechnologies Inc | method for production of 3-hydroxypropionic acid and other products |
US20110125118A1 (en) * | 2009-11-20 | 2011-05-26 | Opx Biotechnologies, Inc. | Production of an Organic Acid and/or Related Chemicals |
IN2012DN06617A (en) * | 2010-01-27 | 2015-10-23 | Opx Biotechnologies Inc | |
TW201144442A (en) | 2010-05-17 | 2011-12-16 | Dow Agrosciences Llc | Production of DHA and other LC-PUFAs in plants |
US10392578B2 (en) * | 2010-06-01 | 2019-08-27 | Dsm Ip Assets B.V. | Extraction of lipid from cells and products therefrom |
NO2585603T3 (en) | 2010-06-25 | 2018-05-19 | ||
TW201307553A (en) | 2011-07-26 | 2013-02-16 | Dow Agrosciences Llc | Production of DHA and other LC-PUFAs in plants |
CN104853596B (en) | 2012-06-15 | 2018-10-09 | 联邦科学技术研究组织 | Long-chain polyunsaturated fatty acid is generated in plant cell |
US9879234B2 (en) | 2012-08-03 | 2018-01-30 | Basf Plant Science Company Gmbh | Enzymes, enzyme components and uses thereof |
KR20150040359A (en) | 2012-08-10 | 2015-04-14 | 오피엑스 바이오테크놀로지스, 인크. | Microorganisms and methods for the production of fatty acids and fatty acid derived products |
DK2970926T3 (en) | 2013-03-13 | 2018-04-16 | Dsm Nutritional Products Ag | GENMANIPULATION OF MICROorganisms |
BR112015023583A2 (en) | 2013-03-15 | 2017-08-22 | Dow Global Technologies Llc | METHOD FOR 3-HP VAPORIZATION FROM A SOLUTION, VAPOR, CONDENSED 3-HP SOLUTION, COMPOSITION, DOWNSTREAM CHEMICAL, CONSUMER PRODUCT, SYSTEM FOR 3-HP PURIFICATION, METHOD FOR PRODUCING A 3-HP SOLUTION, METHOD FOR DECOMPOSING 3-HP AMMONIA (A3-HP) AND METHOD FOR DECOMPOSING A3-HP |
WO2014146026A1 (en) | 2013-03-15 | 2014-09-18 | Opx Biotechnologies, Inc. | Bioproduction of chemicals |
US11408013B2 (en) | 2013-07-19 | 2022-08-09 | Cargill, Incorporated | Microorganisms and methods for the production of fatty acids and fatty acid derived products |
EP3022310B1 (en) | 2013-07-19 | 2019-10-16 | Cargill, Incorporated | Microorganisms and methods for the production of fatty acids and fatty acid derived products |
TW201525136A (en) | 2013-11-26 | 2015-07-01 | Dow Agrosciences Llc | Production of omega-3 long-chain polyunsaturated fatty acids in oilseed crops by a thraustochytrid PUFA synthase |
KR102386838B1 (en) | 2013-12-18 | 2022-04-15 | 커먼웰쓰 사이언티픽 앤 인더스트리알 리서치 오거니제이션 | Lipid comprising long chain polyunsaturated fatty acids |
WO2015095696A1 (en) | 2013-12-20 | 2015-06-25 | Dsm Ip Assets B.V. | Processes for obtaining microbial oil from microbial cells |
WO2015095688A1 (en) | 2013-12-20 | 2015-06-25 | Dsm Ip Assets B.V. | Processes for obtaining microbial oil from microbial cells |
WO2015095694A1 (en) | 2013-12-20 | 2015-06-25 | Dsm Ip Assets B.V. | Processes for obtaining microbial oil from microbial cells |
WO2015095690A2 (en) | 2013-12-20 | 2015-06-25 | Dsm Ip Assets B.V. | Processes for obtaining microbial oil from microbial cells |
MX2016008228A (en) | 2013-12-20 | 2016-11-28 | Dsm Ip Assets Bv | Processes for obtaining microbial oil from microbial cells. |
EP3099782B1 (en) | 2014-01-28 | 2019-03-20 | DSM IP Assets B.V. | Factors for the production and accumulation of polyunsaturated fatty acids (pufas) derived from pufa synthases |
US10131897B2 (en) | 2014-03-03 | 2018-11-20 | Synthetics Genomics, Inc. | Molecules associated with fatty acid biosynthetic pathways and uses thereof |
CN105219789B (en) | 2014-06-27 | 2023-04-07 | 联邦科学技术研究组织 | Extracted plant lipids comprising docosapentaenoic acid |
EP2993228B1 (en) | 2014-09-02 | 2019-10-09 | Cargill, Incorporated | Production of fatty acid esters |
AU2016228545A1 (en) * | 2015-03-12 | 2017-09-21 | Synthetic Genomics. Inc. | Microorganisms for fatty acid production using elongase and desaturase enzymes |
JP6966040B2 (en) | 2016-05-12 | 2021-11-10 | ディーエスエム アイピー アセッツ ビー.ブイ.Dsm Ip Assets B.V. | How to Increase Omega-3 Polyunsaturated Fatty Acid Production in Microalgae |
AR109245A1 (en) | 2016-05-12 | 2018-11-14 | Basf Plant Science Co Gmbh | METHODS TO OPTIMIZE THE PRODUCTION OF METABOLITES IN GENETICALLY MODIFIED PLANTS AND TO PROCESS THESE PLANTS |
US11345938B2 (en) | 2017-02-02 | 2022-05-31 | Cargill, Incorporated | Genetically modified cells that produce C6-C10 fatty acid derivatives |
WO2018205165A1 (en) * | 2017-05-10 | 2018-11-15 | 南京工业大学 | Schizochytrium limacinum strain, building method therefor and application thereof |
WO2019217572A1 (en) | 2018-05-08 | 2019-11-14 | Sanford Burnham Prebys Medical Discovery Institute | Role of pvt1 in the diagnosis and treatment of myc-driven cancer |
US20210309987A1 (en) * | 2018-08-10 | 2021-10-07 | Kyowa Hakko Bio Co., Ltd. | Microorganism producing polyunsaturated fatty acid and method for producing polyunsaturated fatty acid |
CN112601808A (en) * | 2018-08-10 | 2021-04-02 | 协和发酵生化株式会社 | Microorganism producing eicosapentaenoic acid and process for producing eicosapentaenoic acid |
US20230058305A1 (en) * | 2019-09-19 | 2023-02-23 | Sanford Burnham Prebys Medical Discovery Institute | Methods and compositions for treating myc-driven cancers |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030134400A1 (en) * | 2001-05-04 | 2003-07-17 | Pradip Mukerji | Delta4-desaturase genes and uses thereof |
Family Cites Families (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5034322A (en) | 1983-01-17 | 1991-07-23 | Monsanto Company | Chimeric genes suitable for expression in plant cells |
US4792523A (en) | 1984-08-31 | 1988-12-20 | Cetus Corporation | 3'Expression enhancing fragments and method |
US4743548A (en) | 1984-09-25 | 1988-05-10 | Calgene, Inc. | Plant cell microinjection technique |
US4943674A (en) | 1987-05-26 | 1990-07-24 | Calgene, Inc. | Fruit specific transcriptional factors |
US5420034A (en) | 1986-07-31 | 1995-05-30 | Calgene, Inc. | Seed-specific transcriptional regulation |
GB2178752B (en) | 1985-07-12 | 1989-10-11 | Unilever Plc | Substitute milk fat |
US4940835A (en) | 1985-10-29 | 1990-07-10 | Monsanto Company | Glyphosate-resistant plants |
US5068193A (en) | 1985-11-06 | 1991-11-26 | Calgene, Inc. | Novel method and compositions for introducing alien DNA in vivo |
US4795855A (en) | 1985-11-14 | 1989-01-03 | Joanne Fillatti | Transformation and foreign gene expression with woody species |
US5188958A (en) | 1986-05-29 | 1993-02-23 | Calgene, Inc. | Transformation and foreign gene expression in brassica species |
US5565347A (en) | 1986-06-10 | 1996-10-15 | Calgene, Inc. | Transformation and foreign gene expression with plant species |
US5246841A (en) | 1986-12-26 | 1993-09-21 | Sagami Chemical Research Center | Microbial process for production of eicosapentaenoic acid |
AU612133B2 (en) | 1987-02-20 | 1991-07-04 | Natinco Nv | Production of proteins in active forms |
DE3719097C1 (en) | 1987-06-06 | 1988-06-09 | Fratzer Uwe | Medicament containing eicosapentaenoic acid and docosahexaenoic acid as unsaturated fatty acids as well as vitamin E. |
US5011770A (en) | 1987-09-04 | 1991-04-30 | Molecular Devices, Inc. | DNA detection method |
US4999422A (en) | 1988-04-15 | 1991-03-12 | Biogen, N.V. | Continuous method of refolding proteins |
US5565346A (en) | 1988-07-27 | 1996-10-15 | Calgene, Inc. | Transformation and regeneration system for legumes |
US5130242A (en) * | 1988-09-07 | 1992-07-14 | Phycotech, Inc. | Process for the heterotrophic production of microbial products with high concentrations of omega-3 highly unsaturated fatty acids |
US5340742A (en) * | 1988-09-07 | 1994-08-23 | Omegatech Inc. | Process for growing thraustochytrium and schizochytrium using non-chloride salts to produce a microfloral biomass having omega-3-highly unsaturated fatty acids |
US5162215A (en) | 1988-09-22 | 1992-11-10 | Amgen Inc. | Method of gene transfer into chickens and other avian species |
BR9007159A (en) | 1989-02-24 | 1991-12-10 | Monsanto Co | SYNTHETIC GENES OF PLANTS AND PROCESS FOR THE PREPARATION OF THE SAME |
US5106739A (en) | 1989-04-18 | 1992-04-21 | Calgene, Inc. | CaMv 355 enhanced mannopine synthase promoter and method for using same |
US5175095A (en) | 1989-07-19 | 1992-12-29 | Calgene, Inc. | Ovary-tissue transcriptional factors |
US5639790A (en) | 1991-05-21 | 1997-06-17 | Calgene, Inc. | Plant medium-chain thioesterases |
EP0594868A4 (en) | 1992-05-15 | 1997-01-02 | Sagami Chem Res | Gene which codes for eicosapentaenoic acid synthetase group and process for producing eicosapentaenoic acid |
US5683898A (en) | 1992-05-15 | 1997-11-04 | Sagami Chemical Research Center | Gene coding for eicosapentaenoic acid synthesizing enzymes and process for production of eicosapentaenoic acid |
US5798259A (en) * | 1992-05-15 | 1998-08-25 | Sagami Chemical Research Center | Gene coding for eicosapentaenoic acid synthesizing enzymes and process for production of eicosapentaenoic acid |
US5310242A (en) * | 1992-09-28 | 1994-05-10 | Golder Kimberly A | Portable infant seat |
WO1994019477A1 (en) | 1993-02-26 | 1994-09-01 | Calgene Inc. | Geminivirus-based gene expression system |
DE4323727A1 (en) * | 1993-07-15 | 1995-03-09 | Boehringer Mannheim Gmbh | Method of identifying human and animal cells capable of unlimited proliferation or tumor formation |
US5672491A (en) | 1993-09-20 | 1997-09-30 | The Leland Stanford Junior University | Recombinant production of novel polyketides |
CA2184687A1 (en) | 1994-03-09 | 1995-09-14 | Pedro Antonio Prieto | Humanized milk |
DE69525456T2 (en) | 1994-08-24 | 2002-10-10 | Meiji Milk Products Co. Ltd., Tokio/Tokyo | METHOD FOR CULTIVATING BIRD CELLS |
FR2726003B1 (en) | 1994-10-21 | 2002-10-18 | Agronomique Inst Nat Rech | CULTURE MEDIUM OF AVIAN TOTIPOTENT EMBRYONIC CELLS, METHOD FOR CULTURING THESE CELLS, AND AVIAN TOTIPOTENT EMBRYONIC CELLS |
WO1996021735A1 (en) | 1995-01-13 | 1996-07-18 | Sagami Chemical Research Center | Genes coding for biosynthetic enzyme group for icosapentaenoic acid and process for producing icosapentaenoic acid |
DE69637953D1 (en) * | 1995-04-17 | 2009-07-30 | Nat Inst Of Advanced Ind Scien | HIGHLY UNSATURATED FATTY ACID-PRODUCING MICRO-ORGANISMS AND METHOD FOR PRODUCING HIGH-UNSATURATED FATTY ACIDS THROUGH THE USE OF THESE MICRO-ORGANISMS |
GB9508023D0 (en) | 1995-04-20 | 1995-06-07 | Scotia Holdings Plc | Fatty acid derivatives |
AU727694B2 (en) * | 1996-07-10 | 2000-12-21 | Sagami Chemical Research Center | Process for producing icosapentaenoic acid by genetic recombination |
US6033883A (en) * | 1996-12-18 | 2000-03-07 | Kosan Biosciences, Inc. | Production of polyketides in bacteria and yeast |
AU720677B2 (en) * | 1997-04-11 | 2000-06-08 | Abbott Laboratories | Methods and compositions for synthesis of long chain polyunsaturated fatty acids in plants |
US6566583B1 (en) * | 1997-06-04 | 2003-05-20 | Daniel Facciotti | Schizochytrium PKS genes |
IL133272A0 (en) * | 1997-06-04 | 2001-04-30 | Calgene Llc | Production of polyunsaturated fatty acid by expression of polyketide-like synthesis genes in plants |
US6677145B2 (en) * | 1998-09-02 | 2004-01-13 | Abbott Laboratories | Elongase genes and uses thereof |
US7271315B2 (en) * | 1999-01-14 | 2007-09-18 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
US7247461B2 (en) * | 1999-01-14 | 2007-07-24 | Martek Biosciences Corporation | Nucleic acid molecule encoding ORFA of a PUFA polyketide synthase system and uses thereof |
US8003772B2 (en) * | 1999-01-14 | 2011-08-23 | Martek Biosciences Corporation | Chimeric PUFA polyketide synthase systems and uses thereof |
US7217856B2 (en) * | 1999-01-14 | 2007-05-15 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
US7211418B2 (en) * | 1999-01-14 | 2007-05-01 | Martek Biosciences Corporation | PUFA polyketide synthase systems and uses thereof |
US20070244192A1 (en) * | 1999-01-14 | 2007-10-18 | Martek Biosciences Corporation | Plant seed oils containing polyunsaturated fatty acids |
AU2001268296B2 (en) * | 2000-06-08 | 2006-05-25 | Miami University | Fatty acid elongase 3-ketoacyl coa synthase polypeptides |
US20040010817A1 (en) * | 2000-07-21 | 2004-01-15 | Washington State University Research Foundation | Plant acyl-CoA synthetases |
JP4071622B2 (en) * | 2000-09-28 | 2008-04-02 | バイオリジナル フード アンド サイエンス コーポレーション | FAD4, FAD5, FAD5-2 and FAD6 which are members of the fatty acid desaturase family, and methods for their use |
TWI377253B (en) * | 2001-04-16 | 2012-11-21 | Martek Biosciences Corp | Product and process for transformation of thraustochytriales microorganisms |
TWI337619B (en) | 2001-04-16 | 2011-02-21 | Martek Biosciences Corp | Pufa polyketide synthase systems and uses thereof |
EP2255668A3 (en) * | 2001-05-14 | 2012-04-04 | Martek Biosciences Corporation | Production and Use of a Polar Lipid-Rich Fraction Containing Omega-3 and/or Omega-6 Highly Unsaturated Fatty Acids from Microbes, Genetically Modified Plant Seeds and Marine Organisms |
US20040005672A1 (en) * | 2002-02-22 | 2004-01-08 | Santi Daniel V. | Heterologous production of polyketides |
GB2385852A (en) * | 2002-02-27 | 2003-09-03 | Rothamsted Ex Station | Delta 6-desaturases from Primulaceae |
US7705202B2 (en) * | 2002-03-16 | 2010-04-27 | The University Of York | Transgenic plants expressing enzymes involved in fatty acid biosynthesis |
US8084074B2 (en) * | 2003-02-12 | 2011-12-27 | E. I. Du Pont De Nemours And Company | Production of very long chain polyunsaturated fatty acids in oil seed plants |
US20040172682A1 (en) | 2003-02-12 | 2004-09-02 | Kinney Anthony J. | Production of very long chain polyunsaturated fatty acids in oilseed plants |
AU2004225485B2 (en) | 2003-03-26 | 2008-08-21 | Dsm Ip Assets B.V. | PUFA polyketide synthase systems and uses thereof |
CA2868312A1 (en) | 2003-03-31 | 2004-10-14 | University Of Bristol | Novel plant acyltransferases specific for long-chained, multiply unsaturated fatty acids |
US7125672B2 (en) * | 2003-05-07 | 2006-10-24 | E. I. Du Pont De Nemours And Company | Codon-optimized genes for the production of polyunsaturated fatty acids in oleaginous yeasts |
US7208590B2 (en) * | 2003-07-15 | 2007-04-24 | Abbott Laboratories | Genes involved in polyketide synthase pathways and uses thereof |
DE102004060340A1 (en) | 2004-07-16 | 2006-02-09 | Basf Plant Science Gmbh | Method for increasing the content of polyunsaturated long-chain fatty acids in transgenic organisms |
RU2007114780A (en) | 2004-09-20 | 2008-10-27 | БАСФ ПЛАНТ САЙЕНС ГмбХ (DE) | ARABIDOPSIS GENES ENCODING PROTEINS PARTICIPATING IN SUGAR AND LIPID EXCHANGE, AND WAYS OF THEIR APPLICATION |
US20080260933A1 (en) * | 2004-10-08 | 2008-10-23 | Dow Agroscience Llc | Certain Plants with "No Saturate" or Reduced Saturate Levels of Fatty Acids in Seeds, and Oil Derived from the Seeds |
JP5941606B2 (en) * | 2004-11-04 | 2016-06-29 | モンサント テクノロジー エルエルシー | High PUFA oil composition |
AR059376A1 (en) * | 2006-02-21 | 2008-03-26 | Basf Plant Science Gmbh | PROCEDURE FOR THE PRODUCTION OF POLYINSATURATED FATTY ACIDS |
MX339812B (en) * | 2006-03-15 | 2016-06-08 | Dsm Ip Assets B V * | Polyunsaturated fatty acid production in heterologous organisms using pufa polyketide synthase systems. |
EP2197289A4 (en) * | 2007-08-31 | 2013-08-28 | Dsm Ip Assets Bv | Polyunsaturated fatty acid-containing solid fat compositions and uses and production thereof |
-
1999
- 1999-01-14 US US09/231,899 patent/US6566583B1/en not_active Expired - Lifetime
-
2000
- 2000-01-14 WO PCT/US2000/000956 patent/WO2000042195A2/en active Application Filing
- 2000-01-14 BR BR0008760-2A patent/BR0008760A/en active Search and Examination
- 2000-01-14 JP JP2000593752A patent/JP2002534123A/en not_active Withdrawn
- 2000-01-14 EP EP10010710A patent/EP2333073A1/en not_active Withdrawn
- 2000-01-14 EP EP10010708A patent/EP2333071A1/en not_active Withdrawn
- 2000-01-14 EP EP00904357A patent/EP1147197A2/en not_active Ceased
- 2000-01-14 EP EP10010709.3A patent/EP2333072B1/en not_active Expired - Lifetime
- 2000-01-14 IL IL14426900A patent/IL144269A0/en unknown
- 2000-01-14 MX MXPA01007153 patent/MX252154B/en active IP Right Grant
- 2000-01-14 CA CA2359629A patent/CA2359629C/en not_active Expired - Lifetime
- 2000-01-14 ES ES10010709T patent/ES2427017T3/en not_active Expired - Lifetime
-
2001
- 2001-07-11 IL IL144269A patent/IL144269A/en not_active IP Right Cessation
-
2002
- 2002-04-23 HK HK02103037.7A patent/HK1041292A1/en unknown
- 2002-12-27 US US10/331,061 patent/US7214853B2/en not_active Expired - Lifetime
-
2007
- 2007-02-13 US US11/674,574 patent/US8829274B2/en not_active Expired - Fee Related
- 2007-06-28 US US11/770,474 patent/US7897844B2/en not_active Expired - Fee Related
- 2007-11-07 JP JP2007289187A patent/JP2008099697A/en not_active Withdrawn
-
2008
- 2008-01-03 IL IL188590A patent/IL188590A/en not_active IP Right Cessation
-
2010
- 2010-08-09 JP JP2010178345A patent/JP2010252811A/en active Pending
-
2011
- 2011-01-18 US US13/008,756 patent/US20110250342A1/en not_active Abandoned
- 2011-12-15 HK HK11113542.3A patent/HK1159166A1/en not_active IP Right Cessation
-
2013
- 2013-02-20 US US13/771,840 patent/US20140053299A1/en not_active Abandoned
- 2013-02-21 IL IL224871A patent/IL224871A/en not_active IP Right Cessation
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030134400A1 (en) * | 2001-05-04 | 2003-07-17 | Pradip Mukerji | Delta4-desaturase genes and uses thereof |
Also Published As
Publication number | Publication date |
---|---|
CA2359629C (en) | 2015-02-24 |
US7214853B2 (en) | 2007-05-08 |
WO2000042195B1 (en) | 2000-11-09 |
WO2000042195A3 (en) | 2000-09-28 |
US20080148433A1 (en) | 2008-06-19 |
US20090098622A1 (en) | 2009-04-16 |
HK1159166A1 (en) | 2012-07-27 |
EP1147197A2 (en) | 2001-10-24 |
IL144269A0 (en) | 2002-05-23 |
IL188590A (en) | 2013-06-27 |
IL144269A (en) | 2010-04-15 |
JP2008099697A (en) | 2008-05-01 |
US8829274B2 (en) | 2014-09-09 |
EP2333072A1 (en) | 2011-06-15 |
EP2333073A1 (en) | 2011-06-15 |
US20030101486A1 (en) | 2003-05-29 |
JP2002534123A (en) | 2002-10-15 |
IL188590A0 (en) | 2008-04-13 |
BR0008760A (en) | 2002-10-08 |
MXPA01007153A (en) | 2003-07-21 |
US7897844B2 (en) | 2011-03-01 |
ES2427017T3 (en) | 2013-10-28 |
IL224871A (en) | 2017-11-30 |
US6566583B1 (en) | 2003-05-20 |
CA2359629A1 (en) | 2000-07-20 |
US20110250342A1 (en) | 2011-10-13 |
EP2333072B1 (en) | 2013-06-19 |
EP2333071A1 (en) | 2011-06-15 |
WO2000042195A2 (en) | 2000-07-20 |
MX252154B (en) | 2007-12-04 |
JP2010252811A (en) | 2010-11-11 |
HK1041292A1 (en) | 2002-07-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7897844B2 (en) | PUFA polyketide synthase systems and uses thereof | |
US7271315B2 (en) | PUFA polyketide synthase systems and uses thereof | |
US20080005811A1 (en) | Pufa polyketide synthase systems and uses thereof | |
US8309796B2 (en) | Chimeric PUFA polyketide synthase systems and uses thereof | |
US20070220634A1 (en) | Plant seed oils containing polyunsaturated fatty acids | |
US20070244192A1 (en) | Plant seed oils containing polyunsaturated fatty acids | |
US7919320B2 (en) | PUFA polyketide synthase systems and uses thereof |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: MARTEK BIOSCIENCES CORPORATION, MARYLAND Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:METZ, JAMES G.;FLATT, JAMES H.;KUNER, JERRY M.;AND OTHERS;SIGNING DATES FROM 20061212 TO 20061214;REEL/FRAME:031535/0858 Owner name: DSM IP ASSETS B.V., NETHERLANDS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MARTEK BIOSCIENCES CORPORATION;REEL/FRAME:031535/0897 Effective date: 20120625 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |