Abstract
The codeword design problem is an important problem in DNA computing and its applications. Several theoretical analyses as well as practical solutions for short oligonucleotides (up to 20-mers) have been generated recently. These solutions have, in turn, suggested new applications to DNA-based indexing and natural language processing, in addition to the obvious applications to the problems of reliability and scalability that generated them. Here we continue the exploration of this type of DNA-based indexing for biological applications and show that DNA noncrosshybridizing (nxh) sets can be successfully applied to infer ab initio phylogenetic trees by providing a way to measure distances among different genomes indexed by sets of short oligonucleotides selected so as to minimize crosshybridization. These phylogenies are solidly established and well accepted in biology. The new technique is much more effective in terms of signal-to-noise ratio, cost and time than current methods. Second, it can be scaled up to newly available universal DNA chips readily available both in vitro and in silico. In particular, we show how a recently obtained such set of nxh 16-mers can be used as a universal coordinate system in DNA spaces to characterize very large groups (families, genera, and even phylla) of organisms on a uniform reference system, a veritable and comprehensive “Atlas of Life”, as it is or as it could be on earth.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Adleman, L.: Molecular computation of solutions of combinatorial problems. Science 266, 1021–1024 (1994)
Blain, D., Garzon, M.H., Shin, S.Y., Zhang, B.T., Kashiwamura, S., Yamamoto, M., Kameda, A., Ohuchi, A.: Development, Evaluation and Benchmarking of Simulation Software for Biomolecule-based Computing. J. of Natural Computing 3(4), 427–442 (2004)
Bi, H., Chen, J., Deaton, R., Garzon, M., Rubin, H., Wood, D.H.: A PCR Protocol for in Vitro Selection of Non-Crosshybridizing Oligonucleotides. J. of Natural Computing 2(3), 417–426 (2003)
Bobba, K.C., Neel, A.J., Phan, V., Garzon, M.H.: “Reasoning” and “Talking” DNA: Can DNA understand english? In: Mao, C., Yokomori, T. (eds.) DNA12. LNCS, vol. 4287, pp. 337–349. Springer, Heidelberg (2006)
Chen, J., Deaton, R., Garzon, M., Wood, D.H., Bi, H., Carpenter, D., Wang, Y.Z.: Characterization of Non-Crosshybridizing DNA Oligonucleotides Manufactured in vitro. J. of Natural Computing 5(2), 165–181 (2006)
DasGupta, K.M., Konwar, I.I., Shvartsman, A.A.: Highly scalable algorithms for robust string barcoding. Int. J. Bioinformatics Research and Applications 1(2) (2005)
Deaton, J., Chen, J., Garzon, M., Wood, D.H.: Test Tube Selection of Large Independent Sets of DNA Oligonucleotides R, pp. 152–166. World Publishing Co., Singapore (2006); (Volume dedicated to Ned Seeman on occasion of his 60th birthday)
8 M.H. Garzon. V. Phan, A. Neel (2009). Optimal Codes for Computing and Self-Assembly, Int. J. of Nanotechnology and Molecular Computing 1, 1-17.
Garzon, M.H., Yan, H. (eds.): DNA 2007. LNCS, vol. 4848. Springer, Heidelberg (2008)
Garzon, M.H., Phan, V., Roy, S., Neel, A.J.: In Search of Optimal Codes for DNA-Computing. In: Mao, C., Yokomori, T. (eds.) DNA12. LNCS, vol. 4287, pp. 143–156. Springer, Heidelberg (2006)
Garzon, M.H., Bobba, K., Phan, V., Kontham, R.: Sensitivity and Capacity of Microarray Encodings. In: Carbone, A., Pierce, N.A. (eds.) DNA 2005. LNCS, vol. 3892, pp. 81–95. Springer, Heidelberg (2006)
Garzon, M.H., Blain, D., Neel, A.J.: Virtual Test Tubes for Biomolecular Computing. J. of Natural Computing 3(4), 461–477 (2004)
Garzon, M., Bobba, K., Hyde, B.: Digital information encoding on DNA. In: Jonoska, N., Păun, G., Rozenberg, G. (eds.) Aspects of Molecular Computing. LNCS, vol. 2950, pp. 152–166. Springer, Heidelberg (2003)
Hennig, W.: Grundzüge einer Theorie der Phylogenetischen Systematik English revision, Phylogenetic Systematics (tr. D. Davis and R. Zangerl). Univ. of Illinois Press, Urbana 1966, reprinted 1979 (1950)
Henz, S.R., Huson, D.H., Auch, A.F., Nieselt-Struwe, K., Schuster, S.C.: Whole-genome prokaryotic phylogeny. Bioinformatics 21(10), 2329–2335 (2005)
Liu, T.T., Lee, R.E.B., Barker, K.S., Lee, R.E., Wei, L., Homayouni, R., Rogers, P.D.: Genome-wide expression profiling of the response to azole, polyene, echinocandin, and pyrimidine antifungal agents in Candida albicans. Antimicrobial Agents and Chemotherapy 49(6), 2226–2236 (2005)
Margulis, L.: Symbiosis in Cell Evolution, 2nd edn. Freeman, New York (1993)
Neel, A.J., Garzon, M.H.: DNA-based Memories: A Survey. Studies in Computational Intelligence, vol. 113, pp. 259–275. Springer, Heidelberg (2008)
Neel, A., Garzon, M.: Semantic Retrieval in DNA-Based Memories with Gibbs Energy Models. Biotechnology Progress 22(1), 86–90 (2006)
Qiu, Q., Mukre, P., Bishop, M., Bruns, D., Wu, Q.: Hardware Accelerator for Thermodynamic Constrained DNA Code Generation. In: Garzon, Y. (ed.), pp. 201–210 (2008)
Reif, J.H., LaBean, T.M., Pirrung, M., Rana, V.S., Guo, B., Kingsfor, C., Wickman, G.S.: Experimental Construction of Very-Large Scale Databases with Associative Search Capability. In: Jonoska, N., Seeman, N.C. (eds.) DNA 2001. LNCS, vol. 2340, p. 231. Springer, Heidelberg (2002)
Seeman, N.: DNA in a material world. Nature 421, 427–431 (2003)
Stekel, D.: Microarray Bioinformatics. Cambridge University Press, Cambridge (2003)
Tulpan, D., Andronescu, M., Chang, S.B., Shortreed, M.R., Condon, A., Hoos, H.H., Smith, L.M.: Thermodynamically based DNA strand design. Nucleic Acids Res. 33(15), 4951–4964 (2005)
Watkins, N.E., SantaLucia Jr., J.: Nearest-neighbor thermodynamics of deoxyinosine pairs in DNA duplexes. Nucleic Acids Research 33(19), 6258–6267 (2005)
Winfree, E., Liu, F., Wenzler, L.A., Seeman, N.C.: Design and self-assembly of two-dimensional DNA crystals. Nature 394, 539–544 (1998)
Woese, C., Fox, G.: Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc. Natl. Acad. Sci. USA 74, 5088–5090 (1977)
Wong, T.Y., Fernandes, S., Sankhon, N., Leong, P.P., Kuo, J., Liu, J.K.: On the role of premature stop codons in bacterial evolution. J. Bacteriology (in press, 2009)
Zhou, F., Olman, V., Xu, Y.: Barcodes for Genomes and Applications. Bioinformatics 9, 546 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Garzon, M.H., Wong, TY., Phan, V. (2009). DNA Chips for Species Identification and Biological Phylogenies. In: Deaton, R., Suyama, A. (eds) DNA Computing and Molecular Programming. DNA 2009. Lecture Notes in Computer Science, vol 5877. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10604-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-10604-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10603-3
Online ISBN: 978-3-642-10604-0
eBook Packages: Computer ScienceComputer Science (R0)