Abstract
Bioinformatics applications are naturally distributed, due to distribution of involved data sets, experimental data and biological databases. They require high computing power, owing to the large size of data sets and the complexity of basic computations, may access heterogeneous data, where heterogeneity is in data format, access policy, distribution, etc., and require a secure infrastructure, because they could access private data owned by different organizations. The Problem Solving Environment (PSE) is an approach and a technology that can fulfil such bioinformatics requirements. The PSE can be used for the definition and composition of complex applications, hiding programming and configuration details to the user that can concentrate only on the specific problem. Moreover, Grids can be used for building geographically distributed collaborative problem solving environments and Grid aware PSEs can search and use dispersed high performance computing, networking, and data resources. In this work, the PSE solution has been chosen as the integration platform of bioinformatics tools and data sources. In particular an experiment of multiple sequence alignment on large scale, supported by the LIBI PSE, is presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bairoch, A., Apweiler, R., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., et al.: The Universal Protein Resource (UniProt). Nucleic Acids Res. 33, 154–159 (2005), http://www.uniprot.org
Rost, B., Sander, C.: Progress of 1D protein structure prediction at last. Proteins 23, 295–300 (1995)
Van der Spoel, D., Lindahl, E., Hess, B., Groenhof, G., Mark, A.E., Berendsen, H.J.C.: GROMACS: Fast, flexible, and free. J. Comput. Chem. 26, 1701–1718 (2005)
Erwin, D.W., Snelling, D.: UNICORE: A Grid Computing Environment. In: Sakellariou, R., Keane, J.A., Gurd, J.R., Freeman, L. (eds.) Euro-Par 2001. LNCS, vol. 2150, Springer, Heidelberg (2001)
Houstis, E., Gallopoulos, E., Bramley, R., Rice, J.: Problem-Solving Environments for Computational Science. IEEE Comput. Sci. Eng. 4(3), 18–21 (1997)
Berman, F., Hey, A.J.G., Fox, G.: Grid Computing: Making The Global Infrastructure a Reality. Wiley & Sons, Chichester (2003)
Ronquist, F., Huelsenbeck, J.P.: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19, 1572–1574 (2003)
Aloisio, G., Cafaro, M., Lezzi, D., Van Engelen, R.: The GSI plug-in for gSOAP: Enhanced Security, Performance, and Reliability. In: Proceedings of Information Technology Coding and Computing (ITCC 2005), vol. I, pp. 304–309. IEEE Computer Society Press, Los Alamitos (2005)
Aloisio, G., Cafaro, M., Carteni, G., Epicoco, I., Fiore, S., Lezzi, D., Mirto, M., Mocavero, S.: The Grid Resource Broker Portal. In: Concurrency and Computation: Practice and Experience, Special Issue on Grid Computing Environments (to appear, 2006)
Aloisio, G., Cafaro, M., Fiore, S., Mirto, M.: The Grid Relational Catalog Project. In: Grandinetti, L. (ed.) Advances in Parallel Computing, Grid Computing: The New Frontiers of High Performance Computing, pp. 129–155 (2005)
Grillo, G., Licciulli, F., Liuni, S., Sbisà, E., Pesole, G.: PatSearch: A program for the detection of patterns and structural motifs in nucleotide sequences. Nucleic Acids Res. 31, 3608–3612 (2003)
Von Laszewski, G., Foster, I., Gawor, J., Lane, P., Rehn, N., Russell, M.: Designing Grid-based Problem Solving Environments and Portals. In: Proceedings of International Conference on System Sciences (HICSS-34) (2001)
gLite Project: http://glite.web.cern.ch/glite/documentation/
Foster, I., Kesselman, C.: Globus Toolkit Version 4: Software for Service-Oriented Systems. In: Jin, H., Reed, D., Jiang, W. (eds.) NPC 2005. LNCS, vol. 3779, Springer, Heidelberg (2005)
Cafaro, M., Epicoco, I., Mirto, M., Lezzi, D., Aloisio, G.: The Grid Resource Broker Workflow Engine. In: IEEE Proceedings of The 6th International Conference on Grid and Cooperative Computing, Urumchi, Xinjiang, China, August 16-18, IEEE Computer Society Press, Los Alamitos (to appear, 2007)
Workflow management coalition reference model: http://www.wfmc.org/
The LIBI Project: http://www.libi.it
Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 17(25), 3389–3402 (1997), http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-psi
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mirto, M. et al. (2007). High Throughput Protein Similarity Searches in the LIBI Grid Problem Solving Environment. In: Thulasiraman, P., He, X., Xu, T.L., Denko, M.K., Thulasiram, R.K., Yang, L.T. (eds) Frontiers of High Performance Computing and Networking ISPA 2007 Workshops. ISPA 2007. Lecture Notes in Computer Science, vol 4743. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74767-3_43
Download citation
DOI: https://doi.org/10.1007/978-3-540-74767-3_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74766-6
Online ISBN: 978-3-540-74767-3
eBook Packages: Computer ScienceComputer Science (R0)