Algorithmic and Complexity Issues of Three Clustering Methods in Microarray Data Analysis

Jinsong Tan²,
Kok Seng Chua³ &
Louxin Zhang²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3595))

Included in the following conference series:

International Computing and Combinatorics Conference

1860 Accesses

Abstract

The complexity, approximation and algorithmic issues of several clustering problems are studied. These non-traditional clustering problems arise from recent studies in microarray data analysis. We prove the following results. (1) Two variants of the Order-Preserving Submatrix problem are NP-hard. There are polynomial-time algorithms for the Order-Preserving Submatrix Problem when the condition or gene sets are given. (2) The Smooth Subset problem cannot be approximable with ratio 0.5 +δ for any constant δ >0 unless NP=P. (3) Inferring plaid model problem is NP-hard.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Recovering all generalized order-preserving submatrices: new exact formulations and algorithms

Article 25 March 2016

Clustering Methods for Microarray Data Sets

A Comparative Analysis of Clustering and Biclustering Algorithms in Gene Analysis

References

Alizadeh, A., et al.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–510 (2000)
Article Google Scholar
Ausiello, G., et al.: Complexity and Approximation. Springer, Heidelberg (1999)
Book MATH Google Scholar
Ben-Dor, A., Yakhini, Z.: Clustering gene expression patterns. In: Proc. RECOMB 1999, pp. 33–42 (1999)
Google Scholar
Ben-Dor, A., Chor, B., Karp, R., Yakhini, Z.: Discovering local structure in gene expression data: The order-preserving submatrix problem. In: Proceedings of RECOMB 2002, pp. 49–57 (2002)
Google Scholar
Berman, P., DasGupta, B., Muthukrishnan, S., Ramaswami, S.: Efficient approximation algorithm for tiling and packing problems with rectangles. J. Alg. 41, 443–470 (2001)
Article MathSciNet MATH Google Scholar
Chen, Y., Dougherty, E., Bitter, M.: Ratio-based decisions and the quantitative analysis of cDNA microarray images. J. Biomed. Optics 2, 364–374 (1997)
Article Google Scholar
Cheng, Y., Church, G.: Biclustering of expression data. In: Proceedings of ISMB 2000, pp. 93-103 (2000)
Google Scholar
Cormen, T.H., et al.: Introduction to Algorithms, 2nd edn. McGraw-Hill, New York (2001)
MATH Google Scholar
Eisen, M.B., et al.: Clustering Analysis and display of genome-wide expression pattern. Proc. Natl. Amer. Sci. 95, 14863–14868 (1998)
Article Google Scholar
Garey, M.R., Johnson, D.: Computers and Intractability: A Guide to the Theory of NP-completeness. Freeman, San Francisco (1979)
MATH Google Scholar
Hartuv, E., et al.: An algorithm for clustering cDNAs for gene expression analysis. In: Proceedings of Recomb 1999, pp. 188–197 (1999)
Google Scholar
Hedenfalk, I., et al.: Gene-expression profiles in hereditary breast cancer. New England Journal of Medicine 344, 539–548 (2001)
Article Google Scholar
Hochbaum, D.S.: Approximation Algorithms for NP-hard Problems. PWS Publishing Co. (1995)
Google Scholar
Kolda, T.G., O’Leary, D.P.: A semidiscrete matrix decomposition for latent semantic indexing in information retrieval. ACM Trans. on Information Systems 16, 322–346 (1998)
Article Google Scholar
Lawler, E.L.: Combinatorial Optimization: Networks and Matroids. Holt, Rinehart and Winston Inc. (1976)
MATH Google Scholar
Liu, J., Yang, J., Wang, W.: Biclustering in gene expression data by tendency. In: Proceedings of CSB 2004, pp. 182–193 (2004)
Google Scholar
Lazzeroni, L., Owen, A.: Plaid Models for Gene Expression Data. Statistica Sinica 12, 61–86 (2002); See http://www-stat.stanford.edu/~owen for more about Plaid model.
MathSciNet MATH Google Scholar
Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article Google Scholar
Peeters, R.: The maximum edge biclique problem is NP-complete. Discrete Applied Mathematics 131, 651–654 (2003)
Article MathSciNet MATH Google Scholar
Tamayo, P., et al.: Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. Proc. Natl. Acad. Sci. 96, 2907–2912 (1999)
Article Google Scholar
Troyanskaya, O., et al.: Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525 (2001)
Article Google Scholar
Yannakakis, M.: Node-and edge-deletion NP-complete problems. In: Proceedings of the 10th Annual STOC, pp. 253–264 (1978)
Google Scholar
Zhang, L., Zhu, S.: Complexity Study on Two Clustering Problems. In: Proceedings of the Annual Inter. Symposium on Alg. and Comput., pp. 660–669 (2001)
Google Scholar
Zhang, L., Zhu, S.: A new approach to clustering gene expression data. In: Proceedings of IEEE Symposium on Bioinformatics, pp. 268–275 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, National University of Singapore, Singapore, 117543
Jinsong Tan & Louxin Zhang
The Inst. of High Performance Computing, Singapore, 117528
Kok Seng Chua

Authors

Jinsong Tan
View author publications
You can also search for this author in PubMed Google Scholar
Kok Seng Chua
View author publications
You can also search for this author in PubMed Google Scholar
Louxin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong
Lusheng Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, J., Chua, K.S., Zhang, L. (2005). Algorithmic and Complexity Issues of Three Clustering Methods in Microarray Data Analysis. In: Wang, L. (eds) Computing and Combinatorics. COCOON 2005. Lecture Notes in Computer Science, vol 3595. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533719_10

Download citation

DOI: https://doi.org/10.1007/11533719_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28061-3
Online ISBN: 978-3-540-31806-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Algorithmic and Complexity Issues of Three Clustering Methods in Microarray Data Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Recovering all generalized order-preserving submatrices: new exact formulations and algorithms

Clustering Methods for Microarray Data Sets

A Comparative Analysis of Clustering and Biclustering Algorithms in Gene Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Algorithmic and Complexity Issues of Three Clustering Methods in Microarray Data Analysis

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Recovering all generalized order-preserving submatrices: new exact formulations and algorithms

Clustering Methods for Microarray Data Sets

A Comparative Analysis of Clustering and Biclustering Algorithms in Gene Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation