Complexity of Comparing Hidden Markov Models

Rune B. Lyngsø⁶ &
Christian N. S. Pedersen⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2223))

Included in the following conference series:

International Symposium on Algorithms and Computation

1660 Accesses
5 Citations
3 Altmetric

Abstract

The basic theory of hidden Markov models was developed and applied to problems in speech recognition in the late 1960’s, and has since then been applied to numerous problems, e.g. biological sequence analysis. In this paper we consider the problem of computing the most likely string generated by a given model, and its implications on the complexity of comparing hidden Markov models. We show that computing the most likely string, and approximating its probability within any constant factor, is NP-hard, and establish the NP-hardness of comparing two hidden Markov models under the Lα- and L ₁-norms. We discuss the applicability of the technique used to other measures of distance between probability distributions. In particular we show that it cannot be used to prove NP-hardness of determining the Kullback-Leibler distance between the probability distributions of two hidden Markov models, or of comparing them under the L _k-norm for any xed even integer k.

Supported by grants from Carlsbergfondet and the Prog. in Math. and Mol. Biology

Partially supported by the IST Programme of the EU under contract number IST-1999-14186 (ALCOM-FT)

funded by the University of Aarhus Research Foundation

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Bounds and Estimates on the Average Edit Distance

Lower Bounds on the Generalized Central Moments of the Optimal Alignments Score of Random Sequences

Article 08 December 2016

Hypothesis Testing, Information Divergence and Computational Geometry

References

N. Abe and M. K. Warmuth. On the computational complexity of approximating distributions by probabilistic automata. Machine Learning, 9:205–260, 1992.
MATH Google Scholar
K. Asai, S. Hayamizu, and K. Handa. Prediction of protein secondary structure by the hidden markov model. Comp. Appl. in the Biosciences, 9:141–146, 1993.
Google Scholar
A. Bateman, E. Birney, R. Durbin, S. R. Eddy, K. L. Howe, and E. L. L. Sonnhammer. The Pfam protein families database. Nucleic Acid Research, 28:263–266, 2000.
Article Google Scholar
T. Batu, L. Fortnow, R. Rubinfeld, W. D. Smith, and P. White. Testing that distributions are close. In Proc. 15th STOC, pages 259–269, 2000.
Google Scholar
G. A. Churchill. Stochastic models for heterogeneous DNA sequences. Bull. Math. Biol., 51:79–94, 1989.
Article MathSciNet MATH Google Scholar
T. M. Cover and J. A. Thomas. Elements of Information Theory. John Wiley & Sons, New York, 1991.
Book MATH Google Scholar
L. Engebretsen and J. Holmerin. Clique is hard to approximate within n^(1-o(1)). In Proc. 27th ICALP, volume 1853 of Lecture Notes in Computer Science, pages 2–12, 2000.
MATH Google Scholar
J. Feigenbaum, S. Kannan, M. Strauss, and M. Viswanathan. An approximate l ¹-difference algorithm for massive data streams. In Proc. 40th FOCS, pages 501–511, 1999.
Google Scholar
J. Fong and M. Strauss. An approximate l ^p-difference algorithm for massive data streams. In Proc. 17th STACS, 2000.
Google Scholar
J. Håstad. Clique is hard to approximate within n ^1-∈. Acta Mathematica, 182:105–142, 1999.
Article MathSciNet Google Scholar
A. Krogh. Two methods for improving performance of an HMM and their application for gene nding. In Proc. 5th ISMB, pages 179–186, 1997.
Google Scholar
A. Krogh, M. Brown, I. S. Mian, K. Sjölander, and D. Haussler. Hidden markov models in computational biology: Applications to protein modeling. Jour. Mol. Biol, 235:1501–1531, 1994.
Article Google Scholar
R. B. Lyngsø, C. N. S. Pedersen, and H. Nielsen. Metrics and similarity measures for hidden Markov models. In Proc. 7th ISMB, pages 178–186, 1999.
Google Scholar
L. R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. In Proc. of the IEEE, volume 77, pages 257–286, 1989.
Google Scholar
Y. Singer and M. K. Warmuth. Training algorithms for hidden Markov models using entropy based distance functions. In Proc. 9th NIPS, pages 641–647, 1996.
Google Scholar
E. L. L. Sonnhammer, G. von Heijne, and A. Krogh. A hidden Markov model for predicting transmembrane helices in protein sequences. In Proc. 6th ISMB, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Baskin Center for Computer Science and Engineering, University of California, 95064, Santa Cruz, CA, USA
Rune B. Lyngsø
BiRC - Bioinformatics Research Center, Department of Computer Science, University of Aarhus, Ny Munkegade, 8000, Århus C, DK, Denmark
Christian N. S. Pedersen

Authors

Rune B. Lyngsø
View author publications
You can also search for this author in PubMed Google Scholar
Christian N. S. Pedersen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Basser Department of Computer Science, University of Sydney, Madsen Building F09, 2006, Sydney, NSW, Australia
Peter Eades
Department of Computer Science, University of Canterbury, Private Bag 4800, Christchurch, New Zealand
Tadao Takaoka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lyngsø, R.B., Pedersen, C.N.S. (2001). Complexity of Comparing Hidden Markov Models. In: Eades, P., Takaoka, T. (eds) Algorithms and Computation. ISAAC 2001. Lecture Notes in Computer Science, vol 2223. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45678-3_36

Download citation

DOI: https://doi.org/10.1007/3-540-45678-3_36
Published: 04 December 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42985-2
Online ISBN: 978-3-540-45678-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Complexity of Comparing Hidden Markov Models

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Bounds and Estimates on the Average Edit Distance

Lower Bounds on the Generalized Central Moments of the Optimal Alignments Score of Random Sequences

Hypothesis Testing, Information Divergence and Computational Geometry

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Complexity of Comparing Hidden Markov Models

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Bounds and Estimates on the Average Edit Distance

Lower Bounds on the Generalized Central Moments of the Optimal Alignments Score of Random Sequences

Hypothesis Testing, Information Divergence and Computational Geometry

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation