Abstract
We present a new retrieval method based on multiple-Bernoulli model and multinomial model in this paper. We use the multiple-Bernoulli model and multinomial model to estimate the term probabilities by importing the conjugate prior and the term frequencies, and use Dirchlet method to smooth the models for solving the ”zero probability” problem of the language model.
Supported by the science research foundation program of Henan University of Science and Technology, China (2004ZY041) and the natural science foundation program of the Henan Educational Department, China (200410464004).
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lafferty, J., Zhai, C.: Document language models, query models, and risk minimization for information retrieval. In: Proceedings of SIGIR 2001, pp. 111–119 (2001)
Metzler, D., Lavrenko, V., Croft, W.B.: Formal multiple-Bernoulli models for language modeling. In: Proceedings of ACM SIGIR 2004, pp. 231–235 (2004)
Miller, D.H., Leek, T., Schwartz, R.: A hidden Markov model information retrieval system. In: Proceedings of ACM SIGIR 1999, pp. 214–221 (1999)
Ponte, J., Croft, W.B.: A language modeling approach to information retrieval. In: Proceedings of ACM SIGIR 1998, pp. 275–281 (1998)
Zaragoza, H., Hiemstra, D., et al.: Bayesian extension to the language model for ad hoc information retrieval. In: Proceedings of ACM SIGIR 2003, pp. 325–327 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Huo, H., Liu, J., Feng, B. (2005). Multinomial Approach and Multiple-Bernoulli Approach for Information Retrieval Based on Language Modeling. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3613. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539506_72
Download citation
DOI: https://doi.org/10.1007/11539506_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28312-6
Online ISBN: 978-3-540-31830-9
eBook Packages: Computer ScienceComputer Science (R0)