SVD-LDA: Topic Modeling for Full-Text Recommender Systems

Sergey Nikolenko^16,17,18

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9414))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1700 Accesses
2 Citations

Abstract

In recommender systems, matrix decompositions, in particular singular value decomposition (SVD), represent users and items as vectors of features and allow for additional terms in the decomposition to account for other available information. In text mining, topic modeling, in particular latent Dirichlet allocation (LDA), are designed to extract topical content of a large corpus of documents. In this work, we present a unified SVD-LDA model that aims to improve SVD-based recommendations for items with textual content with topic modeling of this content. We develop a training algorithm for SVD-LDA based on a first order approximation to Gibbs sampling and show significant improvements in recommendation quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 35.99; Price includes VAT (United Kingdom)

Softcover Book: GBP 44.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Personalized topic modeling for recommending user-generated content

Article 27 May 2017

Scalable Moment-Based Inference for Latent Dirichlet Allocation

Addressing the user cold start with cross-domain collaborative filtering: exploiting item metadata in matrix factorization

Article 01 January 2019

Notes

1.
http://surfingbird.ru.

References

Resnick, P., Iacovou, N., Sushak, M., Bergstrom, P., Riedl, J.T.: Grouplens: an open architecture for collaborative filtering of netnews. In: 1994 ACM Conference on Computer Supported Collaborative Work Conference, pp. 175–186, Chapel Hill, NC, Association of Computing Machinery (1994)
Google Scholar
Said, A., Jain, B.J., Albayrak, S.: Analyzing weighting schemes in collaborative filtering: cold start, post cold start and power users. In: Proceedings of the 27th Annual ACM Symposium on Applied Computing, SAC 2012, pp. 2035–2040, New York (2012)
Google Scholar
Koren, Y., Bell, R.M.: Advances in collaborative filtering. In: Ricci, F., Rokach, L., Shapira, B., Kantor, P.B. (eds.) Recommender Systems Handbook, pp. 145–186. Springer, US (2011)
Chapter Google Scholar
Hu, Y., Koren, Y., Volinsky, C.: Collaborative filtering for implicit feedback datasets. In: Proceedings of the 8th IEEE International Conference on Data Mining, pp. 263–272, Pisa, Italy. IEEE Computer Society (2008)
Google Scholar
Hoffmann, T.: Unsupervised learning by probabilistic latent semantic analysis. Mach. Learn. 42, 177–196 (2001)
Article Google Scholar
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Griffiths, T., Steyvers, M.: Finding scientific topics. Proc. Nat. Acad. Sci. 101(Suppl. 1), 5228–5335 (2004)
Article Google Scholar
Blei, D.M., McAuliffe, J.D.: Supervised topic models. In: Advances in Neural Information Processing Systems, vol. 22 (2007)
Google Scholar
Linden, G., Smith, B., York, J.: Amazon.com recommendations: item-to-item collaborative filtering. IEEE Internet Computing 7(1), 76–80 (2003)
Article Google Scholar
Agarwal, D., Chen, B.C.: fLDA: matrix factorization through latent Dirichlet allocation. In: Proceedings of the 3rd WSDM, pp. 91–100, New York. ACM (2010)
Google Scholar
Agarwal, D., Chen, B.C.: Regression-based latent factor models. In: Proceedings of the 15th KDD, pp. 19–28 New York. ACM (2009)
Google Scholar
Jarvelin, K., Kekalainen, J.: Cumulated gain-based evaluation of IR techniques. ACM Trans. Inf. Syst. 20, 422–446 (2002)
Article Google Scholar
Fawcett, T.: An introduction to ROC analysis. Pattern Recogn. Lett. 27, 861–874 (2006)
Article Google Scholar
Ling, C.X., Huang, J., Zhang, H.: AUC: a statistically consistent and more discriminating measure than accuracy. In: Proceedings of the International Joint Conference on Artificial Intelligence 2003, pp. 519–526 (2003)
Google Scholar
Potapenko, A., Vorontsov, K.: Robust PLSA performs better than LDA. In: Serdyukov, P., Braslavski, P., Kuznetsov, S.O., Kamps, J., Rüger, S., Agichtein, E., Segalovich, I., Yilmaz, E. (eds.) ECIR 2013. LNCS, vol. 7814, pp. 784–787. Springer, Heidelberg (2013)
Chapter Google Scholar
Vorontsov, K.: Additive regularization for topic models of text collections. Doklady Mathematics 89, 301–304 (2014)
Article MathSciNet MATH Google Scholar
Cao, L., Fei-Fei, L.: Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes. In: IEEE 11th International Conference on Computer Vision, ICCV 2007, pp. 1–8 (2007)
Google Scholar
Hu, D., Saul, L.K.: A probabilistic topic model for unsupervised learning of musical key-profiles. In: Hirata, K., Tzanetakis, G., Yoshii, K. (eds.) ISMIR, International Society for Music Information Retrieval, pp. 441–446 (2009)
Google Scholar

Download references

Acknowledgements

This work was supported by the Samsung Research Center grant “Recommendation Systems based on Probabilistic Graphical Models”, the Government of the Russian Federation grant 14.Z50.31.0030, and the Russian Foundation for Basic Research grant no. 15-29-01173.

Author information

Authors and Affiliations

Steklov Institute of Mathematics at St. Petersburg, St. Petersburg, Russia
Sergey Nikolenko
Laboratory for Internet Studies, National Research University – Higher School of Economics, St. Petersburg, Russia
Sergey Nikolenko
Kazan (Volga Region) Federal University, Kazan, Russia
Sergey Nikolenko

Authors

Sergey Nikolenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sergey Nikolenko .

Editor information

Editors and Affiliations

Unidad Profesional Interdisciplinaria, México DF, Mexico
Obdulia Pichardo Lagunas
Universidad Autónoma Metropolitana, México DF, Mexico
Oscar Herrera Alcántara
Instituto de Investigaciones Eléctricas, Cuernavaca, Morelos, Mexico
Gustavo Arroyo Figueroa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nikolenko, S. (2015). SVD-LDA: Topic Modeling for Full-Text Recommender Systems. In: Pichardo Lagunas, O., Herrera Alcántara, O., Arroyo Figueroa, G. (eds) Advances in Artificial Intelligence and Its Applications. MICAI 2015. Lecture Notes in Computer Science(), vol 9414. Springer, Cham. https://doi.org/10.1007/978-3-319-27101-9_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-27101-9_5
Published: 10 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27100-2
Online ISBN: 978-3-319-27101-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

SVD-LDA: Topic Modeling for Full-Text Recommender Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Personalized topic modeling for recommending user-generated content

Scalable Moment-Based Inference for Latent Dirichlet Allocation

Addressing the user cold start with cross-domain collaborative filtering: exploiting item metadata in matrix factorization

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

SVD-LDA: Topic Modeling for Full-Text Recommender Systems

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Personalized topic modeling for recommending user-generated content

Scalable Moment-Based Inference for Latent Dirichlet Allocation

Addressing the user cold start with cross-domain collaborative filtering: exploiting item metadata in matrix factorization

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation