More Web Proxy on the site http://driver.im/

research-article

Free access

Variational Autoencoders for Collaborative Filtering

Authors:

Rahul G. Krishnan,

Matthew D. Hoffman,

Tony JebaraAuthors Info & Claims

WWW '18: Proceedings of the 2018 World Wide Web Conference

Pages 689 - 698

https://doi.org/10.1145/3178876.3186150

Published: 23 April 2018 Publication History

All formats PDF

Abstract

We extend variational autoencoders (VAEs) to collaborative filtering for implicit feedback. This non-linear probabilistic model enables us to go beyond the limited modeling capacity of linear factor models which still largely dominate collaborative filtering research.We introduce a generative model with multinomial likelihood and use Bayesian inference for parameter estimation. Despite widespread use in language modeling and economics, the multinomial likelihood receives less attention in the recommender systems literature. We introduce a different regularization parameter for the learning objective, which proves to be crucial for achieving competitive performance. Remarkably, there is an efficient way to tune the parameter using annealing. The resulting model and learning algorithm has information-theoretic connections to maximum entropy discrimination and the information bottleneck principle. Empirically, we show that the proposed approach significantly outperforms several state-of-the-art baselines, including two recently-proposed neural network approaches, on several real-world datasets. We also provide extended experiments comparing the multinomial likelihood with other commonly used likelihood functions in the latent factor collaborative filtering literature and show favorable results. Finally, we identify the pros and cons of employing a principled Bayesian inference approach and characterize settings where it provides the most significant improvements.

References

[1]

Alexander Alemi, Ian Fischer, Joshua Dillon, and Kevin Murphy. 2017. Deep Variational Information Bottleneck. In 5th International Conference on Learning Representations.

[2]

Amjad Almahairi, Kyle Kastner, Kyunghyun Cho, and Aaron Courville. 2015. Learning distributed representations from reviews for collaborative filtering Proceedings of the 9th ACM Conference on Recommender Systems. ACM, 147--154.

Digital Library

[3]

Thierry Bertin-Mahieux, Daniel P.W. Ellis, Brian Whitman, and Paul Lamere. 2011. The Million Song Dataset. In ISMIR, Vol. Vol. 2. 10.

[4]

David M. Blei, Alp Kucukelbir, and Jon D. McAuliffe. 2017. Variational Inference: A Review for Statisticians. J. Amer. Statist. Assoc. Vol. 112, 518 (2017), 859--877.

[5]

David M. Blei, Andrew Y. Ng, and Michael I. Jordan. 2003. Latent dirichlet allocation. Journal of Machine Learning Research Vol. 3, Jan (2003), 993--1022.

Digital Library

[6]

Aleksandar Botev, Bowen Zheng, and David Barber. 2017. Complementary Sum Sampling for Likelihood Approximation in Large Scale Classification. In Proceedings of the 20th International Conference on Artificial Intelligence and Statistics. 1030--1038.

[7]

Samuel R. Bowman, Luke Vilnis, Oriol Vinyals, Andrew M. Dai, Rafal Jozefowicz, and Samy Bengio. 2015. Generating sentences from a continuous space. arXiv preprint arXiv:1511.06349 (2015).

[8]

Sotirios Chatzis, Panayiotis Christodoulou, and Andreas S. Andreou. 2017. Recurrent Latent Variable Networks for Session-Based Recommendation Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems.

Digital Library

[9]

Paul Covington, Jay Adams, and Emre Sargin. 2016. Deep neural networks for youtube recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 191--198.

Digital Library

[10]

Carl Doersch. 2016. Tutorial on variational autoencoders. arXiv preprint arXiv:1606.05908 (2016).

[11]

Kostadin Georgiev and Preslav Nakov. 2013. A non-IID Framework for Collaborative Filtering with Restricted Boltzmann Machines Proceedings of the 30th International Conference on Machine Learning. 1148--1156.

Digital Library

[12]

Samuel Gershman and Noah Goodman. 2014. Amortized inference in probabilistic reasoning. In Proceedings of the Cognitive Science Society, Vol. Vol. 36.

[13]

Prem Gopalan, Jake M. Hofman, and David M. Blei. 2015. Scalable Recommendation with Hierarchical Poisson Factorization Uncertainty in Artificial Intelligence.

Digital Library

[14]

Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web. 173--182.

Digital Library

[15]

Balázs Hidasi and Alexandros Karatzoglou. 2017. Recurrent Neural Networks with Top-k Gains for Session-based Recommendations. arXiv preprint arXiv:1706.03847 (2017).

[16]

Balázs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2015. Session-based recommendations with recurrent neural networks. arXiv preprint arXiv:1511.06939 (2015).

[17]

Irina Higgins, Loic Matthey, Arka Pal, Christopher Burgess, Xavier Glorot, Matthew Botvinick, Shakir Mohamed, and Alexander Lerchner. 2017. β-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework 5th International Conference on Learning Representations.

[18]

Matthew D. Hoffman and Matthew J. Johnson. 2016. ELBO surgery: yet another way to carve up the variational evidence lower bound Workshop in Advances in Approximate Bayesian Inference, NIPS.

[19]

Yifan Hu, Yehuda Koren, and Chris Volinsky. 2008. Collaborative filtering for implicit feedback datasets Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on. 263--272.

Digital Library

[20]

Tommi Jaakkola, Marina Meila, and Tony Jebara. 2000. Maximum entropy discrimination. In Advances in Neural Information Processing Systems. 470--476.

Digital Library

[21]

Kalervo J"arvelin and Jaana Kek"al"ainen. 2002. Cumulated gain-based evaluation of IR techniques. ACM Transactions on Information Systems (TOIS) Vol. 20, 4 (2002), 422--446.

Digital Library

[22]

Michael I. Jordan, Zoubin Ghahramani, Tommi S. Jaakkola, and Lawrence K. Saul. 1999. An introduction to variational methods for graphical models. Machine learning Vol. 37, 2 (1999), 183--233.

Digital Library

[23]

Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[24]

Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).

[25]

Rahul G. Krishnan, Dawen Liang, and Matthew D. Hoffman. 2017. On the challenges of learning with inference networks on sparse, high-dimensional data. arXiv preprint arXiv:1710.06085 (2017).

[26]

Mark Levy and Kris Jack. 2013. Efficient top-n recommendation by linear regression RecSys Large Scale Recommender Systems Workshop.

[27]

Dawen Liang, Jaan Altosaar, Laurent Charlin, and David M. Blei. 2016. Factorization meets the item embedding: Regularizing matrix factorization with item co-occurrence. In Proceedings of the 10th ACM conference on recommender systems. 59--66.

Digital Library

[28]

Dawen Liang, Minshu Zhan, and Daniel P.W. Ellis. 2015. Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks. ISMIR. 295--301.

[29]

Benjamin Marlin. 2004. Collaborative filtering: A machine learning perspective. University of Toronto.

[30]

Daniel McFadden et almbox. 1973. Conditional logit analysis of qualitative choice behavior. (1973), bibinfonumpages105--142 pages.

[31]

Yishu Miao, Lei Yu, and Phil Blunsom. 2016. Neural variational inference for text processing. In International Conference on Machine Learning. 1727--1736.

Digital Library

[32]

Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality Advances in neural information processing systems. 3111--3119.

Digital Library

[33]

Xia Ning and George Karypis. 2011. Slim: Sparse linear methods for top-n recommender systems Data Mining (ICDM), 2011 IEEE 11th International Conference on. 497--506.

Digital Library

[34]

Rong Pan, Yunhong Zhou, Bin Cao, Nathan N. Liu, Rajan Lukose, Martin Scholz, and Qiang Yang. 2008. One-class collaborative filtering. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on. 502--511.

Digital Library

[35]

Arkadiusz Paterek. 2007. Improving regularized singular value decomposition for collaborative filtering Proceedings of KDD cup and workshop, Vol. Vol. 2007. 5--8.

[36]

Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. 452--461.

Digital Library

[37]

Danilo Jimenez Rezende, Shakir Mohamed, and Daan Wierstra. 2014. Stochastic Backpropagation and Approximate Inference in Deep Generative Models. In Proceedings of the 31st International Conference on Machine Learning. 1278--1286.

Digital Library

[38]

Ruslan Salakhutdinov and Andriy Mnih. 2008. Probabilistic matrix factorization. Advances in neural information processing systems (2008), 1257--1264.

Digital Library

[39]

Ruslan Salakhutdinov, Andriy Mnih, and Geoffrey Hinton. 2007. Restricted Boltzmann machines for collaborative filtering Proceedings of the 24th International Conference on Machine Learning. 791--798.

Digital Library

[40]

Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Darius Braziunas. 2016. On the Effectiveness of Linear Models for One-Class Collaborative Filtering. AAAI.

Digital Library

[41]

Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie. 2015. Autorec: Autoencoders meet collaborative filtering Proceedings of the 24th International Conference on World Wide Web. 111--112.

Digital Library

[42]

Elena Smirnova and Flavian Vasile. 2017. Contextual Sequence Modeling for Recommendation with Recurrent Neural Networks Proceedings of the 2nd Workshop on Deep Learning for Recommender Systems.

Digital Library

[43]

Nitish Srivastava, Geoffrey E. Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. Journal of machine learning research Vol. 15, 1 (2014), 1929--1958.

Digital Library

[44]

Harald Steck. 2015. Gaussian ranking by matrix factorization. In Proceedings of the 9th ACM Conference on Recommender Systems. ACM, 115--122.

Digital Library

[45]

Yong Kiam Tan, Xinxing Xu, and Yong Liu. 2016. Improved recurrent neural networks for session-based recommendations Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 17--22.

Digital Library

[46]

Naftali Tishby, Fernando Pereira, and William Bialek. 2000. The information bottleneck method. arXiv preprint physics/0004057 (2000).

[47]

Aaron van den Oord, Sander Dieleman, and Benjamin Schrauwen. 2013. Deep content-based music recommendation. In Advances in Neural Information Processing Systems 26. 2643--2651.

Digital Library

[48]

Hao Wang, Naiyan Wang, and Dit-Yan Yeung. 2015. Collaborative deep learning for recommender systems Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1235--1244.

Digital Library

[49]

Markus Weimer, Alexandros Karatzoglou, Quoc V Le, and Alex J Smola. 2008. Cofi rank-maximum margin matrix factorization for collaborative ranking Advances in neural information processing systems. 1593--1600.

Digital Library

[50]

Jason Weston, Samy Bengio, and Nicolas Usunier. 2011. Wsabie: Scaling up to large vocabulary image annotation IJCAI, Vol. Vol. 11. 2764--2770.

Digital Library

[51]

Yao Wu, Christopher DuBois, Alice X. Zheng, and Martin Ester. 2016. Collaborative denoising auto-encoders for top-n recommender systems Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. 153--162.

Digital Library

[52]

Puyang Xu, Asela Gunawardana, and Sanjeev Khudanpur. 2011. Efficient subsampling for training complex language models Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1128--1136.

Digital Library

[53]

Shuang-Hong Yang, Bo Long, Alexander J. Smola, Hongyuan Zha, and Zhaohui Zheng. 2011. Collaborative competitive filtering: learning recommender using context of user choice. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval. ACM, 295--304.

Digital Library

[54]

Yin Zheng, Bangsheng Tang, Wenkui Ding, and Hanning Zhou. 2016. A Neural Autoregressive Approach to Collaborative Filtering Proceedings of The 33rd International Conference on Machine Learning. 764--773.

Digital Library

Cited By

Aljunid MD.H. MHooshmand MAli WShetty AAlzoubah S(2025)A collaborative filtering recommender systems: SurveyNeurocomputing10.1016/j.neucom.2024.128718617(128718)Online publication date: Feb-2025
https://doi.org/10.1016/j.neucom.2024.128718
Liu Y(2025)A generative and discriminative model for diversity-promoting recommendationInformation Systems10.1016/j.is.2024.102488128(102488)Online publication date: Feb-2025
https://doi.org/10.1016/j.is.2024.102488
Niu YXing XJia ZLiu RXin M(2025)Implicit local–global feature extraction for diffusion sequence recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109471139(109471)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109471
Show More Cited By

Index Terms

Variational Autoencoders for Collaborative Filtering
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
      1. Learning in probabilistic graphical models
        Latent variable models
2. Information systems
  1. Information retrieval
    1. Retrieval tasks and goals
      1. Recommender systems
  2. Information systems applications
    1. Data mining
      1. Collaborative filtering

Recommendations

Bayesian Inference via Variational Approximation for Collaborative Filtering

Variational approximation method finds wide applicability in approximating difficult-to-compute probability distributions, a problem that is especially important in Bayesian inference to estimate posterior distributions. Latent factor model is a ...
Stochastic-Expert Variational Autoencoder for Collaborative Filtering
WWW '22: Proceedings of the ACM Web Conference 2022

Motivated by the recent successes of deep generative models used for collaborative filtering, we propose a novel framework of VAE for collaborative filtering using multiple experts and stochastic expert selection, which allows the model to learn a ...
Bilateral Variational Autoencoder for Collaborative Filtering
WSDM '21: Proceedings of the 14th ACM International Conference on Web Search and Data Mining

Preference data is a form of dyadic data, with measurements associated with pairs of elements arising from two discrete sets of objects. These are users and items, as well as their interactions, e.g., ratings. We are interested in learning ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences

WWW '18: Proceedings of the 2018 World Wide Web Conference

April 2018

2000 pages

ISBN:9781450356398

General Chairs:
Pierre-Antoine Champin
Universitè Claude Bernard Lyon 1, France
,
Fabien Gandon
Inria, Université Côte d'Azur, CNRS, I3S, France
,
Lionel Médini
Université Claude Bernard Lyon 1, France
,
Program Chairs:
Mounia Lalmas
Spotify, UK
,
Panagiotis G. Ipeirotis
New York University, USA

Copyright © 2018 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

IW3C2: International World Wide Web Conference Committee

In-Cooperation

SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web

Publisher

International World Wide Web Conferences Steering Committee

Republic and Canton of Geneva, Switzerland

Publication History

Published: 23 April 2018

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

WWW '18

Sponsor:

IW3C2

WWW '18: The Web Conference 2018

April 23 - 27, 2018

Lyon, France

Acceptance Rates

WWW '18 Paper Acceptance Rate 170 of 1,155 submissions, 15%;

Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

772
Total Citations
View Citations
11,249
Total Downloads

Downloads (Last 12 months)2,801
Downloads (Last 6 weeks)452

Reflects downloads up to 11 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Aljunid MD.H. MHooshmand MAli WShetty AAlzoubah S(2025)A collaborative filtering recommender systems: SurveyNeurocomputing10.1016/j.neucom.2024.128718617(128718)Online publication date: Feb-2025
https://doi.org/10.1016/j.neucom.2024.128718
Liu Y(2025)A generative and discriminative model for diversity-promoting recommendationInformation Systems10.1016/j.is.2024.102488128(102488)Online publication date: Feb-2025
https://doi.org/10.1016/j.is.2024.102488
Niu YXing XJia ZLiu RXin M(2025)Implicit local–global feature extraction for diffusion sequence recommendationEngineering Applications of Artificial Intelligence10.1016/j.engappai.2024.109471139(109471)Online publication date: Jan-2025
https://doi.org/10.1016/j.engappai.2024.109471
Sinha SDatta SKumar RBhattacharya SSarkar ADas K(2024)Exploring CreativityThe Pioneering Applications of Generative AI10.4018/979-8-3693-3278-8.ch008(167-198)Online publication date: 28-Jun-2024
https://doi.org/10.4018/979-8-3693-3278-8.ch008
Kumari RSingh R(2024)Influence of Generative AI on Healthcare Industries Enhanced by the MetaverseExamining the Metaverse in Healthcare10.4018/979-8-3693-1515-6.ch006(129-166)Online publication date: 14-Jun-2024
https://doi.org/10.4018/979-8-3693-1515-6.ch006
Akinola OAkinola AVictor Ifeanyi IOyerinde OJoseph Adewole OSulaimon BOyekan Oyekan B(2024)Artificial Intelligence and Machine Learning Techniques for Anomaly Detection and Threat Mitigation in Cloud-Connected Medical DevicesInternational Journal of Scientific Research and Modern Technology (IJSRMT)10.38124/ijsrmt.v3i3.26(1-13)Online publication date: 28-Mar-2024
https://doi.org/10.38124/ijsrmt.v3i3.26
Akinola OAkinola AVictor Ifeanyi IOyerinde OJoseph Adewole OSulaimon BSulaimon B(2024)Artificial Intelligence and Machine Learning Techniques for Anomaly Detection and Threat Mitigation in Cloud-Connected Medical DevicesInternational Journal of Innovative Science and Research Technology (IJISRT)10.38124/ijisrt/IJISRT24MAR1231(1886-1898)Online publication date: 3-Apr-2024
https://doi.org/10.38124/ijisrt/IJISRT24MAR1231
Li JYu YLiu YOuyang C(2024)GCN-based Interest and Popularity Disentangled Representations Framework for RecommendationData Intelligence10.3724/2096-7004.di.2024.0031Online publication date: 22-Nov-2024
https://doi.org/10.3724/2096-7004.di.2024.0031
Amiri MSarani Rad FLi J(2024)Delighting Palates with AI: Reinforcement Learning’s Triumph in Crafting Personalized Meal Plans with High User AcceptanceNutrients10.3390/nu1603034616:3(346)Online publication date: 24-Jan-2024
https://doi.org/10.3390/nu16030346
Dong JZhou YHao SFeng DZheng HXu Z(2024)Improving Graph Collaborative Filtering from the Perspective of User–Item Interaction Directly Using Contrastive LearningMathematics10.3390/math1213205712:13(2057)Online publication date: 30-Jun-2024
https://doi.org/10.3390/math12132057
Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents