[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2124295.2124313acmconferencesArticle/Chapter ViewAbstractPublication PageswsdmConference Proceedingsconference-collections
research-article

Learning recommender systems with adaptive regularization

Published: 08 February 2012 Publication History

Abstract

Many factorization models like matrix or tensor factorization have been proposed for the important application of recommender systems. The success of such factorization models depends largely on the choice of good values for the regularization parameters. Without a careful selection they result in poor prediction quality as they either underfit or overfit the data. Regularization values are typically determined by an expensive search that requires learning the model parameters several times: once for each tuple of candidate values for the regularization parameters. In this paper, we present a new method that adapts the regularization automatically while training the model parameters. To achieve this, we optimize simultaneously for two criteria: (1) as usual the model parameters for the regularized objective and (2) the regularization of future parameter updates for the best predictive quality on a validation set. We develop this for the generic model class of Factorization Machines which subsumes a wide variety of factorization models. We show empirically, that the advantages of our adaptive regularization method compared to expensive hyperparameter search do not come to the price of worse predictive quality. In total with our method, learning regularization parameters is as easy as learning model parameters and thus there is no need for any time-consuming search of regularization values because they are found on-the-fly. This makes our method highly attractive for practical use.

References

[1]
C.-C. Chang and C.-J. Lin. LIBSVM: a library for support vector machines, 2001. Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[2]
O. Chapelle, V. Vapnik, O. Bousquet, and S. Mukherjee. Choosing multiple parameters for support vector machines. Machine Learning, 46:131--159, 2002. 10.1023/A:1012450327387.
[3]
A. Gelman, J. B. Carlin, H. S. Stern, and D. B. Rubin. Bayesian Data Analysis. Chapman and Hall/CRC, 2nd edition, 2003.
[4]
R. A. Harshman. Foundations of the parafac procedure: models and conditions for an 'exploratory' multimodal factor analysis. UCLA Working Papers in Phonetics, pages 1--84, 1970.
[5]
T. Hastie, S. Rosset, R. Tibshirani, and J. Zhu. The entire regularization path for the support vector machine. Journal of Machine Learning Research, 5:1391--1415, December 2004.
[6]
T. Hofmann. Latent semantic models for collaborative filtering. ACM Trans. Inf. Syst., 22(1):89--115, 2004.
[7]
Y. Koren. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In KDD '08: Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 426--434, New York, NY, USA, 2008. ACM.
[8]
Y. Koren. Collaborative filtering with temporal dynamics. In KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 447--456, New York, NY, USA, 2009. ACM.
[9]
J. Larsen, C. Svarer, and L. N. Andersen. Adaptive regularization in neural network modeling. In Neural Networks: Tricks of the Trade, Lecture Notes in Computer Science, volume 1524, pages 113--132. Springer-Verlag, 1998.
[10]
L. D. Lathauwer, B. D. Moor, and J. Vandewalle. A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl., 21(4):1253--1278, 2000.
[11]
H. Ma, H. Yang, M. R. Lyu, and I. King. Sorec: social recommendation using probabilistic matrix factorization. In Proceeding of the 17th ACM conference on Information and knowledge management, CIKM '08, pages 931--940, New York, NY, USA, 2008. ACM.
[12]
A. Paterek. Improving regularized singular value decomposition for collaborative filtering. In Proc. KDD Cup Workshop at SIGKDD'07, 13th ACM Int. Conf. on Knowledge Discovery and Data Mining, pages 39--42, 2007.
[13]
S. Rendle. Factorization machines. In Proceedings of the 10th IEEE International Conference on Data Mining. IEEE Computer Society, 2010.
[14]
S. Rendle, L. B. Marinho, A. Nanopoulos, and L. Schmidt-Thieme. Learning optimal ranking with tensor factorization for tag recommendation. In KDD '09: Proceeding of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, New York, NY, USA, 2009. ACM.
[15]
S. Rendle and L. Schmidt-Thieme. Pairwise interaction tensor factorization for personalized tag recommendation. In WSDM '10: Proceedings of the third ACM international conference on Web search and data mining, pages 81--90, New York, NY, USA, 2010. ACM.
[16]
J. D. M. Rennie and N. Srebro. Fast maximum margin matrix factorization for collaborative prediction. In ICML '05: Proceedings of the 22nd international conference on Machine learning, pages 713--719. ACM, 2005.
[17]
R. Salakhutdinov and A. Mnih. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. In Proceedings of the 25th International Conference on Machine Learning, volume~25, 2008.
[18]
R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. In Advances in Neural Information Processing Systems, volume~20, 2008.
[19]
N. Srebro, J. D. M. Rennie, and T. S. Jaakola. Maximum-margin matrix factorization. In Advances in Neural Information Processing Systems 17, pages 1329--1336. MIT Press, 2005.
[20]
L. Tucker. Some mathematical notes on three-mode factor analysis. Psychometrika, 31:279--311, 1966.
[21]
L. Xiong, X. Chen, T.-K. Huang, J. Schneider, and J. G. Carbonell. Temporal collaborative filtering with bayesian probabilistic tensor factorization. 2010.

Cited By

View all
  • (2024)Pseudo Gradient-Adjusted Particle Swarm Optimization for Accurate Adaptive Latent Factor AnalysisIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2023.334091954:4(2213-2226)Online publication date: Apr-2024
  • (2024)Purposeful Regularization with Reinforcement Learning for Facial Expression Recognition In-the-Wild2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW63382.2024.00464(4615-4624)Online publication date: 17-Jun-2024
  • (2023)Scaling Machine Learning with a Ring-based Distributed FrameworkProceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence10.1145/3638584.3638667(23-32)Online publication date: 8-Dec-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WSDM '12: Proceedings of the fifth ACM international conference on Web search and data mining
February 2012
792 pages
ISBN:9781450307475
DOI:10.1145/2124295
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 February 2012

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. matrix factorization
  2. regularization
  3. tensor factorization

Qualifiers

  • Research-article

Conference

Acceptance Rates

Overall Acceptance Rate 498 of 2,863 submissions, 17%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)4
Reflects downloads up to 10 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Pseudo Gradient-Adjusted Particle Swarm Optimization for Accurate Adaptive Latent Factor AnalysisIEEE Transactions on Systems, Man, and Cybernetics: Systems10.1109/TSMC.2023.334091954:4(2213-2226)Online publication date: Apr-2024
  • (2024)Purposeful Regularization with Reinforcement Learning for Facial Expression Recognition In-the-Wild2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)10.1109/CVPRW63382.2024.00464(4615-4624)Online publication date: 17-Jun-2024
  • (2023)Scaling Machine Learning with a Ring-based Distributed FrameworkProceedings of the 2023 7th International Conference on Computer Science and Artificial Intelligence10.1145/3638584.3638667(23-32)Online publication date: 8-Dec-2023
  • (2023)Weighted Knowledge Graph EmbeddingProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591784(867-877)Online publication date: 19-Jul-2023
  • (2023)Hyperparameter Learning for Deep Learning-Based Recommender SystemsIEEE Transactions on Services Computing10.1109/TSC.2023.323462316:4(2699-2712)Online publication date: 1-Jul-2023
  • (2023)Scaling Machine Learning with an Efficient Hybrid Distributed FrameworkWeb Information Systems Engineering – WISE 202310.1007/978-981-99-7254-8_72(926-936)Online publication date: 21-Oct-2023
  • (2022)Adapting Triplet Importance of Implicit Feedback for Personalized RecommendationProceedings of the 31st ACM International Conference on Information & Knowledge Management10.1145/3511808.3557229(2148-2157)Online publication date: 17-Oct-2022
  • (2022)Adaptive Latent Factor Analysis via Generalized Momentum-Incorporated Particle Swarm Optimization2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)10.1109/ICNSC55942.2022.10004062(1-6)Online publication date: 15-Dec-2022
  • (2022)Bayesian feature interaction selection for factorization machinesArtificial Intelligence10.1016/j.artint.2021.103589302:COnline publication date: 1-Jan-2022
  • (2022)IntroductionLatent Factor Analysis for High-dimensional and Sparse Matrices10.1007/978-981-19-6703-0_1(1-10)Online publication date: 16-Nov-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media