[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3219819.3220023acmotherconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems

Published: 19 July 2018 Publication History

Abstract

Combinatorial features are essential for the success of many commercial models. Manually crafting these features usually comes with high cost due to the variety, volume and velocity of raw data in web-scale systems. Factorization based models, which measure interactions in terms of vector product, can learn patterns of combinatorial features automatically and generalize to unseen features as well. With the great success of deep neural networks (DNNs) in various fields, recently researchers have proposed several DNN-based factorization model to learn both low- and high-order feature interactions. Despite the powerful ability of learning an arbitrary function from data, plain DNNs generate feature interactions implicitly and at the bit-wise level. In this paper, we propose a novel Compressed Interaction Network (CIN), which aims to generate feature interactions in an explicit fashion and at the vector-wise level. We show that the CIN share some functionalities with convolutional neural networks (CNNs) and recurrent neural networks (RNNs). We further combine a CIN and a classical DNN into one unified model, and named this new model eXtreme Deep Factorization Machine (xDeepFM). On one hand, the xDeepFM is able to learn certain bounded-degree feature interactions explicitly; on the other hand, it can learn arbitrary low- and high-order feature interactions implicitly. We conduct comprehensive experiments on three real-world datasets. Our results demonstrate that xDeepFM outperforms state-of-the-art models. We have released the source code of xDeepFM at https://github.com/Leavingseason/xDeepFM.

Supplementary Material

MP4 File (lian_xdeepfm.mp4)

References

[1]
Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Qiang Cheng, Guoliang Chen, et almbox. . 2016. Deep speech 2: End-to-end speech recognition in english and mandarin International Conference on Machine Learning. 173--182.
[2]
Mathieu Blondel, Akinori Fujino, Naonori Ueda, and Masakazu Ishihata . 2016. Higher-order factorization machines. In Advances in Neural Information Processing Systems. 3351--3359.
[3]
Jingyuan Chen, Hanwang Zhang, Xiangnan He, Liqiang Nie, Wei Liu, and Tat-Seng Chua . 2017. Attentive collaborative filtering: Multimedia recommendation with item-and component-level attention. In Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 335--344.
[4]
Tianqi Chen, Weinan Zhang, Qiuxia Lu, Kailong Chen, Zhao Zheng, and Yong Yu . 2012. SVDFeature: a toolkit for feature-based collaborative filtering. Journal of Machine Learning Research Vol. 13, Dec (2012), 3619--3622.
[5]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et almbox. . 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 7--10.
[6]
Kyunghyun Cho, Bart Van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio . 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014).
[7]
Xin Dong, Lei Yu, Zhonghuo Wu, Yuxia Sun, Lingfeng Yuan, and Fangxi Zhang . 2017. A Hybrid Collaborative Filtering Model with Deep Structure for Recommender Systems AAAI. 1309--1315.
[8]
Ali Mamdouh Elkahky, Yang Song, and Xiaodong He . 2015. A multi-view deep learning approach for cross domain user modeling in recommendation systems. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 278--288.
[9]
Huifeng Guo, Ruiming Tang, Yunming Ye, Zhenguo Li, and Xiuqiang He . 2017. Deepfm: A factorization-machine based neural network for CTR prediction. arXiv preprint arXiv:1703.04247 (2017).
[10]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun . 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.
[11]
Ruining He and Julian McAuley . 2016. VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback AAAI. 144--150.
[12]
Xiangnan He and Tat-Seng Chua . 2017. Neural factorization machines for sparse predictive analytics Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 355--364.
[13]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua . 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 173--182.
[14]
Xinran He, Junfeng Pan, Ou Jin, Tianbing Xu, Bo Liu, Tao Xu, Yanxin Shi, Antoine Atallah, Ralf Herbrich, Stuart Bowers, et almbox. . 2014. Practical lessons from predicting clicks on ads at facebook Proceedings of the Eighth International Workshop on Data Mining for Online Advertising. ACM, 1--9.
[15]
Geoffrey Hinton, Li Deng, Dong Yu, George E Dahl, Abdel-rahman Mohamed, Navdeep Jaitly, Andrew Senior, Vincent Vanhoucke, Patrick Nguyen, Tara N Sainath, et almbox. . 2012. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Signal Processing Magazine Vol. 29, 6 (2012), 82--97.
[16]
Diederik P Kingma and Jimmy Ba . 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[17]
Yehuda Koren . 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 426--434.
[18]
Yehuda Koren, Robert Bell, and Chris Volinsky . 2009. Matrix factorization techniques for recommender systems. Computer Vol. 42, 8 (2009).
[19]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton . 2012. Imagenet classification with deep convolutional neural networks Advances in neural information processing systems. 1097--1105.
[20]
Joonseok Lee, Seungyeon Kim, Guy Lebanon, and Yoram Singer . 2013. Local low-rank matrix approximation. In International Conference on Machine Learning. 82--90.
[21]
Jianxun Lian and Xing Xie . 2016. Cross-Device User Matching Based on Massive Browse Logs: The Runner-Up Solution for the 2016 CIKM Cup. arXiv preprint arXiv:1610.03928 (2016).
[22]
Jianxun Lian, Fuzheng Zhang, Min Hou, Hongwei Wang, Xing Xie, and Guangzhong Sun . 2017 a. Practical Lessons for Job Recommendations in the Cold-Start Scenario Proceedings of the Recommender Systems Challenge 2017 (RecSys Challenge '17). ACM, New York, NY, USA, Article bibinfoarticleno4, bibinfonumpages6 pages.
[23]
Jianxun Lian, Fuzheng Zhang, Xing Xie, and Guangzhong Sun . 2017 b. CCCFNet: a content-boosted collaborative filtering neural network for cross domain recommender systems. In Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 817--818.
[24]
Jianxun Lian, Fuzheng Zhang, Xing Xie, and Guangzhong Sun . 2017 c. Restaurant Survival Analysis with Heterogeneous Information Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 993--1002.
[25]
Xiaoliang Ling, Weiwei Deng, Chen Gu, Hucheng Zhou, Cui Li, and Feng Sun . 2017. Model Ensemble for Click Prediction in Bing Search Ads Proceedings of the 26th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 689--698.
[26]
Guimei Liu, Tam T Nguyen, Gang Zhao, Wei Zha, Jianbo Yang, Jianneng Cao, Min Wu, Peilin Zhao, and Wei Chen . 2016. Repeat buyer prediction for e-commerce. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 155--164.
[27]
H Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, et almbox. . 2013. Ad click prediction: a view from the trenches. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 1222--1230.
[28]
Aditya Krishna Menon and Charles Elkan . 2010. A log-linear model with latent features for dyadic prediction Data Mining (ICDM), 2010 IEEE 10th International Conference on. IEEE, 364--373.
[29]
Tomávs Mikolov, Martin Karafiát, Lukávs Burget, Jan vCernockỳ, and Sanjeev Khudanpur . 2010. Recurrent neural network based language model. In Eleventh Annual Conference of the International Speech Communication Association.
[30]
Rong Pan, Yunhong Zhou, Bin Cao, Nathan N Liu, Rajan Lukose, Martin Scholz, and Qiang Yang . 2008. One-class collaborative filtering. In Data Mining, 2008. ICDM'08. Eighth IEEE International Conference on. IEEE, 502--511.
[31]
Yanru Qu, Han Cai, Kan Ren, Weinan Zhang, Yong Yu, Ying Wen, and Jun Wang . 2016. Product-based neural networks for user response prediction Data Mining (ICDM), 2016 IEEE 16th International Conference on. IEEE, 1149--1154.
[32]
Steffen Rendle . 2010. Factorization machines. In Data Mining (ICDM), 2010 IEEE 10th International Conference on. IEEE, 995--1000.
[33]
Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme . 2009. BPR: Bayesian personalized ranking from implicit feedback Proceedings of the twenty-fifth conference on uncertainty in artificial intelligence. AUAI Press, 452--461.
[34]
Steffen Rendle and Lars Schmidt-Thieme . 2010. Pairwise interaction tensor factorization for personalized tag recommendation Proceedings of the third ACM international conference on Web search and data mining. ACM, 81--90.
[35]
Matthew Richardson, Ewa Dominowska, and Robert Ragno . 2007. Predicting clicks: estimating the click-through rate for new ads Proceedings of the 16th international conference on World Wide Web. ACM, 521--530.
[36]
Suvash Sedhain, Aditya Krishna Menon, Scott Sanner, and Lexing Xie . 2015. Autorec: Autoencoders meet collaborative filtering Proceedings of the 24th International Conference on World Wide Web. ACM, 111--112.
[37]
Ying Shan, T Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, and JC Mao . 2016. Deep crossing: Web-scale modeling without manually crafted combinatorial features Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 255--262.
[38]
Nathan Srebro, Jason Rennie, and Tommi S Jaakkola . 2005. Maximum-margin matrix factorization. In Advances in neural information processing systems. 1329--1336.
[39]
Hao Wang, Naiyan Wang, and Dit-Yan Yeung . 2015. Collaborative deep learning for recommender systems Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1235--1244.
[40]
Ruoxi Wang, Bin Fu, Gang Fu, and Mingliang Wang . 2017. Deep & Cross Network for Ad Click Predictions. arXiv preprint arXiv:1708.05123 (2017).
[41]
Xinxi Wang and Ye Wang . 2014. Improving content-based and hybrid music recommendation using deep learning Proceedings of the 22nd ACM international conference on Multimedia. ACM, 627--636.
[42]
Yao Wu, Christopher DuBois, Alice X Zheng, and Martin Ester . 2016. Collaborative denoising auto-encoders for top-n recommender systems Proceedings of the Ninth ACM International Conference on Web Search and Data Mining. ACM, 153--162.
[43]
Jun Xiao, Hao Ye, Xiangnan He, Hanwang Zhang, Fei Wu, and Tat-Seng Chua . 2017. Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19--25, 2017. 3119--3125.
[44]
Fajie Yuan, Guibing Guo, Joemon M Jose, Long Chen, Haitao Yu, and Weinan Zhang . 2016. Lambdafm: learning optimal ranking with factorization machines using lambda surrogates. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management. ACM, 227--236.
[45]
Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, and Wei-Ying Ma . 2016 b. Collaborative knowledge base embedding for recommender systems Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, 353--362.
[46]
Weinan Zhang, Tianming Du, and Jun Wang . 2016 a. Deep learning over multi-field categorical data. In European conference on information retrieval. Springer, 45--57.
[47]
Guorui Zhou, Chengru Song, Xiaoqiang Zhu, Xiao Ma, Yanghui Yan, Xingya Dai, Han Zhu, Junqi Jin, Han Li, and Kun Gai . 2017. Deep interest network for click-through rate prediction. arXiv preprint arXiv:1706.06978 (2017).

Cited By

View all
  • (2025)Attention-Enhanced and Knowledge-Fused Dual Item Representations Network for RecommendationTsinghua Science and Technology10.26599/TST.2023.901014330:2(585-599)Online publication date: Apr-2025
  • (2025)ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior ModelingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.348644537:1(265-277)Online publication date: Jan-2025
  • (2025)Upper bound on the predictability of rating prediction in recommender systemsInformation Processing & Management10.1016/j.ipm.2024.10395062:1(103950)Online publication date: Jan-2025
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
July 2018
2925 pages
ISBN:9781450355520
DOI:10.1145/3219819
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep learning
  2. factorization machines
  3. feature interactions
  4. neural network
  5. recommender systems

Qualifiers

  • Research-article

Funding Sources

  • Youth Innovation Promotion Association of CAS

Conference

KDD '18
Sponsor:

Acceptance Rates

KDD '18 Paper Acceptance Rate 107 of 983 submissions, 11%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)304
  • Downloads (Last 6 weeks)27
Reflects downloads up to 09 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2025)Attention-Enhanced and Knowledge-Fused Dual Item Representations Network for RecommendationTsinghua Science and Technology10.26599/TST.2023.901014330:2(585-599)Online publication date: Apr-2025
  • (2025)ENCODE: Breaking the Trade-Off Between Performance and Efficiency in Long-Term User Behavior ModelingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.348644537:1(265-277)Online publication date: Jan-2025
  • (2025)Upper bound on the predictability of rating prediction in recommender systemsInformation Processing & Management10.1016/j.ipm.2024.10395062:1(103950)Online publication date: Jan-2025
  • (2025)A real-time energy and cost efficient vehicle route assignment neural recommender systemExpert Systems with Applications10.1016/j.eswa.2024.125634263(125634)Online publication date: Mar-2025
  • (2025)Knowledge-driven hierarchical intents modeling for recommendationExpert Systems with Applications10.1016/j.eswa.2024.125361259(125361)Online publication date: Jan-2025
  • (2025)Multi-perspective API call sequence behavior analysis and fusion for malware classificationComputers & Security10.1016/j.cose.2024.104177148(104177)Online publication date: Jan-2025
  • (2024)Explainable Neural Tensor Factorization for Commercial Alley Revenues PredictionElectronics10.3390/electronics1316327913:16(3279)Online publication date: 19-Aug-2024
  • (2024)Popularity-Debiased Graph Self-Supervised for RecommendationElectronics10.3390/electronics1304067713:4(677)Online publication date: 6-Feb-2024
  • (2024)Calibration of Typhoon Track Forecasts Based on Deep Learning MethodsAtmosphere10.3390/atmos1509112515:9(1125)Online publication date: 17-Sep-2024
  • (2024)Feature-Interaction-Enhanced Sequential Transformer for Click-Through Rate PredictionApplied Sciences10.3390/app1407276014:7(2760)Online publication date: 26-Mar-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media