[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

On the User Behavior Leakage from Recommender System Exposure

Published: 07 February 2023 Publication History

Abstract

Modern recommender systems are trained to predict users’ potential future interactions from users’ historical behavior data. During the interaction process, despite the data coming from the user side, recommender systems also generate exposure data to provide users with personalized recommendation slates. Compared with the sparse user behavior data, the system exposure data are much larger in volume since only very few exposed items would be clicked by the user. In addition, user historical behavior data are privacy sensitive and commonly protected with careful access authorization. However, the large volume of recommender exposure data generated by the service provider itself usually receives less attention and could be accessed within a relatively larger scope of various information seekers or even potential adversaries.
In this article, we investigate the problem of user behavior data leakage in the field of recommender systems. We show that the privacy-sensitive user past behavior data can be inferred through the modeling of system exposure. In other words, one can infer which items the userhas clicked just from the observation of current systemexposure for this user. Given the fact that system exposure data could be widely accessed from a relatively larger scope, we believe that user past behavior privacy has a high risk of leakage in recommender systems. More precisely, we conduct an attack model whose input is the current recommended item slate (i.e., system exposure) for the user while the output is the user’s historical behavior. Specifically, we exploit an encoder-decoder structure to construct the attack model and apply different encoding and decoding strategies to verify attack performance. Experimental results on two real-world datasets indicate a great danger of user behavior data leakage. To address the risk, we propose a two-stage privacy-protection mechanism that first selects a subset of items from the exposure slate and then replaces the selected items with uniform or popularity-based exposure. Experimental evaluation reveals a trade-off effect between the recommendation accuracy and the privacy disclosure risk, which is an interesting and important topic for privacy concerns in recommender systems.

References

[1]
Muhammad Ammad-ud din, Elena Ivannikova, A. Suleiman Khan, Were Oyomno, Qiang Fu, Eeik Kuan Tan, and Adrian Flanagan. 2019. Federated collaborative filtering for privacy-preserving personalized recommendation system. arXiv: Information Retrieval (2019).
[2]
Ghazaleh Beigi, Ahmadreza Mosallanezhad, Ruocheng Guo, Hamidreza Alvari, Alexander Nou, and Huan Liu. 2020. Privacy-aware recommendation with private-attribute protection using adversarial learning. In 13th ACM International Conference on Web Search and Data Mining (WSDM’20), Houston, TX, February, 2020 (2020), 34–42.
[3]
Di Chai, Leye Wang, Kai Chen, and Qiang Yang. 2021. Secure federated matrix factorization. IEEE Intell. Syst. 36, 5 (2021), 11–20.
[4]
Jiawei Chen, Hande Dong, Yang Qiu, Xiangnan He, Xin Xin, Liang Chen, Guli Lin, and Keping Yang. 2021. AutoDebias: Learning to debias for recommendation. In SIGIR. ACM, 21–30.
[5]
Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He. 2020. Bias and debias in recommender system: A survey and future directions. ACM Trans. Inf. Syst. Just Accepted (October 2022).
[6]
Minmin Chen, Alex Beutel, Paul Covington, Sagar Jain, Francois Belletti, and Ed H. Chi. 2019. Top-k off-policy correction for a REINFORCE recommender system. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. ACM, 456–464.
[7]
Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, et al. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 7–10.
[8]
Kyunghyun Cho, Bart van Merrienboer, Çaglar Gülçehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In EMNLP. ACL, 1724–1734.
[9]
Yashar Deldjoo, Tommaso Di Noia, and Felice Antonio Merra. 2021. A survey on adversarial recommender systems: from attack/defense strategies to generative adversarial networks. Computing Surveys (2021).
[10]
Jingtao Ding, Fuli Feng, Xiangnan He, Guanghui Yu, Yong Li, and Depeng Jin. 2018. An improved sampler for Bayesian personalized ranking by leveraging view data. In Companion Proceedings of the The Web Conference 2018 (Lyon, France) (WWW’18). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 13–14.
[11]
Jingtao Ding, Yuhan Quan, Xiangnan He, Yong Li, and Depeng Jin. 2019. Reinforced negative sampling for recommendation with exposure data. In IJCAI. ijcai.org, 2230–2236.
[12]
Chen Gao, Chao Huang, Dongsheng Lin, Depeng Jin, and Yong Li. 2020. DPLCF: Differentially private local collaborative filtering. In SIGIR. ACM, 961–970.
[13]
Zhengqiang Ge, Xinyu Liu, Qiang Li, Yu Li, and Dong Guo. 2021. PrivItem2Vec: A privacy-preserving algorithm for top-N recommendation. International Journal of Distributed Sensor Networks 17, 12 (2021).
[14]
Jialiang Han, Yun Ma, Qiaozhu Mei, and Xuanzhe Liu. 2021. DeepRec: On-device deep learning for privacy-preserving sequential recommendation in mobile commerce. In Proceedings of the Web Conference 2021. 900–911.
[15]
Bin Hao, Min Zhang, Weizhi Ma, Shaoyun Shi, Xinxing Yu, Houzhi Shan, Yiqun Liu, and Shaoping Ma. 2021. large-scale rich context query and recommendation dataset in online knowledge-sharing. arxiv:2106.06467 [cs.IR]
[16]
Tong He, Zhi Zhang, Hang Zhang, Zhongyue Zhang, Junyuan Xie, and Mu Li. 2019. Bag of tricks for image classification with convolutional neural networks. In CVPR. Computer Vision Foundation/IEEE, 558–567.
[17]
Xiangnan He and Tat-Seng Chua. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. 355–364.
[18]
Xiangnan He, Kuan Deng, Xiang Wang, Yan Li, Yongdong Zhang, and Meng Wang. 2020. LightGC N: Simplifying and powering graph convolution network for recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 639–648.
[19]
Xiangnan He, Lizi Liao, Hanwang Zhang, Liqiang Nie, Xia Hu, and Tat-Seng Chua. 2017. Neural collaborative filtering. In Proceedings of the 26th International Conference on World Wide Web. 173–182.
[20]
Balázs Hidasi, Alexandros Karatzoglou, Linas Baltrunas, and Domonkos Tikk. 2016. Session-based recommendations with recurrent neural networks. In ICLR (Poster).
[21]
Balázs Hidasi and Domonkos Tikk. 2016. General factorization framework for context-aware recommendations. Data Mining and Knowledge Discovery 30, 2 (2016), 342–371.
[22]
Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, and Yinghui Xu. 2018. Reinforcement learning to rank in e-commerce search engine: Formalization, analysis, and application. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 368–377.
[23]
Hakan Inan, Khashayar Khosravi, and Richard Socher. 2017. Tying word vectors and word classifiers: A loss framework for language modeling. In ICLR (Poster). OpenReview.net.
[24]
Arjan J. P. Jeckmans, Michael Beye, Zekeriya Erkin, Pieter Hartel, Reginald L. Lagendijk, and Qiang Tang. 2013. Privacy in recommender systems. In Social Media Retrieval. Springer, 263–281.
[25]
Ray Jiang, Silvia Chiappa, Tor Lattimore, András György, and Pushmeet Kohli. 2019. Degenerate feedback loops in recommender systems. In Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society (AIES’19) (Honolulu, HI). ACM, New York, NY, USA, 383–390.
[26]
Santosh Kabbur, Xia Ning, and George Karypis. 2013. FISM: Factored item similarity models for top-N recommender systems. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 659–667.
[27]
Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive sequential recommendation. In 2018 IEEE International Conference on Data Mining (ICDM’18). IEEE, 197–206.
[28]
Farwa K. Khan, Adrian Flanagan, Kuan Eeik Tan, Zareen Alamgir, and Muhammad Ammad-Ud-Din. 2021. A payload optimization method for federated recommender systems. In Recsys. 432–442.
[29]
Jinsu Kim, Dongyoung Koo, Yuna Kim, Hyunsoo Yoon, Junbum Shin, and Sungwook Kim. 2018. Efficient privacy-preserving matrix factorization for recommendation via fully homomorphic encryption. ACM Transactions on Privacy and Security. 21, 4 (2018), 17:1–17:30.
[30]
Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In ICLR (Poster).
[31]
Yehuda Koren, Robert Bell, and Chris Volinsky. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30–37.
[32]
Jianxun Lian, Fuzheng Zhang, Min Hou, Hongwei Wang, Xing Xie, and Guangzhong Sun. 2017. Practical lessons for job recommendations in the cold-start scenario. In Proceedings of the Recommender Systems Challenge 2017 (Como, Italy) (RecSys Challenge’17). ACM, New York, NY, Article 4, 6 pages.
[33]
Guanyu Lin, Feng Liang, Weike Pan, and Zhong Ming. 2020. Fedrec: Federated recommendation with explicit feedback. IEEE Intelligent Systems (2020).
[34]
Dugang Liu, Pengxiang Cheng, Zhenhua Dong, Xiuqiang He, Weike Pan, and Zhong Ming. 2020. A general knowledge distillation framework for counterfactual recommendation via uniform data. In SIGIR. ACM, 831–840.
[35]
Lorenzo Minto, Moritz Haller, Benjamin Livshits, and Hamed Haddadi. 2021. Stronger privacy for federated collaborative filtering with implicit feedback. In RecSys. ACM, 342–350.
[36]
Khalil Muhammad, Qinqin Wang, Diarmuid O’Reilly-Morgan, Elias Tragos, Barry Smyth, Neil Hurley, James Geraci, and Aonghus Lawlor. 2020. Fedfast: Going beyond average for faster training of federated recommender systems. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1234–1242.
[37]
Xia Ning and George Karypis. 2011. Slim: Sparse linear methods for top-N recommender systems. In IEEE 11th International Conference on Data Mining. IEEE, 497–506.
[38]
Marta Otto. 2018. Regulation (EU) 2016/679 on the protection of natural persons with regard to the processing of personal data and on the free movement of such data (general data protection regulation–GDPR). In International and European Labour Law. Nomos Verlagsgesellschaft mbH & Co. KG, 958–981.
[39]
Michael J. Pazzani and Daniel Billsus. 2007. Content-based recommendation systems. In The Adaptive Web(Lecture Notes in Computer Science), Vol. 4321. Springer, 325–341.
[40]
Ofir Press and Lior Wolf. 2017. Using the output embedding to improve language models. In EACL (2). Association for Computational Linguistics, 157–163.
[41]
Tao Qi, Fangzhao Wu, Chuhan Wu, Yongfeng Huang, and Xing Xie. 2020. Privacy-preserving news recommendation model learning. In EMNLP(Findings of ACL, Vol. EMNLP’20). Association for Computational Linguistics, 1423–1432.
[42]
Steffen Rendle, Christoph Freudenthaler, and Lars Schmidt-Thieme. 2010. Factorizing personalized Markov chains for next-basket recommendation. In WWW. ACM, 811–820.
[43]
Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata. 2020. Unbiased recommender learning from missing-not-at-random implicit feedback. In WSDM. ACM, 501–509.
[44]
Hasim Sak, Andrew W. Senior, and Françoise Beaufays. 2014. Long short-term memory recurrent neural network architectures for large scale acoustic modeling. In INTERSPEECH. ISCA, 338–342.
[45]
Badrul Sarwar, George Karypis, Joseph Konstan, and John Riedl. 2001. Item-based collaborative filtering recommendation algorithms. In Proceedings of the 10th International Conference on World Wide Web. 285–295.
[46]
Hyejin Shin, Sungwook Kim, Junbum Shin, and Xiaokui Xiao. 2018. Privacy enhanced matrix factorization for recommendation with local differential privacy. IEEE Transactions on Knowledge and Data Engineering 30, 9 (2018), 1770–1782.
[47]
Reza Shokri, Marco Stronati, Congzheng Song, and Vitaly Shmatikov. 2017. Membership inference attacks against machine learning models. In 2017 IEEE Symposium on Security and Privacy (SP’17). IEEE, 3–18.
[48]
Lisa J. Sotto, Bridget C. Treacy, and Melinda L. McLellan. 2010. Privacy and data security risks in cloud computing.World Communications Regulation Report 5, 2 (2010), 38.
[49]
Fei Sun, Jun Liu, Jian Wu, Changhua Pei, Xiao Lin, Wenwu Ou, and Peng Jiang. 2019. BERT4Rec: Sequential recommendation with bidirectional encoder representations from transformer. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1441–1450.
[50]
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, and Zbigniew Wojna. 2016. Rethinking the inception architecture for computer vision. In CVPR. IEEE Computer Society, 2818–2826.
[51]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In NIPS. 5998–6008.
[52]
Qinyong Wang, Hongzhi Yin, Tong Chen, Junliang Yu, Alexander Zhou, and Xiangliang Zhang. 2021. Fast-adapting and privacy-preserving federated recommender system. The VLDB Journal (2021), 1–20.
[53]
Xiang Wang, Xiangnan He, Meng Wang, Fuli Feng, and Tat-Seng Chua. 2019. Neural graph collaborative filtering. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 165–174.
[54]
Xiang Wang, Yaokun Xu, Xiangnan He, Yixin Cao, Meng Wang, and Tat-Seng Chua. 2020. Reinforced negative sampling over knowledge graph for recommendation. In Proceedings of The Web Conference 2020. 99–109.
[55]
Chuhan Wu, Fangzhao Wu, Yang Cao, Yongfeng Huang, and Xing Xie. 2021. FedGNN: Federated graph neural network for privacy-preserving recommendation. arXiv preprint arXiv:2102.04925 (2021).
[56]
Fangzhao Wu, Ying Qiao, Jiun-Hung Chen, Chuhan Wu, Tao Qi, Jianxun Lian, Danyang Liu, Xing Xie, Jianfeng Gao, Winnie Wu, and Ming Zhou. 2020. MIND: A large-scale dataset for news recommendation. In ACL. Association for Computational Linguistics, 3597–3606.
[57]
Yao Wu, Christopher DuBois, Alice X. Zheng, and Martin Ester. 2016. Collaborative denoising auto-encoders for top-N recommender systems. In Proceedings of the 9th ACM International Conference on Web Search and Data Mining. 153–162.
[58]
Xin Xin, Alexandros Karatzoglou, Ioannis Arapakis, and Joemon M. Jose. 2020. Self-supervised reinforcement learning for recommender systems. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 931–940.
[59]
Jheng-Hong Yang, Chih-Ming Chen, Chuan-Ju Wang, and Ming-Feng Tsai. 2018. HOP-rec: High-order proximity for implicit recommendation. In Recsys. 140–144.
[60]
Qiang Yang, Yang Liu, Yong Cheng, Yan Kang, Tianjian Chen, and Han Yu. 2019. Federated learning. Synthesis Lectures on Artificial Intelligence and Machine Learning 13, 3 (2019), 1–207.
[61]
Fajie Yuan, Xiangnan He, Alexandros Karatzoglou, and Liguang Zhang. 2020. Parameter-efficient transfer from sequential behaviors for user modeling and recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1469–1478.
[62]
Fajie Yuan, Alexandros Karatzoglou, Ioannis Arapakis, Joemon M. Jose, and Xiangnan He. 2019. A simple convolutional generative network for next item recommendation. In Proceedings of the 12th ACM International Conference on Web Search and Data Mining. ACM, 582–590.
[63]
Minxing Zhang, Zhaochun Ren, Zihan Wang, Pengjie Ren, Zhumin Chen, Pengfei Hu, and Yang Zhang. 2021. Membership inference attacks against recommender systems. In CCS. ACM, 864–879.
[64]
Shijie Zhang and Hongzhi Yin. 2022. Comprehensive privacy analysis on federated recommender system against attribute inference attacks. CoRR abs/2205.11857 (2022).
[65]
Shijie Zhang, Hongzhi Yin, Tong Chen, Zi Huang, Lizhen Cui, and Xiangliang Zhang. 2021. Graph embedding for recommendation against attribute inference attacks. In WWW. ACM/IW3C2, 3002–3014.
[66]
Yang Zhang, Fuli Feng, Xiangnan He, Tianxin Wei, Chonggang Song, Guohui Ling, and Yongdong Zhang. 2021. Causal intervention for leveraging popularity bias in recommendation. In SIGIR. ACM, 11–20.

Cited By

View all
  • (2025)Landmark-v6: A stable IPv6 landmark representation method based on multi-feature clusteringInformation Processing & Management10.1016/j.ipm.2024.10392162:1(103921)Online publication date: Jan-2025
  • (2024)Average User-Side Counterfactual Fairness for Collaborative FilteringACM Transactions on Information Systems10.1145/365663942:5(1-26)Online publication date: 13-May-2024
  • (2024)Report on the Workshop on Learning and Evaluating Recommendations with Impressions (LERI) at RecSys 2023ACM SIGIR Forum10.1145/3642979.364300157:2(1-8)Online publication date: 22-Jan-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems
ACM Transactions on Information Systems  Volume 41, Issue 3
July 2023
890 pages
ISSN:1046-8188
EISSN:1558-2868
DOI:10.1145/3582880
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 February 2023
Online AM: 21 October 2022
Accepted: 04 October 2022
Revised: 15 August 2022
Received: 31 May 2022
Published in TOIS Volume 41, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Recommender system
  2. privacy leakage
  3. privacy protection
  4. information security

Qualifiers

  • Research-article
  • Refereed

Funding Sources

  • Natural Science Foundation of China
  • Key Scientific and Technological Innovation Program of Shandong Province
  • Fundamental Research Funds of Shandong University, Meituan, and the Tencent WeChat Rhino-Bird Focused Research Program

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)344
  • Downloads (Last 6 weeks)34
Reflects downloads up to 03 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Landmark-v6: A stable IPv6 landmark representation method based on multi-feature clusteringInformation Processing & Management10.1016/j.ipm.2024.10392162:1(103921)Online publication date: Jan-2025
  • (2024)Average User-Side Counterfactual Fairness for Collaborative FilteringACM Transactions on Information Systems10.1145/365663942:5(1-26)Online publication date: 13-May-2024
  • (2024)Report on the Workshop on Learning and Evaluating Recommendations with Impressions (LERI) at RecSys 2023ACM SIGIR Forum10.1145/3642979.364300157:2(1-8)Online publication date: 22-Jan-2024
  • (2024)Debiasing Sequential Recommenders through Distributionally Robust Optimization over System ExposureProceedings of the 17th ACM International Conference on Web Search and Data Mining10.1145/3616855.3635848(882-890)Online publication date: 4-Mar-2024
  • (2024)User perceptions of algorithmic persuasion in OTT platforms: A scoping review2024 IEEE International Symposium on Technology and Society (ISTAS)10.1109/ISTAS61960.2024.10732741(1-7)Online publication date: 18-Sep-2024
  • (2024)Deep Leakage From Horizontal Federated Sequential Recommender SystemsIEEE Access10.1109/ACCESS.2024.349869912(173037-173046)Online publication date: 2024
  • (2024)Collaborative denoised graph contrastive learning for multi-modal recommendationInformation Sciences10.1016/j.ins.2024.121017679(121017)Online publication date: Sep-2024
  • (2024)Dynamic Hierarchical Attention Network for news recommendationExpert Systems with Applications: An International Journal10.1016/j.eswa.2024.124667255:PCOnline publication date: 1-Dec-2024
  • (2024)BayesSentiRSExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121930238:PBOnline publication date: 27-Feb-2024
  • (2024)Achieving EEG-based depression recognition using Decentralized-Centralized structureBiomedical Signal Processing and Control10.1016/j.bspc.2024.10640295(106402)Online publication date: Sep-2024
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

Full Text

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media