[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Side Information Fusion for Recommender Systems over Heterogeneous Information Network

Published: 10 June 2021 Publication History

Abstract

Collaborative filtering (CF) has been one of the most important and popular recommendation methods, which aims at predicting users’ preferences (ratings) based on their past behaviors. Recently, various types of side information beyond the explicit ratings users give to items, such as social connections among users and metadata of items, have been introduced into CF and shown to be useful for improving recommendation performance. However, previous works process different types of information separately, thus failing to capture the correlations that might exist across them. To address this problem, in this work, we study the application of heterogeneous information network (HIN), which offers a unifying and flexible representation of different types of side information, to enhance CF-based recommendation methods. However, we face challenging issues in HIN-based recommendation, i.e., how to capture similarities of complex semantics between users and items in a HIN, and how to effectively fuse these similarities to improve final recommendation performance. To address these issues, we apply metagraph to similarity computation and solve the information fusion problem with a “matrix factorization (MF) + factorization machine (FM)” framework. For the MF part, we obtain the user-item similarity matrix from each metagraph and then apply low-rank matrix approximation to obtain latent features for both users and items. For the FM part, we apply FM with Group lasso (FMG) on the features obtained from the MF part to train the recommending model and, at the same time, identify the useful metagraphs. Besides FMG, a two-stage method, we further propose an end-to-end method, hierarchical attention fusing, to fuse metagraph-based similarities for the final recommendation. Experimental results on four large real-world datasets show that the two proposed frameworks significantly outperform existing state-of-the-art methods in terms of recommendation performance.

References

[1]
Z. Allen-Zhu and E. Hazan. 2016. Variance reduction for faster non-convex optimization. In ICML. 699–707.
[2]
P. W. Battaglia, J. B. Hamrick, V. Bapst, A. Sanchez-Gonzalez, V. Zambaldi, M. Malinowski, A. Tacchetti, D. Raposo, A. Santoro, R. Faulkner, C. Gulcehre, F. Song, A. Ballard, J. Gilmer, G. Dahl, A. Vaswani, K. Allen, C. Nash, V. Langston, C. Dyer, N. Heess, D. Wierstra, P. Kohli, M. Botvinick, O. Vinyals, Y. Li, and R. Pascanu. 2018. Relational inductive biases, deep learning, and graph networks. arXiv preprint arXiv:1806.01261 (2018).
[3]
D. P. Bertsekas. 1999. Nonlinear Programming. Athena Scientific.
[4]
E. Candès and B. Recht. 2009. Exact matrix completion via convex optimization. Foundations of Computational Mathematics 9, 6 (2009), 717.
[5]
E. J. Candès, M. B. Wakin, and S. P. Boyd. 2008. Enhancing sparsity by reweighted ℓ1 minimization. Journal of Fourier Analysis and Applications 14, 5–6 (2008), 877–905.
[6]
Yuhui Ding, Quanming Yao, and Tong Zhang. 2020. Propagation Model Search for Graph Neural Networks. Technical Report.
[7]
Y. Dong, N. V Chawla, and A. Swami. 2017. metapath2vec: Scalable representation learning for heterogeneous networks. In SIGKDD. 135–144.
[8]
Y. Dong, Z. Hu, K. Wang, Y. Sun, and J. Tang. 2020. Heterogeneous network representation learning. In IJCAI. 4861–4867.
[9]
S. Fan, J. Zhu, X. Han, C. Shi, L. Hu, B. Ma, and Y. Li. 2019. Metapath-guided heterogeneous graph neural network for intent recommendation. In KDD. 2478–2486
[10]
Y. Fan, S. Hou, Y. Zhang, Y. Ye, and M. Abdulhayoglu. 2018a. Gotcha-sly malware!: Scorpion a Metagraph2vec based malware detection system. In SIGKDD. 253–262.
[11]
Y. Fan, Y. Zhang, Y. Ye, and X. Li. 2018b. Automatic opioid user detection from Twitter: Transductive ensemble built on different meta-graph based similarities over heterogeneous information network. In IJCAI. 3357–3363.
[12]
Y. Fang, W. Lin, V. W. Zheng, M. Wu, J. Shi, K. Chang, and X. Li. 2019. Metagraph-based learning on heterogeneous graphs. IEEE Transactions on Knowledge and Data Engineering 33, 1 (2019), 154–168.
[13]
Y. Fang, W. Lin, W. Zheng, M. Wu, K. Chang, and X. Li. 2016. Semantic proximity search on graphs with meta graph-based learning. In ICDE. 277–288.
[14]
G. Guo, J. Zhang, Z. Sun, and N. Yorke-Smith. 2015. LibRec: A Java Library for Recommender Systems. Technical Report. School of Information Systems, Singapore Management University.
[15]
R. He and J. McAuley. 2016. Ups and downs: Modeling the visual evolution of fashion trends with one-class collaborative filtering. In WebConf. 507–517.
[16]
Y. He, Y. Song, J. Li, C. Ji, J. Peng, and H. Peng. 2019. HeteSpaceyWalk: A heterogeneous spacey random walk for heterogeneous information network embedding. In CIKM. 639–648.
[17]
J. Herlocker, J. Konstan, A. Borchers, and J. Riedl. 1999. An algorithmic framework for performing collaborative filtering. In SIGIR. 230–237.
[18]
L. Hong, A. S Doumith, and B. D. Davison. 2013. Co-factorization machines: Modeling user interests and predicting individual decisions in Twitter. In WSDM. 557–566.
[19]
S. Hou, Y. Ye, Y. Song, and M. Abdulhayoglu. 2017. HinDroid: An intelligent Android malware detection system based on structured heterogeneous information network. In SIGKDD. 1507–1515.
[20]
B. Hu, C. Shi, W. X. Zhao, and P. S. Yu. 2018. Leveraging meta-path based context for top-N recommendation with a neural co-attention model. In SIGKDD. 1531–1540.
[21]
B. Hu, Z. Zhang, C. Shi, J. Zhou, X. Li, and Y. Qi. 2019. Cash-out user detection based on attributed heterogeneous information network with a hierarchical attention mechanism. In AAAI. 946–953.
[22]
Z. Huang, Y. Zheng, R. Cheng, Y. Sun, N. Mamoulis, and X. Li. 2016. Meta structure: Computing relevance in large heterogeneous information networks. In SIGKDD. 1595–1604.
[23]
L. Jacob, G. Obozinski, and J. Vert. 2009. Group lasso with overlap and graph lasso. In ICML. 433–440.
[24]
H. Jiang, Y. Song, C. Wang, M. Zhang, and Y. Sun. 2017. Semi-supervised learning over heterogeneous information networks by ensemble of meta-graph guided random walks. In IJCAI. 1944–1950.
[25]
J. Jin, J. Qin, Y. Fang, K. Du, W. Zhang, Y. Yu, Z. Zhang, and A.J Smola. 2020. An efficient neighborhood-based interaction model for recommendation on heterogeneous graph. In SIGKDD. 75–84.
[26]
W. Kang, M. Wan, and J. McAuley. 2018. Recommendation through mixtures of heterogeneous item relationships. In CIKM. 1143–1152.
[27]
X. Kong, B. Cao, and P. S. Yu. 2013a. Multi-label classification by mining label and instance correlations from heterogeneous information networks. In SIGKDD. 614–622.
[28]
X. Kong, J. Zhang, and P. S. Yu. 2013b. Inferring anchor links across multiple heterogeneous social networks. In CIKM. 179–188.
[29]
Y. Koren. 2008. Factorization meets the neighborhood: A multifaceted collaborative filtering model. In SIGKDD. 426–434.
[30]
N. Lao and W. Cohen. 2010. Relational retrieval using a combination of path-constrained random walks. Machine Learning 81, 1 (2010), 53–67.
[31]
H. Li and Z. Lin. 2015. Accelerated proximal gradient methods for nonconvex programming. In NeurIPS. 379–387.
[32]
G. Ling, M. R. Lyu, and I. King. 2014. Ratings meet reviews, a combined approach to recommend. In RecSys. 105–112.
[33]
H. Ma, D. Zhou, C. Liu, M. R. Lyu, and I. King. 2011. Recommender systems with social regularization. In WSDM. 287–296.
[34]
J. McAuley and J. Leskovec. 2013. Hidden factors and hidden topics: Understanding rating dimensions with review text. In RecSys. 165–172.
[35]
A. Mnih and R. Salakhutdinov. 2007. Probabilistic matrix factorization. In NeurIPS. 1257–1264.
[36]
W. Pan. 2016. A survey of transfer learning for collaborative recommendation with auxiliary data. Neurocomputing 177, C (2016), 447–453.
[37]
N. Parikh and S. Boyd. 2014. Proximal algorithms. Foundations and Trends in Optimization 1, 3 (2014), 127–239.
[38]
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Köpf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala. 2019. PyTorch: An imperative style, high-performance deep learning library. In NeurIPS. 8024–8035.
[39]
A. Paterek. 2007. Improving Regularized Singular Value Decomposition for Collaborative Filtering. Technical Report. Institute of Informatics, Warsaw University. 5–8.
[40]
A. Pfadler, H. Zhao, J. Wang, L. Wang, P. Huang, and D. L. Lee. 2020. Billion-scale recommendation with heterogeneous side information at Taobao. In ICDE. 1667–1676.
[41]
S. Reddi, A. Hefny, S. Sra, B. Poczos, and A. Smola. 2016. Stochastic variance reduction for nonconvex optimization. In ICML. 314–323.
[42]
R. Rehurek and P. Sojka. 2010. Software Framework for Topic Modelling with Large Corpora. Technical Report.
[43]
S. Rendle. 2012. Factorization machines with libFM. ACM Transactions on Intelligent Systems and Technology 3, 3 (2012), 57:1–57:22.
[44]
C. Shi, X. Han, S. Li, X. Wang, S. Wang, J. Du, and P. S. Yu. 2021. Deep collaborative filtering with multi-aspect information in heterogeneous networks. IEEE Transactions on Knowledge and Data Engineering 33, 4 (2021), 1413–1425.
[45]
C. Shi, B. Hu, X. Zhao, and P. S. Yu. 2018. Heterogeneous information network embedding for recommendation. IEEE Transactions on Knowledge and Data Engineering 31, 2 (2018), 357–370.
[46]
C. Shi, X. Kong, Y. Huang, P. S. Yu, and B. Wu. 2014. HeteSim: A general framework for relevance measure in heterogeneous networks. IEEE Transactions on Knowledge and Data Engineering 26, 10 (2014), 2479–2492.
[47]
C. Shi, Y. Li, J. Zhang, Y. Sun, and Y. Philip. 2017. A survey of heterogeneous information network analysis. IEEE Transactions on Knowledge and Data Engineering 29, 1 (2017), 17–37.
[48]
C. Shi, Z. Zhang, P. Luo, P. S. Yu, Y. Yue, and B. Wu. 2015. Semantic path based personalized recommendation on weighted heterogeneous information networks. In CIKM. 453–462.
[49]
Y. Sun and J. Han. 2013. Mining heterogeneous information networks: A structural analysis approach. ACM SIGKDD Explorations Newsletter 14, 2 (2013), 20–28.
[50]
Y. Sun, J. Han, C. C. Aggarwal, and N. V. Chawla. 2012. When will it happen?: Relationship prediction in heterogeneous information networks. In WSDM. 663–672.
[51]
Y. Sun, J. Han, X. Yan, P. S. Yu, and T. Wu. 2011. PathSim: Meta path-based top-k similarity search in heterogeneous information networks. In PVLDB. 992–1003.
[52]
Y. Sun, B. Norick, J. Han, X. Yan, P. S. Yu, and X. Yu. 2013. PathSelClus: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. ACM Transactions on Knowledge Discovery from Data 7, 3 (2013), 11:1–11:23.
[53]
P. Veličković, G. Cucurull, A. Casanova, A. Romero, P. Lio, and Y. Bengio. 2018. Graph attention networks. In ICLR.
[54]
C. Wang, Y. Song, A. El-Kishky, D. Roth, M. Zhang, and J. Han. 2015a. Incorporating world knowledge to document clustering via heterogeneous information networks. In SIGKDD. 1215–1224.
[55]
C. Wang, Y. Song, H. Li, Y. Sun, Z. Zhang, and J. Han. 2017. Distant meta-path similarities for text-based heterogeneous information networks. In CIKM. 1629–1638.
[56]
C. Wang, Y. Song, H. Li, M. Zhang, and J. Han. 2015b. KnowSim: A document similarity measure on structured heterogeneous information networks. In ICDM. 1015–1020.
[57]
H. Wang, F. Zhang, J. Wang, M. Zhao, W. Li, X. Xie, and M. Guo. 2019c. Exploring high-order user preference on the knowledge graph for recommender systems. ACM Transactions on Information Systems 37, 3 (2019), 32.
[58]
J. Wang, P. Huang, H. Zhao, Z. Zhang, B. Zhao, and D. L. Lee. 2018. Billion-scale commodity embedding for E-commerce recommendation in Alibaba. In SIGKDD. 839–848.
[59]
M. Wang, L. Yu, D. Zheng, Q. Gan, Y. Gai, Z. Ye, M. Li, J. Zhou, Q. Huang, C. Ma, Z. Huang, Q. Guo, H. Zhang, H. Lin, J. Zhao, J. Li, A. Smola, and Z. Zhang. 2019b. Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs. Technical Report.
[60]
X. Wang, H. Ji, C. Shi, B. Wang, Y. Ye, P. Cui, and P. S. Yu. 2019a. Heterogeneous graph attention network. In WebConf. 2022–2032.
[61]
L. Xiao and T. Zhang. 2014. A proximal stochastic gradient method with progressive variance reduction. SIAM Journal on Optimization 24, 4 (2014), 2057–2075.
[62]
W. Xiao, H. Zhao, H. Pan, Y. Song, V. W. Zheng, and Q. Yang. 2019. Beyond personalization: Social content recommendation for creator equality and consumer satisfaction. In SIGKDD. 235–245.
[63]
W. Xiao, H. Zhao, H. Pan, Y. Song, V. W. Zheng, and Q. Yang. 2021. Social explorative attention based recommendation for content distribution platforms. Data Mining and Knowledge Discovery 35 (2021), 533–567.
[64]
L. Yan, W. Li, G. Xue, and D. Han. 2014. Coupled group lasso for web-scale ctr prediction in display advertising. In ICML. 802–810.
[65]
C. Yang, Y. Feng, P. Li, Y. Shi, and J. Han. 2018. Meta-graph based HIN spectral embedding: Methods, analyses, and insights. In ICDM. 657–666.
[66]
C. Yang, Y. Xiao, Y. Zhang, Y. Sun, and J. Han. 2020. Heterogeneous network representation learning: Survey, benchmark, evaluation, and beyond. arXiv preprint arXiv:2004.00216 (2020).
[67]
Q. Yao and J. Kwok. 2015. Accelerated inexact soft-impute for fast large-scale matrix completion. In IJCAI. 4002–4008.
[68]
Q. Yao and J. Kwok. 2016. Efficient learning with a family of nonconvex regularizers by redistributing nonconvexity. In ICML. 2645–2654.
[69]
Q. Yao, J. Kwok, F. Gao, W. Chen, and T.-Y. Liu. 2017. Efficient inexact proximal gradient algorithm for nonconvex problems. In IJCAI. 3308–3314.
[70]
Q. Yao, J. T. Kwok, T. Wang, and T. Y. Liu. 2018. Large-scale low-rank matrix learning with nonconvex regularizers. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 11 (2018), 2628–2643.
[71]
Q. Yao and M. Wang. 2018. Taking Human Out of Learning Applications: A Survey on Automated Machine Learning. Technical Report.
[72]
M. Ye, P. Yin, W. C. Lee, and D. L. Lee. 2011. Exploiting geographical influence for collaborative point-of-interest recommendation. In SIGIR. 325–334.
[73]
X. Yu, X. Ren, Q. Gu, Y. Sun, and J. Han. 2013. Collaborative Filtering with Entity Similarity Regularization in Heterogeneous Information Networks. Technical Report. University of Illinois at Urbana-Champaign.
[74]
X. Yu, X. Ren, Y. Sun, Q. Gu, B. Sturt, U. Khandelwal, B. Norick, and J. Han. 2014. Personalized entity recommendation: A heterogeneous information network approach. In WSDM. 283–292.
[75]
D. Zhang, J. Yin, X. Zhu, and C. Zhang. 2018. Metagraph2vec: Complex semantic path augmented heterogeneous network embedding. In PAKDD. 196–208.
[76]
J. Zhang, P. S. Yu, and Z. Zhou. 2014. Meta-path based multi-network collective link prediction. In SIGKDD. 1286–1295.
[77]
T. Zhang. 2010. Analysis of multi-stage convex relaxation for sparse regularization. Journal of Machine Learning Research 11, 35 (2010), 1081–1107.
[78]
W. Zhang, Y. Fang, Z. Liu, M. Wu, and X. Zhang. 2020. mg2vec: Learning relationship-preserving heterogeneous graph representations via metagraph embedding. IEEE Transactions on Knowledge and Data Engineering (2020).
[79]
Huan Zhao, Lanning Wei, and Quanming Yao. 2020. Simplifying Architecture Search for Graph Neural Network. Technical Report.
[80]
H. Zhao, Q. Yao, J. Kwok, and D. Lee. 2017a. Collaborative filtering with social local models. In ICDM. 645–654.
[81]
H. Zhao, Q. Yao, J. Li, Y. Song, and D. Lee. 2017b. Meta-graph based recommendation fusion over heterogeneous information networks. In SIGKDD. 635–644.
[82]
H. Zhao, Y. Zhou, Y. Song, and D. L. Lee. 2019. Motif enhanced recommendation over heterogeneous information network. In CIKM. 2189–2192.
[83]
V. W. Zheng, Y. Zheng, X. Xie, and Q. Yang. 2012. Towards mobile intelligence: Learning from GPS history data for collaborative recommendation. Artificial Intelligence Journal 184 (2012), 17–37.

Cited By

View all
  • (2024)Application of recommendation system based on fuzzy logic in music culture communication and music teaching managementJournal of Intelligent & Fuzzy Systems10.3233/JIFS-232422(1-15)Online publication date: 30-Apr-2024
  • (2024)Hybrid recommender system model for digital library from multiple online publishersF1000Research10.12688/f1000research.133013.312(1140)Online publication date: 18-Nov-2024
  • (2024)Hybrid recommender system model for digital library from multiple online publishersF1000Research10.12688/f1000research.133013.212(1140)Online publication date: 4-Apr-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Transactions on Knowledge Discovery from Data
ACM Transactions on Knowledge Discovery from Data  Volume 15, Issue 4
August 2021
486 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3458847
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 June 2021
Accepted: 01 December 2020
Revised: 01 August 2020
Received: 01 September 2019
Published in TKDD Volume 15, Issue 4

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Recommender systems
  2. collaborative filtering
  3. factorization machine
  4. graph attention networks
  5. heterogeneous information networks
  6. matrix factorization

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • General Research Fund
  • Hong Kong RGC including Early Career Scheme
  • Research Impact Fund
  • NSFC
  • WeBank-HKUST Joint Lab
  • Research Grants Council HKSAR GRF

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)6
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Application of recommendation system based on fuzzy logic in music culture communication and music teaching managementJournal of Intelligent & Fuzzy Systems10.3233/JIFS-232422(1-15)Online publication date: 30-Apr-2024
  • (2024)Hybrid recommender system model for digital library from multiple online publishersF1000Research10.12688/f1000research.133013.312(1140)Online publication date: 18-Nov-2024
  • (2024)Hybrid recommender system model for digital library from multiple online publishersF1000Research10.12688/f1000research.133013.212(1140)Online publication date: 4-Apr-2024
  • (2024)A Systematic Review of the Impact of Auxiliary Information on Recommender SystemsIEEE Access10.1109/ACCESS.2024.346275012(139524-139539)Online publication date: 2024
  • (2024)Multidimensional Insights into Recommender Systems: A Systematic Review of Evaluation Metrics and Thematic ApplicationsSoftware Engineering Methods Design and Application10.1007/978-3-031-70285-3_29(382-403)Online publication date: 23-Oct-2024
  • (2023)Hybrid recommender system model for digital library from multiple online publishersF1000Research10.12688/f1000research.133013.112(1140)Online publication date: 12-Sep-2023
  • (2023)Sequential and Graphical Cross-Domain Recommendations with a Multi-View Hierarchical Transfer GateACM Transactions on Knowledge Discovery from Data10.1145/360461518:1(1-28)Online publication date: 10-Aug-2023
  • (2023)Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged PhotosACM Transactions on Knowledge Discovery from Data10.1145/358201517:7(1-18)Online publication date: 6-Apr-2023
  • (2023)Exploring Time-aware Multi-pattern Group Venue Recommendation in LBSNsACM Transactions on Information Systems10.1145/356428041:3(1-31)Online publication date: 7-Feb-2023
  • (2023)Curriculum Pre-training Heterogeneous Subgraph Transformer for Top-N RecommendationACM Transactions on Information Systems10.1145/352866741:1(1-28)Online publication date: 10-Jan-2023
  • Show More Cited By

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media