[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2740908.2742000acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Fast and Accurate Maximum Inner Product Recommendations on Map-Reduce

Published: 18 May 2015 Publication History

Abstract

Personalization has become a predominant theme in online advertising; the internet allows advertisers to target only those users with the greatest chances of engagement, maximizing the probability of success and user happiness. However, a na{\"i}ve approach to matching users with their most suitable content scales proportionally to the product of the cardinalities of the user and content sets. For advertisers with large portfolios, this quickly becomes intractable. In this work, we address this more general {\em top-$k$ personalization} problem, giving a scalable method to produce recommendations based on personalization models where the affinity between a user and an item is captured by an inner product (i.e., most matrix factorization models). We first transform the problem into finding the $k$-nearest neighbors among the items for each user, then approximate the solution via a method which is particularly suited for use on a map-reduce cluster. We empirically show that our method is between 1 and 2 orders of magnitude faster than previous work, while maintaining excellent approximation quality. Additionally, we provide an open-source implementation of our proposed method, this implementation is used in production at Etsy for a number of large scale personalization systems, and is the same code as used in the experiments below.

References

[1]
A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Commun. ACM, 51(1):117--122, Jan. 2008.
[2]
J. Attenberg, S. Pandey, and T. Suel. Modeling and predicting user behavior in sponsored search. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '09, pages 1067--1076, New York, NY, USA, 2009. ACM.
[3]
J. Bennett and S. Lanning. The netix prize. In In KDD Cup and Workshop in conjunction with KDD, 2007.
[4]
P. Gopalan, J. M. Hofman, and D. M. Blei. Scalable recommendation with poisson factorization. CoRR, abs/1311.1704, 2013.
[5]
N. Halko, P.-G. Martinsson, and J. A. Tropp. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions. SIAM Review, 53(2):217--288, 2011.
[6]
D. J. Hu, R. Hall, and J. Attenberg. Style in the long tail: Discovering unique interests with latent variable models in large scale social e-commerce. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '14, pages 1640--1649, New York, NY, USA, 2014. ACM.
[7]
Y. Hu, Y. Koren, and C. Volinsky. Collaborative filtering for implicit feedback datasets. In Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, ICDM '08, pages 263--272, Washington, DC, USA, 2008. IEEE Computer Society.
[8]
P. Indyk. Nearest neighbors in high-dimensional spaces. In J. E. Goodman and J. O'Rourke, editors, Handbook of Discrete and Computational Geometry. CRC Press LLC, Boca Raton, FL, 2nd edition, April 2004.
[9]
B. Kang and K. Jung. Robust and efficient locality sensitive hashing for nearest neighbor search in large data sets, 2012.
[10]
Y. Koren, R. Bell, and C. Volinsky. Matrix factorization techniques for recommender systems. Computer, 42(8):30--37, Aug. 2009.
[11]
D. D. Lee and H. S. Seung. Algorithms for non-negative matrix factorization. In NIPS, pages 556--562. MIT Press, 2000.
[12]
C.-J. Lin. Projected gradient methods for nonnegative matrix factorization. Neural Comput., 19(10):2756--2779, Oct. 2007.
[13]
G. Linden, B. Smith, and J. York. Amazon. com recommendations: Item-to-item collaborative filtering. Internet Computing, IEEE, 7(1):76--80, 2003.
[14]
W. Lu, Y. Shen, S. Chen, and B. C. Ooi. Efficient processing of k nearest neighbor joins using mapreduce. Proc. VLDB Endow., 5(10):1016--1027, June 2012.
[15]
D. Oard and J. Kim. Implicit feedback for recommender systems. In in Proceedings of the AAAI Workshop on Recommender Systems, pages 81--83, 1998.
[16]
L. Peska and P. Vojtas. Negative implicit feedback in e-commerce recommender systems. In Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, WIMS '13, pages 45:1--45:4, New York, NY, USA, 2013.
[17]
P. Ram and A. G. Gray. Maximum inner-product search using cone trees. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '12, pages 931--939, New York, NY, USA, 2012.
[18]
F. Ricci, L. Rokach, B. Shapira, and P. B. Kantor, editors. Recommender Systems Handbook. Springer, 2011.
[19]
J. B. Schafer, J. Konstan, and J. Riedl. Recommender systems in e-commerce. In Proceedings of the 1st ACM conference on Electronic commerce, pages 158--166. ACM, 1999.
[20]
S. Schroed, A. Kesari, and L. Neumeyer. Personalized ad placement in web search. In Proceedings of ADKDD'10, ADKDD'10, 2010.
[21]
G. Shakhnarovich, T. Darrell, and P. Indyk. Nearest-Neighbor Methods in Learning and Vision: Theory and Practice. The MIT Press, 2006.
[22]
M. Shatnawi and N. Mohamed. Statistical techniques for online personalized advertising: A survey. In Proceedings of the 27th Annual ACM Symposium on Applied Computing, SAC '12, pages 680--687, New York, NY, USA, 2012. ACM.
[23]
Y. Shi, A. Karatzoglou, L. Baltrunas, M. Larson, N. Oliver, and A. Hanjalic. Climf: Learning to maximize reciprocal rank with collaborative less-is-more filtering. In Proceedings of the Sixth ACM Conference on Recommender Systems, RecSys '12, pages 139--146, New York, NY, USA, 2012. ACM.
[24]
A. Shrivastava and P. Li. Asymmetric lsh (alsh) for sublinear time maximum inner product search (mips). In Z. Ghahramani, M. Welling, C. Cortes, N. Lawrence, and K. Weinberger, editors, Advances in Neural Information Processing Systems 27, pages 2321--2329. Curran Associates, Inc., 2014.
[25]
J. W. Stamos and H. C. Young. A symmetric fragment and replicate algorithm for distributed joins. IEEE Trans. Parallel Distrib. Syst., 4(12):1345--1354, 1993.

Cited By

View all
  • (2023)Reverse Maximum Inner Product Search: Formulation, Algorithms, and AnalysisACM Transactions on the Web10.1145/3587215Online publication date: 16-Mar-2023
  • (2021)Norm Adjusted Proximity Graph for Fast Inner Product RetrievalProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467412(1552-1560)Online publication date: 14-Aug-2021

Index Terms

  1. Fast and Accurate Maximum Inner Product Recommendations on Map-Reduce

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    WWW '15 Companion: Proceedings of the 24th International Conference on World Wide Web
    May 2015
    1602 pages
    ISBN:9781450334730
    DOI:10.1145/2740908

    Sponsors

    • IW3C2: International World Wide Web Conference Committee

    In-Cooperation

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 18 May 2015

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. map-reduce
    2. maximum inner product search
    3. recommender systems

    Qualifiers

    • Research-article

    Conference

    WWW '15
    Sponsor:
    • IW3C2

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 03 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Reverse Maximum Inner Product Search: Formulation, Algorithms, and AnalysisACM Transactions on the Web10.1145/3587215Online publication date: 16-Mar-2023
    • (2021)Norm Adjusted Proximity Graph for Fast Inner Product RetrievalProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467412(1552-1560)Online publication date: 14-Aug-2021

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media