[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1835804.1835834acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Estimating rates of rare events with multiple hierarchies through scalable log-linear models

Published: 25 July 2010 Publication History

Abstract

We consider the problem of estimating rates of rare events for high dimensional, multivariate categorical data where several dimensions are hierarchical. Such problems are routine in several data mining applications including computational advertising, our main focus in this paper. We propose LMMH, a novel log-linear modeling method that scales to massive data applications with billions of training records and several million potential predictors in a map-reduce framework. Our method exploits correlations in aggregates observed at multiple resolutions when working with multiple hierarchies; stable estimates at coarser resolution provide informative prior information to improve estimates at finer resolutions. Other than prediction accuracy and scalability, our method has an inbuilt variable screening procedure based on a "spike and slab prior" that provides parsimony by removing non-informative predictors without hurting predictive accuracy. We perform large scale experiments on data from real computational advertising applications and illustrate our approach on datasets with several billion records and hundreds of millions of predictors. Extensive comparisons with other benchmark methods show significant improvements in prediction accuracy.

Supplementary Material

JPG File (kdd2010_agarwal_erre_01.jpg)
MOV File (kdd2010_agarwal_erre_01.mov)

References

[1]
D. Agarwal, A. Z. Broder, D. Chakrabarti, D. Diklic, V. Josifovski, and M. Sayyadian. Estimating rates of rare events at multiple resolutions. In KDD '07, pages 16--25, 2007.
[2]
D. Agarwal and B.-C. Chen. Regression-based latent factor models. In KDD '09, pages 19--28, 2009.
[3]
A.Gelman and J.Hill. Data Analysis using Regression/Multi-level Hierarchical Models. Cambridge University Press, 2007.
[4]
D. Bates and D. Sarkar. lme4: Linear mixed-effects models using S4 classes, 2007.
[5]
A. Broder. Computational advertising. In SODA '08: Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms, pages 992--992, 2008.
[6]
A. C. Cameron and P. K. Trivedi. Regression Analysis of Count Data. Cambridge University Press, 1998.
[7]
J. Dean and S. Ghemawat. Mapreduce: a flexible data processing tool. Commun. CACM, 53(1):72--77, 2010.
[8]
K. Dembczynski, W.Kotlowski, and D.Weiss. Predicting ads' click-through rate with decision rules. In WWW '08, 2008.
[9]
D.G.Clayton and J.Kaldor. Empirical bayes estimates of age-standardized relative risks for use in disease mapping. Biometrics, 43:671--681, 1987.
[10]
M. Dudik, D. M. Blei, and R. E. Schapire. Hierarchical maximum entropy density estimation. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 249--256, 2007.
[11]
W. DuMouchel and D. Pregibon. Empirical bayes screening for multi-item associations. In KDD '01, pages 67--76, 2001.
[12]
H.Ishwaran and J. Rao. Spike and slab variable selection: Frequentist and bayesian strategies. Annals of Statistics, 33(2):730--773, 2005.
[13]
J.Besag. On the statistical analysis of dirty pictures. Journal of the Royal Statistical Society, Series B, 48:259--302, 1986.
[14]
O.Papaspiliopoulos, G. O. Roberts, and M. Skold. A general framework for the parametrization of hierarchical models. Statistical Science, 22(1):59--73, 2007.
[15]
M. Richardson, E. Dominowska, and R. Ragno. Predicting clicks: estimating the click-through rate for new ads. In WWW '07, 2007.
[16]
D. H. Stern, R. Herbrich, and T. Graepel. Matchbox: large scale online bayesian recommendations. In WWW '09, pages 111--120, 2009.
[17]
K. Weinberger, A. Dasgupta, J. Langford, A. Smola, and J. Attenberg. Feature hashing for large scale multitask learning. In ICML '09: Proceedings of the 26th Annual International Conference on Machine Learning, pages 1113--1120, 2009.
[18]
L. Zhang and D. Agarwal. Fast computation of posterior mode in multi-level hierarchical models. In NIPS, pages 1913--1920, 2008.

Cited By

View all
  • (2024)Utilizing Non-click Samples via Semi-supervised Learning for Conversion Rate PredictionProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688151(350-359)Online publication date: 8-Oct-2024
  • (2023)Masked Multi-Domain Network: Multi-Type and Multi-Scenario Conversion Rate Prediction with a Single ModelProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614697(4773-4779)Online publication date: 21-Oct-2023
  • (2023)Contrastive Learning for Conversion Rate PredictionProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591968(1909-1913)Online publication date: 19-Jul-2023
  • Show More Cited By

Index Terms

  1. Estimating rates of rare events with multiple hierarchies through scalable log-linear models

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
    July 2010
    1240 pages
    ISBN:9781450300551
    DOI:10.1145/1835804
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. computational advertising
    2. count data
    3. display advertising
    4. gamma-poisson
    5. spars contingency tables
    6. spike and slab prior

    Qualifiers

    • Research-article

    Conference

    KDD '10
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Upcoming Conference

    KDD '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)35
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 25 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Utilizing Non-click Samples via Semi-supervised Learning for Conversion Rate PredictionProceedings of the 18th ACM Conference on Recommender Systems10.1145/3640457.3688151(350-359)Online publication date: 8-Oct-2024
    • (2023)Masked Multi-Domain Network: Multi-Type and Multi-Scenario Conversion Rate Prediction with a Single ModelProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614697(4773-4779)Online publication date: 21-Oct-2023
    • (2023)Contrastive Learning for Conversion Rate PredictionProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591968(1909-1913)Online publication date: 19-Jul-2023
    • (2023)Audience Prospecting for Dynamic-Product-Ads in Native Advertising2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386796(1571-1580)Online publication date: 15-Dec-2023
    • (2023)Recommending on graphs: a comprehensive review from a data perspectiveUser Modeling and User-Adapted Interaction10.1007/s11257-023-09359-w33:4(803-888)Online publication date: 13-Mar-2023
    • (2021)Conversion Rate Prediction Based on Text Readability Analysis of Landing PagesEntropy10.3390/e2311138823:11(1388)Online publication date: 23-Oct-2021
    • (2021)User Response Prediction in Online AdvertisingACM Computing Surveys10.1145/344666254:3(1-43)Online publication date: 8-May-2021
    • (2021)Dynamically Optimizing Display Advertising Profits under Diverse Budget SettingsIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.3077699(1-1)Online publication date: 2021
    • (2021)Addressing Stability in Classifier Explanations2021 IEEE International Conference on Big Data (Big Data)10.1109/BigData52589.2021.9671458(1920-1927)Online publication date: 15-Dec-2021
    • (2020)Online Display Advertising MarketsInformation Systems Research10.1287/isre.2019.090231:2(556-575)Online publication date: 1-Jun-2020
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media