[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2517288.2517291acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

KDD Cup 2013 - author-paper identification challenge: second place team

Published: 11 August 2013 Publication History

Abstract

This paper describes our submission to the KDD Cup 2013 Track 1 Challenge: Author-Paper Indentification in the Microsoft Academic Search database. Our approach is based on Gradient Boosting Machine (GBM) of Friedman ([5]) and deep feature engineering. The method was second in the final standings with Mean Average Precision (MAP) of 0.98144, while the winning submission scored 0.98259.

References

[1]
C. J. C. Burges, R. Ragno, and Q. Le. Learning to Rank with Nonsmooth Cost Functions. In NIPS, pages 193--200, 2006.
[2]
M. Diez, A. Varona, M. Penagarikano, L. Rodriguez-Fuentes, and G. Bordel. On the use of phone log-likelihood ratios as features in spoken language recognition. In Spoken Language Technology Workshop (SLT), 2012 IEEE, pages 274--279, 2012.
[3]
Y. Freund, R. Iyer, R. E. Shapire, and Y. Singer. An efficient boosting algorithm for combining preferences. Journal of Machine Learning Research, 4:933--969, 2003.
[4]
Y. Freund and R. Schapire. A decision-theoretic generalization of online learning and an application to boosting. J. Comput. System Sciences, 55:119--139, 1997.
[5]
J. Friedman. Greedy function approximation: a gradient boosting machine. The Annals of Statistics, 29:1189--1232, 2001.
[6]
A. Galecki and T. Burzykowski. Linear Mixed-Effects Models Using R. Springer, 2013.
[7]
Q. Wu, C. J. C. Burges, K. M. Svore, and J. Gao. Ranking, boosting, and model adaptation, 2008.
[8]
G. Salton and C. Buckley. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5):513--523, 1988.
[9]
S. B. Roy, M. Cock, V. Mandava, B. Dalessandro, C. Perlich, W. Cukierski, and B. Hamner. The microsoft academic search dataset and kdd cup 2013, 2013.
[10]
S. Wang, H. Chen, and X. Yao. Negative correlation learning for classification ensembles. In WCCI 2010, Barcelona, Spain, pages 2893--2900. IEEE, 2010.

Cited By

View all
  • (2022)Blackmarket-Driven Collusion on Online Media: A SurveyACM/IMS Transactions on Data Science10.1145/35179312:4(1-37)Online publication date: 17-May-2022
  • (2022)CONNA: Addressing Name Disambiguation on the FlyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.302125634:7(3139-3152)Online publication date: 1-Jul-2022
  • (2021)DropMonitorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/34634965:2(1-22)Online publication date: 24-Jun-2021
  • Show More Cited By

Index Terms

  1. KDD Cup 2013 - author-paper identification challenge: second place team

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      KDD Cup '13: Proceedings of the 2013 KDD Cup 2013 Workshop
      August 2013
      69 pages
      ISBN:9781450324953
      DOI:10.1145/2517288
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 11 August 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. algorithms
      2. collaborative filtering
      3. cross-validation
      4. decision trees
      5. ensembling
      6. feature engineering

      Qualifiers

      • Research-article

      Conference

      KDD' 13
      Sponsor:

      Upcoming Conference

      KDD '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 04 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Blackmarket-Driven Collusion on Online Media: A SurveyACM/IMS Transactions on Data Science10.1145/35179312:4(1-37)Online publication date: 17-May-2022
      • (2022)CONNA: Addressing Name Disambiguation on the FlyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2020.302125634:7(3139-3152)Online publication date: 1-Jul-2022
      • (2021)DropMonitorProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/34634965:2(1-22)Online publication date: 24-Jun-2021
      • (2021)AmbientBreathProceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies10.1145/34634935:2(1-30)Online publication date: 24-Jun-2021
      • (2021)Fine-Grained Visual Textual Alignment for Cross-Modal Retrieval Using Transformer EncodersACM Transactions on Multimedia Computing, Communications, and Applications10.1145/345139017:4(1-23)Online publication date: 12-Nov-2021
      • (2021)Inductive Contextual Relation Learning for PersonalizationACM Transactions on Information Systems10.1145/345035339:3(1-22)Online publication date: 25-May-2021
      • (2021)Predicting Performance Anomalies in Software Systems at Run-timeACM Transactions on Software Engineering and Methodology10.1145/344075730:3(1-33)Online publication date: 23-Apr-2021
      • (2021)A Hybrid Approach to Formal Verification of Higher-Order Masked Arithmetic ProgramsACM Transactions on Software Engineering and Methodology10.1145/342801530:3(1-42)Online publication date: 11-Feb-2021
      • (2019)Task-Guided Pair Embedding in Heterogeneous NetworkProceedings of the 28th ACM International Conference on Information and Knowledge Management10.1145/3357384.3357982(489-498)Online publication date: 3-Nov-2019
      • (2018)Task-guided and semantic-aware ranking for academic author-paper correlation inferenceProceedings of the 27th International Joint Conference on Artificial Intelligence10.5555/3304222.3304274(3641-3647)Online publication date: 13-Jul-2018
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media