[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Topic modeling for expert finding using latent Dirichlet allocation

Published: 01 September 2013 Publication History

Abstract

The task of expert finding is to rank the experts in the search space given a field of expertise as an input query. In this paper, we propose a topic modeling approach for this task. The proposed model uses latent Dirichlet allocation LDA to induce probabilistic topics. In the first step of our algorithm, the main topics of a document collection are extracted using LDA. The extracted topics present the connection between expert candidates and user queries. In the second step, the topics are used as a bridge to find the probability of selecting each candidate for a given query. The candidates are then ranked based on these probabilities. The experimental results on the Text REtrieval Conference TREC Enterprise track for 2005 and 2006 show that the proposed topic-based approach outperforms the state-of-the-art profile- and document-based models, which use information retrieval methods to rank experts. Moreover, we present the superiority of the proposed topic-based approach to the improved document-based expert finding systems, which consider additional information such as local context, candidate prior, and query expansion.

References

[1]
Zhu H, Chen E, Cao H. Finding experts in tag based knowledge sharing communities. In: Xiong H, LeeW, eds. Knowledge Science, Engineering and Management. Lecture Notes in Computer Science, volume 7091. Berlin: Springer; 2011, pp.183-195.
[2]
Balog K, Azzopardi L, de Rijke M. A Language modeling framework for expert finding. Inform Process Manag 2008, Volume 45: pp.1-19.
[3]
Craswell N, Vries APD, Soboroff I. Overview of the TREC-2005 Enterprise Track. In: Proceedings of the Text REtrieval Conference TREC. Gaithersburg, MD; 2006.
[4]
Soboroff I, Vries APD, Craswell N. Overview of the TREC 2006 Enterprise Track. In: Proceedings of the Text REtrieval Conference TREC. Gaithersburg, MD; 2007.
[5]
Blei DM, Ng AY, Jordan MI, Lafferty J. Latent Dirichlet allocation. J Mach Learn Res Volume 3, 2003.
[6]
Fang H, Zhai C. Probabilistic models for expert finding. In: Proceedings of European Conference on Information Retrieval ECIR. Berlin: Springer; 2007, pp.418-430.
[7]
Craswell N, Hawking D, Vercoustre A-M, Wilkins P. P@NOPTIC Expert: searching for experts not just for documents. In: Proceedings of the Australian World Wide Web Conference Ausweb. Coffs Harbour, NSW; 2001, pp.21-25.
[8]
Petkova D, Croft WB. Hierarchical language models for expert finding in Enterprise Corpora. In: Proceedings of the IEEE International Conference on Tools With Artificial Intelligence ICTAI. Piscataway, NJ: IEEE; 2006.
[9]
Balog K, de Rijke M. Determining expert profiles with an application to expert finding. In: Proceedings of the International Joint Conference on Artifical Intelligence IJCAI. Hyderabad, India; 2007, pp.2657-2662.
[10]
Mattox D, Maybury M, Morey D. Enterprise expert and knowledge discovery. In: Proceedings of the 8th International Conference on Human-Computer Interaction HCI. Munich, Germany; 1999, pp.303-307.
[11]
Serdyukov P, Hiemstra D. Modeling documents as mixtures of persons for expert finding. In: Proceedings of European Conference on Information Retrieval ECIR. Berlin: Springer; 2008, pp.309-320.
[12]
Balog K, Bogers T, Azzopardi L, de Rijke M, van den Bosch A. Broad expertise retrieval in sparse data environments. In: Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM; 2007, pp.551-558.
[13]
Macdonald C, Ounis I. Voting techniques for expert search. Knowl Inform Syst 2008, Volume 16: pp.259-280.
[14]
Rode H, Serdyukov P, Hiemstra D, Zaragoza H. Entity ranking on graphs: studies on expert finding. Technical Report TR-CTIT-07-81, University of Twente, Enschede, the Netherlands, 2007.
[15]
Li Y, Ma S, Zhang Y, Huang R. Expertise network discovery via topic and link analysis in online communities. In: IEEE 12th International Conference on Advanced Learning Technologies ICALT, 2012; Piscataway, NJ: IEEE; 2012, pp.311-315.
[16]
Wei X, Croft WB. LDA-based document models for ad-hoc retrieval. In: Proceedings of the international ACM SIGIR Conference on Research and Development in Information Retrieval. New York: ACM; 2006, pp.178-185.
[17]
Griffiths TL, Steyvers M. Finding scientific topics. Proc Nat Acad Sci USA 2004, Volume 101: pp.5228-5235.
[18]
Rosen-Zvi M, Chemudugunta C, Griffiths T, Smyth P, Steyvers M. Learning author-topic models from text corpora. ACM Trans Inform Syst 2010, pp.28.
[19]
Linstead E, Rigor P, Bajrachary S, Lopes C, Baldi P. Mining Eclipse developer contributions via author-topic models. In: Fourth International Workshop on Mining Software Repositories MSR. Minneapolis, MN; 2007, pp.30-33.
[20]
Lin C, He Y. Joint sentiment/topic model for sentiment analysis. In: Proceedings of the ACM Conference on Information and Knowledge Management CIKM. New York: ACM; 2009, pp.375-384.
[21]
Chrupała G. Efficient induction of probabilistic word classes with LDA. In: Proceedings of the International Joint Conference on Natural Language Processing IJCNLP. Chiang Mai, Thailand; 2011, pp.363-372.
[22]
Wu H, Pei Y, Yu J. Hidden topic analysis based formal framework for finding experts in metadata corpus. In: Proceedings of the IEEE/ACIS International Conference on Computer and Information Science. Washington, DC: IEEE Computer Society; 2009, pp.369-374.
[23]
Kongthon A, Haruechaiyasak C, Thaiprayoon S. Expert identification for multidisciplinary R&D project collaboration. In: Portland International Conference on Management of Engineering and Technology PICMET. Portland, OR; 2009, pp.1474-1480.
[24]
Jelinek F, Mercer R. Interpolated estimation of Markov source parameters from sparse data. In: Proceedings of the International Workshop on Pattern Recognition in Practice. Amsterdam, The Netherlands; 1980, pp.381-397.
[25]
Zhai C, Lafferty J. A study of smoothing methods for language models applied to information retrieval. ACM Trans Inform Syst 2004, Volume 22 Issue 2: pp.179-214.
[26]
Zhu J. W3C corpus annotated with W3C people identity, 2006, "http://ir.nist.gov/w3c/contrib/W3Ctagged.html". Accessed 2012.

Cited By

View all
  • (2024)It Takes a Team to Triumph: Collaborative Expert Finding in Community QA NetworksProceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3673791.3698404(164-174)Online publication date: 8-Dec-2024
  • (2024)A semantic modular framework for events topic modeling in social mediaMultimedia Tools and Applications10.1007/s11042-023-15745-883:4(10755-10778)Online publication date: 1-Jan-2024
  • (2023)Efficient and Effective Academic Expert Finding on Heterogeneous Graphs through (k, 𝒫)-Core based EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/357836517:6(1-35)Online publication date: 22-Mar-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery  Volume 3, Issue 5
September 2013
47 pages

Publisher

John Wiley & Sons, Inc.

United States

Publication History

Published: 01 September 2013

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)It Takes a Team to Triumph: Collaborative Expert Finding in Community QA NetworksProceedings of the 2024 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3673791.3698404(164-174)Online publication date: 8-Dec-2024
  • (2024)A semantic modular framework for events topic modeling in social mediaMultimedia Tools and Applications10.1007/s11042-023-15745-883:4(10755-10778)Online publication date: 1-Jan-2024
  • (2023)Efficient and Effective Academic Expert Finding on Heterogeneous Graphs through (k, 𝒫)-Core based EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/357836517:6(1-35)Online publication date: 22-Mar-2023
  • (2022)Domain expertise extraction for finding rising starsScientometrics10.1007/s11192-022-04492-6127:9(5475-5495)Online publication date: 1-Sep-2022
  • (2021)User Embedding for Expert Finding in Community Question AnsweringACM Transactions on Knowledge Discovery from Data10.1145/344130215:4(1-16)Online publication date: 26-Mar-2021
  • (2021)LDA-based term profiles for expert finding in a political settingJournal of Intelligent Information Systems10.1007/s10844-021-00636-x56:3(529-559)Online publication date: 1-Jun-2021
  • (2020)Expert finding in community question answering: a reviewArtificial Intelligence Review10.1007/s10462-018-09680-653:2(843-874)Online publication date: 1-Feb-2020
  • (2019)Translations Diversification for Expert FindingACM Transactions on Knowledge Discovery from Data10.1145/332048913:3(1-20)Online publication date: 29-May-2019
  • (2017)Skill Translation Models in Expert FindingProceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3077136.3080719(1057-1060)Online publication date: 7-Aug-2017
  • (2016)Find an ExpertProceedings of the 2016 CHI Conference on Human Factors in Computing Systems10.1145/2858036.2858131(3038-3048)Online publication date: 7-May-2016
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media