[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1183614.1183671acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
Article

Voting for candidates: adapting data fusion techniques for an expert search task

Published: 06 November 2006 Publication History

Abstract

In an expert search task, the users' need is to identify people who have relevant expertise to a topic of interest. An expert search system predicts and ranks the expertise of a set of candidate persons with respect to the users' query. In this paper, we propose a novel approach for predicting and ranking candidate expertise with respect to a query. We see the problem of ranking experts as a voting problem, which we model by adapting eleven data fusion techniques.We investigate the effectiveness of the voting approach and the associated data fusion techniques across a range of document weighting models, in the context of the TREC 2005 Enterprise track. The evaluation results show that the voting paradigm is very effective, without using any collection specific heuristics. Moreover, we show that improving the quality of the underlying document representation can significantly improve the retrieval performance of the data fusion techniques on an expert search task. In particular, we demonstrate that applying field-based weighting models improves the ranking of candidates. Finally, we demonstrate that the relative performance of the adapted data fusion techniques for the proposed approach is stable regardless of the used weighting models.

References

[1]
G. Amati. Probabilistic Models for Information Retrieval based on Divergence from Randomness. PhD thesis, University of Glasgow, 2003.
[2]
G. Amati. Frequentist and Bayesian Approach to Information Retrieval. In Proceedings of ECIR 2006, volume 3936 Lecture Notes in Computer Science, pages 13--24, Springer, 2006.
[3]
K. Balog and M. de Rijke. Finding experts and their details in e-mail corpora. In 15th International World Wide Web Conference (WWW2006), Edinburgh, Scotland, 2006.
[4]
J. A. Aslam and M. Montague. Models for metasearch. In Proceedings of ACM SIGIR 2001, pages 276--284, New Orleans LA, 2001.
[5]
C. S. Campbell, P. P. Maglio, A. Cozzi, and B. Dom. Expertise identification using email communications. In Proceedings of ACM CIKM 2003, pages 528--531, New Orleans, LA, 2003.
[6]
N. Craswell, A. P. de Vries, and I. Soboro. Overview of the TREC-2005 Enterprise Track. In Proceedings of TREC-2005, Gaithersburg, MD, 2005.
[7]
N. Craswell, D. Hawking, A.-M. Vercoustre, and P. Wilkins. Panoptic expert: Searching for experts not just for documents. In Ausweb Poster Proceedings, Queensland, Australia, 2001.
[8]
B. Dom, I. Eiron, A. Cozzi, and Y. Zhang. Graph-based ranking algorithms for e-mail expertise analysis. In Proceedings of ACM SIGMOD DMKD Workshop 2003, pages 42--48, San Diego, CA, 2003.
[9]
S. T. Dumais and J. Nielsen. Automating the assignment of submitted manuscripts to reviewers. In Proceedings of ACM SIGIR 1992, pages 233--244, Copenhagen, Denmark, 1992.
[10]
E. A. Fox and J. A. Shaw. Combination of multiple searches. In Proceedings of TREC-2, Gaithersburg, MD, 1994.
[11]
M. Hertzum and A. M. Pejtersen. The information-seeking practises of engineers: searching for documents as well as for people. Inf. Process. Manage., 36(5):761--778, 2000.
[12]
M. G. Kendall. Rank Correlation Methods, 2nd ed. Charles Griffin & Co. Ltd., London WC2, 1955.
[13]
J. M. Kleinberg. Authoritative sources in a hyperlinked environment. J. ACM, 46(5):604--632, 1999.
[14]
J. H. Lee. Analyses of multiple evidence combination. In Proceedings of ACM SIGIR 1997, pages 267--276, Philadelphia, PA, 1997.
[15]
X. Liu, W. B. Croft, and M. Koll. Finding experts in community-based question-answering services. In Proceedings of ACM CIKM 2005, pages 315--316, Bremen, Germany, 2005.
[16]
C. Macdonald, B. He, V. Plachouras, and I. Ounis. University of Glasgow at TREC 2005: Experiments in Terabyte and Enterprise tracks with Terrier. In Proceedings of TREC-2004, Gaithersburg, MD, 2004.
[17]
C. Macdonald and I. Ounis. Searching for expertise using the Terrier platform. In Proceedings of SIGIR 2006, Seattle, WA, 2006.
[18]
C. Macdonald, V. Plachouras, B. He, C. Lioma, and I. Ounis. University of Glasgow at WebCLEF 2005: Experiments in per-field normalisation and language specific stemming. In Proceedings of CLEF Workshop 2005, volume 4022 Lecture Notes in Computer Science, 2006.
[19]
R. Manmatha, T. Rath, and F. Feng. Modelling score distributions for combining the outputs of search engines. In Proceedings of ACM SIGIR 2001, pages 267--275, New Orleans, LA, 2001.
[20]
M. Maybury, R. D'Amore, and D. House. Expert finding for collaborative virtual environments. Commun. ACM, 44(12):55--56, 2001.
[21]
A. McLean, A.-M. Vercoustre, and M. Wu. Enterprise PeopleFinder: Combining Evidence from Web Pages and Corporate Data. In The 8th Australasian Document Computing Conference (ADCS'03), 2003.
[22]
M. Montague and J. A. Aslam. Metasearch consistency. In Proceedings of ACM SIGIR 2001, pages 386--387, New Orleans, LA, 2001.
[23]
M. Montague and J. A. Aslam. Relevance score normalization for metasearch. In Proceedings of ACM CIKM 2001, pages 427--433, Atlanta, GA, 2001.
[24]
M. Montague and J. A. Aslam. Condorcet fusion for improved retrieval. In Proceedings of ACM CIKM 2002, pages 538--548, McLean, VA, 2002.
[25]
P. Ogilvie and J. Callan. Combining document representations for known-item search. In Proceedings of ACM SIGIR 2003, pages 143--150, Toronto, Canada, 2003.
[26]
I. Ounis, G. Amati, Plachouras V., B. He, C. Macdonald, and D. Johnson. Terrier Information Retrieval Platform. In Proceedings of ECIR 2005, volume 3408 Lecture Notes in Computer Science, pages 517--519, Springer, 2005.
[27]
I. Ounis, G. Amati, Plachouras V., B. He, C. Macdonald, and C. Lioma. Terrier: A High Performance and Scalable Information Retrieval Platform. In Proceedings of the OSIR Workshop 2006, pages 18--25, Seattle, WA, 2006.
[28]
V. Plachouras, B. He, and I. Ounis. University of Glasgow at TREC2004: Experiments in Web, Robust and Terabyte tracks with Terrier. In Proceedings of TREC-2004, Gaithersburg, MD, 2004.
[29]
S. Robertson, H. Zaragoza, and M. Taylor. Simple BM25 extension to multiple weighted fields. In Proceedings of ACM CIKM 2004, pages 42--49, Washington, DC, 2004.
[30]
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, M. Gatford, and A. Payne. Okapi at TREC-4. In Proceedings of TREC-4. Gaithersburg, MD, 1995.
[31]
S. E. Robertson, S. Walker, M. Hancock-Beaulieu, A. Gull, and M. Lau. Okapi at TREC. In Proceedings of TREC-1, Gaithersburg, MD, 1992.
[32]
J. Savoy, A. L. Calvé, and D. Vrajitoru. Report on the TREC-5 experiment: data fusion and collection fusion. In Proceedings of TREC-5, Gaithersburg, MD, 1997.
[33]
J. A. Shaw and E. A. Fox. Combination of multiple searches. In Proceedings of TREC-3, Gaithersburg, MD, 1994.
[34]
W. Sihn and F. Heeren. Xpert finder - Expert finding within specified subject areas through analysis of E-mail communication. In Proceedings of Euromedia 2001, pages 279--283, 2001.
[35]
J. Wang, Z. Chen, L. Tao, W.-Y. Ma, and L. Wenyin. Ranking user's relevance to a topic through link analysis on web logs. In Proceedings of WIDM 2002 workshop, pages 49--54, McLean, VA, 2002.
[36]
D. Yimam-Seid and A. Kobsa. Expert finding systems for organizations: Problem and domain analysis and the DEMOIR approach. Journal of Organizational Computing and Electronic Commerce 13(1):1-24, 2003.
[37]
E. Yom-Tov, S. Fine, D. Carmel, and A. Darlow. Learning to estimate query difficulty: including applications to missing content detection and distributed information retrieval. In Proceedings of ACM SIGIR 2005, pages 512--519, Salvador, Brazil, 2005.
[38]
H. Zaragoza, N. Craswell, M. Taylor, S. Saria, and S. Robertson. Microsoft Cambridge at TREC-13: Web and HARD tracks. In Proceedings of TREC-2004, Gaithersburg, MD, 2004.
[39]
M. Zhang, R. Song, C. Lin, S. Ma, Z. Jang, Y. Lin, Y. Liu, and L. Zhao. Expansion-based technologies in finding relevant and new information: THU TREC2002: Novelty Track experiments. In Proceedings of TREC-2002, Gaithersburg, MD, 2002.

Cited By

View all
  • (2024)Estimating Average Vehicle Mileage for Various Vehicle Classes Using Polynomial Models in Deep ClassifiersIEEE Access10.1109/ACCESS.2024.335999012(17404-17418)Online publication date: 2024
  • (2024)State recognition and temperature rise time prediction of tobacco curing using multi-sensor data-fusion method based on feature impact factorExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121591237:PCOnline publication date: 1-Mar-2024
  • (2024)KnowFIRES: A Knowledge-Graph Framework for Interpreting Retrieved Entities from SearchAdvances in Information Retrieval10.1007/978-3-031-56069-9_15(182-188)Online publication date: 23-Mar-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIKM '06: Proceedings of the 15th ACM international conference on Information and knowledge management
November 2006
916 pages
ISBN:1595934332
DOI:10.1145/1183614
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data fusion
  2. expert finding
  3. expert search
  4. expertise modelling
  5. information retrieval
  6. ranking
  7. voting

Qualifiers

  • Article

Conference

CIKM06
CIKM06: Conference on Information and Knowledge Management
November 6 - 11, 2006
Virginia, Arlington, USA

Acceptance Rates

Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

Upcoming Conference

CIKM '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Estimating Average Vehicle Mileage for Various Vehicle Classes Using Polynomial Models in Deep ClassifiersIEEE Access10.1109/ACCESS.2024.335999012(17404-17418)Online publication date: 2024
  • (2024)State recognition and temperature rise time prediction of tobacco curing using multi-sensor data-fusion method based on feature impact factorExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.121591237:PCOnline publication date: 1-Mar-2024
  • (2024)KnowFIRES: A Knowledge-Graph Framework for Interpreting Retrieved Entities from SearchAdvances in Information Retrieval10.1007/978-3-031-56069-9_15(182-188)Online publication date: 23-Mar-2024
  • (2024)LaQuE: Enabling Entity Search at ScaleAdvances in Information Retrieval10.1007/978-3-031-56060-6_18(270-285)Online publication date: 16-Mar-2024
  • (2023)SE-PEF: a Resource for Personalized Expert FindingProceedings of the Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region10.1145/3624918.3625335(288-309)Online publication date: 26-Nov-2023
  • (2023)Efficient and Effective Academic Expert Finding on Heterogeneous Graphs through (k, 𝒫)-Core based EmbeddingACM Transactions on Knowledge Discovery from Data10.1145/357836517:6(1-35)Online publication date: 22-Mar-2023
  • (2023)Smooth Operators for Effective Systematic Review QueriesProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591768(580-590)Online publication date: 19-Jul-2023
  • (2023)Personalized First Issue Recommender for Newcomers in Open Source ProjectsProceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering10.1109/ASE56229.2023.00158(800-812)Online publication date: 11-Nov-2023
  • (2023)ExpFinder: A hybrid model for expert finding from text-based expertise dataExpert Systems with Applications10.1016/j.eswa.2022.118691211(118691)Online publication date: Jan-2023
  • (2023)An Architecture to Develop an Automated Expert Finding System for Academic EventsProceedings of the International Conference on Paradigms of Computing, Communication and Data Sciences10.1007/978-981-19-8742-7_25(297-306)Online publication date: 24-Feb-2023
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media