[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2815833.2815837acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesk-capConference Proceedingsconference-collections
research-article

Capturing Researcher Expertise through MeSH Classification

Published: 07 October 2015 Publication History

Abstract

For a large research institution and a broad research discipline such as the life sciences, it is a highly important and very challenging task to capture each researcher's expertise, and to match researchers by expertise to assist in identifying inter-disciplinary collaboration opportunities and in making informed policy decisions. The challenges are multi-dimensional, stemming from the needs to (a) provide thorough coverage of the breadth and depth of the disciplinary areas, (b) develop accurate representation of researcher's expertise, and (c) process large volumes of data efficiently. Medical Subject Headings (MeSH), a comprehensive taxonomy for the life sciences, has been widely used for indexing MEDLINE publications. In this paper, we present a novel framework for capturing and matching research expertise based on knowledge encoded in MeSH. Specifically, (1) we design a novel and effective hybrid MeSH classification algorithm by combining state-of-the-art methods, and (2) using MeSH terms aggregated from a researcher's publications, we design a researcher matching algorithm based on semantic similarity that takes into consideration the structure of the MeSH taxonomy.

References

[1]
On optimization of expertise matching with various constraints. Neurocomputing, 76(1):71 -- 83, 2012.
[2]
G. Adomavicius and A. Tuzhilin. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. Knowledge and Data Engineering, IEEE Transactions on, 17(6):734--749, 2005.
[3]
B. Aljaber, D. Martinez, N. Stokes, and J. Bailey. Improving MeSH classification of biomedical articles using citation contexts. Journal of Biomedical Informatics, 44(5):881--896, 2011.
[4]
A. Aronson, J. Mork, C. Gay, S. Humphrey, and W. Rogers. The NLM Indexing Initiative's Medical Text Indexer. Medinfo, 11(1):268--272, 2004.
[5]
A. Bellogín. Performance prediction in recommender systems: application to the dynamic optimisation of aggregative methods. Master's thesis, Escuela Politécnica Superior, Universidad Autónoma de Madrid, Madrid, Spain, July 2009.
[6]
L. Breiman. Random forests. Mach. Learn., 45(1):5--32, Oct. 2001.
[7]
R. Burke. Hybrid recommender systems: Survey and experiments. User Modeling and User-Adapted, Interaction, 12(4):331--370, Nov. 2002.
[8]
M. Cobo, A. López-Herrera, E. Herrera-Viedma, and F. Herrera. Science mapping software tools: Review, analysis, and cooperative study among tools. Journal of the American Society for Information Science and Technology, 62(7):1382--1402, 2011.
[9]
J. Cohen. Statistical Power-Analysis for the Behavioral Sciences (2nd Edition). 1988.
[10]
H. Fang and C. Zhai. Probabilistic models for expert finding. In Proceedings of the 29th European Conference on IR Research, ECIR'07, pages 418--430, Berlin, Heidelberg, 2007. Springer-Verlag.
[11]
P. Ganesan, H. Garcia-Molina, and J. Widom. Exploiting hierarchical domain structure to compute similarity. ACM Trans. Inf. Syst., 21(1):64--93, Jan. 2003.
[12]
R. Genuer, J.-M. Poggi, and C. Tuleau-Malot. Variable selection using random forests. Pattern Recogn. Lett., 31(14):2225--2236, Oct. 2010.
[13]
M. Huang, A. Névéol, and Z. Lu. Recommending MeSH terms for annotating biomedical articles. Journal of the American Medical Informatics Association, 18(5):660--667, 2011.
[14]
A. Jimeno-Yepes, L. Plaza, J. Mork, A. Aronson, and A. Díaz. MeSH indexing based on automatically generated summaries. BMC Bioinformatics, page 208, 2013.
[15]
Y.-B. Kang, A. Zaslavsky, S. Krishnaswamy, and C. Bartolini. A knowledge-rich similarity measure for improving it incident resolution process. In Proceedings of the 2010 ACM Symposium on Applied Computing, SAC '10, pages 1781--1788, New York, NY, USA, 2010. ACM.
[16]
R. Kavuluru and Z. He. Unsupervised Medical Subject Heading Assignment Using Output Label Co-occurrence Statistics and Semantic Predications. In Natural Language Processing and Information Systems. 2013.
[17]
D. Lindberg, B. Humphreys, and A. McCray. The unified medical language system. Methods of Information in Medicine, 32(4):281--291, 1993.
[18]
S. A. Morris and B. Van der Veer Martens. Mapping research specialties. Annual Review of Information Science and Technology, 42(1):213--295, 2008.
[19]
T. Pedersen, S. V. S. Pakhomov, S. Patwardhan, and C. G. Chute. Measures of semantic similarity and relatedness in the biomedical domain. J. of Biomedical Informatics, 40(3):288--299, June 2007.
[20]
A. Porter and I. Rafols. Is science becoming more interdisciplinary? Measuring and mapping six research fields over time. Scientometrics, 81(3):719--745, Dec. 2009.
[21]
F. Provost. Machine learning from imbalanced data sets 101. Proceedings of the AAAI-2000 Workshop on Imbalanced Data Sets, 2000.
[22]
P. Resnik. Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11:95--130, 1999.
[23]
T. C. Rindflesch and M. Fiszman. The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text. J. of Biomedical Informatics, 36(6):462 -- 477, 2003.
[24]
G. Salton, editor. Automatic text processing. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1988.
[25]
N. Seco, T. Veale, and J. Hayes. An Intrinsic Information Content Metric for Semantic Similarity in WordNet. In Proceedings of European Conference on Artificial Intelligence, pages 1089--1090, 2004.
[26]
G. Tsoumakas, I. Katakis, and I. Vlahavas. Random k-labelsets for multilabel classification. IEEE Trans. on Knowl. and Data Eng., 23(7):1079--1089, July 2011.
[27]
V. Vasuki and T. Cohen. Reflective random indexing for semi-automatic indexing of the biomedical literature. Journal of Biomedical Informatics, 43(5):694--700, Oct. 2010.
[28]
M. Wahle, D. Widdows, J. R. Herskovic, E. V. Bernstam, and T. Cohen. Deterministic Binary Vectors for Efficient Automated Indexing of MEDLINE/ Abstracts. Proceedings of AMIA Symposium, 2012:940--949, 2012.
[29]
X. Wei and W. B. Croft. Lda-based document models for ad-hoc retrieval. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR '06, pages 178--185, New York, NY, USA, 2006. ACM.

Cited By

View all
  • (2017)On Using Disparate Scholarly Data to Identify Potential Members for Interdisciplinary Research Groups2017 IEEE International Conference on Information Reuse and Integration (IRI)10.1109/IRI.2017.33(59-68)Online publication date: 4-Aug-2017

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
K-CAP '15: Proceedings of the 8th International Conference on Knowledge Capture
October 2015
209 pages
ISBN:9781450338493
DOI:10.1145/2815833
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 October 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Expertise matching
  2. MeSH
  3. Researcher profile

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

K-CAP 2015
K-CAP 2015: Knowledge Capture Conference
October 7 - 10, 2015
NY, Palisades, USA

Acceptance Rates

K-CAP '15 Paper Acceptance Rate 16 of 56 submissions, 29%;
Overall Acceptance Rate 55 of 198 submissions, 28%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2017)On Using Disparate Scholarly Data to Identify Potential Members for Interdisciplinary Research Groups2017 IEEE International Conference on Information Reuse and Integration (IRI)10.1109/IRI.2017.33(59-68)Online publication date: 4-Aug-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media