[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.5555/1944566.1944592dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
research-article
Free access

Global ranking via data fusion

Published: 23 August 2010 Publication History

Abstract

Global ranking, a new information retrieval (IR) technology, uses a ranking model for cases in which there exist relationships between the objects to be ranked. In the ranking task, the ranking model is defined as a function of the properties of the objects as well as the relations between the objects. Existing global ranking approaches address the problem by "learning to rank". In this paper, we propose a global ranking framework that solves the problem via data fusion. The idea is to take each retrieved document as a pseudo-IR system. Each document generates a pseudo-ranked list by a global function. The data fusion algorithm is then adapted to generate the final ranked list. Taking a biomedical information extraction task, namely, interactor normalization task (INT), as an example, we explain how the problem can be formulated as a global ranking problem, and demonstrate how the proposed fusion-based framework outperforms baseline methods. By using the proposed framework, we improve the performance of the top 1 INT system by 3.2% using the official evaluation metric of the BioCreAtIvE challenge. In addition, by employing the standard ranking quality measure, NDCG, we demonstrate that the proposed framework can be cascaded with different local ranking models and improve their ranking results.

References

[1]
Adler, P., R. Kolde, M. Kull, A. Tkachenko, H. Peterson, J. Reimand and J. Vilo (2009). Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods. Genome Biology 10(R139).
[2]
Aslam, J. A. and M. Montague (2001). Models for metasearch. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, New Orleans, Louisiana, United States, ACM.
[3]
Bartell, B. T., G. W. Cottrell and R. K. Belew (1994). Automatic combination of multiple ranked retrieval systems. Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland Springer-Verlag New York, Inc.
[4]
Borda, J. (1781). Mémoire sur les élections au scrutin. Histoire del'Acad'emie Royale des Sciences 2: 13.
[5]
Chowdhury, G. (2007). TREC: Experiment and Evaluation in Information Retrieval. Online Information Review 31(5): 462.
[6]
Dai, H.-J., P.-T. Lai and R. T.-H. Tsai (2010). Multi-stage gene normalization and SVM-based ranking for protein inter actor extraction in full-text articles. IEEE TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFOPMATICS 14 May. 2010. IEEE computer Society Digital Library. IEEE Computer Society.
[7]
Fox, E. A. and J. A. Shaw (1994). Combination of Multiple Searches. 1994, Proceedings of the Second Text REtrieval Conference (TREC 2)
[8]
Jenssen, T.-K., A. Lagreid, J. Komorowski and E. Hovig (2001). A literature network of human genes for high-throughput analysis of gene expression. Nature Genetics 28(1): 21--28.
[9]
Knaus, D., E. Mittendorf and P. Schäuble (1995). Improving a basic retrieval method by links and passage level evidence. NIST Special Publication 500--225: Overview of the Third Text REtrieval Conference (TREC-3).
[10]
Krallinger, M., F. Leitner and A. Valencia (2009). The BioCreative II.5 challenge overview. Proceedings of the BioCreative II.5 Workshop 2009 on Digital Annotations, Madrid, Spain.
[11]
Kwok, K. L. (1984). A document-document similarity measure based on cited titles and probability theory, and its application to relevance feedback retrieval. SIGIR'84.
[12]
Lee, J. H. (1997). Analyses of multiple evidence combination. Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, Philadelphia, Pennsylvania, United States, ACM.
[13]
Lin, S. and J. Ding (2008). Integration of Ranked Lists via Cross Entropy Monte Carlo with Applications to mRNA and microRNA Studies. Biometrics 65(1): 9--18.
[14]
Liu, Y.-T., T.-Y. Liu, T. Qin, Z.-M. Ma and H. Li (2007). Supervised rank aggregation. Proceedings of the 16th international conference on World Wide Web, Banff, Alberta, Canada, ACM.
[15]
Mardis, S., F. Leitner and L. Hirschman (2009). BioCreative II.5: Evaluation and ensemble system performance. Proceedings of the BioCreative II.5 Workshop 2009 on Digital Annotations, Madrid, Spain.
[16]
Nuray, R. and F. Can (2006). Automatic ranking of information retrieval systems using data fusion. Inf. Process. Manage. 42(3): 595--614.
[17]
Pihura, V., S. Dattaa and S. Datta (2008). Finding common genes in multiple cancer types through meta-analysis of microarray experiments: A rank aggregation approach Genomics 92(6): 400--403
[18]
Qin, T., T.-Y. Liu, X.-D. Zhang, D.-S. Wang and H. Li (2008). Global Ranking Using Continuous Conditional Random Fields. Proceedings of the Twenty-Second Annual Conference on Neural Information Processing Systems (NIPS 2008), Vancouver, Canada.
[19]
Qin, T., T. Liu, X. Zhang, D. Wang, W. Xiong and H. Li (2008). Learning to rank relational objects and its application to web search, ACM.
[20]
Vogt, C. and G. Cottrell (1999). Fusion via a linear combination of scores. Information Retrieval 1(3): 151--173.
[21]
Vogt, C. C. and G. W. Cottrell (1998). Predicting the performance of linearly combined IR systems. Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, Melbourne, Australia ACM.
[22]
Zhao, Z., J. Wang, S. Sharma, N. Agarwal, H. Liu and Y. Chang (2010). An Integrative Approach to Identifying Biologically Relevant Genes. Proceedings of SIAM International Conference on Data Mining (SDM).

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: Posters
August 2010
1588 pages

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 23 August 2010

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 185
    Total Downloads
  • Downloads (Last 12 months)57
  • Downloads (Last 6 weeks)12
Reflects downloads up to 29 Jan 2025

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media