More Web Proxy on the site http://driver.im/

research-article

Free access

Global ranking via data fusion

Authors:

Richard Tzong-Han Tsai,

Wen-Lian HsuAuthors Info & Claims

COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: Posters

Pages 223 - 231

Published: 23 August 2010 Publication History

Abstract

Global ranking, a new information retrieval (IR) technology, uses a ranking model for cases in which there exist relationships between the objects to be ranked. In the ranking task, the ranking model is defined as a function of the properties of the objects as well as the relations between the objects. Existing global ranking approaches address the problem by "learning to rank". In this paper, we propose a global ranking framework that solves the problem via data fusion. The idea is to take each retrieved document as a pseudo-IR system. Each document generates a pseudo-ranked list by a global function. The data fusion algorithm is then adapted to generate the final ranked list. Taking a biomedical information extraction task, namely, interactor normalization task (INT), as an example, we explain how the problem can be formulated as a global ranking problem, and demonstrate how the proposed fusion-based framework outperforms baseline methods. By using the proposed framework, we improve the performance of the top 1 INT system by 3.2% using the official evaluation metric of the BioCreAtIvE challenge. In addition, by employing the standard ranking quality measure, NDCG, we demonstrate that the proposed framework can be cascaded with different local ranking models and improve their ranking results.

References

[1]

Adler, P., R. Kolde, M. Kull, A. Tkachenko, H. Peterson, J. Reimand and J. Vilo (2009). Mining for coexpression across hundreds of datasets using novel rank aggregation and visualization methods. Genome Biology 10(R139).

[2]

Aslam, J. A. and M. Montague (2001). Models for metasearch. Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, New Orleans, Louisiana, United States, ACM.

Digital Library

[3]

Bartell, B. T., G. W. Cottrell and R. K. Belew (1994). Automatic combination of multiple ranked retrieval systems. Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, Dublin, Ireland Springer-Verlag New York, Inc.

Digital Library

[4]

Borda, J. (1781). Mémoire sur les élections au scrutin. Histoire del'Acad'emie Royale des Sciences 2: 13.

[5]

Chowdhury, G. (2007). TREC: Experiment and Evaluation in Information Retrieval. Online Information Review 31(5): 462.

[6]

Dai, H.-J., P.-T. Lai and R. T.-H. Tsai (2010). Multi-stage gene normalization and SVM-based ranking for protein inter actor extraction in full-text articles. IEEE TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFOPMATICS 14 May. 2010. IEEE computer Society Digital Library. IEEE Computer Society.

Digital Library

[7]

Fox, E. A. and J. A. Shaw (1994). Combination of Multiple Searches. 1994, Proceedings of the Second Text REtrieval Conference (TREC 2)

[8]

Jenssen, T.-K., A. Lagreid, J. Komorowski and E. Hovig (2001). A literature network of human genes for high-throughput analysis of gene expression. Nature Genetics 28(1): 21--28.

[9]

Knaus, D., E. Mittendorf and P. Schäuble (1995). Improving a basic retrieval method by links and passage level evidence. NIST Special Publication 500--225: Overview of the Third Text REtrieval Conference (TREC-3).

[10]

Krallinger, M., F. Leitner and A. Valencia (2009). The BioCreative II.5 challenge overview. Proceedings of the BioCreative II.5 Workshop 2009 on Digital Annotations, Madrid, Spain.

[11]

Kwok, K. L. (1984). A document-document similarity measure based on cited titles and probability theory, and its application to relevance feedback retrieval. SIGIR'84.

Digital Library

[12]

Lee, J. H. (1997). Analyses of multiple evidence combination. Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval, Philadelphia, Pennsylvania, United States, ACM.

Digital Library

[13]

Lin, S. and J. Ding (2008). Integration of Ranked Lists via Cross Entropy Monte Carlo with Applications to mRNA and microRNA Studies. Biometrics 65(1): 9--18.

[14]

Liu, Y.-T., T.-Y. Liu, T. Qin, Z.-M. Ma and H. Li (2007). Supervised rank aggregation. Proceedings of the 16th international conference on World Wide Web, Banff, Alberta, Canada, ACM.

Digital Library

[15]

Mardis, S., F. Leitner and L. Hirschman (2009). BioCreative II.5: Evaluation and ensemble system performance. Proceedings of the BioCreative II.5 Workshop 2009 on Digital Annotations, Madrid, Spain.

[16]

Nuray, R. and F. Can (2006). Automatic ranking of information retrieval systems using data fusion. Inf. Process. Manage. 42(3): 595--614.

Digital Library

[17]

Pihura, V., S. Dattaa and S. Datta (2008). Finding common genes in multiple cancer types through meta-analysis of microarray experiments: A rank aggregation approach Genomics 92(6): 400--403

[18]

Qin, T., T.-Y. Liu, X.-D. Zhang, D.-S. Wang and H. Li (2008). Global Ranking Using Continuous Conditional Random Fields. Proceedings of the Twenty-Second Annual Conference on Neural Information Processing Systems (NIPS 2008), Vancouver, Canada.

Digital Library

[19]

Qin, T., T. Liu, X. Zhang, D. Wang, W. Xiong and H. Li (2008). Learning to rank relational objects and its application to web search, ACM.

[20]

Vogt, C. and G. Cottrell (1999). Fusion via a linear combination of scores. Information Retrieval 1(3): 151--173.

Digital Library

[21]

Vogt, C. C. and G. W. Cottrell (1998). Predicting the performance of linearly combined IR systems. Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval, Melbourne, Australia ACM.

Digital Library

[22]

Zhao, Z., J. Wang, S. Sharma, N. Agarwal, H. Liu and Y. Chang (2010). An Integrative Approach to Identifying Biologically Relevant Genes. Proceedings of SIAM International Conference on Data Mining (SDM).

Index Terms

Global ranking via data fusion
1. Applied computing
  1. Arts and humanities
    1. Language translation
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing

Recommendations

Authority and ranking effects in data fusion

This paper provides empirical support for some of the key assumptions guiding the design of data fusion methods. It computes and analyzes the overlap structures between the search results of retrieval systems that participated in the short, long, and ...
Improving recency ranking using twitter data
Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context

In Web search and vertical search, recency ranking refers to retrieving and ranking documents by both relevance and freshness. As impoverished in-links and click information is the the biggest challenge for recency ranking, we advocate the use of ...
Global ranking by exploiting user clicks
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval

It is now widely recognized that user interactions with search results can provide substantial relevance information on the documents displayed in the search results. In this paper, we focus on extracting relevance information from one source of user ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings

COLING '10: Proceedings of the 23rd International Conference on Computational Linguistics: Posters

August 2010

1588 pages

General Chair:
Aravind K. Joshi
University of Pennsylvania
,
Program Chairs:
Chu-Ren Huang
The Hong Kong Polytechnic University
,
Dan Jurafsky
Stanford University

Publisher

Association for Computational Linguistics

United States

Publication History

Published: 23 August 2010

Qualifiers

Research-article

Acceptance Rates

Overall Acceptance Rate 1,537 of 1,537 submissions, 100%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
185
Total Downloads

Downloads (Last 12 months)57
Downloads (Last 6 weeks)12

Reflects downloads up to 29 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Figures

Tables

Media

View Table of Conten