[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/1460096.1460125acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Late fusion of heterogeneous methods for multimedia image retrieval

Published: 30 October 2008 Publication History

Abstract

Late fusion of independent retrieval methods is the simpler approach and a widely used one for combining visual and textual information for the search process. Usually each retrieval method is based on a single modality, or even, when several methods are considered per modality, all of them use the same information for indexing/querying. The latter reduces the diversity and complementariness of documents considered for the fusion, as a consequence the performance of the fusion approach is poor.
In this paper we study the combination of multiple heterogeneous methods for image retrieval in annotated collections. Heterogeneousness is considered in terms of i) the modality in which the methods are based on, ii) in the information they use for indexing/querying and iii) in the individual performance of the methods. Different settings for the fusion are considered including weighted, global, per-modality and hierarchical. We report experimental results, in an image retrieval benchmark, that show that the proposed combination outperforms significantly any of the individual methods we consider. Retrieval performance is comparable to the best performance obtained in the context of ImageCLEF2007. An interesting result is that even methods that perform poor (individually) resulted very useful to the fusion strategy. Furthermore, opposed to work reported in the literature, better results were obtained by assigning a low weight to text-based methods. The main contribution of this paper is experimental, several interesting findings are reported that motivate further research on diverse subjects.

References

[1]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Pearson E. L., 1999.
[2]
R. Besancon and C. Millet. Merging results from different media: Lic2m experiments at imageclef 2005. In Working notes of the CLEF 2005. CLEF.
[3]
Y. Chang and H. Chen. Approaches of using a word-image ontology and an annotated image corpus as intermedia for cross-language image retrieval. In Working Notes of the CLEF. CLEF, 2006.
[4]
P. Clough, M. Grubinger, T. Deselaers, A. Hanbury, and H. Müller. Overview of the imageclef 2007 photographic retrieval task. In CLEF 2007, volume 5152 of LNCS. CLEF, Springer-Verlag, 2008.
[5]
H. J. Escalante and et al. Towards annotation-based query and document expansion for image retrieval. In CLEF 2007, volume 5152 of LNCS, pages 546--553. Springer-Verlag, 2008.
[6]
T. Gass, T. Weyand, T. Deselaers, and H. Ney. Fire in imageclef 2007: Support vector machines and logistic regression to fuse image descriptors in for photo retrieval. volume 5152 of LNCS. Springer-Verlag, 2008.
[7]
A. Goodrum. Image information retrieval: An overview of current research. Journal of Informing Science, 3(2), 2000.
[8]
M. Grubinger, P. Clough, H. Müller, and T. Deselaers. The iapr tc-12 benchmark: A new evaluation resource for visual information systems. In Proc. of the Intl. Workshop OntoImage'2006 Language Resources for CBIR, Genoa, Italy, 2006.
[9]
C. Hernández and L. E. Sucar. Markov random fields and spatial information to improve automatic image annotation. In Proc. of the the 2007 Pacific-Rim Symposium on Image and Video Technology, volume 4872 of LNCS, pages 879--892. Springer, 2007.
[10]
R. Izquierdo-Beviá, D. Tomás, M. Saiz-Noeda, and J. L. Vicedo. University of alicante in imageclef2005. In Working Notes of the CLEF. CLEF, 2005.
[11]
M. M. Rautiainen and T. Seppdnen. Comparison of visual features and fusion techniques in automatic detection of concepts from news video. In Proceedings of the IEEE ICME, pages 932--935, 2005.
[12]
P. Over and A. F. Smeaton., editors. Proc. of the international workshop on TRECVID video summarization., Augsburg, Bavaria, Germany., 2007.
[13]
V. Peinado, F. López-Ostenero, and J. Gonzalo. Uned at imageclef 2005: Automatically structured queries with named entities over metadata. In Working Notes of the CLEF. CLEF, 2005.
[14]
J. L. R. Datta, D. Joshi and J. Z. Wang. Image retrieval: Ideas, in uences, and trends of the new age. ACM Computing Surveys, to appear, 2008.
[15]
M. Rautiainen, T. Ojala, and S. Tapio. Analyzing the performance of visual, concept and text features in content-based video retrieval. In MIR'04: Proc. of the 6th ACM workshop on Multimedia information retrieval, pages 197--204, New York, NY, USA, 2004. ACM Press.
[16]
S. Sclaroff, M. L. Cascia, and S. Sethi. Unifying textual and visual cues for content-based image retrieval on the world wide web. Computer Vision, 75(1/2):86--98, July/August 1999.
[17]
C. Snoek, M. Worring, and A. Smeulders. Early versus late fusion in semantic video analysis. In Proc. of the 13th Annual ACM Conference on Multimedia, pages 399--402, Singapore, 2005. ACM.
[18]
D. Zeimpekis and E. Gallopoulos. Tmg: A matlab toolbox for generating term-document matrices from text collections. In Recent Advances in Clustering, pages 187--210. Springer, 2005.

Cited By

View all
  • (2024)Multimodal Analysis of BioimagesReference Module in Life Sciences10.1016/B978-0-323-95502-7.00213-XOnline publication date: 2024
  • (2023)A deep learning approach for image and text classification using neutrosophyInternational Journal of Information Technology10.1007/s41870-023-01529-816:2(853-859)Online publication date: 13-Oct-2023
  • (2022)Influence of Late Fusion of High-Level Features on User Relevance Feedback for VideosProceedings of the 2nd International Workshop on Interactive Multimedia Retrieval10.1145/3552467.3554795(17-24)Online publication date: 14-Oct-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MIR '08: Proceedings of the 1st ACM international conference on Multimedia information retrieval
October 2008
506 pages
ISBN:9781605583129
DOI:10.1145/1460096
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. image retrieval
  2. late fusion

Qualifiers

  • Research-article

Conference

MM08
Sponsor:
MM08: ACM Multimedia Conference 2008
October 30 - 31, 2008
British Columbia, Vancouver, Canada

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)6
  • Downloads (Last 6 weeks)0
Reflects downloads up to 11 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Multimodal Analysis of BioimagesReference Module in Life Sciences10.1016/B978-0-323-95502-7.00213-XOnline publication date: 2024
  • (2023)A deep learning approach for image and text classification using neutrosophyInternational Journal of Information Technology10.1007/s41870-023-01529-816:2(853-859)Online publication date: 13-Oct-2023
  • (2022)Influence of Late Fusion of High-Level Features on User Relevance Feedback for VideosProceedings of the 2nd International Workshop on Interactive Multimedia Retrieval10.1145/3552467.3554795(17-24)Online publication date: 14-Oct-2022
  • (2022)Efficient and Effective Multi-Modal Queries Through Heterogeneous Network EmbeddingIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2021.305287134:11(5307-5320)Online publication date: 1-Nov-2022
  • (2020)A Multi-Stream Recurrent Neural Network for Social Role Detection in Multiparty InteractionsIEEE Journal of Selected Topics in Signal Processing10.1109/JSTSP.2020.299239414:3(554-567)Online publication date: Mar-2020
  • (2020)Graph Embeddings for One-pass Processing of Heterogeneous Queries2020 IEEE 36th International Conference on Data Engineering (ICDE)10.1109/ICDE48307.2020.00222(1994-1997)Online publication date: Apr-2020
  • (2020)Convolutional neural networks for relevance feedback in content based image retrievalMultimedia Tools and Applications10.1007/s11042-020-09292-9Online publication date: 21-Jul-2020
  • (2020)Recurrent Image Annotation with Explicit Inter-label DependenciesComputer Vision – ECCV 202010.1007/978-3-030-58526-6_12(191-207)Online publication date: 7-Oct-2020
  • (2018)OpenseaProceedings of the 9th ACM Multimedia Systems Conference10.1145/3204949.3208128(363-368)Online publication date: 12-Jun-2018
  • (2018)Cross-media retrieval based on semi-supervised regularization and correlation learningMultimedia Tools and Applications10.1007/s11042-018-6037-y77:17(22455-22473)Online publication date: 1-Sep-2018
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media