More Web Proxy on the site http://driver.im/

research-article

Tag completion based on belief theory and neighbor voting

Authors:

Hervé Le Borgne,

Céline HudelotAuthors Info & Claims

ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

Pages 49 - 56

https://doi.org/10.1145/2461466.2461476

Published: 16 April 2013 Publication History

Abstract

We address the problem of tag completion for automatic image annotation. Our method consists in two main steps: creating a list of "candidate tags" from the visual neighbors of the untagged image then using them as pieces of evidence to be combined to provide the final list of predicted tags. Both steps introduce a scheme to tackle with imprecision and uncertainty. First, a bag-of-words (BOW) signature is generated for each neighbor using local soft coding. Second, a sum-pooling operation across the BOW of the k nearest neighbors provides the list of "candidate tags". Finally, we use neighbors as pieces of evidence to be combined according to the Dempster's rule to predict the more relevant tags. The method is evaluated in the context of image classification and that of tag suggestion. The database used for visual neighbors search contains 1.2 million images extracted from Flickr. Classification is evaluated on the well known Pascal VOC 2007 and MIR Flickr datasets, on which we obtain similar or better results than the state-of-the-art. For tag suggestion, we manually annotated 241 queries. As well, we obtain competitive results on this task.

References

[1]

M. Ames and M. Naaman. Why we tag: motivations for annotation in mobile and online media. In Proceedings of the SIGCHI, pages 971--980, New York, NY, USA, 2007. ACM.

Digital Library

[2]

S. Ayache and G. Quénot. Evaluation of active learning strategies for video indexing. Journal of Image Communication, 22(7--8):692--704, Aug. 2007.

Digital Library

[3]

A. Binder, W. Samek, M. Kloft, C. Müller, K.-R. Müller, and M. Kawanabe. The Joint Submission of the TU Berlin and Fraunhofer FIRST (TUBFI) to the ImageCLEF2011 Photo Annotation Task. In CLEF (Notebook Papers/Labs/Workshop), 2011.

[4]

T. Denoeux. A k-nearest neighbor classification rule based on dempster-shafer theory. IEEE Transaction on systems, man and cybernetics, 25:804--813, 1995.

[5]

P. Duygulu, K. Barnard, J. F. G. d. Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. ECCV '02, pages 97--112, London, UK, UK, 2002. Springer-Verlag.

Digital Library

[6]

M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results.

[7]

M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation. ICCV'09, pages 309--316, Kyoto, Japon, Sept. 2009. IEEE Computer society.

[8]

M. Guillaumin, J. Verbeek, and C. Schmid. Multimodal semi-supervised learning for image classification. CVPR '10, pages 902--909, 2010.

[9]

J. Huang, S. R. Kumar, M. Mitra, W.-J. Zhu, and R. Zabih. Image indexing using color correlograms. CVPR '97, Washington, DC, USA, 1997. IEEE Computer Society.

Digital Library

[10]

M. J. Huiskes and M. S. Lew. The MIR flickr retrieval evaluation. In ACM international conference on Multimedia information retrieval (ICMR), pages 39--43, 2008.

Digital Library

[11]

H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez, and C. Schmid. Aggregating local image descriptors into compact codes. IEEE Transactions on Pattern Analysis and Machine Intelligence, Sept. 2012.

Digital Library

[12]

L. S. Kennedy, S. fu Chang, and I. V. Kozintsev. To search or to label?: predicting the performance of search-based automatic image classifiers. MIR '06, pages 249--258, 2006.

Digital Library

[13]

J. Li and J. Z. Wang. Real-time computerized annotation of pictures. IEEE Trans. Pattern Anal. Mach. Intell., 30(6):985--1002, June 2008.

Digital Library

[14]

X. Li, C. G. M. Snoek, and M. Worring. Learning social tag relevance by neighbor voting. IEEE Transactions on Multimedia, 11(7):1310--1322, November 2009.

Digital Library

[15]

L. Liu, L. Wang, and X. Liu. In Defense of Soft-assignment Coding. ICCV '11, 2011.

[16]

A. Makadia, V. Pavlovic, and S. Kumar. Baselines for image annotation. International Journal of Computer Vision, 90(1):88--105, 2010.

Digital Library

[17]

F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. Proceedings of the eleventh ACM international conference on Multimedia, pages 275--278, New York, NY, USA, 2003. ACM.

Digital Library

[18]

A. Popescu and G. Grefenstette. Social media driven image retrieval. In ACM International Conference on Multimedia Retrieval (ICMR), pages 33:1--33:8, 2011.

Digital Library

[19]

G. Shafer. A Mathematical Theory of Evidence. Princeton University Press, Princeton, 1976.

[20]

B. Sigurbjörnsson and R. van Zwol. Flickr tag recommendation based on collective knowledge. WWW '08, pages 327--336, New York, NY, USA, 2008. ACM.

Digital Library

[21]

J. Tang, R. Hong, S. Yan, T.-S. Chua, G.-J. Qi, and R. Jain. Image annotation by knn-sparse graph-based label propagation over noisily tagged web images. ACM Transactions on Intelligent Systems and Technology (TIST), 2(2):14, 2011.

Digital Library

[22]

A. Torralba, R. Fergus, and W. T. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell., 30(11):1958--1970, Nov. 2008.

Digital Library

[23]

C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Scalable search-based image annotation. Multimedia Syst., 14(4):205--220, 2008.

Digital Library

[24]

G. Wang, D. Hoiem, and D. A. Forsyth. Building text features for object image classification. In CVPR, pages 1367--1374, 2009.

[25]

L. Wu, R. Jin, and A. K. Jain. Tag completion for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 99(PrePrints), 2012.

Digital Library

[26]

H. Yu, M. Li, H.-J. Zhang, and J. Feng. Color texture moments for content-based image retrieval. ICIP '10, pages 24--28, 2003.

[27]

A. Znaidia, A. Shabou, A. Popescu, H. Le Borgne, and C. Hudelot. Multimodal feature generation framework for semantic image classification. In ICMR, International Conference on Multimedia Retrieval, ICMR '12, Hong Kong, China, June 5--8, 2012, page 38, 2012.

Digital Library

Cited By

Li ZTang J(2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
https://doi.org/10.1360/TB-2022-0938
Du XLiu QLi ZQin ZTang J(2019)Cauchy Matrix Factorization for Tag-Based Social Image RetrievalIEEE Access10.1109/ACCESS.2019.29405987(132302-132310)Online publication date: 2019
https://doi.org/10.1109/ACCESS.2019.2940598
Xu CDai YLin RWang S(2019)Stacked Autoencoder Based Weak Supervision for Social Image UnderstandingIEEE Access10.1109/ACCESS.2019.28989917(21777-21786)Online publication date: 2019
https://doi.org/10.1109/ACCESS.2019.2898991
Show More Cited By

Index Terms

Tag completion based on belief theory and neighbor voting
1. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Tag suggestion and localization in user-generated videos based on social knowledge
WSM '10: Proceedings of second ACM SIGMM workshop on Social media

Nowadays, almost any web site that provides means for sharing user-generated multimedia content, like Flickr, Facebook, YouTube and Vimeo, has tagging functionalities to let users annotate the material that they want to share. The tags are then used to ...
Tag suggestion using visual content and social tag
ICUIMC '11: Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication

With the popularity of social media sharing sites such as Flickr or YouTube, tagging has become a more important task to describe the content of the multimedia object. Recently, automatic tagging or tag recommendation has studied to automatically provide ...
Tag Completion for Image Retrieval

Many social image search engines are based on keyword/tag matching. This is because tag-based image retrieval (TBIR) is not only efficient but also effective. The performance of TBIR is highly dependent on the availability and quality of manual tags. ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval

April 2013

362 pages

ISBN:9781450320337

DOI:10.1145/2461466

General Chairs:
Ramesh Jain
University of California, Irvine, USA
,
Balakrisknan Prabhakaran
University of Texas at Dallas, USA
,
Program Chairs:
Marcel Worring
University of Amsterdam, The Netherlands
,
John Smith
IBM Research, New York, USA
,
Tat-Seng Chua
National University of Singapore

Copyright © 2013 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 April 2013

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

ICMR'13

Sponsor:

SIGMM

ICMR'13: International Conference on Multimedia Retrieval

April 16 - 20, 2013

Texas, Dallas, USA

Acceptance Rates

ICMR '13 Paper Acceptance Rate 38 of 96 submissions, 40%;

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

10
Total Citations
View Citations
229
Total Downloads

Downloads (Last 12 months)3
Downloads (Last 6 weeks)1

Reflects downloads up to 18 Dec 2024

Other Metrics

View Author Metrics

Citations

Cited By

Li ZTang J(2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
https://doi.org/10.1360/TB-2022-0938
Du XLiu QLi ZQin ZTang J(2019)Cauchy Matrix Factorization for Tag-Based Social Image RetrievalIEEE Access10.1109/ACCESS.2019.29405987(132302-132310)Online publication date: 2019
https://doi.org/10.1109/ACCESS.2019.2940598
Xu CDai YLin RWang S(2019)Stacked Autoencoder Based Weak Supervision for Social Image UnderstandingIEEE Access10.1109/ACCESS.2019.28989917(21777-21786)Online publication date: 2019
https://doi.org/10.1109/ACCESS.2019.2898991
Li ZTang JSingh SMarkovitch S(2017)Weakly-supervised deep nonnegative low-rank model for social image tag refinement and assignmentProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298171(4154-4160)Online publication date: 4-Feb-2017
https://dl.acm.org/doi/10.5555/3298023.3298171
Li ZTang J(2017)Weakly Supervised Deep Matrix Factorization for Social Image UnderstandingIEEE Transactions on Image Processing10.1109/TIP.2016.262414026:1(276-288)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.1109/TIP.2016.2624140
Li XUricchio TBallan LBertini MSnoek CBimbo A(2016)Socializing the Semantic GapACM Computing Surveys10.1145/290615249:1(1-39)Online publication date: 6-Jun-2016
https://dl.acm.org/doi/10.1145/2906152
Yang XYang F(2016)Completing tags by local learningNeural Computing and Applications10.1007/s00521-015-1983-z27:8(2407-2416)Online publication date: 1-Nov-2016
https://dl.acm.org/doi/10.1007/s00521-015-1983-z
Xia ZFeng XPeng JWu JFan J(2015)A regularized optimization framework for tag completion and image retrievalNeurocomputing10.1016/j.neucom.2014.06.028147(500-508)Online publication date: Jan-2015
https://doi.org/10.1016/j.neucom.2014.06.028
Guisado-Gámez JDominguez-Sal DLarriba-Pey JKankanhalli MRueger SManmatha RJose Jvan Rijsbergen K(2014)Massive Query Expansion by Exploiting Graph Knowledge Bases for Image RetrievalProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578737(33-40)Online publication date: 1-Apr-2014
https://dl.acm.org/doi/10.1145/2578726.2578737
Ballan LUricchio TSeidenari LDel Bimbo AKankanhalli MRueger SManmatha RJose Jvan Rijsbergen K(2014)A Cross-media Model for Automatic Image AnnotationProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578728(73-80)Online publication date: 1-Apr-2014
https://dl.acm.org/doi/10.1145/2578726.2578728

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents