[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2461466.2461476acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

Tag completion based on belief theory and neighbor voting

Published: 16 April 2013 Publication History

Abstract

We address the problem of tag completion for automatic image annotation. Our method consists in two main steps: creating a list of "candidate tags" from the visual neighbors of the untagged image then using them as pieces of evidence to be combined to provide the final list of predicted tags. Both steps introduce a scheme to tackle with imprecision and uncertainty. First, a bag-of-words (BOW) signature is generated for each neighbor using local soft coding. Second, a sum-pooling operation across the BOW of the k nearest neighbors provides the list of "candidate tags". Finally, we use neighbors as pieces of evidence to be combined according to the Dempster's rule to predict the more relevant tags. The method is evaluated in the context of image classification and that of tag suggestion. The database used for visual neighbors search contains 1.2 million images extracted from Flickr. Classification is evaluated on the well known Pascal VOC 2007 and MIR Flickr datasets, on which we obtain similar or better results than the state-of-the-art. For tag suggestion, we manually annotated 241 queries. As well, we obtain competitive results on this task.

References

[1]
M. Ames and M. Naaman. Why we tag: motivations for annotation in mobile and online media. In Proceedings of the SIGCHI, pages 971--980, New York, NY, USA, 2007. ACM.
[2]
S. Ayache and G. Quénot. Evaluation of active learning strategies for video indexing. Journal of Image Communication, 22(7--8):692--704, Aug. 2007.
[3]
A. Binder, W. Samek, M. Kloft, C. Müller, K.-R. Müller, and M. Kawanabe. The Joint Submission of the TU Berlin and Fraunhofer FIRST (TUBFI) to the ImageCLEF2011 Photo Annotation Task. In CLEF (Notebook Papers/Labs/Workshop), 2011.
[4]
T. Denoeux. A k-nearest neighbor classification rule based on dempster-shafer theory. IEEE Transaction on systems, man and cybernetics, 25:804--813, 1995.
[5]
P. Duygulu, K. Barnard, J. F. G. d. Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. ECCV '02, pages 97--112, London, UK, UK, 2002. Springer-Verlag.
[6]
M. Everingham, L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results.
[7]
M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid. TagProp: discriminative metric learning in nearest neighbor models for image auto-annotation. ICCV'09, pages 309--316, Kyoto, Japon, Sept. 2009. IEEE Computer society.
[8]
M. Guillaumin, J. Verbeek, and C. Schmid. Multimodal semi-supervised learning for image classification. CVPR '10, pages 902--909, 2010.
[9]
J. Huang, S. R. Kumar, M. Mitra, W.-J. Zhu, and R. Zabih. Image indexing using color correlograms. CVPR '97, Washington, DC, USA, 1997. IEEE Computer Society.
[10]
M. J. Huiskes and M. S. Lew. The MIR flickr retrieval evaluation. In ACM international conference on Multimedia information retrieval (ICMR), pages 39--43, 2008.
[11]
H. Jégou, F. Perronnin, M. Douze, J. Sánchez, P. Pérez, and C. Schmid. Aggregating local image descriptors into compact codes. IEEE Transactions on Pattern Analysis and Machine Intelligence, Sept. 2012.
[12]
L. S. Kennedy, S. fu Chang, and I. V. Kozintsev. To search or to label?: predicting the performance of search-based automatic image classifiers. MIR '06, pages 249--258, 2006.
[13]
J. Li and J. Z. Wang. Real-time computerized annotation of pictures. IEEE Trans. Pattern Anal. Mach. Intell., 30(6):985--1002, June 2008.
[14]
X. Li, C. G. M. Snoek, and M. Worring. Learning social tag relevance by neighbor voting. IEEE Transactions on Multimedia, 11(7):1310--1322, November 2009.
[15]
L. Liu, L. Wang, and X. Liu. In Defense of Soft-assignment Coding. ICCV '11, 2011.
[16]
A. Makadia, V. Pavlovic, and S. Kumar. Baselines for image annotation. International Journal of Computer Vision, 90(1):88--105, 2010.
[17]
F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. Proceedings of the eleventh ACM international conference on Multimedia, pages 275--278, New York, NY, USA, 2003. ACM.
[18]
A. Popescu and G. Grefenstette. Social media driven image retrieval. In ACM International Conference on Multimedia Retrieval (ICMR), pages 33:1--33:8, 2011.
[19]
G. Shafer. A Mathematical Theory of Evidence. Princeton University Press, Princeton, 1976.
[20]
B. Sigurbjörnsson and R. van Zwol. Flickr tag recommendation based on collective knowledge. WWW '08, pages 327--336, New York, NY, USA, 2008. ACM.
[21]
J. Tang, R. Hong, S. Yan, T.-S. Chua, G.-J. Qi, and R. Jain. Image annotation by knn-sparse graph-based label propagation over noisily tagged web images. ACM Transactions on Intelligent Systems and Technology (TIST), 2(2):14, 2011.
[22]
A. Torralba, R. Fergus, and W. T. Freeman. 80 million tiny images: A large data set for nonparametric object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell., 30(11):1958--1970, Nov. 2008.
[23]
C. Wang, F. Jing, L. Zhang, and H.-J. Zhang. Scalable search-based image annotation. Multimedia Syst., 14(4):205--220, 2008.
[24]
G. Wang, D. Hoiem, and D. A. Forsyth. Building text features for object image classification. In CVPR, pages 1367--1374, 2009.
[25]
L. Wu, R. Jin, and A. K. Jain. Tag completion for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 99(PrePrints), 2012.
[26]
H. Yu, M. Li, H.-J. Zhang, and J. Feng. Color texture moments for content-based image retrieval. ICIP '10, pages 24--28, 2003.
[27]
A. Znaidia, A. Shabou, A. Popescu, H. Le Borgne, and C. Hudelot. Multimodal feature generation framework for semantic image classification. In ICMR, International Conference on Multimedia Retrieval, ICMR '12, Hong Kong, China, June 5--8, 2012, page 38, 2012.

Cited By

View all

Index Terms

  1. Tag completion based on belief theory and neighbor voting

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
    April 2013
    362 pages
    ISBN:9781450320337
    DOI:10.1145/2461466
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 16 April 2013

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. bag of words
    2. belief theory
    3. classification
    4. image annotation
    5. local soft coding
    6. tag completion
    7. tag suggestion

    Qualifiers

    • Research-article

    Conference

    ICMR'13
    Sponsor:

    Acceptance Rates

    ICMR '13 Paper Acceptance Rate 38 of 96 submissions, 40%;
    Overall Acceptance Rate 254 of 830 submissions, 31%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 18 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)A survey on social image semantic analysisChinese Science Bulletin10.1360/TB-2022-093868:25(3368-3384)Online publication date: 11-Nov-2022
    • (2019)Cauchy Matrix Factorization for Tag-Based Social Image RetrievalIEEE Access10.1109/ACCESS.2019.29405987(132302-132310)Online publication date: 2019
    • (2019)Stacked Autoencoder Based Weak Supervision for Social Image UnderstandingIEEE Access10.1109/ACCESS.2019.28989917(21777-21786)Online publication date: 2019
    • (2017)Weakly-supervised deep nonnegative low-rank model for social image tag refinement and assignmentProceedings of the Thirty-First AAAI Conference on Artificial Intelligence10.5555/3298023.3298171(4154-4160)Online publication date: 4-Feb-2017
    • (2017)Weakly Supervised Deep Matrix Factorization for Social Image UnderstandingIEEE Transactions on Image Processing10.1109/TIP.2016.262414026:1(276-288)Online publication date: 1-Jan-2017
    • (2016)Socializing the Semantic GapACM Computing Surveys10.1145/290615249:1(1-39)Online publication date: 6-Jun-2016
    • (2016)Completing tags by local learningNeural Computing and Applications10.1007/s00521-015-1983-z27:8(2407-2416)Online publication date: 1-Nov-2016
    • (2015)A regularized optimization framework for tag completion and image retrievalNeurocomputing10.1016/j.neucom.2014.06.028147(500-508)Online publication date: Jan-2015
    • (2014)Massive Query Expansion by Exploiting Graph Knowledge Bases for Image RetrievalProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578737(33-40)Online publication date: 1-Apr-2014
    • (2014)A Cross-media Model for Automatic Image AnnotationProceedings of International Conference on Multimedia Retrieval10.1145/2578726.2578728(73-80)Online publication date: 1-Apr-2014

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media