[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Kernel and Spectral Methods for Learning the Semantics of Images

  • Conference paper
  • First Online:
Computer and Information Sciences

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 62))

  • 833 Accesses

Abstract

In order to bridge the semantic gap, learning the semantics of images automatically using visual features alone has been an area of active research. Recently, visual keywords extracted from images have been shown to provide a useful intermediate representation for image characterization and retrieval. A challenging problem is to find effectively ways of extracting, representing and using the context of visual keyword for learning image semantic. In this paper, we will present a number of kernel and spectral methods which our research group has developed for learning the semantics of images, which can be applied to a variety of image annotation, categorization and retrieval tasks. To capture the context of visual keywords, we propose two contextual kernels, called spatial Markov kernel and spatial mismatch kernel, respectively. The first kernel is defined based on Markov models, while the second kernel is motivated from the concept of string kernel and derived without the use of any generative models. The experimental results show that the context captured by our kernels is very effective for learning the semantics of images. Moreover, to learn a semantically compact (or high level) vocabulary, we further propose a spectral embedding method to capture the local intrinsic geometric (i.e. manifold) structure of the original abundant visual keywords. This spectral method can also be applied to manifold learning on textual keywords for image annotation refinement. The experimental results show that our spectral methods lead to significant improvement in performance by capturing the manifold structure of visual or textual keywords.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 143.50
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 179.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
GBP 179.99
Price includes VAT (United Kingdom)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Feng, S., Manmatha, R, Lavrenko, V.: Multiple Bernoulli Relevance Models for Image and Video Annotation. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1002–1009. IEEE Press (2004)

    Google Scholar 

  2. Lu, Z., Peng, Y., Ip, H.: Image Categorization via Robust pLSA. Pattern Recognition Letters, 31 (1), 36–43 (2010)

    Article  Google Scholar 

  3. Lu, Z., Ip, H.: Generalized Relevance Models for Automatic Image Annotation. In: Pacific-Rim Conference on Multimedia (PCM), pp. 245–255. Springer Press (2009)

    Google Scholar 

  4. Lu, Z., Ip, H., He, Q.: Context-Based Multi-Label Image Annotation. In: ACM International Conference on Image and Video Retrieval (CIVR). ACM Press (2009)

    Google Scholar 

  5. Li, J., Wang, J.: Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach. IEEE Trans. on Pattern Analysis and Machine Intelligence, 25(9), 1075—1088 (2003)

    Article  Google Scholar 

  6. Yu, F., Ip H.: Automatic Semantic Annotation of Images Using Spatial Hidden Markov Model. In: International Conference on Multimedia and Expo (ICME), pp. 305–308. IEEE Press (2006)

    Google Scholar 

  7. Wang, L., Lu, Z., Ip, H.: Image Categorization Based on a Hierarchical Spatial Markov Model. In: International Conference on Computer Analysis of Images and Patterns (CAIP), pp. 766–773. Springer Press (2009)

    Google Scholar 

  8. Lu, Z., Ip, H.: Combining Context, Consistency, and Diversity Cues for Interactive Image Categorization. IEEE Trans. on Multimedia, 12(3), 194–203 (2010)

    Article  Google Scholar 

  9. Lu, Z., Ip, H.: Image Categorization with Spatial Mismatch Kernels. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 397–404. IEEE Press (2009)

    Google Scholar 

  10. Lu, Z., Ip, H.: Learning the Semantics of Images Using Visual and Semantic Context. IEEE Trans. on Multimedia. (Under second round review)

    Google Scholar 

  11. Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2169–2178. IEEE Press (2006)

    Google Scholar 

  12. Leslie, C., Eskin, E., Noble, W.: The Spectrum Kernel: A String Kernel for SVM Protein Classification. In: Pacific Symposium on Biocomputing, pp. 566–575. (2002)

    Google Scholar 

  13. Rodgers, J., Nicewander, W: Thirteen Ways to Look at the Correlation Coefficient. The American Statistician, 42 (1), 59–66 (1988)

    Article  Google Scholar 

  14. Liu, J., Yang, Y., Shah, M.: Learning Semantic Visual Vocabularies Using Diffusion Distance. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 461–468. IEEE Press (2009).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Horace H. S. Ip .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer Science+Business Media B.V.

About this paper

Cite this paper

Ip, H.H.S. (2011). Kernel and Spectral Methods for Learning the Semantics of Images. In: Gelenbe, E., Lent, R., Sakellari, G., Sacan, A., Toroslu, H., Yazici, A. (eds) Computer and Information Sciences. Lecture Notes in Electrical Engineering, vol 62. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-9794-1_60

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-9794-1_60

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-9793-4

  • Online ISBN: 978-90-481-9794-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics