Automatic Annotation of Images from the Practitioner Perspective

Peter G. B. Enser²¹,
Christine J. Sandom²¹ &
Paul H. Lewis²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3568))

Included in the following conference series:

International Conference on Image and Video Retrieval

1192 Accesses

Abstract

This paper describes an ongoing project which seeks to contribute to a wider understanding of the realities of bridging the semantic gap in visual image retrieval. A comprehensive survey of the means by which real image retrieval transactions are realised is being undertaken. An image taxonomy has been developed, in order to provide a framework within which account may be taken of the plurality of image types, user needs and forms of textual metadata. Significant limitations exhibited by current automatic annotation techniques are discussed, and a possible way forward using ontologically supported automatic content annotation is briefly considered as a potential means of mitigating these limitations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Content-Based Visual Information Retrieval Approach for Automated Image Annotation

Image annotation: the effects of content, lexicon and annotation method

Article 02 March 2020

Review: Automatic Image Annotation for Semantic Image Retrieval

References

Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Article Google Scholar
Zhao, R., Grosky, W.I.: Bridging the semantic gap in image retrieval. In: Shih, T.K. (ed.) Distributed multimedia databases: techniques & applications, pp. 14–36. Idea Group Publishing, Hershey (2002)
Google Scholar
Jőrgensen, C.: Image retrieval: theory and research. The Scarecrow Press, Lanham (2003)
Google Scholar
Enser, P.G.B.: Pictorial information retrieval (Progress in Documentation). Journal of Documentation 51(2), 126–170 (1995)
Article Google Scholar
Rasmussen, E.M.: Indexing images. In: Williams, M.E. (ed.) Annual Review of Information Science 32. Information Today (ASIS), Information Today, Medford, New Jersey, pp. 169–196 (1997)
Google Scholar
Sandore, B. (ed.): Progress in visual information access and retrieval. Library Trends, 48(2), 283–524 (1999)
Google Scholar
Shatford, S.: Analysing the subject of a picture; a theoretical approach. Cataloging & Classification Quarterly 6(3), 39–62 (1986)
Article Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., De Freitas, N., Blei, D.M., Jordan, M.I.: Matching Words and Pictures. Journal of Machine Learning Research 3(6), 1107–1135
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, pp. 119–126. ACM Press, New York (2003), http://ciir.cs.umass.edu/pubfiles/mm-41.pdf
Google Scholar
Fan, J., Hangzai Luo, Y.G., Xu, G.: Automatic image annotation by using concept-sensitive salient objects for image content representation. In: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, pp. 361–368. ACM Press, New York (2004)
Google Scholar
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Seventeenth Annual Conference on Neural Information Processing Systems (2003)
Google Scholar
Zhao, R., Grosky, W.I.: From Features to Semantics: Some Preliminary Results. In: IEEE International Conference on Multimedia and Expo, New York (2000), http://www.cs.sunysb.edu/~rzhao/publications/ICME00.pdf
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. ACM Multimedia, 275–278 (2003)
Google Scholar
Kosinov, S., Marchand-Maillet, S.: Hierarchical ensemble learning for multimedia categorisation and autoannotation. In: Proceedings IEEE Machine Learning for Signal Processing workshop (MLSP), Sao Luis, Brazil (2004)
Google Scholar
Enser, P.G.B.: Query Analysis in a Visual Information Retrieval Context. Journal of Document and Text Management 1(1), 25–52 (1993)
Google Scholar
Armitage, L.H., Enser, P.G.B.: Analysis of user need in image archives. Journal of Information Science 23(4), 287–299 (1997)
Article Google Scholar
Enser, P., Sandom, C.: Retrieval of Archival Moving Imagery - CBIR Outside the Frame? In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, pp. 202–214. Springer, Berlin (2002)
Chapter Google Scholar
Panofsky, E.: Meaning in the visual arts. Doubleday Anchor Books, Garden City (1955)
Google Scholar
Cawkell, A.E.: Selected aspects of image processing and management: review and future prospects. Journal of Information Science 18(3), 179–192 (1992)
Article Google Scholar
Enser, P.: Visual image retrieval: seeking the alliance of concept-based and content-based paradigms. Journal of Information Science 26(4), 199–210 (2000)
Article Google Scholar
Edina: Education Image Gallery, http://edina.ac.uk/eig/
Wellcome Trust: Medical Photographic Library, http://medphoto.wellcome.ac.uk
Science & Society Picture Library, http://www.scienceandsociety.co.uk
Corporation of London: Talisweb, http://librarycatalogue.cityoflondon.gov.uk:8001/
Town, C., Sinclair, D.: Language-based querying of image collections on the basis of an extensible ontology. Image and Vision Computing 22(3), 251–267 (2003)
Article Google Scholar
Jaimes, A., Smith, J.R.: Semi-automatic, Data-driven Construction of Multimedia Ontologies. In: Proceedings of the IEEE International Conference on Multimedia and Expo (2003), http://mia.ece.uic.edu/~papers/MediaBot/pdf00002.pdf
Hollink, L., Schreiber, A., Wielemaker Th., J., Wielinga, B.: Semantic Annotation of Image Collections. In: Proceedings of the KCAP 2003 Workshop on Knowledge Capture and Semantic Annotation, Florida (2003), http://www.cs.vu.nl/~guus/papers/Hollink03b.pdf
Goodall, S., Lewis, P.H., Martinez, K., Sinclair, P.A.S., Giorgini, F., Addis, M.J., Laharnier, C., Stevenson, J.: Knowledge-based exploration of multimedia museum collections. In: Proceedings of the European workshop on the integration of knowledge semantics and digital media technology, London, pp. 415–422 (2004)
Google Scholar
Addis, M., Boniface, M., Goodall, S., Grimwood, P., Kim, S., Lewis, P., Martinez, K., Stevenson, A.: SCULPTEUR: Towards a New Paradigm for Multimedia Museum Information Handling. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 582–596. Springer, Heidelberg (2003)
Chapter Google Scholar
Hu, B., Dasmahapatra, S., Lewis, P., Shadbolt, N.: Ontology-based Medical Image Annotation with Description Logics. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence, Sacramento, CA, USA (2003) (in press)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing, Mathematical and Information Sciences, University of Brighton,
Peter G. B. Enser & Christine J. Sandom
Department of Electronics and Computer Science, University of Southampton,
Paul H. Lewis

Authors

Peter G. B. Enser
View author publications
You can also search for this author in PubMed Google Scholar
Christine J. Sandom
View author publications
You can also search for this author in PubMed Google Scholar
Paul H. Lewis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, National University of Singapore, Computing 1, 117590, Singapore
Wee-Kheng Leow
LIACS Media Lab, Leiden University,
Michael S. Lew & Erwin M. Bakker &
National University of Singapore, 3 Science Dr, 117543, Singapore
Tat-Seng Chua
Microsoft Research Asia, 4F, Sigma Center, No.49, Zhichun Road, 100080, Beijing, P.R.China
Wei-Ying Ma
School of Computing, National University of Singapore, 3 Science Drive 2, 117543, Singapore
Lekha Chaisorn

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Enser, P.G.B., Sandom, C.J., Lewis, P.H. (2005). Automatic Annotation of Images from the Practitioner Perspective. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_53

Download citation

DOI: https://doi.org/10.1007/11526346_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics