Abstract
This paper describes an ongoing project which seeks to contribute to a wider understanding of the realities of bridging the semantic gap in visual image retrieval. A comprehensive survey of the means by which real image retrieval transactions are realised is being undertaken. An image taxonomy has been developed, in order to provide a framework within which account may be taken of the plurality of image types, user needs and forms of textual metadata. Significant limitations exhibited by current automatic annotation techniques are discussed, and a possible way forward using ontologically supported automatic content annotation is briefly considered as a potential means of mitigating these limitations.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(12), 1349–1380 (2000)
Zhao, R., Grosky, W.I.: Bridging the semantic gap in image retrieval. In: Shih, T.K. (ed.) Distributed multimedia databases: techniques & applications, pp. 14–36. Idea Group Publishing, Hershey (2002)
Jőrgensen, C.: Image retrieval: theory and research. The Scarecrow Press, Lanham (2003)
Enser, P.G.B.: Pictorial information retrieval (Progress in Documentation). Journal of Documentation 51(2), 126–170 (1995)
Rasmussen, E.M.: Indexing images. In: Williams, M.E. (ed.) Annual Review of Information Science 32. Information Today (ASIS), Information Today, Medford, New Jersey, pp. 169–196 (1997)
Sandore, B. (ed.): Progress in visual information access and retrieval. Library Trends, 48(2), 283–524 (1999)
Shatford, S.: Analysing the subject of a picture; a theoretical approach. Cataloging & Classification Quarterly 6(3), 39–62 (1986)
Barnard, K., Duygulu, P., Forsyth, D., De Freitas, N., Blei, D.M., Jordan, M.I.: Matching Words and Pictures. Journal of Machine Learning Research 3(6), 1107–1135
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, pp. 119–126. ACM Press, New York (2003), http://ciir.cs.umass.edu/pubfiles/mm-41.pdf
Fan, J., Hangzai Luo, Y.G., Xu, G.: Automatic image annotation by using concept-sensitive salient objects for image content representation. In: Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, pp. 361–368. ACM Press, New York (2004)
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Seventeenth Annual Conference on Neural Information Processing Systems (2003)
Zhao, R., Grosky, W.I.: From Features to Semantics: Some Preliminary Results. In: IEEE International Conference on Multimedia and Expo, New York (2000), http://www.cs.sunysb.edu/~rzhao/publications/ICME00.pdf
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. ACM Multimedia, 275–278 (2003)
Kosinov, S., Marchand-Maillet, S.: Hierarchical ensemble learning for multimedia categorisation and autoannotation. In: Proceedings IEEE Machine Learning for Signal Processing workshop (MLSP), Sao Luis, Brazil (2004)
Enser, P.G.B.: Query Analysis in a Visual Information Retrieval Context. Journal of Document and Text Management 1(1), 25–52 (1993)
Armitage, L.H., Enser, P.G.B.: Analysis of user need in image archives. Journal of Information Science 23(4), 287–299 (1997)
Enser, P., Sandom, C.: Retrieval of Archival Moving Imagery - CBIR Outside the Frame? In: Lew, M., Sebe, N., Eakins, J.P. (eds.) CIVR 2002. LNCS, vol. 2383, pp. 202–214. Springer, Berlin (2002)
Panofsky, E.: Meaning in the visual arts. Doubleday Anchor Books, Garden City (1955)
Cawkell, A.E.: Selected aspects of image processing and management: review and future prospects. Journal of Information Science 18(3), 179–192 (1992)
Enser, P.: Visual image retrieval: seeking the alliance of concept-based and content-based paradigms. Journal of Information Science 26(4), 199–210 (2000)
Edina: Education Image Gallery, http://edina.ac.uk/eig/
Wellcome Trust: Medical Photographic Library, http://medphoto.wellcome.ac.uk
Science & Society Picture Library, http://www.scienceandsociety.co.uk
Corporation of London: Talisweb, http://librarycatalogue.cityoflondon.gov.uk:8001/
Town, C., Sinclair, D.: Language-based querying of image collections on the basis of an extensible ontology. Image and Vision Computing 22(3), 251–267 (2003)
Jaimes, A., Smith, J.R.: Semi-automatic, Data-driven Construction of Multimedia Ontologies. In: Proceedings of the IEEE International Conference on Multimedia and Expo (2003), http://mia.ece.uic.edu/~papers/MediaBot/pdf00002.pdf
Hollink, L., Schreiber, A., Wielemaker Th., J., Wielinga, B.: Semantic Annotation of Image Collections. In: Proceedings of the KCAP 2003 Workshop on Knowledge Capture and Semantic Annotation, Florida (2003), http://www.cs.vu.nl/~guus/papers/Hollink03b.pdf
Goodall, S., Lewis, P.H., Martinez, K., Sinclair, P.A.S., Giorgini, F., Addis, M.J., Laharnier, C., Stevenson, J.: Knowledge-based exploration of multimedia museum collections. In: Proceedings of the European workshop on the integration of knowledge semantics and digital media technology, London, pp. 415–422 (2004)
Addis, M., Boniface, M., Goodall, S., Grimwood, P., Kim, S., Lewis, P., Martinez, K., Stevenson, A.: SCULPTEUR: Towards a New Paradigm for Multimedia Museum Information Handling. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 582–596. Springer, Heidelberg (2003)
Hu, B., Dasmahapatra, S., Lewis, P., Shadbolt, N.: Ontology-based Medical Image Annotation with Description Logics. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence, Sacramento, CA, USA (2003) (in press)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Enser, P.G.B., Sandom, C.J., Lewis, P.H. (2005). Automatic Annotation of Images from the Practitioner Perspective. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_53
Download citation
DOI: https://doi.org/10.1007/11526346_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)