[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

A Bayesian network-based framework for semantic image understanding

Published: 01 June 2005 Publication History

Abstract

Current research in content-based semantic image understanding is largely confined to exemplar-based approaches built on low-level feature extraction and classification. The ability to extract both low-level and semantic features and perform knowledge integration of different types of features is expected to raise semantic image understanding to a new level. Belief networks, or Bayesian networks (BN), have proven to be an effective knowledge representation and inference engine in artificial intelligence and expert systems research. Their effectiveness is due to the ability to explicitly integrate domain knowledge in the network structure and to reduce a joint probability distribution to conditional independence relationships. In this paper, we present a general-purpose knowledge integration framework that employs BN in integrating both low-level and semantic features. The efficacy of this framework is demonstrated via three applications involving semantic understanding of pictorial images. The first application aims at detecting main photographic subjects in an image, the second aims at selecting the most appealing image in an event, and the third aims at classifying images into indoor or outdoor scenes. With these diverse examples, we demonstrate that effective inference engines can be built within this powerful and flexible framework according to specific domain knowledge and available training data to solve inherently uncertain vision problems.

References

[1]
M. Szummer, R.W. Picard, Indoor-outdoor image classification, Proceedings of Workshop on Content-based Access of Image and Video Database, 1998.
[2]
A. Vailaya, A. Jain, H.J. Zhang, On image classification: city images vs. landscapes, Pattern Recognition (1998) 1921-1935.
[3]
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A. and Jain, R., Content-based image retrieval at the end of the early years. IEEE Trans. PAMI. v22. 1349-1380.
[4]
Kam, M., Zhu, X. and Kalata, P., Sensor fusion for mobile robot navigation. Proc. IEEE. v85 i1. 108-119.
[5]
Klein, L.A., Sensor and Data Fusion Concepts and Applications. 1999. SPIE Press, Bellingham, WA.
[6]
G.A. Baraghimian, A. Klinger, Preference voting for sensor fusion, Proceedings of the International Society of Optical Engineering, Aerospace Sensing, Sensor Fusion III, 1990, pp. 46-57.
[7]
Pirjanian, P., Christensen, H.I. and Fayman, J.A., Application of voting to fusion of purposive modules: an experimental investigation. . Invited paper in J. Robotics Autonomous Systems. v23 i4. 253-266.
[8]
T. Brotherton, P. Simpson, Fuzzy neural networks for hierarchical fusion with applications, Proceedings of International Conference on Multisensor Fusion and Integration for Intelligent Systems, 1994.
[9]
Jouseau, E. and Dorizzi, B., Neural networks and fuzzy data fusion. Application to an online and real time vehicle detection system. Pattern Recognition Lett. v20. 97-107.
[10]
L. Rong, Z. Wang, Fusion of numerical and linguistic information by the use of fuzzy neural network, Proceedings of Multisource-Multisensor Information Fusion, July 6-9, 1998.
[11]
Lauritzen, S.L., Graphical Models. 1996. Oxford University Press, Oxford.
[12]
Y. Wang, H.-J. Zhang, Content-based image orientation detection with support vector machine, Proceedings of IEEE Workshop on Content-based Access of Image and Video Libraries, 2001.
[13]
N. Serrano, A. Savakis, J. Luo, A computationally efficient approach to indoor/outdoor scene classification, Proceedings of the International Conference on Pattern Recognition, 2002.
[14]
Pearl, J., Probabilistic Reasoning in Intelligent Systems. 1988. Morgan Kaufmann, San Francisco.
[15]
A. Vailaya, M. Figueriredo, A. Jain, H.-J. Zhang, Content-based hierarchical classification of vacation images, Proceedings of IEEE International Conference on Multimedia Computing and Systems, June 1999.
[16]
A. Vailaya, H.-J. Zhang, A.K. Jain, Automatic image orientation detection, Proceedings of IEEE International Conference on Image Processing, October 1999.
[17]
S. Peak, S.-F. Chang, A knowledge engineering approach for image classification based on probabilistic reasoning systems, Proceedings of IEEE International Conference on Multimedia and Exposition, 2000.
[18]
Serrano, N., Savakis, A.E. and Luo, J., Improved scene classification using efficient low-level features and semantic cues. Pattern Recognition. v37 i9. 1773-1784.
[19]
Luo, J., Etz, S., Singhal, A. and Gray, R.T., A computational approach to detecting main subjects in photographic images. Image Vision Comput. v22. 227-241.
[20]
Ballard, D. and Brown, C., Computer Vision. 1982. Prentice-Hall, Englewood Cliffs, NJ.
[21]
Sonka, M., Hlavac, V. and Boyle, R., Image Processing, Analysis, and Machine Vision. 1999. second ed. Brooks & Cole Publishing, Pacific Grove, CA.
[22]
Dudgeon, D.E. and Lacoss, R.T., An overview of automatic target recognition. Lincoln Lab. J. v6 i1. 3-10.
[23]
J. Schurmann, N. Bartneck, T. Bayer, J. Franke, E. Mandler, M. Oberlander, Document analysis-from pixels to contents, Proc. IEEE (1992) 1101-1119.
[24]
Robinson, G.P. and Colchester, A.C.F., Model-based recognition of anatomical objects from medical images. Image Vision Comput. v8. 499-507.
[25]
P. Lipson, Context and configuration-based scene classification, Ph.D. Thesis, MIT, Cambridge, MA, 1996.
[26]
P. Lipson, E. Grimson, P. Sinha, Configuration-based scene classification and image indexing, Proceedings of IEEE CVPR, 1997.
[27]
Smith, J.R. and Li, C.-S., Image classification and querying using composite region templates. Comput. Vision Image Understanding. v75 i1/2. 165-174.
[28]
Pradhan, M., The sensitivity of belief networks to imprecise probabilities: an experimental investigation. . Artif. Intell. v85.
[29]
de Dombal, F.T., Can computer aided teaching packages improve clinical care in patients with acute abdominal pain?. Br. Med. J. v302.
[30]
Kwoh, C.K. and Gillies, D.F., Using hidden nodes in Bayesian Network. Artif. Intell. v88.
[31]
Fung, R.M. and del Favero, B., Applying Bayesian Network to information retrieval. Commun. ACM. v38 i3.
[32]
Sarkar, S. and Chavali, S., Modeling parameter space behavior of vision systems using BN. Comput. Vision Image Understanding. v79.
[33]
D. Heckerman, A tutorial on learning with Bayesian Network, Technical Report MSD-TR-95-06, Microsoft Research, March 1995.
[34]
Buntine, W., Operations for learning with graphical models. Artif. Intell. Res. v2.
[35]
S. Etz, J. Luo, Ground truth for training and evaluation of automatic main subject detection, Proceedings of Human Vision and Electronic Imaging, 2000.
[36]
W. Osberger, A.J. Maeder, Automatic identification of perceptually important regions in an image, Proceedings of the International Conference on Image Processing, 1998.
[37]
J. Luo, A. Singhal, A. Savakis, Efficient mobile imaging using emphasis image selection, Proceedings of the PICS Conference, 2003.
[38]
A.C. Loui, A.E. Savakis, Automated Event Clustering and Quality Screening of Consumer Pictures for Digital Albuming, IEEE Trans. Multimedia, pp. 390-402, September 2003.
[39]
S. Paek, C.L. Sable, V. Hatzivassiloglou, A. Jaimes, B.H. Schiffman, S.-F. Chang, K.R. McKeown, Integration of visual and text based approaches for the content labeling and classification of photographs, Proceedings of ACM SIGIR '99 Workshop on Multimedia Indexing and Retrieval, 1999.
[40]
M.J. Kane, A.E. Savakis, Bayesian Network structure learning and inference in indoor vs. outdoor image classification, Proceedings of International Conference on Pattern Recognition, 2004.
[41]
Ohta, Y.-I., Kanade, T. and Sakai, T., Color information for region segmentation. Comput. Graphics Image Process. v13. 222-241.
[42]
Hogg, R.V. and Tanis, E.A., Probability and Statistical Inference. 1988. Macmillan, New York.
[43]
A. Savakis, S.P. Etz, A.C. Loui, Evaluation of image appeal in consumer photography, Proceedings of Human Vision and Electronic Imaging, 2000.
[44]
S. Etz, et al., Quantitative evaluation of rank-order similarity of images, Proceedings of IEEE International Conference on Image Processing, 2000.
[45]
J. Antonisse, K.S. Keller, Dynamic evaluation of sources in rule-based sensor fusion systems, Proceedings of the Data Fusion Symposium, 1988.

Cited By

View all
  • (2021)Multidimensional continuous time Bayesian network classifiersInternational Journal of Intelligent Systems10.1002/int.2261136:12(7839-7866)Online publication date: 26-Oct-2021
  • (2018)Categorization of Images Using Autoencoder Hashing and Training of Intra Bin Classifiers for Image Classification and AnnotationJournal of Medical Systems10.1007/s10916-018-0986-642:7(1-15)Online publication date: 1-Jul-2018
  • (2017)Machine Vision for UAS Ground OperationsJournal of Intelligent and Robotic Systems10.1007/s10846-017-0542-588:2-4(527-546)Online publication date: 1-Dec-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Pattern Recognition
Pattern Recognition  Volume 38, Issue 6
June, 2005
162 pages

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 June 2005

Author Tags

  1. Bayesian networks
  2. Domain knowledge
  3. Low-level features
  4. Semantic features
  5. Semantic image understanding

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Multidimensional continuous time Bayesian network classifiersInternational Journal of Intelligent Systems10.1002/int.2261136:12(7839-7866)Online publication date: 26-Oct-2021
  • (2018)Categorization of Images Using Autoencoder Hashing and Training of Intra Bin Classifiers for Image Classification and AnnotationJournal of Medical Systems10.1007/s10916-018-0986-642:7(1-15)Online publication date: 1-Jul-2018
  • (2017)Machine Vision for UAS Ground OperationsJournal of Intelligent and Robotic Systems10.1007/s10846-017-0542-588:2-4(527-546)Online publication date: 1-Dec-2017
  • (2016)Modelling uncertainty in social-natural interactionsEnvironmental Modelling & Software10.1016/j.envsoft.2014.07.00875:C(362-372)Online publication date: 1-Jan-2016
  • (2016)Cognitive computing and proposed approaches to conceptual organization of case law knowledge basesArtificial Intelligence and Law10.1007/s10506-016-9188-z24:4(347-370)Online publication date: 1-Dec-2016
  • (2015)Bayesian network based semantic image classification with attributed relational graphMultimedia Tools and Applications10.1007/s11042-014-1858-974:13(4965-4986)Online publication date: 1-Jun-2015
  • (2014)Support vector description of clusters for content-based image annotationPattern Recognition10.1016/j.patcog.2013.10.01547:3(1361-1374)Online publication date: 1-Mar-2014
  • (2014)Non-Gaussian Data Clustering via Expectation Propagation Learning of Finite Dirichlet Mixture Models and ApplicationsNeural Processing Letters10.1007/s11063-013-9293-x39:2(115-135)Online publication date: 1-Apr-2014
  • (2013)Applying Bayesian Network Techniques to Prioritize Lean Six Sigma EffortsInternational Journal of Strategic Decision Sciences10.4018/jsds.20130401014:2(1-15)Online publication date: 1-Apr-2013
  • (2013)Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selectionPattern Recognition10.1016/j.patcog.2013.03.02646:10(2754-2769)Online publication date: 1-Oct-2013
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media