[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Surveillance Audio Attention Model Based on Spatial Audio Cues

  • Conference paper
Advances in Multimedia Information Processing - PCM 2009 (PCM 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5879))

Included in the following conference series:

  • 1097 Accesses

Abstract

For stereo audio surveillance in complex environment, we proposed a bottom-up audio attention model based on spatial audio cues analysis, and an environment adaptive normalization method. The traditional audio attention models are based on mono audio characters, such as energy, energy peak, or pitch. Their performance is limited by neglecting the spatial information. The spatial cues in audio stream provide additional information for attention analysis. And the dynamic updated background sound can help to reduce the environment effect. The preliminary experiment showed that the proposed model is an effective way to analyzing attention events, which is caused by rapid moving sound source, in stereo audio stream.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. James, W.: The Principles of Psychology. Harvard Univ. Press, Cambridge (1890)

    Google Scholar 

  2. Treisman, A., Gelande, G.: A Feature integration theory of attention. Cognitive Psychology 12, 97–136 (1980)

    Article  Google Scholar 

  3. Treisman, A., Gormican, S.: Feature analysis in early vision: evidence from search asymmetries. Psychol. Rev. 95, 15–48 (1988)

    Article  Google Scholar 

  4. Treisman, A.: Perception of features and objects. In: Visual Attention. Oxford Univ. Press, New York (1998)

    Google Scholar 

  5. Posner, M.L.: The Attention System of the Human Brain. Annu. Rev. Neurosci. 13, 25–42 (1990)

    Article  Google Scholar 

  6. Egeth, H.E., Yantis, S.: Visual attention: control, representation, and time course. Annu. Rev. Psychol. 48, 269–297 (1997)

    Article  Google Scholar 

  7. Cui, R., Lu, L., Zhung, H.-J., Cai, L.-H.: Highlight sound effects detection in audio stream. In: ICME (May 2003)

    Google Scholar 

  8. Ma, Y.-F., Hua, X.-S., Lu, L., Zhang, H.-J.: A generic framework of user attention model and its application in video summarization. IEEE Transaction on Multimedia 7, 907–919 (2005)

    Article  Google Scholar 

  9. Huang, Q.-M., Zheng, Y.-J., Jiang, S.-Q., Gao, W.: User Attention Analysis Based Video Summarization and Highlight Ranking. Chinese Journal Of Computers 31(9) (September 2008)

    Google Scholar 

  10. Kalinli, O., Narayanan, S.: A Top-Down Auditory Attention Model for Learning Task Dependent Influences on Prominence Detection in speech. In: ICASSP (March 2008)

    Google Scholar 

  11. Liu, A., Li, J., Zhang, Y., Tang, S., Song, Y., Yang, Z.: Human Attention Model for Action Movie Analysis. In: ICPCA (July 2007)

    Google Scholar 

  12. Evangelopoulos, G., Rapantsikos, K., Potamianos, A., Maragos, P., Zlatintsi, A., Avrithis, Y.: Movie Summarization Based on Audiovisual Saliency Detection. In: ICIP (October 2008)

    Google Scholar 

  13. Faller, C.: Parametric Coding of Spatial Audio. Ph.D Thesis (2004)

    Google Scholar 

  14. Moore, B.C.J.: An Introduction to the Psychology of Hearing, 5th edn. Elsevier Academic Press, Amsterdam (2004)

    Google Scholar 

  15. Roman, N., Wang, D.L.: Binaural Tracking of Multiple Moving Sources. IEEE Transaction on Audio, Speech, and Language Processing 16(4), 728–739 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hang, B., Hu, R., Yang, Y., Ma, Y., Chang, J. (2009). Surveillance Audio Attention Model Based on Spatial Audio Cues. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds) Advances in Multimedia Information Processing - PCM 2009. PCM 2009. Lecture Notes in Computer Science, vol 5879. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10467-1_81

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-10467-1_81

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-10466-4

  • Online ISBN: 978-3-642-10467-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics