[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2461466.2461503acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
poster

An information retrieval approach to identifying infrequent events in surveillance video

Published: 16 April 2013 Publication History

Abstract

This paper presents work on integrating multiple computer vision-based approaches to surveillance video analysis to support user retrieval of video segments showing human activities. Applied computer vision using real-world surveillance video data is an extremely challenging research problem, independently of any information retrieval (IR) issues. Here we describe the issues faced in developing both generic and specific analysis tools and how they were integrated for use in the new TRECVid interactive surveillance event detection task. We present an interaction paradigm and discuss the outcomes from face-to-face end user trials and the resulting feedback on the system from both professionals, who manage surveillance video, and computer vision or machine learning experts. We propose an information retrieval approach to finding events in surveillance video rather than solely relying on traditional annotation using specifically trained classifiers.

References

[1]
J. Aggarwal and M. Ryoo. Human activity analysis: A review. ACM Comput. Surv., 43(3), Apr. 2011.
[2]
A. Bosch, A. Zisserman, and X. Munoz. Representing shape with a spatial pyramid kernel. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pages 401--408, 2007.
[3]
N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pages 886--893, 2005.
[4]
N. Dalal, B. Triggs, and C. Schmid. Human detection using oriented histograms of flow and appearance. In European Conference on Computer Vision (ECCV), pages 428--441, 2006.
[5]
J. Davis. Hierarchical motion history images for recognizing human motion. In Proceedings of the IEEE Workshop on Detection and Recognition of Events in Video, pages 39--46, 2001.
[6]
C. R. del Blanco, F. Jaureguizar, and N. Garcia. An advanced Bayesian model for the visual tracking of multiple interacting objects. EURASIP Journal on Advances in Signal Processing, 130, 2011.
[7]
D. Doermann and D. Mihalcik. Tools and techniques for video performance evaluation. In Proceedings of the 15th International Conference on Pattern Recognition, volume 4, pages 167--170, 2000.
[8]
C. Dogbe. On the modelling of crowd dynamics by generalized kinetic models. Journal of Mathematical Analysis and Applications, 2011.
[9]
A. Doucet, N. Gordon, and V. Krishnamurthy. Particle filters for state estimation of jump markov linear systems. IEEE Transactions on Signal Processing, 49(3):613--624, 2001.
[10]
A. Efros, A. Berg, G. Mori, and J. Malik. Recognizing action at a distance. In Proceedings of the Ninth IEEE International Conference on Computer Vision, pages 726--733, 2003.
[11]
J. Ijsselmuiden and R. Stiefelhagen. Towards high-level human activity recognition through computer vision and temporal logic. In R. Dillmann, J. Beyerer, U. Hanebeck, and T. Schultz, editors, KI 2010: Advances in Artificial Intelligence, volume 6359 of Lecture Notes in Computer Science, pages 426--435. Springer Berlin Heidelberg, 2010.
[12]
P. Kelly, C. Ó Conaire, C. Kim, and N. O'Connor. Automatic camera selection for activity monitoring in a multi-camera system for tennis. In Third ACM/IEEE International Conference on Distributed Smart Cameras, pages 1--8, 2009.
[13]
H. W. Kuhn. The hungarian method for the assigment problem. Naval Research Logistic Quarterly, 2:83--97, 1955.\balancecolumns
[14]
I. Laptev, M. Marszalek, C. Schmid, and B. Rozenfeld. Learning realistic human actions from movies. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1--8, 2008.
[15]
B. Lucas and T. Kanade. An iterative image registration technique with an application to stereo vision. In Proceedings of the 7th International Joint Conference on Artificial intelligence, 1981.
[16]
P. Over, G. Awad, M. Michel, J. Fiscus, G. Sanders, B. Shaw, W. Kraaij, A. F. Smeaton, and G. Quéenot. TRECVid 2012 -- an overview of the goals, tasks, data, evaluation mechanisms and metrics. In Proceedings of TRECVID 2012. NIST, USA, 2012.
[17]
R. Poppe. A survey on vision-based human action recognition. Image and Vision Computing, 28(6):976 -- 990, 2010.
[18]
M. Ryoo and J. Aggarwal. Semantic representation and recognition of continued and recursive human activities. International journal of computer vision, 82(1):1--24, 2009.
[19]
C. Tomasi and J. Shi. Good features to track. CVPR94, pages 593--600, 1994.
[20]
H. Wang, A. Klaser, C. Schmid, and C. Liu. Action recognition by dense trajectories. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 3169--3176, 2011.
[21]
H. Wang, M. Ullah, A. Klaser, I. Laptev, and C. Schmid. Evaluation of local spatio-temporal features for action recognition. In BMVC 2009-British Machine Vision Conference, 2009.
[22]
P. Wilkins, A. F. Smeaton, and P. Ferguson. Properties of optimally weighted data fusion in cbmir. In Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, pages 643--650, 2010.
[23]
X. Yang, Y. Tian, C. Yi, and L. Cao. MediaCCNY at TRECVID 2012: Surveillance event detection. In TRECVid 2012 - TREC Video Retrieval Evaluation Workshop, Gaithersburg, MD, 2012.
[24]
F. Zernike. Physica, volume 1. 1934.

Cited By

View all
  • (2022)The ASU-DCU International Research and Workforce Development Program on Sensors and Machine Learning2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA56318.2022.9904409(1-6)Online publication date: 18-Jul-2022
  • (2020)Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE ComplianceIntelligent Information and Database Systems10.1007/978-3-030-41964-6_19(214-225)Online publication date: 4-Mar-2020
  • (2019)F2ConText: how to extract holistic contexts of persons of interest for enhancing exploratory analysisKnowledge and Information Systems10.1007/s10115-018-1304-961:1(363-396)Online publication date: 1-Oct-2019
  • Show More Cited By

Index Terms

  1. An information retrieval approach to identifying infrequent events in surveillance video

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
      April 2013
      362 pages
      ISBN:9781450320337
      DOI:10.1145/2461466
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 April 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. surveillance event detection
      2. video analysis

      Qualifiers

      • Poster

      Conference

      ICMR'13
      Sponsor:

      Acceptance Rates

      ICMR '13 Paper Acceptance Rate 38 of 96 submissions, 40%;
      Overall Acceptance Rate 254 of 830 submissions, 31%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)8
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 26 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)The ASU-DCU International Research and Workforce Development Program on Sensors and Machine Learning2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA56318.2022.9904409(1-6)Online publication date: 18-Jul-2022
      • (2020)Image Representation for Cognitive Systems Using SOEKS and DDNA: A Case Study for PPE ComplianceIntelligent Information and Database Systems10.1007/978-3-030-41964-6_19(214-225)Online publication date: 4-Mar-2020
      • (2019)F2ConText: how to extract holistic contexts of persons of interest for enhancing exploratory analysisKnowledge and Information Systems10.1007/s10115-018-1304-961:1(363-396)Online publication date: 1-Oct-2019
      • (2018)Cognition and Decisional Experience to Support Safety Management in WorkplacesInformation Systems Architecture and Technology: Proceedings of 39th International Conference on Information Systems Architecture and Technology – ISAT 201810.1007/978-3-319-99996-8_24(266-275)Online publication date: 28-Aug-2018
      • (2017)Hazard Control in Industrial Environments: A Knowledge-Vision-Based ApproachInformation Systems Architecture and Technology: Proceedings of 38th International Conference on Information Systems Architecture and Technology – ISAT 201710.1007/978-3-319-67223-6_23(243-252)Online publication date: 1-Sep-2017
      • (2015)Optical flow-based representation for video action detectionEmerging Trends in Image Processing, Computer Vision and Pattern Recognition10.1016/B978-0-12-802045-6.00021-1(331-351)Online publication date: 2015
      • (2014)Perspective Multiscale Detection and Tracking of PersonsProceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 832610.1007/978-3-319-04117-9_9(92-103)Online publication date: 6-Jan-2014
      • (2014)An Evaluation of Local Action Descriptors for Human Action Classification in the Presence of OcclusionProceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 832610.1007/978-3-319-04117-9_6(56-67)Online publication date: 6-Jan-2014

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media