[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Compressed domain video zoom motion analysis utilizing CURL

Published: 01 April 2022 Publication History

Abstract

This paper explores the application of the concept of CURL borrowed from vector calculus to the zoom motion detection and classification problems. The interframe block motion vectors extracted from the compressed bitstream form the input to the proposed method. These block motion vectors are analyzed by partitioning the motion vector field into 4 representative quadrants followed by quantizing the block motion vectors into 3 levels and converting the block motion vectors into complex motion vector space. The resultant vector for each of the 4 quadrants is estimated followed by estimating the velocity vector between the quadrants. The CURL of the velocity field is then estimated whose magnitude essentially provides the area enclosed between the resultant quadrant motion vectors which are utilized for separating the zooming and non-zooming camera types. The zooming camera frames are further classified into zoom-in and zoom-out types utilizing the direction information (anti-clockwise/clockwise) extracted from CURL of the velocity field. The novelty here stems from the fact that a concept borrowed from vector calculus is being applied to the zoom motion analysis problem. Although handcrafted features from CURL are utilized we demonstrate its superiority over existing methods including a deep learning architecture where we show the robustness of the proposed features extracted from CURL in the presence of noise. Experimental validation carried out utilizing block motion vectors extracted using Exhaustive Search Motion Estimation algorithm as well as H.264 decoded block motion vectors demonstrate superior performance for the proposed method both in terms of detection accuracy as well as computational complexity in comparison to existing techniques.

References

[1]
Abdollahian G, Pizlo Z, Delp EJ (2008) A study on the effect of camera motion on human visual attention. In: 15th IEEE international conference on image processing, pp 693–696
[2]
Chang C-C, Lin C-J (2011) LIBSVM: A library for support vector machines. ACM Trans Intell Syst Technol, 2:27:1–27:27. Software available at http://www.csie.ntu.edu.tw/cjlin/libsvm
[3]
Chen YM and Bajic IV Motion vector outlier rejection cascade for global motion estimation IEEE Signal Process Lett 2010 17 2 197-200
[4]
Deng Y, Manjunath BS (1997) Content-based search of video using color, texture, and motion. In: Proceedings of international conference on image processing, vol 2. pp 534–537, vol.2
[5]
Duan L-Y, Jin JS, Tian Q, and Xu C-S Nonparametric motion characterization for robust classification of camera motion patterns IEEE Trans Multimed 2006 8 2 323-340
[6]
Fang Y, Lin W, Chen Z, Tsai C, and Lin C A video saliency detection model in compressed domain IEEE Trans Circuits Syst Video Technol 2014 24 1 27-38
[7]
Ghosh S, Biswas J (2017) Joint perception and planning for efficient obstacle avoidance using stereo vision. In: IEEE/RSJ International conference on intelligent robots and systems (IROS), pp 1026–1031
[8]
Guironnet M, Pellerin D, Rombaut M (2006) Camera motion classification based on transferable belief model. In: 14Th european signal processing conference, pp 1–5
[9]
Hasan MA, Xu M, He X, and Xu C Camhid: Camera motion histogram descriptor and its application to cinematographic shot classification IEEE Trans Circuits Syst Video Technol 2014 24 10 1682-1695
[10]
Jin R, Qi Y, Hauptmann A (2002) A probabilistic model for camera zoom detection. In: 16Th IEEE international conference on pattern recognition, vol 3. IEEE, pp 859–862
[11]
Kesana V and Okade M Compressed domain zoom motion classification using local tetra patterns Signal Image and Video Processing 2019 13 5 879-885
[12]
Kilicarslan M and Zheng JY Predict vehicle collision by ttc from motion using a single video camera IEEE Trans Intell Transp Syst 2019 20 2 522-533
[13]
Kreyszig E Advanced Engineering Mathematics 2000 8th edn. New York Wiley
[14]
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems 25, pp 1097–1105
[15]
Luo J, Papin C, and Costello K Towards extracting semantically meaningful key frames from personal video clips: From humans to computers IEEE Trans Circuits Syst Video Technol 2009 19 2 289-301
[16]
Murala S, Maheshwari RP, and Balasubramanian R Local tetra patterns: a new feature descriptor for content-based image retrieval IEEE Trans Image Process 2012 21 5 2874-2886
[17]
Nakaya Y and Harashima H Motion compensation based on spatial transformations IEEE Transactions on Circuits and Systems for Video Technology 1994 4 3 339-356, 366–7
[18]
Okade M, Patel G, and Kumar Biswas P Robust learning-based camera motion characterization scheme with applications to video stabilization IEEE Trans Circuits Syst Video Technol 2016 26 3 453-466
[19]
Po L-M, Wong K-M, Cheung K-W, and Ng K-H Subsampled block-matching for zoom motion compensated prediction IEEE Trans Circuits Syst Video Technol 2010 20 11 1625-1637
[20]
Sandula P, Okade M (2019) Camera zoom detection and classification based on application of histogram intersection and kullback leibler divergence. In: 2019 National conference on communications (NCC), pp 1–6
[21]
Sandula P, Okade M (2019) Camera zoom detection in the compressed domain. In: International conference on range technology, (ICORT), pp 1–4
[22]
Schoeffmann K, Taschwer M, Boeszoermenyi L (2009) Video browsing using motion visualization. In: IEEE International conference on multimedia and expo, pp 1835–1836
[23]
The H.264 AVC JM Reference Software
[24]
Wiegand T, Sullivan GJ, Bjontegaard G, and Luthra A Overview of the h.264/avc video coding standard IEEE Trans Circuits Syst Video Technol 2003 13 7 560-576
[25]
Yuan H, Chang Y, Lu Z, and Ma Y Model based motion vector predictor for zoom motion IEEE Signal Process Lett 2010 17 9 787-790
[26]
Zhang Z, Jing T, Ding B, Gao M, and Li X A model-based approach of foreground region of interest detection for video codecs Appl Sci 2019 9 2670, 06

Cited By

View all
  • (2024)A video compression-cum-classification network for classification from compressed video streamsThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-023-03242-w40:11(7539-7558)Online publication date: 1-Nov-2024

Index Terms

  1. Compressed domain video zoom motion analysis utilizing CURL
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Please enable JavaScript to view thecomments powered by Disqus.

            Information & Contributors

            Information

            Published In

            cover image Multimedia Tools and Applications
            Multimedia Tools and Applications  Volume 81, Issue 9
            Apr 2022
            1278 pages

            Publisher

            Kluwer Academic Publishers

            United States

            Publication History

            Published: 01 April 2022
            Accepted: 18 January 2022
            Revision received: 10 January 2022
            Received: 26 November 2020

            Author Tags

            1. Zooming camera
            2. CURL
            3. Camera motion
            4. Compressed domain
            5. Block motion vectors
            6. H.264 codec

            Qualifiers

            • Research-article

            Funding Sources

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • Downloads (Last 12 months)0
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 26 Dec 2024

            Other Metrics

            Citations

            Cited By

            View all
            • (2024)A video compression-cum-classification network for classification from compressed video streamsThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-023-03242-w40:11(7539-7558)Online publication date: 1-Nov-2024

            View Options

            View options

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media