Abstract
In this paper, we propose an accurate and effective method for detecting abnormal behavior. We consider the video as a series of frame sequences; in the training phase, our deep learning framework is used to extract appearance features and learn the relationship between historical features and current features in the normal video. In the testing phase, the predicted features that differ from the actual features are considered as abnormal. Our model is designed as a feature prediction framework with a new temporal attention mechanism. In the feature extraction stage, we transform a pre-trained Vgg16 network into a fully convolutional neural network and used the third pooling layer output as the appearance feature extraction to effectively capture static appearance features. Then, a new temporal attention mechanism is introduced to learn the contribution of different historical appearance features at the same position to the current features, thereby solving the problem of representing dynamic motion features. Finally, the LSTM network is used to decode the historical feature sequences with temporal attention to predict the features at the current moment. Those actual features that differ from the predicted features are considered as abnormal features. Using upsampling for the abnormal features locates abnormal behavior on the original frames. Experiments on two benchmark datasets demonstrate the competitive performance of our method with the state-of-the-art methods.
Similar content being viewed by others
References
Mingliang X, Li C, Lv P, Lin N, Hou R, Zhou B (2017) An efficient method of crowd aggregation computation in public areas. IEEE Trans Circuits Syst Video Technol 28(10):2814–2825
Xie S, Zhang X, Cai J (2019) Video crowd detection and abnormal behavior model detection based on machine learning method. Neural Comput Appl 31:175–184
Shen M, Jiang X, Sun T (2018) Anomaly detection based on nearest neighbor search with locality-sensitive B-tree. Neurocomputing 289:55–67
Xing H, Huang Y, Duan Q, Ci W, Dai J, Yang H (2018) Abnormal event detection in crowded scenes using histogram of oriented contextual gradient descriptor. EURASIP J Adv Signal Process 2018(1):54
Ke X, Jiang X, Sun T (2018) Anomaly detection based on stacked sparse coding with intraframe classification strategy. IEEE Trans Multimed 20(5):1062–1074
Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3D deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Process 26(4):1992–2004
Coşar S, Donatiello G, Bogorny V, Garate C, Alvares LO, Brémond F (2017) Toward abnormal trajectory and event detection in video surveillance. IEEE Trans Circuits Syst Video Technol 27(3):683–695
Ye O, Deng J, Yu Z, Liu T, Dong L (2020) Abnormal event detection via feature expectation subgraph calibrating classification in video surveillance scenes. IEEE Access 8:97564–97575
Bosi Y, Liu Y, Sun Q (2016) A content-adaptively sparse reconstruction method for abnormal events detection with low-rank property. IEEE Trans Syst Man Cybern Syst 47(4):704–716
Mehran R, Oyama A, Shah M (2009) Abnormal crowd behavior detection using social force model. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 935–942
Fernando T, Denman S, Sridharan S, Fookes C (2018) Soft+ hardwired attention: an lstm framework for human trajectory prediction and abnormal event detection. Neural Netw 108:466–478
Ullah A, Muhammad K, Del Ser J, Baik SW, de Albuquerque VHC (2018) Activity recognition using temporal optical flow convolutional features and multilayer lstm. IEEE Trans Ind Electron 66(12):9692–9702
Martinel N, Micheloni C, Piciarelli C, Foresti GL (2013) Camera selection for adaptive human–computer interface. IEEE Trans Syst Man Cybern Syst 44(5):653–664
Sabokrou M, Fathy M, Hoseini M, Klette R (2015) Real-time anomaly detection and localization in crowded scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 56–62
Liu L, Wang S, Hu B, Qiong Q, Wen J, Rosenblum DS (2018) Learning structures of interval-based bayesian networks in probabilistic generative model for human complex activity recognition. Pattern Recognit 81:545–561
Epaillard E, Bouguila N (2018) Variational Bayesian learning of generalized dirichlet-based hidden Markov models applied to unusual events detection. IEEE Trans Neural Netw Learn Syst 30(4):1034–1047
Ouivirach K, Gharti S, Dailey MN (2013) Incremental behavior modeling and suspicious activity detection. Pattern Recognit 46(3):671–680
Kratz L, Nishino K (2009) Anomaly detection in extremely crowded scenes using spatio-temporal motion pattern models. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, pp 1446–1453
Li J, Gong S, Xiang T (2008) Global behaviour inference using probabilistic latent semantic analysis. In: BMVC, vol 3231, p 3232
Isupova O, Kuzin D, Mihaylova L (2018) Learning methods for dynamic topic modeling in automated behavior analysis. IEEE Trans Neural Netw Learn Syst 29(9):3980–3993
Hospedales T, Gong S, Xiang T (2012) Video behaviour mining using a dynamic topic model. Int J Comput Vis 98(3):303–323
Tang X, Zhang S, Yao H (2013) Sparse coding based motion attention for abnormal event detection. In: 2013 IEEE International Conference on Image Processing. IEEE, pp 3602–3606
Ren H, Moeslund TB (2014) Abnormal event detection using local sparse representation. In: 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS). IEEE, pp 125–130
He C, Shao J, Sun J (2018) An anomaly-introduced learning method for abnormal event detection. Multimed Tools Appl 77(22):29573–29588
Sun J, Wang X, Xiong N, Shao J (2018) Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access 6:33353–33361
Ko K-E, Sim K-B (2018) Deep convolutional framework for abnormal behavior detection in a smart surveillance system. Eng Appl Artif Intell 67:226–234
Sabokrou M, Fayyaz M, Fathy M, Moayed Z, Klette R (2018) Deep-anomaly: fully convolutional neural network for fast anomaly detection in crowded scenes. Comput Vis Image Underst 172:88–97
Chu W, Xue H, Yao C, Cai D (2018) Sparse coding guided spatiotemporal feature learning for abnormal event detection in large videos. IEEE Trans Multimed 21(1):246–255
Fang Z, Fei F, Fang Y, Lee C, Xiong N, Shu L, Chen S (2016) Abnormal event detection in crowded scenes based on deep learning. Multimed Tools Appl 75(22):14617–14639
Zhou JT, Du J, Zhu H, Peng X, Liu Y, Goh RSM (2019) Anomalynet: an anomaly detection network for video surveillance. IEEE Trans Inf Forensics Secur 14(10):2537–2550
Feng Y, Yuan Y, Xiaoqiang L (2017) Learning deep event models for crowd anomaly detection. Neurocomputing 219:548–556
Chan T-H, Jia K, Gao S, Jiwen L, Zeng Z, Ma Y (2015) PCANet: a simple deep learning baseline for image classification? IEEE Trans Image Process 24(12):5017–5032
Yong B, Zhang G, Chen H, Zhou Q (2017) Intelligent monitor system based on cloud and convolutional neural networks. J Supercomput 73(7):3260–3276
Zhao L, Su C, Dai Z, Huang H, Ding S, Huang X, Han Z (2019) Indoor device-free passive localization with DCNN for location-based services. J Supercomput. https://doi.org/10.1007/s11227-019-03110-2
Xiao Y, Yu D, Wang X, Lv Y, Fan T, Wu L (2020) SPCNet: spatial preserve and content-aware network for human pose estimation. arXiv e-prints arXiv:2004.05834
Tay N, Connie T, Ong TS, Teh PS (2019) Abnormal behavior recognition using CNN-LSTM with attention mechanism. In: 2019 1st International Conference on Electrical, Control and Instrumentation Engineering (ICECIE), pp 1–5
Liu W, Luo W, Lian D, Gao S (2018) Future frame prediction for anomaly detection—a new baseline. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 6536–6545
Woo S, Park J, Lee J-Y, So Kweon I (2018) Cbam: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp 3–19
Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A (2016) Learning deep features for discriminative localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 2921–2929
Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 7132–7141
Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, pp 1975–1981
Lu C, Shi J, Jia J (2013) Abnormal event detection at 150 fps in matlab. In: Proceedings of the IEEE International Conference on Computer Vision, pp 2720–2727
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xia, L., Li, Z. A new method of abnormal behavior detection using LSTM network with temporal attention mechanism. J Supercomput 77, 3223–3241 (2021). https://doi.org/10.1007/s11227-020-03391-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-020-03391-y