Abstract
Human Action Recognition (HAR) has many applications in surveillance, gaming, animation and Active and Assisted Living (AAL). Several actions performed in daily life are composed of various poses arranged sequentially in time. Recognition of such actions is a difficult and challenging task. The classification approach proposed in this paper considers an analogy between actions and text, where an action is considered as a sentence and a single pose as a word. In the first stage, the poses are grouped based on their similarity and are then assigned labels. These labels are used for constructing label sequences representing motion. We propose Hierarchical Agglomerative Clustering (HAC) for clustering poses. Once the actions are modelled as the spatio-temporal evolution of key poses, we classify the actions using the Hidden Markov Model (HMM) and Hyper-dimensional Computing (HDC) classifiers. The experiments are performed on different datasets using both classifiers and the results are indicative of the effectiveness of the proposed approach in comparison with state-of-the-art methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Barmpoutis, P., Stathaki, T., Camarinopoulos, S.: Skeleton-based human action recognition through third-order tensor representation and spatio-temporal analysis. Inventions 4, 9 (2019)
Chin, Z.H., Ng, H., Yap, T.T.V., Tong, H.L., Ho, C.C., Goh, V.T.: Daily activities classification on human motion primitives detection dataset. Computational Science and Technology. LNEE, vol. 481, pp. 117–125. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-2622-6_12
Cippitelli, E., Gasparrini, S., Gambi, E., Spinsante, S.: A human activity recognition system using skeleton data from RGBD sensors. Comput. Intell. Neurosci. (2016)
Gaglio, S., Re, G.L., Morana, M.: Human activity recognition process using 3-D posture data. IEEE Trans. Hum.-Mach. Syst. 45, 586–597 (2015)
Gupta, R., Chia, A.Y.S., Rajan, D.: Human activities recognition using depth images. In: Proceedings of the 21st ACM International Conference on Multimedia, MM 2013 (2013)
Kanerva, P.: Hyperdimensional computing: an introduction to computing in distributed representation with high-dimensional random vectors. Cogn. Comput. 1, 139–159 (2009)
Kim, T.S., Reiter, A.: Interpretable 3D human action analysis with temporal convolutional networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2017)
Kim, Y., Imani, M., Rosing, T.S.: Efficient human activity recognition using hyperdimensional computing. In: Proceedings of the 8th International Conference on the Internet of Things (2018)
Konstantinidis, D., Dimitropoulos, K., Daras, P.: Skeleton-based action recognition based on deep learning and Grassmannian pyramids. In: 2018 26th European Signal Processing Conference (EUSIPCO) (2018)
Koppula, H.S., Gupta, R., Saxena, A.: Learning human activities and object affordances from RGB-D videos. CoRR (2012)
Lan, R., Sun, H., Zhu, M.: Text-like motion representation for human motion retrieval. In: Yang, J., Fang, F., Sun, C. (eds.) IScIDE 2012. LNCS, vol. 7751, pp. 72–81. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-36669-7_10
Liu, X., He, G., Peng, S., Cheung, Y., Tang, Y.Y.: Efficient human motion retrieval via temporal adjacent bag of words and discriminative neighborhood preserving dictionary learning. IEEE Trans. Hum.-Mach. Syst. 47, 763–776 (2017)
Mahasseni, B., Todorovic, S.: Regularizing long short term memory with 3D human-skeleton sequences for action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3054–3062 (2016)
Mokari, M., Mohammadzade, H., Ghojogh, B.: Recognizing involuntary actions from 3D skeleton data using body states. CoRR (2017)
Pantuwong, N., Takahara, K., Sugimoto, M.: A rapid motion retrieval technique using simple and discrete representation of motion data. In: 2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE) (2015)
Patel, A., Shah, P.: IIITV@INLI-2018: Hyperdimensional Computing for Indian Native Language Identification. INLI track at Forum for Information Retrieval Evaluation DAIICT, Gandhinagar (2018)
Stamp, M.: A revealing introduction to hidden Markov models (2004)
Sung, J., Ponce, C., Selman, B., Saxena, A.: Unstructured human activity detection from RGBD images. In: Proceedings - IEEE International Conference on Robotics and Automation, July 2011
Theodorakopoulos, I., Kastaniotis, D., Economou, G., Fotopoulos, S.: Pose-based human action recognition via sparse representation in dissimilarity space. J. Vis. Commun. Image Represent. 25(1), 12–23 (2014)
Yan, S., Xiong, Y., Lin, D.: Spatial temporal graph convolutional networks for skeleton-based action recognition. In: Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Yang, X., Tian, Y.: Effective 3D action recognition using EigenJoints. J. Vis. Commun. Image Represent. 25, 2–11 (2014)
Zhang, P., Lan, C., Xing, J., Zeng, W., Xue, J., Zheng, N.: View adaptive neural networks for high performance skeleton-based human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1963–1978 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Tyagi, A., Patel, A., Shah, P. (2020). Text Like Classification of Skeletal Sequences for Human Action Recognition. In: Palaiahnakote, S., Sanniti di Baja, G., Wang, L., Yan, W. (eds) Pattern Recognition. ACPR 2019. Lecture Notes in Computer Science(), vol 12047. Springer, Cham. https://doi.org/10.1007/978-3-030-41299-9_26
Download citation
DOI: https://doi.org/10.1007/978-3-030-41299-9_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-41298-2
Online ISBN: 978-3-030-41299-9
eBook Packages: Computer ScienceComputer Science (R0)