Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features

Yanli Ji¹⁹,
Atsushi Shimada¹⁹ &
Rin-ichiro Taniguchi¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6444))

Included in the following conference series:

International Conference on Neural Information Processing

2688 Accesses
1 Citations

Abstract

In this paper, an action recognition system was invented by proposing a compact 3D descriptor to represent action information, and employing self-organizing map (SOM) to learn and recognize actions. Histogram Of Gradient 3D (HOG3D) performed better among currently used descriptors for action recognition. However, the calculation of the descriptor is quite complex. Furthermore, it used a vector with 960 elements to describe one interest point. Therefore, we proposed a compact descriptor, which shortened the support region of interest points, combined symmetric bins after orientation quantization. In addition, the top value bin of quantized vector was kept instead of setting threshold experimentally. Comparing with HOG3D, our descriptor used 80 bins to describe a point, which reduced much computation complexity. The compact descriptor was used to learn and recognize actions considering the probability of local features in SOM, and the results showed that our system outperformed others both on KTH and Hollywood datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Action Recognition Based on Hierarchical Model

Action recognition new framework with robust 3D-TCCHOGAC and 3D-HOOFGAC

Article 27 January 2016

A compact discriminant hierarchical clustering approach for action recognition

Article 18 April 2017

References

Harris, C., Stephens, M.: A combined corner and edge detector. In: 4th Alvey Vision Conference. Elsevier North-Holland, The Netherlands (1988)
Google Scholar
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, pp. 65–72 (2005)
Google Scholar
Laptev, I., Lindeberg, T.: On Space-time interest points. In: 6th IEEE International Conference on Computer Vision, pp. 432–439 (2003)
Google Scholar
Rosten, E., Drummond, T.: Machine learning for high-speed corner detection. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 430–443. Springer, Heidelberg (2006)
Chapter Google Scholar
Willems, G., Tuytelaars, T., Gool, L.V.: An efficient dense and scaleinvariant spatio-temporal interest point detector. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 650–663. Springer, Heidelberg (2008)
Chapter Google Scholar
FeiFei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: 15th IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 524–531 (2005)
Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: 8th IEEE International Conference on Computer Vision, pp. 604–610 (2005)
Google Scholar
Laptev, I., Marszalek, M., Schmid, C., Rozenfeld, B.: Learning realistic human actions from movies. In: 18th IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Klaser, A., Marszalek, M., Schmid, C.: A spatio-temporal descriptor based on 3D gradients. In: 19th British Machine Vision Conference, pp. 995–1004. British Machine Vision Association, Worcs (2008)
Google Scholar
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. In: 15th ACM International Conference on Multimedia, pp. 357–360. ACM, New York (2007)
Google Scholar
Shimada, A., Taniguchi, R.: Gesture recognition using sparse code of hierarchical SOM. In: 18th International Conference on Pattern Recognition (2008)
Google Scholar
Kohonen, T.: Self-Organizing Maps. Springer, Berlin (1995)
Book MATH Google Scholar
Gilbert, A., Illingworth, J., Bowden, R.: Fast realistic multi-action recognition using mined dense spatio-temporal features. In: 12th IEEE International Conference on computer Vision (2009)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: A local SVM approach. In: 14th International Conference on Pattern Recognition, pp. 32–36 (2004)
Google Scholar
Heng, W., Muhammad, M.U., Klaser, A., Laptev, I., Schmid, C.: Evaluation of local spatio-temporal features for action recognition. In: British Machine Vision Conference, pp. 127–137 (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Advanced Information Technology, Kyushu University, Fukuoka, Japan
Yanli Ji, Atsushi Shimada & Rin-ichiro Taniguchi

Authors

Yanli Ji
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Shimada
View author publications
You can also search for this author in PubMed Google Scholar
Rin-ichiro Taniguchi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Information Technology, Murdoch University, 6150, Murdoch, WA, Australia
Kok Wai Wong
The Australian National University, 0200, Canberra, ACT, Australia
B. Sumudu U. Mendis
School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Northfields Avenue, 2522, P.O. Box, Wollongong, NSW, Australia
Abdesselam Bouzerdoum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ji, Y., Shimada, A., Taniguchi, Ri. (2010). Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features. In: Wong, K.W., Mendis, B.S.U., Bouzerdoum, A. (eds) Neural Information Processing. Models and Applications. ICONIP 2010. Lecture Notes in Computer Science, vol 6444. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17534-3_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-17534-3_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17533-6
Online ISBN: 978-3-642-17534-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Action Recognition Based on Hierarchical Model

Action recognition new framework with robust 3D-TCCHOGAC and 3D-HOOFGAC

A compact discriminant hierarchical clustering approach for action recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Human Action Recognition by SOM Considering the Probability of Spatio-temporal Features

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Action Recognition Based on Hierarchical Model

Action recognition new framework with robust 3D-TCCHOGAC and 3D-HOOFGAC

A compact discriminant hierarchical clustering approach for action recognition

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation