Abstract
Facial expression recognition plays a significant role in human behavior detection. In this study, we present an efficient and fast facial expression recognition system. We introduce a new feature called W_HOG where W indicates discrete wavelet transform and HOG indicates histogram of oriented gradients feature. The proposed framework comprises of four stages: (i) Face processing, (ii) Domain transformation, (iii) Feature extraction and (iv) Expression recognition. Face processing is composed of face detection, cropping and normalization steps. In domain transformation, spatial domain features are transformed into the frequency domain by applying discrete wavelet transform (DWT). Feature extraction is performed by retrieving Histogram of Oriented Gradients (HOG) feature in DWT domain which is termed as W_HOG feature. For expression recognition, W_HOG feature is supplied to a well-designed tree based multiclass support vector machine (SVM) classifier with one-versus-all architecture. The proposed system is trained and tested with benchmark CK+, JAFFE and Yale facial expression datasets. Experimental results of the proposed method are effective towards facial expression recognition and outperforms existing methods.
Similar content being viewed by others
References
Addison PS (2017) The illustrated wavelet transform handbook: introductory theory and applications in science, engineering, medicine and finance. CRC press, Boca Raton
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Al Chanti D, Caplier A (2017) Spontaneous facial expression recognition using sparse representation. In: 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2017) (vol 5, no. 64–74, p 11), Porto, Portugal
Ali G, Iqbal MA, Choi TS (2016) Boosted NNE collections for multicultural facial expression recognition. Pattern Recogn 55:14–27. https://doi.org/10.1016/j.patcog.2016.01.032
Bousmalis K, Mehu M, Pantic M (2013) Towards the automatic detection of spontaneous agreement and disagreement based on nonverbal behaviour: a survey of related cues, databases, and tools. Image Vis Comput 31(2):203–221
Burges CJ (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2(2):121–167
Burkert, P., Trier, F., Afzal, M. Z., Dengel, A., and Liwicki, M. (2015). Dexpression: deep convolutional neural network for expression recognition. arXiv preprint arXiv:1509.05371.URL http://arxiv.org/abs/1509.05371
Chew, S. W., Lucey, P., Lucey, S., Saragih, J., Cohn, J. F., & Sridharan, S. (2011). Person-independent facial expression detection using constrained local models. In Automatic Face & Gesture Recognition and Workshops (FG 2011), 2011 I.E. International Conference on (pp. 915-920). IEEE
Chiong R (ed) (2009) Intelligent systems for automated learning and adaptation: emerging trends and applications: emerging trends and applications. IGI Global, Hershey
Corneanu CA, Simón MO, Cohn JF, Guerrero SE (2016) Survey on rgb, 3d, thermal, and multimodal approaches for facial expression recognition: history, trends, and affect-related applications. IEEE Trans Pattern Anal Mach Intell 38(8):1548–1568
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Dailey MN, Joyce C, Lyons MJ, Kamachi M, Ishi H, Gyoba J, Cottrell GW (2010) Evidence and a computational explanation of cultural differences in facial expression recognition. Emotion 10(6):874–893
Dalal, N., and Triggs, B. (2005). Histograms of oriented gradients for human detection. In IEEE computer society conference on computer vision and pattern recognition, CVPR 2005. (Vol. 1, pp. 886-893)
Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists, San Francisco
Eskil MT, Benli KS (2014) Facial expression recognition based on anatomy. Comput Vis Image Underst 119:1–14
Fan X, Tjahjadi T (2015) A spatial-temporal framework based on histogram of gradients and optical flow for facial expression recognition in video sequences. Pattern Recogn 48(11):3407–3416
Fan X, Tjahjadi T (2017) A dynamic framework based on local Zernike moment and motion history image for facial expression recognition. Pattern Recogn 64:399–406
Franco L, Treves A (2001) A neural network facial expression recognition system using unsupervised local processing. In 2nd International Symposium on Image and Signal Processing and Analysis (ISPA’01) (pp. 628-632)
Guo G, Dyer CR (2003) Simultaneous feature selection and classifier training via linear programming: a case study for face expression recognition. In 2003 I.E. Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’03) (Vol. 1, pp. 346-352)
Hegde GP, Seetha M (2017) Subspace based expression recognition using combinational Gabor based feature fusion. International Journal of Image, Graphics and Signal Processing 9(1):50
Huang GB, Zhu QY, Siew CK (2004) Extreme learning machine: a new learning scheme of feedforward neural networks. In proceedings 2004 I.E. International Joint Conference on Neural Networks, (Vol. 2, pp. 985-990)
Huang GB, Zhou H, Ding X, Zhang R (2012) Extreme learning machine for regression and multiclass classification. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 42(2):513–529
Iosifidis A, Tefas A, Pitas I (2017) Approximate kernel extreme learning machine for large scale data classification. Neurocomputing 219:210–220
Jiang R, Ho AT, Cheheb I, Al-Maadeed N, Al-Maadeed S, Bouridane A (2017) Emotion recognition from scrambled facial images via many graph embedding. Pattern Recogn 67:245–251
Jung H, Lee S, Park S, Kim B, Kim J, Lee I, Ahn C (2015) Development of deep learning-based facial expression recognition system. In Frontiers of computer vision (FCV), 2015 21st Korea-Japan joint workshop on (pp. 1-4). IEEE
Jung H, Lee S, Yim J, Park S, Kim J (2015) Joint fine-tuning in deep neural networks for facial expression recognition. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2983-2991)
Khan SA, Hussain A, Usman M (2017) Reliable facial expression recognition for multi-scale images using weber local binary image based cosine transform features. Multimedia Tools and Applications, 1-33
Kumbhar M, Jadhav A, Patil M (2012) Facial expression recognition based on image feature. International Journal of Computer and Communication Engineering 1(2):117–119
Li W, Li M, Su Z, Zhu Z (2015) A deep-learning approach to facial expression recognition with candid images. In Machine Vision Applications (MVA), 2015 14th IAPR International Conference on (pp. 279-282). IEEE
Liu M, Li S, Shan S, Wang R, Chen X (2014) Deeply learning deformable facial action parts model for dynamic expression analysis. In Asian conference on computer vision (pp. 143-157). Springer, Cham
Liu, M., Shan, S., Wang, R., & Chen, X. (2014). Learning expressionlets on spatio-temporal manifold for dynamic facial expression recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1749-1756)
Liu P, Han S, Meng Z, Tong Y (2014) Facial expression recognition via a boosted deep belief network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1805-1812)
Liu M, Li S, Shan S, Chen X (2015) Au-inspired deep networks for facial expression feature learning. Neurocomputing 159:126–136
Liu Y, Nie L, Han L, Zhang L, Rosenblum DS (2015) Action2Activity: recognizing complex activities from sensor data. In IJCAI (pp. 1617-1623)
Liu L, Cheng L, Liu Y, Jia Y, Rosenblum DS (2016) Recognizing complex activities by a probabilistic interval-based model. In AAAI (Vol. 30, pp. 1266-1272)
Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181:108–115
Liu Y, Zhang L, Nie L, Yan Y, Rosenblum DS (2016) Fortune teller: predicting your career path. In AAAI (pp. 201-207)
Liu Y, Zheng Y, Liang Y, Liu S, Rosenblum DS (2016) Urban water quality prediction based on multi-task multi-view learning
Lopes AT, de Aguiar E, De Souza AF, Oliveira-Santos T (2017) Facial expression recognition with convolutional neural networks: coping with few data and the training sample order. Pattern Recogn 61:610–628
Lucey P, Cohn JF, Kanade T, Saragih J, Ambadar Z, Matthews I (2010) The extended cohn-kanade dataset (ck+): a complete dataset for action unit and emotion-specified expression. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2010 I.E. Computer Society Conference on (pp. 94-101). IEEE
Lv, Y., Feng, Z., & Xu, C. (2014). Facial expression recognition via deep learning. In smart computing (SMARTCOMP), 2014 international conference on (pp. 303-308). IEEE
Malisiewicz T, Gupta A, Efros AA (2011). Ensemble of exemplar-SVMs for object detection and beyond. In 2011 I.E. International Conference on Computer Vision (ICCV’11) (pp. 89-96)
Meng Z, Liu P, Cai J, Han S, Tong Y (2017). Identity-aware convolutional neural network for facial expression recognition. In Automatic Face & Gesture Recognition (FG 2017), 2017 12th IEEE international conference on (pp. 558-565). IEEE
Nigam S, Khare A (2015) Multiresolution approach for multiple human detection using moments and local binary patterns. Multimedia Tools and Applications 74(17):7037–7062
Nigam S, Khare A (2015) Multiscale local binary patterns for facial expression-based human emotion recognition, In Computational Vision and Robotics. Springer, New Delhi, pp 71–77
Nigam S, Khare A (2016) Integration of moment invariants and uniform local binary patterns for human activity recognition in video sequences. Multimedia Tools and Applications 75(24):17303–17332
Nigam S, Singh R, Misra AK (2018) Towards intelligent human behavior detection for video surveillance. In Advancements in Computer Vision and Image Processing (pp. 34-73). IGI Global
Nikitidis S, Tefas A, Nikolaidis N, Pitas I (2012) Subclass discriminant nonnegative matrix factorization for facial image analysis. Pattern Recogn 45(12):4080–4091
Pantic, M. (2009). Facial expression recognition. In encyclopedia of biometrics (pp. 400-406). Springer US
Poria S, Cambria E, Bajpai R, Hussain A (2017) A review of affective computing: from unimodal analysis to multimodal fusion. Information Fusion 37:98–125
Preoţiuc-Pietro D, Liu Y, Hopkins D, Ungar L (2017). Beyond binary labels: political ideology prediction of twitter users. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (Vol. 1, pp. 729-740)
Rudovic O, Pantic M, Patras I (2013) Coupled Gaussian processes for pose-invariant facial expression recognition. IEEE Trans Pattern Anal Mach Intell 35(6):1357–1369
Russell JA, Fernández-Dols JM (eds) (2017) The science of facial expression. Oxford University Press, Oxford
Shan C, Gong S, McOwan PW (2009) Facial expression recognition based on local binary patterns: a comprehensive study. Image Vis Comput 27(6):803–816
Shinohara Y, Otsu N (2004) Facial expression recognition using fisher weight maps. In sixth IEEE international conference on automatic face and gesture recognition (pp. 499-504)
Singh AK, Shukla VP, Tiwari S, Biradar SR (2015) Wavelet based histogram of oriented gradients feature descriptors for classification of partially occluded objects. International Journal of Intelligent Systems and Applications 7(3):54–61
Song I, Kim HJ, Jeon PB (2014). Deep learning for real-time robust facial expression recognition on a smartphone. In 2014 I.E. international conference on consumer electronics (ICCE), (pp. 564-567)
Spiers DL (2016) Facial emotion detection using deep learning. Doctoral Dissertation, UPPSALA Universitet
Tian Y, Kanade T, Cohn JF (2011) Facial expression recognition. In handbook of face recognition (pp. 487-519). Springer London
UCSD Computer Vision. Yale facial expression database. http://vision.ucsd.edu/content/yale-facedatabase. Accessed 28 Dec 2017
Uddin MZ, Lee JJ, Kim TS (2009) An enhanced independent component-based human facial expression recognition from video. IEEE Trans Consum Electron 55(4):2216–2224
Valstar M, Pantic M (2006) Fully automatic facial action unit detection and temporal analysis. In IEEE computer society computer vision and pattern recognition workshop (CVPRW'06) (pp. 149-149)
Valstar MF, Patras I, Pantic M (2005) Facial action unit detection using probabilistic actively learned support vector machines on tracked facial point data. In IEEE Computer Society Computer Vision and Pattern Recognition Workshop (CVPRW'05) (Vol. 3, pp. 76-76)
Valstar M, Zafeiriou S, Pantic M (2017) Chapter 11: facial actions as social signals. In: Burgoon JK, Magnenat-Thalmann N, Pantic M, Vinciarelli A (eds) Social signal processing. Cambridge University Press. https://doi.org/10.1017/9781316676202. Available online at: www.cambridge.org/9781107161269
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang Z, Wang S, Ji Q (2013) Capturing complex spatio-temporal relations among facial muscles for facial expression recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3422-3429)
Yang MH (2002) Kernel Eigenfaces vs. Kernel Fisherfaces: face recognition using kernel methods. In proceedings of the 5th IEEE International Conference on automatic face and Gesture Recognition, pp. 215–220
Yang J, Yang J-Y (2002) Generalized K–L transform based combined feature extraction. Pattern Recogn 35(1):295–297
Zhang Z (1999) Feature based facial expression recognition: sensitivity analysis and experiment with a multi-layer perceptron. Int J Pattern Recognit Artif Intell 13(6):893–911
Zhao X, Zhang S (2016) A review on facial expression recognition: feature extraction and classification. IETE Tech Rev 33(5):505–517
Acknowledgements
This work is supported by Science and Engineering Research Board, Department of Science and Technology, Government of India under grant number PDF/2016/003644.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nigam, S., Singh, R. & Misra, A.K. Efficient facial expression recognition using histogram of oriented gradients in wavelet domain. Multimed Tools Appl 77, 28725–28747 (2018). https://doi.org/10.1007/s11042-018-6040-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6040-3