A Bi-directional Residual Network for Image Expression Recognition

Daihong Jiang¹⁷,
Sanyou Zhang¹⁸,
Cheng Yu¹⁷ &
…
Chuangeng Tian¹⁷

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 369))

Included in the following conference series:

International Conference on Simulation Tools and Techniques

955 Accesses

Abstract

In this paper, an improved model based on the combination of residual and inverted residual blocks is proposed for image expression recognition, named as bi-directional residual network. The main objective of the proposed method is to alleviate the problem of feature dispersion due to the deep network level in traditional expression recognition research. In this case, residual block is a good solution. However, residual network with small scale of training data can easily lead to over-fitting, which is often the case for image expression recognition. To improve the robustness of the network during training, inverted residual blocks are therefore adopted. Depending on the organization sequence of residual blocks and inverted residual blocks, three network structures are proposed and studied. Fer2013 and CK+ datasets in facial field are adopted for experiment. The experimental results show that the optimized algorithm improves the accuracy by 2.79% on Fer2013 dataset compared with ResNet-50 models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: GBP 19.95; Price includes VAT (United Kingdom)

eBook: GBP 71.50; Price includes VAT (United Kingdom)

Softcover Book: GBP 89.99; Price includes VAT (United Kingdom)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A mixed depthwise separation residual network for image feature extraction

Article 22 June 2021

LKRNet: a dual-branch network based on local key regions for facial expression recognition

Article 28 July 2020

Facial Expression Recognition with Multi-scale Convolution Neural Network

References

Akiba, T., Suzuki, S., Fukuda, K.: Extremely large minibatch SGD: training ResNet-50 on ImageNet in 15 minutes (2017)
Google Scholar
Balduzzi, D., Frean, M., Leary, L., Lewis, J.P., Mcwilliams, B.: The shattered gradients problem: if ResNets are the answer, then what is the question? (2017)
Google Scholar
Barsoum, E., Zhang, C., Ferrer, C.C., Zhang, Z.: Training deep networks for facial expression recognition with crowd-sourced label distribution. In: ACM International Conference on Multimodal Interaction (2016)
Google Scholar
Bengio, Y.: Knowledge matters: importance of prior information for optimization (2016)
Google Scholar
Chen, Y., Li, J., Xiao, H., Jin, X., Yan, S., Feng, J.: Dual path networks (2017)
Google Scholar
Chollet, F.: Xception: deep learning with depthwise separable convolutions. In: IEEE Conference on Computer Vision Pattern Recognition (2017)
Google Scholar
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks, pp. 249–256 (2010)
Google Scholar
Gu, W., Xiang, C., Venkatesh, Y.V., Huang, D., Lin, H.: Facial expression recognition using radial encoding of local Gabor features and classifier synthesis. Pattern Recogn. 45(1), 80–91 (2012)
Article Google Scholar
Guan-Ming, L.U., Guo, M., Xiao-Nan, L.I., Hai-Bo, L.I.: Recognition for expression of pain in neonate using support vector machine. J. Nanjing Univ. Posts Telecommun. 25(3), 582–587 (2008)
Google Scholar
Guan-Ming, L.U., Zuo, J.K.: Feature extraction based on two-dimensional locality preserving discriminant analysis. J. Nanjing Univ. Posts Telecommun. (2014)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition (2015)
Google Scholar
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015)
Google Scholar
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Matthews, I.: The extended Cohn-Kanade dataset (CK+): a complete dataset for action unit and emotion-specified expression. In: Computer Vision Pattern Recognition Workshops (2010)
Google Scholar
Mehrabian, A.: Communication without words. Commun. Theory 6, 193–200 (2008)
Google Scholar
Mollahosseini, A., Chan, D., Mahoor, M.H.: Going deeper in facial expression recognition using deep neural networks. In: IEEE Winter Conference on Applications of Computer Vision (2016)
Google Scholar
Orhan, A.E., Pitkow, X.: Skip connections eliminate singularities (2017)
Google Scholar
Ping, L., Han, S., Meng, Z., Yan, T.: Facial expression recognition via a boosted deep belief network. In: IEEE Conference on Computer Vision Pattern Recognition (2014)
Google Scholar
Rusiecki, A.: Trimmed categorical cross-entropy for deep learning with label noise. Electron. Lett. 55(6), 319–320 (2019)
Article Google Scholar
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., Chen, L.C.: MobileNetV2: inverted residuals and linear bottlenecks (2018)
Google Scholar
Shan, C., Gong, S., Mcowan, P.W.: Facial expression recognition based on local binary patterns: a comprehensive study. Image Vis. Comput. 27(6), 803–816 (2009)
Article Google Scholar
Shan, L., Deng, W.: Deep facial expression recognition: a survey (2018)
Google Scholar
Szegedy, C., Ioffe, S., Vanhoucke, V.: Inception-v4, inception-ResNet and the impact of residual connections on learning (2016)
Google Scholar
Wang, X., Chao, J., Wei, L., Min, H., Ren, F.: Feature fusion of HOG and WLD for facial expression recognition. In: IEEE/SICE International Symposium on System Integration (2013)
Google Scholar
Wu, Z., Shen, C., Van Den Hengel, A.: Wider or deeper: revisiting the ResNet model for visual recognition, vol. 90 (2016)
Google Scholar
Yang, H., Ciftci, U., Yin, L.: Facial expression recognition by de-expression residue learning. Int. J. Comput. Sci. Eng. (2018)
Google Scholar
Yu, Z., Zhang, C.: Image based static facial expression recognition with multiple deep network learning. In: ACM on International Conference on Multimodal Interaction (2015)
Google Scholar

Download references

Acknowledgement

The study was supported by the Major Project of Natural Science Research of the Jiangsu Higher Education Institutions of China (18KJA520012), and the Xuzhou Science and Technology Plan Project (KC19197).

Author information

Authors and Affiliations

Xuzhou University of Technology, Xuzhou, 221000, Jiangsu, China
Daihong Jiang, Cheng Yu & Chuangeng Tian
China University of Mining and Technology, Xuzhou, 221000, Jiangsu, China
Sanyou Zhang

Authors

Daihong Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Sanyou Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Yu
View author publications
You can also search for this author in PubMed Google Scholar
Chuangeng Tian
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Embry-Riddle Aeronautical University, Daytona Beach, FL, USA
Houbing Song
School of Astronautics and Aeronautic, UESTC, Chengdu, China
Dingde Jiang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jiang, D., Zhang, S., Yu, C., Tian, C. (2021). A Bi-directional Residual Network for Image Expression Recognition. In: Song, H., Jiang, D. (eds) Simulation Tools and Techniques. SIMUtools 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 369. Springer, Cham. https://doi.org/10.1007/978-3-030-72792-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-72792-5_16
Published: 27 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72791-8
Online ISBN: 978-3-030-72792-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Bi-directional Residual Network for Image Expression Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A mixed depthwise separation residual network for image feature extraction

LKRNet: a dual-branch network based on local key regions for facial expression recognition

Facial Expression Recognition with Multi-scale Convolution Neural Network

References

Acknowledgement

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

A Bi-directional Residual Network for Image Expression Recognition

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A mixed depthwise separation residual network for image feature extraction

LKRNet: a dual-branch network based on local key regions for facial expression recognition

Facial Expression Recognition with Multi-scale Convolution Neural Network

References

Acknowledgement

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation