[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3378184.3378225acmotherconferencesArticle/Chapter ViewAbstractPublication PagesappisConference Proceedingsconference-collections
research-article

A Face Recognition System for Assistive Robots

Published: 17 February 2020 Publication History

Abstract

Assistive robots collaborating with people demand strong Human-Robot interaction capabilities. In this way, recognizing the person the robot has to interact with is paramount to provide a personalized service and reach a satisfactory end-user experience. To this end, face recognition: a non-intrusive, automatic mechanism of identification using biometric identifiers from an user's face, has gained relevance in the recent years, as the advances in machine learning and the creation of huge public datasets have considerably improved the state-of-the-art performance. In this work we study different open-source implementations of the typical components of state-of-the-art face recognition pipelines, including face detection, feature extraction and classification, and propose a recognition system integrating the most suitable methods for their utilization in assistant robots. Concretely, for face detection we have considered MTCNN, OpenCV's DNN, and OpenPose, while for feature extraction we have analyzed InsightFace and Facenet. We have made public an implementation of the proposed recognition framework, ready to be used by any robot running the Robot Operating System (ROS). The methods in the spotlight have been compared in terms of accuracy and performance in common benchmark datasets, namely FDDB and LFW, to aid the choice of the final system implementation, which has been tested in a real robotic platform.

References

[1]
2017. Face alignment in-the-wild: A Survey. Computer Vision and Image Understanding 162 (2017), 1--22.
[2]
G. Bradski. 2000. The OpenCV Library. Dr. Dobb's Journal of Software Tools (2000).
[3]
Qiong Cao, Li Shen, Weidi Xie, Omkar M Parkhi, and Andrew Zisserman. 2018. Vggface2: A dataset for recognising faces across pose and age. In 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition. 67--74.
[4]
Zhe Cao, Gines Hidalgo, Tomas Simon, Shih-En Wei, and Yaser Sheikh. 2018. OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields. CoRR abs/1812.08008 (2018).
[5]
D Chaves, JR Ruiz-Sarmiento, N Petkov, and J Gonzalez-Jimenez. 2019. Integration of CNN into a Robotic Architecture to Build Semantic Maps of Indoor Environments. In International Work-Conference on Artificial Neural Networks. Springer, 313--324.
[6]
Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019. Arcface: Additive angular margin loss for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4690--4699.
[7]
Eurostat. [n.d.]. Population structure and ageing. Recovered from: https://ec.europa.eu/eurostat/statistics-explained/index.php/Population_structure_and_ageing. Accessed: 2019-09-04.
[8]
Mercedes Garcia-Salguero, Javier Monroy, Alejandro Solano, and Javier Gonzalez-Jimenez. 2019. Socially Acceptable Approach to Humans by a Mobile Robot. In International Conference on Applications of Intelligent Systems (APPIS).
[9]
J. González-Jiménez, C. Galindo, and J. R. Ruiz-Sarmiento. 2012. Technical improvements of the Giraff telepresence robot based on users' evaluation. In 2012 IEEE RO-MAN: The 21st IEEE International Symposium on Robot and Human Interactive Communication. Paris, France, 827--832.
[10]
Yandong Guo, Lei Zhang, Yuxiao Hu, Xiaodong He, and Jianfeng Gao. 2016. MS-CELEB-1m: A dataset and benchmark for large-scale face recognition. In European Conference on Computer Vision. Springer, 87--102.
[11]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[12]
Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller. 2007. Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments. Technical Report 07-49. University of Massachusetts, Amherst.
[13]
Vidit Jain and Erik Learned-Miller. 2010. FDDB: A Benchmark for Face Detection in Unconstrained Settings. Technical Report UM-CS-2010-009. University of Massachusetts, Amherst.
[14]
Ira Kemelmacher-Shlizerman, Steven M. Seitz, Daniel Miller, and Evan Brossard. 2016. The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas.
[15]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems 25. 1097--1105.
[16]
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In European conference on computer vision. Springer, 740--755.
[17]
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song. 2017. Sphereface: Deep hypersphere embedding for face recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 212--220.
[18]
M. Luperto, J. Monroy, J. R. Ruiz-Sarmiento, F. Moreno, N. Basilico, J. Gonzalez-Jimenez, and N. A. Borghese. 2019. Towards Long-Term Deployment of a Mobile Robot for at-Home Ambient Assisted Living of the Elderly. In 2019 European Conference on Mobile Robots (ECMR). 1--6.
[19]
Pablo Navarrete and Javier Ruiz-del Solar. 2002. Comparative study between different eigenspace-based approaches for face recognition. In AFSS International Conference on Fuzzy Systems. Springer, 178--184.
[20]
Andrea Orlandini, Annica Kristoffersson, Lena Almquist, Patrik Björkman, Amedeo Cesta, Gabriella Cortellessa, Cipriano Galindo, Javier Gonzalez-Jimenez, et al. 2016. ExCITE Project: A review of forty-two months of robotic telepresence technology evolution. Presence: Teleoperators and Virtual Environments 25, 3 (2016), 204--221.
[21]
Kari Pulli, Anatoly Baksheev, Kirill Kornyakov, and Victor Eruhimov. 2012. Real-Time Computer Vision with OpenCV. Commun. ACM 55 (06 2012), 61--69.
[22]
Morgan Quigley, Ken Conley, Brian Gerkey, Josh Faust, Tully Foote, Jeremy Leibs, Rob Wheeler, and Andrew Ng. 2009. ROS: an open-source Robot Operating System. In ICRA Workshop on Open Source Software, Vol. 3. Kobe, Japan.
[23]
JR Ruiz-Sarmiento, Cipriano Galindo, and Javier Gonzalez. 2011. Improving human face detection through ToF cameras for ambient intelligence applications. In Ambient Intelligence-Software and Applications. Springer, 125--132.
[24]
Jose-Raul Ruiz-Sarmiento, Cipriano Galindo, Javier Monroy, Francisco-Angel Moreno, and Javier Gonzalez-Jimenez. 2019. Ontology-based conditional random fields for object recognition. Knowledge-Based Systems 168 (2019), 100--108.
[25]
Florian Schroff, Dmitry Kalenichenko, and James Philbin. 2015. FaceNet: A Unified Embedding for Face Recognition and Clustering. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Boston, MA.
[26]
Karen Simonyan and Andrew Zisserman. 2014. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 (Sep 2014).
[27]
Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going Deeper With Convolutions. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1--9.
[28]
Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato, and Lior Wolf. 2014. Deepface: Closing the gap to human-level performance in face verification. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 1701--1708.
[29]
Paul Viola and Michael J Jones. 2004. Robust real-time face detection. International journal of computer vision 57, 2 (2004), 137--154.
[30]
Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu. 2018. Cosface: Large margin cosine loss for deep face recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 5265--5274.
[31]
Sijie Yan, Yuanjun Xiong, and Dahua Lin. 2018. Spatial temporal graph convolutional networks for skeleton-based action recognition. In Thirty-Second AAAI Conference on Artificial Intelligence.
[32]
K. Zhang, Z. Zhang, Z. Li, and Y. Qiao. 2016. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters 23, 10 (Oct 2016), 1499--1503.
[33]
David Zuñiga-Noël, Jose-Raul Ruiz-Sarmiento, and Javier Gonzalez-Jimenez. 2019. Intrinsic Calibration of Depth Cameras for Mobile Robots Using a Radial Laser Scanner. In International Conference on Computer Analysis of Images and Patterns. Springer, 659--671.

Cited By

View all
  • (2024)Sistema multimodal para la orientación de robots móviles hacia su interlocutorJornadas de Automática10.17979/ja-cea.2024.45.10939Online publication date: 19-Jul-2024
  • (2024)Deep learning and machine learning techniques for head pose estimation: a surveyArtificial Intelligence Review10.1007/s10462-024-10936-757:10Online publication date: 12-Sep-2024
  • (2023)An Efficient Biometric Identification Privacy Protection Protocol on the CloudComputer Science and Application10.12677/CSA.2023.13916313:09(1641-1654)Online publication date: 2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
APPIS 2020: Proceedings of the 3rd International Conference on Applications of Intelligent Systems
January 2020
214 pages
ISBN:9781450376303
DOI:10.1145/3378184
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 February 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Assistive Robots
  2. CNNs
  3. Face Detection
  4. Face Recognition
  5. Feature Extraction
  6. Mobile Robots
  7. ROS

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Universidad de Málaga
  • European H2020 program
  • Spanish Government

Conference

APPIS 2020

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)15
  • Downloads (Last 6 weeks)4
Reflects downloads up to 09 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Sistema multimodal para la orientación de robots móviles hacia su interlocutorJornadas de Automática10.17979/ja-cea.2024.45.10939Online publication date: 19-Jul-2024
  • (2024)Deep learning and machine learning techniques for head pose estimation: a surveyArtificial Intelligence Review10.1007/s10462-024-10936-757:10Online publication date: 12-Sep-2024
  • (2023)An Efficient Biometric Identification Privacy Protection Protocol on the CloudComputer Science and Application10.12677/CSA.2023.13916313:09(1641-1654)Online publication date: 2023
  • (2022)Evaluation of Various State of the Art Head Pose Estimation Algorithms for Clinical ScenariosSensors10.3390/s2218685022:18(6850)Online publication date: 10-Sep-2022
  • (2022)Efficient Biometric Identification on the Cloud With Privacy Preservation GuaranteeIEEE Access10.1109/ACCESS.2022.321870310(115520-115531)Online publication date: 2022
  • (2021)Framework for Controlling KNX Devices Based on GesturesUniversal Access in Human-Computer Interaction. Access to Media, Learning and Assistive Environments10.1007/978-3-030-78095-1_37(507-518)Online publication date: 3-Jul-2021

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media