Pose-invariant face recognition with multitask cascade networks

Elharrouss, Omar; Almaadeed, Noor; Al-Maadeed, Somaya; Khelifi, Fouad

doi:10.1007/s00521-021-06690-4

Pose-invariant face recognition with multitask cascade networks

Original Article
Published: 09 January 2022

Volume 34, pages 6039–6052, (2022)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Omar Elharrouss ORCID: orcid.org/0000-0002-5341-5440¹,
Noor Almaadeed¹,
Somaya Al-Maadeed¹ &
…
Fouad Khelifi²

967 Accesses
17 Citations
2 Altmetric
Explore all metrics

Abstract

In this work, a face recognition method is proposed for face under pose variations using a multitask convolutional neural network (CNN). Furthermore, a pose estimation method followed by a face identification module is combined in a cascaded structure and used separately. In the presence of various facial expressions as well as low illuminations, datasets that include separated face poses can enhance the robustness of face recognition. The proposed method relies on a pose estimation module using a convolutional neural network model and trained on three categories of face image capture such as the left side, frontal, and right side. Second, three CNN models are used for face identification according to the estimated pose. The Left-CNN model, Front-CNN model, and Right-CNN model are used to identify the face for the left, frontal, and right pose of the face, respectively. Because face images may contain some useless information (e.g., background content), we propose a skin-based face segmentation method using structure–texture decomposition and the color-invariant descriptor. Experimental evaluation has been conducted using the proposed cascade-based face recognition system that consists of the aforementioned steps (i.e., pose estimation, face segmentation, and face identification) and is assessed on four different datasets and its superiority has been shown over related state-of-the-art techniques. Results reveal the contribution of the separate representation, skin segmentation, and pose estimation in the recognition robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 4

Facial emotion recognition using convolutional neural networks (FERC)

Article 18 February 2020

Convolutional neural network: a review of models, methodologies and applications to object detection

Article 20 December 2019

A review on face recognition systems: recent approaches and challenges

Article 30 July 2020

References

Elharrouss O, Almaadeed N, Al-Maadeed S (2021) A Review of Video Surveillance Systems. J Vis Commun Image Represent p 103116
Akbari Y, Almaadeed N, Al-maadeed S, Elharrouss O (2021) Applications, databases and open computer vision research from drone videos and images: a survey. Artif Intell Rev 1–52
Wang Z et al (2014) Low-resolution face recognition: a review. Vis Comput 30(4):359–386
Article Google Scholar
Yi D, Lei Z, Li SZ (2013) Towards pose robust face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Yu X, Xu F (2020) Random inverse packet information and its acquisition. Appl Math Nonlinear Sci 5(2):357–366
Article MathSciNet Google Scholar
Xie T, Liu R, Wei Z (2020) Improvement of the fast clustering algorithm improved by k-means in the big data. Appl Math Nonlinear Sci 5(1):1–10
Article MathSciNet Google Scholar
Li T, Yang W (2020) Solution to chance constrained programming problem in swap trailer transport organisation based on improved simulated annealing algorithm. Appl Math Nonlinear Sci 5(1):47–54
Article MathSciNet Google Scholar
Crosswhite N et al (2018) Template adaptation for face verification and identification. Image Vis Comput 79:35–48
Article Google Scholar
Abbad A et al (2018) Application of MEEMD in post-processing of dimensionality reduction methods for face recognition. IET Biometrics 8(1):59–68
Article MathSciNet Google Scholar
Dadi HS, Pillutla GKM, Makkena ML (2017) Face recognition and human tracking using GMM, HOG and SVM in surveillance videos. Ann Data Sci 1–23
Xu W et al (2018) Sensor-assisted multi-view face recognition system on smart glass. IEEE Trans Mobile Comput 17(1):197–210
Article Google Scholar
Bhowmik MK et al (2019) Enhancement of robustness of face recognition system through reduced gaussianity in Log-ICA. Expert Syst Appl 116:96–107
Article Google Scholar
Li Y et al (2019) Occlusion aware facial expression recognition using cnn with attention mechanism. IEEE Trans Image Process 28(5):2439–2450
Article MathSciNet Google Scholar
Ge S et al (2019) Low-resolution face recognition in the wild via selective knowledge distillation. IEEE Trans Image Process 28(4):2051–2062
Article MathSciNet Google Scholar
Masi I, et al (2016) Pose-aware face recognition in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Pentland A, Moghaddam B, Starner T (1994) View-based and modular eigenspaces for face recognition
Sun Y, Wang X, Tang X (2014) Deep learning face representation from predicting 10,000 classes. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Taigman Y, et al (2014) Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Bashbaghi, S et al (2019) Deep learning architectures for face recognition in video surveillance. Deep Learning in Object Detection and Recognition. Springer, Singapore, pp 133–154
Jourabloo A, et al (2017) Pose-invariant face alignment with a single CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV)
Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. In: bmvc
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Banerjee S, Das S (2018) Mutual variation of information on transfer-CNN for face recognition with degraded probe samples. Neurocomputing 310:299–315
Article Google Scholar
Zhang MM, Shang K, Wu H (2019) Deep Compact Discriminative representation for unconstrained face recognition. Signal Process Image Commun
He R et al (2018) Wasserstein CNN: learning invariant features for nir-vis face recognition. IEEE Trans Pattern Analysis Mach Intell
Hsu G-S et al (2018) Fast Landmark localization with 3D component reconstruction and CNN for cross-pose recognition. IEEE Trans Circuits Syst Video Technol 28(11):3194–3207
Article Google Scholar
Huang GB, Learned-Miller E (2014) Labeled faces in the wild: updates and new reporting procedures. Dept. Comput. Sci., Univ. Massachusetts Amherst, Amherst, MA, USA, Tech. Rep, : p. 14-003
Littwin E, Wolf L (2016) The multiverse loss for robust transfer learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3957–3966
Tadmor O, Wexler Y, Rosenwein T, Shalev-Shwartz S, Shashua A (2016) Learning a metric embedding for face recognition using the multibatch method. In: Proceedings of the NIPS, pp 10–11
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: Proceedings of the European conference on computer vision (ECCV), pp 499–515
Wang D, Otto C, Jain AK (2017) Face search at scale. IEEE Trans Pattern Anal Mach Intell 39(6):1122–1136
Article Google Scholar
Sankaranarayanan, Swami AA, Castillo CD, Chellappa R (2016) Triplet probabilistic embedding for face verification and clustering. In: 2016 IEEE 8th international conference on biometrics theory, applications and systems (BTAS), pp 1-8. IEEE
Tran L, Yin X, Liu X (2017) Disentangled representation learning GAN for pose-invariant face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–10
Peng X, Yu X, Sohn K, Metaxas DN, Chandraker M (2017) Reconstruction-based disentanglement for pose-invariant face recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 1–10
Patacchiola M, Cangelosi A (2017) Head pose estimation in the wild using Convolutional Neural Networks and adaptive gradient methods. Pattern Recognit 71:132–143
Article Google Scholar
Sreekanth P, Kulkarni U, Shetty S, S M M (2018) Head Pose Estimation using Transfer Learning, 2018 International Conference on Recent Trends in Advance Computing (ICRTAC), Chennai, India, , pp. 73-79
Zavan FHDB, Bellon OR, Silva L, Medioni GG (2019) Benchmarking parts-based face processing in-the-wild for gender recognition and head pose estimation. Pattern Recognit Lett 123:104–110
Article Google Scholar
Ranjan R, Patel VM, Chellappa R (2017) Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell 41(1):121–135
Article Google Scholar
Xie W, Zisserman A (2018) Multicolumn networks for face recognition. arXiv preprint arXiv:1807.09192
Kemelmacher-Shlizerman I, Seitz SM, Miller D, Brossard E (2016) The MegaFace benchmark: 1 million faces for recognition at scale. In: IEEE conference on computer vision and pattern recognition (CVPR)
Masi I, Tran AT, Hassner T, Leksut JT, Medioni G (2016) Do we really need to collect millions of faces for effective face recognition? In European Conference on Computer Vision (ECCV)
Huang C, Li Y, Loy CC, Tang X (2016) Learning deep representation for imbalanced classification. In: CVPR
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision, pp 499–515. Springer
Liu W, Wen Y, Yu Z, Yang M (2016) Large-margin softmax loss for convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 507–516
Wu Y, Liu H, Li J, Fu Y (2017) Deep face recognition with center invariant loss. In: Thematic Workshop of ACM-MM
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) Sphereface: Deep hypersphere embedding for face recognition. In: IEEE conf Comput Vis Pattern Recogt, volume 1
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) Sphereface: deep hypersphere embedding for face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR)
Wang H, Wang Y, Zhou Z, Ji X, Gong D, Zhou J, Li Z, Liu W (2018) Cosface: large margin cosine loss for deep face recognition. In: CVPR
Zhang X, Fang Z, Wen Y, Li Z, Qiao Y (2017) Range loss for deep face recognition with long-tailed training data. In: IEEE international conference on computer vision (ICCV)
Yin X, Yu X, Sohn K, Liu X, Chandraker M (2018) Feature transfer learning for deep face recognition with long-tail data, arXiv preprint, arXiv:1803.09014
Zhong Y, Deng W, Wang M, et al (2019) Unequal-training for deep face recognition with long-tailed noisy data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7812–7821
Huang C, Li Y, Chen CL, et al (2019) Deep imbalanced learning for face recognition and attribute prediction. IEEE Trans Pattern Anal Mach Intell
Zhao K, Xu J, Cheng M-M (2019) Regularface: Deep face recognition via exclusive regularization. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp. 1136–1144
Deng J, Guo J, Xue N, et al (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 4690–4699
Liu B, Deng W, Zhong Y, et al (2019) Fair Loss: margin-Aware Reinforcement Learning for Deep Face Recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 10052–10061
Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) , pp 815–82
Elharrouss O, Almaadeed N, Al-Maadeed S (2020) February. LFR face dataset: Left-Front-Right dataset for pose-invariant face recognition in the wild. In: 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT) (pp 124-130). IEEE
Elharrouss O, Moujahid D, Tairi H (2015) Motion detection based on the combining of the background subtraction and the structure–texture decomposition. Optik-Int JLight Electron Opt 126(24):5992–5997
Article Google Scholar
Geusebroek J-M, van den Boomgaard R, Smeulders AWM, Dev A (2000) Color and scale: the spatial structure of color images. In: Proceeding of the 6th European conference on computer vision (ECCV), vol 1, pp 331–341
Zhou H, Chen Y, Feng R (2013) A novel background subtraction method based on color invariants. Comput Vis Image Understand 117(11):1589–1597
Article Google Scholar
Simonyan K, Zisserman A (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Wang L, Ge L, Li R et al (2017) Three-stream CNNs for action recognition. Pattern Recognit Lett 92:33–40
Article Google Scholar
Gourier N, Hall D, James L (2004) Crowley. Estimating face orientation from robust detection of salient facial structures. In: FG Net workshop on visual observation of deictic gestures, vol. 6, p. 7. FGnet (IST-2000-26434) Cambridge, UK
Sengupta S et al (2016) Frontal to profile face verification in the wild. In: 2016 IEEE winter conference on applications of computer vision (WACV). IEEE
Wolf L, Hassner T, Maoz, FI (2011) recognition in unconstrained videos with matched background similarity. In: IEEE conference on computer vision and pattern recognition (CVPR)
Masi I, Rawls S, Medioni G, Natarajan P (2016) Pose-aware face recognition in the wild. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 4838–4846
Lee H, et al (2017) Structure-texture decomposition of images with interval gradient. In: Computer graphics forum. Wiley Online Library
Yin X, Liu X (2018) Multi-task convolutional neural network for pose-invariant face recognition. IEEE Trans Image Process 27(2):964–975
Article MathSciNet Google Scholar

Download references

Acknowledgements

This publication was made by NPRP Grant # NPRP8-140-2-065 from the Qatar National Research Fund (a member of the Qatar Foundation). The statements made herein are solely the responsibility of the authors.

Funding

This publication was made by NPRP Grant # NPRP8-140-2-065 from the Qatar National Research Fund (a member of the Qatar Foundation). The statements made herein are solely the responsibility of the authors.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Qatar University, Doha, Qatar
Omar Elharrouss, Noor Almaadeed & Somaya Al-Maadeed
Department of Computer Science and Digital Technologies, Northumbria University, Newcastle Upon Tyne, Newcastle, UK
Fouad Khelifi

Authors

Omar Elharrouss
View author publications
You can also search for this author in PubMed Google Scholar
Noor Almaadeed
View author publications
You can also search for this author in PubMed Google Scholar
Somaya Al-Maadeed
View author publications
You can also search for this author in PubMed Google Scholar
Fouad Khelifi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Omar Elharrouss.

Ethics declarations

Conflict of interest

I certify that there is no actual or potential conflict of interest in relation to this article.

Conflict of interest

The authors declare that they have no competing interests.

Financial interests

I certify that there is no financial interest in relation to this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Elharrouss, O., Almaadeed, N., Al-Maadeed, S. et al. Pose-invariant face recognition with multitask cascade networks. Neural Comput & Applic 34, 6039–6052 (2022). https://doi.org/10.1007/s00521-021-06690-4

Download citation

Received: 06 October 2020
Accepted: 27 October 2021
Published: 09 January 2022
Issue Date: April 2022
DOI: https://doi.org/10.1007/s00521-021-06690-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Pose-invariant face recognition with multitask cascade networks

Abstract

Access this article

Similar content being viewed by others

Facial emotion recognition using convolutional neural networks (FERC)

Convolutional neural network: a review of models, methodologies and applications to object detection

A review on face recognition systems: recent approaches and challenges

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Conflict of interest

Financial interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Pose-invariant face recognition with multitask cascade networks

Abstract

Access this article

Similar content being viewed by others

Facial emotion recognition using convolutional neural networks (FERC)

Convolutional neural network: a review of models, methodologies and applications to object detection

A review on face recognition systems: recent approaches and challenges

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Conflict of interest

Financial interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation