Abstract
In this work, a face recognition method is proposed for face under pose variations using a multitask convolutional neural network (CNN). Furthermore, a pose estimation method followed by a face identification module is combined in a cascaded structure and used separately. In the presence of various facial expressions as well as low illuminations, datasets that include separated face poses can enhance the robustness of face recognition. The proposed method relies on a pose estimation module using a convolutional neural network model and trained on three categories of face image capture such as the left side, frontal, and right side. Second, three CNN models are used for face identification according to the estimated pose. The Left-CNN model, Front-CNN model, and Right-CNN model are used to identify the face for the left, frontal, and right pose of the face, respectively. Because face images may contain some useless information (e.g., background content), we propose a skin-based face segmentation method using structure–texture decomposition and the color-invariant descriptor. Experimental evaluation has been conducted using the proposed cascade-based face recognition system that consists of the aforementioned steps (i.e., pose estimation, face segmentation, and face identification) and is assessed on four different datasets and its superiority has been shown over related state-of-the-art techniques. Results reveal the contribution of the separate representation, skin segmentation, and pose estimation in the recognition robustness.
Similar content being viewed by others
References
Elharrouss O, Almaadeed N, Al-Maadeed S (2021) A Review of Video Surveillance Systems. J Vis Commun Image Represent p 103116
Akbari Y, Almaadeed N, Al-maadeed S, Elharrouss O (2021) Applications, databases and open computer vision research from drone videos and images: a survey. Artif Intell Rev 1–52
Wang Z et al (2014) Low-resolution face recognition: a review. Vis Comput 30(4):359–386
Yi D, Lei Z, Li SZ (2013) Towards pose robust face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Yu X, Xu F (2020) Random inverse packet information and its acquisition. Appl Math Nonlinear Sci 5(2):357–366
Xie T, Liu R, Wei Z (2020) Improvement of the fast clustering algorithm improved by k-means in the big data. Appl Math Nonlinear Sci 5(1):1–10
Li T, Yang W (2020) Solution to chance constrained programming problem in swap trailer transport organisation based on improved simulated annealing algorithm. Appl Math Nonlinear Sci 5(1):47–54
Crosswhite N et al (2018) Template adaptation for face verification and identification. Image Vis Comput 79:35–48
Abbad A et al (2018) Application of MEEMD in post-processing of dimensionality reduction methods for face recognition. IET Biometrics 8(1):59–68
Dadi HS, Pillutla GKM, Makkena ML (2017) Face recognition and human tracking using GMM, HOG and SVM in surveillance videos. Ann Data Sci 1–23
Xu W et al (2018) Sensor-assisted multi-view face recognition system on smart glass. IEEE Trans Mobile Comput 17(1):197–210
Bhowmik MK et al (2019) Enhancement of robustness of face recognition system through reduced gaussianity in Log-ICA. Expert Syst Appl 116:96–107
Li Y et al (2019) Occlusion aware facial expression recognition using cnn with attention mechanism. IEEE Trans Image Process 28(5):2439–2450
Ge S et al (2019) Low-resolution face recognition in the wild via selective knowledge distillation. IEEE Trans Image Process 28(4):2051–2062
Masi I, et al (2016) Pose-aware face recognition in the wild. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Pentland A, Moghaddam B, Starner T (1994) View-based and modular eigenspaces for face recognition
Sun Y, Wang X, Tang X (2014) Deep learning face representation from predicting 10,000 classes. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Schroff F, Kalenichenko D, Philbin J (2015) Facenet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Taigman Y, et al (2014) Deepface: closing the gap to human-level performance in face verification. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)
Bashbaghi, S et al (2019) Deep learning architectures for face recognition in video surveillance. Deep Learning in Object Detection and Recognition. Springer, Singapore, pp 133–154
Jourabloo A, et al (2017) Pose-invariant face alignment with a single CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV)
Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. In: bmvc
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Banerjee S, Das S (2018) Mutual variation of information on transfer-CNN for face recognition with degraded probe samples. Neurocomputing 310:299–315
Zhang MM, Shang K, Wu H (2019) Deep Compact Discriminative representation for unconstrained face recognition. Signal Process Image Commun
He R et al (2018) Wasserstein CNN: learning invariant features for nir-vis face recognition. IEEE Trans Pattern Analysis Mach Intell
Hsu G-S et al (2018) Fast Landmark localization with 3D component reconstruction and CNN for cross-pose recognition. IEEE Trans Circuits Syst Video Technol 28(11):3194–3207
Huang GB, Learned-Miller E (2014) Labeled faces in the wild: updates and new reporting procedures. Dept. Comput. Sci., Univ. Massachusetts Amherst, Amherst, MA, USA, Tech. Rep, : p. 14-003
Littwin E, Wolf L (2016) The multiverse loss for robust transfer learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 3957–3966
Tadmor O, Wexler Y, Rosenwein T, Shalev-Shwartz S, Shashua A (2016) Learning a metric embedding for face recognition using the multibatch method. In: Proceedings of the NIPS, pp 10–11
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: Proceedings of the European conference on computer vision (ECCV), pp 499–515
Wang D, Otto C, Jain AK (2017) Face search at scale. IEEE Trans Pattern Anal Mach Intell 39(6):1122–1136
Sankaranarayanan, Swami AA, Castillo CD, Chellappa R (2016) Triplet probabilistic embedding for face verification and clustering. In: 2016 IEEE 8th international conference on biometrics theory, applications and systems (BTAS), pp 1-8. IEEE
Tran L, Yin X, Liu X (2017) Disentangled representation learning GAN for pose-invariant face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 1–10
Peng X, Yu X, Sohn K, Metaxas DN, Chandraker M (2017) Reconstruction-based disentanglement for pose-invariant face recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 1–10
Patacchiola M, Cangelosi A (2017) Head pose estimation in the wild using Convolutional Neural Networks and adaptive gradient methods. Pattern Recognit 71:132–143
Sreekanth P, Kulkarni U, Shetty S, S M M (2018) Head Pose Estimation using Transfer Learning, 2018 International Conference on Recent Trends in Advance Computing (ICRTAC), Chennai, India, , pp. 73-79
Zavan FHDB, Bellon OR, Silva L, Medioni GG (2019) Benchmarking parts-based face processing in-the-wild for gender recognition and head pose estimation. Pattern Recognit Lett 123:104–110
Ranjan R, Patel VM, Chellappa R (2017) Hyperface: A deep multi-task learning framework for face detection, landmark localization, pose estimation, and gender recognition. IEEE Trans Pattern Anal Mach Intell 41(1):121–135
Xie W, Zisserman A (2018) Multicolumn networks for face recognition. arXiv preprint arXiv:1807.09192
Kemelmacher-Shlizerman I, Seitz SM, Miller D, Brossard E (2016) The MegaFace benchmark: 1 million faces for recognition at scale. In: IEEE conference on computer vision and pattern recognition (CVPR)
Masi I, Tran AT, Hassner T, Leksut JT, Medioni G (2016) Do we really need to collect millions of faces for effective face recognition? In European Conference on Computer Vision (ECCV)
Huang C, Li Y, Loy CC, Tang X (2016) Learning deep representation for imbalanced classification. In: CVPR
Wen Y, Zhang K, Li Z, Qiao Y (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision, pp 499–515. Springer
Liu W, Wen Y, Yu Z, Yang M (2016) Large-margin softmax loss for convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 507–516
Wu Y, Liu H, Li J, Fu Y (2017) Deep face recognition with center invariant loss. In: Thematic Workshop of ACM-MM
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) Sphereface: Deep hypersphere embedding for face recognition. In: IEEE conf Comput Vis Pattern Recogt, volume 1
Liu W, Wen Y, Yu Z, Li M, Raj B, Song L (2017) Sphereface: deep hypersphere embedding for face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR)
Wang H, Wang Y, Zhou Z, Ji X, Gong D, Zhou J, Li Z, Liu W (2018) Cosface: large margin cosine loss for deep face recognition. In: CVPR
Zhang X, Fang Z, Wen Y, Li Z, Qiao Y (2017) Range loss for deep face recognition with long-tailed training data. In: IEEE international conference on computer vision (ICCV)
Yin X, Yu X, Sohn K, Liu X, Chandraker M (2018) Feature transfer learning for deep face recognition with long-tail data, arXiv preprint, arXiv:1803.09014
Zhong Y, Deng W, Wang M, et al (2019) Unequal-training for deep face recognition with long-tailed noisy data. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7812–7821
Huang C, Li Y, Chen CL, et al (2019) Deep imbalanced learning for face recognition and attribute prediction. IEEE Trans Pattern Anal Mach Intell
Zhao K, Xu J, Cheng M-M (2019) Regularface: Deep face recognition via exclusive regularization. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp. 1136–1144
Deng J, Guo J, Xue N, et al (2019) Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 4690–4699
Liu B, Deng W, Zhong Y, et al (2019) Fair Loss: margin-Aware Reinforcement Learning for Deep Face Recognition. In: Proceedings of the IEEE international conference on computer vision (ICCV), pp 10052–10061
Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: a unified embedding for face recognition and clustering. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) , pp 815–82
Elharrouss O, Almaadeed N, Al-Maadeed S (2020) February. LFR face dataset: Left-Front-Right dataset for pose-invariant face recognition in the wild. In: 2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT) (pp 124-130). IEEE
Elharrouss O, Moujahid D, Tairi H (2015) Motion detection based on the combining of the background subtraction and the structure–texture decomposition. Optik-Int JLight Electron Opt 126(24):5992–5997
Geusebroek J-M, van den Boomgaard R, Smeulders AWM, Dev A (2000) Color and scale: the spatial structure of color images. In: Proceeding of the 6th European conference on computer vision (ECCV), vol 1, pp 331–341
Zhou H, Chen Y, Feng R (2013) A novel background subtraction method based on color invariants. Comput Vis Image Understand 117(11):1589–1597
Simonyan K, Zisserman A (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556
Wang L, Ge L, Li R et al (2017) Three-stream CNNs for action recognition. Pattern Recognit Lett 92:33–40
Gourier N, Hall D, James L (2004) Crowley. Estimating face orientation from robust detection of salient facial structures. In: FG Net workshop on visual observation of deictic gestures, vol. 6, p. 7. FGnet (IST-2000-26434) Cambridge, UK
Sengupta S et al (2016) Frontal to profile face verification in the wild. In: 2016 IEEE winter conference on applications of computer vision (WACV). IEEE
Wolf L, Hassner T, Maoz, FI (2011) recognition in unconstrained videos with matched background similarity. In: IEEE conference on computer vision and pattern recognition (CVPR)
Masi I, Rawls S, Medioni G, Natarajan P (2016) Pose-aware face recognition in the wild. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp 4838–4846
Lee H, et al (2017) Structure-texture decomposition of images with interval gradient. In: Computer graphics forum. Wiley Online Library
Yin X, Liu X (2018) Multi-task convolutional neural network for pose-invariant face recognition. IEEE Trans Image Process 27(2):964–975
Acknowledgements
This publication was made by NPRP Grant # NPRP8-140-2-065 from the Qatar National Research Fund (a member of the Qatar Foundation). The statements made herein are solely the responsibility of the authors.
Funding
This publication was made by NPRP Grant # NPRP8-140-2-065 from the Qatar National Research Fund (a member of the Qatar Foundation). The statements made herein are solely the responsibility of the authors.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
I certify that there is no actual or potential conflict of interest in relation to this article.
Conflict of interest
The authors declare that they have no competing interests.
Financial interests
I certify that there is no financial interest in relation to this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Elharrouss, O., Almaadeed, N., Al-Maadeed, S. et al. Pose-invariant face recognition with multitask cascade networks. Neural Comput & Applic 34, 6039–6052 (2022). https://doi.org/10.1007/s00521-021-06690-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-021-06690-4