Unsupervised Machine Learning based rapid Pose-Invariant Three-Dimensional Facial Reconstruction for Biomedical Extended Reality and Phantom Generation in Oncologic Cranio-Maxillofacial Surgery
DOI:
https://doi.org/10.26821/IJSHRE.10.1.2022.91101Keywords:
Cranio-maxillofacial surgery, biomedical extended reality (XR), ), custom patient phantoms, cancerous facial tumors, PET/CT scans, cosmetic surgery, three-dimensional reconstruction, 2D images, 3D meshes, pose invariant face frontalization, Generative Adversarial Network (GAN), Convolutional Neural Network (CNN), morphable 3D model, 2D facial landmarks, printable modelAbstract
Cranio-maxillofacial surgery is a surgical specialty that focuses on the reconstructive surgery of the entire cranio-maxillofacial complex: the anatomical area of the mouth, jaws, face, and skull, head and neck as well as associated structures. A highly precise and complex procedure, it demands a well-structured and comprehensive treatment planning process. Biomedical extended reality (XR) has proven to be beneficial in the pre-surgery visualization step as it enables interaction in the three-dimensional (3D) space relative to the patient, however the tedious nature of the construction of unique, patient-specific facial models makes it difficult to implement XR and commercial physical patient phantoms are far too generic and expensive for the task. In the case of cancerous tumors on the head and face, the numbers of which have increased tremendously over the past few years, a suitable oncologic therapy needs to be delivered in the shortest possible period of time, as they spread rapidly and the possible complications can be fatal to patients. While PET/CT scans do help in accurately visualizing such tumors, and are the universally-accepted approach in surgically examining cases of craniofacial trauma and oral cancer, they are time-consuming and only assist in two-dimensional (2D) visualization. They have to be further mapped by a surgical or radiological expert to the 3D anatomy of the patient, a task that can be eliminated in both oncologic cranio-maxillofacial surgery and cosmetic surgery.
In this study, we explore near-instantaneous external 3D facial reconstruction from landmarks obtained specifically from a single 2D image of the human face, which is unchallenging to sample, making the system deployable as an application even on mobile devices. We also aim to eliminate the limitations encountered due to pose variations, through the frontalization of these images, while maintaining focus on the affected area, with the implementation of an illumination-preserving Generative Adversarial Network (GAN), integrated with a light Convolutional Neural Network (CNN) for feature extraction, trained and tested on the Multi-PIE and LFW (Labelled Faces In-the-Wild) datasets. The output is then processed to obtain 2D landmarks (disregarding external features) that are crucial to project the image on a 3D morphable model (3DMM). The final output is a digital, printable 3D mesh in both the OBJ and STL format, which can be used for generating custom patient phantoms as well as be implemented in biomedical XR for pre-surgical planning and evaluation. It can also be integrated with a CT scan for internal skeletal 3D imaging in other cranio-maxillofacial cases.
We ultimately evaluate the performance of the system on images captured from several angles, to show its generalization and robustness to novel facial configurations and unseen data.
References
Robins JMW, Sheikh AJ, Shastin D, Schramm MWJ, Carter P, Russell JL, Liddington M, Chumas PD. Fronto-orbital advancement and reconstruction using reverse frontal bone graft without the use of orbital bar: a technical note. Childs Nerv Syst. 2020 Jun;36(6):1295-1299. doi: 10.1007/s00381-020-04583-w. Epub 2020 Mar 26. PMID: 32219525; PMCID: PMC7250796.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250796/
Chen WL, Yang ZH, Huang ZQ, Fan S, Zhang DM, Wang YY. Craniofacial Resection and Reconstruction in Patients With Recurrent Cancer Involving the Craniomaxillofacial Region. J Oral Maxillofac Surg. 2017 Mar;75(3):622-631. doi: 10.1016/j.joms.2016.08.044. Epub 2016 Sep 13. PMID: 27717818.
https://pubmed.ncbi.nlm.nih.gov/27717818/
Benmahdjoub M, van Walsum T, van Twisk P, Wolvius EB. Augmented reality in craniomaxillofacial surgery: added value and proposed recommendations through a systematic review of the literature. Int J Oral Maxillofac Surg. 2021 Jul;50(7):969-978. doi: 10.1016/j.ijom.2020.11.015. Epub 2020 Dec 16. PMID: 33339731.
https://pubmed.ncbi.nlm.nih.gov/33339731/
Xiaojun Chen, Lu Xu, Yiping Wang, Huixiang Wang, Fang Wang, Xiangsen Zeng, Qiugen Wang, Jan Egger, Development of a surgical navigation system based on augmented reality using an optical see-through head-mounted display, Journal of Biomedical Informatics, Volume 55, 2015, Pages 124-131, ISSN 1532-0464, https://doi.org/10.1016/j.jbi.2015.04.003.
(https://www.sciencedirect.com/science/article/pii/S1532046415000702)
Hsieh, Chung-Hung, Jiann-Der Lee, and Chieh-Tsai Wu. "A Kinect-based medical augmented reality system for craniofacial applications using image-to-patient registration." Neuropsychiatry 7.6 (2017): 927-939.
Alshomer F, Alazzam A, Alturki A, Almeshal O, Alhusainan H. Smartphone-assisted Augmented Reality in Craniofacial Surgery. Plast Reconstr Surg Glob Open. 2021 Aug 13;9(8):e3743. doi: 10.1097/GOX.0000000000003743. PMID: 34414055; PMCID: PMC8367041.
https://pubmed.ncbi.nlm.nih.gov/34414055/
Giovanni Badiali, Vincenzo Ferrari, Fabrizio Cutolo, Cinzia Freschi, Davide Caramella, Alberto Bianchi, Claudio Marchetti, Augmented reality as an aid in maxillofacial surgery: Validation of a wearable system allowing maxillary repositioning, Journal of Cranio-Maxillofacial Surgery, Volume 42, Issue 8, 2014, Pages 1970-1976, ISSN 1010-5182, https://doi.org/10.1016/j.jcms.2014.09.001.
(https://www.sciencedirect.com/science/article/pii/S1010518214002522)
Carlos G. Landaeta-Quinones, Nicole Hernandez, Najy K. Zarroug, Computer-Assisted Surgery: Applications in Dentistry and Oral and Maxillofacial Surgery, Dental Clinics of North America, Volume 62, Issue 3, 2018, Pages 403-420, ISSN 0011-8532, ISBN 9780323610766, https://doi.org/10.1016/j.cden.2018.03.009. (https://www.sciencedirect.com/science/article/pii/S0011853218300247)
Beyer, T., Townsend, D.W., Brun, T., Kinahan, P.E., Charron, M., Roddy, R., Jerin, J., Young, J., Byars, L. and Nutt, R., 2000. A combined PET/CT scanner for clinical oncology. Journal of nuclear medicine, 41(8), pp.1369-1379.
Modeling and Simulation of 4D PET-CT and PET-MR Images Tsoumpas, Charalampos et al. PET Clinics, Volume 8, Issue 1, 95 - 110
Wong WL, Batty V. Role of PET/CT in maxillo-facial surgery. Br J Oral Maxillofac Surg. 2009 Jun;47(4):259-67. doi: 10.1016/j.bjoms.2008.11.011. Epub 2009 Feb 5. PMID: 19200626.
https://pubmed.ncbi.nlm.nih.gov/19200626/
Bibb, R., Eggbeer, D. & Paterson, A. Medical modelling: the application of advanced design and rapid prototyping techniques in medicine. (Woodhead Publishing, 2014).
Filippou, V. and Tsoumpas, C. (2018), Recent advances on the development of phantoms using 3D printing for imaging with CT, MRI, PET, SPECT, and ultrasound. Med. Phys., 45: e740-e760.
https://doi.org/10.1002/mp.13058
Amorim, P., Moraes, T., Silva, J. & Pedrini, H. Invesalius: An interactive rendering framework for health care support. In Bebis, G. et al. (eds.) Advances in Visual Computing. Lecture Notes in Computer Science, vol. 9474, 45–54 (Springer, Cham, 2015)
Clement, J.G. and Marks, M.K., 2005. Computer-graphic facial reconstruction. Elsevier.
3D Semi-Landmarks Based Statistical Face Reconstruction. Maxime Berar, Michel Desvignes, Gerard Bailly, Yohan Payan
https://doi.org/10.2498/cit.2006.01.04
Elyan, Eyad & Ugail, Hassan. (2007). Reconstruction of 3D Human Facial Images Using Partial Differential Equations. Journal of Computers. 2. 10.4304/jcp.2.8.1-8.
Wang, D. 2005. Face Reconstruction Across Different Poses and Arbitrary Illumination Conditions. In Audio- and Video-Based Biometric Person Authentication (pp. 91–101). Springer Berlin Heidelberg.
https://doi.org/10.1007/11527923_10
Rudomin, Isaac & Cuevas, Hector. (2000). Generating a 3D Facial Model From a Single Image using Principal Components Analysis.
Y. Guan, "Automatic 3D Face Reconstruction based on Single 2D Image," 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07), 2007, pp. 1216-1219, doi: 10.1109/MUE.2007.95.
https://ieeexplore.ieee.org/abstract/document/4197445
Yao Feng, Fan Wu, Xiaohu Shao, Yanfeng Wang, & Xi Zhou. (2018). Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network.
https://arxiv.org/abs/1803.07835
Aaron S. Jackson, Adrian Bulat, Vasileios Argyriou, & Georgios Tzimiropoulos. (2017). Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression.
https://arxiv.org/abs/1703.07834
Yu Deng, Jiaolong Yang, Sicheng Xu, Dong Chen, Yunde Jia, & Xin Tong. (2020). Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set.
https://arxiv.org/abs/1903.08527
Pengfei Dou, Shishir K. Shah, & Ioannis A. Kakadiaris. (2017). End-to-end 3D face reconstruction with deep neural networks.
https://arxiv.org/abs/1704.05020
Alexandros Lattas, Stylianos Moschoglou, Baris Gecer, Stylianos Ploumpis, Vasileios Triantafyllou, Abhijeet Ghosh, & Stefanos Zafeiriou. (2020). AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild".
https://arxiv.org/abs/2003.13845
Gecer, B., Ploumpis, S., Kotsia, I., & Zafeiriou, S. (2019). GANFIT: Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
https://arxiv.org/abs/1902.05978
https://towardsdatascience.com/getting-started-with-gans-using-pytorch-78e7c22a14a5
Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters, 23(10), 1499–1503.
https://arxiv.org/abs/1604.02878
Huang, Gary & Mattar, Marwan & Berg, Tamara & Learned-Miller, Eric. (2008). Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments. Tech. rep..
R. Gross, I. Matthews, J. Cohn, T. Kanade and S. Baker, "Multi-PIE," 2008 8th IEEE International Conference on Automatic Face & Gesture Recognition, 2008, pp. 1-8, doi: 10.1109/AFGR.2008.4813399.
https://ieeexplore.ieee.org/document/4813399
Rui Huang, Shu Zhang, Tianyu Li, & Ran He. (2017). Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis.
https://arxiv.org/abs/1704.04086
L. Tran, X. Yin and X. Liu, "Disentangled Representation Learning GAN for Pose-Invariant Face Recognition," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 1283-1292, doi: 10.1109/CVPR.2017.141.
https://ieeexplore.ieee.org/document/8099624
Shuang Ma, Jianlong Fu, Chang Wen Chen, & Tao Mei. (2018). DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks (with Supplementary Materials).
https://arxiv.org/abs/1802.06454
Wei, Yuxiang & Liu, Ming & Wang, Haolin & Zhu, Ruifeng & Hu, Guosheng & Zuo, Wangmeng. (2020). Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision.
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, & Alexei A. Efros. (2018). Image-to-Image Translation with Conditional Adversarial Networks.
https://arxiv.org/abs/1611.07004
Xudong Mao, Qing Li, Haoran Xie, Raymond Y. K. Lau, Zhen Wang, & Stephen Paul Smolley. (2017). Least Squares Generative Adversarial Networks.
https://arxiv.org/abs/1611.04076
P. Paysan, R. Knothe, B. Amberg, S. Romdhani and T. Vetter, "A 3D Face Model for Pose and Illumination Invariant Face Recognition," 2009 Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009, pp. 296-301, doi: 10.1109/AVSS.2009.58.
https://ieeexplore.ieee.org/document/5279762
Li, Tianye & Bolkart, Timo & Black, Michael & Li, Hao & Romero, Javier. (2017). Learning a model of facial shape and expression from 4D scans. ACM Transactions on Graphics. 36. 1-17. 10.1145/3130800.3130813.
Gsaxner, C., Wallner, J., Chen, X. et al. Facial model collection for medical augmented reality in oncologic cranio-maxillofacial surgery. Sci Data 6, 310 (2019). https://doi.org/10.1038/s41597-019-0327-8