Cascade Multi-view Hourglass Model for Robust 3D Face Alignment

Estimating the 3D facial landmarks from a 2D image remains a challenging problem. Even though state-of-the-art 2D alignment methods are able to predict accurate landmarks for semi-frontal faces, the majority of them fail to provide semantically consistent landmarks for profile faces. A de facto solution to this problem is through 3D face alignment that preserves correspondence across different poses. In this paper, we proposed a Cascade Multi-view Hourglass Model for 3D face alignment, where the first Hourglass model is explored to jointly predict semi-frontal and profile 2D facial landmarks, after removing spatial transformations, another Hourglass model is employed to estimate the 3D facial shapes. To improve the capacity without sacrificing the computational complexity, the original residual bottleneck block in the Hourglass model is replaced by a parallel, multi-scale inception-resnet block. Extensive experiments on two challenging 3D face alignment datasets, AFLW2000-3D and Menpo-3D, show the robustness of the proposed method under continuous pose changes.

Deng Jiankang, Zhou Yuxiang, Cheng Shiyang, Zaferiou Stefanos

A4 Article in conference proceedings

13th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2018

J. Deng, Y. Zhou, S. Cheng and S. Zaferiou, "Cascade Multi-View Hourglass Model for Robust 3D Face Alignment," 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, 2018, pp. 399-403. doi: 10.1109/FG.2018.00064