European Conference on
Computer Vision 2022
Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering.
Can we keep things simple by just using more landmarks?
In answer, we present the first method that accurately predicts ten times as many landmarks as usual, covering the whole head, including the eyes and teeth. This is accomplished using synthetic training data, which guarantees perfect landmark annotations. By fitting a morphable model to these dense landmarks, we achieve state-of-the-art results for monocular 3D face reconstruction in the wild. We show that dense landmarks are an ideal signal for integrating face shape information across frames by demonstrating accurate and expressive facial performance capture in both monocular and multi-view scenarios. Finally, our method is highly efficient: we can predict dense landmarks and fit our 3D face model at over 150FPS on a single CPU thread.
We first predict probabilistic dense landmarks L, each with position µ and certainty σ.
Then, we fit our 3D face model to L, minimizing an energy E by optimizing model parameters Φ
While a human might consistently label images with 68 landmarks, manually annotating images with dense landmarks would be impossible. Instead, we rendered 100,000 synthetic training images using our Face Synthetics system. Without the perfect annotations provided by synthetic data, dense landmark prediction would not be possible.
Dense landmarks are the ideal signal for markerless multi-view facial performance capture.
Our approach is also highly efficient, running in real time on low-power CPU-only laptops.
@inproceedings{wood2022dense, title={3d face reconstruction with dense landmarks}, author={Wood, Erroll and Baltru{\v{s}}aitis, Tadas and Hewitt, Charlie and Johnson, Matthew and Shen, Jingjing and Milosavljevi{\'c}, Nikola and Wilde, Daniel and Garbin, Stephan and Sharp, Toby and Stojiljkovi{\'c}, Ivan and others}, booktitle={European Conference on Computer Vision}, pages={160--177}, year={2022}, organization={Springer} }