L 2, 1-based regression and prediction accumulation across views for robust facial landmark detection
Martinez, Brais; Valstar, Michel F.
Michel F. Valstar
We propose a new methodology for facial landmark detection. Similar to other state-of-the-art methods, we rely on the use of cascaded regression to perform inference, and we use a feature representation that results from concatenating 66 HOG descriptors, one per landmark. However, we propose a novel regression method that substitutes the commonly used Least Squares regressor. This new method makes use of the L2,1 norm, and it is designed to increase the robust- ness of the regressor to poor initialisations (e.g., due to large out of plane head poses) or partial occlusions. Furthermore, we propose to use multiple initialisations, consisting of both spatial translation and 4 head poses corresponding to different pan rotations. These estimates are aggregated into a single prediction in a robust manner. Both strategies are designed to improve the convergence behaviour of the algorithm, so that it can cope with the challenges of in-the- wild data. We further detail some important experimental details, and show extensive performance comparisons highlighting the performance improvement attained by the method proposed here.
Martinez, B., & Valstar, M. F. (2016). L 2, 1-based regression and prediction accumulation across views for robust facial landmark detection. Image and Vision Computing, https://doi.org/10.1016/j.imavis.2015.09.003
|Journal Article Type||Article|
|Acceptance Date||Sep 28, 2015|
|Online Publication Date||Oct 9, 2015|
|Deposit Date||Jan 21, 2016|
|Publicly Available Date||Jan 21, 2016|
|Journal||Image and Vision Computing|
|Peer Reviewed||Peer Reviewed|
|Keywords||Facial landmark detection, Regression, 300 W challenge|
Copyright information regarding this work can be found at the following address: http://creativecommons.org/licenses/by-nc-nd/4.0