Shashank Jaiswal
Deep learning the dynamic appearance and shape of facial action units
Jaiswal, Shashank; Valstar, Michel F.
Authors
Michel F. Valstar
Abstract
Spontaneous facial expression recognition under uncontrolled conditions is a hard task. It depends on multiple factors including shape, appearance and dynamics of the facial features, all of which are adversely affected by environmental noise and low intensity signals typical of such conditions. In this work, we present a novel approach to Facial Action Unit detection using a combination of Convolutional and Bi-directional Long Short-Term Memory Neural Networks (CNN-BLSTM), which jointly learns shape, appearance and dynamics in a deep learning manner. In addition, we introduce a novel way to encode shape features using binary image masks computed from the locations of facial landmarks. We show that the combination of dynamic CNN features and Bi-directional Long Short-Term Memory excels at modelling the temporal information. We thoroughly evaluate the contributions of each component in our system and show that it achieves state-of-the-art performance on the FERA-2015 Challenge dataset.
Citation
Jaiswal, S., & Valstar, M. F. Deep learning the dynamic appearance and shape of facial action units. Presented at Winter Conference on Applications of Computer Vision (WACV)
Conference Name | Winter Conference on Applications of Computer Vision (WACV) |
---|---|
End Date | Mar 9, 2016 |
Publication Date | Jan 1, 2016 |
Deposit Date | Jan 21, 2016 |
Publicly Available Date | Jan 21, 2016 |
Peer Reviewed | Peer Reviewed |
Public URL | https://nottingham-repository.worktribe.com/output/980044 |
Files
paper.pdf
(457 Kb)
PDF
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: discovery-access-systems@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search