Adrian Bulat
Hierarchical binary CNNs for landmark localization with limited resources
Bulat, Adrian; Tzimiropoulos, Georgios
Authors
Georgios Tzimiropoulos
Abstract
Our goal is to design architectures that retain the groundbreaking performance of Convolutional Neural Networks (CNNs) for landmark localization and at the same time are lightweight, compact and suitable for applications with limited computational resources. To this end, we make the following contributions: (a) we are the first to study the effect of neural network binarization on localization tasks, namely human pose estimation and face alignment. We exhaustively evaluate various design choices, identify performance bottlenecks, and more importantly propose multiple orthogonal ways to boost performance. (b) Based on our analysis, we propose a novel hierarchical, parallel and multi-scale residual architecture that yields large performance improvement over the standard bottleneck block while having the same number of parameters, thus bridging the gap between the original network and its binarized counterpart. (c) We perform a large number of ablation studies that shed light on the properties and the performance of the proposed block. (d) We present results for experiments on the most challenging datasets for human pose estimation and face alignment, reporting in many cases state-of-the-art performance. (e) We further provide additional results for the problem of facial part segmentation. Code can be downloaded from https://www.adrianbulat.com/binary-cnn-landmarks
Citation
Bulat, A., & Tzimiropoulos, G. (2020). Hierarchical binary CNNs for landmark localization with limited resources. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(2), 343 - 356. https://doi.org/10.1109/tpami.2018.2866051
Journal Article Type | Article |
---|---|
Acceptance Date | Aug 5, 2018 |
Online Publication Date | Aug 23, 2018 |
Publication Date | 2020-02 |
Deposit Date | Aug 16, 2018 |
Publicly Available Date | Mar 29, 2024 |
Journal | IEEE Transactions on Pattern Analysis and Machine Intelligence |
Print ISSN | 0162-8828 |
Electronic ISSN | 1939-3539 |
Publisher | Institute of Electrical and Electronics Engineers |
Peer Reviewed | Peer Reviewed |
Volume | 42 |
Issue | 2 |
Pages | 343 - 356 |
DOI | https://doi.org/10.1109/tpami.2018.2866051 |
Keywords | Binary Convolutional Neural Networks; Residual learning; Landmark localization; Human pose estimation; Face alignment. |
Public URL | https://nottingham-repository.worktribe.com/output/1036353 |
Publisher URL | https://ieeexplore.ieee.org/document/8444745/ |
Files
Binary CNNs
(4.8 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by/3.0/
You might also like
Object landmark discovery through unsupervised adaptation
(2019)
Journal Article
Artificial intelligence-enhanced multi-material form measurement for additive materials
(2018)
Presentation / Conference
Deep word embeddings for visual speech recognition
(2018)
Conference Proceeding
End-to-end audiovisual speech recognition
(2018)
Conference Proceeding
Synergy between face alignment and tracking via Discriminative Global Consensus Optimization
(2017)
Conference Proceeding
Downloadable Citations
About Repository@Nottingham
Administrator e-mail: digital-library-support@nottingham.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search