Combining residual networks with LSTMs for lipreading
(2017)
Conference Proceeding
Stafylakis, T., & Tzimiropoulos, G. (in press). Combining residual networks with LSTMs for lipreading. In Proc. Interspeech 2017 (3652-3656). https://doi.org/10.21437/Interspeech.2017-85
We propose an end-to-end deep learning architecture for word-level visual speech recognition. The system is a combination of spatiotemporal convolutional, residual and bidirectional Long Short-Term Memory networks. We train and evaluate it on the Lip... Read More about Combining residual networks with LSTMs for lipreading.