Skip to main content

Research Repository

Advanced Search

All Outputs (1)

Combining residual networks with LSTMs for lipreading (2017)
Conference Proceeding
Stafylakis, T., & Tzimiropoulos, G. (in press). Combining residual networks with LSTMs for lipreading. In Proc. Interspeech 2017 (3652-3656). https://doi.org/10.21437/Interspeech.2017-85

We propose an end-to-end deep learning architecture for word-level visual speech recognition. The system is a combination of spatiotemporal convolutional, residual and bidirectional Long Short-Term Memory networks. We train and evaluate it on the Lip... Read More about Combining residual networks with LSTMs for lipreading.