Deep word embeddings for visual speech recognition

In this paper we present a deep learning architecture for extracting word embeddings for visual speech recognition. The embeddings summarize the information of the mouth region that is relevant to the problem of word recognition, while suppressing other types of variability such as speaker, pose and...

Full description

Bibliographic Details
Main Authors: Stafylakis, Themos, Tzimiropoulos, Georgios
Format: Conference or Workshop Item
Language:English
Published: 2018
Subjects:
Online Access:https://eprints.nottingham.ac.uk/51133/