Deep word embeddings for visual speech recognition

QR Code

Deep word embeddings for visual speech recognition

In this paper we present a deep learning architecture for extracting word embeddings for visual speech recognition. The embeddings summarize the information of the mouth region that is relevant to the problem of word recognition, while suppressing other types of variability such as speaker, pose and...

Full description

Bibliographic Details
Main Authors:	Stafylakis, Themos, Tzimiropoulos, Georgios
Format:	Conference or Workshop Item
Language:	English
Published:	2018
Subjects:	Visual Speech Recognition Lipreading Word Embeddings Deep Learning Low-shot Learning
Online Access:	https://eprints.nottingham.ac.uk/51133/

Similar Items

Combining residual networks with LSTMs for lipreading
by: Stafylakis, Themos, et al.
Published: (2017)

Self-supervised learning for automatic speech recognition In low-resource environments
by: Fatehi, Kavan
Published: (2024)

Figurative language detection using deep and contextual features
by: Razali, Md Saifullah
Published: (2023)

Visual word recognition in bilinguals and monolinguals: behavioural and ERP investigations of the role of word frequency, lexicality and repetition
by: Corona Dzul, B.
Published: (2017)

Speaker discriminability for visual speech modes
by: Kim, J., et al.
Published: (2009)

Mathematical Aspects of Word Embeddings
by: Carrington, Rachel
Published: (2021)

The role of orthography and visual form on word recognition
by: Kelly, Andrew N.
Published: (2016)

Input matters: speed of word recognition in 2-year-olds exposed to multiple accents
by: Buckler, Helen, et al.
Published: (2017)

Motion Path Generation Using A Modified 6th Order Polynomial Function for Visual Speech Synthesis
by: Salleh, Siti Salwa
Published: (2008)

Perceptual plasticity in the peripheral visual field of older adults
by: Blighe, Alan
Published: (2014)

Novel meta-learning approaches for few-shot image classification
by: Song, Heda
Published: (2022)

An investigation of deep learning for image processing applications
by: Hou, Xianxu
Published: (2018)

A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
by: Wong, Y.W., et al.
Published: (2011)

Machine learning for neural coding of sound envelopes: slithering from sinusoids to speech
by: Levy, Alban Hugo
Published: (2018)

Deep learning approach with image noise reduction to determine planting density and defected paddy seedlings
by: Mohamed Anuar, Mohamed Marzhar
Published: (2022)

Multichannel filters for speech recognition using a particle swarm optimization
by: Chan, Kit Yan, et al.
Published: (2012)

Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss
by: Fontan, Lionel, et al.
Published: (2017)

Development of an Isolated Digit Speech Recognition Based on Multilayer Perceptron Model
by: Mohamad Hussin, Ummu Salmah
Published: (2004)

Deep Learning Using Tiny Domain-Specific Datasets with Sparse Labels
by: Smith, Thomas J
Published: (2021)

Deep tissue analysis: advancing optical techniques with interpretable deep learning and aberration correction
by: Kok, Yong En
Published: (2025)

Micro Neural-Controller for Optical Character Recognition
by: Lim, Ker-chin, et al.
Published: (2006)

Speech recognition enhancement using beamforming and a genetic algorithm
by: Chan, Kit Yan, et al.
Published: (2009)

End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
by: Mfarej, Sumaya Dhari Awad
Published: (2021)

Parts of speech in Bloom's Taxonomy Classification
by: von Konsky, Brian, et al.
Published: (2018)

Deep learning models of biological visual information processing
by: Turcsány, Diána
Published: (2016)

A multi-filter system for speech enhancement under low signal-to-noise ratios
by: Yiu, Ka Fai, et al.
Published: (2009)

Efficient online subspace learning with an indefinite kernel for visual tracking and recognition
by: Liwicki, Stephan, et al.
Published: (2012)

Synthetic data driven deep learning for plant phenotyping
by: Hartley, Zane K.J.
Published: (2024)

Deep machine learning provides state-of-the-art performance in image-based plant phenotyping
by: Pound, Michael P., et al.
Published: (2017)

An investigation into image-based indoor localization using deep learning
by: Li, Qing
Published: (2020)

Large-scale detection, mapping, and initial health assessment of date palm trees using multiplatform remotely-sensed data and deep learning techniques
by: Gibril, Mohamed Barakat Abdelfatah
Published: (2023)

Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
by: Chan, Kit Yan, et al.
Published: (2012)

A new penalty term for the BIC with respect to speaker diarization
by: Stafylakis, Themos, et al.
Published: (2010)

End-to-end audiovisual speech recognition
by: Petridis, Stavros, et al.
Published: (2018)

An investigation into how low achieving secondary students learn fractions through visual representations
by: Barichello, Leonardo
Published: (2019)

Visual search strategies of children with and without autism spectrum disorders during an embedded figures task
by: Horlin, Chiara, et al.
Published: (2014)

Fast automatic translation and morphological decomposition in Chinese- English bilinguals
by: Zhang, Taoli, et al.
Published: (2011)

Sensor fusion of motion-based sign language interpretation with deep learning
by: Lee, Boon Giin, et al.
Published: (2020)

Speech Enhancement Strategy for Speech Recognition Microcontroller under Noisy Environments
by: Chan, Kit Yan, et al.
Published: (2013)

Multitasking deep neural network models for Arabic dialect sentiment analysis
by: Alali, Muath Mohammad Oqlah
Published: (2022)