Deep word embeddings for visual speech recognition
In this paper we present a deep learning architecture for extracting word embeddings for visual speech recognition. The embeddings summarize the information of the mouth region that is relevant to the problem of word recognition, while suppressing other types of variability such as speaker, pose and...
| Main Authors: | Stafylakis, Themos, Tzimiropoulos, Georgios |
|---|---|
| Format: | Conference or Workshop Item |
| Language: | English |
| Published: |
2018
|
| Subjects: | |
| Online Access: | https://eprints.nottingham.ac.uk/51133/ |
Similar Items
Combining residual networks with LSTMs for lipreading
by: Stafylakis, Themos, et al.
Published: (2017)
by: Stafylakis, Themos, et al.
Published: (2017)
Self-supervised learning for automatic speech recognition In low-resource environments
by: Fatehi, Kavan
Published: (2024)
by: Fatehi, Kavan
Published: (2024)
Figurative language detection using deep and contextual features
by: Razali, Md Saifullah
Published: (2023)
by: Razali, Md Saifullah
Published: (2023)
Visual word recognition in bilinguals and monolinguals: behavioural and ERP investigations of the role of word frequency, lexicality and repetition
by: Corona Dzul, B.
Published: (2017)
by: Corona Dzul, B.
Published: (2017)
Speaker discriminability for visual speech modes
by: Kim, J., et al.
Published: (2009)
by: Kim, J., et al.
Published: (2009)
Mathematical Aspects of Word Embeddings
by: Carrington, Rachel
Published: (2021)
by: Carrington, Rachel
Published: (2021)
The role of orthography and visual form on word recognition
by: Kelly, Andrew N.
Published: (2016)
by: Kelly, Andrew N.
Published: (2016)
Input matters: speed of word recognition in 2-year-olds exposed to multiple accents
by: Buckler, Helen, et al.
Published: (2017)
by: Buckler, Helen, et al.
Published: (2017)
Motion Path Generation Using A Modified 6th Order Polynomial Function for Visual Speech Synthesis
by: Salleh, Siti Salwa
Published: (2008)
by: Salleh, Siti Salwa
Published: (2008)
Perceptual plasticity in the peripheral visual field of older adults
by: Blighe, Alan
Published: (2014)
by: Blighe, Alan
Published: (2014)
Novel meta-learning approaches for few-shot image classification
by: Song, Heda
Published: (2022)
by: Song, Heda
Published: (2022)
An investigation of deep learning for image processing applications
by: Hou, Xianxu
Published: (2018)
by: Hou, Xianxu
Published: (2018)
A new multi-purpose audio-visual UNMC-VIER database with multiple variabilities
by: Wong, Y.W., et al.
Published: (2011)
by: Wong, Y.W., et al.
Published: (2011)
Machine learning for neural coding of sound envelopes: slithering from sinusoids to speech
by: Levy, Alban Hugo
Published: (2018)
by: Levy, Alban Hugo
Published: (2018)
Deep learning approach with image noise reduction to determine planting density and defected paddy seedlings
by: Mohamed Anuar, Mohamed Marzhar
Published: (2022)
by: Mohamed Anuar, Mohamed Marzhar
Published: (2022)
Multichannel filters for speech recognition using a particle swarm optimization
by: Chan, Kit Yan, et al.
Published: (2012)
by: Chan, Kit Yan, et al.
Published: (2012)
Automatic speech recognition predicts speech intelligibility and comprehension for listeners with simulated age-related hearing loss
by: Fontan, Lionel, et al.
Published: (2017)
by: Fontan, Lionel, et al.
Published: (2017)
Deep Learning Using Tiny Domain-Specific Datasets with Sparse Labels
by: Smith, Thomas J
Published: (2021)
by: Smith, Thomas J
Published: (2021)
Development of an Isolated Digit Speech Recognition Based on Multilayer Perceptron Model
by: Mohamad Hussin, Ummu Salmah
Published: (2004)
by: Mohamad Hussin, Ummu Salmah
Published: (2004)
Deep tissue analysis: advancing optical techniques with interpretable deep learning and aberration correction
by: Kok, Yong En
Published: (2025)
by: Kok, Yong En
Published: (2025)
Micro Neural-Controller for Optical Character Recognition
by: Lim, Ker-chin, et al.
Published: (2006)
by: Lim, Ker-chin, et al.
Published: (2006)
Speech recognition enhancement using beamforming and a genetic algorithm
by: Chan, Kit Yan, et al.
Published: (2009)
by: Chan, Kit Yan, et al.
Published: (2009)
End-to-end DVB-S2X system design with deep learning-based channel estimation over satellite fading channels
by: Mfarej, Sumaya Dhari Awad
Published: (2021)
by: Mfarej, Sumaya Dhari Awad
Published: (2021)
Parts of speech in Bloom's Taxonomy Classification
by: von Konsky, Brian, et al.
Published: (2018)
by: von Konsky, Brian, et al.
Published: (2018)
Deep learning models of biological visual information processing
by: Turcsány, Diána
Published: (2016)
by: Turcsány, Diána
Published: (2016)
A multi-filter system for speech enhancement under low signal-to-noise ratios
by: Yiu, Ka Fai, et al.
Published: (2009)
by: Yiu, Ka Fai, et al.
Published: (2009)
Efficient online subspace learning with an indefinite kernel for visual tracking and recognition
by: Liwicki, Stephan, et al.
Published: (2012)
by: Liwicki, Stephan, et al.
Published: (2012)
Synthetic data driven deep learning for plant phenotyping
by: Hartley, Zane K.J.
Published: (2024)
by: Hartley, Zane K.J.
Published: (2024)
Deep machine learning provides state-of-the-art performance in image-based plant phenotyping
by: Pound, Michael P., et al.
Published: (2017)
by: Pound, Michael P., et al.
Published: (2017)
An investigation into image-based indoor localization using deep learning
by: Li, Qing
Published: (2020)
by: Li, Qing
Published: (2020)
Large-scale detection, mapping, and initial health assessment of date palm trees using multiplatform remotely-sensed data and deep learning techniques
by: Gibril, Mohamed Barakat Abdelfatah
Published: (2023)
by: Gibril, Mohamed Barakat Abdelfatah
Published: (2023)
A new penalty term for the BIC with respect to speaker diarization
by: Stafylakis, Themos, et al.
Published: (2010)
by: Stafylakis, Themos, et al.
Published: (2010)
Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
by: Chan, Kit Yan, et al.
Published: (2012)
by: Chan, Kit Yan, et al.
Published: (2012)
End-to-end audiovisual speech recognition
by: Petridis, Stavros, et al.
Published: (2018)
by: Petridis, Stavros, et al.
Published: (2018)
An investigation into how low achieving secondary students learn fractions through visual representations
by: Barichello, Leonardo
Published: (2019)
by: Barichello, Leonardo
Published: (2019)
Fast automatic translation and morphological decomposition in Chinese- English bilinguals
by: Zhang, Taoli, et al.
Published: (2011)
by: Zhang, Taoli, et al.
Published: (2011)
Visual search strategies of children with and without autism spectrum disorders during an embedded figures task
by: Horlin, Chiara, et al.
Published: (2014)
by: Horlin, Chiara, et al.
Published: (2014)
Sensor fusion of motion-based sign language interpretation with deep learning
by: Lee, Boon Giin, et al.
Published: (2020)
by: Lee, Boon Giin, et al.
Published: (2020)
Speech Enhancement Strategy for Speech Recognition Microcontroller under Noisy Environments
by: Chan, Kit Yan, et al.
Published: (2013)
by: Chan, Kit Yan, et al.
Published: (2013)
Multitasking deep neural network models for Arabic dialect sentiment analysis
by: Alali, Muath Mohammad Oqlah
Published: (2022)
by: Alali, Muath Mohammad Oqlah
Published: (2022)
Similar Items
-
Combining residual networks with LSTMs for lipreading
by: Stafylakis, Themos, et al.
Published: (2017) -
Self-supervised learning for automatic speech recognition In low-resource environments
by: Fatehi, Kavan
Published: (2024) -
Figurative language detection using deep and contextual features
by: Razali, Md Saifullah
Published: (2023) -
Visual word recognition in bilinguals and monolinguals: behavioural and ERP investigations of the role of word frequency, lexicality and repetition
by: Corona Dzul, B.
Published: (2017) -
Speaker discriminability for visual speech modes
by: Kim, J., et al.
Published: (2009)