Now you see it, now you don’t - frequency distribution of articulatory information reflected in speech face motion

Although the increase in intelligibility of spoken language when watching the speaker’s face is well documented, the characteristics and the distribution of phonetic information over the frequency range of visual speech remain largely unknown. For this study face motion and tongue movements were mea...

Full description

Bibliographic Details
Main Authors: Kroos, Christian, Bundgaard-Nielsen, R., Best, C.
Other Authors: Felicity Cox
Format: Conference Paper
Published: Australasian Speech Science and Technology Association (ASSTA) 2012
Subjects:
Online Access:http://assta.org/sst/SST-12/SST2012/PDF/AUTHOR/ST120096.PDF
http://hdl.handle.net/20.500.11937/17025
Description
Summary:Although the increase in intelligibility of spoken language when watching the speaker’s face is well documented, the characteristics and the distribution of phonetic information over the frequency range of visual speech remain largely unknown. For this study face motion and tongue movements were measured simultaneously for three speakers of American English. The motion signals were subjected to a multiresolution analysis using spline wavelets and partial least squares regression was applied to estimate tongue movements from face motion. The amount of recovered variance was found to be small (below 30%) compared to previous studies and more pronounced in the lower frequencies.