Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization

For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users...

Full description

Bibliographic Details
Main Authors: Chan, Kit Yan, Yiu, Cedric K.F., Dillon, Tharam S., Nordholm, Sven, Ling, S.H.
Format: Journal Article
Published: IEEE 2012
Subjects:
Online Access:http://hdl.handle.net/20.500.11937/16983
_version_ 1848749332524695552
author Chan, Kit Yan
Yiu, Cedric K.F.
Dillon, Tharam S.
Nordholm, Sven
Ling, S.H.
author_facet Chan, Kit Yan
Yiu, Cedric K.F.
Dillon, Tharam S.
Nordholm, Sven
Ling, S.H.
author_sort Chan, Kit Yan
building Curtin Institutional Repository
collection Online Access
description For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users and the automation system. However, users’ commands are often contaminated by environmental noise which degrades the performance of speech recognition for controlling automation systems. This paper presents a multichannel signal enhancement methodology to improve the performance of commercial speech recognizers. The proposed methodology aims to optimize speech recognition accuracy of a commercial speech recognizer in a noisy environment based on a beam former, which is developed by an intelligent particle swarm optimization. It overcomes the limitation of the existing signal enhancement approaches whereby the parameters inside commercial speech recognizers are required to be tuned, which is impossible in a real-world situation. Also, it overcomes the limitation of the existing optimization algorithm including gradient descent methods, genetic algorithms and classical particle swarm optimization that are unlikely to develop optimal beam formers for maximizing speech recognition accuracy. The performance of the proposed methodology was evaluated by developing beam formers for a commercial speech recognizer, which was implemented on warehouse automation. Results indicate a significant improvement regarding speech recognition accuracy.
first_indexed 2025-11-14T07:19:15Z
format Journal Article
id curtin-20.500.11937-16983
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T07:19:15Z
publishDate 2012
publisher IEEE
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-169832018-03-29T09:06:20Z Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization Chan, Kit Yan Yiu, Cedric K.F. Dillon, Tharam S. Nordholm, Sven Ling, S.H. particle swarm optimization speech recognition Speech control speech recognizer multi-channel filter beamformer intelligent fuzzy systems For over two decades, speech control mechanisms have been widely applied in manufacturing systems such as factory automation, warehouse automation and industrial robotic control for over two decades. To implement speech controls, a commercial speech recognizer is used as the interface between users and the automation system. However, users’ commands are often contaminated by environmental noise which degrades the performance of speech recognition for controlling automation systems. This paper presents a multichannel signal enhancement methodology to improve the performance of commercial speech recognizers. The proposed methodology aims to optimize speech recognition accuracy of a commercial speech recognizer in a noisy environment based on a beam former, which is developed by an intelligent particle swarm optimization. It overcomes the limitation of the existing signal enhancement approaches whereby the parameters inside commercial speech recognizers are required to be tuned, which is impossible in a real-world situation. Also, it overcomes the limitation of the existing optimization algorithm including gradient descent methods, genetic algorithms and classical particle swarm optimization that are unlikely to develop optimal beam formers for maximizing speech recognition accuracy. The performance of the proposed methodology was evaluated by developing beam formers for a commercial speech recognizer, which was implemented on warehouse automation. Results indicate a significant improvement regarding speech recognition accuracy. 2012 Journal Article http://hdl.handle.net/20.500.11937/16983 10.1109/TII.2012.2187910 IEEE restricted
spellingShingle particle swarm optimization
speech recognition
Speech control
speech recognizer
multi-channel filter
beamformer
intelligent fuzzy systems
Chan, Kit Yan
Yiu, Cedric K.F.
Dillon, Tharam S.
Nordholm, Sven
Ling, S.H.
Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title_full Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title_fullStr Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title_full_unstemmed Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title_short Enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
title_sort enhancement of speech recognitions for control automation using an intelligent particle swarm optimization
topic particle swarm optimization
speech recognition
Speech control
speech recognizer
multi-channel filter
beamformer
intelligent fuzzy systems
url http://hdl.handle.net/20.500.11937/16983