Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking

In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement period. Furthermore, the speech sources are moving so their mixing...

Full description

Bibliographic Details
Main Authors: Chong, Nicholas, Nordholm, Sven, Vo, Ba Tuong, Murray, Iain
Format: Conference Paper
Published: 2017
Online Access:http://hdl.handle.net/20.500.11937/50768
_version_ 1848758532598398976
author Chong, Nicholas
Nordholm, Sven
Vo, Ba Tuong
Murray, Iain
author_facet Chong, Nicholas
Nordholm, Sven
Vo, Ba Tuong
Murray, Iain
author_sort Chong, Nicholas
building Curtin Institutional Repository
collection Online Access
description In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement period. Furthermore, the speech sources are moving so their mixing parameters change with time. As a result of this, traditional source separation techniques are limited by their capability to properly attribute the correct mixing parameters to the respective sources. The 'conference room scenario' problem is very challenging as it involves the localization, tracking and separation of a time varying number of moving speech sources. An online solution which systematically solves 'conference room scenario' problem by solving the source localization, tracking and separation in stages is proposed in this paper.
first_indexed 2025-11-14T09:45:29Z
format Conference Paper
id curtin-20.500.11937-50768
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T09:45:29Z
publishDate 2017
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-507682017-09-13T15:36:42Z Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking Chong, Nicholas Nordholm, Sven Vo, Ba Tuong Murray, Iain In a 'conference room scenario', the number of speech sources are not known a priori and the number of speech sources which are active remains unknown as these speech sources appear and disappear throughout the measurement period. Furthermore, the speech sources are moving so their mixing parameters change with time. As a result of this, traditional source separation techniques are limited by their capability to properly attribute the correct mixing parameters to the respective sources. The 'conference room scenario' problem is very challenging as it involves the localization, tracking and separation of a time varying number of moving speech sources. An online solution which systematically solves 'conference room scenario' problem by solving the source localization, tracking and separation in stages is proposed in this paper. 2017 Conference Paper http://hdl.handle.net/20.500.11937/50768 10.1109/ICCAIS.2016.7822441 restricted
spellingShingle Chong, Nicholas
Nordholm, Sven
Vo, Ba Tuong
Murray, Iain
Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title_full Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title_fullStr Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title_full_unstemmed Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title_short Tracking and separation of multiple moving speech sources via cardinality balanced multi-Target multi Bernoulli (CBMeMBer) filter and time frequency masking
title_sort tracking and separation of multiple moving speech sources via cardinality balanced multi-target multi bernoulli (cbmember) filter and time frequency masking
url http://hdl.handle.net/20.500.11937/50768