Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments

This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation algorithm utilizes the steered response power phase transform for obtaining a lo...

Full description

Bibliographic Details
Main Author: Nordholm, Sven
Format: Journal Article
Published: springer 2017
Online Access:http://hdl.handle.net/20.500.11937/65741
_version_ 1848761193624240128
author Nordholm, Sven
author_facet Nordholm, Sven
author_sort Nordholm, Sven
building Curtin Institutional Repository
collection Online Access
description This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation algorithm utilizes the steered response power phase transform for obtaining a localization estimate for each individual speech source in the frequency domain. Based on those estimates each desired speech signal is extracted from the speech mixture using an optimal beamforming technique. To solve the permutation problem, a permutation alignment algorithm based on the mutual output correlation is employed to group the output signals into the correct sources from each frequency bin. Evaluations using real speech recordings in a room environment show that the proposed blind speech separation algorithm offers high interference suppression level whilst maintaining low distortion level for each desired signal.
first_indexed 2025-11-14T10:27:47Z
format Journal Article
id curtin-20.500.11937-65741
institution Curtin University Malaysia
institution_category Local University
last_indexed 2025-11-14T10:27:47Z
publishDate 2017
publisher springer
recordtype eprints
repository_type Digital Repository
spelling curtin-20.500.11937-657412018-02-19T08:06:52Z Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments Nordholm, Sven This paper presents a new blind speech separation algorithm using beamforming technique that is capable of extracting each individual speech signal from a mixture of three speech sources in a room. The speech separation algorithm utilizes the steered response power phase transform for obtaining a localization estimate for each individual speech source in the frequency domain. Based on those estimates each desired speech signal is extracted from the speech mixture using an optimal beamforming technique. To solve the permutation problem, a permutation alignment algorithm based on the mutual output correlation is employed to group the output signals into the correct sources from each frequency bin. Evaluations using real speech recordings in a room environment show that the proposed blind speech separation algorithm offers high interference suppression level whilst maintaining low distortion level for each desired signal. 2017 Journal Article http://hdl.handle.net/20.500.11937/65741 10.1007/s40595-016-0085-x springer unknown
spellingShingle Nordholm, Sven
Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title_full Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title_fullStr Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title_full_unstemmed Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title_short Source separation employing beamforming and SRP-PHAT localization in three-speaker room environments
title_sort source separation employing beamforming and srp-phat localization in three-speaker room environments
url http://hdl.handle.net/20.500.11937/65741