Comparative evaluation of CA for subband cocktail-party
نویسنده
چکیده
For speech segregation, a recurrent blind separation model (BSS) is tested together with a Computational Auditory Scene Analysis (CASA) model, which is based on the localisation cue and the evaluation of the Time Delay Of Arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we divide the frequency domain into a variable number of subbands, which are processed independently. Then, we evaluate the gain, using reference signals recorded in isolation. After a careful analysis, we find similar gains of about 2-3dB for both methods. The variation of the number of subbands allows an optimisation, and we obtain a significant peak at 4 subbands for the CASA model, as well as a maximum at 2 subbands for the BSS model.
منابع مشابه
Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation
For speech segregation, a blind separation model (BSS) is tested together with a CASA model which is based on the localisation cue and the evaluation of the time delay of arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we cut the frequency doma...
متن کاملA Casa Front-end Using the Localisation Cue for Segregation and Then Cocktail-party Speech Recognition
We propose and test a cocktail-party recognition technique based on segregation applied before recognition. This CASA front-end uses the TDOA (Time Delay Of Arrival) evaluated within subbands in order to determine the Relative Level (RL) of two competing speech sources. To perform the evaluation of the model, we have recorded a stereo database ST-NB95 from the mono Numbers95. This is composed o...
متن کاملA CASA-labelling model using the localisation cue for robust cocktail-party speech recognition
We propose a new cocktail-party recognition technique based on the coupling of a CASA-labelling method using the TDOA (Time Delay Of Arrival) with multistream recognition. This is an alternative to the classical "segregate and recognise" architecture. First, we have recorded a stereo database ST-NB95 from the mono Numbers95. This is composed of binary mixtures of sentences at 0dB, placed left a...
متن کاملEvaluation of CASA and BSS models for subband cocktail-party speech separation
For speech segregation, a recurrent blind separation model (BSS) is tested together with a CASA model, which is based on the localisation cue and the evaluation of the time delay of arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we cut the fre...
متن کاملMeasured Performance for Real-Time Localization of Cocktail-Party Talkers
Technology improvements, hardware, software and algorithmic, have made the use of a largeaperture microphone array cost effective. In this paper we present real, measured results for our wired, 128microphone array that surrounds a focal area (room) of about 7Mx5M. While it was necessary to evaluate the performance of the array offline using the array’s recording feature, we ensured that all the...
متن کامل