Co-channel speaker identification using usable speech extraction based on multi-pitch tracking
نویسندگان
چکیده
Recently, usable speech criteria [1] are proposed to extract minimally corrupted speech for speaker identification (SID) in co-channel speech. In this paper, we propose a new usable speech extraction method to improve the SID performance under the co-channel situation based on the pitch information obtained from a robust multi-pitch tracking algorithm [2]. The idea is to retain the speech segments that have only one pitch detected and remove the others. The system is evaluated on co-channel speech and results show a significant improvement across various Target to Interferer Ratios (TIR) for speaker identification.
منابع مشابه
Evaluation of a Multi-Resolution Dyadic Wavelet Transform Method for usable Speech Detection
Many applications of speech communication and speaker identification suffer from the problem of co-channel speech. This paper deals with a multi-resolution dyadic wavelet transform method for usable segments of co-channel speech detection that could be processed by a speaker identification system. Evaluation of this method is performed on TIMIT database referring to the Target to Interferer Rat...
متن کاملDeveloping usable speech criteria for speaker identification technology
Recently, a “usable speech” extraction system [1] was proposed to separate co-channel speech into “usable” frames that are minimally corrupted by interfering speech. Studies indicate [2] that a significant amount of cochannel speech can be considered “usable” for speaker identification (SID). Therefore, it is necessary to establish criteria for usable speech frames for SID. Voiced speech, of wh...
متن کاملUsable Speech Assignment for Speaker Identification under Co-Channel Situation
Usable speech criteria are proposed to extract minimally corrupted speech for speaker identification (SID) in co-channel speech. In co-channel speech, either speaker can randomly appear as the stronger speaker or the weaker one at a time. Hence, the extracted usable segments are separated in time and need to be organized into speaker streams for SID. In this paper, we focus to organize extracte...
متن کاملLocal Linear Wavelet Neural Network and RLS for Usable Speech Classification
While operating in a co -channel environment, the accuracy of the speech processing technique degrades. When more than one person is talking at same time, then there occurs the co-channel speech. The objective of usable speech segmentation is identification and extraction of those portions of co-channel speech that are degraded in a negligible range but still needed for various speech processin...
متن کاملUsable speech measures and their fusion
Usable speech is a novel concept related to the co-channel speech problem. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech is to identify and extract those portions of co-channel speech that are still useful for speech processing applications such as speaker identification or speech recognition, which do not work in cochannel environment...
متن کامل