Developing usable speech criteria for speaker identification technology
نویسندگان
چکیده
Recently, a “usable speech” extraction system [1] was proposed to separate co-channel speech into “usable” frames that are minimally corrupted by interfering speech. Studies indicate [2] that a significant amount of cochannel speech can be considered “usable” for speaker identification (SID). Therefore, it is necessary to establish criteria for usable speech frames for SID. Voiced speech, of which usable speech is entirely comprised, is shown to be information rich for SID. In addition, SID accuracy increases as the frame-based Target to Interferer Ratio (TIR) increases when evaluated independently of the amount of available segments. Recent work [3] develops a frame-based Spectral Autocorrelation Ratio (SAR) technique for determining usable frames within co-channel speech. The ability of the SAR method to determine usable frames at various thresholds is examined. This paper investigates the effectiveness of a frame-based usable speech extraction technique for speaker identification.
منابع مشابه
Usable Speech Assignment for Speaker Identification System
Usable speech criteria are proposed to extract minimally corrupted speech for speaker identification in cochannel speech. Extracted usable segments are separated in time and need to be organized into speaker streams for speaker identification system. In this paper, we focus to organize extracted usable speech segment into a single stream for the same speaker by speaker assignment system. We ext...
متن کاملUsable Speech Assignment for Speaker Identification under Co-Channel Situation
Usable speech criteria are proposed to extract minimally corrupted speech for speaker identification (SID) in co-channel speech. In co-channel speech, either speaker can randomly appear as the stronger speaker or the weaker one at a time. Hence, the extracted usable segments are separated in time and need to be organized into speaker streams for SID. In this paper, we focus to organize extracte...
متن کاملCo-channel speaker identification using usable speech extraction based on multi-pitch tracking
Recently, usable speech criteria [1] are proposed to extract minimally corrupted speech for speaker identification (SID) in co-channel speech. In this paper, we propose a new usable speech extraction method to improve the SID performance under the co-channel situation based on the pitch information obtained from a robust multi-pitch tracking algorithm [2]. The idea is to retain the speech segme...
متن کاملEvaluation of a Multi-Resolution Dyadic Wavelet Transform Method for usable Speech Detection
Many applications of speech communication and speaker identification suffer from the problem of co-channel speech. This paper deals with a multi-resolution dyadic wavelet transform method for usable segments of co-channel speech detection that could be processed by a speaker identification system. Evaluation of this method is performed on TIMIT database referring to the Target to Interferer Rat...
متن کاملUsable speech measures and their fusion
Usable speech is a novel concept related to the co-channel speech problem. Co-channel speech occurs when more than one person is talking at the same time. The idea of usable speech is to identify and extract those portions of co-channel speech that are still useful for speech processing applications such as speaker identification or speech recognition, which do not work in cochannel environment...
متن کامل