Speaker Verification based on Single Channel Speech Separation
نویسندگان
چکیده
In multi-speaker scenarios, speech processing tasks like speaker identification and recognition are susceptible to noise overlapped voices. As the voices a complicated mixture of signals, target extraction method from this is good front end solution for further understanding classifying. The quality separation can be assessed by ratio or subjective scoring also accuracy downstream identification. order make model more adapted complex overlapping research investigates incorporate with voiceprint task. This paper proposes feature-scale single channel network connected back verification MFCCT feature, so indicates datasets prepared synthesizing Voxceleb1 data, used training testing. results show that using an objective evaluation effectively improve overall performance, as optimized significantly reduced error rate verification.
منابع مشابه
Speaker separation using visual speech features and single-channel audio
This work proposes a method of single-channel speaker separation that uses visual speech information to extract a target speaker’s speech from a mixture of speakers. The method requires a single audio input and visual features extracted from the mouth region of each speaker in the mixture. The visual information from speakers is used to create a visually-derived Wiener filter. The Wiener filter...
متن کاملCatalog-based single-channel speech-music separation
We propose a new catalog-based speech-music separation method for background music removal. Assuming that we know a catalog of the background music, we develop a generative model for the superposed speech and music spectrograms. We represent the speech spectrogram by a Non-negative Matrix Factorization (NMF) model and the music spectrogram by a conditional Poisson Mixture Model (PMM). By choosi...
متن کاملA Generalized Approach for Model-based Speaker-dependent Single Channel Speech Separation
Abstract– In this paper, we present a new technique for separating two speech signals received from one microphone or one communication channel. In this special case, the separation problem is too ill-conditioned to be handled with common blind source separation techniques. The proposed technique is a generalized approach to model-based speaker-dependent single channel speech separation techniq...
متن کاملWavelet-based speaker change detection in single channel speech data
Speaker segmentation is the task of finding speaker turns in an audio stream. We propose a metric-based algorithm based on Discrete Wavelet Transform (DWT) features. Principal component analysis (PCA) or linear discriminant analysis (LDA) [1] are further used to reduce the dimensionality of the feature space and remove redundant information. In the experiments our methods referred to as DWT-PCA...
متن کاملSingle-speaker/multi-speaker co-channel speech classification
The demand for content-based management and real-time manipulation of audio data is constantly increasing. This paper presents a method to identify temporal regions, in a segment of co-channel speech, as being either single-speaker or multispeaker speech. The state of the art approach for this purpose is the kurtosis. In this paper, a set of complementary time-domain and frequency-domain featur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3287868