Single Channel Speaker Segregation using Sinusoidal Residual Modeling
نویسنده
چکیده
In this paper we address the two speaker segregation problem in a single channel paradigm using sinusoidal residual modeling. An appropriate selection of the number of sine waves, window length and hysteresis threshold, is done so as to model and synthesize the underlying signal corresponding to the speaker with the lower pitch period, using an amplitude only sine wave synthesis. The sinusoidal residual is then computed after restimating the phases with known amplitudes, by minimizing a criterion function. This residual corresponds to the the speaker with the higher pitch period. But such a residual consists of harmonic components of the speaker with the lower pitch period. We therefore estimate a binary mask from the spectrograms of the synthesized signal and the residual using a min-max technique to further improve the quality of the segregated speech. This segregation technique is then integrated into a co-channel speaker identification system, at various target to interference ratios. Reasonable improvements in identification performance are noted from these experiments.
منابع مشابه
Title of Document : MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING
Title of Document: MAXIMUM LIKELIHOOD PITCH ESTIMATION USING SINUSOIDAL MODELING Vijay Mahadevan, Master of Science, 2010 Directed By: Dr. Carol Y. Espy-Wilson Department of Electrical and Computer Engineering The aim of the work presented in this thesis is to automatically extract the fundamental frequency of a periodic signal from noisy observations, a task commonly referred to as pitch estim...
متن کاملSpeaker Independent Single Channel Source Separation using Sinusoidal Features
Model-based approaches to achieve Single Channel Source Separation (SCSS) have been reasonably successful at separating two sources. However, most of the currently used model-based approaches require pre-trained speaker specific models in order to perform the separation. Often, insufficient or no prior training data may be available to develop such speaker specific models, necessitating the use...
متن کاملRestoring the Residual Speaker Information in Total Variability Modeling for Speaker Verification
In this paper, we introduce the residual space into the Total Variability Modeling by assuming that the speaker super-vectors are not totally contained in a linear subspace of low dimension. Thus the feature reduction carried out by Probabilistic Principal Component Analysis(PPCA) leads to information loss including information of speaker as well as channel. We add the residual factor to restor...
متن کاملExtending Spectral Modeling Synthesis with Transient Modeling Synthesis
Sinusoidal modeling has enjoyed a rich history in both speech and music applications, including sound transformations, compression, denoising, and auditory scene analysis. For such applications, the underlying signal model must efficiently capture salient audio features (Goodwin 1998). In this article, we present an accurate, efficient, and flexible three-part model for audio signals consisting...
متن کاملModeling of Upward Subcooled Flow Boiling of Refrigerant-113 in a Vertical Annulus
In this paper, a modified two-fluid model has been adopted to simulate the process of upward vertical subcooled flow boiling of refrigerant R-113 in a vertical annular channel at low pressure. The modified model considers temperature dependent properties and saturation temperature variation and was validated against a number of published low-pressure subcooled boiling experiments. The results s...
متن کامل