A pitch-based spectral enhancement technique for robust speech processing
نویسندگان
چکیده
This paper presents a new pitch-based spectral enhancement algorithm on voiced frames for speech analysis and noiserobust speech processing. The proposed algorithm determines a time-warping function (TWF) and the speaker’s pitch with high precision, simultaneously. This technique reduces the smearing effect in between harmonics when the fundamental frequency is not constant within the analysis window. To do so, we propose a metric called the harmonic residual which measures the difference between the actual spectrum and the resynthesized spectrum derived from the linear model of speech production with various combinations of TWF and high-precision pitch values as parameters. The TWF and pitch pair that yields the minimum harmonic residual is selected and the enhanced spectrum is obtained accordingly. We show how this new representation can be used for automatic speech recognition by proposing a robust spectral representation derived from harmonic amplitude interpolation.
منابع مشابه
Pitch synchronized speech processing (PSSP) for speaker recognition
A method for speech signal enhancement is developed with application to automatic speaker recognition where the signals have different channel conditions. The basis of this technique is a robust pitch detection algorithm that accurately estimates the instantaneous pitch rate, and extracts single pitch period speech segments. This technique of pitch synchronized speech processing (PSSP) provides...
متن کاملImproving the phase vocoder approach to pitch-shifting
A class of methods known as phase vocoders allows for implementing pitch shifting in the spectral domain. We extend the approach of shifting the isolated harmonics of the spectrum by introducing a new technique for separating the sinusoidal components. Keeping together the main lobe and the side lobes, which result from convolution of the harmonics with the spectrum of the analysis window in th...
متن کاملMorphological pre-processing technique and its applications on speech signal
The properties and applications of morphological filters for speech analysis are investigated. We introduce and investigate a novel nonlinear spectral envelope estimation method based on morphological operations, which is found to be very robust against noise. This method is also compared with the spectral envelope estimation vocoder (SEEVOC) method. A simple method for the optimum selection of...
متن کاملClassification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملEsophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain
Altough esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. This article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. The analysis-transformation-sy...
متن کامل