Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes
نویسنده
چکیده
In this paper, we propose Speaker Identification using two transforms, namely Haar Transform and Kekre’s Transform. The speech signal spoken by a particular speaker is converted into a spectrogram by using 25% and 50% overlap between consecutive sample vectors. The two transforms are applied on the spectrogram. The row mean of the transformed matrix forms the feature vector, which is used in the training as well as matching phases. The results of both the transform techniques have been compared. Haar transform gives fairly good results with a maximum accuracy of 69% for both 25% as well as 50% overlap. Kekre’s Transform shows much better performance, with a maximum accuracy of 85.7% for 25% overlap and 88.5% accuracy for 50% overlap. Keywords-Speaker Identification; Spectrogram; Haar Transform; Kekre’s Transform; Row Mean; Euclidean distance
منابع مشابه
Speaker Identification using Spectrograms of Varying Frame Sizes
In this paper, a text dependent speaker recognition algorithm based on spectrogram is proposed. The spectrograms have been generated using Discrete Fourier Transform for varying frame sizes with 25% and 50% overlap between speech frames. Feature vector extraction has been done by using the row mean vector of the spectrograms. For feature matching, two distance measures, namely Euclidean distanc...
متن کاملSpeaker Identification Using 2-d Dct, Walsh and Haar on Full and Block Spectrogram
This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then subjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is...
متن کاملSpeaker Identification using Frequency Dsitribution in the Transform Domain
In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distributio...
متن کاملPerformance Comparison Of 2-D DCT On Full/Block Spectrogram And 1-D DCT On Row Mean Of Spectrogram For Speaker Identification
The goal of this paper is to present a very simple approach to text dependent speaker identification using a combination of spectrograms and well known Discrete Cosine Transform (DCT). This approach is based on use of DCT to find similarities between spectrograms obtained from speech samples. The set of spectrograms forms the database for our experiments rather than raw speech samples. Performa...
متن کاملNovel Approach to Text Independent Speaker Identification
In this paper, we propose Speaker Identification using two transforms, namely Kekre’s Transform and Kekre’s Wavelet Transform. The speech signal is spoken by a particular speaker is converted into a spectrogram by first taking 25 or 50% overlapping frames between consecutive sample vectors and arranged in the form of matrix. In order to improve the performance, intially log of matrix is taken a...
متن کامل