Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
author
Abstract:
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tracking parameters were considered in secondary features. The new architecture was proposed for phoneme classification by a combining classifier using both tracked and energy-based features. Clustered based spectro-temporal features vectors were used for the classification of several subsets of TIMIT database phonemes. The results show that the phoneme classification rate was improved Using tracked spectro-temporal features. The results were improved to 78.9% on voiced plosives classification which was relatively 3.3% higher than the results of non-tracked spectro-temporal feature vectors. The results on other subsets of phonemes showed good improvement in classification rate too.
similar resources
Bioinspired sparse spectro-temporal representation of speech for robust classification
In this work, a first approach to a robust phoneme recognition task by means of a biologically-inspired feature extraction method is presented. The proposed technique provides an approximation to the speech signal representation at the auditory cortical level. It is based on an optimal dictionary of atoms, estimated from auditory spectrograms, and the Matching Pursuit algorithm to approximate t...
full textSpectro-temporal Modulations for Robust Speech Emotion Recognition Spectro-temporal Modulations for Robust Speech Emotion Recognition
full text
Aging and Spectro-Temporal Integration of Speech
The purpose of this study was to determine the effects of age on the spectro-temporal integration of speech. The hypothesis was that the integration of speech fragments distributed over frequency, time, and ear of presentation is reduced in older listeners-even for those with good audiometric hearing. Younger, middle-aged, and older listeners (10 per group) with good audiometric hearing partici...
full textSpectro-temporal analysis of speech using 2-d Gabor filters
We present a 2-D spectro-temporal Gabor filterbank based on the 2-D Fast Fourier Transform, and show how it may be used to analyze localized patches of a spectrogram. We argue that the 2-D Gabor filterbank has the capacity to decompose a patch into its underlying dominant spectro-temporal components, and we illustrate the response of our filterbank to different speech phenomena such as harmonic...
full textHilbert envelope based spectro-temporal features for phoneme recognition in telephone speech
In this paper, we present a spectro-temporal feature extraction technique using sub-band Hilbert envelopes of relatively long segments of speech signal. Hilbert envelopes of the sub-bands are estimated using Frequency Domain Linear Prediction (FDLP). Spectral features are derived by integrating the sub-band Hilbert envelopes in short-term frames and the temporal features are formed by convertin...
full textHierarchical Spectro-Temporal Models for Speech Recognition
We seek to explore computational approaches for audition that are inspired by computational visual neuroscience. In particular, we seek to leverage recent progress over the past few years in building a biologically-faithful hierarchical, feed-forward system for visual object recognition [13,14]. The system, which was designed to closely match the currently known feed-forward path in the ventral...
full textMy Resources
Journal title
volume 33 issue 1
pages 105- 111
publication date 2020-01-01
By following a journal you will be notified via email when a new issue of this journal is published.
Hosted on Doprax cloud platform doprax.com
copyright © 2015-2023