A phoneme recognition framework based on auditory spectro-temporal receptive fields
نویسندگان
چکیده
We propose to incorporate features derived using spectrotemporal receptive fields (STRFs) of neurons in the auditory cortex for phoneme recognition. Each of these STRFs is tuned to different auditory frequencies, scales and modulation rates. We select different sets of STRFs which are specific for phonemes in different broad phonetic classes (BPC) of sounds. These STRFs are then used as spectro-temporal filters on spectrograms of speech to extract features for phoneme recognition. For the phoneme recognition task on the TIMIT database, the proposed features show a relative improvement of about 5% over conventional feature extraction techniques.
منابع مشابه
Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain
This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...
متن کاملRobust phoneme recognition based on biomimetic speech contours
It has been previously suggested that ensembles of central auditory neurons optimize a sustained firing criterion as part of the underlying neural code for representing sound. Moreover, computational studies have shown that optimizing such a criterion yields ensembles of spectro-temporal receptive fields akin to those observed in physiological studies. In this study, we show that these emergent...
متن کاملIdealized Computational Models for Auditory Receptive Fields
We present a theory by which idealized models of auditory receptive fields can be derived in a principled axiomatic manner, from a set of structural properties to (i) enable invariance of receptive field responses under natural sound transformations and (ii) ensure internal consistency between spectro-temporal receptive fields at different temporal and spectral scales. For defining a time-frequ...
متن کاملScale-Space Theory for Auditory Signals
We show how the axiomatic structure of scale-space theory can be applied to the auditory domain and be used for deriving idealized models of auditory receptive fields via scale-space principles. For defining a time-frequency transformation of a purely temporal signal, it is shown that the scale-space framework allows for a new way of deriving the Gabor and Gammatone filters as well as a novel f...
متن کاملPrincipal components of auditory spectro-temporal receptive fields
More than two thousand auditory cortical spectro-temporal receptive fields (STRFs) of the ferret were analysed by Principal Component Analysis (PCA) to reveal their dominant properties. Results show that cortical levels of mammalian auditory processing enhance relatively low modulation spectral components of the signal around 3 Hz, using relatively broad spectral processing channels of the orde...
متن کامل