An Auditory Model of Speaker Size Perception for Voiced Speech Sounds
نویسندگان
چکیده
An auditory model was developed to explain the results of behavioral experiments on perception of speaker size with voiced speech sounds. It is based on the dynamic, compressive gammachirp (dcGC) filterbank and a weighting function (SSI weight) derived from a theory of size-shape segregation in the auditory system. Voiced words with and without highfrequency emphasis (+6 dB/octave) were produced using a speech vocoder (STRAIGHT). The SSI weighting function reduces the effect of glottal pulse excitation in voiced speech, which, in turn, makes it possible for the model to explain the individual subject variability in the data.
منابع مشابه
The Effect of Spectral Tilt on Size Discrimination of Voiced Speech Sounds
A number of studies, with either voiced or unvoiced speech, have demonstrated that a speaker’s geometric mean formant frequency (MFF) has a large effect on the perception of the speaker’s size, as would be expected. One study with unvoiced speech showed that lifting the slope of the speech spectrum by 6 dB/octave also led to a reduction in the perceived size of the speaker. This paper reports a...
متن کاملEpoch Extraction of Voiced Speech
A general theory of epoch extraction of overlapping nonidentical waveforms is presented. The theory is applied to outputs of models of voiced speech production mechanism and to actual speech data. Some typical glottal waveshapes are considered to explain their effect on the speech output. It is shown that the points of excitation of the vocal tract can be precisely identified for continuous spe...
متن کاملConstraints on the Transfer of Perceptual Learning in Accented Speech
The perception of speech sounds can be re-tuned through a mechanism of lexically driven perceptual learning after exposure to instances of atypical speech production. This study asked whether this re-tuning is sensitive to the position of the atypical sound within the word. We investigated perceptual learning using English voiced stop consonants, which are commonly devoiced in word-final positi...
متن کاملSpecialization of left auditory cortex for speech perception in man depends on temporal coding.
Speech perception requires cortical mechanisms capable of analysing and encoding successive spectral (frequency) changes in the acoustic signal. To study temporal speech processing in the human auditory cortex, we recorded intracerebral evoked potentials to syllables in right and left human auditory cortices including Heschl's gyrus (HG), planum temporale (PT) and the posterior part of superior...
متن کاملNasality in speech and its contribution to speaker individuality
The term nasality refers to the timbre of the nasal phonemes. It is also used to express the quality of sound that characterises some speakers. In this paper, we propose to classify nasality in natural speech into four types: phonemic nasality, nasality in assimilation, incidental nasality in the production of voiced plosives, and nasality associated with speaker individuality. Speech sounds re...
متن کامل