Structure to speech conversion - speech generation based on infant-like vocal imitation
نویسندگان
چکیده
This paper proposes a new framework of speech generation by imitating “infants’ vocal imitation”. Most of the speech synthesizers take a phoneme sequence as input and generate speech by converting each of the phonemes into a sound sequentially. In other words, they simulate a human process of reading text out. However, infants usually acquire speech generation ability without text or phoneme sequences. Since their phonemic awareness is very immature, they can hardly decompose a word utterance into a sequence of phones. In this situation, as developmental psychology states, infants acquire the holistic sound pattern of words from the utterances of their parents, called word Gestalt, and they reproduce it with their vocal tubes. This behavior is called vocal imitation. In our previous studies, the word Gestalt was defined physically and a method of extracting it from an utterance was proposed and used successfully for ASR and CALL. In this paper, a method of converting the word Gestalt back to speech is proposed and evaluated. Unlike a reading machine, our proposal simulates infants’ vocal imitation.
منابع مشابه
Optimal event search using a structural cost function - improvement of structure to speech conversion
This paper describes a new and improved method for the framework of structure to speech conversion we previously proposed. Most of the speech synthesizers take a phoneme sequence as input and generate speech by converting each of the phonemes into its corresponding sound. In other words, they simulate a human process of reading text out. However, infants usually acquire speech communication abi...
متن کاملInfant vocalizations in response to speech: vocal imitation and developmental change.
Infants' development of speech begins with a language-universal pattern of production that eventually becomes language specific. One mechanism contributing to this change is vocal imitation. The present study was undertaken to examine developmental change in infants' vocalizations in response to adults' vowels at 12, 16, and 20 weeks of age and test for vocal imitation. Two methodological aspec...
متن کاملProposal of structure-to-speech conversion and its application to implementation of infants’ vocal imitation
Speech acoustics vary due to differences in age, gender, vocal tract length, microphone, and so on. The authors recently proposed a structural and abstract representation of speech, where these variations were effectively removed. This representation captures only dynamics of speech. In our previous study, using this abstract representation, a new framework of speech synthesis was proposed and ...
متن کاملHuman Speech Model Based on Information Separation — Collection or Separation, That is the Question. —
— Collection or Separation, That is the Question. — Nobuaki Minematsu Graduate School of Information Science and Technology, The University of Tokyo [email protected] Abstract This paper points out that no existing technically-implemented speech model is adequate enough to describe one of the most fundamental and unique capacities of human speech processing. Language acquisition of infa...
متن کاملHuman Speech Model Based on Information Separation — Collection
Abstract: This paper points out that no existing technically-implemented speech model is adequate enough to describe one of the most fundamental and unique capacities of human speech processing. Language acquisition of infants is based on vocal imitation [1] but they don’t impersonate their parents and imitate only the linguistic and para-linguistic aspects of the parents’ utterances. The vocal...
متن کامل