Variable pronunciations reveal dynamic intra-speaker variation in speech planning
نویسندگان
چکیده
منابع مشابه
Learning speaker-specific pronunciations of disordered speech
One of the main clinical applications of speech technology is in voice-enabled assistive technology for people with disordered speech. Progress in this area is hampered by a sparseness in suitable data and recent research have focused on ways of incorporating knowledge about typical (i.e., un-impaired) speech through the use of e.g., deep belief neural networks. This paper presents a new way of...
متن کاملRobust Speech Recognition Usin Intra-speaker Ada
Inter-speaker variation can be coped rather well in speech recognition by speaker adaptation techniques such as MLLR and MAP. However, when dealing with speech other than reading style, such as conversational speech, emotional speech and so on, current recognition systems cannot achieve a satisfactory performance even after speaker adaptation. In view of this situation, two-level adaptation met...
متن کاملIntra-speaker variation and units in human speech perception and ASR
Research on speech perception and ASR has resulted several important advances in our understanding of speech variation: one is that speaker dependent variation is systematic, another is that inter-speaker and intra-speaker variation diverge in their root causes and characteristics. Therefore, a successful approach to one may not always transfer to the other. Intertalker variation, or indexical ...
متن کاملMinimizing Speaker Variation Effects for Speaker-Independent Speech Recognition
1. I N T R O D U C T I O N For speaker-independent speech recognition, speaker variation is one of the major error sources. As a typical example, the error rate of a well-trained speaker-dependent speech recognition system is three times less than that of a speaker-independent speech recognition system [11]. To minimize speaker variation effects, we can use either speakerclustered models [28, 1...
متن کاملModeling dynamic prosodic variation for speaker verification
Statistics of frame-level pitch have recently been used in speaker recognition systems with good results [1, 2, 3]. Although they convey useful long-term information about a speaker’s distribution of f0 values, such statistics fail to capture information about local dynamics in intonation that characterize an individual’s speaking style. In this work, we take a first step toward capturing such ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Psychonomic Bulletin & Review
سال: 2021
ISSN: 1069-9384,1531-5320
DOI: 10.3758/s13423-021-01886-0