Porting: SwitchBoard to the VoiceMail task
نویسندگان
چکیده
This paper examines techniques that allow a well-trained source system built on one task to be rapidly adapted, or ported, to another target task. The two tasks considered in this paper are Hub5, or Switchboard, as the source system and VoiceMail as the target task. The two tasks are acoustically similar, both being telephonebandwidth speech tasks, but differ in speaking style. SwitchBoard is conversational speech, VoiceMail is a set of voicemail messages. Various porting schemes for acoustic models are examined including discriminative MAP and heteroscedastic LDA. Using around 28 hours of data the error rate on the VoiceMail was reduced by 42% relative compared to the baseline Switchboard performance.
منابع مشابه
Recent improvements in speech recognition performance on large vocabulary conversational speech (voicemail and switchboard)
In this paper we report recent improvements in word error performance on a voicemail transcription task. Last year, the speaker independent word error rate (WER) on the dev test set of the Voicemail Transcription task was reported at 35.45% [1]. This year, we report a relative 20% gain over this number. The improvements were obtained using several new algorithms and an increased amount of train...
متن کاملTranscription of New Speaking Styles - Voicemail
In this paper we describe a new testbed for developing speech recognition algorithms a VoiceMail transcription task, analogous to other tasks such as the Switchboard, CallHome [1] and the Hub 4 tasks [2] which are currently used by speech recognition researchers. Spontaneous speech occurring in day-today life can broadly be classi ed into two categories (i) where the speaker does not receive an...
متن کاملPerformance Improvements in Voicemail Transcription
In this paper we report recent improvements in word error performance on a voicemail transcription task. Last year, the speaker independent word error rate (WER) on the dev test set of the Voicemail Transcription task was reported at 35.45% [1]. This year, we report a relative 20% gain over this number. The improvements were obtained using several new algorithms and an increased amount of train...
متن کاملSpeech recognition performance on a new voicemail transcription task
In this paper we describe a new testbed for developing speech recognition algorithms a VoiceMail transcription task, analogous to other tasks such as the Switchboard, CallHome, and the Hub 4 tasks, which are currently used by speech recognition researchers. We describe the collection and use of a new VoiceMail database (that is available to the research community through the LDC), and also desc...
متن کاملRecent improvements in voicemail transcription
In this paper we report recent improvements in voicemail transcription. The voicemail transcription task was introduced last year [1] as representing a style of conversational telephone speech that is somewhat different from the Switchboard and CallHome [2] databases. Last year, the speaker independent and speaker adapted word error rates (WER) on this task were reported at 41.94% and 38.18% re...
متن کامل