Cross-Lingual Study of ASR Errors: On the Role of the Context in Human Perception of Near-Homophones

نویسندگان

  • Ioana Vasilescu
  • Dahbia Yahia
  • Natalie D. Snoeren
  • Martine Adda-Decker
  • Lori Lamel
چکیده

It is widely acknowledged that human listeners significantly outperform machines when it comes to transcribing speech. This paper presents a paradigm for perceptual experiments that aims to increase our understanding of human and automatic speech recognition errors. The role of the context length is investigated through perceptual recovery of small homophonic words or near-homophones yielding frequent automatic transcription errors. The same experimental protocol of varied size speech stimuli transcription is applied to both French and English. Our hypothesis is that ambiguity due to homophonic words reduces with context size for both languages, which in turn should entail reduced perception and transcription errors. The results show that context plays a central role as the human word error rate decreases significantly with increasing context. The long-term aim is to improve the modelling of such ambiguous items to reduce automatic errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A perceptual investigation of speech transcription errors involving frequent near-homophones in French and american English

This article compares the errors made by automatic speech recognizers to those made by humans for near-homophones in American English and French. This exploratory study focuses on the impact of limited word context and the potential resulting ambiguities for automatic speech recognition (ASR) systems and human listeners. Perceptual experiments using 7-gram chunks centered on incorrect or correc...

متن کامل

Assessment of the probability of human error occurring in the process of appendectomy operation using SPAR-H method

1.Ochr('39')Connor PO, Keogh IJ. Addressing human error within the Irish healthcare system. Irish Medical Journal. 2011;104(1):5-6. 2. Jahangiri M, Hoboubi N, Rostamabadi A, Keshavarzi S, Hosseini AA. Human error analysis in a permit to work system: a case study in a chemical plant. Safety and Health  at Work. 2016;7(1):6-11. 3. Edmondson AC. Learning from mistakes is easier said than done: G...

متن کامل

INVESTIGATING THE ROLE OF CAUSATIVIZATION IN OVERPASSIVIZATION OF UN-ACCUSATIVE VERBS BY IRANIAN ENGLISH MAJORS

The current study aims at exploring the role of causativization as one of the causes stated in the literature for overpassivization of English unaccusatives in an Iranian context.The study was conducted using three data collection procedures, an Oxford Placement Test, a Grammaticality Judgment Task, and a Production Task. The results revealed that causativization errors with non-alternating una...

متن کامل

Cross-lingual studies of ASR errors: paradigms for perceptual evaluations

It is well-known that human listeners significantly outperform machines when it comes to transcribing speech. This paper presents a progress report of the joint research in the automatic vs human speech transcription and of the perceptual experiments developed at LIMSI that aims to increase our understanding of automatic speech recognition errors. Two paradigms are described here in which human...

متن کامل

سنجش کیفیت منظر مسیرهای پیاده با استفاده از تکنیک یادداشت‌برداری و تحلیل عناصر بصری منظر (نمونه موردی: بافت تاریخی هارونیه اصفهان)

Abstract Old and historical contexts always consider as the major and historical and cultural heritage of cities. Nowadays the role of urban landscape in historical contexts and pedestrian paths is greatly considered by the researchers in urban landscaping, designing and planning. Urban landscape is cultural capitals of the cities and pedestrians are quite interested in it. The rate of effec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011