A new prosody annotation protocol for live sports commentaries

نویسندگان

  • Sandrine Brognaux
  • Benjamin Picart
  • Thomas Drugman
چکیده

This paper proposes a new prosody annotation protocol specific to live sports commentaries. Two levels of annotation are defined with HMM-based speech synthesis in view. Local labels are assigned to all syllables and refer to accentual phenomena. Global labels classify sequences of words into five distinct subgenres, defined in terms of valence and arousal. The objective of the study is to provide a set of labels both related to a specific function and characterized by a distinct acoustic realization. The consideration of these constraints should allow for an automatic prediction of the labels both from the text or from the speech signal. Reasonable inter-annotator scores are achieved for both annotation levels. A prosodic analysis of all labels also shows that they can usually be distinguished by specific acoustic realizations. The integration of this new annotation protocol within HMM-based speech synthesis shows promising results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

HMM-based speech synthesis of live sports commentaries: integration of a two-layer prosody annotation

This paper proposes the integration of a two-layer prosody annotation specific to live sports commentaries into HMMbased speech synthesis. Local labels are assigned to all syllables and refer to accentual phenomena. Global labels categorize sequences of words into five distinct speaking styles, defined in terms of valence and arousal. Two stages of the synthesis process are analyzed. First, the...

متن کامل

Synthesizing sports commentaries: One or several emphatic stresses?

Emphatic stresses are known to fulfill essential functions in expressive speech. Their integration in speech synthesis usually relies on a prosodic annotation of the training corpus. Emphasized syllables are then assigned a single label or can receive several labels according to their acoustic realization. While it is more complex to predict those various labels for a new text to synthesize, it...

متن کامل

Semantic Interpretation of Events in Live Soccer Commentaries

English. In the context of semantic interpretation of live soccer commentaries in Italian, we propose an annotation schema for relevant events and their argument structure, on whose basis we annotated a reference evaluation corpus. We investigated automatic event classification and used Active Learning to reduce the cost of acquiring domain-specific training data. Italiano. Nel contesto dell’in...

متن کامل

A Bootstrapping Approach to Automating Prosodic Annotation for Limited-domain Synthesis

Most speech synthesis systems use symbolic prosody labels for marking emphasis and phrase structure, but in corpus-based approaches prosodic annotation of speech is a labor intensive process driving up the cost of development of new voices. This paper explores the potential for reducing that cost by using a bootstrapping approach to automatic prosodic annotation, particularly in a limited domai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013