Intonational phrases for speech summarization

نویسندگان

  • Sameer Maskey
  • Andrew Rosenberg
  • Julia Hirschberg
چکیده

Extractive speech summarization approaches select relevant segments of spoken documents and concatenate them to generate a summary. The extraction unit chosen, whether a sentence, syntactic constituent, or other segment, has a significant impact on the overall quality and fluency of the summary. Even though sentences tend to be the choice of most the extractive speech summarizers, in this paper, we present the results of an empirical study indicating that intonational phrases are better units of extraction for summarization. Our study compared four types of input segmentation: sentences, two pause-based segmentation, and intonational phrases (IP). We found that IPs are the best candidates for extractive summarization, improving over the second highest-performing approach, sentence-based summarization, by 8.2% F-measure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Pitch patterns of intonational phrases and intonational phrase groups in native and non-native speech

We examined pitch patterns within and across intonational phrases of Japanese read aloud by native and non-native (Mandarin Chinese) speakers. Japanese speakers change pitch ranges for each intonational phrase. The relative pitch ranges of neighboring intonational phrases indicate which intonational phrase belongs to which intonational phrase group. Chinese speakers are unable to acoustically c...

متن کامل

Syntactic and prosodic parenthesis

This paper examines the view that parentheticals obligatorily form an intonational phrase and break up the intonational phrase of the matrix sentence into two intonational phrases. The analysis of spontaneous speech data of Hamburg German shows that neither do all parentheticals form a distinct intonational phrase nor do all parentheticals break up the intonational phrase of the matrix sentence...

متن کامل

Modeling spontaneous speech events during recognition

In spontaneous speech, speakers segment their speech into intonational phrases, and make repairs to what they are saying. However, techniques for understanding spontaneous speech tend to treat these events as noise, in the same manner as they handle out-of-grammar constructions and misrecognitions. In our approach, we advocate that these events should be explicitly modeled. We modify the speech...

متن کامل

Prosody in a corpus of French spontaneous speech: perception, annotation and prosody ~ syntax interaction

Our study focuses on the issue of prosodic annotation and of the prosody ~ syntax interface in conversation and is based on a large corpus of conversational speech in French. The results of inter-transcriber agreement tests show that two expert transcribers are consistent in their labeling of prosodic phrasing and the consistency is well above the chance. A qualitative analysis reveals transcri...

متن کامل

Length, ordering preference and intonational phrasing: evidence from pauses

This paper reports a speech production experiment in which the effects of surrounding phrase lengths and head-argument distance on intra-sentential pause duration were tested. While the results confirm an effect of phrase length on pausing, this effect is found to be distinctly stronger for long phrases preceding the pause than for long upcoming phrases. The results are discussed with respect t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008