DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains
نویسندگان
چکیده
In this paper, we present a summa.rization system for spontaneous dialogues which consists of a novel multi-stage architectm'e. It is specifically aimed at addressing issues related to tlle nature of the l;exts being spoken vs. written and being diMogical vs. monologica.l. The system is embedded in a. graphical user interface ~md was developed and tested on transcripts of recorded telephone conversations in English and Spanish (CAI,LHOMI,;).
منابع مشابه
Using Speech-Specific Characteristics for Automatic Speech Summarization
In this thesis we address the challenge of automatically summarizing spontaneous, multi-party spoken dialogues. The experimental hypothesis is that it is advantageous when summarizing such meeting speech to exploit a variety of speech-specific characteristics, rather than simply treating the task as text summarization with a noisy transcript. We begin by investigating which term-weighting metri...
متن کاملDomain adaptation with augmented space method for multi-domain contact center dialogue summarization
In this paper we propose a method to improve the quality of extractive summarization for contact center dialogues in various domains by making use of training samples whose domains are different from that of the test samples. Since preparing sufficient numbers of training samples for each domain is too expensive, we leverage references from many different domains and employ the Augmented Space ...
متن کاملLearning to Model Domain-Specific Utterance Sequences for Extractive Summarization of Contact Center Dialogues
This paper proposes a novel extractive summarization method for contact center dialogues. We use a particular type of hidden Markov model (HMM) called Class Speaker HMM (CSHMM), which processes operator/caller utterance sequences of multiple domains simultaneously to model domain-specific utterance sequences and common (domainwide) sequences at the same time. We applied the CSHMM to call summar...
متن کاملTurn Segmentation into Utterances for Arabic Spontaneous Dialogues and Instance Messages
Text segmentation task is an essential processing task for many of Natural Language Processing (NLP) such as text summarization, text translation, dialogue language understanding, among others. Turns segmentation considered the key player in dialogue understanding task for building automatic HumanComputer systems. In this paper, we introduce a novel approach to turn segmentation into utterances...
متن کاملA flexible formal language for the orthographic transcription of spontaneous spoken dialogues
Orthographic transcriptions of speech are important in most fields of research concerned with spoken language. For spontaneous speech they have to be created manually, resulting potentially in inconsistent or erroneous transcriptions. We propose a new flexible and easy-to-use formal language for the orthographic transcription of spontaneous speech. All relevant phenomena introduced by spontaneo...
متن کامل