A corpus for studying addressing behavior in multi-party dialogues
نویسندگان
چکیده
This paper describes a multi-modal corpus of hand-annotated meeting dialogues that was designed for studying addressing behavior in face-to-face conversations. The corpus contains annotated dialogue acts, addressees, adjacency pairs and gaze direction. First, we describe the corpus design where we present the annotation schema, annotation tools and annotation process itself. Then, we analyze the reproducibility and stability of the annotation schema.
منابع مشابه
A corpus for studying addressing behaviour in multi-party dialogues
1 2 Abstract This paper describes a multi-modal corpus of hand-annotated meeting 3 dialogues that was designed for studying addressing behaviour in face-to-face con4 versations. The corpus contains annotated dialogue acts, addressees, adjacency pairs 5 and gaze direction. First, we describe the corpus design where we present the 6 meetings collection, annotation scheme and annotation tools. The...
متن کاملTowards Automatic Addressee Identification in Multi-party Dialogues
The paper is about the issue of addressing in multi-party dialogues. Analysis of addressing behavior in face to face meetings results in the identification of several addressing mechanisms. From these we extract several utterance features and features of non-verbal communicative behavior of a speaker, like gaze and gesturing, that are relevant for observers to identify the participants the spea...
متن کاملThe Teams Corpus and Entrainment in Multi-Party Spoken Dialogues
When interacting individuals entrain, they begin to speak more like each other. To support research on entrainment in cooperative multi-party dialogues, we have created a corpus where teams of three or four speakers play two rounds of a cooperative board game. We describe the experimental design and technical infrastructure used to collect our corpus, which consists of audio, video, transcripti...
متن کاملThe PIT Corpus of German Multi-Party Dialogues
The PIT corpus is a German multi-media corpus of multi-party dialogues recorded in a Wizard-of-Oz environment at the University of Ulm. The scenario involves two human dialogue partners interacting with a multi-modal dialogue system in the domain of restaurant selection. In this paper we present the characteristics of the data which was recorded in three sessions resulting in a total of 75 dial...
متن کاملDiscourse Structure and Dialogue Acts in Multiparty Dialogue: the STAC Corpus
This paper describes the STAC resource, a corpus of multi-party chats annotated for discourse structure in the style of SDRT (Asher and Lascarides, 2003; Lascarides and Asher, 2009). The main goal of the STAC project is to study the discourse structure of multi-party dialogues in order to understand the linguistic strategies adopted by interlocutors to achieve their conversational goals, especi...
متن کامل