Who Talks to Whom: Modeling Latent Structures in Dialogue Documents

نویسندگان

  • Bailu Ding
  • Jiang-Ming Yang
  • Chong Wang
  • Rui Cai
  • Zhiwei Li
  • Lei Zhang
چکیده

Bailu Ding, Jiang-Ming Yang, Chong Wang, Rui Cai, Zhiwei Li, Lei Zhang Fudan University, [email protected] Microsoft Research Asia, {jmyang, ruicai, zli, leizhang}@microsoft.com Princeton University, [email protected] 1 Latent Structures of Dialogue Documents Various forms of data that consist of sequential messages abound in social networks, such as citations, mail lists, chats, and forum discussions. A sequence of messages can be seen as a dialogue. We call the correlation among messages, namely the ’who-talks-to-whom’ relationship, the dialogue structure. Discovering dialogue structure is important for further studies on user behaviors. In a dialogue, we call the first message the root message, and the member who posts the root message the dialogue starter. People interested in the root message ’talk’ to the dialogue starter by posting new messages. Figure 1 (left) shows a dialogue taken from a discussion forum Slashdot(http://slashdot.org). Once we discover the tree-structured reply relationship in the dialogue, as shown in the upper-right part of Figure 1, we can construct the ’who-talks-to-whom’ network as shown in the lower-right part of Figure 1. This network can be used in various applications, such as community identification, expert ranking and friend recommendation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structural Topic Model for Latent Topical Structure Analysis

Topic models have been successfully applied to many document analysis tasks to discover topics embedded in text. However, existing topic models generally cannot capture the latent topical structures in documents. Since languages are intrinsically cohesive and coherent, modeling and discovering latent topical transition structures within documents would be beneficial for many text analysis tasks...

متن کامل

Locally discriminative topic modeling

Topic modeling is a powerful tool for discovering the underlying or hidden structure in text corpora. Typical algorithms for topic modeling include probabilistic latent semantic analysis (PLSA) and latent Dirichlet allocation (LDA). Despite their different inspirations, both approaches are instances of generative model, whereas the discriminative structure of the documents is ignored. In this p...

متن کامل

"I Object!" Modeling Latent Pragmatic Effects in Courtroom Dialogues

Understanding the actionable outcomes of a dialogue requires effectively modeling situational roles of dialogue participants, the structure of the dialogue and the relevance of each utterance to an eventual action. We develop a latent-variable model that can capture these notions and apply it in the context of courtroom dialogues, in which the objection speech act is used as binary supervision ...

متن کامل

Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation

Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...

متن کامل

An Investigation on The Effectiveness of Employing Topic Modeling Techniques to Provide Topic Awareness For Conversational Agents

The idea behind this proposal is to investigate the possibility of utilizing NLP tools, statistical topic modeling techniques and freely available online resources to propose a system able to provide dialogue contribution suggestions which are relevant to the context, yet out of the main activity of the dialogue (i.e. off-activity talk). The aim is to evaluate the effects of a tool that automat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009