Modeling Text through Gaussian Processes

نویسندگان

  • Daichi Mochihashi
  • Kazuyoshi Yoshii
  • Masataka Goto
چکیده

This paper proposes a continous space text model based on Gaussian processes. Introducing latent coordinates of words over which the Gaussian process is defined, we can encode word correlations directly and lead to a model that performs better than mixture models. Our model would serve as a foundation of more complex text models and also as a statistical visualization of texts.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Indefinite Gaussian Processes

Gaussian processes (GPs) enable probabilistic kernel-machines with remarkable modeling efficacy and GPML toolbox facilitates a widespread use by practitioners and researchers. Many modern applications demand non-metric (dis)similarities. As a result, Mercer’s condition for positive semidefiniteness is violated. Through a simple text categorization example that involves a KL-divergence based ker...

متن کامل

Gaussian processes in Bayesian modeling : Manual for Matlab toolbox

(This is an early version of the manual, which is still subject to some modifications. The text contains still errors in some details but great picture is correctly described.)

متن کامل

Twitter-Network Topic Model: A Full Bayesian Treatment for Social Network and Text Modeling

Twitter data is extremely noisy – each tweet is short, unstructured and with informal language, a challenge for current topic modeling. On the other hand, tweets are accompanied by extra information such as authorship, hashtags and the user-follower network. Exploiting this additional information, we propose the Twitter-Network (TN) topic model to jointly model the text and the social network i...

متن کامل

Modeling Tweet Arrival Times using Log-Gaussian Cox Processes

Research on modeling time series text corpora has typically focused on predicting what text will come next, but less well studied is predicting when the next text event will occur. In this paper we address the latter case, framed as modeling continuous inter-arrival times under a logGaussian Cox process, a form of inhomogeneous Poisson process which captures the varying rate at which the tweets...

متن کامل

Properties of Spatial Cox Process Models

Probabilistic properties of Cox processes of relevance for statistical modeling and inference are studied. Particularly, we study the most important classes of Cox processes, including log Gaussian Cox processes, shot noise Cox processes, and permanent Cox processes. We consider moment properties and point process operations such as thinning, displacements, and superpositioning. We also discuss...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013