Extracting Topics From Weblogs Through Frequency Segments

نویسندگان

  • Mizuki Oka
  • Hirotake Abe
  • Kazuhiko Kato
چکیده

In this paper, we present an approach to extracting topics from weblogs by using terms that appear in them. We model a term in terms of frequency segments, i.e., sequential occurrences of the term over time, as the unit of characterization. A notable feature of the model is its approximation of changes in the dynamics of term frequencies; it captures the granularity of frequencies from the very beginning of their occurrence. This approximation also makes a comparison of frequency patterns of terms more effective. We report on the results obtained from weblogs that contained an event of global significance i.e., the London bombings of 2005.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Topics and Innovators Using Topic Diffusion Process in Weblogs

The diffusion process on weblogs has attracted great interest since the early days of weblog studies. We propose a ranking technique which extracts topics and innovators by analyzing that process. Our method identifies URLs of topics and the bloggers who trigger topic diffusion. Our assumption is that the strength of propagation of a topic is determined by the influences of topics and bloggers....

متن کامل

بررسی محتوای یادداشت‌های ارسالی و نظرات وبلاگ‌های فردی و گروهی کتابداری و اطلاع‎رسانی فارسی

The present study employed a content analysis method for analyzing the posts and comments in 85 individual and 31 collective weblogs published in Farsi on the subject of Library and information science. Studies showed that the average monthly postings in collective weblog are more than individual weblogs, while regarding the comments posted the reverse is true. The highest numbers of postings i...

متن کامل

Extracting Domain-Dependent Semantic Orientations of Latent Variables for Sentiment Classification

Sentiment analysis of weblogs is a challenging problem. Most previous work utilized semantic orientations of words or phrases to classify sentiments of weblogs. The problem with this approach is that semantic orientations of words or phrases are investigated without considering the domain of weblogs. Weblogs contain the author’s various opinions about multifaceted topics. Therefore, we have to ...

متن کامل

Weblog Recommendation Using Association Rules

Weblogs are web sites where one or several authors publish their opinions about current events. Even in Spain, there are several thousands, and it is often difficult to find a weblog that meets one's interest. Recommendation services thus become, if not a need, at least a convenience. In this paper we propose automatic extraction of association rules from the results of a survey as a means to r...

متن کامل

BLOGRANK: Ranking on the blogosphere

Although, the Blogosphere is part of the World Wide Web, weblogs present several features that differentiate them from traditional websites: the number of different editors, the multitude of topics, the connectivity among weblogs and bloggers, the update rate, and the importance of time in rating are some of them. Traditional search engines perform poorly on blogs since they do not cover these ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006