The Blogosphere at a Glance—Content-Based Structures Made Simple

نویسندگان

  • Olof Görnerup
  • Magnus Boman
چکیده

A network representation based on a basic wordoverlap similarity measure between blogs is introduced. The simplicity of the representation renders it computationally tractable, transparent and insensitive to representation-dependent artifacts. Using Swedish blog data, we demonstrate that the representation, in spite of its simplicity, manages to capture important structural properties of the content in the blogosphere. First, blogs that treat similar subjects are organized in distinct network clusters. Second, the network is hierarchically organized as clusters in turn form higher-order clusters: a compound structure reminiscent of a blog taxonomy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Network Formation in the Political Blogosphere: An Application of Agent Based Simulation and e-Research Tools

Abstract: The political blogosphere has recently been the focus of attention for social network analysis and applications of network and graph theory. In a recent paper, Adamic and Glance (2005) report differences between the linking behavior of politically conservative vs. politically liberal Web bloggers. We construct a simple agent-based network formation model which shows that one such diff...

متن کامل

Leave a Reply: An Analysis of Weblog Comments

Access to weblogs, both through commercial services and in academic studies, is usually limited to the content of the weblog posts. This overlooks an important aspect distinguishing weblogs from other web pages: the ability of weblog readers to respond to posts directly, by posting comments. In this paper we present a large-scale study of weblog comments and their relation to the posts. Using a...

متن کامل

News Detection in the Blogosphere: Two Approaches Based on Structure and Content Analysis

In this paper, we study a subset of the blogosphere created by spinn3r during August and September 2008 containing 20.5 million posts. We propose two approaches to detect and filter important news and events published in blogs. The first involves exploring the structural properties of the post network and the information cascades within it. For the second approach, we use a scalable algorithm t...

متن کامل

Collaborative Sensemaking in the Blogosphere

This paper presents a case study of a class of students coblogging throughout the semester. The students collaboratively made sense of the course material. The class blogosphere became a repository of interpretations, reflections, opinions, monologues and dialogues about the course content. Over the course of the semester there was an aggregation of “sense made” that was “mined” by the students...

متن کامل

Hierarchical Characterization and Generation of Blogosphere Workloads

We present a thorough characterization of the access patterns in blogspace, which comprises a rich interconnected web of blog postings and comments by an increasingly prominent user community that collectively define what has become known as the blogosphere. Our characterization of over 35 million read, write, and management requests spanning a 28-day period is done at three different levels. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011