Sampling Large-scale Social Networks: Insights from Simulated Networks

نویسندگان

  • Peter Ebbes
  • Zan Huang
  • Arvind Rangaswamy
  • Hari P Thadakamalla
چکیده

We conduct a detailed simulation study to assess how well various sampling techniques recover network characteristics such as degree, clustering coefficient, and path length distributions of several simulated population networks that have the high clustering tendency characteristic of social networks but vary in terms of degree distribution and density. We consider several alternative sampling procedures tailored to the context of social network sampling, including random-node and random-edge sampling, egocentric sampling, and several variations of graph-exploration-based sampling methods (random walk, forest fire, and snowball methods). Our main findings are that for networks with Poisson degree distribution the snowball method is overall the best while for networks of power-law degree distribution random walk is the best when the network is sparse and the forest fire method is the best when the network is dense. Nous menons une étude détaillée à évaluer à quel point les diverses techniques d'échantillonnage récupèrent les distributions de le degré, le coefficient de clustering, et le longueur de chemin de plusieurs réseaux sociaux simulés qui ont une tendance de groupement élevée caractéristique des réseaux sociaux, mais changent en termes de distribution de degré et densité.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Additional Insights Into Problem Definition and Positioning From Social Science; Comment on “Four Challenges That Global Health Networks Face”

Commenting on a recent editorial in this journal which presented four challenges global health networks will have to tackle to be effective, this essay discusses why this type of analysis is important for global health scholars and practitioners, and why it is worth understanding and critically engaging with the complexities behind these challenges. Focusing on the topics of problem definition ...

متن کامل

Sequential Sampling Enhanced Composite Likelihood Approach to Estimation of Social Intercorrelations in Large-Scale Networks

The increasing access to large social network data has generated substantial interest in the marketing community. However, due to its large scale, traditional analysis methods often become inadequate. In this paper, we propose a sequential sampling enhanced composite likelihood approach for efficient estimation of social intercorrelations in large-scale networks using the spatial model. The pro...

متن کامل

Sequential Sampling Enhanced Composite Likelihood Approach to Estimation of Social Intercorrelations in Large-Scale Networks

The increasing access to large social network data has generated substantial interest in the marketing community. However, due to its large scale, traditional analysis methods often become inadequate. In this paper, we propose a sequential sampling enhanced composite likelihood approach for efficient estimation of social intercorrelations in large-scale networks using the spatial model. The pro...

متن کامل

Using an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks

Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...

متن کامل

The Association between Use of Virtual Social Networks and Social Isolation among High School Girls in Shahrekord

The Association between Use of Virtual Social Networks and Social Isolation among High School Girls in Shahrekord   K. Karimian [1] M. Parsamehr, Ph.D. [2] S.A.R. Afshani, Ph.D. [3]   This study [4] sought to examine the association between use of virtual social networks and social isolation among high school girls in Shahrekord. The research method was survey. The statistical population ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008