Quantifying and Comparing Centrality Measures for Network Individuals as Applied to the Enron Corpus

نویسنده

  • T. KAYE
چکیده

The ever increasing body of social networks creates an opportunity for extensive network analysis and investigations of communications, cliques, and network contributions. In this study, we focus our attention on the Enron email corpus and the corresponding network of employees, attempting to gather information from the email communications. Methods of data reduction on the email corpus were used to create a weighted adjacency matrix in which each i, j-entry corresponds to a weighted count of correspondences from employee i to employee j. While there are many ways to measure importance within a corporate network, of which job title constitutes one such measure, our study focuses on five primary measures: eigenvector centrality, row-sums of a topological overlap matrix, closeness, betweenness, and Opsahl metric. These network analysis metrics were applied to the weighted adjacency matrix to calculate the centrality measures for each individual employee, which were subsequently compiled into ordinally ranked lists of employees for each centrality measure based on decreasing importance. Additionally, the centrality data was visualized using the DataDriven Documents (D3) javascript library, allowing for network visualization in terms of department job title and number of emails sent. In applying the centrality measures to network data, we explore the differences inherent in each measure and work to compare them as well as the corresponding employee importance rankings for each. The metrics in our analysis determined individual importance of employees by applying significant weight to various aspects of the employees’ network roles. By identifying employees that are connected to a large number of individuals and simultaneously have extensive correspondences with those individuals, the Opsahl score combines the other measures, proving to be the most useful metric in exploring Enron’s inner-corporate structure.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Network Analysis with the Enron Email Corpus

We use the Enron email corpus to study relationships in a network by applying six different measures of centrality. Our results came out of an in-semester undergraduate research seminar. The Enron corpus is well suited to statistical analyses at all levels of undergraduate education. Through this article’s focus on centrality, students can explore the dependence of statistical models on initial...

متن کامل

The Influence of Location on Nodes’ Centrality in Location-Based Social Networks

Nowadays, due to the widespread use of social networks, they can be used as a convenient, low-cost, and affordable tool for disseminating all kinds of information and data among the massive users of these networks. Issues such as marketing for new products, informing the public in critical situations, and disseminating medical and technological innovations are topics that have been considered b...

متن کامل

An Email Attachment is Worth a Thousand Words, or Is It?

There is an extensive body of research on Social Network Analysis (SNA) based on the email arhive. The network used in the analysis is generally extracted either by capturing the email communication in From, To, Cc and Bcc email header elds or by the entities contained in the email message. In the latter case, the entities could be, for instance, the bag of words, url’s, names, phones, etc. It ...

متن کامل

Intra-Firm Information Flow: A Content-Structure Perspective

This paper endeavors to bring together two largely disparate areas of research. On one hand, text mining methods treat each document as an independent instance despite the fact that in many text domains, documents are linked and their topics are correlated. For example, web pages of related topics are often connected by hyperlinks and scientific papers from related fields are typically linked b...

متن کامل

Evaluating Seismic Effects on a Water Supply Network and Quantifying Post-Earthquake Recovery

This paper summarises the impact of major earthquakes, 2010–2011, on Christchurch’s water supply network and what recovery measures have been applied, what worked well, what did not and why. A number of issues related to the open nature of the Christchurch water supply network were identified during earthquakes. It was difficult to manage large water supply pressure zones during the post-earthq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014