Graph-based Analytics for Decentralized Online Social Networks
نویسنده
چکیده
Decentralized Online Social Networks (DOSNs) have been introduced as a privacy preserving alternative to the existing online social networks. DOSNs remove the dependency on a centralized provider and operate as distributed information management platforms. The main objective behind decentralization is to preserve user privacy in both shared content and communication. Current efforts of providing DOSNs are mainly focused on designing the required building blocks for managing the distributed network and supporting the social services (e.g., search for topics or people, content delivery, etc.). However, there is a lack of reliable techniques for enabling complex analytical services (e.g., spam detection, identity validation, recommendation systems, etc.) that comply with the decentralization requirements of DOSNs. In particular, there is a need for decentralized data analytic techniques and machine learning (ML) algorithms that can successfully run on top of DOSNs. In this thesis, we empower decentralized analytics for DOSNs through a set of novel algorithms. Our algorithms allow decentralized analytics to effectively work on top of fully decentralized topology, when the data is fully distributed and nodes have access to their local knowledge only. Additionally, our algorithms follow unsupervised ML paradigm, thus removing the need for collecting labeled training data that potentially puts user privacy at risk. Furthermore, our algorithms and methods are able to extract and exploit the latent patterns in the social user interaction networks and effectively combine them with the shared content, yielding significant improvements for the complex analytic tasks. We argue that, community identification is at the core of the learning and analytical services provided for DOSNs. We show in this thesis that knowledge on community structures and information dissemination patterns, embedded in the topology of social networks has a potential to greatly enhance data analytic insights and improve results. At the heart of this thesis lies a community detection technique that successfully extracts communities in a completely decentralized manner. In particular, we show that multiple complex analytic tasks, like spam detection and identity validation, can be successfully tackled by harvesting the information from the social network structure. This is achieved by using decentralized community detection algorithm which acts as the main building block for the community-aware learning paradigm that we lay out in this thesis. To the best of our knowledge, this thesis represents the first attempt to bring complex analytical services, which require decentralized iterative computation over distributed data, to the domain of DOSNs. The experimental evaluation of our proposed algorithms using real-world datasets confirms the ability of our solutions to generate efficient ML models in massively parallel and highly scalable manner. Furthermore, our algorithms preserve user privacy and achieve better performance compared to the existing centralized and global approaches.
منابع مشابه
Providing a Link Prediction Model based on Structural and Homophily Similarity in Social Networks
In recent years, with the growing number of online social networks, these networks have become one of the best markets for advertising and commerce, so studying these networks is very important. Most online social networks are growing and changing with new communications (new edges). Forecasting new edges in online social networks can give us a better understanding of the growth of these networ...
متن کاملiSocial: Decentralized Online Social Networks
What is the Destiny of Social Networking Sites Data After the Data Owners Pass Away? 12 Competition Between Global and Local Online Social Networks 14 Users Key Locations in Online Social Networks: Identification and Applications 16 Building a Spam Free Social Network 18 Large Scale Topic Detection using Node-Cut Partitioning on Dense Weighted Graphs 20 Individualism and Collectivism in Social ...
متن کاملSocial Networks Data Mining Using Visual Analytics
Internet-based social networks are facilities (typically web sites) where people can form online communities, connect to each other and share information. This paper explores the area of applying visual analytics to represent data and underlying relationships in social networks – more specifically, on the Twitter micro-blogging service. Networks of this kind can be treated as graphs, where each...
متن کاملA Novel Approach for Detecting Relationships in Social Networks Using Cellular Automata Based Graph Coloring
All the social networks can be modeled as a graph, where each roles as vertex and each relationroles as an edge. The graph can be show as G = [V;E], where V is the set of vertices and E is theset of edges. All social networks can be segmented to K groups, where there are members in eachgroup with same features. In each group each person knows other individuals and is in touch ...
متن کاملSocial Network Data Analytics Social Network Data Analytics
The advent of online social networks has been one of the most exciting events in this decade. Many popular online social networks such as Twitter, LinkedIn, and Facebook have become increasingly popular. In addition, a number of multimedia networks such as Flickr have also seen an increasing level of popularity in recent years. Many such social networks are extremely rich in content, and they t...
متن کامل