Dynamic Editing Distance-based Extracting Relevant Information Approach from Social Networks

نویسندگان

چکیده

Online social networks, such as Facebook, Twitter, LinkedIn, etc., have grown exponentially in recent times with a large amount of information. These networks huge volumes data especially structured, textual, and unstructured forms which often led to cyber-crimes like cyber terrorism, bullying, extracting information from these has now become serious challenge order ensure the safety. In this work, we propose new, supervised approach for Information Extraction (IE) Web resources based on remote dynamic editing, called EIDED. Our is part family IE approaches masks extraction articulated around three algorithms: (i) labeling algorithm, (ii) learning inference (iii) an extended edit distance algorithm. proposed able work even presence anomalies tuples missing attributes, multivalued permutation structure web pages. The experimental study, conducted, standard database pages, shows performance our EIDED compared classic distance, respect metrics recall coefficient, precision, F1-measure.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extracting Relevant Information from Samples

The information bottleneck is an information theoretic framework, extending the classical notion of minimal sufficient statistics, that finds concise representations for an ‘input’ random variable that are as relevant as possible for an ‘output’ variable. This framework has been used successfully in various supervised and unsupervised applications. However, its learning theoretic properties and...

متن کامل

Extracting Relevant and Trustworthy Information from Microblogs

Microblogging sites like Twitter have emerged as a popular platform for exchanging real-time information on the Web. Twitter is used by hundreds of millions of users ranging from popular news organizations and celebrities to domain experts in fields like computer science and astrophysics and spammers. As a result, the quality of information posted in Twitter is highly variable and finding the u...

متن کامل

Extracting Information from Multiplex Networks

Multiplex networks are generalized network structures that are able to describe networks in which the same set of nodes are connected by links that have different connotations. Multiplex networks are ubiquitous since they describe social, financial, engineering, and biological networks as well. Extending our ability to analyze complex networks to multiplex network structures increases greatly t...

متن کامل

Extracting User Spatio-Temporal Profiles from Location Based Social Networks

Location Based Social Networks (LBSN) like Twitter or Instagram are a good source for user spatio-temporal behavior. These social network provide a low rate sampling of user’s location information during large intervals of time that can be used to discover complex behaviors, including mobility profiles, points of interest or unusual events. This information is important for different domains li...

متن کامل

Extracting Object Information from Aerial Images: A Map-Based Approach

We have developed a map-based approach that enables us to efficiently extract information about man-made objects, such as buildings, from aerial images. An image is matched with a corresponding map in order to estimate the object information in the image (i.e., presence, location, shape, size, kind, and surroundings). This approach is characterized by using a figure contained in a map as an obj...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Network and Information Security

سال: 2022

ISSN: ['2074-9090', '2074-9104']

DOI: https://doi.org/10.5815/ijcnis.2022.06.01