Perform Three Data Mining Tasks with Crowdsourcing Process
Authors
Abstract:
For data mining studies, because of the complexity of doing feature selection process in tasks by hand, we need to send some of labeling to the workers with crowdsourcing activities. The process of outsourcing data mining tasks to users is often handled by software systems without enough knowledge of the age or geography of the users' residence. Uncertainty about the performance of virtual users in crowdsourcing reduces the accuracy of the received information. In this article, we propose to gather a number of people in a university and apply them for the small gathering tasks by using incentive methods. Increased accuracy in announcing results due to physical presence, high speed of obtaining high precision results at designated time, appropriate education of participants in the activity, and native implementation plan are characteristics of this study. In this study, three classifications are used to classify word embedding as well as the process of motivation to provide standard data by the crowd source. In the first task we use convolutional neural network, for doing classification in six classes: USAGE, TOPIC, COMPARE, MODEL-FEATURE, RESULT and PART-WHOLE. This article extracts the data from the abstract of 400 scientific articles and it is a total of 835 relations. One hundred of these abstracts have been selected by the crowdsourcing. Classification results in this article have been done with a slight improvement in accuracy. In the second task, classification is done by crowdsourcing on the result of a combine word embedding system. Glove and Word2vec are two of the most popular word embedding algorithms. These two algorithms have been used separately in different machine learning applications. In ad innovation was created word embedding with using a combination of glove and word2vec. In this study, we computed the classification results on a combination of vocabulary vectors with using of 450 abstract relation data (100 crowd source datasets with 350 standards). The results of the implementation of the classification algorithm give us performance improvement. The third task was to completing the process of installing the mobile applications. One of the problems with data mining is the lack of standard data. The high cost of data preparation on the one hand and the lack of a suitable population for specialized tasks have led to the design of software for doing these processes. But these methods have no incentive to install the software and often fail due to the lack of crowd workers or more precisely, lack of specialist workers. In another process, we have installed the app into this collection of specialist science people. This smart population has reduced costs and time for doing data mining processes in the future. People can make the move towards the regulation of social activity at a given time and place by the capacity created by ICT to work on a large scale. Creating a social campaign will benefit users on mobile social networks. Scientists can benefit from the capacity created by crowdsourcing in ICT and make movement to regularize social activities at specific time and place. Creating social campaign will help to form new society. The smart population created in this article was an innovation or startup in the field of event organizing that used the attention of people to the day issues as well as the importance of watching competition. This paper uses the population power to perform preparing data mining works.
similar resources
Process Mining: Process Management with Data Mining
In this paper we propose new methods for ordering the Web pages returned from search engines. Given a few search keywords, nowadays most search engines could retrieve more than a few thousand Web pages. The problem is how to order the retrieved Web pages and then to present the most relevant Web pages first. We propose new factors to allow relevant Web pages to be ranked higher. The factors inc...
full textMining process models with prime invisible tasks
Process mining is helpful for deploying new business processes as well as auditing, analyzing and improving the already enacted ones. Most of the existing process mining algorithms have problems in dealing with invisible tasks, i.e., such tasks that exist in a process model but not in its event log. This is a problem since invisible tasks are mainly used for routing purpose but must not be igno...
full textUsing Data Mining and Three Decision Tree Algorithms to Optimize the Repair and Maintenance Process
The purpose of this research is to predict the failure of devices using a data mining tool. For this purpose, at the outset, an appropriate database consists of 392 records of ongoing failures in a pharmaceutical company in 1394, in the next step, by analyzing 9 characteristics and type of failure as a database class, analyzes have been used. In this regard, three decision tree algorithms have ...
full textCrowdsourcing Tasks within Linked Data Management
Many aspects of Linked Data management – including exposing legacy data and applications to semantic formats, designing vocabularies to describe RDF data, identifying links between entities, query processing, and data curation – are necessarily tackled through the combination of human effort with algorithmic techniques. In the literature on traditional data management the theoretical and techni...
full textBrief survey of crowdsourcing for data mining
Crowdsourcing allows large-scale and flexible invocation of human input for data gathering and analysis, which introduces a new paradigm of data mining process. Traditional data mining methods often require the experts in analytic domains to annotate the data. However, it is expensive and usually takes a long time. Crowdsourcing enables the use of heterogeneous background knowledge from volunte...
full textSafely Delegating Data Mining Tasks
Data mining is playing an important role in decision making for business activities and governmental administration. Since many organizations or their divisions do not possess the in-house expertise and infrastructure for data mining, it is beneficial to delegate data mining tasks to external service providers. However, the organizations or divisions may lose of private information during the d...
full textMy Resources
Journal title
volume 19 issue 1
pages 0- 0
publication date 2022-05
By following a journal you will be notified via email when a new issue of this journal is published.
No Keywords
Hosted on Doprax cloud platform doprax.com
copyright © 2015-2023