Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
نویسندگان
چکیده مقاله:
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consuming for a human to perform, due to the large number of technologies and due to the fact that the technologies are constantly changing. In this paper, a method is provided for the automatic discovery of the network of associations of Industrial IT technologies as a networked graph, using data mining and web-mining algorithms. The proposed process is an approach to form a dynamic weighted graph of technologies. A numeric value is calculated as similarity between technologies. A combination of data mining and web mining techniques have been used to achieve the results. The main objective is to invent a computerized reproducible method so that by the help of it, technological relation can be extracted and updated constantly. This method consists of six phases, of which four phases are performed automatically by novel algorithms introduced in this paper. The analysis of more than 8 million terms suggests that the proposed method provides acceptable results. This paper also provided recommendations to improve the suggested method.
منابع مشابه
automatic discovery of technology networks for industrial-scale r&d it projects via data mining
industrial-scale r&d it projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. when planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. discovery of this network of technologies is time con...
متن کاملData Mining Industrial Applications
Novel, advanced sensors, dynamic development of information technologies as well as modern high-performance computers applied in different fields of human activity result in large amount of data. Consequently, these data, grouped in the data sets are both large and complex. The complexity come from the several mutually excluding factors like acquisition with different sensors at various times, ...
متن کاملThe Challenge of Large-Scale IT Projects
The trend in the world of Information Technology (IT) is getting increasingly large and difficult projects rather than smaller and easier. However, the data on large-scale IT project success rates provide cause for concern. This paper seeks to answer why large-scale IT projects are different from and more difficult than other typical engineering projects. Drawing on the industrial experience, a...
متن کاملKnowledge discOvery And daTa minINg inteGrated (KOATING) Moderators for collaborative projects
A major issue in any multidiscipline collaborative project is how to best share and simultaneously exploit different types of expertise, without duplicating efforts or inadvertently causing conflicts or loss of efficiency through misunderstandings of individual or shared goals. Moderators are knowledge based systems designed to support collaborative teams by raising awareness of potential probl...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملWeb Mining Techniques for Automatic Discovery of Medical Knowledge
In this paper, we propose an automatic and autonomous methodology to discover taxonomies of terms from the Web and represent retrieved web documents into a meaningful organization. Moreover, a new method for lexicalizations and synonyms discovery is also introduced. The obtained results can be very useful for easing the access to web resources of any medical domain or creating ontological repre...
متن کاملمنابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ذخیره در منابع من قبلا به منابع من ذحیره شده{@ msg_add @}
عنوان ژورنال
دوره 47 شماره 1
صفحات 17- 22
تاریخ انتشار 2015-09-23
با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.
میزبانی شده توسط پلتفرم ابری doprax.com
copyright © 2015-2023