Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining

Authors

  • H. Veisi Assistant Professor, Department of New Sciences & Technology, Tehran University, Tehran, Iran
  • R. Rahmani Assistant Professor, Department of New Sciences & Technology, Tehran University, Tehran, Iran
  • S. Azimi Faculty Member, Department of New Sciences & Technology, Tehran University, Tehran, Iran
Abstract:

Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consuming for a human to perform, due to the large number of technologies and due to the fact that the technologies are constantly changing. In this paper, a method is provided for the automatic discovery of the network of associations of Industrial IT technologies as a networked graph, using data mining and web-mining algorithms. The proposed process is an approach to form a dynamic weighted graph of technologies. A numeric value is calculated as similarity between technologies.  A combination of data mining and web mining techniques have been used to achieve the results. The main objective is to invent a computerized reproducible method so that by the help of it, technological relation can be extracted and updated constantly. This method consists of six phases, of which four phases are performed automatically by novel algorithms introduced in this paper. The analysis of more than 8 million terms suggests that the proposed method provides acceptable results. This paper also provided recommendations to improve the suggested method.

Download for Free

Sign up for free to access the full text

Already have an account?login

similar resources

automatic discovery of technology networks for industrial-scale r&d it projects via data mining

industrial-scale r&d it projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. when planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. discovery of this network of technologies is time con...

full text

Data Mining Industrial Applications

Novel, advanced sensors, dynamic development of information technologies as well as modern high-performance computers applied in different fields of human activity result in large amount of data. Consequently, these data, grouped in the data sets are both large and complex. The complexity come from the several mutually excluding factors like acquisition with different sensors at various times, ...

full text

The Challenge of Large-Scale IT Projects

The trend in the world of Information Technology (IT) is getting increasingly large and difficult projects rather than smaller and easier. However, the data on large-scale IT project success rates provide cause for concern. This paper seeks to answer why large-scale IT projects are different from and more difficult than other typical engineering projects. Drawing on the industrial experience, a...

full text

Knowledge discOvery And daTa minINg inteGrated (KOATING) Moderators for collaborative projects

A major issue in any multidiscipline collaborative project is how to best share and simultaneously exploit different types of expertise, without duplicating efforts or inadvertently causing conflicts or loss of efficiency through misunderstandings of individual or shared goals. Moderators are knowledge based systems designed to support collaborative teams by raising awareness of potential probl...

full text

Enhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining

This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...

full text

Web Mining Techniques for Automatic Discovery of Medical Knowledge

In this paper, we propose an automatic and autonomous methodology to discover taxonomies of terms from the Web and represent retrieved web documents into a meaningful organization. Moreover, a new method for lexicalizations and synonyms discovery is also introduced. The obtained results can be very useful for easing the access to web resources of any medical domain or creating ontological repre...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}


Journal title

volume 47  issue 1

pages  17- 22

publication date 2015-09-23

By following a journal you will be notified via email when a new issue of this journal is published.

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023