Knowledge Discovery from Disparate Earth Data Sources
نویسندگان
چکیده
Advances in data collection and data storage technologies have made it possible to acquire massive Earth science data sets. In principle, these data sets could be transformed into great scientific discoveries. However, due to the heterogeneous nature and to the scale of the available Earth science data, traditional analysis methods are challenged and much of these data remain largely unexplored. We have developed a general strategy for transforming machine learning algorithms for learning from a single data source into algorithms for learning from disparate, semantically heterogeneous data sources. We believe that our strategy could be adapted and used for data exploration and knowledge discovery from Earth science data.
منابع مشابه
Context mediation among knowledge discovery components
Acknowledgements I would like to express my gratitude to my supervisors Prof. John Hughes and Prof. David Bell for their guidance. The thesis has benefited from a number of research projects and the input provided by a range of individuals. Initial work was conducted at the University of Abertay, Scotland under the heartening guidance of Dr. Colin Miller and Dr. Louis Nathanson. During my stay ...
متن کاملKD in FM: Knowledge Discovery in Facilities Management Databases
The KD in FM project aims to investigate how Knowledge Discovery in Databases (KDD), and particularly data mining, techniques can be applied to the distributed, heterogeneous and autonomous data sources found in the Facilities Management (FM) environment. The problems associated with multiple disparate databases are examined as is recent research in heterogeneous database mining. Finally, we de...
متن کاملA New Generation of Digital Library to Support Drug Discovery Research
The recent explosion of publicly available biomedical information gave drug discovery researchers unprecedented access to a wide variety of online repositories, but the sheer volume of the available data diminishes its utility. This is compounded by the fact that these repositories suffer from a silo effect: data from one cannot be easily linked to data in another. This is true for both publicl...
متن کاملSTORM - A Novel Information Fusion and Cluster Interpretation Technique
Abstract. Analysis of data without labels is commonly subject to scrutiny by unsupervised machine learning techniques. Such techniques provide more meaningful representations, useful for better understanding of a problem at hand, than by looking only at the data itself. Although abundant expert knowledge exists in many areas where unlabelled data is examined, such knowledge is rarely incorporat...
متن کاملSupporting Drug Discovery Research through Knowledge Modeling and Integration
This paper describes a knowledge platform that is designed to support drug discovery researchers in pharmaceutical companies. The core of this platform is a knowledge model that provides a semantically integrated knowledge space for the researchers to easily learn and explore various aspects of biological data that originate from multiple disparate sources. By using domain-specific functional r...
متن کامل