Handling Tree-Structured Values in RapidMiner
نویسنده
چکیده
Attribute value types play an important role in mostly every datamining task. Most learners, for instance, are restricted to particular value types. The usage of such learners is just possible after special forms of preprocessing. RapidMiner most commonly distinguishes between nominal and numerical values which are well-known to every RapidMineruser. Although, covering a great fraction of attribute types being present in nowadays datamining tasks, nominal and numerical attribute values are not sufficient for every type of feature. In this work we are focusing on attribute values containing a tree-structure. We are presenting the handling and especially the possibilities to use tree-structured data for modelling. Additionally, we are introducing particular tasks which are offering tree-structured data and might benefit from using those structures for modelling. All methods presented in this paper are contained in the Information Extraction Plugin for RapidMiner.
منابع مشابه
RMonto: Ontological Extension to RapidMiner
We present RMonto, an ontological extension to RapidMiner, that provides possibility of machine learning with formal ontologies. RMonto is an easily extendable framework, currently providing support for unsupervised clustering with kernel methods and (frequent) pattern mining in knowledge bases. One important feature of RMonto is that it enables working directly on structured, relational data. ...
متن کاملDefining and Executing Process Mining Workflows with RapidProM
Today’s Information Systems (ISs) record huge amounts of data about the business processes they support. These data can be used for process mining. This way we can analyse the operational processes within an organization based on facts rather than fiction. Examples of these processes are the handling of a loan application within a bank or the treatment of a patient suffering from colorectal can...
متن کاملComparative Analysis of Premises Valuation Models Using KEEL, RapidMiner, and WEKA
The experiments aimed to compare machine learning algorithms to create models for the valuation of residential premises, implemented in popular data mining systems KEEL, RapidMiner and WEKA, were carried out. Six common methods comprising two neural network algorithms, two decision trees for regression, and linear regression and support vector machine were applied to actual data sets derived fr...
متن کاملTechnische Universität Dortmund Subproject A1 Data Mining for Ubiquitous System Software Information Extraction in Rapidminer
This paper describes the Information Extraction Plugin 1 [3] which allows the use of Information Extraction mechanisms in RapidMiner 2 .
متن کاملUtilizing the Open Movie Database API for Predicting the Review Class of Movies
In this paper, we present our contribution to the Linked Data Mining Challenge 2015. Our approach predicts the review class of movies using external data from the Open Movie Database API (OMDb-API). We select specific features, such as movie ratings and box office, that are very likely to describe the quality of a movie. With RapidMiner we utilize these features and apply three basic classifica...
متن کامل