Preliminary Analysis of Hepatitis Data by Beam-wise Graph-Based Induction

نویسندگان

  • Takashi Matsuda
  • Hiroshi Motoda
  • Tetsuya Yoshida
  • Takashi Washio
چکیده

Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). It is very efficient because of its greedy search strategy but at the same time it suffers from the incompleteness of search. Improvement is made on its search capability without imposing much computational complexity by 1) incorporating a beam search, 2) using a different evaluation function to extract patterns that are more discriminatory than those simply occurring frequently, and 3) adopting canonical labeling to enumerate identical patterns accurately. This new algorithm, now called Beam-wise GBI, B-GBI for short, was tested against the promoter dataset from UCI repository and shown successful in extracting discriminatory substructures. Effect of beam width on the number of discovered attributes and predictive accuracy was evaluated. The best result obtained by this approach was better than the previously best known result. B-GBI was then applied to a real-world data, Hepatitis dataset provided by Chiba University. Our very preliminary results indicate that B-GBI can actually handle graphs with a few thousands nodes and extract discriminatory patterns.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Patterns from Structured Data by Beam-Wise Graph-Based Induction

Graph-Based Induction (GBI) extracts typical patterns from graph data by stepwise pair expansion (pairwise chunking). It is very efficient because of its greedy search strategy but at the same time it suffers from the incompleteness of search. Improvement is made on its search capability without imposing much computational complexity by 1) incorporating a beam search, 2) using a different evalu...

متن کامل

Analysis of Hepatitis Dataset by Decision Tree Graph-Based Induction

We analyzed the hepatitis data by Decision Tree GraphBased Induction (DT-GBI), which constructs a decision tree for graphstructured data while simultaneously constructing attributes for classification. An attribute at each node in the decision tree is a discriminative pattern (subgraph) in the input graph, and extracted by Graph-Based Induction (GBI). We conducted four kinds of experiments usin...

متن کامل

Constructing a Decision Tree for Graph-Structured Data and its Applications

A machine learning technique called Graph-Based Induction (GBI) efficiently extracts typical patterns from graph-structured data by stepwise pair expansion (pairwise chunking). It is very efficient because of its greedy search. Meanwhile, a decision tree is an effective means of data classification from which rules that are easy to understand can be obtained. However, a decision tree could not ...

متن کامل

Analysis of Resting-State fMRI Topological Graph Theory Properties in Methamphetamine Drug Users Applying Box-Counting Fractal Dimension

Introduction: Graph theoretical analysis of functional Magnetic Resonance Imaging (fMRI) data has provided new measures of mapping human brain in vivo. Of all methods to measure the functional connectivity between regions, Linear Correlation (LC) calculation of activity time series of the brain regions as a linear measure is considered the most ubiquitous one. The strength of the dependence obl...

متن کامل

Backward Induced Probability Models

This paper describes how to specify probability models for data analysis via a backward induction procedure. The new approach yields coherent, priorfree uncertainty assessment. The backward induction approach is first demonstrated on two familiar models — the Bernoulli distribution and the Gaussian distribution — to compare the resulting specifications to their standard counterparts arising as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002