Ensembles of Classifiers for Morphological Galaxy Classification

نویسندگان

  • D. BAZELL
  • DAVID W. AHA
چکیده

We compare the use of three algorithms for performing automated morphological galaxy classiÐcation using a sample of 800 galaxies. ClassiÐers are created using a single training set as well as bootstrap replicates of the training set, producing an ensemble of classiÐers. We use a Naive Bayes classiÐer, a neural network trained with backpropagation, and a decision-tree induction algorithm with pruning. Previous work in the Ðeld has emphasized backpropagation networks and decision trees. The Naive Bayes classiÐer is easy to understand and implement and often works remarkably well on real-world data. For each of these algorithms, we examine the classiÐcation accuracy of individual classiÐers using 10-fold cross validation and of ensembles of classiÐers trained using 25 bootstrap data sets and tested on the same cross-validation test sets. Our results show that (1) the neural network produced the best individual classiÐers (lowest classiÐcation error) for the majority of cases, (2) the ensemble approach signiÐcantly reduced the classiÐcation error for the neural network and the decision-tree classiÐers but not for the Naive Bayes classiÐer, (3) the ensemble approach worked better for decision trees (typical error reduction of 12%È23%) than for the neural network (typical error reduction of 7%È12%), and (4) the relative improvement when using ensembles decreases as the number of output classes increases. While more extensive comparisons are needed (e.g., a variety of data and classiÐers), our work is the Ðrst demonstration that the ensemble approach can signiÐcantly increase the performance of certain automated classiÐcation methods when applied to the domain of morphological galaxy classiÐcation. Subject headings : galaxies : fundamental parameters È methods : data analysis È methods : numerical

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning and Image Analysis for Morphological Galaxy Classification

In this paper we present an experimental study of machine learning and image analysis for performing automated morphological galaxy classification. We have used a neural network, and a locally weighted regression method, and also we implemented homogeneous ensembles of classifiers. The ensemble of neural networks was created using the bagging ensemble method, and manipulation of input features ...

متن کامل

From static to dynamic ensemble of classifiers selection: Application to Arabic handwritten recognition

Arabic handwriting word recognition is a challenging problem due to Arabic’s connected letter forms, consonantal diacritics and rich morphology. One way to improve the recognition rates classification task is to improve the accuracy of individual classifiers; another, is to apply ensemble of classifiers methods. To select the best classifier set from a pool of classifiers, the classifier divers...

متن کامل

Automated Galaxy Morphology: A Fourier Approach

We use automated surface photometry and pattern classification techniques to morphologically classify galaxies. The two-dimensional light distribution of a galaxy is reconstructed using Fourier series fits to azimuthal profiles computed in concentric elliptical annuli centered on the galaxy. Both the phase and amplitude of each Fourier component have been studied as a function of radial bin num...

متن کامل

ADABOOST ENSEMBLE ALGORITHMS FOR BREAST CANCER CLASSIFICATION

With an advance in technologies, different tumor features have been collected for Breast Cancer (BC) diagnosis, processing of dealing with large data set suffers some challenges which include high storage capacity and time require for accessing and processing. The objective of this paper is to classify BC based on the extracted tumor features. To extract useful information and diagnose the tumo...

متن کامل

A High-Performance Model based on Ensembles for Twitter Sentiment Classification

Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001