High-Dimensional Unsupervised Active Learning Method

Authors

  • M. Javadian Department of Computer Engineering, Kermanshah University of Technology. Kermanshah, Iran.
  • S. Bagheri Shouraki Department of Electrical Engineering, Sharif University of Technology, Tehran, Iran.
  • V. Ghasemi Department of Computer Engineering, Kermanshah University of Technology. Kermanshah, Iran.
Abstract:

In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the data points as one-dimensional ink drop patterns, in order to summarize the effects of all data points, and then applies a threshold on the resulting vectors. It is based on an ensemble clustering method which performs one-dimensional density partitioning to produce ensemble of clustering solutions. Then, it assigns a unique prime number to the data points that exist in each partition as their labels. Consequently, a combination is performed by multiplying the labels of every data point in order to produce the absolute labels. The data points with identical absolute labels are fallen into the same cluster. The hierarchical property of the algorithm is intended to cluster complex data by zooming in each already formed cluster to find further sub-clusters. The algorithm is verified using several synthetic and real-world datasets. The results show that the proposed method has a promising performance, compared to some well-known high-dimensional data clustering algorithms.

Upgrade to premium to download articles

Sign up to access the full text

Already have an account?login

similar resources

Active and Unsupervised Learning for A

State-of-the-art speech recognition systems are trained using human transcriptions of speech utterances. In this paper, we describe a method to combine active and unsupervised learning for automatic speech recognition (ASR). The goal is to minimize the human supervision for training acoustic and language models and to maximize the performance given the transcribed and untranscribed data. Active...

full text

Unsupervised Active Learning in Large Domains

Active learning is a powerful approach to an­ alyzing data effectively. We show that the feasibility of active learning depends crucially on the choice of measure with respect to which the query is being optimized. The standard information gain, for example, does not permit an accurate evaluation with a small committee, a representative subset of the model space. We propose a surrogate measure ...

full text

Index-learning of unsupervised low dimensional embeddings

We introduce a simple unsupervised learning method for creating low-dimensional embeddings. Autoencoders work by simultaneously learning how to encode the input to a low dimensional representation and decoding the low dimensional representation to reconstruct the original input—the need to be able to reconstruct the input places a significant limit on the complexity of what can be learnt. The m...

full text

Extended Active Learning Method

Active Learning Method (ALM) is a soft computing method which is used for modeling and control, based on fuzzy logic. Although ALM has shown that it acts well in dynamic environments, its operators cannot support it very well in complex situations due to losing data. Thus ALM can find better membership functions if more appropriate operators be chosen for it. This paper substituted two new oper...

full text

Active Learning

This article has no abstract.

full text

Learning high-dimensional data

Observations from real-world problems are often highdimensional vectors, i.e. made up of many variables. Learning methods, including artificial neural networks, often have difficulties to handle a relatively small number of high-dimensional data. In this paper, we show how concepts gained from our intuition on 2and 3dimensional data can be misleading when used in high-dimensional settings. When...

full text

My Resources

Save resource for easier access later

Save to my library Already added to my library

{@ msg_add @}


Journal title

volume 8  issue 3

pages  391- 407

publication date 2020-07-01

By following a journal you will be notified via email when a new issue of this journal is published.

Hosted on Doprax cloud platform doprax.com

copyright © 2015-2023