Information-Theoretic Representation Learning for Positive-Unlabeled Classification
نویسندگان
چکیده
Recent advances in weakly supervised classification allow us to train a classifier only from positive and unlabeled (PU) data. However, existing PU methods typically require an accurate estimate of the class-prior probability, which is critical bottleneck particularly for high-dimensional This problem has been commonly addressed by applying principal component analysis advance, but such unsupervised dimension reduction can collapse underlying class structure. In this paper, we propose novel representation learning method data based on information-maximization principle. Our does not estimation thus be used as preprocessing classification. Through experiments, demonstrate that our combined with deep neural networks highly improves accuracy estimation, leading state-of-the-art performance.
منابع مشابه
Positive Unlabeled Learning for Data Stream Classification
Learning from positive and unlabeled examples (PU learning) has been investigated in recent years as an alternative learning model for dealing with situations where negative training examples are not available. It has many real world applications, but it has yet to be applied in the data stream environment where it is highly possible that only a small set of positive data and no negative data i...
متن کاملEnsemble Based Positive Unlabeled Learning for Time Series Classification
Many real-world applications in time series classification fall into the class of positive and unlabeled (PU) learning. Furthermore, in many of these applications, not only are the negative examples absent, the positive examples available for learning can also be rather limited. As such, several PU learning algorithms for time series classification have recently been developed to learn from a s...
متن کاملPositive Unlabeled Leaning for Time Series Classification
In many real-world applications of the time series classification problem, not only could the negative training instances be missing, the number of positive instances available for learning may also be rather limited. This has motivated the development of new classification algorithms that can learn from a small set P of labeled seed positive instances augmented with a set U of unlabeled instan...
متن کاملPrototype Based Classification Using Information Theoretic Learning
In this article we extend the (recently published) unsupervised information theoretic vector quantization approach based on the Cauchy–Schwarz-divergence for matching data and prototype densities to supervised learning and classification. In particular, first we generalize the unsupervised method to more general metrics instead of the Euclidean, as it was used in the original algorithm. Thereaf...
متن کاملPositive-Unlabeled Learning for Pupylation Sites Prediction
Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neural Computation
سال: 2021
ISSN: ['0899-7667', '1530-888X']
DOI: https://doi.org/10.1162/neco_a_01337