Beam Search Optimized Batch Bayesian Active Learning
نویسندگان
چکیده
Active Learning is an essential method for label-efficient deep learning. As a Bayesian active learning method, by Disagreement (BALD) successfully selects the most representative samples maximizing mutual information between model prediction and parameters. However, when applied to batch acquisition mode, like construction with greedy search, BALD suffers from poor performance, especially noises of near-duplicate data. To address this shortcoming, we propose diverse beam search optimized which explores graph every expanding highest-scored predetermined number. avoid near duplicate branches (very similar beams generated same root samples), undesirable lacking representations in feature space, design self-adapted constraint within candidate beams. The proposed able acquire data that can better represent distribution unlabeled pool, at time, be significantly different existing We observe achieves higher performance than baseline methods on three benchmark datasets.
منابع مشابه
Discriminative Batch Mode Active Learning
Active learning sequentially selects unlabeled instances to label with the goal of reducing the effort needed to learn a good classifier. Most previous studies in active learning have focused on selecting one unlabeled instance to label at one time while retraining in each iteration. Recently a few batch mode active learning approaches have been proposed that select a set of most informative un...
متن کاملBatch Active Learning via Coordinated Matching
We propose a novel batch active learning method that leverages the availability of high-quality and efficient sequential active-learning policies by approximating their behavior when applied for k steps. Specifically, our algorithm uses MonteCarlo simulation to estimate the distribution of unlabeled examples selected by a sequential policy over k steps. The algorithm then selects k examples tha...
متن کاملActive Batch Learning with Tree Ensembles
In a conventional machine learning approach, one uses labeled data to train the model. However, often we have a data set with few labeled instances, and a large number of unlabeled ones. This is called a semi-supervised learning problem. It is well known that often unlabeled data could be used to improve a model. In real world scenarios, labeled data can usually be obtained dynamically. However...
متن کاملJoint Transfer and Batch-mode Active Learning
Active learning and transfer learning are two different methodologies that address the common problem of insufficient labels. Transfer learning addresses this problem by using the knowledge gained from a related and already labeled data source, whereas active learning focuses on selecting a small set of informative samples for manual annotation. Recently, there has been much interest in develop...
متن کاملRNN Based Batch Mode Active Learning Framework
Active Learning has been applied in many real world classification tasks to reduce the amount of labeled data required for training a classifier. However most of the existing active learning strategies select only a single sample for labeling by the oracle in every iteration. This results in retraining the classifier after each sample is added which is quite computationally expensive. Also many...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i5.25751