Misclassification and Margin based SVM Active Learning Algorithm for Audio Event Detection
نویسندگان
چکیده
Audio event detection has become a hot research due to its wide applications in many fields, such as multimedia retrieval etc., the detection needs large amounts of labeled samples to train the audio event models, but in real life, the labeled samples are expensive to obtain, the shortage of such labeled samples is a big obstacle. Active learning is an efficient way to deal with the problem of insufficient labeled samples. The most popular support vector machines active learning is the margin based sampling (MBS), which is to query the sample closest to the current hyperplane, but when the current hyperplane is far away from the true hyperplane, the sample closest to the current hyperplane is not so informative, querying such samples would have a much slower adjustment of the hyperplane. In order to accelerate the adjustment, this paper proposes the misclassification and margin based sampling (MMBS) active learning algorithm. In order to query more informative samples, MMBS selects samples based on misclassified samples’ KL divergence in the first few iterations, after that, considering the lower misclassification confidence and the outlier problem, it switches to MBS. Experiments show that compared to MBS and representative sampling (RepS), MMBS can get the highest detection performance under the same human annotation workload.
منابع مشابه
A New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate
Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...
متن کاملLow-rank Audio Signal Classification Under Soft Margin and Trace Norm Constraints
We propose an algorithm to do speech/non-speech classification based on low-rank matrix representative audio data. Conventionally, the low-rank matrix data can be represented by a vector in high dimensional space. Some learning algorithms are then applied in such a vector space for matrix data classification. Particularly, maximum margin classifiers, such as support vector machine (SVM) etc. ha...
متن کاملSVM Soft Margin Classifiers: Linear Programming versus Quadratic Programming
Support vector machine soft margin classifiers are important learning algorithms for classification problems. They can be stated as convex optimization problems and are suitable for a large data setting. Linear programming SVM classifier is specially efficient for very large size samples. But little is known about its convergence, compared with the well understood quadratic programming SVM clas...
متن کاملIntrusion Detection System based on SVM and Bee Colony
An intrusion detection system (IDS) is an active process or device that analyzes system and network activity for unauthorized entry. Nowadays many intrusion detection systems are developed based on many different machine learning techniques. Some of the models are based on single classifying techniques while some models are based on combining different classifying techniques, such as hybrid or ...
متن کاملRegularized margin-based conditional log-likelihood loss for prototype learning
The classification performance of nearest prototype classifiers largely relies on the prototype learning algorithm. The minimum classification error (MCE) method and the soft nearest prototype classifier (SNPC) method are two important algorithms using misclassification loss. This paper proposes a new prototype learning algorithm based on the conditional log-likelihood loss (CLL), which is base...
متن کامل