test semi

Face recognition based on LDA in manifold subspace

Journal: :EAI Endorsed Trans. Context-aware Syst. & Appl. 2016

Hung Phuoc Truong Tue-Minh Dinh Vo Thai Hoang Le

Although LDA has many successes in dimensionality reduction and data separation, it also has disadvantages, especially the small sample size problem in training data because the "within-class scatter" matrix may not be accurately estimated. Moreover, this algorithm can only operate correctly with labeled data in supervised learning. In practice, data collection is very huge and labeling data re...

متن کامل

Improved Arabic Dialect Classification with Social Media Data

2015

Fei Huang

Arabic dialect classification has been an important and challenging problem for Arabic language processing, especially for social media text analysis and machine translation. In this paper we propose an approach to improving Arabic dialect classification with semi-supervised learning: multiple classifiers are trained with weakly supervised, strongly supervised, and unsupervised data. Their comb...

متن کامل

Semi-supervised Incremental Learning of Manipulative Tasks

2007

Zhe Li Sven Wachsmuth Jannik Fritsch Gerhard Sagerer

For a social robot, the ability of learning tasks via human demonstration is very crucial. But most current approaches suffer from either the demanding of the huge amount of labeled training data, or the limited recognition cabability caused by very domain-specific modeling. This paper puts forward a semi-supervised incremental strategy for the robot to learn the manipulative tasks performed by...

متن کامل

Semi-supervised Learning of Facial Attributes in Video

2010

Neva Cherniavsky Ivan Laptev Josef Sivic Andrew Zisserman

In this work we investigate a weakly-supervised approach to learning facial attributes of humans in video. Given a small set of images labeled with attributes and a much larger unlabeled set of video tracks, we train a classifier to recognize these attributes in video data. We make two contributions. First, we show that training on video data improves classification performance over training on...

متن کامل

GS4: Generating Synthetic Samples for Semi-Supervised Nearest Neighbor Classification

2014

Panagiotis Moutafis Ioannis A. Kakadiaris

In this paper, we propose a method to improve nearest neighbor classification accuracy under a semi-supervised setting. We call our approach GS4 (i.e., Generating Synthetic Samples Semi-Supervised). Existing self-training approaches classify unlabeled samples by exploiting local information. These samples are then incorporated into the training set of labeled data. However, errors are propagate...

متن کامل

Semi-supervised Learning from Unbalanced Labeled Data - An Improvement

2004

Te Ming Huang Vojislav Kecman

We present a great improvement while performing semi-supervised learning tasks from training data sets when only a small fraction of the data pairs is labeled. In particular, we propose a novel decision strategy based on normalized model outputs. We give the explanation why the normalization step helps. The paper compares performances of two popular semi-supervised approaches (Consistency Metho...

متن کامل

Determining Term Subjectivity and Term Orientation for Opinion Mining

2006

Andrea Esuli Fabrizio Sebastiani

Opinion mining is a recent subdiscipline of computational linguistics which is concerned not with the topic a document is about, but with the opinion it expresses. To aid the extraction of opinions from text, recent work has tackled the issue of determining the orientation of “subjective” terms contained in text, i.e. deciding whether a term that carries opinionated content has a positive or a ...

متن کامل

Semi-Supervised Training of Language Model on Spanish Conversational Telephone Speech Data

2016

Ekaterina Egorova Jordi Luque Serrano

This work addresses one of the common issues arising when building a speech recognition system within a low-resourced scenario adapting the language model on unlabeled audio data. The proposed methodology makes use of such data by means of semisupervised learning. Whilst it has been proven that adding system-generated labeled data for acoustic modeling yields good results, the benefits of addin...

متن کامل

Combination of multilingual and semi-supervised training for under-resourced languages

2014

Frantisek Grézl Martin Karafiát

Multilingual training of neural networks for ASR is widely studied these days. It has been shown that languages with little training data can benefit largely from the multilingual resources for training. The use of unlabeled data for the neural network training in semi-supervised manner has also improved the ASR system performance. Here, the combination of both methods is presented. First, mult...

متن کامل

Semi-lagrangian Transport Algorithms for the Shallow Water Equations in Spherical Geometry

2007

P. H. Worley N. M. Nachtigal

Global atmospheric circulation models (GCM) typically have three primary algorithmic components: columnar physics, spectral tran-form, and semi-Lagrangian transport. In this study, several varients of a SLT method are studied in the context of test cases for the shallow water equations in spherical geometry. A grid point formulation is used with implicit, semi-implicit or explicit time integrat...

متن کامل