نتایج جستجو برای: test semi

تعداد نتایج: 943546  

2011
Margaret Mitchell Aaron Dunlop Brian Roark

In this paper, we argue that ordering prenominal modifiers – typically pursued as a supervised modeling task – is particularly wellsuited to semi-supervised approaches. By relying on automatic parses to extract noun phrases, we can scale up the training data by orders of magnitude. This minimizes the predominant issue of data sparsity that has informed most previous approaches. We compare sever...

2010
Shin MATSUSHIMA

We analyze the combination of online learning and semisupervised learning. Online learning is an approach to supervised machine learning, which is effective at processing a large amount of training data. Semi-supervised learning is employed when a large amount of unlabeled data is available. While both setting occurs frequently in real applications of machine learning, thus far, their theoretic...

2007
Rebecca Watson Ted Briscoe John A. Carroll

We compare the accuracy of a statistical parse ranking model trained from a fully-annotated portion of the Susanne treebank with one trained from unlabeled partially-bracketed sentences derived from this treebank and from the Penn Treebank. We demonstrate that confidence-based semi-supervised techniques similar to self-training outperform expectation maximization when both are constrained by pa...

2012
Zhiyao Duan Gautham J. Mysore Paris Smaragdis

Non-negative spectrogram factorization algorithms such as probabilistic latent component analysis (PLCA) have been shown to be quite powerful for source separation. When training data for all of the sources are available, it is trivial to learn their dictionaries beforehand and perform supervised source separation in an online fashion. However, in many real-world scenarios (e.g. speech denoisin...

Journal: :CoRR 2018
Antonia Creswell Alison Pouplin Anil A. Bharath

We propose a novel deep learning model for classifying medical images in the setting where there is a large amount of unlabelled medical data available, but labelled data is in limited supply. We consider the specific case of classifying skin lesions as either malignant or benign. In this setting, the proposed approach – the semi-supervised, denoising adversarial autoencoder – is able to utilis...

2012
Jie-Zhi Cheng Feng-Ju Chang Kuang-Jui Hsu Yen-Yu Lin

In the class based image segmentation problem, one of the major concerns is to provide large training data for learning complex graphical models. To alleviate the labeling effort, a concise annotation approach working on bounding boxes is introduced. The main idea is to leverage the knowledge learned from a few object contours for the inference of unknown contours in bounding boxes. To this end...

Journal: :CoRR 2015
S. Thenmalar Balaji Jagan T. V. Geetha

The aim of Named Entity Recognition (NER) is to identify references of named entities in unstructured documents, and to classify them into pre-defined semantic categories. NER often aids from added background knowledge in the form of gazetteers. However using such a collection does not deal with name variants and cannot resolve ambiguities associated in identifying the entities in context and a...

2017
Minnan Luo Lingling Zhang Feiping Nie Xiaojun Chang Buyue Qian Qinghua Zheng

Semi-supervised learning plays a significant role in multi-class classification, where a small number of labeled data are more deterministic while substantial unlabeled data might cause large uncertainties and potential threats. In this paper, we distinguish the label fitting of labeled and unlabeled training data through a probabilistic vector with an adaptive parameter, which always ensures t...

Journal: :Pattern Recognition Letters 2008
Yuanqing Li Cuntai Guan Huiqi Li Zhengyang Chin

In this paper, we first present a self-training semi-supervised support vector machine (SVM) algorithm and its corresponding model selection method, which are designed to train a classifier with small training data. Next, we prove the convergence of this algorithm. Two examples are presented to demonstrate the validity of our algorithm with model selection. Finally, we apply our algorithm to a ...

Journal: :IEICE Transactions 2002
Tatsuya Asai Kenji Abe Shinji Kawasoe Hiroki Arimura Hiroshi Sakamoto Setsuo Arikawa

By rapid progress of network and storage technologies, a huge amount of electronic data such as Web pages and XML data [23] has been available on intra and internet. These electronic data are heterogeneous collection of ill-structured data that have no rigid structures, and often called semi-structured data [1]. Hence, there have been increasing demands for automatic methods for extracting usef...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید