test words

نتایج جستجو برای: test words

تعداد نتایج: 938092 فیلتر نتایج به سال:

Learning Bigrams from Unigrams

2008

Xiaojin Zhu Andrew B. Goldberg Michael G. Rabbat Robert D. Nowak

Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisingly, is able to learn a bigram language model from a set of bag-of-words documents. At its heart, our approach is an EM algorithm that seeks a model which maximizes the regularized marginal likelihood of the bagof-wor...

متن کامل

An Autoencoder Approach to Learning Bilingual Word Representations

2014

A. P. Sarath Chandar Stanislas Lauly Hugo Larochelle Mitesh M. Khapra Balaraman Ravindran Vikas C. Raykar Amrita Saha

Cross-language learning allows one to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this work we explore the use of autoencoder-based methods for cross-language learning of vectorial word representations that are coherent between two languages, w...

متن کامل

Concept-Based Feature Generation and Selection for Information Retrieval

2008

Ofer Egozi Evgeniy Gabrilovich Shaul Markovitch

Traditional information retrieval systems use query words to identify relevant documents. In difficult retrieval tasks, however, one needs access to a wealth of background knowledge. We present a method that uses Wikipedia-based feature generation to improve retrieval performance. Intuitively, we expect that using extensive world knowledge is likely to improve recall but may adversely affect pr...

متن کامل

a study on the effectiveness of biolingual teaching of cognate words(persian-english)on iranian efl learners knowledge of lexical improvement بررسی تاثیر دوزبانی ریشه کلمات (انگلیسی-فارسی)در بهبود دانش لغوی زبان اموزان ایرانی

پایان نامه :موسسه آموزش عالی غیردولتی رودکی تنکابن - دانشکده ادبیات و زبانهای خارجی 1393

ساجده فلاح, مرتضی خدابنده لو, شاهرخ جهاندار,

abstract this study aimed at investigating the effect of bilingual teaching of cognate words (persian-english) on iranian upper intermediate efl learners’ knowledge of lexical development. for this purpose,100 subjects participated in this study out of which 40 learners were selected for this study and they were assigned into two groups, control and experimental. cross-language cognates (wor...

Fusing semantic aspects for image annotation and retrieval

Journal: :J. Visual Communication and Image Representation 2010

Zhixin Li Zhiping Shi Xi Liu Zhiqing Li Zhongzhi Shi

In this paper, we present an approach based on probabilistic latent semantic analysis (PLSA) to achieve the task of automatic image annotation and retrieval. In order to model training data precisely, each image is represented as a bag of visual words. Then a probabilistic framework is designed to capture semantic aspects from visual and textual modalities, respectively. Furthermore, an adaptiv...

متن کامل

Optimizing feature set for Chinese Word Sense Disambiguation

2004

Zheng-Yu Niu Dong-Hong Ji Chew Lim Tan

This article describes the implementation of I2R word sense disambiguation system (I2R −WSD) that participated in one senseval3 task: Chinese lexical sample task. Our core algorithm is a supervised Naive Bayes classifier. This classifier utilizes an optimal feature set, which is determined by maximizing the cross validated accuracy of NB classifier on training data. The optimal feature set incl...

متن کامل

Improving a Page Classifier with Anchor Extraction and Link Analysis

2002

William W. Cohen

Most text categorization systems use simple models of documents and document collections. In this paper we describe a technique that improves a simple web page classifier’s performance on pages from a new, unseen web site, by exploiting link structure within a site as well as page structure within hub pages. On real-world test cases, this technique significantly and substantially improves the a...

متن کامل

The Penefit of Salience: Salient Accented, but Not Unaccented Words Reveal Accent Adaptation Effects

2016

Ann-Kathrin Grohe Andrea Weber

In two eye-tracking experiments, the effects of salience in accent training and speech accentedness on spoken-word recognition were investigated. Salience was expected to increase a stimulus' prominence and therefore promote learning. A training-test paradigm was used on native German participants utilizing an artificial German accent. Salience was elicited by two different criteria: production...

متن کامل

The RepEval 2017 Shared Task: Multi-Genre Natural Language Inference with Sentence Representations

2017

Nikita Nangia Adina Williams Angeliki Lazaridou Samuel R. Bowman

This paper presents the results of the RepEval 2017 Shared Task, which evaluated neural network sentence representation learning models on the MultiGenre Natural Language Inference corpus (MultiNLI) recently introduced by Williams et al. (2017). All of the five participating teams beat the bidirectional LSTM (BiLSTM) and continuous bag of words baselines reported in Williams et al.. The best si...

متن کامل

Bootstrapping polarity classifiers with rule-based classification

Journal: :Language Resources and Evaluation 2013

Michael Wiegand Manfred Klenner Dietrich Klakow

In this article, we examine the effectiveness of bootstrapping supervised machine-learning polarity classifiers with the help of a domain-independent rulebased classifier that relies on a lexical resource, i.e., a polarity lexicon and a set of linguistic rules. The benefit of this method is that though no labeled training data are required, it allows a classifier to capture in-domain knowledge ...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید