نتایج جستجو برای: stop words

تعداد نتایج: 178858  

2005
Luis M. T. Jesus Christine H. Shadle C. H. Shadle

In a study of European Portuguese fricatives it was noted that /ö, R/ were often realized as [X, K, R ̊ ], which are, respectively, unvoiced and voiced uvular fricative, and voiceless tapped alveolar fricative. Although these phones have not previously been recognized as occurring in European Portuguese, they occurred in 115 words out of a corpus of 1304 words, out of which 107 words could be an...

2017
Fanqing Meng Wenpeng Lu Yuteng Zhang Jinyong Cheng Yuehan Du Shuwang Han

This paper reports the details of our submissions in the task 1 of SemEval 2017. This task aims at assessing the semantic textual similarity of two sentences or texts. We submit three unsupervised systems based on word embeddings. The differences between these runs are the various preprocessing on evaluation data. The best performance of these systems on the evaluation of Pearson correlation is...

2013
Prabu palanisamy Vineet Yadav Harsha Elchuri

This paper describes the system developed by the Serendio team for the SemEval-2013 Task 2 competition (Task A). We use a lexicon based approach for discovering sentiments. Our lexicon is built from the Serendio taxonomy. The Serendio taxonomy consists of positive, negative, negation, stop words and phrases. A typical tweet contains word variations, emoticons, hashtags etc. We use preprocessing...

Journal: :The Journal of the Acoustical Society of America 2011
Chi-Yueh Lin Hsiao-Chuan Wang

The voice onset time (VOT) of a stop consonant is the interval between its burst onset and voicing onset. Among a variety of research topics on VOT, one that has been studied for years is how VOTs are efficiently measured. Manual annotation is a feasible way, but it becomes a time-consuming task when the corpus size is large. This paper proposes an automatic VOT estimation method based on an on...

2015
Tal Linzen Timothy O'Donnell

The phonotactics of a language describes the ways in which the sounds of the language combine to form possible morphemes and words. Humans can learn phonotactic patterns at the level of abstract classes, generalizing across sounds (e.g., “words can end in a voiced stop”). Moreover, they rapidly acquire these generalizations, even before they acquire soundspecific patterns. We present a probabil...

Journal: :IJITWE 2011
Bassam Al-Shargabi Fekry Olayah Waseem Al-Romimah

In this paper, an experimental study was conducted on three techniques for Arabic text classification. These techniques are Support Vector Machine (SVM) with Sequential Minimal Optimization (SMO), Naïve Bayesian (NB), and J48. The paper assesses the accuracy for each classifier and determines which classifier is more accurate for Arabic text classification based on stop words elimination. The a...

2014
Swati joshi Dharmendra Sharma

Data mining is a technique of data evaluation for discovering hidden patterns over the raw data. The data is found in unstructured manner in the real world, therefore for extracting the meaningful data contents form this raw data, the data mining methods and techniques are helpful.In this presented study the, the data mining techniques are utilized for finding the valuable contents from the tex...

Journal: :CoRR 2008
Ibrahim Abu El-Khair

The effectiveness of three stop words lists for Arabic Information Retrieval---General Stoplist, CorpusBased Stoplist, Combined Stoplist ---were investigated in this study. Three popular weighting schemes were examined: the inverse document frequency weight, probabilistic weighting, and statistical language modelling. The Idea is to combine the statistical approaches with linguistic approaches ...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید