نتایج جستجو برای: stop words

تعداد نتایج: 178858  

2011
Eugene Charniak Paul McCann

Stylometry is usually concerned with finding an authorial invariant, and attempts at authorship identification often model authors based on topics, putting weight on consciously selected content words and their unconsciously controlled frequency. This paper presents two uses of stylometry that rely on factors beyond the control of the author: document dating to a given historical period by dete...

Journal: :Neurology 2004
Chun-Liang Pan Meng-Fai Kuo Sung-Tsang Hsieh

The authors describe a patient with auditory agnosia caused by a tectal germinoma. Despite having normal audiometric tests, the patient failed to recognize words and musical characters. On head MRI, the inferior colliculi were infiltrated by tumor. Neuropsychological tests revealed severe impairment in recognition of environmental sounds and words, defective musical perception, and stop consona...

2006
Henry S Thompson

Many phonemic units can be readily described as a configuration of segments of acoustic features. Certain stop allophones, for example, can be constructed out of the features silence, hurst release and aspiration. It seems natural to specify this as a rewrite rule that can be interpreted by a parser: stop <silence + burst + aspi ration. Conventional parsers construct grammatical phrases out of ...

1998
Johannes Fürnkranz

In this paper, we study the effect of using -grams (sequences of words of length ) for text categorization. We use an efficient algorithm for generating such -gram features in two benchmark domains, the 20 newsgroups data set and 21,578 REUTERS newswire articles. Our results with the rule learning algorithm RIPPER indicate that, after the removal of stop words, word sequences of length 2 or 3 a...

2017
Ekaterina Chernyak

In this paper we address the problem of filtering obscene lexis in Russian texts. We use string similarity measures to find words similar or identical to words from a stop list and establish both a test collection and a baseline for the task. Our experiments show that a novel string similarity measure based on the notion of an annotated suffix tree outperforms some of the other well known measu...

2016
Jaideepsinh K. Raulji Jatinderkumar R. Saini

In the Information era, optimization of processes for Information Retrieval, Text Summarization, Text and Data Analytic systems becomes utmost important. Therefore in order to achieve accuracy, extraction of redundant words with low or no semantic meaning must be filtered out. Such words are known as stopwords. Stopwords list has been developed for languages like English, Chinese, Arabic, Hindi...

2012
Amira Shoukry Ahmed Rafea

Research done on Arabic sentiment analysis is considered very limited almost in its early steps compared to other languages like English whether at document-level or sentence-level. In this paper, we test the effect of preprocessing (normalization, stemming, and stop words removal) on the performance of an Arabic sentiment analysis system using Arabic tweets from twitter. The sentiment (positiv...

Journal: :Inf. Services and Use 2017
Bernd Pulverer Chris Armbruster

Issues around research ethics and the reproducibility of research are impacting the credibility of science. Firstly, we look at what is to be understand by research ethics, misconduct, and reproducibility. Next, we examine some examples of fraud, beautification, and failed reproducibility. Next, we address the causes and possible resolutions, culminating in the question whether the scholarly li...

Journal: :Journal of Japan Society for Fuzzy Theory and Intelligent Informatics 2022

In this research, we propose a method of labeling topic word for the sections in chat dialogue. First, investigated how humans label words. As result, it was found that words included first sentence which started and associated with appearing evenly section were given as It also non-characteristic appear other selected when is based on frequency appearance section. We obtain high TF-IDF candida...

Journal: :Brain and language 2013
Natasha Bullock-Rest Alissa Cerny Carol Sweeney Carole Palumbo Kathleen Kurowski Sheila E Blumstein

Previous behavioral work has shown that the phonetic realization of words in spoken word production is influenced by sound shape properties of the lexicon. A recent fMRI study (Peramunage, Blumstein, Myers, Goldrick, & Baese-Berk, 2011) showed that this influence of lexical structure on phonetic implementation recruited a network of areas that included the supramarginal gyrus (SMG) extending in...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید