n grams

نتایج جستجو برای: n grams

تعداد نتایج: 982486 فیلتر نتایج به سال:

N-gramas sintácticos no-continuos

Journal: :Polibits 2013

Grigori Sidorov

In this paper, we present the concept of noncontinuous syntactic n-grams. In our previous works we introduced the general concept of syntactic n-grams, i.e., n-grams that are constructed by following paths in syntactic trees. Their great advantage is that they allow introducing of the merely linguistic (syntactic) information into machine learning methods. Certain disadvantage is that previous ...

متن کامل

Unordered N-gram Representation Based on Zero-suppressed BDDs for Text Mining and Classification

2007

Ryutaro Kurai Shin-ichi Minato Thomas Zeugmann

In this paper, we present a new method to analyze unordered n-grams by using ZBDDs (Zero-suppressed BDDs). n-grams have been used not only for text analysis but also for text indexing in some search engines. We newly use a variation of n-grams called unordered n-grams. Unordered n-grams abstract from the position of the characters in each n-gram, i.e., they just deal with the range of ordinary ...

متن کامل

Syntactic Dependency-Based N-grams as Classification Features

2012

Grigori Sidorov Francisco Velasquez Efstathios Stamatatos Alexander F. Gelbukh Liliana Chanona-Hernández

In this paper we introduce a concept of syntactic n-grams (sn-grams). Sn-grams differ from traditional n-grams in the manner of what elements are considered neighbors. In case of sn-grams, the neighbors are taken by following syntactic relations in syntactic trees, and not by taking the words as they appear in the text. Dependency trees fit directly into this idea, while in case of constituency...

متن کامل

Syntactic N-grams as machine learning features for natural language processing

Journal: :Expert Syst. Appl. 2014

Grigori Sidorov Francisco Velasquez Efstathios Stamatatos Alexander F. Gelbukh Liliana Chanona-Hernández

In this paper we introduce and discuss a concept of syntactic n-grams (sn-grams). Sn-grams differ from traditional n-grams in the manner how we construct them, i.e., what elements are considered neighbors. In case of sngrams, the neighbors are taken by following syntactic relations in syntactic trees, and not by taking words as they appear in a text, i.e., sn-grams are constructed by following ...

متن کامل

CIC-FBK Approach to Native Language Identification

2017

Ilia Markov Lingzhen Chen Carlo Strapparava Grigori Sidorov

We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017. Our approach combines features commonly used in previous NLI research, i.e., word n-grams, lemma n-grams, part-of-speech n-grams, and function words, with recently introduced character n-grams from misspelled words, and features that are novel in this task, such as typed character n-gram...

متن کامل

Signature-Based Malware Detection Using Sequences of N-grams

Journal: :International Journal of Engineering & Technology 2018

متن کامل

Towards Arabic Spell-Checker Based on N-Grams Scores

Journal: :International Journal of Computer Applications 2012

متن کامل

key lexical chunks in applied linguistics article abstracts

Journal: :journal of teaching language skills 2015

hadi farjami

in any discourse domain, certain chunks are particularly frequent and deserve attention by the novice to be initiated and by the expert to maintain a sense of community. to make a relevant contribution to the awareness about applied linguistics texts and discourse, this study attempted to develop lists of lexical chunks frequently used in the abstracts of applied linguistics journals. the abstr...

متن کامل

Comparing word, character, and phoneme n-grams for subjective utterance recognition

2008

Theresa Wilson Stephan Raaijmakers

In this paper, we compare the performance of classifiers trained using word n-grams, character n-grams, and phoneme n-grams for recognizing subjective utterances in multiparty conversation. We show that there is value in using very shallow linguistic representations, such as character n-grams, for recognizing subjective utterances, in particular, gains in the recall of subjective utterances.

متن کامل

Comparing Medline citations using modified N-grams: Table 1

Journal: :Journal of the American Medical Informatics Association 2014

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید