منابع مشابه
Text Classification Using Word-Based PPM Models
Text classification is one of the most actual among the natural language processing problems. In this paper the application of word-based PPM (Prediction by Partial Matching) model for automatic content-based text classification is described. Our main idea is that words and especially word combinations are more relevant features for many text classification tasks. Key-words for a document in mo...
متن کاملRecent results in combined coding for word-based PPM
In this paper it is presented the lossless PPM (Prediction by Partial string Matching) algorithm and it is studied the way the extended alphabet can be used for the PPM encoding so it will allow the use of symbols which are not present in the alphabet at the beginning of the encoding phase. The extended alphabet can contain symbols with the size larger than a byte and at the decoding external w...
متن کاملWord Ordering with Phrase-Based Grammars
We describe an approach to word ordering using modelling techniques from statistical machine translation. The system incorporates a phrase-based model of string generation that aims to take unordered bags of words and produce fluent, grammatical sentences. We describe the generation grammars and introduce parsing procedures that address the computational complexity of generation under permutati...
متن کاملExtending grammars based on similar-word recognition
Pronunciation variation is extremely widespread and one of the reasons for recognition errors. In this paper we explore how similar-recognized-words can be used to construct or expand more accurate grammars in a specific domain. The domain that serves as framework for this research is the assessment of depression. Assessment of depression is done via a system that verbally administers a discret...
متن کاملLocal grammars in word counting
The results of word counting in text depend on the level of its linguistic annotation. If a text can is regarded as a sequence of alphabetic character strings, without any information on their possible linguistic interpretation we are talking about the rough text. Some quantitative characteristics of texts can be obtained by the application of formal operations on a rough text, but these result...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2017
ISSN: 2156-5570,2158-107X
DOI: 10.14569/ijacsa.2017.081037