Role of Text Mining for Lexical Monolingual Punjabi Dictionary
نویسندگان
چکیده
This document gives brief description about various Punjabi dictionaries that have been developed and text mining techniques. The various text mining techniques are briefly studied and are compared according to different parameters. The best technique is then proposed for developing a text mining model for Punjabi data dictionary. This purposed model can be used to create a dynamic ever updating dictionary model. Keywords— Text Mining, Punjabi Dictionary, PLSA, LSA, LDA, CTM
منابع مشابه
The Habanera Lexical Knowledge Base Management System
Habanera is a multipurpose multilingual lexical knowledge base that is developed at CRL to be used as a central repository of multilingual lexical data. The knowledge base contains a set of dictionaries and relations between entries, within a dictionary (e.g., synonymy) as well as between entries of different dictionaries (e.g., translation). The format of monolingual lexical entries is left re...
متن کاملDisregarding the Corpus: Head- word and Sense Treatment in Shona Monolingual Lexicography*
With specific reference to Shona monolingual lexicography, this article discusses how corpus-based lexicographers might, in some instances, decide not strictly to adhere to the corpus when it comes to headword and sense treatment. The writer is a member of the African Languages Research Institute (ALRI), formerly known as the African Languages Lexical (ALLEX) Project. ALRI is a nonfaculty inter...
متن کاملAutomatically Populating Acception Lexical Database through Bilingual Dictionaries and Conceptual Vectors
The NLP team of LIRMM currently works on thematic and lexical disambiguation text analysis [Lafourcade, 2001]. We built a system, with automated learning capabilities, based on conceptual vectors for meaning representation. Vectors are supposed to encode ideas associated to words or expressions. In the framework of Acception Based Lexical Database (instantiated through the Papillon project), we...
متن کاملOn multiword lexical units and their role in maritime dictionaries
Multi-word lexical units are a typical feature of specialized dictionaries, in particular monolingual and bilingual maritime dictionaries. The paper studies the concept of the multi-word lexical unit and considers the similarities and differences of their selection and presentation in monolingual and bilingual maritime dictionaries. The work analyses such issues as the classification of multi-w...
متن کاملLaying Lexical Foundations for NLP: the Case of Basque at the Ixa Research Group
The purpose of this paper is to present the strategy and methodology followed at the Ixa NLP Group of the University of The Basque Country in laying the lexical foundations for language processing. Monolingual and bilingual dictionaries, text corpora, and linguists’ knowledge have been the main information sources from which lexical knowledge currently present in our NLP system has been acquire...
متن کامل