Method of Tibetan Person Knowledge Extraction
نویسندگان
چکیده
Person knowledge extraction is the foundation of the Tibetan knowledge graph construction, which provides support for Tibetan question answering system, information retrieval, information extraction and other researches, and promotes national unity and social stability. This paper proposes a SVM and template based approach to Tibetan person knowledge extraction. Through constructing the training corpus, we build the templates based the shallow parsing analysis of Tibetan syntactic, semantic features and verbs. Using the training corpus, we design a hierarchical SVM classifier to realize the entity knowledge extraction. Finally, experimental results prove the method has greater improvement in Tibetan person knowledge extraction.
منابع مشابه
Hot Topic Extraction and Public Opinion Classification of Tibetan Texts
The increasing amount of Tibetan information has made Tibetan text processing popular and highly significant. In this study, Tibetan hot topic extraction and public opinion classification were investigated to accelerate the development of Tibetan information processing. First, Tibetan word segmentation in Tibetan hot topic extraction was presented. Second, feature selection based on term freque...
متن کاملTibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features
Sentence-level aligning bilingual parallel corpus is shown significant and indispensable status in machine translation, translation knowledge acquiring and bilingual lexicography research fields, which is the fundamental work for natural language processing. Given the great deal of work in sentence alignment and a variety of methods have developed for bilingual terminology extraction, those are...
متن کاملInformation Extraction and Change Analysis of Major Lakes in Tibetan Plateau Based on Landsat Remote Sensing Images
The water resources of Tibetan plateau, particularly the lakes, has been influenced by global climate change and also reacted to global change. It is important to study the lake changes in the Tibetan plateau. This paper aimed to analyse the changes detected from remote sensing images for the typical lakes in Tibetan Plateau, including Qinghai Lake, Nam Co and Selin Co, using different informat...
متن کاملGlacier Information Extraction Based on Multi-feature Combination Model
As a typical landform class of Qinghai-Tibetan Plateau, glacier is widely distributed in alpine terrain. However, field measurement is impossible in those areas because of complex terrain and adverse weather. At first, on the basis of analyzing the features of glacier image spectrum, object shape, spatial relations and environment distribution including terrain and climate, this paper combines ...
متن کاملFinding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora
Currently there is much interest in the automatic acquisition of entities, with the goal of Named Entity Recognition (NER). However previous work has focused primarily on major languages, with the large, structured, and semantically rich knowledge bases and using the large corpus with annotated NER tags. In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1604.02843 شماره
صفحات -
تاریخ انتشار 2016