Method of Tibetan Person Knowledge Extraction

نویسندگان

  • Yuan Sun
  • Zhen Zhu
چکیده

Person knowledge extraction is the foundation of the Tibetan knowledge graph construction, which provides support for Tibetan question answering system, information retrieval, information extraction and other researches, and promotes national unity and social stability. This paper proposes a SVM and template based approach to Tibetan person knowledge extraction. Through constructing the training corpus, we build the templates based the shallow parsing analysis of Tibetan syntactic, semantic features and verbs. Using the training corpus, we design a hierarchical SVM classifier to realize the entity knowledge extraction. Finally, experimental results prove the method has greater improvement in Tibetan person knowledge extraction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hot Topic Extraction and Public Opinion Classification of Tibetan Texts

The increasing amount of Tibetan information has made Tibetan text processing popular and highly significant. In this study, Tibetan hot topic extraction and public opinion classification were investigated to accelerate the development of Tibetan information processing. First, Tibetan word segmentation in Tibetan hot topic extraction was presented. Second, feature selection based on term freque...

متن کامل

Tibetan-Chinese Bilingual Sentences Alignment Method based on Multiple Features

Sentence-level aligning bilingual parallel corpus is shown significant and indispensable status in machine translation, translation knowledge acquiring and bilingual lexicography research fields, which is the fundamental work for natural language processing. Given the great deal of work in sentence alignment and a variety of methods have developed for bilingual terminology extraction, those are...

متن کامل

Information Extraction and Change Analysis of Major Lakes in Tibetan Plateau Based on Landsat Remote Sensing Images

The water resources of Tibetan plateau, particularly the lakes, has been influenced by global climate change and also reacted to global change. It is important to study the lake changes in the Tibetan plateau. This paper aimed to analyse the changes detected from remote sensing images for the typical lakes in Tibetan Plateau, including Qinghai Lake, Nam Co and Selin Co, using different informat...

متن کامل

Glacier Information Extraction Based on Multi-feature Combination Model

As a typical landform class of Qinghai-Tibetan Plateau, glacier is widely distributed in alpine terrain. However, field measurement is impossible in those areas because of complex terrain and adverse weather. At first, on the basis of analyzing the features of glacier image spectrum, object shape, spatial relations and environment distribution including terrain and climate, this paper combines ...

متن کامل

Finding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora

Currently there is much interest in the automatic acquisition of entities, with the goal of Named Entity Recognition (NER). However previous work has focused primarily on major languages, with the large, structured, and semantically rich knowledge bases and using the large corpus with annotated NER tags. In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1604.02843  شماره 

صفحات  -

تاریخ انتشار 2016