A Large-scale Lexical Semantic Knowledge-base of Chinese

نویسندگان

  • Hui Wang
  • Shiwen Yu
چکیده

The Semantic Knowledge-base of Contemporary Chinese (SKCC) is a large scale Chinese semantic resource developed by the Institute of Computational Linguistics of Peking University. It provides a large amount of semantic information such as semantic hierarchy and collocation features for 66,539 Chinese words and their English counterparts. Its POS and semantic classification represent the latest progress in Chinese linguistics and language engineering. The descriptions of semantic attributes are fairly thorough, comprehensive and authoritative. The main work in this paper is to introduce the outline of SKCC, and establish a multi-level WSD model based on it. The results indicate that the SCK is effective for word sense disambiguation in Chinese and are likely to be important for general NLP.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

XHK: The Grammar-based Lexical Semantic Knowledge base

Although the semantic analysis and the grammatical distribution are treated as separate issues in linguistic theories, there is a close interconnection between the two in that the differences in the lexical meanings are often realized at the levels both of the grammatical function and of the lexical collocation. This is the basic assumption utilized when we design and develop Knowledge Base of ...

متن کامل

现代汉语语义词典多义词词库的校正和再修订(New Editing and Checking Work of the Semantic Knowledge Base of Contemporary Chinese (SKCC))[In Chinese]

This paper is rooted in the two principles and methods that should be followed by sense discrimination for Chinese language processing: Completeness and discreteness. Built on the comparison of Semantic Knowledge-base of Contemporary Chinese (SKCC) and Grammatical Knowledge base of Contemporary Chinese (GKB), supported by large scale corpus, we conducted our new editing and checking works. Firs...

متن کامل

The semantic Knowledge-base of Contemporary Chinese and Its Applications in WSD

The Semantic Knowledge-base of Contemporary Chinese (SKCC) is a large scale Chinese semantic resource developed by the Institute of Computational Linguistics of Peking University. It provides a large amount of semantic information such as semantic hierarchy and collocation features for 66,539 Chinese words and their English counterparts. Its POS and semantic classification represent the latest ...

متن کامل

Some Suggestions on How to Improve the Lexical Semantic Knowledge-Base

Disambiguation, particularly that of lexical meanings is the key problem involved in natural language processing (NLP), and among many means of achieving this end, is to construct a language knowledge base. Nowadays, the knowledge bases established tend to be more and more advanced and fine-grained, with the function of providing the lexical information improved a lot. This paper tries to argue...

متن کامل

Building a Large Scale Knowledge Base from Chinese Wiki Encyclopedia

DBpedia has been proved to be a successful structured knowledge base, and large scale Semantic Web data has been built by using DBpedia as the central interlinking-hubs of the Web of Data in English. But in Chinese, due to the heavily imbalance in size (no more than one tenth) between English and Chinese in Wikipedia, there are few Chinese linked data are published and linked to DBpedia, which ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003