A Large-scale Lexical Semantic Knowledge-base of Chinese
نویسندگان
چکیده
The Semantic Knowledge-base of Contemporary Chinese (SKCC) is a large scale Chinese semantic resource developed by the Institute of Computational Linguistics of Peking University. It provides a large amount of semantic information such as semantic hierarchy and collocation features for 66,539 Chinese words and their English counterparts. Its POS and semantic classification represent the latest progress in Chinese linguistics and language engineering. The descriptions of semantic attributes are fairly thorough, comprehensive and authoritative. The main work in this paper is to introduce the outline of SKCC, and establish a multi-level WSD model based on it. The results indicate that the SCK is effective for word sense disambiguation in Chinese and are likely to be important for general NLP.
منابع مشابه
XHK: The Grammar-based Lexical Semantic Knowledge base
Although the semantic analysis and the grammatical distribution are treated as separate issues in linguistic theories, there is a close interconnection between the two in that the differences in the lexical meanings are often realized at the levels both of the grammatical function and of the lexical collocation. This is the basic assumption utilized when we design and develop Knowledge Base of ...
متن کامل现代汉语语义词典多义词词库的校正和再修订(New Editing and Checking Work of the Semantic Knowledge Base of Contemporary Chinese (SKCC))[In Chinese]
This paper is rooted in the two principles and methods that should be followed by sense discrimination for Chinese language processing: Completeness and discreteness. Built on the comparison of Semantic Knowledge-base of Contemporary Chinese (SKCC) and Grammatical Knowledge base of Contemporary Chinese (GKB), supported by large scale corpus, we conducted our new editing and checking works. Firs...
متن کاملThe semantic Knowledge-base of Contemporary Chinese and Its Applications in WSD
The Semantic Knowledge-base of Contemporary Chinese (SKCC) is a large scale Chinese semantic resource developed by the Institute of Computational Linguistics of Peking University. It provides a large amount of semantic information such as semantic hierarchy and collocation features for 66,539 Chinese words and their English counterparts. Its POS and semantic classification represent the latest ...
متن کاملSome Suggestions on How to Improve the Lexical Semantic Knowledge-Base
Disambiguation, particularly that of lexical meanings is the key problem involved in natural language processing (NLP), and among many means of achieving this end, is to construct a language knowledge base. Nowadays, the knowledge bases established tend to be more and more advanced and fine-grained, with the function of providing the lexical information improved a lot. This paper tries to argue...
متن کاملBuilding a Large Scale Knowledge Base from Chinese Wiki Encyclopedia
DBpedia has been proved to be a successful structured knowledge base, and large scale Semantic Web data has been built by using DBpedia as the central interlinking-hubs of the Web of Data in English. But in Chinese, due to the heavily imbalance in size (no more than one tenth) between English and Chinese in Wikipedia, there are few Chinese linked data are published and linked to DBpedia, which ...
متن کامل