Sinitic Wordnet: Laying the Groundwork with Chinese Varieties Written in Traditional Characters

نویسندگان

  • Chih-Yao Lee
  • Shu-Kai Hsieh
چکیده

The present work seeks to make the logographic nature of Chinese script a relevant research ground in wordnet studies. While wordnets are not so much about words as about the concepts represented in words, synset formation inevitably involves the use of orthographic and/or phonetic representations to serve as headword for a given concept. For wordnets of Chinese languages, if their synsets are mapped with each other, the connection from logographic forms to lexicalized concepts can be explored backwards to, for instance, help trace the development of cognates in different varieties of Chinese. The Sinitic Wordnet project is an attempt to construct such an integrated wordnet that aggregates three Chinese varieties that are widely spoken in Taiwan and all written in traditional Chinese characters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sinitic Wordnet: Laying the Groundwork with Chinese Varieties Written in Traditional Characters

The present work seeks to make the logographic nature of Chinese script a relevant research ground in wordnet studies. While wordnets are not so much about words as about the concepts represented in words, synset formation inevitably involves the use of orthographic and/or phonetic representations to serve as headword for a given concept. For wordnets of Chinese languages, if their synsets are ...

متن کامل

Salmonella enterica serovar Enteritidis live vaccine strain in the reproductive organs of laying goose after subcutaneous vaccination

Serovar-specific real-time PCR for Salmonella enterica serovar Enteritidis (S. Enteritidis) was conductedto detect the genomic DNA of S. Enteritidis from laying goose after subcutaneous vaccination at differenttime points. Indirect fluorescent antibody (IFA) technique and immunohistochemical localization wereemployed to validate the results. The results showed that S. Enteritidis was consistent...

متن کامل

Strategies of Processing Japanese Names and Character Variants in Traditional Chinese Text

This paper proposes an approach to identify word candidates that are not Traditional Chinese, including Japanese names (written in Japanese Kanji or Traditional Chinese characters) and word variants, when doing word segmentation on Traditional Chinese text. When handling personal names, a probability model concerning formats of names is introduced. We also propose a method to map Japanese Kanji...

متن کامل

Procedures and Problems in Korean-Chinese-Japanese Wordnet with Shared Semantic Hierarchy

This paper introduces a Korean-Chinese-Japanese wordnet for nouns, verbs and adjectives. This wordnet is constructed based on a hierarchy of shared semantic categories originated from NTT Goidaikei (Hierarchical Lexical System). The Korean wordnet has been constructed by mapping a semantic category to each Korean word sense in a way that maps the same semantic hierarchy to the meanings of nouns...

متن کامل

Hantology-A Linguistic Resource for Chinese Language Processing and Studying

Hantology, a character-based Chinese language resource is created to provide an infrastructure for language processing and research on the writing system. Unlike alphabetic or syllabic writing systems, the ideographic writing system of Chinese poses both a challenge and an opportunity. The challenge is that a totally different resources structure must be created to represent and process speaker...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018