Supporting the Character Sets of Japanese Kanji and Korean Hangul in the ADABAS/NATURAL System

نویسندگان

Masahiro Shimada

Hideki Nishimoto

Takashi Ishizaka

Andreas Schütz

Yong-Soo Kim

Yoshioki Ishii

چکیده

Software AG of Far East (SAGFE) has established various system environments peculiar to Japanese use since Japanese Kanji was supported on ADABAS for the first time in 1978. The supporting of Japanese language, which started by putting the character strings on a special Kanji printer, has recently been improving with the development of terminal equipment and controllers. This paper describes the progress of our supports for Japanese language at SAGFE and Software AG (SAG), West Germany and a study on Korean Hangul done by SAG and Penta Computer Korea as well as Japanese Kanji. Additionally, the method called DBCS (double-byte character set) support is proposed. We discuss the problems and solve those problems by adapting the fourth generation language, NATURAL, as a SAG product. 1. Supporting Japanese Kanji on ADABAS _ The Japanese language borrowed extensively from the Chinese Hanzi by way of Korean to write their own language around 1,500 years ago. Later, the characters came to symbolize native Japanese words similar in meaning to that of the Chinese. permission to copy without fee all or part of this material is granted provided that Ihe copies are not made or distribured for direct commercial advantage. tie DASFAA copyright notice and the title of the publication and ifs date appear, and notice is given that copying is by permission of the Organizing Commilke of fhe IntematiOnd Symposium on DaCabase Systems for Advanced Applications. To copy otherwise. or to republish. requin-s a fee and /or special jxrmission fmm fhe Organizing Committee. Modern Japanese is written as a mixture of ideograms, Kanji, and native phonetic letters, Kana. The Kana phonetic alphabet exists as Hiragana and Katakana, which serve different purposes and differ stylistically. A typical passage of Japanese writing contains Kanji, Hiragana, and perhaps also Katakana. Both Kana syllabaries cor+sist of 46 basic symbols each, and Kanji chatiacters are limited to about 2,000 symbols for off%ial and daily use. . JAPANESE CHARACTER -----S!!wiS -----=-.==============Y!!! ====-==----==-=. Fig.1 -1 Japanese character and code representation In the early Japanese computer market, there was intense interest in the establishme& of a system to handle Kanji. However, it took a long time and a vast sum of money to create compatibility with existing systems and to develop devices peculiar to Japanese use. Generally one byte is used to denote a’character, but the eight-bits byte provides for as many as 256 characters. Japanese Kanji must be represented by codes that are usually two bytes lorig because there are many different kinds of characters(Fig.l=1). This point is pressing computer manufacturers and vendors in Japan to greatly revise standard systems. A great number of Kanji typing methods have been developed to date, all of which fall into three main International Symposium on Database Systems for Advanced Appllcatlons Seoul, Korea, April, 1989

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ideographic Alexia without Involvement of the Fusiform Gyrus in a Korean Stroke Patient: A Serial Functional Magnetic Resonance Imaging Study

The Korean orthographic system consists of both phonograms (Hangul) and ideograms (Hanja). Hangul is a phonetic alphabet comprised of consonants and vowels that are grouped together to form syllables that generally exhibit regular correspondences between graphemes and phonemes. On the other hand, Hanja is derived from complex Chinese characters with distinct meanings. In this respect, Hanja and...

متن کامل

Tsukurimashou: a Japanese-language Font Meta-family

METAFONT-based font projects for the Chinese, Japanese, and Korean (CJK) languages have been announced every few years since the early 1980s, even predating the current form of the METAFONT language. Except for a few non-parameterized conversions of fonts that originated in other formats, in 30 years every METAFONT CJK font has been abandoned at or before the 8-bit barrier of 256 kanji, nowhere...

متن کامل

Effects of Related Term Extraction in Transliteration into Chinese

To transliterate foreign technical terms and proper nouns, in Japanese and Korean, phonograms, such as Katakana and Hangul, are used. In Chinese, the pronunciation of a source word is spelled out with Kanji characters. However, because Kanji comprises ideograms, different Kanji are associated with the same pronunciation, but can potentially convey different meanings and impressions. In this pap...

متن کامل

Problems and Approaches for Oriental Document Analysis

Machine understanding of hand,filled documents in China, Japan and Korea requires not only general solutions of document analysis but also ability to handle peculiarities of the Oriental languages. As expected, handwritten Chinese character recognition is the major task for it. In addition, Japanese Kana, Korean Hangul, Roman alphabet as well as numerals are targets of recognition. The main dif...

متن کامل

Upt E X — Unicode Version of Pt E X with Cjk Extensions

upTEX is a Unicode extension of ASCII’s pTEX (a Japanese-localized TEX). It not only improves Japanese support, but also handles Chinese and Korean characters, i.e., Kanji (Hanzi, Hanja), Kana, CJK symbols, and Hangul with Unicode. Moreover, it can process multilingual typesetting of original LTEX with inputenc and Babel (Latin, Cyrillic, Greek, etc.) by switching its \kcatcode tables. This pap...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1989

Supporting the Character Sets of Japanese Kanji and Korean Hangul in the ADABAS/NATURAL System

نویسندگان

چکیده

منابع مشابه

Ideographic Alexia without Involvement of the Fusiform Gyrus in a Korean Stroke Patient: A Serial Functional Magnetic Resonance Imaging Study

Tsukurimashou: a Japanese-language Font Meta-family

Effects of Related Term Extraction in Transliteration into Chinese

Problems and Approaches for Oriental Document Analysis

Upt E X — Unicode Version of Pt E X with Cjk Extensions

عنوان ژورنال:

اشتراک گذاری