Information Retrieval and Database Architecture for Conventional Japanese Character Dictionaries
نویسندگان
چکیده
The cycle of abstraction-reconstruction, which occurs as a fundamental principle in the development of culture and in cognitive processes, is described and analyzed. This approach leads to recognition of boundary conditions for and directions of probable development of cognitive tools. It is shown how the transition from a conventional Japanese-English character dictionary to a multi-dimensional language database is an instance of such an abstraction-reconstruction cycle. The individual phases in the design of a multi-dimensional language database based upon diierent computer software technologies are investigated in regard to the underlying cycle. The methods used in the design of a multi-dimensional language database include the use of unix software tools, classical database methods as well as the use of search engines based upon full text search. Several directions of application and extension for multi-dimensional language databases are discussed.
منابع مشابه
Outline of a database model for electronic dictionaries
The growing avaibility of dictionaries in electronic form calls for a model sophisticated enough to represent the richness of entries and enable complex information retrieval. Electronic dictionaries are a special kind of object, intermediary between a text and a database. Textual models are not powerful enough to handle complex information retrieval, and conventional database models are not fl...
متن کاملPapillon Lexical Database Project: Monolingual Dictionaries & Interlingual Links
This paper presents a new research and development project called Papillon. It started as a French-Japanese cooperation between laboratories GETA/CLIPS (Grenoble, France) and NII (Tokyo, Japan). Its goal is to build a multilingual lexical database and to extract from it digital bilingual dictionaries. The database is built with monolingual dictionaries, one for each language of the database, li...
متن کاملBuilding A Graphetic Dictionary For Japanese Kanji - Character Look-Up Based On Brush Strokes Or Stroke Groups, And The Display Of Kanji As Path Data
kanji contains too little information on how Reading and writing Japanese isn’t easy for Japanese and foreigners alike. While Japanese learn these skills at school, foreigners should be helped by good teaching material and dictionaries. Kanji lexica have to be very different from other dictionaries. Unfortunately existing lexica normally expect that the users already have a lot of information o...
متن کاملExtracting Key Terms from Chinese and Japanese texts
Key term extraction is very useful for information retrieval. Most term extraction methods use one of two approaches, namely lexical and grammatical. We argue that due to the diierences in linguistic and character set characteristics of Chinese and Japanese, a lexical approach is more suitable for Chinese whereas a grammatical approach is more suitable for Japanese. In this paper, we present tw...
متن کاملChinese-Japanese Cross Language Information Retrieval: A Han Character Based Approach
In this paper, we investigate cross language information retrieval (CLIR) for Chinese and Japanese texts utilizing the Han characters common ideographs used in writing Chinese, Japanese and Korean (CJK) languages. The Unicode encoding scheme, which encodes the superset of Han characters, is used as a common encoding platform to deal with the mulfilingual collection in a uniform manner. We discu...
متن کامل