Building and Using a Russian Resource Grammar in GF
نویسندگان
چکیده
Grammatical Framework (GF) [5] is a grammar formalism for describing formal and natural languages. An application grammar in GF is usually written for a restricted language domain, e.g. to map a formal language to a natural language. A resource grammar, on the other hand, aims at a complete description of a natural languages. The language-independent grammar API (Application Programmer’s Interface) allows the user of a resource grammar to build application grammars in the same way as a programmer writes programs using a standard library. In an ongoing project, we have developed an API suitable for technical language, and implemented it for English, Finnish, French, German, Italian, Russian, and Swedish. This paper gives an outline of the project using Russian as an example. 1 The GF Resource Grammar Library The Grammatical Framework (GF) is a grammar formalism based on type theory [5]. GF grammars can be considered as programs written in the GF grammar language, which can be compiled by the GF program. Just as with ordinary programming languages, the efficiency of programming labor can be increased by reusing previously written code. For that purpose standard libraries are usually used. To use the library a programmer only needs to know the type signatures of the library functions. Implementation details are hidden from the user. The GF resource grammar library [4] is aimed to serve as a standard library for the GF grammar language. It aims at fairly complete descriptions of different natural languages, starting from the perspective of linguistics structure rather the logical structure of applications. The current coverage is comparable with, but still smaller than, the Core Language Engine (CLE) project [2]. Since GF is a multilingual system the library structure has an additional dimension for different languages. Each language has its own layer, produced by visible to the linguist grammarian. What is visible to the application grammarian is a an API (Application Programmer’s Interface), which abstracts away from linguistic details and is therefore, to a large extent, language-independent. The module structure of a resource grammar layer corresponding to one language is shown in Fig. 1. Arrows indicate the dependencies among the modules.
منابع مشابه
GF Parallel Resource Grammars and Russian
A resource grammar is a standard library for the GF grammar formalism. It raises the abstraction level of writing domainspecific grammars by taking care of the general grammatical rules of a language. GF resource grammars have been built in parallel for eleven languages and share a common interface, which simplifies multilingual applications. We reflect on our experience with the Russian resour...
متن کاملThesis for the Degree of Licentiate of Philosophy
This thesis describes a number of practical experiments rather than theoretical investigations in the area of natural language processing. The basis for the work presented is Grammatical Framework (GF). It is a very complex system, which comprises among other things a grammar formalism based on type theory and its implementation written in Haskell. GF is intended for high-quality machine transl...
متن کاملAn implementation of a Latvian resource grammar in Grammatical Framework
This paper describes an open-source Latvian resource grammar implemented in Grammatical Framework (GF), a programming language for multilingual grammar applications. GF differentiates between concrete grammars and abstract grammars: translation among concrete languages is provided via abstract syntax trees. Thus the same concrete grammar is effectively used for both language analysis and langua...
متن کاملAn Open Source Punjabi Resource Grammar
We describe an open source computational grammar for Punjabi; a resource-poor language. The grammar is developed in GF (Grammatical framework), which is a tool for multilingual grammar formalism. First, we explore different syntactic features of Punjabi and then we implement them in accordance with GF grammar requirements, to make Punjabi the 17th language in the GF resource grammar library.
متن کاملLinguistic Issues in Language Technology – LiLT
The GF Resource Grammar Library is a set of natural language grammars implemented in GF (Grammatical Framework). These grammars are in a strong sense parallel: they are built upon a common abstract syntax, i.e. a common tree structure. Individual languages are obtained via compositional mappings from abstract syntax trees to feature structures specific to each language. The grammar defines, for...
متن کامل