Implementing Hpsg with Modular Tools for Fast Compiling and Parsing

نویسنده

  • Graham Wilcock
چکیده

We describe a modular HPSG implementation , based on a set of tools rather than a single monolithic engine such as ALE. With these tools we can use techniques for much faster compiling and parsing than ALE. We use two-stage grammar compilation with partial execution and a concurrent process implementation of the chart for fast parsing. We compile HPSG lexical rules into Prolog rules used at run-time by the morphological preprocess, thus eliminating lexical rule expansion at compile-time as in ALE. This gives fast lexicon compilation, and also allows further exploitation of lexical rules to eliminate empty categories.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introduction to Data-Oriented Parsing

We present HPSG–DOP, a method for automatically extracting a Stochas-tic Lexicalized Tree Grammar (SLTG) from a HPSG source grammar and a given corpus. 1 Processing of a SLTG is performed by a specialized fast parser. The approach has been tested on a large English grammar and has been shown to achieve additional performance increase compared to parsing with a highly tuned HPSG parser. Our appr...

متن کامل

Steps towards a GENIA Dependency Treebank

In this paper we describe on-going work aimed at creating a dependency-based annotated treebank for the BioMedical domain. Our starting point is the GENIA corpus [14], which is a corpus of 2000 MEDLINE abstracts, which has been manually annotated for various biological entities, according to the GENIA Ontology.1 There is an exponential growth of published research in this sector, which makes it...

متن کامل

Compiling an HPSG-based grammar into more than one CFG

Recently, the performance of HPSG parsing has been improved so that the parsers can be applied to real-world texts. CFG filtering is one of the techniques which contributed to this progress. It improved parsing speed by filtering impossible parse trees by using the CFG compiled from a given HPSGbased grammar. However, there is a limit in the speed-up. This is because the compiled CFG grows into...

متن کامل

Extremely Lexicalized Models for Accurate and Fast HPSG Parsing

This paper describes an extremely lexicalized probabilistic model for fast and accurate HPSG parsing. In this model, the probabilities of parse trees are defined with only the probabilities of selecting lexical entries. The proposed model is very simple, and experiments revealed that the implemented parser runs around four times faster than the previous model and that the proposed model has a h...

متن کامل

CuteForce - Deep Deterministic HPSG Parsing

We present a deterministic HPSG parser capable of processing text incrementally with very fast parsing times. Our system demonstrates an efficient data-driven approach that achieves a high level of precision. Through a series of experiments in different configurations, we evaluate our system and compare it to current state-of-the-art within the field, and show that high quality deterministic pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005