Bilexical Dependencies as an Intermedium for Data-Driven and HPSG-Based Parsing

نویسنده

  • Angelina Ivanova
چکیده

Bilexical dependencies capturing asymmetrical lexical relations between heads and dependents are viewed as a practical representation of syntax that is well-suited for computation and intelligible for human readers. In the present work we use dependency representations as a bridge between data-driven and grammar-based parsing, both for cross-framework parser comparison and for parser integration. We observe that the state of the art in dependency parsing for English is characterized by broad diversity of dependency representations and seek to systematize properties of various dependency formats pointing out their similarities and differences by carrying out qualitative and quantitative structural analysis and furthermore exploring learnability of four of these representations in automatic syntactic analysis. In addition to comparing syntactic dependencies along several evaluation measures for parsing, we also evaluate the representations in application to the negation resolution task. Using a dependency representation extracted from HPSG structures we contrast three different approaches to parsing—data-driven dependency, phrase structure and a hybrid grammarbased—observe what trade-offs apply along accuracy, efficiency, coverage, and resilience to domain variation and show that explicit, hand-engineered grammatical knowledge helps in both accuracy and cross-domain parsing performance. We complement intrinsic parser evaluation with extrinsic comparison on the negation resolution and semantic dependency parsing tasks discovering that accuracy gains sometimes but not always translate into improved end-to-end performance. A combination of complementary approaches is often a good strategy for achieving improvement. We explore parser integration as a method for advancing the efficiency of a grammarbased parser. Bilexical dependencies serve as an interface for enforcing constraints drawn from the output of the statistical, data-driven systems on the unification-based processing of the grammar-based parser. We experiment with confidence thresholding, filtering and parser ensembles for tackling the problem of selecting high-quality dependencies and propose a technique of static analysis as preliminary evaluation in navigating a large space of various combination setups. We choose configurations optimizing for speed, coverage and balancing the two metrics and carefully evaluate the trade-offs along efficiency, coverage, accuracy and domainresilience.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Different Approaches to Syntactic Analysis Into Bi-Lexical Dependencies An Empirical Comparison of Direct, PCFG-Based, and HPSG-Based Parsers

We compare three different approaches to parsing into syntactic, bi-lexical dependencies for English: a ‘direct’ data-driven dependency parser, a statistical phrase structure parser, and a hybrid, ‘deep’ grammar-driven parser. The analyses from the latter two are post-converted to bilexical dependencies. Through this ‘reduction’ of all three approaches to syntactic dependency parsers, we determ...

متن کامل

Transition-Based Parsing for Deep Dependency Structures

Derivations under different grammar formalisms allow extraction of various dependency structures. Particularly, bilexical deep dependency structures beyond surface tree representation can be derived from linguistic analysis grounded by CCG, LFG, and HPSG. Traditionally, these dependency structures are obtained as a by-product of grammar-guided parsers. In this article, we study the alternative ...

متن کامل

Evaluating Contribution of Deep Syntactic Information to Shallow Semantic Analysis

This paper presents shallow semantic parsing based only on HPSG parses. An HPSG-FrameNet map was constructed from a semantically annotated corpus, and semantic parsing was performed by mapping HPSG dependencies to FrameNet relations. The semantic parsing was evaluated in a Senseval-3 task; the results suggested that there is a high contribution of syntactic information to semantic analysis.

متن کامل

Decomposing Bilexical Dependencies into Semantic and Syntactic Vectors

Bilexical dependencies have been commonly used to help identify the most likely parses of a sentence. The probability of a word occurring as the dependent of a given head within a particular structure provides a measure of semantic plausibility that complements the purely syntactic part of the parsing model. Here, we attempt to use the distributional information within these bilexical dependenc...

متن کامل

HPSG Parsing with Shallow Dependency Constraints

We present a novel framework that combines strengths from surface syntactic parsing and deep syntactic parsing to increase deep parsing accuracy, specifically by combining dependency and HPSG parsing. We show that by using surface dependencies to constrain the application of wide-coverage HPSG rules, we can benefit from a number of parsing techniques designed for highaccuracy dependency parsing...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015