Tree parsing for tree-adjoining machine translation

نویسندگان

  • Matthias Büchse
  • Heiko Vogler
  • Mark-Jan Nederhof
چکیده

Tree parsing is an important problem in statistical machine translation. In this context, one is given (a) a synchronous grammar that describes the translation from one language into another and (b) a recognizable set of trees; the aim is to construct a finite representation of the set of those derivations that derive elements from the given set, either on the source side (input restriction) or on the target side (output restriction). In tree-adjoining machine translation the grammar is a kind of synchronous tree-adjoining grammar. For this case, only partial solutions to the tree parsing problem have been described, some being restricted to the unweighted case, some to the monolingual case. We introduce a class of synchronous tree-adjoining grammars which is effectively closed under input and output restrictions to weighted regular tree languages, i.e. the restricted translations can again be represented by grammars in the same class; this enables, e.g. cascading restrictions. Moreover, we present an algorithm that constructs these grammars for input and output restriction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Non-Projective Parsing for Statistical Machine Translation

We describe a novel approach for syntaxbased statistical MT, which builds on a variant of tree adjoining grammar (TAG). Inspired by work in discriminative dependency parsing, the key idea in our approach is to allow highly flexible reordering operations during parsing, in combination with a discriminative model that can condition on rich features of the sourcelanguage string. Experiments on tra...

متن کامل

Korean Grammar Using TAGs

This paper addresses various issues related to representing the Korean language using Tree Ad joining Grammars Topics covered include Korean grammar using TAGs Machine Translation between Korean and English using Synchronous Tree Adjoining Grammars STAGs handling scrambling using Multi Component TAGs MC TAGs and recovering empty arguments The data for the parsing is from US military telecommuni...

متن کامل

Adjoining Tree-to-String Translation

We introduce synchronous tree adjoining grammars (TAG) into tree-to-string translation, which converts a source tree to a target string. Without reconstructing TAG derivations explicitly, our rule extraction algorithm directly learns tree-to-string rules from aligned Treebank-style trees. As tree-to-string translation casts decoding as a tree parsing problem rather than parsing, the decoder sti...

متن کامل

Tree Parsing with Synchronous Tree-Adjoining Grammars

Restricting the input or the output of a grammar-induced translation to a given set of trees plays an important role in statistical machine translation. The problem for practical systems is to find a compact (and in particular, finite) representation of said restriction. For the class of synchronous treeadjoining grammars, partial solutions to this problem have been described, some being restri...

متن کامل

Optimal $k$-arization of Synchronous Tree-Adjoining Grammar

Synchronous Tree-Adjoining Grammar (STAG) is a promising formalism for syntaxaware machine translation and simultaneous computation of natural-language syntax and semantics. Current research in both of these areas is actively pursuing its incorporation. However, STAG parsing is known to be NP-hard due to the potential for intertwined correspondences between the linked nonterminal symbols in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Log. Comput.

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2014