Comments on Mohri, Pereira and Riley

نویسنده

András Kornai

چکیده

Throughout the history of computational linguistics the rule-based and the statistics-based approaches appeared as competing rather than complementary threads of research. Even today, many view the success of the Xerox rule-based taggers as a threat to the more statistically oriented taggers, just as a few years ago the IBM statistical approach was viewed as a threat to rule-based systems of machine translation. The historical importance of the AT&T work on weighted transducers lies in the fact that for the first time it makes possible a genuine integration of the two approaches. Under the heading of “speech-natural language integration” we usually find the shotgun marriage of two completely disjoined systems, each with its own distinct theoretical apparatus and algorithmic building blocks. In contrast, here we find surprisingly smooth integration, both in terms of underlying theory and in terms of shared algorithms. This is a very significant accomplishment, and the main goal of my comments is to situate it as a particular stage of a constant developmental trend towards greater integration. I will ask how much the good sides of the rule-based and the data-based approaches have been preserved, and what, if anything, has been lost by taking this approach. I will also ask how far the present approach can be pushed, and offer some speculative remarks on future directions. First let me take a clear and unambiguous stance on the rule-based vs. statistics-based debate: rules are better. As a simple illustration, consider Fig. 1 which shows the performance of a bank check OCR system developed by the author [2] under three conditions: using a bigram language model, a finite state grammar, and a combination of the two. As it is evident from Fig. 1, the rule-based system fares much better than the statistics-based, and in fact the latter adds very little to the performance of a system already containing the former. So the question is not so much an overarching philosophical problem of whether rules are better, but rather the more mundane practical problem of finding the rules. To the extent that the rules, constrains and representations constituting the grammar are devised by grammarians like Quirk and Greenbaum [3], computational linguists can get a free ride, and they should avail themselves of the opportunity. But to the extent that Quirk et al. represent the culmination of an extremely sophisticated descriptive tradition of a singularly deeply researched language, it appears very unlikely that more than a handful languages could be handled in the same fashion. Again as an illustration, readers are invited to consider 0 10 20 30 40 50 60 70 80

متن کامل

منابع مشابه

Weighted Automata in Text

Processing Mehryar Mohri, Fernando Pereira and Michael Riley AT&T Research 600 Mountain Avenue Murray Hill, 07974 NJ fmohri,pereira,[email protected] Abstract. Finite-state automata are a very e ective tool in natural language processing. However, in a variety of applications and especially in speech precessing, it is necessary to consider more general machines in which arcs are assigned ...

متن کامل

Transducer composition for context-dependent network expansion

Context-dependent models for language units are essential in high-accuracy speech recognition. However, standard speech recognition frameworks are based on the substitution of lowerlevel models for higher-level units. Since substitution cannot express context-dependency constraints, actual recognizers use restrictive model-structure assumptions and specialized code for context-dependent models,...

متن کامل

The Design Principles of a Weighted Finite-State Transducer Library

We describe the algorithmic and software design principles of an object-oriented library for weighted finite-state transducers. By taking advantage of the theory of rational power series, we were able to achieve high degrees of generality, modularity and irredundancy, while attaining competitive efficiency in demanding speech processing applications involving weighted automata of more than 107 ...

متن کامل

Weighted Automata in Text and Speech Processing

Finite-state automata are a very effective tool in natural language processing. However, in a variety of applications and especially in speech precessing, it is necessary to consider more general machines in which arcs are assigned weights or costs. We briefly describe some of the main theoretical and algorithmic aspects of these machines. In particular, we describe an efficient composition alg...

متن کامل

Speech Recognition with Weighted Finite-state Transducers

This chapter describes a general representation and algorithmic framework for speech recognition based on weighted finite-state transducers. These transducers provide a common and natural representation for major components of speech recognition systems, including hidden Markov models (HMMs), context-dependency models, pronunciation dictionaries, statistical grammars, and word or phone lattices...

متن کامل

Weighted finite-state transducers in speech recognition

We survey the weighted finite-state transducer (WFST) approach to speech recognition developed at AT&T over the last several years. We show that WFSTs provide a common and natural representation for HMM models, context-dependency, pronunciation dictionaries, grammars, and alternative recognition outputs. Furthermore, general finite-state operations combine these representations flexibly and eff...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Comments on Mohri, Pereira and Riley

نویسنده

چکیده

منابع مشابه

Weighted Automata in Text

Transducer composition for context-dependent network expansion

The Design Principles of a Weighted Finite-State Transducer Library

Weighted Automata in Text and Speech Processing

Speech Recognition with Weighted Finite-state Transducers

Weighted finite-state transducers in speech recognition

عنوان ژورنال:

اشتراک گذاری