Memory-Based Shallow Parsing of Spoken Dutch
نویسنده
چکیده
منابع مشابه
A Memory-Based Shallow Parser for Spoken Dutch
We describe the development of a Dutch memory-based shallow parser. The availability of large treebanks for Dutch, such as the one provided by the Spoken Dutch Corpus, allows memory-based learners to be trained on examples of shallow parsing taken from the treebank, and act as a shallow parser after training. An overview is given of a modular memory-based learning approach to shallow parsing, c...
متن کاملCategorial grammars used to partial parsing of spoken language
Spoken language understanding is a challenge for the development of Spoken Dialogue Systems. Recognition errors and speech repairs make it impossible to get complete syntactic analysis. Shallow parsing and chunking seem to be efficient in order to start both a robust and precise analysis. This paper describes experiments made with Logus, a spoken understanding system based on incremental methol...
متن کاملSyntactic Analysis in the Spoken Dutch Corpus (CGN)
The paper describes the syntactic annotation of the Spoken Dutch Corpus (“Corpus Gesproken Nederlands” or CGN), the Dutch-Flemish project (1998-2003) aiming at the collection, description and annotation of ten million words of spoken Dutch. In the first part, the background of the parsing strategy is discussed, as well as some details concerning the actual implementation of the parsing process....
متن کاملSemantic and Syntactic Features for Dutch Coreference Resolution
We investigate the effect of encoding additional semantic and syntactic information sources in a classification-based machine learning approach to the task of coreference resolution for Dutch. We experiment both with a memory-based learning approach and a maximum entropy modeling method. As an alternative to using external lexical resources, such as the lowcoverage Dutch EuroWordNet, we evaluat...
متن کاملParsing Domain Actions with Phrase-level Grammars and Memory-based Learners
In this paper, we describe an approach to analysis for spoken language translation that combines phrase-level grammar-based parsing and automatic domain action classification. The job of the analyzer is to transform utterances into a shallow semantic task-oriented interlingua representation. The goal of our hybrid approach is to provide accurate real-time analyses and to improve robustness and ...
متن کامل