DanProof: Pedagogical Spell and Grammar Checking for Danish
نویسنده
چکیده
This paper presents a Constraint Grammarbased pedagogical proofing tool for Danish. The system recognizes not only spelling errors, but also grammatical errors in otherwise correctly spelled words, and categorizes errors for WORD-integrated pedagogical comments. Possible spelling corrections are prioritized from context, and grammatical corrections generated by a morphological module. The system uses both phonetic similarity measures and traditional Levenshtein-distances, and has a special focus on compounding/splitting errors common in modern Danish. As a classical spell-checker DanProof achieves F-Scores over 95, and F=88 if compounding correction is included. With the maximal set of error types, 2/3 of all errors are found in school essays, and precision is 91.7%.
منابع مشابه
A Constraint Grammar Based Spellchecker for Danish with a Special Focus on Dyslexics
This Paper presents a new, Constraint Grammar based spell and grammar checker for Danish (OrdRet), with a special focus on dyslectic users. The system uses a multi-stage approach, employing both data-driven error lists, phonetic similarity measures and traditional letter matching at the word and chunk level, and CG rules at the contextual level. An ordinary CG parser (DanGram) is used to choose...
متن کاملAn extended spell checker for unknown words
Spell checking is considered a solved problem, but with the rapid development of the natural language processing the new results are slowly extending the means of spell checking towards grammar checking. In this article I review some of the spell checking error classes in a broader sense, the related problems, their state-of-the-art solutions and their different nature on different types of lan...
متن کاملThe Design of a Proofreading Software Service
Web applications have the opportunity to check spelling, style, and grammar using a software service architecture. A software service authoring aid can offer contextual spell checking, detect real word errors, and avoid poor grammar checker suggestions through the use of large language models. Here we present After the Deadline, an open source authoring aid, used in production on WordPress.com,...
متن کاملSyntactic Analysis and Error Correction in the SCARRIE Project
This paper reports on work carried out at CST in Copenhagen to develop the Danish version of the SCARRIE prototype, addressing in particular the issue of how a form of shallow parsing is combined with error detection and correction. The syntactic grammar for Danish has been developed with the aim of dealing with the most frequent context-dependent errors found in a parallel corpus of unedited a...
متن کاملProof Checking for Mathematical English
I am requesting funding for a summer research student. The topic is a software system to automatically verify the correctness of mathematical proofs, and more specifically, to verify proofs written in the stylized English of mathematical textbooks, papers, and monographs. Essentially, the goal is to go beyond spell-checking and grammar-checking to logic-checking. Since this would be the start o...
متن کامل