Annotating descriptively incomplete language phenomena
نویسندگان
چکیده
When annotating non-standard languages, descriptively incomplete language phenomena (EAGLES, 1996) are often encountered. In this paper, we present examples of ambiguous forms taken from a historical corpus and offer a classification of such descriptively incomplete language phenomena and its rationale. We then discuss various approaches to the annotation of these phenomena, arguing that multiple annotations provide the most appropriate encoding strategy for the annotator. Finally, we show how multiple annotations can be encoded in existing standards such as PAULA and GrAF.
منابع مشابه
Challenges For Annotating Images For Sense Disambiguation
We describe an unusual data set of thousands of annotated images with interesting sense phenomena. Natural language image sense annotation involves increased semantic complexities compared to disambiguating word senses when annotating text. These issues are discussed and illustrated, including the distinction between word senses and iconographic senses.
متن کاملSemantic Support for Security-Annotated Business Process Models
Service-Oriented Architectures (SOA) benefit from business processes (BP), which orchestrate web services (WS) and human actors in cross organizational environments. In this setting, handling the security and privacy issues while exchanging and processing personal data is essential. This lacks for secure business processes management. To achieve this, we represent security constraints descripti...
متن کاملAn Annotation Scheme for Quantifier Scope Disambiguation
Annotating natural language sentences with quantifier scoping has proved to be very hard. In order to overcome the challenge, previous work on building scope-annotated corpora has focused on sentences with two explicitly quantified noun phrases (NPs). Furthermore, it does not address the annotation of scopal operators or complex NPs such as plurals and definites. We present the first annotation...
متن کاملHow Dependency Trees and Tectogrammatics Help Annotating Coreference and Bridging Relations in Prague Dependency Treebank
In this paper, we explore the benefits of dependency trees and tectogrammatical structure used in the Prague Dependency Treebank for annotating language phenomena that cross the sentence boundary, namely coreference and bridging relations. We present the benefits of dependency trees such as the detailed processing of ellipses, syntactic decisions for coordination and apposition structures that ...
متن کاملEQuIKa System: Supporting OWL applications with local closed world assumption
One of the major advantages of semantically annotating resources on Web is the facilitation of web services discovery. Languages based on OWL are prune to several problems for web services discovery due to the open-world assumption when handling incomplete information. Thus standard OWL reasoner are usually not suitable for the discovery purposes. The aforementioned problems can easily be fixed...
متن کامل