Discourse Annotation And Semantic Annotation In The GNOME Corpus
نویسنده
چکیده
The GNOME corpus was created to study the discourse and semantic properties of discourse entities that affect their realization and interpretation, and particularly salience. We discuss what information was annotated and the methods we followed.
منابع مشابه
Semantic-Based Image Retrial in the VQ Compressed Domain using Image Annotation Statistical Models
متن کامل
Exploiting Semantic Information For Manual Anaphoric Annotation In Cast3LB Corpus
This paper presents the discourse annotation followed in Cast3LB, a Spanish corpus annotated with several information sources (morphological, syntactic, semantic and coreferential) at syntactic, semantic and discourse level. 3LB annotation scheme has been developed for three languages (Spanish, Catalan and Basque). Human annotators have used a set of tagging techniques and protocols. Several to...
متن کاملSemantic Annotation for Generation: Issues in annotating a corpus to develop and evaluate discourse entity realization algorithms
We are annotating a corpus with information relevant to discourse entity realization, and especially the information needed to decide which type of NP to use. The corpus is being used to study correlations between NP type and certain semantic or discourse features, to evaluate hand-coded algorithms, and to train statistical models. We report on the development of our annotation scheme, the prob...
متن کاملSemantic Relations in Discourse: The Current State of ISO 24617-8
This paper describes some of the research conducted with the aim to develop a proposal for an ISO standard for the annotation of semantic relations in discourse. A range of theoretical approaches and annotation efforts were analysed for their commonalities and their differences, in order to define a clear delineation of the scope of the ISO effort and to give it a solid theoretical and empirica...
متن کاملAn Annotation Tool for Multimodal Dialogue Corpora using Global Document Annotation
This paper reports a tool which assists the user in annotating a video corpus and enables the user to search for a semantic or pragmatic structure in a GDA tagged corpus. An XQL format is allowed for search patterns as well as a plain phrase. This tool is capable of generating a GDA timestamped corpus from a video file manually. It will be publicly available for academic purposes.
متن کامل