A Paraphrase-Based Exploration of Cohesiveness Criteria

نویسندگان

  • Kentaro Inui
  • Masaru Nogami
چکیده

This paper proposes an empirical approach to the development of a computational model for assessing texts according to cohesiveness. We argue that the NLG technologies for the generation of structural paraphrases can be used to efficiently create what we call a cohesion-variant parallel corpus, which would serve as a good resource for empirical acquisition of cohesiveness criteria. We also present our pilot case study, in which we took a particular type of paraphrasing that separates a relative clause from a sentence. We have so far created a cohesion-variant parallel corpus containing 499 cohesive instances and 841 incohesive instances. Based on this corpus, we conducted a preliminary experiment on cohesion evaluation, obtaining encouraging results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Strategic Technology Planning in Science-Based Subsectors of Petroleum Industry: The Case Study of R&D Roadmapping for Geochemical Exploration Technologies

Strategic planning of technology in Iran's oil industry has a long history, however, the knowledge-based sectors of the oil industry, despite their different characteristics, have been less exposed to such experiences, and hence the study of the experience in one of the key sub-sectors of this industry, namely the exploration geochemical sector, can be innovative. This article seeks to answer t...

متن کامل

PEM: A Paraphrase Evaluation Metric Exploiting Parallel Texts

We present PEM, the first fully automatic metric to evaluate the quality of paraphrases, and consequently, that of paraphrase generation systems. Our metric is based on three criteria: adequacy, fluency, and lexical dissimilarity. The key component in our metric is a robust and shallow semantic similarity measure based on pivot language N-grams that allows us to approximate adequacy independent...

متن کامل

Hardness, Cohesiveness, and Adhesiveness of Oral Moisturizers and Denture Adhesives: Selection Criteria for Denture Wearers

The mechanical properties of seven denture adhesives and eight oral moisturizers, all of which are commercially available, were evaluated using a texture profile analysis. A new assessment chart is proposed for the selection criteria of denture adhesive and oral moisturizers using a radar chart with three axes: hardness, cohesiveness, and adhesiveness.

متن کامل

A contrastive review of paraphrase acquisition techniques

This paper addresses the issue of what approach should be used for building a corpus of sentential paraphrases depending on one’s requirements. Six strategies are studied: (1) multiple translations into a single language from another language; (2) multiple translations into a single language from different other languages; (3) multiple descriptions of short videos; (4) multiple subtitles for th...

متن کامل

Sentential Paraphrase Generation for Agglutinative Languages Using SVM with a String Kernel

Paraphrase generation is widely used for various natural language processing (NLP) applications such as question answering, multi-document summarization, and machine translation. In this study, we identify the problems occurring in the process of applying existing probabilistic model-based methods to agglutinative languages, and provide solutions by reflecting the inherent characteristics of ag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001