Automated Paraphrase Generation with Over-Generation and Pruning Services

نویسندگان

چکیده

Conversational services are emerging as a new paradigm for accessing information by simply uttering questions in natural language, posing whole set of challenges to the design and engineering systems. Training conversational deal with nuances language often requires collecting high-quality diverse training samples (i.e., paraphrases). Traditional approaches such hiring an expert or crowdsourcing involve data collection processes that costly time-consuming. Automated paraphrase generation is promising cost-effective scalable approach generating samples. Current automatic techniques, however, tend specialise specific types lexical syntactic variations. As result, generated paraphrases may not perform well relevant quality aspects diversity semantic relatedness. In this paper, we follow inspired integration address these issues generate English semantically diverse. We propose extensible reusable pipeline combines paraphrasing techniques two-step process first focus on i) leveraging strengths multiple most (and possibly noisy) paraphrases, then ii) common separate step. Through empirical evaluations show benefits combining more balancing relevance diversity.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Paraphrase and Textual Entailment Generation

One particular information can be conveyed by many different sentences. This variety concerns the choice of vocabulary and style as well as the level of detail (from laconism or succinctness to total verbosity). Although verbosity in written texts is considered bad style, generated verbosity can help natural language processing (NLP) systems to fill in the implicit knowledge. The paper presents...

متن کامل

Paraphrase Generation with Deep Reinforcement Learning

Automatic generation of paraphrases for a given sentence is an important yet challenging task in natural language processing (NLP), and plays a key role in a number of applications such as question answering, information retrieval and dialogue. In this paper we present a deep reinforcement learning approach to paraphrase generation. Specifically, we propose a new model for the task, which consi...

متن کامل

Neural Clinical Paraphrase Generation with Attention

Paraphrase generation is important in various applications such as search, summarization, and question answering due to its ability to generate textual alternatives while keeping the overall meaning intact. Clinical paraphrase generation is especially vital in building patient-centric clinical decision support (CDS) applications where users are able to understand complex clinical jargons via ea...

متن کامل

Application-driven Statistical Paraphrase Generation

Paraphrase generation (PG) is important in plenty of NLP applications. However, the research of PG is far from enough. In this paper, we propose a novel method for statistical paraphrase generation (SPG), which can (1) achieve various applications based on a uniform statistical model, and (2) naturally combine multiple resources to enhance the PG performance. In our experiments, we use the prop...

متن کامل

Paraphrase and Textual Entailment Recognition and Generation

Paraphrasing methods recognize, generate, or extract phrases, sentences, or longer natural language expressions that convey almost the same information. Textual entailment methods, on the other hand, recognize, generate, or extract pairs of natural language expressions, such that a human who reads (and trusts) the first element of a pair would most likely infer that the other element is also tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-91431-8_25