Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text

نویسندگان

چکیده

As text generated by large language models proliferates, it becomes vital to understand how humans engage with such text, and whether or not they are able detect when the reading did originate a human writer. Prior work on detection of focuses case where an entire passage is either human-written machine-generated. In this paper, we study more realistic setting begins as transitions being state-of-the-art neural models. We show that, while annotators often struggle at task, there substantial variance in annotator skill that given proper incentives, can improve task over time. Furthermore, conduct detailed comparison analyze variety variables (model size, decoding strategy, fine-tuning, prompt genre, etc.) affect performance. Finally, collect error annotations from our participants use them certain textual genres influence make different types errors sentence-level features correlate highly selection. release RoFT dataset: collection 21,000 paired classifications encourage future evaluation text.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text Beautifier: An Affective-Text Tool to Tailor Written Text

We have spelling and grammar checking tools available on today’s word processors. But what they are missing is a tool that can recommend several possibilities of a given written sentence to assist a user to write better sentences. Therefore, we aim to develop a linguistic tool to beautify text by applying our developed lexical resources regarding textual affect sensing. The developed tool will ...

متن کامل

Predicting the Fluency of Text with Shallow Structural Features: Case Studies of Machine Translation and Human-Written Text

Sentence fluency is an important component of overall text readability but few studies in natural language processing have sought to understand the factors that define it. We report the results of an initial study into the predictive power of surface syntactic statistics for the task; we use fluency assessments done for the purpose of evaluating machine translation. We find that these features ...

متن کامل

Predicting the Fluency of Text with Shallow Structural Features: Case Studies of Machine Tanslation and Human-Written Text

Sentence fluency is an important component of overall text readability but few studies in natural language processing have sought to understand the factors that define it. We report the results of an initial study into the predictive power of surface syntactic statistics for the task; we use fluency assessments done for the purpose of evaluating machine translation. We find that these features ...

متن کامل

Text Comparison Using Machine-Generated Nuggets

This paper describes a novel text comparison environment that facilities text comparison administered through assessing and aggregating information nuggets automatically created and extracted from the texts in question. Our goal in designing such a tool is to enable and improve automatic nugget creation and present its application for evaluations of various natural language processing tasks. Du...

متن کامل

Machine-printed and hand-written text lines identification

There are many types of documents where machine-printed and handwritten texts intermixedly appear. Since the optical character recognition (OCR) methodologies for machine-printed and handwritten texts are di€erent, to achieve optimal performance it is necessary to separate these two types of texts before feeding them to their respective OCR systems. In this paper, we present a machine-printed a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i11.26501