Are Three Words All We Need? Recognizing Genre at the Sub-Sentential Level
نویسندگان
چکیده
Genre identification is a critical facet of text comprehension, but very little is known about the process and information constraints of classifying texts by genres. In this study, higherskill and lower-skill participants read 210 sentences from three genres. The words in the sentences were presented sequentially, one at a time. With each new word, participants decided whether the sentences came from a narrative, science, or history text. Both groups were able to correctly identify the genre by the third word of the sentence. Higher-skilled readers made their genre decisions more quickly and more accurately, and were also more precise in their selection of narrative texts. The study includes a computational model that uses text features from only the first three words of the sentences. The model reflects key features of the participants’ genre classifications.
منابع مشابه
Are Seven Words All We Need? Recognizing Genre at the Sub-Sentential Level
Genre recognition is a critical facet of text comprehension. In this study, we assess the minimum number of words in a sentence necessary for genre recognition to occur. Using corpora of Narrative, History, and Science sentences, we found that three experts in discourse psychology (demonstrating high agreement) accurately recognized the genre of over 80% of the sentences. This recognition gener...
متن کاملA Psychological and Computational Study of Sub-Sentential Genre Recognition
Abstract Genre recognition is a critical facet of text comprehension and text classification. In three experiments, we assessed the minimum number of words in a sentence needed for genre recognition to occur, the distribution of genres across text, and the relationship between reading ability and genre recognition. We also propose and demonstrate a computational model for genre recognition. Usi...
متن کاملFirst Language Activation during Second Language Lexical Processing in a Sentential Context
Lexicalization-patterns, the way words are mapped onto concepts, differ from one language to another. This study investigated the influence of first language (L1) lexicalization patterns on the processing of second language (L2) words in sentential contexts by both less proficient and more proficient Persian learners of English. The focus was on cases where two different senses of a polys...
متن کاملPromotion of Self in an Other-Oriented Academic Sub-Genre: The Case of Self-Mention in Acknowledgments
Although sometimes considered to act only as a means of recognizing debts, acknowledgments give the opportunity for writers to display a self-conscious and reflective representation of self. Following this assumption and to reveal some of the ways this is achieved, a corpus of 80 textbook acknowledgments in the field of Linguistics and Applied Linguistics were analyzed in order to show what “se...
متن کاملDiscourse for Machine Translation
Statistical Machine Translation is a modern success: Given a source language sentence, SMT finds the most probable target language sentence, based on (1) properties of the source; (2) probabilistic source--target mappings at the level of words, phrases and/or sub-structures; and (3) properties of the target language. SMT translates individual sentences because the search space even for a single...
متن کامل