Is language evolution grinding to a halt?: Exploring the life and death of words in English fiction

نویسندگان

  • Eitan Adam Pechenick
  • Christopher M. Danforth
  • Peter Sheridan Dodds
چکیده

The Google Books corpus, derived from millions of books in a range of major languages, would seem to offer many possibilities for research into cultural, social, and linguistic evolution. In a previous work, we found that the 2009 and 2012 versions of the unfiltered English data set as well as the 2009 version of the English Fiction data set are all heavily saturated with scientific and medical literature, rendering them unsuitable for rigorous analysis [Pechenick, Danforth and Dodds, PLoS ONE, 10, e0137041, 2015]. By contrast, the 2012 version of English Fiction appeared to be uncompromised, and we use this data set to explore language dynamics for English from 1820– 2000. We critique an earlier method for measuring birth and death rates of words, and provide a robust, principled approach to examining the volume of word flux across various relative frequency usage thresholds. We use the contributions to the Jensen-Shannon divergence of words crossing thresholds between consecutive decades to illuminate the major driving factors behind the flux. We find that while individual word usage may vary greatly, the overall statistical structure of the language appears to remain fairly stable. We also find indications that scholarly works about fiction are strongly represented in the 2012 English Fiction corpus, and suggest that a future revision of the corpus should attempt to separate critical works from fiction itself.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Is language evolution grinding to a halt? The scaling of lexical turbulence in English fiction suggests it is not

Of basic interest is the quantification of the long term growth of a language’s lexicon as it develops to more completely cover both a culture’s communication requirements and knowledge space. Here, we explore the usage dynamics of words in the English language as reflected by the Google Books 2012 English Fiction corpus. We critique an earlier method that found decreasing birth and increasing ...

متن کامل

Immanent Indeterminacy: Tracing Postmodernity in John Banville’s Neo-Realist Novel The Sea

This study aimed at exploring the ontological indeterminacies of The Sea (2005), a novel by John Banville using the postmodern catena put forth by Ihab Hassan. Hassan’s catalogue of the features of postmodern fiction includes indeterminacy, fragmentation, decanonization, selflessness, depthlessness, the unpresentable/unrepresentable, irony, hybridization, carnivalization, performance, participa...

متن کامل

Incorporating E-learning in teaching English language to medical students: exploring its potential contributions

Background: The spread of technology has influenced different aspects of human life, and teaching and learning are not exceptions. This study aimed to examine the potential contribution of the use of technology in teaching English language to medical students.   Methods: This qualitative-action research study was conducted in Birjand University of Medical Sciences (BUMS), with 60 medica...

متن کامل

The Reality of Arabic Fiction Translation into English: A Sociological Approach

English translations of texts associated with Arabic fiction remain largely unexplored from a sociological perspective. Drawing on Pierre Bourdieu’s sociology, this paper aims to examine the genesis of Arabic fiction translation into English as a socially situated activity. Works of Arabic fiction emerged in English translation in the early twentieth century. Since then, this intellectual field...

متن کامل

Exploring the Relationship between Life Quality and Speaking Ability of Iranian Intermediate EFL Learners

Despite its direct relevance to second/foreign language learning, quality of life has been a neglected area within Second Language Acquisition (SLA) research. The present study sought to investigate the relationship between quality of life factors and speaking skill as one of the most challenging parts of L2 learning. To this end, an adapted version of life quality questionnaire originally devi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1503.03512  شماره 

صفحات  -

تاریخ انتشار 2015