A Brief History of Generative Models for Power Law and Lognormal Distributions Draft Manuscript
نویسنده
چکیده
Power law distributions are an increasingly common model for computer science applications; for example, they have been used to describe file size distributions and inand out-degree distributions for the Web and Internet graphs. Recently, the similar lognormal distribution has also been suggested as an appropriate alternative model for file size distributions. In this paper, we briefly survey some of the history of these distributions, focusing on work in other fields. We find that several recently proposed models have antecedents in work from decades ago. We also find that lognormal and power law distributions connect quite naturally, and hence it is not surprising that lognormal distributions arise as a possible alternative to power law distributions.
منابع مشابه
A Brief History of Generative Models for Power Law and Lognormal Distributions
Recently, I became interested in a current debate over whether file size distributions are best modelled by a power law distribution or a lognormal distribution. In trying to learn enough about these distributions to settle the question, I found a rich and long history, spanning many fields. Indeed, several recently proposed models from the computer science community have antecedents in work fr...
متن کاملDynamic Models for File Sizes and Double Pareto Distributions Draft manuscript
In this paper, we introduce and analyze a new generative user model to explain the behavior of file size distributions. Our Recursive Forest File model combines ideas from recent work by Downey with ideas from recent work on random graph models for the Web. Unlike similar previous work, our Recursive Forest File model allows new files to be created and old files to be deleted over time, and our...
متن کاملLong-Tail Distributions and Unsupervised Learning of Morphology
In previous work on unsupervised learning of morphology, the long-tail pattern in the rank-frequency distribution of words, as well as of morphological units, is usually considered as following Zipf’s law (power-law). We argue that these long-tail distributions can also be considered as lognormal. Since we know the conjugate prior distribution for a lognormal likelihood, we propose to generate ...
متن کاملAre there too many uncited articles? Zero inflated variants of the discretised lognormal and hooked power law distributions
Although statistical models fit many citation data sets reasonably well with the best fitting models being the hooked power law and discretised lognormal distribution, the fits are rarely close. One possible reason is that there might be more uncited articles than would be predicted by any model if some articles are inherently uncitable. Using data from 23 different Scopus categories, this arti...
متن کاملA Comparison of the Fallout Mass-size Distributions Calculated by L06n0rmal and Power-law Models
Fallout mass-size distributions presently used at USNRDL are compared vith new distributions suggested by recent investigations. Available data is unable to define the distribution parameters well enough to distinguish between lognormal and power-law distribution models.,
متن کامل