Dynamical Sources in Information Theory : A General Analysis of Trie StructuresJulien Clément

نویسندگان

  • Julien Clément
  • Philippe Flajolet
  • Brigitte Vallée
چکیده

Digital trees, also known as tries, are a general purpose exible data structure that implements dictionaries built on sets of words. An analysis is given of three major representations of tries in the form of array-tries, list tries, and bst-tries ((ternary search triess). The size and the search costs of the corresponding representations are analysed precisely in the average case, while a complete distributional analysis of height of tries is given. The unifying data model used is that of dynamical sources and it encompasses classical models like those of memoryless sources with independent symbols, of nite Markov chains, and of nonuniform densities. The probabilistic behaviour of the main parameters, namely size, path length, or height, appears to be determined by two intrinsic characteristics of the source: the entropy and the probability of letter coincidence. These characteristics are themselves related in a natural way to spectral properties of speciic transfer operators of the Ruelle type. Sources dynamiques en thhorie de l'information: une analyse ggnnrale des arbres digitaux RRsumm : Les arbres digitaux, galement connus sous le nom de triess sont une structure de donne ggnnrique et exible qui permet d'implanter des dictionnaires construits sur des ensembles de mots. Nous donnons une analyse de troies reprrsentations principales de ces arbres, les arbres-tableaux, les arbres-listes, et les arbres ternaires de recherche. La taille et les coots de recherche de ces reprrsentations sont analysss prrcissment en moyenne, tandis qu'une analyse en distribution de la hauteur est obtenue. Le moddle uniicateur d'analyse est celui des sources dynamiquess, lesquelles recouvrent les moddles classiques comme les sources sans mmmoire ((symboles inddpendants), les chaines de Markov nies, et les densitts initiales non uniformes. Les propriitts probabilistes des principaux parammtres de taille, longueur de cheminement et hauteur apparaissent liies deux caracttristiques fondamentales de la source: l'entropie et la probabilitt de coincidence. Ces caracttristiques se trouvent elle-mmmems reliies aux propriitts spectrales d'oprateurs de transfert du type introduit par Ruelle. Abstract. Digital trees, also known as tries, are a general purpose exible data structure that implements dictionaries built on sets of words. An analysis is given of three major representations of tries in the form of array-tries, list tries, and bst-tries ((ternary search triess). The size and the search costs of the corresponding representations are analysed precisely in the average case, while a complete distributional analysis of height of tries is given. The unifying data model used is that of dynamical …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dynamical Sources in Information Theory: a General Analysis of Trie Structures Dynamical Sources in Information Theory: a General Analysis of Trie Structures Dynamical Sources in Information Theory: a General Analysis of Trie Structures

Digital trees, also known as tries, are a general purpose exible data structure that implements dictionaries built on sets of words. An analysis is given of three major representations of tries in the form of array-tries, list tries, and bst-tries ((ternary search triess). The size and the search costs of the corresponding representations are analysed precisely in the average case, while a comp...

متن کامل

Smoothed Analysis of Trie Height

Tries are very simple general purpose data structures for information retrieval. A crucial parameter of a trie is its height. In the worst case the height is unbounded when the trie is built over a set of n strings. Analytical investigations have shown that the average height under many random sources is logarithmic in n. Experimental studies of trie height suggest that this holds for non-rando...

متن کامل

Designing the Model of Information Anorexia Among Medical Students in the Hamedan University of Medical Sciences Using Grounded Theory

Objective People with information anorexia severely limit the acquisition and use of information and lose the opportunity to receive new information, and often rely on a few limited sources of information. This study aims to investigate the information anorexia of medical students in Hamedan University of Medical Sciences (HUMS). Methods The is qualitative study based on grounded theory conduc...

متن کامل

An Analysis of A Fishing Model with Nonlinear Harvesting Function

In this study, considering the importance of how to exploit renewable natural resources, we analyzea shing model with nonlinear harvesting function in which the players at the equilibrium pointdo a static game with complete information that, according to the calculations, will cause a wasteof energy for both players and so the selection of cooperative strategies along with the...

متن کامل

Dynamical Behavior of a Rigid Body with One Fixed Point (Gyroscope). Basic Concepts and Results. Open Problems: a Review

The study of the dynamic behavior of a rigid body with one fixed point (gyroscope) has a long history. A number of famous mathematicians and mechanical engineers have devoted enormous time and effort to clarify the role of dynamic effects on its movement (behavior) – stable, periodic, quasi-periodic or chaotic. The main objectives of this review are: 1) to outline the characteristic features of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999