The bag model in language statistics

نویسندگان

  • Francisco Criado
  • Tamaz Gachechiladze
  • Hamlet Meladze
  • Guram Tsertsvadze
چکیده

In this paper, fuzzy quantitative models of language statistics are constructed. All suggested models are based on the assumption about a superposition of two kinds of uncertainties: probabilistic and possibilistic. The realization of this superposition in statistical distributions is achieved by the probability measure splitting procedure. In this way, the fuzzy versions of generalized binomial, Fucks and Zipf–Mandelbrot s distributions are constructed describing the probabilistic and possibilistic organization of language at any level: morphological, syntactic or phonological. The main problem when constructing the quantitative model of some fuzzy linear structure is finding the corresponding linguistic spectrum, which is reduced to the solution of algebraic or transcendental equation systems by inverse spline-interpolation. In the final section, the general linear mathematical model of language structures is then described briefly, as well as bag statistics for consonantal structures of languages. 2002 Elsevier Science Inc. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Hidden Markov Topic Model: A Probabilistic Model of Semantic Representation

In this paper, we describe a model that learns semantic representations from the distributional statistics of language. This model, however, goes beyond the common bag-of-words paradigm, and infers semantic representations by taking into account the inherent sequential nature of linguistic data. The model we describe, which we refer to as a Hidden Markov Topics model, is a natural extension of ...

متن کامل

Learning Semantic Representations with Hidden Markov Topics Models

In this paper, we describe a model that learns semantic representations from the distributional statistics of language. This model, however, goes beyond the common bag-of-words paradigm, and infers semantic representations by taking into account the inherent sequential nature of linguistic data. The model we describe, and which we refer to as a Hidden Markov Topics model is a natural extension ...

متن کامل

A Corrective Training Algorithm for Adaptive Learning in Bag Generation

expressions. However, this approach is still very intuitive and simple because it only adjusts the word frequencies in run-time, and does not revise the statistical information in the long-term memory. Multiple language model is based on several corpora of different fields. Basically, a small amount of similar text are imported and interpolated with the original texts, when the context domain i...

متن کامل

An Effective Approach to Verbose Queries Using a Limited Dependencies Language Model

Intuitively, any ‘bag of words’ approach in IR should benefit from taking term dependencies into account. Unfortunately, for years the results of exploiting such dependencies have been mixed or inconclusive. To improve the situation, this paper shows how the natural language properties of the target documents can be used to transform and enrich the term dependencies to more useful statistics. T...

متن کامل

Assessing the Effectiveness of Holistic Multidimensional Treatment Model (Hojjati Model) on Receptive and Expressive Language Skills in Autistic Children

Background: Autism as part of the category called Autism spectrum disorder (ASD), is caused by disorders in brain and nervous network and characterized by defect in social behavior, language and cognition. This study aimed to investigate receptive and expressive language performance and the severity of the disorder in 30 children with autism aged 2-8 years who speak in Persian language. Materi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 147  شماره 

صفحات  -

تاریخ انتشار 2002