Inference of variable-length linguistic and acoustic units by multigrams

نویسندگان

  • Sabine Deligne
  • Frédéric Bimbot
چکیده

The efficiency of pattern recognition algorithms is highly conditioned to a proper definition of the patterns assumed to structure the data. The multigram model provides a statistical tool to retrieve sequential variable-length regularities within streams of data. In this paper, we present a general formulation of the model, applicable to single or multiple parallel strings of data having either discrete or continuous values. The model is first assessed to derive an inventory of variable-length sequences of letters from text data, where all spaces between the words have been removed. It turns out that the sequences of letters inferred during this fully unsupervised procedure clearly relate to the morphological structure of the text. The model is then used to infer a set of variable-length acoustic units, directly from speech data. Speech files containing examples of acoustic units are provided along with this paper in order to illustrate their consistency from an auditory point of view. We also report experiments using these acoustically defined units for continuous speech recognition.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inference of variable-length acoustic units for continuous speech recognition

In the eld of speech recognition, the patterns assumed to structure the speech material (phonemes, triphones, words...) are de ned a priori according to a linguistic criterion, whereas the recognition criterion is based on an acoustic similarity measure. From this may result a lack of consistency for the recognition units. In this paper, we explore the possibility of a more data-driven approach...

متن کامل

Variable-length acoustic units inference for text-to-speech synthesis

The best voices in text-to-speech synthesis are currently obtained via acoustic units concatenation-based systems. In such systems, the choice of units whose concatenations will produce an acoustic message is a crucial stage. Moreover, it can be observed that current TTS systems use acoustic units which most often correspond to variable-length phonetic descriptions. In this article, an original...

متن کامل

Language modeling by variable length sequences: theoretical formulation and evaluation of multigrams

The multigram model assumes that language can be described as the output of a memoryless source that emits variable-length sequences of words. The estimation of the model parameters can be formulated as a Maximum Likelihood estimation problem from incomplete data. We show that estimates of the model parameters can be computed through an iterative Expectation-Maximization algorithm and we descri...

متن کامل

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating  them potentially can play an important role in transmitt...

متن کامل

Speech spectrum representation and coding using multigrams with distance

The multigrams allow us to split a string of symbols into a stream of variable length sequences. The direct application of this method to vector-quantized speech spectra fails, we develop an extension of the method called modiied multi-grams or multigrams with distance. The algorithm for mod-iied multigram dictionary training as well as experimental results are presented. We found a signiicant ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Speech Communication

دوره 23  شماره 

صفحات  -

تاریخ انتشار 1997