Distance measure between Gaussian distributions for discriminating speaking styles

نویسندگان

  • Goshu Nagino
  • Makoto Shozakai
چکیده

Discriminating speaking styles is an important issue in speech recognition, speaker recognition and speaker segmentation. This paper compares distance measures between Gaussian distributions for discriminating speaking styles. The Mahalanobis distance, the Bhattacharyya distance and the Kullback-Leibler divergence, which are in common use for a definition as a distance measure between Gaussian distributions, are evaluated in terms of an accuracy to discriminate speaking styles. In this paper, the accuracy is judged on a visualized map, where speaking style speech corpora are mapped onto twodimensional space by utilizing a multidimensional scaling method. It is shown that speaking style clusters appear clearly grouped on the visualized map obtained by the Bhattacharyya distance and the Kullback-Leibler divergence. In addition, the visualized map corresponds to speech recognition performance, and the Kullback-Leibler shows higher sensitivity to recognition performance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Color Texture Classification Using Rao Distance between Multivariate Copula Based Models

This paper presents a new similarity measure based on Rao distance for color texture classification or retrieval. Textures are characterized by a joint model of complex wavelet coefficients. This model is based on a Gaussian Copula in order to consider the dependency between color components. Then, a closed form of Rao distance is computed to measure the difference between two Gaussian Copula b...

متن کامل

Reconstruction vs. Interaction-based Output Practice: (in relation to EFL learner’s speaking skill and learning styles)

  The belief that output practice is crucial in L2 learning affects foreign language teaching methodology. And researchers have endeavored to find the best ways to encourage learners to produce and practice whatever they hear as an input in the process of learning. Moreover, learning styles and the importance of matching learners’ styles with those of teachers inspired the researchers to inves...

متن کامل

The Relationship between Learning Style and Iranian Intermediate EFL Learners' Speaking Performance

There are a number of factors which influence the success of learning foreign languageincluding students’ learning styles. This study investigated language learning styles of IranianEFL learners and their class achievement. To this end, sixty female intermediate learners ofinstruction and different ages (15-25), studying at a language institute in Rasht city wereasked to take part in the study....

متن کامل

Modeling of various speaking styles and emotions for HMM-based speech synthesis

This paper presents an approach to realizing various emotional expressions and speaking styles in synthetic speech using HMM-based speech synthesis. We show two methods for modeling speaking styles and emotions. In the first method, called “style dependent modeling,” each speaking style and emotion is individually modeled. On the other hand, in the second method, called “style mixed modeling,” ...

متن کامل

Convergence of latent mixing measures in nonparametric and mixture models

We consider Wasserstein distance functionals for assessing the convergence of latent discrete measures, which serve as mixing distributions in hierarchical and nonparametric mixture models. We clarify the relationships between Wasserstein distances of mixing distributions and f -divergence functionals such as Hellinger and Kullback-Leibler distances on the space of mixture distributions using v...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006