gaussian mixed model gmm

An Investigation of Automatic Phonetic-Unit Selection for Forensic Voice Comparison

2012

Chee Cheun Huang Julien Epps Cuiling Zhang

A hybrid Hidden Markov Model (HMM) Gaussian Mixture Model (GMM) system was proposed to automatically select tokens of /iau/, /ai/, /ei/, /m/ and /n/ in a database of recordings of Standard-Chinese speech collected under studio-clean, mobile-landline degraded and mismatched recording conditions. The FVC systems constructed were all MFCC GMM-UBM systems, but based on different portions of the rec...

متن کامل

Approximate Inference for Generic Likelihoods via Density-Preserving GMM Simplification

2016

Lei Yu Tianyu Yang Antoni B. Chan

We consider recursive Bayesian filtering where the posterior is represented as a Gaussian mixture model (GMM), and the likelihood function as a sum of scaled Gaussians (SSG). In each iteration of filtering, the number of components increases. We propose an algorithm for simplifying a GMM into a reduced mixture model with fewer components, which is based on maximizing a variational lower bound o...

متن کامل

Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation

2006

Yamato Ohtani Tomoki Toda Hiroshi Saruwatari Kiyohiro Shikano

The performance of voice conversion has been considerably improved through statistical modeling of spectral sequences. However, the converted speech still contains traces of artificial sounds. To alleviate this, it is necessary to statistically model a source sequence as well as a spectral sequence. In this paper, we introduce STRAIGHT mixed excitation to a framework of the voice conversion bas...

متن کامل

Learning with Information Entropy Method for Transportation Image Retrieval

2015

Liu Xiao-jun Li Qing-ling Li Yong-jian Li Jun-yi

As a new learning framework, Multi-Instance learning is labeled recently and has successfully found application in vision classification. A novel Multi-instance bag generating method is presented in this paper on basis of Gaussian Mixed Model. The generated GMM model composes not only color but also the locally stable unchangeable components. It is frequently named as MI bag by researchers. Bes...

متن کامل

Spectral recovery of outdoor illumination by an extension of the Bayesian inverse approach to the Gaussian mixture model.

Journal: :Journal of the Optical Society of America. A, Optics, image science, and vision 2012

Shahram Peyvandi Seyed Hossein Amirshahi Javier Hernández-Andrés Juan Luis Nieves Javier Romero

The Bayesian inference approach to the inverse problem of spectral signal recovery has been extended to mixtures of Gaussian probability distributions of a training dataset in order to increase the efficiency of estimating the spectral signal from the response of a transformation system. Bayesian (BIC) and Akaike (AIC) information criteria were assessed in order to provide the Gaussian mixture ...

متن کامل

On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models

2016

Natalia A. Tomashenko Yuri Y. Khokhlov Yannick Estève

In this paper we investigate the Gaussian Mixture Model (GMM) framework for adaptation of context-dependent deep neural network HMM (CD-DNN-HMM) acoustic models. In the previous work an initial attempt was introduced for efficient transfer of adaptation algorithms from the GMM framework to DNN models. In this work we present an extension, further detailed exploration and analysis of the method ...

متن کامل

Estimating Mutual Information Using Gaussian Mixture Model for Feature Ranking and Selection [IJCNN2046]

2006

Tian Lan Deniz Erdogmus Umut Ozertem Yonghong Huang

Feature selection is a critical step for pattern recognition and many other applications. Typically, feature selection strategies can be categorized into wrapper and filter approaches. Filter approach has attracted much attention because of its flexibility and computational efficiency. Previously, we have developed an ICA-MI framework for feature selection, in which the Mutual Information (MI) ...

متن کامل

Four-layer categorization scheme of fast GMM computation techniques in large vocabulary continuous speech recognition systems

2004

Arthur Chan Mosur Ravishankar Alexander I. Rudnicky Jahanzeb Sherwani

Large vocabulary continuous speech recognition systems are known to be computationally intensive. A major bottleneck is the Gaussian mixture model (GMM) computation and various techniques have been proposed to address this problem. We present a systematic study of fast GMM computation techniques. As there are a large number of these and it is impractical to exhaustively evaluate all of them, we...

متن کامل

Formant frequency prediction from MFCC vectors in noisy environments

2005

Jonathan Darch Ben P. Milner Saeed Vaseghi

This paper proposes a method of predicting the formant frequencies of a frame of speech from its mel-frequency cepstral coefficient (MFCC) representation. Prediction is achieved through the creation of a Gaussian mixture model (GMM) which models the joint density of formant frequencies and MFCCs. Using this GMM and an input MFCC vector, a maximum a posteriori (MAP) prediction of the formant fre...

متن کامل

Acoustic-to-articulatory inversion mapping with Gaussian mixture model

2004

Tomoki Toda Alan W. Black Keiichi Tokuda

This paper describes the acoustic-to-articulatory inversion mapping using a Gaussian Mixture Model (GMM). Correspondence of an acoustic parameter and an articulatory parameter is modeled by the GMM trained using the parallel acousticarticulatory data. We measure the performance of the GMMbased mapping and investigate the effectiveness of using multiple acoustic frames as an input feature and us...

متن کامل