Approximations to the MMI criterion and their effect on lattice-based MMI

نویسنده

  • Steven Wegmann
چکیده

Although maximum mutual information (MMI) training has been used for hidden Markov model (HMM) parameter estimation for more than twenty years ([2], [8], [5], [9], and [14]), it has recently become an essential part of the acoustic modeling repertoire thanks to the refinements introduced by Woodland and Povey ([16] and [11]). The earliest incarnations of MMI worked well on small vocabulary tasks with small models, for example digit recognition. However, one can expect to gain 10-20% in recognition accuracy over standard maximum likelihood methods regardless of the size of the task or the models when using the current methodology, lattice-based MMI. The machinery of lattice-based MMI consists of a model selection criterion called the MMI criterion and an iterative estimation algorithm called the extended Baum-Welch algorithm. This machinery is analogous to – it is in fact based on – the standard machinery used for maximum likelihood estimation with HMMs, where the model selection criterion is the log-likelihood of the training data and the iterative estimation algorithm is the BaumWelch algorithm ([3]). In both cases the estimation algorithm operates on the space of all possible model parameters by producing a new estimate of model parameters from an original estimate. Also, both of these estimation algorithms have been designed so that the model selection criterion is larger on the new estimate than it was on the original estimate. Finally, in both cases the machinery is operated in the same manner: starting from a choice of initial model parameters, we repeatedly apply the estimation algorithm, first to the initial choice, next to the result of this, etc., thereby creating a

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of a Dietary Innovative Multi-Material on Sex Hormones and Molting Period of Canaries and Laying-Hens

Two experiments were conducted to determine the effect of offering a multi-material innovative (MMI) feed including: Vitex agnus-castus, Thymus vulgaris, Lavandula angustifolia, Marigold (Calendula officinalis) on curtails molting and sex hormone concentrations in canaries and laying hens. In the first study, a total of 120 female molted canaries were allotted in to 12 cages of 10 birds with 4 ...

متن کامل

Margin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training

Using the central observation that margin-based weighted classification error (modeled using Minimum Phone Error (MPE)) corresponds to the derivative with respect to the margin term of margin-based hinge loss (modeled using Maximum Mutual Information (MMI)), this article subsumes and extends margin-based MPE and MMI within a broader framework in which the objective function is an integral of MP...

متن کامل

A multiple-mini interview (MMI) for emergency medicine residency admissions: A brief report and qualitative analysis

Introduction: A multiple-mini interview (MMI) is a type ofstructured interview, which may assess many non-cognitivedomains in residency applicants. There are few studies on MMIduring the emergency medicine (EM) residency admissionsprocess in the United States. We sought to determine the strengths,weaknesses, and acceptability of a pilot MMI for EM residencyadmissions.Methods: We piloted a five-...

متن کامل

Antithyroid Drugs

The thionamide drugs, i.e. carbimazole and its metabolite methimazole (MMI), andpropylthiouracil (PTU) have extensively been used in the management of various forms ofhyperthyroidism over the past eight decades. This review aims to summarize different aspectsof these outstanding medications. Thionamides have shown their own acceptable efficacy andeven safety profiles in ...

متن کامل

Semi-supervised maximum mutual information training of deep neural network acoustic models

Maximum Mutual Information (MMI) is a popular discriminative criterion that has been used in supervised training of acoustic models for automatic speech recognition. However, standard discriminative training is very sensitive to the accuracy of the transcription and hence its implementation in a semisupervised setting requires extensive filtering of data. We will show that if the supervision tr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1002.0773  شماره 

صفحات  -

تاریخ انتشار 2010