Approximations to the MMI criterion and their effect on lattice-based MMI
نویسنده
چکیده
Although maximum mutual information (MMI) training has been used for hidden Markov model (HMM) parameter estimation for more than twenty years ([2], [8], [5], [9], and [14]), it has recently become an essential part of the acoustic modeling repertoire thanks to the refinements introduced by Woodland and Povey ([16] and [11]). The earliest incarnations of MMI worked well on small vocabulary tasks with small models, for example digit recognition. However, one can expect to gain 10-20% in recognition accuracy over standard maximum likelihood methods regardless of the size of the task or the models when using the current methodology, lattice-based MMI. The machinery of lattice-based MMI consists of a model selection criterion called the MMI criterion and an iterative estimation algorithm called the extended Baum-Welch algorithm. This machinery is analogous to – it is in fact based on – the standard machinery used for maximum likelihood estimation with HMMs, where the model selection criterion is the log-likelihood of the training data and the iterative estimation algorithm is the BaumWelch algorithm ([3]). In both cases the estimation algorithm operates on the space of all possible model parameters by producing a new estimate of model parameters from an original estimate. Also, both of these estimation algorithms have been designed so that the model selection criterion is larger on the new estimate than it was on the original estimate. Finally, in both cases the machinery is operated in the same manner: starting from a choice of initial model parameters, we repeatedly apply the estimation algorithm, first to the initial choice, next to the result of this, etc., thereby creating a
منابع مشابه
The Effect of a Dietary Innovative Multi-Material on Sex Hormones and Molting Period of Canaries and Laying-Hens
Two experiments were conducted to determine the effect of offering a multi-material innovative (MMI) feed including: Vitex agnus-castus, Thymus vulgaris, Lavandula angustifolia, Marigold (Calendula officinalis) on curtails molting and sex hormone concentrations in canaries and laying hens. In the first study, a total of 120 female molted canaries were allotted in to 12 cages of 10 birds with 4 ...
متن کاملMargin-space integration of MPE loss via differencing of MMI functionals for generalized error-weighted discriminative training
Using the central observation that margin-based weighted classification error (modeled using Minimum Phone Error (MPE)) corresponds to the derivative with respect to the margin term of margin-based hinge loss (modeled using Maximum Mutual Information (MMI)), this article subsumes and extends margin-based MPE and MMI within a broader framework in which the objective function is an integral of MP...
متن کاملA multiple-mini interview (MMI) for emergency medicine residency admissions: A brief report and qualitative analysis
Introduction: A multiple-mini interview (MMI) is a type ofstructured interview, which may assess many non-cognitivedomains in residency applicants. There are few studies on MMIduring the emergency medicine (EM) residency admissionsprocess in the United States. We sought to determine the strengths,weaknesses, and acceptability of a pilot MMI for EM residencyadmissions.Methods: We piloted a five-...
متن کاملAntithyroid Drugs
The thionamide drugs, i.e. carbimazole and its metabolite methimazole (MMI), andpropylthiouracil (PTU) have extensively been used in the management of various forms ofhyperthyroidism over the past eight decades. This review aims to summarize different aspectsof these outstanding medications. Thionamides have shown their own acceptable efficacy andeven safety profiles in ...
متن کاملSemi-supervised maximum mutual information training of deep neural network acoustic models
Maximum Mutual Information (MMI) is a popular discriminative criterion that has been used in supervised training of acoustic models for automatic speech recognition. However, standard discriminative training is very sensitive to the accuracy of the transcription and hence its implementation in a semisupervised setting requires extensive filtering of data. We will show that if the supervision tr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1002.0773 شماره
صفحات -
تاریخ انتشار 2010