Improved Language Model Adaptation Using Existing and Derived External Resources

نویسندگان

  • Pi-Chuan Chang
  • Lin-Shan Lee
چکیده

Adaptation of language models to obtain better parameters for the topics addressed by the spoken documents to be recognized has been a key issue for speech recognition. In this paper, we propose to collect existing as well as derived external resources for improved language model adaptation. The derived external resources are those retrieved based on the baseline transcriptions for the input spoken documents from the Internet using some search engine. The design of queries for such purposes are also analyzed in this paper, in which the special structure of Chinese language is considered. The obtained existing and derived external resources are then used in the model adaptation under a ClusteringClassification framework. Very encouraging results were obtained in the preliminary experiments with two test sets: broadcast news and interview recording.

منابع مشابه

Improved Estimates of Kinematic Wave Parameters for Circular Channels

The momentum equation in the kinematic wave model is a power-law equation with two parameters. These parameters, which relate the discharge to the flow area, are commonly derived using Manning’s equation. In general, the values of these parameters depend on the flow depth except for some special cross sections. In this paper, improved estimates of the kinematic wave parameters for circular chan...

متن کامل

Persian Adaptation of Enhanced Milieu Teaching for Iranian Children With Expressive Language Delay

Objectives: This study aimed at adapting and examining the applicability of the Teach-Model-Coach-Review model of the enhanced milieu teaching (EMT) approach for improving Iranian mothers’ language strategies while interacting with their toddlers with expressive language delay. Methods: In a single-subject multiple-baseline across-behavior study, the mothers of 3 toddlers with expressive langu...

متن کامل

Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition

This paper demonstrates combinations of various language model (LM) technologies simultaneously, not only modeling techniques but also those for training data expansion based on external language resources and unsupervised adaptation for spontaneous speech recognition. Although forming combinations of various LM technologies has been examined, previous works focused on only modeling techniques....

متن کامل

A Model for Standardization/Adaptation Strategy Selection in the Irans Multinational Companies (MNCs)

Purpose-The research aims at evaluating the standardization/adaptation of international marketing strategy in Iran multinational companies (MNCs) based a model in which the impact of external environmental variables on the marketing mix internal variables (i.e. Product, Promotion, Price and Place) is considered, while in the previous researches no attempt was done to examine the interdepende...

متن کامل

Corpus-Based methods for Short Text Similarity

This paper presents corpus-based methods to find similarity between short text (sentences, paragraphs, ...) which has many applications in the field of NLP. Previous works on this problem have been based on supervised methods or have used external resources such as WordNet, British National Corpus etc. Our methods are focused on unsupervised corpus-based methods. We present a new method, based ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل
عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003