Language-identification based on cross-language acoustic models and optimised information combination

نویسندگان

Ove Andersen

Paul Dalsgaard

چکیده

decoding, the second transforms the parameters from This work is concerned with the subject of languagethe decoding module and classifies the language. identification (LID). Two central issues are addressed. The common acoustic signal preprocessor calculates The first is to analyse the trade-off between detailed 12 RASTA filtered MFCC’s, their first derivatives and acoustic modelling and robust estimation of acoustic the delta-log-energy. The phone and language decoding and language models. The second to find the optimal module consists of three parallel branches. In each of combination of acoustic and language scores for languagethese the phone recogniser matches the acoustic identification. parameters to the acoustic models used by that recogniser. Experiments are carried out using the three languages The output from each recogniser is further matched American-English, German and Spanish from the OGI-TS against three language models. database. It is shown that on the average the acoustic The combined output X from all language models modelling is able to recognise 46.3% of the phones correctly and from all recognisers are used as input to the across the three languages. Insertion and deletion rate ‘information combination and the language-classification’ is 35.7% and 6.6%, respectively. Language-identification module (ICLC). This module enforces a transformation performance is 82.6% with the full set of acoustic models. onto the parameters X and estimates the most probable The performance is increased to 83.7% after having language given the acoustic input. conducted 80 iterations of a hierarchical clustering in which phones are merged across the languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language identification incorporating lexical information

In this paper we explore the use of lexical information for language identification (LID). Our reference LID system uses language-dependent acoustic phone models and phone-based bigram language models. For each language, lexical information is introduced by augmenting the phone vocabulary with the N most frequent words in the training data. Combined phone and word bigram models are used to prov...

متن کامل

Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles

When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...

متن کامل

Automatic speech recognition of Cantones

This paper describes our recent work on the development of a largevocabulary, speaker-independent, continuous speech recognition system for Cantonese-English code-mixing utterances. The details of both acoustic modeling and language modeling will be discussed. For acoustic modeling, Cantonese accents in English words are handled by applying cross-lingual acoustic units, as well as modifications...

متن کامل

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating them potentially can play an important role in transmitt...

متن کامل

The Impact of Structured Input-based Tasks on L2 Learners’ Grammar Learning

Abstract Task-based language teaching has received increased attention in second language research. However, the combination of structured input-based approach and task-based language teaching has not been examined in relation to L2 grammar learning. To address this gap, the present study investigated how the structured input-based tasks with and without explicit information impacted learners’ ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1997

Language-identification based on cross-language acoustic models and optimised information combination

نویسندگان

چکیده

منابع مشابه

Language identification incorporating lexical information

Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles

Automatic speech recognition of Cantones

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

The Impact of Structured Input-based Tasks on L2 Learners’ Grammar Learning

عنوان ژورنال:

اشتراک گذاری