Research on Framework of Speech Recognition Combining Text - Speech with Semantic Similarity

نویسنده

  • Xianyi Cheng
چکیده

With the deepening of the speech recognition research, improving the accuracy of the general recognition engine is becoming more and more difficult. For the noise problem of Chinese speech recognition, in this paper we made a brief review and analysis about the current related articles of the semantic similarity and the text-speech similarity. We compared the advantages and disadvantages of various methods in speech recognition and pointed out the difficulties and the common problems existed in the research. At last, we give a speech recognition framework combining the semantic similarity and text-speech similarity. It provides a new way to improve the accuracy of speech recognition. Keywords—Similarity, speech recognition, natural language processing, semanteme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developing a Semantic Similarity Judgment Test for Persian Action Verbs and Non-action Nouns in Patients With Brain Injury and Determining its Content Validity

Objective: Brain trauma evidences suggest that the two grammatical categories of noun and verb are processed in different regions of the brain due to differences in the complexity of grammatical and semantic information processing. Studies have shown that the verbs belonging to different semantic categories lead to neural activity in different areas of the brain, and action verb processing is r...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

Performance Evaluation of WordNet-based Semantic Relatedness Measures for Word Prediction in Conversational Speech

The recognition of conversational speech is a hard problem. Semantic relatedness measures can improve speech recognition performance when using contextual information, as Demetriou [5] has shown. The standard n-gram approach in language modeling for speech recognition cannot cope with long distance dependencies [4]. Therefore J. Bellegarda [2] proposed combining n-gram language models, which ar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014