A comparison of distributed and network speech recognition for mobile communication systems
نویسنده
چکیده
In this paper, we compare the conventional Network Speech Recognition (NSR) and the newly established Distributed Speech Recognition (DSR) concepts for mobile communications. These implementation approaches to Automatic Speech Recognition (ASR) are analyzed from three aspects. First, the effect on the speech recognition accuracy of ASR systems with various complexity. Second, usability in different operating environments (environmental noises). Finally, the resilience to erroneous transmission (radio channel). Our experimental results show that DSR reduces the error rate of ASR systems by 1725% on average compared to NSR, depending on the recognition task. The error resilience of DSR provides up to 37% error rate reduction over NSR under severe channel error conditions. In non-stationary environmental noise, DSR may outperform NSR by as much as 41% in terms of error-rate reduction. Based on these results and considering the low complexity implementation of DSR in the mobile terminals, it is concluded that DSR provides a viable, robust and economical alternative to the traditional NSR approach, especially in real-life mobile operating environments.
منابع مشابه
Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملروشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه
Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملSurvey of the Speech Recognition Techniques for Mobile Devices
This paper presents an overview of different approaches for providing automatic speech recognition (ASR) technology to mobile users. Three principal system architectures in terms of employing wireless communication link are analyzed: Embedded Speech Recognition Systems, Network Speech Recognition (NSR) and Distributed Speech Recognition (DSR). Overview of the solutions which became standards by...
متن کاملNetwork Investigation and Performance Analysis of ZigBee Technology using OPNET
Abstract- Communication has become inevitably part of our day to day activities, in academic, business, banking, and other sectors. It has therefore become so important to implement good and efficient communication system. A reference point according to this research is the wireless sensor networking (WSN) system, and most important thing in communication is to be free from interference, attenu...
متن کامل