Robustness to Transmission Channel – the DSR Approach
نویسنده
چکیده
The desire for improved user interfaces for distributed speech and multimodal services on mobile devices has motivated the need for reliable recognition performance over mobile channels. Performance needs to be robust both to background noise and to any errors introduced by the mobile transmission channel. There has been much work in the telecommunications standards bodies to develop standards to achieve this (ETSI Aurora and 3GPP). The Aurora interest in noise robust frontends is well known but in this paper the emphasis is given to the topic of channel robustness. The general area of channel robustness is very large so this paper takes the perspective of mobile telecommunications standards and the Distributed Speech Recognition (DSR) approach to robustness. As background, the paper first provides an overview of the work in different standards bodies on DSR: the DSR standards created in ETSI Aurora; the work on Speech Enabled Services in 3GPP; the transport protocols in IETF. The different mobile channel types are reviewed next using the particular example of the GSM network. Drawing results from sources in the literature and in the standards bodies, a comparison is made between performance using a voice codec or DSR. Comparison is first made in error-free conditions to separate out the effects of speech compression. Robustness to channel errors is then examined; both with circuit-switched errors and with packet-switched errors. Finally some more advanced error mitigation techniques are cited. These are compatible with the DSR features and can provide even greater robustness with poor channels.
منابع مشابه
طراحی و ارزیابی روش کدگذاری ترکیبی برای کانال پوششی زمانبندیدار در شبکه اینترنت
Covert channel means communicating information through covering of overt and authorized channel in a manner that existence of channel to be hidden. In network covert timing channels that use timing features of transmission packets to modulating covert information, the appropriate encoding schema is very important. In this paper, a hybrid encoding schema proposed through combining "the inter-pac...
متن کاملAutomatic speech recognition over error-prone wireless networks
The past decade has witnessed a growing interest in deploying automatic speech recognition (ASR) in communication networks. The networks such as wireless networks present a number of challenges due to e.g. bandwidth constraints and transmission errors. The introduction of distributed speech recognition (DSR) largely eliminates the bandwidth limitations and the presence of transmission errors be...
متن کاملNonlinearity Transmission of Monetary Policy Through the House Price Channel; MSVAR Approach
In the past decades, the discussions between economists have changed of impact or no impact of monetary policy to effective monetary policy channels. Since, The housing sector has a large share in the household consumption basket, gross domestic product and wealth of the private sector, Then this sector play very decisive role in transfer transfer of monetary policy effects on production. There...
متن کاملA Computationally Efficient Mel-Filter Bank VAD Algorithm for Distributed Speech Recognition Systems
This paper presents a novel computationally efficient voice activity detection (VAD) algorithm and emphasizes the importance of such algorithms in distributed speech recognition (DSR) systems. When using VAD algorithms in telecommunication systems, the required capacity of the speech transmission channel can be reduced if only the speech parts of the signal are transmitted. A similar objective ...
متن کاملA comparison of distributed and network speech recognition for mobile communication systems
In this paper, we compare the conventional Network Speech Recognition (NSR) and the newly established Distributed Speech Recognition (DSR) concepts for mobile communications. These implementation approaches to Automatic Speech Recognition (ASR) are analyzed from three aspects. First, the effect on the speech recognition accuracy of ASR systems with various complexity. Second, usability in diffe...
متن کامل