Telephone Based Voice Pathology Assessment using Automated Speech Analysis and VoiceXML
نویسندگان
چکیده
A system of remotely detecting vocal fold pathologies using telephone quality speech is presented. Using VoiceXML, a database of 631 clean speech files of the sustained phonation of the vowel sound /a/ (58 normal subjects, 573 pathologic) from the Disordered Voice Database Model 4337 was transmitted over telephone channels to produce a test corpus. Pitch perturbation features, amplitude perturbation features and a set of measures of the harmonic-to-noise ratio are extracted from the clean and transmitted speech files. These feature sets are used to test and train automatic classifiers, employing the method of Linear Discriminant Analysis. Cross-fold validation was employed to measure classifier performances. While a sustained phonation can be classified as normal or pathologic with accuracy greater than 90%, results indicate that a telephone quality speech can be classified as normal or pathologic with an accuracy of 74.15%. Amplitude perturbation features proving most robust in channel transmission. This study highlights the real possibility for remote diagnosis of voice pathology.
منابع مشابه
Voice Pathology Assessment based on a Dialogue System and Speech Analysis
A system of remotely detecting vocal fold pathologies using telephone quality speech recorded during a telephone dialogue is presented. This study aims at developing a dialogue system using VoiceXML for remote diagnosis of voice pathology. To assess the accuracy of the system, a database of 631 clean speech files of the sustained phonation of the vowel sound /a/ (58 normal subjects, 573 patholo...
متن کاملVoice User Interface Design for a Telephone Application Using VoiceXML
VoiceXML is a standard language for developing voice based applications. VoiceXML applications have more advantages over traditional Interactive Voice Response (IVR) systems because they can be used through any type of phones and also accessed via a computer. Voice User Interface (VUI) design is an integral part of developing any VoiceXML application. In this paper, the VUI for a VoiceXML ‘Cine...
متن کاملSpeech Recognition of a Voice-Access Automotive Telematics System using VoiceXML
In order to provide a safe way for drivers to retrieve information, a voice-access Telematics system is implemented based on VoiceXML and web architecture. The noise problems during moving affect recognition rate greatly, and make drivers repeat commands once and once again. For the sake of raising the accuracy of recognition, this paper makes several improvements to the major components of Aut...
متن کاملVoiceXML dialog system of the multimodal IP-Telephony - The application for voice ordering service
The development of IP-Telephony in recent years has been substantial. The improvement in voice quality, the integration between voice and data, especially the interaction with multimedia has made the 3G communication more promising. The value added services of Telephony techniques alleviate the dependence on the phone and provide a universal platform for the multimodal telephony applications. F...
متن کاملVoicexml Builder: a Workbench for Investigating Voiced-based Applications
1 Janet D. Hartman, Illinois State University, Applied Computer Science Department 5150, Normal, IL 61790-5150 [email protected] Joaquin A. Vila, Illinois State University, Applied Computer Science Department 5150, Normal, IL 61790-5150 Abstract -Currently, the most common interaction with the Web is visual and accomplished through the use of the keyboard or mouse. While sound files c...
متن کامل