C++ software environment for speech signal processing
نویسندگان
چکیده
Here we present the C++ library SPC (Speech Signal Processing Classes) as development tool for assembling of speech processing applications. SPC offers real-time processing, batch processing of large databases, visualization, and analysis of signals between processing steps. In SPC the data stream occurring in speech processing is partitioned in three different information flows: signal data, control information and visualization data. Because hardware dependent program code is limited exclusively to some special methods, SPC can be adapted to different hardware environments easily. System specific code is encapsulated in low level parts of SPC and SPC user programs can be compiled on various platforms without any changes in source code. Up to now SPC supports Windows 95/98/NT, IBM AIX and LINUX.
منابع مشابه
A Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملSpeech processing in the object-oriented DSP environment quicksig
Several new software techniques are available that can be used to enhance the productivity and flexibility of speech signal processing especially in research and exploratory prograrnrning of new algorithms. This paper describes an object-oriented signal processing environment QuickSig and how it is applied in various speech processing tasks. The notion of signals as objects and Operations as ge...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملAPS: an environment for acoustic phonetic research
Investigation into the relationship between phonetic segments and their acoustic values demands complex data analysis. This paper describes a software environment called APS designed to aid this task. APS provides a means of indexing segments of continuous speech in a general way; These Segments may be obtained from a hand-produced phonemic transcription of the speech according to a phonemic co...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کامل