Speech Endpoint Detection Based on High Order Statistics
نویسندگان
چکیده
For automatic speech recognition, endpoint detection is required to isolate the speech of interest so as to be able to create a speech pattern or template. The process of separating the speech segments of an utterance from the nonspeech segments obtained during the recording process is called endpoint detection. In this paper, we present new endpoint detection algorithm based on high order statistical models and empirical rule-based energy detection algorithm. The performance of the proposed algorithm was evaluated for Arabic phonemes including the weak consonants which are difficult to detect using conventional endpoint detection methods. Keywords—voice activity detection (VAD), frequency domain, spectral domain, high order statistics (HOS).
منابع مشابه
Endpoint in plasma etch process using new modified w-multivariate charts and windowed regression
Endpoint detection is very important undertaking on the side of getting a good understanding and figuring out if a plasma etching process is done in the right way, especially if the etched area is very small (0.1%). It truly is a crucial part of supplying repeatable effects in every single wafer. When the film being etched has been completely cleared, the endpoint is reached. To ensure the desi...
متن کاملRobot Arm Performing Writing through Speech Recognition Using Dynamic Time Warping Algorithm
This paper aims to develop a writing robot by recognizing the speech signal from the user. The robot arm constructed mainly for the disabled people who can’t perform writing on their own. Here, dynamic time warping (DTW) algorithm is used to recognize the speech signal from the user. The action performed by the robot arm in the environment is done by reducing the redundancy which frequently fac...
متن کاملSVM-based speech endpoint detection using contextual speech features
Shown is an effective speech endpoint detection algorithm using a trained support vector machine (SVM) and a feature vector including contextual information speech features. With this and other innovations the proposed algorithm yields high discrimination and reports significant improvements over standard methods and algorithms defining the decision rule in terms of averaged subband speech feat...
متن کاملEndpoint detection using weighted finite state transducer
In this paper, we discuss the possibility of applying weighted finite state transducer (WFST) as a unified framework to solve endpoint detection problem. In general, endpoint detection is composed of two cascaded decision processes. The first process is voice activity detection (VAD) which makes framelevel speech/non-speech classification. The second process is utterance-level detection which m...
متن کاملEvaluation of real-time voice activity detection based on high order statistics
We have proposed a method for real-time, unsupervised voice activity detection (VAD). In this paper, problems of feature selection and classification scheme are addressed. The feature is based on High Order Statistics (HOS) to discriminate close and far-field talk, enhanced by a feature derived from the normalized autocorrelation. Comparative effectiveness on several HOS is shown. The classific...
متن کامل