MAKHRAJ ‘AIN PRONUNCIATION ERROR DETECTION USING MEL FREQUENCY CEPSTRAL COEFFICIENT AND MODIFIED VGG-16
نویسندگان
چکیده
Based on research conducted by the Institute of Qur'anic Sciences (IIQ) as many 65% Muslims in Indonesia are illiterate Qur'an. In previous studies, was detection Arabic word pronunciation errors against non-natives using Mel Frequency Cepstral Coefficient (MFCC) and Support Vector Machine (SVM) methods with a test result 54.6%. Due to low accuracy results this study aims design build system that can correct makhraj letter ‘ain method used is combination MFCC Convolutional Neural Network (CNN) vgg-16 structure has been modified. The dataset 1,600 voice recordings divided into two categories incorrect four variations different vowels total data 800 records each category. This several experiments CNN kernel. training model produced best all were kernels 16, 32, 64 final rate 100% for 96% validation. fathah variation, validation 94%. variation dhommah kasrah obtained 97%. Therefore, succeeded distinguishing sound measuring ‘ain. Implementing modified produces high values speech during train process.
منابع مشابه
Modified Mel-frequency Cepstrum Coefficient
This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.
متن کاملMel-frequency cepstral coefficient-based bandwidth extension of narrowband speech
We present a novel MFCC-based scheme for the Bandwidth Extension (BWE) of narrowband speech. BWE is based on the assumption that narrowband speech (0.3–3.4 kHz) correlates closely with the highband signal (3.4–7 kHz), enabling estimation of the highband frequency content given the narrow band. While BWE schemes have traditionally used LP-based parametrizations, our recent work has shown that MF...
متن کاملRobust Speech Recognition Using Perceptual Wavelet Denoising and Mel-frequency Product Spectrum Cepstral Coefficient Features
To improve the performance of Automatic Speech Recognition (ASR) Systems, a new method is proposed to extract features capable of operating at a very low signal-to-noise ratio (SNR). The basic idea introduced in this article is to enhance speech quality as the first stage for Mel-cepstra based recognition systems, since it is well-known that cepstral coefficients provided better performance in ...
متن کاملVoice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques
Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching...
متن کاملAutomatic Genre Classification Using Fractional Fourier Transform Based Mel Frequency Cepstral Coefficient and Timbral Features
This paper presents the Automatic Genre Classification of Indian Tamil Music andWestern Music using Timbral and Fractional Fourier Transform (FrFT) based Mel Frequency Cepstral Coefficient (MFCC) features. The classifier model for the proposed system has been built using K-NN (K-Nearest Neighbours) and Support Vector Machine (SVM). In this work, the performance of various features extracted fro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Jurnal Teknik Informatika
سال: 2023
ISSN: ['1979-9160', '2549-7901']
DOI: https://doi.org/10.52436/1.jutif.2023.4.1.419