MAKHRAJ ‘AIN PRONUNCIATION ERROR DETECTION USING MEL FREQUENCY CEPSTRAL COEFFICIENT AND MODIFIED VGG-16

نویسندگان

چکیده

Based on research conducted by the Institute of Qur'anic Sciences (IIQ) as many 65% Muslims in Indonesia are illiterate Qur'an. In previous studies, was detection Arabic word pronunciation errors against non-natives using Mel Frequency Cepstral Coefficient (MFCC) and Support Vector Machine (SVM) methods with a test result 54.6%. Due to low accuracy results this study aims design build system that can correct makhraj letter ‘ain method used is combination MFCC Convolutional Neural Network (CNN) vgg-16 structure has been modified. The dataset 1,600 voice recordings divided into two categories incorrect four variations different vowels total data 800 records each category. This several experiments CNN kernel. training model produced best all were kernels 16, 32, 64 final rate 100% for 96% validation. fathah variation, validation 94%. variation dhommah kasrah obtained 97%. Therefore, succeeded distinguishing sound measuring ‘ain. Implementing modified produces high values speech during train process.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modified Mel-frequency Cepstrum Coefficient

This paper describes the principle of MFCC feature extraction and the knowledge of human auditory masking effect in order to introduce a modified-MFCC feature extraction that can improve the robustness of speech recognition systems.

متن کامل

Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speech

We present a novel MFCC-based scheme for the Bandwidth Extension (BWE) of narrowband speech. BWE is based on the assumption that narrowband speech (0.3–3.4 kHz) correlates closely with the highband signal (3.4–7 kHz), enabling estimation of the highband frequency content given the narrow band. While BWE schemes have traditionally used LP-based parametrizations, our recent work has shown that MF...

متن کامل

Robust Speech Recognition Using Perceptual Wavelet Denoising and Mel-frequency Product Spectrum Cepstral Coefficient Features

To improve the performance of Automatic Speech Recognition (ASR) Systems, a new method is proposed to extract features capable of operating at a very low signal-to-noise ratio (SNR). The basic idea introduced in this article is to enhance speech quality as the first stage for Mel-cepstra based recognition systems, since it is well-known that cepstral coefficients provided better performance in ...

متن کامل

Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques

Digital processing of speech signal and voice recognition algorithm is very important for fast and accurate automatic voice recognition technology. The voice is a signal of infinite information. A direct analysis and synthesizing the complex voice signal is due to too much information contained in the signal. Therefore the digital signal processes such as Feature Extraction and Feature Matching...

متن کامل

Automatic Genre Classification Using Fractional Fourier Transform Based Mel Frequency Cepstral Coefficient and Timbral Features

This paper presents the Automatic Genre Classification of Indian Tamil Music andWestern Music using Timbral and Fractional Fourier Transform (FrFT) based Mel Frequency Cepstral Coefficient (MFCC) features. The classifier model for the proposed system has been built using K-NN (K-Nearest Neighbours) and Support Vector Machine (SVM). In this work, the performance of various features extracted fro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jurnal Teknik Informatika

سال: 2023

ISSN: ['1979-9160', '2549-7901']

DOI: https://doi.org/10.52436/1.jutif.2023.4.1.419