نتایج جستجو برای: coefficient mfcc
تعداد نتایج: 170818 فیلتر نتایج به سال:
Abstract In this paper, general regression neural network (GRNN) with the input feature of Mel-frequency cepstrum coefficient (MFCC) is employed to automatically recognize calls leopard, ross, and weddell seals widely overlapping living areas. As a feedforward network, GRNN has only one parameter, i.e., spread factor. The recognition performance can be greatly improved by determining factor bas...
Penelitian ini bertujuan untuk membandingkan akurasi pengenalan emosi melalui suara dengan menggunakan beberapa jenis classifier. Emosi dasar yang akan dikenali ada 4, yaitu senang, sedih, neutral dan marah. Metodologi penelitian dimulai memperoleh dataset dari database RAVDESS, terdiri 24 aktor jumlah sebanyak 60 per aktor. Namun, hanya 28 dipilih setiap aktor, sehingga total 672 digunakan dal...
Heart murmurs are sounds made by rapid blood flow in the heart. Abnormal heart can be a sign of serious conditions such as arrhythmia and cardiovascular diseases. Therefore, murmur classification is crucial for early detection conditions. To this end, we study problem training selected convolutional neural network (CNN) models (such VGGNet ResNet) using various signal representations spectrogra...
Vocal and nonvocal segmentation is an important task in singing voice signal processing. Before identifying the singer it is necessary to locate the singer’s voice in a song. Maximum of the songs start with a piece of instrumental accompaniment known as ‘prelude’ in musical terms after which the singing voice comes into play. Therefore, it is necessary to detect the vocal region in the song in ...
This research was conducted to develop a method to identify voice utterance. For voice utterance that encounters change caused by aging factor, with the interval of 10 to 25 years. The change of voice utterance influenced by aging factor might be extracted by MFCC (Mel Frequency Cepstrum Coefficient). However, the level of the compatibility of the feature may be dropped down to 55%. While the o...
As multimedia becomes the dominant form of entertainment through an ever increasing range of digital formats, there has been a growing interest in obtaining information from entertainment media. Speech is one of the core resources in multimedia, providing a foundation for the extraction of semantic information. Thus, detecting speech is a critical first step for speech-based information retriev...
Standard Mel frequency cepstrum coefficient (MFCC) computation technique utilizes discrete cosine transform (DCT) for decorrelating log energies of filter bank output. The use of DCT is reasonable here as the covariance matrix of Mel filter bank log energy (MFLE) can be compared with that of highly correlated Markov-I process. This full-band based MFCC computation technique where each of the fi...
نمودار تعداد نتایج جستجو در هر سال
با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید