Classification-Based Singing Melody Extraction Using Deep Convolutional Neural Networks
نویسندگان
چکیده
Singing melody extraction is the task that identifies the melody pitch contour of singing 1 voice from polyphonic music. Most of the traditional melody extraction algorithms are based on 2 calculating salient pitch candidates or separating the melody source from the mixture. Recently, 3 classification-based approach based on deep learning has drawn much attentions. In this paper, 4 we present a classification-based singing melody extraction model using deep convolutional neural 5 networks. The proposed model consists of a singing pitch extractor (SPE) and a singing voice activity 6 detector (SVAD). The SPE is trained to predict a high-resolution pitch label of singing voice from 7 a short segment of spectrogram. This allows the model to predict highly continuous curves. The 8 melody contour is smoothed further by post-processing the output of the melody extractor. The 9 SVAD is trained to determine if a long segment of mel-spectrogram contains a singing voice. This 10 often produces voice false alarm errors around the boundary of singing segments. We reduced them 11 by exploiting the output of the SPE. Finally, we evaluate the proposed melody extraction model on 12 several public datasets. The results show that the proposed model is comparable to state-of-the-art 13 algorithms. 14
منابع مشابه
Melody Extraction on Vocal Segments Using Multi-Column Deep Neural Networks
Singing melody extraction is a task that tracks pitch contour of singing voice in polyphonic music. While the majority of melody extraction algorithms are based on computing a saliency function of pitch candidates or separating the melody source from the mixture, data-driven approaches based on classification have been rarely explored. In this paper, we present a classification-based approach f...
متن کاملEMG-based wrist gesture recognition using a convolutional neural network
Background: Deep learning has revolutionized artificial intelligence and has transformed many fields. It allows processing high-dimensional data (such as signals or images) without the need for feature engineering. The aim of this research is to develop a deep learning-based system to decode motor intent from electromyogram (EMG) signals. Methods: A myoelectric system based on convolutional ne...
متن کاملCombining pattern recognition and deep-learning-based algorithms to automatically detect commercial quadcopters using audio signals (Research Article)
Commercial quadcopters with many private, commercial, and public sector applications are a rapidly advancing technology. Currently, there is no guarantee to facilitate the safe operation of these devices in the community. Three different automatic commercial quadcopters identification methods are presented in this paper. Among these three techniques, two are based on deep neural networks in whi...
متن کاملProvide a Deep Convolutional Neural Network Optimized with Morphological Filters to Map Trees in Urban Environments Using Aerial Imagery
Today, we cannot ignore the role of trees in the quality of human life, so that the earth is inconceivable for humans without the presence of trees. In addition to their natural role, urban trees are also very important in terms of visual beauty. Aerial imagery using unmanned platforms with very high spatial resolution is available today. Convolutional neural networks based deep learning method...
متن کاملCystoscopy Image Classication Using Deep Convolutional Neural Networks
In the past three decades, the use of smart methods in medical diagnostic systems has attractedthe attention of many researchers. However, no smart activity has been provided in the eld ofmedical image processing for diagnosis of bladder cancer through cystoscopy images despite the highprevalence in the world. In this paper, two well-known convolutional neural networks (CNNs) ...
متن کامل