Global Structure-Aware Drum Transcription Based on Self-Attention Mechanisms

نویسندگان

چکیده

This paper describes an automatic drum transcription (ADT) method that directly estimates a tatum-level score from music signal in contrast to most conventional ADT methods estimate the frame-level onset probabilities of drums. To score, we propose deep model consists encoder for extracting latent features and decoder estimating pooled at tatum level. capture global repetitive structure scores, which is difficult learn with recurrent neural network (RNN), introduce self-attention mechanism tatum-synchronous positional encoding into decoder. mitigate difficulty training self-attention-based insufficient amount paired data improve musical naturalness estimated regularized uses structure-aware masked language (score) pretrained extensive collection scores. The experimental results showed proposed outperformed RNN-based terms error rate F-measure, even when only limited was available so non-regularized underperformed model.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Drum Transcription Based on Independent Subspace Analysis

In automatic music transcription, metadata extraction from recorded audio data or speaker separation in video conferencing, it is a significant prerequisite task to analyze and separate the audio signal into their original source components. In this report, I study and analyze a set of methods of the extraction of percussive instruments metadata from polyphonic music. It mainly focuses on the s...

متن کامل

Automatic Drum Transcription for Polyphonic Recordings Using Soft Attention Mechanisms and Convolutional Neural Networks

Automatic drum transcription is the process of generating symbolic notation for percussion instruments within audio recordings. To date, recurrent neural network (RNN) systems have achieved the highest evaluation accuracies for both drum solo and polyphonic recordings, however the accuracies within a polyphonic context still remain relatively low. To improve accuracy for polyphonic recordings, ...

متن کامل

Separable mechanisms underlying global feature-based attention.

Feature-based attention is known to operate in a spatially global manner, in that the selection of attended features is not bound to the spatial focus of attention. Here we used electromagnetic recordings in human observers to characterize the spatiotemporal signature of such global selection of an orientation feature. Observers performed a simple orientation-discrimination task while ignoring ...

متن کامل

Combining Temporal and Spectral Features in HMM-Based Drum Transcription

To date several methods for transcribing drums from polyphonic music have been published. Majority of the features used in the transcription systems are “spectral”: parameterising some property of the signal spectrum in a relatively short time frames. It has been shown that utilising narrow-band features describing long-term temporal evolution in conjunction with the more traditional features c...

متن کامل

The Effectiveness of Self-Care Training, Based on the Self-Care Model, on the Global Function of Schizophrenia

Introduction: Schizophrenia has a significant effect on the performance of patients, mostly Global of Functioning. The aim was to evaluate a self-care programchr('39')s effectiveness based on Oremchr('39')s model on the global function (GF) of patients with schizophrenia. Methods: Self-Care Training program was developed and evaluated based on the Orem self-care model in two phases. At first, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Signals

سال: 2021

ISSN: ['2624-6120']

DOI: https://doi.org/10.3390/signals2030031