Perceptually-Based Evaluation of the Errors Usually Made When Automatically Transcribing Music
نویسندگان
چکیده
This paper investigates the perceptual importance of typical errors occurring when transcribing polyphonic music excerpts into a symbolic form. The case of the automatic transcription of piano music is taken as the target application and two subjective tests are designed. The main test aims at understanding how human subjects rank typical transcription errors such as note insertion, deletion or replacement, note doubling, incorrect note onset or duration, and so forth. The Bradley-Terry-Luce (BTL) analysis framework is used and the results show that pitch errors are more clearly perceived than incorrect loudness estimations or temporal deviations from the original recording. A second test presents a first attempt to include this information in more perceptually motivated measures for evaluating transcription systems.
منابع مشابه
Transcribing tone - a likelihood-based quantitative evaluation of chao's tone letters
The accuracy of the widely used and International Phonetic Association-sanctioned Chao five-point scale of tonal transcription is examined quantitatively. Perceptually transformed acoustic data are used from two Chinese dialects with complex tone systems, and a measure derived of the conformability of the data using their likelihoods. It is shown that some tones conform well to the model, but o...
متن کاملAutomatic transcription of Turkish microtonal music.
Automatic music transcription, a central topic in music signal analysis, is typically limited to equal-tempered music and evaluated on a quartertone tolerance level. A system is proposed to automatically transcribe microtonal and heterophonic music as applied to the makam music of Turkey. Specific traits of this music that deviate from properties targeted by current transcription tools are disc...
متن کاملSeparating Advertisements and DJ Chatter from Music
Music on the radio is often interrupted by advertisements or DJ chatter that the listener would prefer not to hear. In this paper, we discuss a straightforward application of perceptually-based audio feature extraction and classification using a support vector machine to automatically differentiate between music and “non-music” audio, so that when non-music is detected the radio might automatic...
متن کاملAutomatic rhythm transcription from multiphonic MIDI signals
For automatically transcribing human-performed polyphonic music recorded in the MIDI format, rhythm and tempo are decomposed through probabilistic modeling using Viterbi search in HMM for recognizing the rhythm and EM Algorithm for estimating the tempo. Experimental evaluation are also presented.
متن کاملAutomatic Transcription of Turkish Makam Music
In this paper we propose an automatic system for transcribing makam music of Turkey. We document the specific traits of this music that deviate from properties that were targeted by transcription tools so far and we compile a dataset of makam recordings along with aligned microtonal ground-truth. An existing multi-pitch detection algorithm is adapted for transcribing music in 20 cent resolution...
متن کامل