An Improved Hierarchical Approach for Music-to-symbolic Score Alignment
نویسندگان
چکیده
We present an efficient approach for an off-line alignment of a symbolic score to a recording of the same piece, using a statistical model. A hidden state model is built from the score, which allows for the use of two different kinds of features, namely chroma vectors and an onset detection function (spectral flux) with specific production models, in a simple manner. We propose a hierarchical pruning method for an approximate decoding of this statistical model. This strategy reduces the search space in an adaptive way, yielding a better overall efficiency than the tested state-of-the art method. Experiments run on a large database of 94 pop songs show that the resulting system obtains higher recognition rates than the dynamic programming algorithm (DTW), with a significantly lower complexity, even though the rhythmic information is not used for the alignment.
منابع مشابه
Towards Audio to Score Alignment in the Symbolic Domain
This paper presents a matrix factorization based feature for audio to score alignment. We show that in combination with dynamic time warping it can compete with chroma vectors, which are the probably most frequently used approach within the last years. A great benefit of the factorizationbased feature is its sparseness, which can be used in order to transform it into a symbolic representation. ...
متن کاملStatistical Music Modeling Aimed at Identification and Alignment
This paper describes a methodology for the statistical modeling of music works. Starting from either the representation of the symbolic score or the audio recording of a performance, a hidden Markov model is built to represent the corresponding music work. The model can be used to identify unknown recordings and to align them with the corresponding score. Experimental evaluation using a collect...
متن کاملBridging Printed Music and Audio Through Alignment Using a Mid-level Score Representation
We present a system that utilizes a mid-level score representation for aligning printed music to its audio rendition. The mid-level representation is designed to capture an approximation to the musical events present in the printed score. It consists of a template based note detection frontend that seeks to detect notes without regard to musical duration, accidentals or the key signature. The p...
متن کاملReal-Time Music Tracking Using Multiple Performances as a Reference
In general, algorithms for real-time music tracking directly use a symbolic representation of the score, or a synthesised version thereof, as a reference for the on-line alignment process. In this paper we present an alternative approach. First, different performances of the piece in question are collected and aligned (off-line) to the symbolic score. Then, multiple instances of the on-line tra...
متن کاملAn Approach for Linking Score and Audio Recordings in Makam Music of Turkey
The main information sources to study a particular piece of music are symbolic scores and audio recordings. These are complementary representations of the piece and it is very useful to have a proper linking between the two of the musically meaningful events. For the case of makam music of Turkey, linking the available scores with the corresponding audio recordings requires taking the specifici...
متن کامل