Constant-q Transform Toolbox for Music Processing
نویسندگان
چکیده
This paper proposes a computationally efficient method for computing the constant-Q transform (CQT) of a timedomain signal. CQT refers to a time-frequency representation where the frequency bins are geometrically spaced and the Q-factors (ratios of the center frequencies to bandwidths) of all bins are equal. An inverse transform is proposed which enables a reasonable-quality (around 55dB signal-to-noise ratio) reconstruction of the original signal from its CQT coefficients. Here CQTs with high Q-factors, equivalent to 12–96 bins per octave, are of particular interest. The proposed method is flexible with regard to the number of bins per octave, the applied window function, and the Q-factor, and is particularly suitable for the analysis of music signals. A reference implementation of the proposed methods is published as a Matlab toolbox. The toolbox includes user-interface tools that facilitate spectral data visualization and the indexing and working with the data structure produced by the CQT.
منابع مشابه
LTFAT: A Matlab/Octave toolbox for sound processing
To visualize and manipulate musical signals time-frequency transforms have been used extensively. The Large Time Frequency Analysis Toolbox is an Octave/Matlab toolbox for modern signal analysis and synthesis. The toolbox provides a large variety of linear and invertible time-frequency transforms like Gabor, MDCT, constant-Q, filterbanks and wavelets transforms, and routines for modifying music...
متن کاملTools for Interactive Audio Signal Analysis Based on Sliding Dft
This article describes an application the author developed in order to compare analysis and synthesis of musical audio signals through Short Time Fourier transform (STFT), Constant Q and Sliding Discrete Fourier Transform (SDFT). This software is the basis for applications of SDFT and Constant Q to other consolidated synthesis techniques. By itself, it is a stand alone instrument for calculatin...
متن کاملA Matlab Toolbox for Efficient Perfect Reconstruction Time-Frequency Transforms with Log-Frequency Resolution
In this paper, we propose a time-frequency representation where the frequency bins are distributed uniformly in log-frequency and their Q-factors obey a linear function of the bin center frequencies. The latter allows for time-frequency representations where the bandwidths can be e.g. constant on the log-frequency scale (constant Q) or constant on the auditory critical-band scale (smoothly vary...
متن کاملShort-Term Memory and Event Memory Classification Systems for Automatic Polyphonic Music Transcription
Music transcription consists in transforming the musical content of audio data into a symbolic representation. The objective of this study is to investigate a transcription system for polyphonic piano. The input to this system consists in piano music recordings stored in WAV files, while the pitch of all the notes in the corresponding score forms the output. The proposed method focuses on tempo...
متن کاملAudio Pitch Shifting Using the Constant-Q Transform
Pitch shifting of polyphonic music is usually performed by manipulating the time-frequency representation of the input signal. Most approaches proposed in the past are based on the Fourier transform although its linear frequency bin spacing is known to be inadequate to some degree for analyzing and processing music signals. Recently invertible constant-Q transforms (CQT) featuring high Q-factor...
متن کامل