Estimating Resemblance of MIDI Documents
نویسندگان
چکیده
Search engines often employ techniques for determining syntactic similarity of Web pages. Such a tool allows them to avoid returning multiple copies of essentially the same page when a user makes a query. Here we describe our experience extending these techniques to MIDI music les. The music domain requires modi cation to cope with problems introduced in the musical setting, such as polyphony. Our experience suggests that when used properly these techniques prove useful for determining duplicates and clustering databases in the musical setting as well.
منابع مشابه
Identifying and Filtering Near-Duplicate Documents
The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a fixed size “sketch” for each document. For a large collection of documents (say hundreds of millions) the size of this sketch is of the order of a few hundred bytes per document. However, for efficient large scale web indexing it is not necessary t...
متن کاملAutomatic rhythm transcription from multiphonic MIDI signals
For automatically transcribing human-performed polyphonic music recorded in the MIDI format, rhythm and tempo are decomposed through probabilistic modeling using Viterbi search in HMM for recognizing the rhythm and EM Algorithm for estimating the tempo. Experimental evaluation are also presented.
متن کاملEstimating Musical Time Information from Performed MIDI Files
Even though originally developed for exchanging control commands between electronic instruments, MIDI has been used as quasi standard for encoding and storing scorerelated parameters. MIDI allows for representing musical time information as specified by sheet music as well as physical time information that reflects performance aspects. However, in many of the available MIDI files the musical be...
متن کاملOn the resemblance and containment of documents
Given two documents A and B we define two mathematical notions: their resemblance r(A,B) and their containment c(A,B) that seem to capture well the informal notions of “roughly the same” and “roughly contained.” The basic idea is to reduce these issues to set intersection problems that can be easily evaluated by a process of random sampling that can be done independently for each document. Furt...
متن کاملDetection of Torque teno midi virus/Small anellovirus (TTMDV/SAV) in the sera of domestic village chickens and its vertical transmission from hen to eggs
Although the infection of different animals and non-human primates with other members of Anelloviridae have already been reported there is no report about infection of animals with Torque teno midi virus/Small anellovirs (TTMDV/SAV). The aim of this study was to detect the virus in domestic village chickens. Blood samples were collected from 79 domestic village chickens in Isfahan. Blood sample...
متن کامل