Automatic Music Transcription using Audio-Visual Fusion for Violin Practice in Home Environment
نویسندگان
چکیده
Violin practice in a home environment, where there is often no teacher available, can benefit from automatic music transcription to provide feedback to the student. This paper describes a high performance violin transcription system with three main contributions. First, as onset detection is an important but challenging task for automatic transcription of pitched non-percussive music, such as from the violin, we propose an effective audio-only onset detection approach based on supervised learning. The proposed approach outperforms the state-of-the-art methods substantially. Second, we introduce the visual modality, i.e., bowing and fingering of the violin playing, to infer onsets, thus enhancing the audio-only onset detection. We devise automatic and realtime video processing algorithms to extract indicative features of onsets from bowing and fingering videos. Third, we evaluate state-of-the-art multimodal fusion techniques to fuse audio and visual modalities and show this improves onset detection and transcription performance significantly. The audio-visual fusion based violin transcription system provides more accurate transcribed results as learning feedback even in acoustically inferior environments. With efficient and fully automatic audio-visual analysis components, the system can be easily deployed in a home environment.
منابع مشابه
Specific Music Transcription for Tutoring
An applicationspecific, musictranscription approach uses a customized human– computer interface to combine the strengths of humans and computers to enhance music transcription through instrument modeling and multimedia fusion. A utomatic music transcription (AMT) refers to the ability of computers to write note information—such as the pitch, onset time, duration, and source of each sound— after...
متن کاملiDVT: A Digital Violin Tutoring System based on Audio-Visual Fusion
iDVT (interactive Digital Violin Tutor) is a violin learning system exploiting physical and virtual resources and interactivity. It aims at providing the user with new effective learning experience. This demonstration paper briefly describes the structure of the system and the underlying audio-visual processing techniques employed in the system.
متن کاملCompositional and Programming Issues Within Lyra, a Fully Interactive Performance Environment For Violin and Kyma System
Meaningful real-time interaction between human performers and computer processing is an important aesthetic issue for many composers. With the advent of computer systems that are actually fast enough to permit real-time algorithmic sound realizations based on the analysis of live performance data the issue now facing composers is how to utilize these tools in a meaningful and artistic way. Lyra...
متن کاملAutomatic Transcription of Polyphonic Music Exploiting Temporal Evolution
Automatic music transcription is the process of converting an audio recording into a symbolic representation using musical notation. It has numerous applications in music information retrieval, computational musicology, and the creation of interactive systems. Even for expert musicians, transcribing polyphonic pieces of music is not a trivial task, and while the problem of automatic pitch estim...
متن کاملClavision: visual automatic piano music transcription
One important problem in Music Information Retrieval is Automatic Music Transcription, which is an automated conversion process from played music to a symbolic notation such as sheet music. Since the accuracy of previous audiobased transcription systems is not satisfactory, we propose an innovative visual-based automatic music transcription system named claVision to perform piano music transcri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009