Using Deep Learning to Annotate Karaoke Songs
نویسندگان
چکیده
Karaoke is a game in which players sing over pre-recorded instrumental backing tracks. To help the singer, lyrics are usually displayed on a video screen. The synchronization between the lyrics display and the song record, often done manually, is a tedious and time-consuming task. Automation of the annotation of karaoke songs can help save time and effort. In this thesis we use the representation of songs as spectrograms to detect singing times. This timing information can be used later to align the lyrics display with a sound track. Convolutional neural networks are trained to detect at any moment in a song whether the artist is singing or not.
منابع مشابه
Efficient karaoke song recommendation via multiple kernel learning approximation
Online karaoke allows users to practice singing and distribute recordings. Different from traditional music recommendation, online karaoke need to consider users’ vocal competence besides their tastes. In this paper, we develop a karaoke recommender system by taking into account vocal competence. Alone this line, we propose a joint modeling method named MKLA by adopting bregman divergence as th...
متن کاملPersonalized Karaoke
In this paper, a personalized Karaoke system, P-Karaoke, is proposed. In the P-Karaoke system, personal home videos and photographs, which are automatically selected from users’ multimedia database according to their content, users’ preferences or music, are utilized as the background videos of the Karaoke. The selected video clips, photographs, and lyrics that obtained from Lyric Service or ma...
متن کاملOn the Effect of Using Games, Songs, and Stories on Young Iranian EFL Learners' Achievement
The objective of the present study was to identify and examine the influence of instructional tools, namely, games, songs and stories on young Iranian EFL learners’ achievement utilizing a quantitative design. To conduct the study 65 Iranian EFL learners, divided into an experimental group and a control group, learning English at Navid English Institute, Shiraz, Iran, participated in the s...
متن کاملSinger Traits Identification using Deep Neural Network
The author investigates automatic recognition of singers’ gender and age through audio features using deep neural network (DNN). Features of each singing voice, fundamental frequency and Mel-Frequency Cepstrum Coefficients (MFCC) are extracted for neural network training. 10,000 singing voice from Smule’s Sing! Karaoke app is used for training and evaluation, and the DNN-based method achieves a...
متن کاملA Music Retrieval System Based on Query-by-Singing for Karaoke Jukebox
This paper investigates the problem of retrieving Karaoke music by singing. The Karaoke music encompasses two audio channels in each track: one is a mix of vocal and background accompaniment, and the other is composed of accompaniment only. The accompaniments in the two channels often resemble each other, but are not identical. This characteristic is exploited to infer the vocal’s background mu...
متن کامل