Song Genre and Artist Classification via Supervised Learning from Lyrics
نویسندگان
چکیده
Motivation The amount of raw data available online has increased dramatically over the past few years; in order for us to maintain the usability of this data we must develop effective ways to efficiently and automatically organize it. For our CS 224N final project, we chose to develop a classifier that classifies songs into genres and/or artists based solely on their lyrics. We primarily focused on developing lyric-specific features that would allow a classifier to easily distinguish between songs from different genres.
منابع مشابه
Artist Attribution via Song Lyrics
Song lyrics, separated from the audio signal of their song, still contain a significant amount of information. Mood and meaning can still be conveyed effectively by a pure textual representation. There has even been somewhat successful previous work on genre classification from song lyrics[7]. Building on previous work, we seek to build an artist attribution system for song lyrics. This task is...
متن کاملPredicting genre labels for artist using FreeDB
This paper explores the value of FreeDB as a source of genre and music similarity information. FreeDB is a public, dynamic, uncurated database for identifying and labeling CDs with album, song, artist and genre information. One quality of FreeDB is that there is high variance in, e.g., the genre labels assigned to a particular disc. We investigate here the ability to use these genre labels to p...
متن کاملLyrics-Based Music Genre Classification Using a Hierarchical Attention Network
Music genre classification, especially using lyrics alone, remains a challenging topic in Music Information Retrieval. In this study we apply recurrent neural network models to classify a large dataset of intact song lyrics. As lyrics exhibit a hierarchical layer structure—in which words combine to form lines, lines form segments, and segments form a complete song—we adapt a hierarchical attent...
متن کاملMulti-modal Analysis of Music: A large-scale Evaluation
Multimedia data by definition comprises several different types of content modalities. Music specifically inherits e.g. audio at its core, text in the form of lyrics, images by means of album covers, or video in the form of music videos. Yet, in many Music Information Retrieval applications, only the audio content is utilised. Recent studies have shown the usefulness of incorporating other moda...
متن کاملI-Vectors for Timbre-Based Music Similarity and Music Artist Classification
In this paper, we present a novel approach to extract songlevel descriptors built from frame-level timbral features such as Mel-frequency cepstral coefficient (MFCC). These descriptors are called identity vectors or i-vectors and are the results of a factor analysis procedure applied on framelevel features. The i-vectors provide a low-dimensional and fixed-length representation for each song an...
متن کامل