Fast Jukebox: Accelerating Music Generation with Knowledge Distillation
نویسندگان
چکیده
The Jukebox model can generate high-diversity music within a single system, which is achieved by using hierarchical VQ-VAE architecture to compress audio in discrete space at different compression levels. Even though the results are impressive, inference stage tremendously slow. To address this issue, we propose Fast Jukebox, uses knowledge distillation strategies reduce number of parameters prior for compressed space. Since has shown highly diverse generation capabilities, used simple compilation songs experimental purposes. Evaluation obtained emotional valence show that proposed approach tendency towards actively pleasant, thus reducing time all levels without compromising quality.
منابع مشابه
Topic Distillation with Knowledge Agents
This is the second year that our group participates in TREC’s Web track. Our experiments focused on the Topic distillation task. Our main goal was to experiment with the Knowledge Agent (KA) technology [1], previously developed at our Lab, for this particular task. The knowledge agent approach was designed to enhance Web search results by utilizing domain knowledge. We first describe the generi...
متن کاملSequence-Level Knowledge Distillation
Neural machine translation (NMT) offers a novel alternative formulation of translation that is potentially simpler than statistical approaches. However to reach competitive performance, NMT models need to be exceedingly large. In this paper we consider applying knowledge distillation approaches (Bucila et al., 2006; Hinton et al., 2015) that have proven successful for reducing the size of neura...
متن کاملFast Generation of Optimal Music Playlists using Local Search
We present an algorithm for use in an interactive music system that automatically generates music playlists that fit the music preferences given by a user. To this end, we introduce a formal model, define the problem of automatic playlist generation (APG) and indicate its NP-hardness. We use a local search (LS) procedure based on simulated annealing (SA) to solve the APG problem. In order to em...
متن کاملMusic Generation with Relation Join
Given a data set taken over a population, the question of how can we construct possible causal explanatory models for the interactions and dependencies in the population is a causal discovery question. Projection and Relation Join is a way of addressing this question in a non-deterministic context with mathematical relations. In this paper, we apply projection and relation join to music harmoni...
متن کاملMusic Generation with Deep Learning
The use of deep learning to solve the problems in literary arts has been a recent trend that gained a lot of attention and automated generation of music has been an active area. This project deals with the generation of music using raw audio files in the frequency domain relying on various LSTM architectures. Fully connected and convolutional layers are used along with LSTM’s to capture rich fe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13095630