Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch.
They are basically using text-conditioned AudioLM, but surprisingly with the embeddings from a text-audio contrastive learned model named MuLan. MuLan is what will be built out in this repository, with AudioLM modified from the other repository to support the music generation needs here.
MusicLM - Pytorch
Implementation of MusicLM music generation model in Pytorch
Downloads:
2 This Week
Windows
Mac
Linux
BSD
ChromeOS

