A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research purposes.

Features

  • Unconditional Generator
  • Text-Conditional Generator
  • Diffusion Upsampler
  • Diffusion Vocoder
  • Diffusion Autoencoder
  • Inpainting

Project Samples

Project Activity

See All Activity >