audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch

This is an exact mirror of the audio-diffusion-pytorch project, hosted at https://github.com/archinetai/audio-diffusion-pytorch. SourceForge is not affiliated with audio-diffusion-pytorch.

Add a Review

Downloads: 1 This Week

Last Update: 2023-03-29

Download

Get an email when there's a new version of audio-diffusion-pytorch

A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research purposes.

Features

Unconditional Generator
Text-Conditional Generator
Diffusion Upsampler
Diffusion Vocoder
Diffusion Autoencoder
Inpainting

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow audio-diffusion-pytorch

audio-diffusion-pytorch Web Site

Other Useful Business Software

Hybrid Bare Metal Cloud Infrastructure | Servers.com

Scale, customize and manage your bare metal servers - all in one place.

Three bare metal hosting solutions on one global network. Spin up on demand to cover peaks, then optimize for cost when usage stabilizes.

Learn More

Rate This Project

User Reviews

Be the first to post a review of audio-diffusion-pytorch!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Music Generators, Python Generative AI, Python Inpainting Tool

Registered

2023-03-28

Similar Business Software

Google AI Studio

Google AI Studio is a unified development platform that helps teams explore, build, and deploy applications using Google’s most advanced AI models, including Gemini 3.5. It brings text, image, audio, and video models together in one interactive playground. With vibe coding, developers can use...

See Software
LTX

Control every aspect of your video using AI, from ideation to final edits, on one holistic platform. We’re pioneering the integration of AI and video production, enabling the transformation of a single idea into a cohesive, AI-generated video. LTX empowers individuals to share their visions,...

See Software
LALAL.AI

LALAL.AI is a next-generation audio separation service powered by advanced AI technology. With a suite of innovative tools - Stem Splitter, Voice Cleaner, Voice Changer, Voice Cloner, VST Plugin, LALAL.AI enables users to take their audio content to the next level. Stem Splitter The core...

See Software
Gemini Enterprise Agent Platform

Gemini Enterprise Agent Platform is a comprehensive solution from Google Cloud designed to help organizations build, scale, govern, and optimize AI agents. It represents the evolution of Vertex AI, combining advanced model development with new capabilities for agent orchestration and...

See Software
Muzaic

Muzaic: AI Music Architect for Professional Video Stop fighting with stock music. Creators often spend 10 minutes editing and 40 minutes hunting for tracks that don't fit. Muzaic is a professional web tool for agencies and serial creators that generates custom soundtracks in seconds. Our AI...

See Software
Google Workspace

Google Workspace with Gemini integrates premium AI into Gmail, Docs, Drive, Meet, and more, helping businesses work smarter, not harder. Draft emails faster, generate ideas, and summarize documents effortlessly with AI-powered assistance. Manage tasks, schedule meetings, and stay organized...

See Software

Report inappropriate content

Hybrid Bare Metal Cloud Infrastructure | Servers.com

Scale, customize and manage your bare metal servers - all in one place.

Three bare metal hosting solutions on one global network. Spin up on demand to cover peaks, then optimize for cost when usage stabilizes.

Learn More

Recommended Projects

Stable Diffusion v 2.1 web UI
Lightweight Stable Diffusion v 2.1 web UI: txt2img, img2img, depth2img
Video Diffusion - Pytorch
Implementation of Video Diffusion Models
Diffusers
State-of-the-art diffusion models for image and audio generation
Improved Diffusion
Release for Improved Denoising Diffusion Probabilistic Models
DALL-E 2 - Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis