A fully featured audio diffusion library, for PyTorch. Includes models for unconditional audio generation, text-conditional audio generation, diffusion autoencoding, upsampling, and vocoding. The provided models are waveform-based, however, the U-Net (built using a-unet), DiffusionModel, diffusion method, and diffusion samplers are both generic to any dimension and highly customizable to work on other formats. Note: no pre-trained models are provided here, this library is meant for research purposes.
Features
- Unconditional Generator
- Text-Conditional Generator
- Diffusion Upsampler
- Diffusion Vocoder
- Diffusion Autoencoder
- Inpainting
License
MIT LicenseFollow audio-diffusion-pytorch
Other Useful Business Software
Epicor BisTrack is a powerful business management software designed specifically for the needs of the building materials industry, including lumberyards, construction suppliers, and distributors.
Epicor BisTrack is a powerful business management software designed specifically for the needs of the building materials industry, including lumberyards, construction suppliers, and distributors. Known for its comprehensive suite of tools, BisTrack streamlines operations by integrating inventory management, purchasing, sales, and delivery processes into a single, user-friendly platform. Its advanced reporting and analytics capabilities enable businesses to make data-driven decisions, optimize workflows, and enhance customer service. With robust mobile functionality and seamless cloud-based deployment options, BisTrack supports real-time collaboration and efficient operations across teams, ensuring businesses stay competitive in a fast-paced industry.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of audio-diffusion-pytorch!