文本生成音乐,可以通过文本描述生成“在风声中吹口哨”、“警报器和嗡嗡作响的引擎接近后走远”等特殊声音效果。

Zach Evans c61cc65257 Add 'drop_last=True' to DataLoader 8 months ago
audio_diffusion 9b861991a7 Fix imports in training file 1 year ago
dataset 6aa8fdfffa Adding jmann-large and glitch models [v0.9] 1 year ago
viz 27b9e2cdfd Adding audio diffusion code 1 year ago
.gitignore dbc09f6e22 Initial commit 1 year ago
Dance_Diffusion.ipynb 7ae1a7ba70 Update Dance_Diffusion.ipynb 8 months ago
Finetune_Dance_Diffusion.ipynb cb95fa079d Update Finetune_Dance_Diffusion.ipynb 9 months ago
LICENSE dbc09f6e22 Initial commit 1 year ago
README.md fa8e5c8087 Update README.md 1 year ago
defaults.ini 74aa9dff7c set save_wandb to none 1 year ago
setup.py 35c7fb3389 Update setup.py 1 year ago
train_uncond.py c61cc65257 Add 'drop_last=True' to DataLoader 8 months ago

README.md

sample-generator

Tools to train a generative model on arbitrary audio samples

Dance Diffusion notebook: Open In Colab

Dance Diffusion fine-tune notebook: Open In Colab

Prerequisites

Dance Diffusion requires Python 3.7+

You can install the required packages by running pip install . from the root of the repo

Todo

  • Add inference notebook
  • Add interpolations to nobebook
  • Add fine-tune notebook
  • Add guidance to notebook