add stable audio tools as a library + code snippets. #741

Vaibhavs10 · 2024-06-06T15:36:10Z

Adds inference snippets, download count support for stable audio tools.

Vaibhavs10 · 2024-06-06T15:41:50Z

The failing test is unrelated to this PR.

Wauplin

Metadata in model-libraries.ts looks good to me. Added a comment about the snippet. Better to wait review from others :)

Wauplin · 2024-06-06T16:28:24Z

packages/tasks/src/model-libraries-snippets.ts

@@ -326,6 +326,50 @@ export const sklearn = (model: ModelData): string[] => {
 	}
 };

+export const stable_audio_tools = (model: ModelData): string[] => [


Code snippets are usually much smaller than this (basically from stable_audio_tools import get_pretrained_model + model, model_config = get_pretrained_model("${model.id}")).

Are we sure the whole snippet is valid for all models tagged as stable-audio-tools? (or is there only one model?). I'm not against a more complete snippet if you think it makes sense though. Happy to get opinion from others as well

This is only one model - but they support other models (although there are no pre-trained models supported)

Let me see if I can reduce the snippet size.

Here's the shortened snippet.

import torch import torchaudio from einops import rearrange from stable_audio_tools import get_pretrained_model from stable_audio_tools.inference.generation import generate_diffusion_cond device = "cuda" if torch.cuda.is_available() else "cpu" # Download model model, model_config = get_pretrained_model("stabilityai/stable-audio-open-1.0") sample_rate = model_config["sample_rate"] sample_size = model_config["sample_size"] model = model.to(device) # Set up text and timing conditioning conditioning = [{ "prompt": "128 BPM tech house drum loop", }] # Generate stereo audio output = generate_diffusion_cond( model, conditioning=conditioning, sample_size=sample_size, device=device ) # Rearrange audio batch to a single sequence output = rearrange(output, "b d n -> d (b n)") # Peak normalize, clip, convert to int16, and save to file output = output.to(torch.float32).div(torch.max(torch.abs(output))).clamp(-1, 1).mul(32767).to(torch.int16).cpu() torchaudio.save("output.wav", output, sample_rate)

This is too complex imo. Usually the snippets don't have things such as loading into GPU for example. We usually also don't show the actual inference (and just do the loading), but it might be fine for this case.

Although it's just one model, the library is fine-tuning, so we need to make sure our snippet would work for those.

I updated the code snippet to the minified code snippet as mentioned above!

That's a good point, happy to hear other opinions :)

no preference on my side!

No strong opinion either especially since it can be reassessed later on if we see it brakes on some finetunes

Cool! Should we merge with the current snippet for now? I'm setting a reminder to revisit this in 20 or so days.

Fine for me!

packages/tasks/src/model-libraries.ts

Co-authored-by: Lucain <[email protected]>

…gingface.js into add-stable-audio

Wauplin · 2024-06-07T10:17:16Z

packages/tasks/src/model-libraries-snippets.ts

@@ -326,6 +326,50 @@ export const sklearn = (model: ModelData): string[] => {
 	}
 };

+export const stable_audio_tools = (model: ModelData): string[] => [


Fine for me!

Vaibhavs10 · 2024-06-07T12:23:48Z

Given there is more or less agreement here, I will merge this!

NielsRogge · 2024-06-09T18:30:51Z

@Vaibhavs10 does the model card also require "library_name: "stable_audio" to be added? https://huggingface.co/stabilityai/stable-audio-open-1.0

Wauplin · 2024-06-10T07:30:39Z

@NielsRogge Yes it does! Actually, it requires library_name: stable-audio-tools in the model card metadata.

NielsRogge · 2024-06-10T07:35:13Z

Ok, opened a PR here: https://huggingface.co/stabilityai/stable-audio-open-1.0/discussions/25

Vaibhavs10 · 2024-06-10T14:13:14Z

Thanks for the ping, @NielsRogge, and to take of this - I made a note in my internal doc not to forget about this!

add stable audio tools as a library + code snippets.

a5972de

Vaibhavs10 requested review from osanseviero, SBrandeis, gary149, Wauplin, julien-c and pcuenca as code owners June 6, 2024 15:36

Wauplin approved these changes Jun 6, 2024

View reviewed changes

Vaibhavs10 and others added 4 commits June 6, 2024 21:46

Update packages/tasks/src/model-libraries.ts

9d82d55

Co-authored-by: Lucain <[email protected]>

Chop the code snippet 🔪

a51d109

Merge branch 'add-stable-audio' of https://github.com/huggingface/hug…

670445e

…gingface.js into add-stable-audio

Merge branch 'main' into add-stable-audio

dec6fb0

Wauplin approved these changes Jun 7, 2024

View reviewed changes

Vaibhavs10 merged commit cf70e66 into main Jun 7, 2024
4 checks passed

Vaibhavs10 deleted the add-stable-audio branch June 7, 2024 12:23

add stable audio tools as a library + code snippets. #741

add stable audio tools as a library + code snippets. #741

Uh oh!

Conversation

Vaibhavs10 commented Jun 6, 2024

Uh oh!

Vaibhavs10 commented Jun 6, 2024

Uh oh!

Wauplin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Vaibhavs10 commented Jun 7, 2024

Uh oh!

Uh oh!

NielsRogge commented Jun 9, 2024

Uh oh!

Wauplin commented Jun 10, 2024

Uh oh!

NielsRogge commented Jun 10, 2024

Uh oh!

Vaibhavs10 commented Jun 10, 2024

Uh oh!

Uh oh!

Wauplin left a comment •

edited

Loading