Creating doc automatically for supported models. #1929

Narsil · 2024-05-21T13:48:17Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

osanseviero

Looking very nice!

osanseviero · 2024-05-21T13:56:20Z

server/text_generation_server/models/__init__.py

+        "name": "Llama",
+        "url": "https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct",
+    }
+    BAICHUAN = {


Note to self: this is not supported in transformers

osanseviero · 2024-05-21T13:56:36Z

server/text_generation_server/models/__init__.py

+        "url": "https://huggingface.co/CohereForAI/c4ai-command-r-plus",
+    }
+    DRBX = {
+        "type": "drbx",


Suggested change

"type": "drbx",

"type": "dbrx",

osanseviero · 2024-05-21T13:57:59Z

server/text_generation_server/models/__init__.py

@@ -110,6 +111,138 @@
    __all__.append(Mamba)


+class ModelType(enum.Enum):
+    MAMBA = {
+        "type": "ssm",


To be consistent with transformers https://huggingface.co/state-spaces/mamba-130m-hf/blob/main/config.json

Suggested change

"type": "ssm",

"type": "mamba",

This is the original one: https://huggingface.co/state-spaces/mamba-2.8b-slimpj

osanseviero · 2024-05-21T13:58:23Z

server/text_generation_server/models/__init__.py

+        "name": "Mamba",
+        "url": "https://huggingface.co/state-spaces/mamba-2.8b-slimpj",
+    }
+    GALACTICA = {


I think this is opt as well based on the config file?

Probably not. I'm not familiar with the codebase, but we tend to remove a lot of ifs from modeling code, meaning than in TGI, some model types which are the same in transformers , are different here (like gpt_bigcode which has models declared as gpt2 but the modeling is actually quite different).

osanseviero · 2024-05-21T13:59:15Z

server/text_generation_server/models/__init__.py

+        "name": "Galactica",
+        "url": "https://huggingface.co/facebook/galactica-120b",
+    }
+    SANTACODER = {


Note to self: not supported in transformers

osanseviero · 2024-05-21T13:59:36Z

server/text_generation_server/models/__init__.py

+    MAMBA = {
+        "type": "ssm",
+        "name": "Mamba",
+        "url": "https://huggingface.co/state-spaces/mamba-2.8b-slimpj",


Maybe example_repo or something like that?

It's just to output a link in the doc. Could be the paper page or anything. In here I'd like to keep things minimal.

drbh · 2024-05-21T21:18:30Z

we may want to update the mamba test to the state-spaces/mamba-130m-hf model, and theres some edge case logic we can probably remove from the __init__ too

diff --git a/integration-tests/models/test_mamba.py b/integration-tests/models/test_mamba.py
index bf3701b..6559f8a 100644
--- a/integration-tests/models/test_mamba.py
+++ b/integration-tests/models/test_mamba.py
@@ -3,7 +3,7 @@ import pytest
 
 @pytest.fixture(scope="module")
 def fused_kernel_mamba_handle(launcher):
-    with launcher("state-spaces/mamba-130m", num_shard=1) as handle:
+    with launcher("state-spaces/mamba-130m-hf", num_shard=1) as handle:
         yield handle
 
 
diff --git a/server/text_generation_server/models/__init__.py b/server/text_generation_server/models/__init__.py
index 8878ad1..ad75e3f 100644
--- a/server/text_generation_server/models/__init__.py
+++ b/server/text_generation_server/models/__init__.py
@@ -246,15 +246,6 @@ def get_model(
     if speculate > 0:
         logger.info(f"Using speculation {method} with {speculate} input ids.")
 
-    if model_type is None:
-        # TODO: fix how we determine model type for Mamba
-        if "ssm_cfg" in config_dict:
-            # *only happens in Mamba case
-            model_type = "ssm"
-        else:
-            raise RuntimeError(
-                f"Could not determine model type for {model_id} revision {revision}"
-            )
     quantization_config = config_dict.get("quantization_config", None)
     if quantization_config is not None and quantize is None:
         method = quantization_config.get("quant_method", None)

Narsil · 2024-05-22T08:06:30Z

We never remove that old logic. Simple current deployments might depend on it.
But they are indeed considered legacy.

Narsil · 2024-05-22T08:27:19Z

I tried enabling the hf format for mamba, every tensor changed around so we have to redo the modeling code. Not fun (and not important)

HuggingFaceDocBuilderDev · 2024-05-22T08:27:37Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

osanseviero reviewed May 21, 2024

View reviewed changes

Narsil added 8 commits May 22, 2024 16:03

Creating doc automatically for supported models.

1373c18

Auto doc supported models.

7f54603

Different temporary filename.

0c6e0bc

Cleaner autodoc.

9fd232f

Fix.

9e6d97e

Ssh debugging... again..

2890026

I'm dumb.

18da570

WTF ?

b3be512

Narsil force-pushed the generate_doc branch from ed4f3f0 to b3be512 Compare May 22, 2024 14:04

Narsil merged commit 2f243a1 into main May 22, 2024
6 checks passed

Narsil deleted the generate_doc branch May 22, 2024 14:22

Creating doc automatically for supported models. #1929

Creating doc automatically for supported models. #1929

Uh oh!

Conversation

Narsil commented May 21, 2024

What does this PR do?

Before submitting

Who can review?

Uh oh!

osanseviero left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

drbh commented May 21, 2024

Uh oh!

Narsil commented May 22, 2024

Uh oh!

Narsil commented May 22, 2024

Uh oh!

HuggingFaceDocBuilderDev commented May 22, 2024

Uh oh!

Uh oh!

Uh oh!