Merge Openai api version route to main #1021

vmpuri · 2024-08-08T02:21:49Z

#1016 mistakenly got merged into this development branch instead of main.

… request/response (#1016)

pytorch-bot · 2024-08-08T02:21:51Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1021

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 5 New Failures

As of commit 716485c with merge base f384d4f ():

NEW FAILURES - The following jobs have failed:

pull / test-cpu-eval-sanity-check (aarch64, stories15M) (gh)
TypeError: Invalid NaN comparison
pull / test-cpu-eval-sanity-check (x86_64, stories15M) (gh)
TypeError: Invalid NaN comparison
pull / test-cpu-eval-sanity-check-float32 (aarch64, stories15M) (gh)
TypeError: Invalid NaN comparison
pull / test-cpu-eval-sanity-check-float32 (x86_64, stories15M) (gh)
TypeError: Invalid NaN comparison
pull / test-gpu-eval-sanity-check (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t f47ba2cbe1664f1a1a639461fee84a7b4820e1ad45efa62df04907d991fbfd18 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-08-08T18:24:12Z

README.md

@@ -200,6 +200,10 @@ streamlit run torchchat.py -- browser llama3.1
 <details>
 <summary>This mode gives a REST API that matches the OpenAI API spec for interacting with a model</summary>

+The server follows the [OpenAI API specification](https://platform.openai.com/docs/api-reference/chat) for chat completions.
+Since this feature is under active development, it's possible not every parameter is consumed. See api/api.py for details on


Suggested change

Since this feature is under active development, it's possible not every parameter is consumed. See api/api.py for details on

Since this feature is under active development, not every parameter is consumed. See api/api.py for details on

Jack-Khuu · 2024-08-08T18:27:30Z

api/api.py

@@ -270,7 +271,13 @@ def chunked_completion(self, completion_request: CompletionRequest):
        )
        generator_args = GeneratorArgs(
            completion_request.messages[-1].get("content"),
+            max_new_tokens=(
+                int(completion_request.max_tokens)


The request object was being populated from JSON, which automatically means the value for every field is str despite the dataclass type hints.

These casts were to get things working, but yes - this is a bad solution and we should have better type enforcement here (and possibly repo-wide)

Based on what I can find, we can resolve this by casting the types at the object's init time for each field (kinda tedious) or by using something like pydantic (yet another package to manage).

Should we attack this in another issue/PR or try to resolve it here?

Casting at init time makes sense, since downstream users shouldn't need to think about it

We can make it a separate PR or push the casting to init just for max_tokens and temperature

Created #1025 to track this, will merge this PR for now.

Jack-Khuu · 2024-08-08T18:29:08Z

api/api.py

            encoded_prompt=encoded,
+            temperature=float(completion_request.temperature),


See previous re: typecasting for dataclasses.

vmpuri added 2 commits August 6, 2024 14:08

Add OPENAI_API_VERSION constant to routes

e139ad9

Add seed, temperature, max_tokens and system_fingerprint paramters to…

3d702af

… request/response (#1016)

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 8, 2024

vmpuri marked this pull request as ready for review August 8, 2024 02:22

vmpuri requested a review from Jack-Khuu August 8, 2024 18:17

Jack-Khuu approved these changes Aug 8, 2024

View reviewed changes

Merge branch 'main' into openai_api_version_route

d1a0da8

vmpuri force-pushed the openai_api_version_route branch from 01bc092 to d1a0da8 Compare August 12, 2024 06:47

vmpuri mentioned this pull request Aug 12, 2024

Dataclass Type Enforcement #1025

Closed

Merge branch 'main' into openai_api_version_route

716485c

byjlw merged commit 3ce1cef into main Aug 14, 2024
46 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merge Openai api version route to main #1021

Merge Openai api version route to main #1021

Uh oh!

vmpuri commented Aug 8, 2024

Uh oh!

pytorch-bot bot commented Aug 8, 2024 •

edited

Loading

Uh oh!

Jack-Khuu Aug 8, 2024

Uh oh!

Jack-Khuu Aug 8, 2024

Uh oh!

vmpuri Aug 12, 2024

Uh oh!

Jack-Khuu Aug 12, 2024

Uh oh!

vmpuri Aug 12, 2024

Uh oh!

Jack-Khuu Aug 8, 2024

Uh oh!

vmpuri Aug 12, 2024

Uh oh!

Uh oh!

Uh oh!

	Since this feature is under active development, it's possible not every parameter is consumed. See api/api.py for details on
	Since this feature is under active development, not every parameter is consumed. See api/api.py for details on

		encoded_prompt=encoded,
		temperature=float(completion_request.temperature),

Merge Openai api version route to main #1021

Merge Openai api version route to main #1021

Uh oh!

Conversation

vmpuri commented Aug 8, 2024

Uh oh!

pytorch-bot bot commented Aug 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1021

❌ 5 New Failures

Uh oh!

Jack-Khuu Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

vmpuri Aug 12, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Aug 12, 2024

Choose a reason for hiding this comment

Uh oh!

vmpuri Aug 12, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

vmpuri Aug 12, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 8, 2024 •

edited

Loading