Reverse Chat Template Fallback Order #963

sayanshaw24 · 2025-06-03T22:46:25Z

This change now ensures we try to parse the chat template with Minja first, and then falls back on the native implementation.

hanbitmyths

Do we need a clean-up for some specialized handling in case that minja works, so we keep codes clean?

sayanshaw24 · 2025-06-03T23:51:53Z

Do we need a clean-up for some specialized handling in case that minja works, so we keep codes clean?

You mean like this: https://github.com/microsoft/onnxruntime-extensions/pull/963/files#diff-660d41852d9740562f27b085a10a08b04b635625f45367300c30206c907b3f5aR816, or something more?

If Minja works, we get a chat template root from minja::Parser::parse that we can then call render on. If not, we attempt the native impl and finally return a kOrtxErrorInvalidArgument if that fails as well.

This PR simply switches fallback order plus any respective changes needed and ensures the status is returned correctly, it shouldn't make the code messier.

…sions into sayanshaw/chat-tmpl-fallback-order

hanbitmyths · 2025-06-04T03:19:45Z

Do we need a clean-up for some specialized handling in case that minja works, so we keep codes clean?

You mean like this: https://github.com/microsoft/onnxruntime-extensions/pull/963/files#diff-660d41852d9740562f27b085a10a08b04b635625f45367300c30206c907b3f5aR816, or something more?

If Minja works, we get a chat template root from minja::Parser::parse that we can then call render on. If not, we attempt the native impl and finally return a kOrtxErrorInvalidArgument if that fails as well.

This PR simply switches fallback order plus any respective changes needed and ensures the status is returned correctly, it shouldn't make the code messier.

What I meant is to remove a model specific code in the file like TokenizerImpl::Phi3_5ChatTemplate if minja works fine with Phi3.5 chat template. We don't need to do it in this PR, but I'd like to know this makes sense and we can clean up unnecessary codes.

sayanshaw24 · 2025-06-04T03:30:28Z

Do we need a clean-up for some specialized handling in case that minja works, so we keep codes clean?

You mean like this: https://github.com/microsoft/onnxruntime-extensions/pull/963/files#diff-660d41852d9740562f27b085a10a08b04b635625f45367300c30206c907b3f5aR816, or something more?
If Minja works, we get a chat template root from minja::Parser::parse that we can then call render on. If not, we attempt the native impl and finally return a kOrtxErrorInvalidArgument if that fails as well.
This PR simply switches fallback order plus any respective changes needed and ensures the status is returned correctly, it shouldn't make the code messier.

What I meant is to remove a model specific code in the file like TokenizerImpl::Phi3_5ChatTemplate if minja works fine with Phi3.5 chat template. We don't need to do it in this PR, but I'd like to know this makes sense and we can clean up unnecessary codes.

Oh I see, sorry did not understand what you meant before - yes absolutely we should, but I want to implement multimodal support and run some tests before we do, in case we need some of the TokenizerImpl functions to add on to model-agnostic multimodal support. Like you said, can do that in a separate PR once multimodal work is sorted.

hanbitmyths · 2025-06-04T04:40:54Z

shared/api/chat_template.cc

+    text = root->render(context);
+    output = text;
+  } catch (const std::runtime_error& e) {
+    if (model_to_template_map.count(activated_str)) {


Do we have any test cases to catch exception due to minja failure?

change chat tmpl fallback order

a98c861

sayanshaw24 requested a review from hanbitmyths June 3, 2025 23:19

sayanshaw24 marked this pull request as ready for review June 3, 2025 23:19

sayanshaw24 requested a review from a team as a code owner June 3, 2025 23:19

hanbitmyths reviewed Jun 3, 2025

View reviewed changes

update nullptr root handling

6ef2c02

Merge branch 'main' of https://github.com/microsoft/onnxruntime-exten…

1d9a325

…sions into sayanshaw/chat-tmpl-fallback-order

hanbitmyths reviewed Jun 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reverse Chat Template Fallback Order #963

Reverse Chat Template Fallback Order #963

Uh oh!

sayanshaw24 commented Jun 3, 2025 •

edited

Loading

Uh oh!

hanbitmyths left a comment

Uh oh!

sayanshaw24 commented Jun 3, 2025

Uh oh!

hanbitmyths commented Jun 4, 2025

Uh oh!

sayanshaw24 commented Jun 4, 2025

Uh oh!

hanbitmyths Jun 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reverse Chat Template Fallback Order #963

Are you sure you want to change the base?

Reverse Chat Template Fallback Order #963

Uh oh!

Conversation

sayanshaw24 commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanbitmyths left a comment

Choose a reason for hiding this comment

Uh oh!

sayanshaw24 commented Jun 3, 2025

Uh oh!

hanbitmyths commented Jun 4, 2025

Uh oh!

sayanshaw24 commented Jun 4, 2025

Uh oh!

hanbitmyths Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sayanshaw24 commented Jun 3, 2025 •

edited

Loading

hanbitmyths Jun 4, 2025 •

edited

Loading