enable flash mistral model for HPU device #594

kaixuanliu · 2025-04-18T13:47:21Z

This PR enables op level optimizations for Mistral type model. Currently it supports HPU device, and can get best throughput from 124 sentence/s to 133 sentences/s compared with Optimum-habana modeling, (We use Salesforce/SFR-Embedding-2_R for benchmark.)

Signed-off-by: Liu, Kaixuan <[email protected]>

kaixuanliu · 2025-04-21T07:03:25Z

@regisss @Narsil pls help review

regisss

LGTM

regisss · 2025-04-21T08:11:32Z

cc @Narsil

regisss · 2025-04-21T08:20:18Z

@kaixuanliu It seems there are trailing whitespaces in backends/python/server/text_embeddings_server/models/flash_mistral.py, can you remove them please?

kaixuanliu · 2025-04-21T08:42:37Z

@regisss Oh sorry, I added unnecessary code by mistake, have deleted them.

Signed-off-by: Liu, Kaixuan <[email protected]>

enable flash mistral model for HPU device

0ccb86e

Signed-off-by: Liu, Kaixuan <[email protected]>

regisss previously approved these changes Apr 21, 2025

View reviewed changes

kaixuanliu dismissed regisss’s stale review via ac92f84 April 21, 2025 08:40

regisss approved these changes Apr 21, 2025

View reviewed changes

regisss merged commit d8021c3 into huggingface:main Apr 21, 2025
3 of 13 checks passed

delete unused class definition

ac92f84

Signed-off-by: Liu, Kaixuan <[email protected]>

kaixuanliu deleted the flash-mistral branch April 23, 2025 03:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

enable flash mistral model for HPU device #594

enable flash mistral model for HPU device #594

Uh oh!

kaixuanliu commented Apr 18, 2025

Uh oh!

kaixuanliu commented Apr 21, 2025

Uh oh!

regisss left a comment

Uh oh!

regisss commented Apr 21, 2025

Uh oh!

regisss commented Apr 21, 2025

Uh oh!

kaixuanliu commented Apr 21, 2025

Uh oh!

Uh oh!

Uh oh!

enable flash mistral model for HPU device #594

enable flash mistral model for HPU device #594

Uh oh!

Conversation

kaixuanliu commented Apr 18, 2025

Uh oh!

kaixuanliu commented Apr 21, 2025

Uh oh!

regisss left a comment

Choose a reason for hiding this comment

Uh oh!

regisss commented Apr 21, 2025

Uh oh!

regisss commented Apr 21, 2025

Uh oh!

kaixuanliu commented Apr 21, 2025

Uh oh!

Uh oh!

Uh oh!