Add support for batch_input #98

HennerM · 2023-04-08T12:29:03Z

The backend was missing support for batch_input inputs, specifically, no batch_input is passed to the model.

I added the ability to pass these inputs through to the model, the same way normal input's are passed through.
I followed the implementation in the onnxruntime_backend, and kept to passing the batch_input data as Tensor on the CPU.

dyastremsky · 2023-04-20T16:12:59Z

Thanks for making this contribution, Markus! This looks like a great change. Before we can start review, can you please submit your Contributor License Agreement, directions here?

Also, we'll need to add tests to validate this change. If you're able to update the L0_batch_input to ensure this passes, that would help with getting this in. The models used in those tests are generated here and here, I believe.

GuanLuo

Leave minor comment, and @dyastremsky 's points on signed CLA and testing. Otherwise looks good to me

GuanLuo · 2023-04-20T19:04:10Z

src/libtorch.cc

+      batch_input_count_ = config_batch_inputs.ArraySize();
+      expected_input_cnt += batch_input_count_;
+    } else {
+      batch_input_count_ = 0;


You can initalize to 0 in the constructor member init list

I added an initialisation at the declaration now, let me know if that's okay.

dyastremsky · 2023-05-01T18:38:23Z

@HennerM Are you able to fill out and send in the CLA?

HennerM · 2023-05-10T11:08:12Z

@HennerM Are you able to fill out and send in the CLA?

Sorry for the delay, we had the CLA sent through for Speechmatics to [email protected] on 4th of May

dyastremsky · 2023-05-19T03:46:43Z

@HennerM Are you able to fill out and send in the CLA?

Sorry for the delay, we had the CLA sent through for Speechmatics to [email protected] on 4th of May

Thanks Markus, received! Will start working on the tests to get this merged.

HennerM · 2023-05-19T17:04:18Z

@HennerM Are you able to fill out and send in the CLA?

Sorry for the delay, we had the CLA sent through for Speechmatics to [email protected] on 4th of May

Thanks Markus, received! Will start working on the tests to get this merged.

Appreciated, I had a look at the test repo but couldn't quite figure out how it all fits together.

dyastremsky · 2023-05-26T15:21:12Z

Hi Henner. Keeping you updated. You can see the updated test here: triton-inference-server/server#5855

You can generate the models by going into server/qa/common and running CUDA_DEVICE=0 ./gen_qa_model_repository.

The batch_input support is failing, so will try to debug it to see what needs changing. When the batch input is made on the CPU, PyTorch complains due to it being on a different device than the other inputs. When modified to be made on the GPU, it segfaults.

dyastremsky · 2023-05-31T19:40:02Z

Thank you for this PR, @HennerM! This works and passes testing if you modify the PR to create the batch inputs on the current device rather than the CPU. As an example, please see these changes.

That PR also has some extra differences due to running the auto-formatter on libtorch.cc. The main differences are here, here, and here.

If you're able to make these changes, we can get this pull request merged. Let me know if you have any questions or comments.

HennerM · 2023-06-05T17:08:01Z

I changed the batch input to create the tensor on the Triton assigned device and ran clang-format over the .cc files, I got some formatting changes that are unrelated to my changes though, I hope that's okay?

dyastremsky · 2023-06-05T17:25:17Z

I changed the batch input to create the tensor on the Triton assigned device and ran clang-format over the .cc files, I got some formatting changes that are unrelated to my changes though, I hope that's okay?

Yep, that's perfect! Thank you for making the changes. Let me re-run CI quickly, then should be able to approve and merge once it passes.

dyastremsky · 2023-06-05T23:57:24Z

Great work! This pull request is now merged.

tanmayv25 requested review from GuanLuo and dyastremsky April 19, 2023 20:15

GuanLuo reviewed Apr 20, 2023

View reviewed changes

HennerM force-pushed the batch-input-support branch from 4a7753d to 5506203 Compare May 19, 2023 16:56

dyastremsky mentioned this pull request May 25, 2023

Test batch input for libtorch triton-inference-server/server#5855

Merged

HennerM force-pushed the batch-input-support branch 2 times, most recently from bc38192 to 625dee4 Compare June 5, 2023 17:04

Add support for batch_input

8c2bcc8

HennerM force-pushed the batch-input-support branch from 625dee4 to 8c2bcc8 Compare June 5, 2023 17:06

dyastremsky approved these changes Jun 5, 2023

View reviewed changes

dyastremsky merged commit f405488 into triton-inference-server:main Jun 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for batch_input #98

Add support for batch_input #98

Uh oh!

HennerM commented Apr 8, 2023

Uh oh!

dyastremsky commented Apr 20, 2023

Uh oh!

GuanLuo left a comment

Uh oh!

GuanLuo Apr 20, 2023

Uh oh!

HennerM May 19, 2023

Uh oh!

dyastremsky commented May 1, 2023

Uh oh!

HennerM commented May 10, 2023

Uh oh!

dyastremsky commented May 19, 2023

Uh oh!

HennerM commented May 19, 2023

Uh oh!

dyastremsky commented May 26, 2023

Uh oh!

dyastremsky commented May 31, 2023 •

edited

Loading

Uh oh!

HennerM commented Jun 5, 2023

Uh oh!

dyastremsky commented Jun 5, 2023

Uh oh!

dyastremsky commented Jun 5, 2023

Uh oh!

Uh oh!

Add support for batch_input #98

Add support for batch_input #98

Uh oh!

Conversation

HennerM commented Apr 8, 2023

Uh oh!

dyastremsky commented Apr 20, 2023

Uh oh!

GuanLuo left a comment

Choose a reason for hiding this comment

Uh oh!

GuanLuo Apr 20, 2023

Choose a reason for hiding this comment

Uh oh!

HennerM May 19, 2023

Choose a reason for hiding this comment

Uh oh!

dyastremsky commented May 1, 2023

Uh oh!

HennerM commented May 10, 2023

Uh oh!

dyastremsky commented May 19, 2023

Uh oh!

HennerM commented May 19, 2023

Uh oh!

dyastremsky commented May 26, 2023

Uh oh!

dyastremsky commented May 31, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HennerM commented Jun 5, 2023

Uh oh!

dyastremsky commented Jun 5, 2023

Uh oh!

dyastremsky commented Jun 5, 2023

Uh oh!

Uh oh!

dyastremsky commented May 31, 2023 •

edited

Loading