Implicit state management #103

jamied157 · 2023-04-27T14:40:08Z

Addresses triton-inference-server/server#5609

I tried to copy the onnxruntime backend as much as possible. To work with how PyTorch treats input and output names, I've tried to only allow the "INPUT__X/OUTPUT__X" naming convention when a state field is defined.

Tabrizian · 2023-04-27T15:42:28Z

Really appreciate your contribution to the Triton project! Could you please sign the CLA as instructed here?

jamied157 · 2023-05-05T08:16:40Z

Sorry for the delay on this, that should be sent over now

jamied157 · 2023-05-16T10:59:00Z

Just wanted to bump this again, have you received the CLA? I'm aware I need to add some tests from the qa folder in the main server repo, could you point me to the ones I should be looking at?

Tabrizian · 2023-05-17T13:49:09Z

Hi @jamied157, I can confirm that we've received the CLA. You need to libtorch to the list of BACKENDS here. Also, you need to add model generation scripts for the models https://github.com/triton-inference-server/server/blob/main/qa/common/gen_qa_implicit_models.py

It might be a bit difficult to get started with testing infra. Feel free to give it a shot and if you run into issues I can take care of testing.

jamied157 · 2023-05-22T09:55:03Z

Hi again, I haven't been able to have a look at the tests yet (and it might take me a while), so feel free to get started if you can. Otherwise I'll take a look when I've got the time.

Tabrizian

The PR mostly looks good. We probably need to add some testing around different naming conventions too to make sure it is working properly.

Tabrizian · 2023-06-01T20:40:33Z

src/libtorch.cc

@@ -35,6 +36,7 @@
 #include "triton/backend/backend_output_responder.h"
 #include "triton/common/nvtx.h"
 #include "triton/core/tritonbackend.h"
+#include "triton/core/tritonserver.h"


I don't think this import is required.

This is removed

Tabrizian · 2023-06-02T20:49:32Z

src/libtorch.cc

+  // can have intersection with the outputs section of the model. If an output
+  // is specified both in the output section and state section, it indicates
+  // that the backend must return the output state to the client too.
+  std::map<std::string, std::pair<int64_t, int64_t>> model_outputs_;


Can you mention in the comment that the first pair is the model output index and second pair is the state index. -1 will be used if either of them are not required.

Tabrizian · 2023-06-02T20:53:46Z

src/libtorch.cc

+           input_index_map_[io_name] =
+               std::distance(allowed_inputs.begin(), itr);
+         }
+         return;


I think you should return error on the else condition that input is not valid.

I think this comment is outdated as this change went in as part of a different MR: https://github.com/triton-inference-server/pytorch_backend/pull/98/files#diff-859dcc1d94f2d13b0d19e7a14af430cd18eb66395a6f69f2d4886854319c4467R795

Tabrizian · 2023-06-02T20:59:33Z

src/libtorch.cc

+                  .c_str());
+        }
+
+


Remove extra line

I think this is gone

Tabrizian · 2023-06-02T21:31:24Z

src/libtorch.cc

@@ -25,6 +25,7 @@
 // OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.

 #include <stdint.h>
+#include <cstdint>


Where is this import needed?

This is gone

src/libtorch.cc

jamied157 · 2023-07-05T09:10:29Z

Thanks for continuing to work on this!

Tabrizian self-assigned this Apr 27, 2023

Tabrizian reviewed Jun 2, 2023

View reviewed changes

jamied157 force-pushed the state_management branch from f4533fb to cc0657e Compare June 7, 2023 16:59

Tabrizian mentioned this pull request Jul 3, 2023

Add testing for implicit state for PyTorch backend triton-inference-server/server#6016

Merged

4 tasks

Tabrizian and others added 4 commits July 4, 2023 14:29

Rebase and fix merge conflict

647704b

formatting

70cb673

MR comments

d1035dc

clang format

b290196

Tabrizian force-pushed the state_management branch from 30a5df0 to b290196 Compare July 4, 2023 22:32

jamied157 commented Jul 5, 2023

View reviewed changes

src/libtorch.cc Outdated Show resolved Hide resolved

Fix double StateNew

dad98fa

GuanLuo approved these changes Jul 5, 2023

View reviewed changes

krishung5 approved these changes Jul 6, 2023

View reviewed changes

Tabrizian merged commit 00b38a9 into triton-inference-server:main Jul 7, 2023

Implicit state management #103

Implicit state management #103

Uh oh!

Conversation

jamied157 commented Apr 27, 2023

Uh oh!

Tabrizian commented Apr 27, 2023

Uh oh!

jamied157 commented May 5, 2023

Uh oh!

jamied157 commented May 16, 2023

Uh oh!

Tabrizian commented May 17, 2023

Uh oh!

jamied157 commented May 22, 2023

Uh oh!

Tabrizian left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jamied157 commented Jul 5, 2023

Uh oh!

Uh oh!