Fix roberta conversion bugs #964

Njuapp · 2022-04-06T08:29:20Z

Description

Fix bugs encountered when converting a RoBERTa model.

Fixes #963

Specifically, it fixes three things:

For aten::ne.Scalar(Tensor self, Scalar other) -> (Tensor), the datatype of Scalar is by default initialized to be of type float. This should be int32 in this case.
For aten::cumsum(Tensor self, int dim, *, int? dtype=None) -> (Tensor), the zeroValue which stores runningSum, is by default initialized to be of type float. This should be int32 in this case.
For aten::to.dtype(Tensor self, int dtype, bool non_blocking=False, bool copy=False, int? memory_format=None) -> (Tensor), it could be casting tensor to long datatype, but this cannot be processed in our code. We simply add an entry in get_at_trt_type_map, where at::kLong would be mapped to nvinfer1::DataType::kINT32 since tensorRT could not support long.

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes

core/conversion/converters/impl/element_wise.cpp

narendasan · 2022-04-09T03:30:27Z

@Njuapp Can you sign your commits?

narendasan · 2022-04-11T16:05:50Z

core/util/trt_util.cpp

@@ -238,6 +238,7 @@ const std::unordered_map<at::ScalarType, nvinfer1::DataType>& get_at_trt_type_ma
      {at::kFloat, nvinfer1::DataType::kFLOAT},
      {at::kHalf, nvinfer1::DataType::kHALF},
      {at::kInt, nvinfer1::DataType::kINT32},
+      {at::kLong, nvinfer1::DataType::kINT32},


I dont want to just associate all instances of long with kINT32. If there are cases of kLong we should explicity handle these cases and explain what we want to do

So in this case in the evaluator, we can add a code path for Long in the to evaluator (when truncate is enabled) and print a warning

Signed-off-by: Cheng Hang <[email protected]>

Njuapp · 2022-04-13T09:46:47Z

I have done cpp_lint with clang-format-9, but it still fails in CI/CD as above. I don't understand the reasons. Maybe you can help check cpp_lint too.

narendasan · 2022-04-13T19:20:11Z

I can fix the linting

narendasan

LGTM

narendasan reviewed Apr 6, 2022

View reviewed changes

core/conversion/converters/impl/element_wise.cpp Outdated Show resolved Hide resolved

Njuapp force-pushed the roberta_fix branch from a5183c8 to 808ef05 Compare April 11, 2022 08:45

narendasan requested changes Apr 11, 2022

View reviewed changes

Njuapp force-pushed the roberta_fix branch 2 times, most recently from eb69bf2 to 4153778 Compare April 13, 2022 07:29

fix RoBERTa compilation bugs

08920e7

Signed-off-by: Cheng Hang <[email protected]>

Njuapp force-pushed the roberta_fix branch from 4153778 to 08920e7 Compare April 13, 2022 07:32

narendasan approved these changes Apr 14, 2022

View reviewed changes

narendasan merged commit 3c59ece into pytorch:master Apr 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix roberta conversion bugs #964

Fix roberta conversion bugs #964

Njuapp commented Apr 6, 2022

Uh oh!

Uh oh!

narendasan commented Apr 9, 2022

Uh oh!

narendasan Apr 11, 2022

Uh oh!

narendasan Apr 11, 2022

Uh oh!

Njuapp commented Apr 13, 2022

Uh oh!

narendasan commented Apr 13, 2022

Uh oh!

narendasan left a comment

Uh oh!

Uh oh!

Fix roberta conversion bugs #964

Fix roberta conversion bugs #964

Conversation

Njuapp commented Apr 6, 2022

Description

Type of change

Checklist:

Uh oh!

Uh oh!

narendasan commented Apr 9, 2022

Uh oh!

narendasan Apr 11, 2022

Choose a reason for hiding this comment

Uh oh!

narendasan Apr 11, 2022

Choose a reason for hiding this comment

Uh oh!

Njuapp commented Apr 13, 2022

Uh oh!

narendasan commented Apr 13, 2022

Uh oh!

narendasan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!