feat: Lazy engine initialization #2997

narendasan · 2024-07-10T23:45:50Z

Description

Allows engines to not be setup immediately after compilation but all at once before the module is returned back to the user.

Fixes #2673

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

py/torch_tensorrt/dynamo/runtime/_PythonTorchTensorRTModule.py

narendasan · 2024-07-10T23:48:31Z

py/torch_tensorrt/dynamo/runtime/_TorchTensorRTModule.py

@@ -50,13 +50,16 @@ class TorchTensorRTModule(torch.nn.Module):  # type: ignore[misc]
        output_binding_names (List[str]): List of output TensorRT engine binding names in the order they should be returned
    """

+    defer_engine_setup = False


peri044

LGTM. Could you add a testcase with a model that has fallback ops ?

zewenli98

LGTM

github-actions

There are some changes that do not conform to Python style guidelines:

--- /home/runner/work/TensorRT/TensorRT/tests/py/dynamo/conversion/harness.py	2024-07-31 15:42:55.193957+00:00
+++ /home/runner/work/TensorRT/TensorRT/tests/py/dynamo/conversion/harness.py	2024-07-31 15:44:51.304037+00:00
@@ -62,11 +62,11 @@
        interpreter,
        rtol,
        atol,
        check_dtype=True,
        pyt_inputs=None,
-        rt_cls=PythonTorchTensorRTModule
+        rt_cls=PythonTorchTensorRTModule,
    ):
        with torch.no_grad():
            cuda_inputs = []
            for i in inputs:
                cuda_inputs.append(i.cuda())
@@ -132,11 +132,11 @@
        inputs,
        expected_ops,
        interpreter,
        comparators: List[Tuple[Callable, List]],
        fp16_mode=False,
-        rt_cls=PythonTorchTensorRTModule
+        rt_cls=PythonTorchTensorRTModule,
    ):
        """
        Runs the test and compares the result using the provided comparators.
        The size of comparators must be equal to the number of outputs from 'mod'.

zewenli98

LGTM, just a few minor comments

py/torch_tensorrt/dynamo/_compiler.py

py/torch_tensorrt/dynamo/conversion/_TRTInterpreter.py

py/torch_tensorrt/dynamo/runtime/_PythonTorchTensorRTModule.py

Allows engines to not be setup immediately after compilation but all at once before the module is returned back to the user. Signed-off-by: Naren Dasan <[email protected]> Signed-off-by: Naren Dasan <[email protected]>

facebook-github-bot added the cla signed label Jul 10, 2024

github-actions bot added component: tests Issues re: Tests component: conversion Issues re: Conversion stage component: api [Python] Issues re: Python API component: runtime component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels Jul 10, 2024

narendasan requested review from zewenli98 and peri044 July 10, 2024 23:46

github-actions bot requested a review from gs-olive July 10, 2024 23:46

narendasan commented Jul 10, 2024

View reviewed changes

narendasan force-pushed the lazy_engine_loading branch from fe2cd6a to f2bf073 Compare July 11, 2024 00:14

peri044 reviewed Jul 12, 2024

View reviewed changes

narendasan force-pushed the lazy_engine_loading branch 3 times, most recently from 366ecc6 to 37adede Compare July 12, 2024 19:58

zewenli98 approved these changes Jul 16, 2024

View reviewed changes

narendasan force-pushed the lazy_engine_loading branch 2 times, most recently from 7c77ffe to f976de9 Compare July 31, 2024 15:42

github-actions bot requested changes Jul 31, 2024

View reviewed changes

narendasan force-pushed the lazy_engine_loading branch 3 times, most recently from 5cf73b4 to 29b5e72 Compare July 31, 2024 22:41

zewenli98 reviewed Aug 1, 2024

View reviewed changes

py/torch_tensorrt/dynamo/_compiler.py Show resolved Hide resolved

py/torch_tensorrt/dynamo/conversion/_TRTInterpreter.py Show resolved Hide resolved

py/torch_tensorrt/dynamo/runtime/_PythonTorchTensorRTModule.py Outdated Show resolved Hide resolved

narendasan force-pushed the lazy_engine_loading branch 3 times, most recently from db1cc6a to 9bf1e2b Compare August 2, 2024 19:04

zewenli98 reviewed Aug 2, 2024

View reviewed changes

py/torch_tensorrt/dynamo/runtime/_PythonTorchTensorRTModule.py Outdated Show resolved Hide resolved

narendasan force-pushed the lazy_engine_loading branch 2 times, most recently from 2323fe2 to 05543ec Compare August 2, 2024 21:29

feat: Lazy engine initialization

ac71002

Allows engines to not be setup immediately after compilation but all at once before the module is returned back to the user. Signed-off-by: Naren Dasan <[email protected]> Signed-off-by: Naren Dasan <[email protected]>

narendasan force-pushed the lazy_engine_loading branch from 05543ec to ac71002 Compare August 2, 2024 23:54

narendasan merged commit 1d5dd56 into main Aug 5, 2024
32 of 61 checks passed

narendasan deleted the lazy_engine_loading branch August 5, 2024 20:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Lazy engine initialization #2997

feat: Lazy engine initialization #2997

Uh oh!

narendasan commented Jul 10, 2024

Uh oh!

Uh oh!

narendasan Jul 10, 2024

Uh oh!

peri044 left a comment

Uh oh!

zewenli98 left a comment

Uh oh!

github-actions bot left a comment

Uh oh!

zewenli98 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat: Lazy engine initialization #2997

feat: Lazy engine initialization #2997

Uh oh!

Conversation

narendasan commented Jul 10, 2024

Description

Type of change

Checklist:

Uh oh!

Uh oh!

narendasan Jul 10, 2024

Choose a reason for hiding this comment

Uh oh!

peri044 left a comment

Choose a reason for hiding this comment

Uh oh!

zewenli98 left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot left a comment

Choose a reason for hiding this comment

Uh oh!

zewenli98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!