You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
"""Compile an ExportedProgram module for NVIDIA GPUs using TensorRT
@@ -154,11 +154,11 @@ def compile(
154
154
hardware_compatible (bool): Build the TensorRT engines compatible with GPU architectures other than that of the GPU on which the engine was built (currently works for NVIDIA Ampere and newer)
155
155
timing_cache_path (str): Path to the timing cache if it exists (or) where it will be saved after compilation
156
156
lazy_engine_init (bool): Defer setting up engines until the compilation of all engines is complete. Can allow larger models with multiple graph breaks to compile but can lead to oversubscription of GPU memory at runtime.
157
-
save_engine_cache (bool): Whether to save the compiled TRT engines to hard disk
158
-
load_engine_cache (bool): Whether to load the compiled TRT engines from hard disk
157
+
cache_built_engines (bool): Whether to save the compiled TRT engines to storage
158
+
reuse_cached_engines (bool): Whether to load the compiled TRT engines from storage
159
159
engine_cache_dir (str): Directory to store the cached TRT engines
160
160
engine_cache_size (int): Maximum hard-disk space to use for the engine cache
161
-
engine_cache_instance (Optional[BaseEngineCache]): Engine cache instance to use for saving and loading engines. Users can provide their own engine cache by inheriting from BaseEngineCache
161
+
custom_engine_cache (Optional[BaseEngineCache]): Engine cache instance to use for saving and loading engines. Users can provide their own engine cache by inheriting from BaseEngineCache. If used, engine_cache_dir and engine_cache_size will be ignored.
162
162
**kwargs: Any,
163
163
Returns:
164
164
torch.fx.GraphModule: Compiled FX Module, when run it will execute via TensorRT
0 commit comments