We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent c253d0b commit 6815adbCopy full SHA for 6815adb
src/sagemaker/remote_function/client.py
@@ -278,6 +278,12 @@ def remote(
278
max_wait_time_in_seconds (int): Timeout in seconds waiting for spot training job.
279
After this amount of time Amazon SageMaker will stop waiting for managed spot training
280
job to complete. Defaults to ``None``.
281
+
282
+ use_torchrun (bool): Specifies whether to use torchrun for distributed training.
283
+ Defaults to ``False``.
284
285
+ nproc_per_node (int): Specifies the number of processes per node for distributed training.
286
+ Defaults to ``1``.
287
"""
288
289
def _remote(func):
0 commit comments