add ref link

mchoi8739 · mchoi8739 · commit 8cfcd9ce1bf9 · 2023-04-28T15:22:07.000-07:00
diff --git a/doc/api/training/smd_model_parallel_release_notes/smd_model_parallel_change_log.rst b/doc/api/training/smd_model_parallel_release_notes/smd_model_parallel_change_log.rst
@@ -28,7 +28,8 @@ SageMaker Distributed Model Parallel 1.15.0 Release Notes
   ``smp.save_checkpoint`` with ``partial=False``. 
   Before, full checkpoints needed to be created by merging partial checkpoint 
   files after training finishes. 
-* ``DistributedTransformer`` now supports the ALiBi position embeddings. 
+* `DistributedTransformer <https://sagemaker.readthedocs.io/en/stable/api/training/smp_versions/latest/smd_model_parallel_pytorch_tensor_parallel.html#smdistributed.modelparallel.torch.nn.DistributedTransformerLayer>`_ 
+  now supports the ALiBi position embeddings. 
   When using DistributedTransformer, you can set the ``use_alibi`` parameter 
   to ``True`` to use the Triton-based flash attention kernels. This helps 
   evaluate sequences longer than those used for training.