Skip to content

Update vista3d tutorial readme #1859

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 5 commits into from
Oct 10, 2024
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
17 changes: 17 additions & 0 deletions vista_3d/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,21 @@
# MONAI **V**ersatile **I**maging **S**egmen**T**ation and **A**nnotation
[[`Paper`](https://arxiv.org/pdf/2406.05285)] [[`Demo`](https://build.nvidia.com/nvidia/vista-3d)] [[`Checkpoint`](https://developer.download.nvidia.com/assets/Clara/monai/tutorials/model_zoo/model_vista3d.pt)]
## Overview

The **VISTA3D** is a foundation model trained systematically on 11,454 volumes encompassing 127 types of human anatomical structures and various lesions. It provides accurate out-of-the-box segmentation that matches state-of-the-art supervised models which are trained on each dataset. The model also achieves state-of-the-art zero-shot interactive segmentation in 3D, representing a promising step toward developing a versatile medical image foundation model.

The tutorial demonstrates how to finetune the VISTA3D model on user data, where we use the MSD Task09 Spleen as the example.

In Summary the tutorial covers the following:
- Creation of datasets and data transforms for training and validation
- Create and VISTA3D model and load the pretrained checkpoint
- Implementation of the finetuning loop
- Mixed precision training with GradScaler
- Visualization of training loss and validation accuracy
- Inference on a single validation image
- Visualization of input image, ground truth, and model prediction

For more advanced use, please refer to [VISTA3D research codebase](https://github.com/Project-MONAI/VISTA/tree/main/vista3d) and [VISTA3D bundle](https://github.com/Project-MONAI/model-zoo/tree/dev/models/vista3d)
## License

The codebase is under Apache 2.0 Licence. The model weight is released under [NVIDIA OneWay Noncommercial License](./NVIDIA%20OneWay%20Noncommercial%20License.txt).
Expand Down
Loading