-
Hi, there! I wonder if it is possible to use WDS data stored on GCS Bucket when training on TPU. I know the recommended format is TFDS when using GCS Bucket + TPU. But I already have a dataset in WDS format. And I have noticed the What do I need to do to read WDS files on GCS Bucket? Like replacing every Thanks a lot! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
@zeyuwang615 yeah, in theory it should but I ran into issues with the gs support, gsutil pipes locking up / erroring out frequently, etc when I was trying to get it working in the past so I never had it working properly. I've found tf.data (TFDS) works really well on gs:// and have had webdataset working very nicely with s3, go figure. |
Beta Was this translation helpful? Give feedback.
-
Got it. In this case, it seems the problem really lies in the GCP support for wds, but not the data format itself. I see what I can do. Thanks Ross! |
Beta Was this translation helpful? Give feedback.
@zeyuwang615 yeah, in theory it should but I ran into issues with the gs support, gsutil pipes locking up / erroring out frequently, etc when I was trying to get it working in the past so I never had it working properly. I've found tf.data (TFDS) works really well on gs:// and have had webdataset working very nicely with s3, go figure.