-
Notifications
You must be signed in to change notification settings - Fork 47
datasets hackathon
Albert Villanova del Moral edited this page Nov 24, 2021
·
23 revisions
Thank you for participating in the BigScience🌸 Datasets hackathon!
By default, collections are added as private community raw datasets in the 🤗 Hub, under the bigscience
namespace.
-
Take an unassigned open issue from the Collections.
The issues are sorted by priority depending on their license, size, among other criteria.
In each Issue page, you can find detailed information of the collection, such as the identifier (UID) and location.
-
Create a 🤗 Dataset repository: https://huggingface.co/new-dataset
- Set Owner: bigscience
- Set Dataset name: the collection identifier (UID)
- Select Private
- Create dataset
-
Clone the 🤗 Dataset repository:
git clone https://huggingface.co/datasets/bigscience/<collection UID>