Skip to main content

Hugging Face Datasets Support

· One min read

You can now populate datasets from the Hugging Face Hub using the same Hugging Face third-party key you may already use for checkpoints from Hugging Face models.

How It Works

Add or reuse a Hugging Face credential under Third-Party Keys: create a token on Hugging Face (read is enough for downloads), then add Hugging Face in the proxiML key configuration with your account name and token.

When creating a dataset, choose the source type huggingface. Specify the dataset identifier in the form the UI or SDK expects—typically the Hub id in namespace/dataset-name form, analogous to checkpoint source URIs for model repos.