Checkpoints can now be created directly from public or private Hugging Face models.
How It Works
To integrate with Hugging Face, first create a User Access Token with these instructions. If you plan to only download data, create a read
token. If you plan to upload results back to huggingface, create a write
token. Once you have the token, go back to the trainML third-party key configuration page, and select Hugging Face
from the Add
menu under Third-Party Keys. Enter the your Hugging Face account name as the Key ID
and the generated token as the Key Secret
and click the check button.
Once the third party key is added, select Hugging Face
as the Checkpoint source type. Use the repo name as the source uri in the format <namespace>/<repo>
. By default, checkpoint creation will automatically download the default branch and remove all other branches and git history to save space. To create a checkpoint from a different branch, specify that in the Branch
field.
Using the SDK
To create a Hugging Face checkpoint using the SDK, use the following syntax.
checkpoint = await trainml.checkpoints.create(
name="System Tests - Hugging Face Checkpoint",
source_type="huggingface",
source_uri="<namespace>/<repo>",
source_options=dict(branch="branch_name")
)