Skip to content

Commit

Permalink
Remove dataset download twice
Browse files Browse the repository at this point in the history
Signed-off-by: Andrey Velichkevich <[email protected]>
  • Loading branch information
andreyvelich committed Mar 15, 2024
1 parent f3fb861 commit 6717f93
Showing 1 changed file with 1 addition and 6 deletions.
7 changes: 1 addition & 6 deletions sdk/python/kubeflow/storage_initializer/hugging_face.py
Original file line number Diff line number Diff line change
Expand Up @@ -102,11 +102,6 @@ def download_dataset(self):
if self.config.access_token:
huggingface_hub.login(self.config.access_token)

load_dataset(self.config.repo_id, cache_dir=VOLUME_PATH_DATASET)

# Load dataset and save to disk.
dataset = load_dataset(
self.config.repo_id,
split=self.config.split,
)
dataset = load_dataset(self.config.repo_id, split=self.config.split)
dataset.save_to_disk(VOLUME_PATH_DATASET)

0 comments on commit 6717f93

Please sign in to comment.