Skip to content

Actions: andreyvelich/training-operator

Publish Training Operator Core Images

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
92 workflow run results
92 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support arm64 for Hugging Face trainer (#2028)
Publish Training Operator Core Images #97: Commit 8433edc pushed by andreyvelich
March 13, 2024 20:38 25m 10s master
March 13, 2024 20:38 25m 10s
Add 3 GPUs in Notebook requirements
Publish Training Operator Core Images #96: Commit 813cb07 pushed by andreyvelich
March 11, 2024 17:21 25m 14s add-example-fine-tune-llm
March 11, 2024 17:21 25m 14s
Add Fine-Tune BERT LLM Example
Publish Training Operator Core Images #95: Commit 1a48a0c pushed by andreyvelich
March 10, 2024 01:51 27m 58s add-example-fine-tune-llm
March 10, 2024 01:51 27m 58s
Add Fine-Tune BERT LLM Example
Publish Training Operator Core Images #94: Commit 9c14096 pushed by andreyvelich
March 10, 2024 01:46 25m 25s add-example-fine-tune-llm
March 10, 2024 01:46 25m 25s
Add Fine-Tune BERT LLM Example
Publish Training Operator Core Images #93: Commit 2789de0 pushed by andreyvelich
March 10, 2024 01:39 25m 37s add-example-fine-tune-llm
March 10, 2024 01:39 25m 37s
Add Fine-Tune BERT LLM Example
Publish Training Operator Core Images #92: Commit b1bb7ba pushed by andreyvelich
March 10, 2024 01:33 23m 46s add-example-fine-tune-llm
March 10, 2024 01:33 23m 46s
Add Fine-Tune BERT LLM Example
Publish Training Operator Core Images #91: Commit a5794a4 pushed by andreyvelich
March 10, 2024 01:23 26m 2s add-example-fine-tune-llm
March 10, 2024 01:23 26m 2s
Fix build workflow config for pytorch-torchrun-example (#2020)
Publish Training Operator Core Images #90: Commit 14eeaeb pushed by andreyvelich
March 9, 2024 23:53 27m 22s master
March 9, 2024 23:53 27m 22s
Fix Distributed Data Samplers in PyTorch Examples
Publish Training Operator Core Images #89: Commit 537ce7e pushed by andreyvelich
March 5, 2024 22:09 24m 22s fix-pytorch-ddp
March 5, 2024 22:09 24m 22s
Fix Distributed Data Samplers in PyTorch Examples
Publish Training Operator Core Images #88: Commit bc251a1 pushed by andreyvelich
March 5, 2024 22:06 26m 44s fix-pytorch-ddp
March 5, 2024 22:06 26m 44s
Fix URL in python SDK setup.py (#2011)
Publish Training Operator Core Images #87: Commit 5b2c6c8 pushed by andreyvelich
March 4, 2024 16:19 28m 44s master
March 4, 2024 16:19 28m 44s
Modify check test conditions
Publish Training Operator Core Images #86: Commit 174b050 pushed by andreyvelich
January 18, 2024 14:55 25m 3s sdk-resource-per-worker
January 18, 2024 14:55 25m 3s
Fix condition
Publish Training Operator Core Images #85: Commit 3dd7ab7 pushed by andreyvelich
January 17, 2024 14:03 28m 15s sdk-resource-per-worker
January 17, 2024 14:03 28m 15s
Fix e2e to create from image
Publish Training Operator Core Images #84: Commit f8dfdc1 pushed by andreyvelich
January 16, 2024 21:53 26m 35s sdk-resource-per-worker
January 16, 2024 21:53 26m 35s
Test to create PyTorchJob from Image
Publish Training Operator Core Images #83: Commit 5031fee pushed by andreyvelich
January 16, 2024 20:47 26m 53s sdk-resource-per-worker
January 16, 2024 20:47 26m 53s
Add torchrun issue
Publish Training Operator Core Images #82: Commit 5ffb32a pushed by andreyvelich
January 16, 2024 20:30 27m 11s sdk-resource-per-worker
January 16, 2024 20:30 27m 11s
Assign values in get pod template
Publish Training Operator Core Images #81: Commit 64039fc pushed by andreyvelich
January 16, 2024 20:11 26m 26s sdk-resource-per-worker
January 16, 2024 20:11 26m 26s
Assign values in get pod template
Publish Training Operator Core Images #80: Commit aef4735 pushed by andreyvelich
January 16, 2024 13:33 25m 32s sdk-resource-per-worker
January 16, 2024 13:33 25m 32s
Fix unbound var
Publish Training Operator Core Images #79: Commit 0bd07c5 pushed by andreyvelich
January 16, 2024 13:10 24m 42s sdk-resource-per-worker
January 16, 2024 13:10 24m 42s
Add Kubeflow Website links to README (#1983)
Publish Training Operator Core Images #78: Commit 018901f pushed by andreyvelich
January 15, 2024 21:23 25m 59s master
January 15, 2024 21:23 25m 59s
[SDK] Fix Worker and Master templates for PyTorchJob
Publish Training Operator Core Images #77: Commit de55a09 pushed by andreyvelich
January 12, 2024 16:56 25m 0s sdk-fix-create-replicas
January 12, 2024 16:56 25m 0s
publish trainer hugging face image (#1985)
Publish Training Operator Core Images #76: Commit 521cbed pushed by andreyvelich
January 12, 2024 15:57 26m 8s master
January 12, 2024 15:57 26m 8s
Update README.md
Publish Training Operator Core Images #75: Commit b767841 pushed by andreyvelich
January 12, 2024 15:49 Failure update-readme
January 12, 2024 15:49 Failure
Update README.md
Publish Training Operator Core Images #74: Commit b848493 pushed by andreyvelich
January 12, 2024 15:48 Failure update-readme
January 12, 2024 15:48 Failure
Fix PyTorchJob guide
Publish Training Operator Core Images #73: Commit b176f83 pushed by andreyvelich
January 11, 2024 18:18 Failure update-readme
January 11, 2024 18:18 Failure