Update ORT training to be compatible with transformers 4.31 #1227

JingyaHuang · 2023-07-25T15:34:27Z

What does this PR do?

As per title.

ORTTrainer
ORTTrainingArguments
ORTSeq2SeqTrainer
ORTSeq2SeqTrainingArguments

Maybe in the next PRs

These PRs are for the inference (evaluation / prediction) of trainer APIs. Given that the Trainer APIs are mainly used for training, and the ORT inference can always be done with ORTModel(s) in the optimum library, there is no priority nor eta for the following PRs. If anyone is interested in contributing, please feel free to open a PR and tag me for review. I am willing to handle them, but I have no vision of when I can have the bandwidth for them...

Improve the export of models
Support merged decoder
Register more tasks supported by Optimum right now

HuggingFaceDocBuilderDev · 2023-07-26T22:26:14Z

The documentation is not available anymore as the PR was closed or merged.

prathikr · 2023-07-26T22:59:24Z

Related Issue: #1133

JingyaHuang · 2023-07-27T21:41:44Z

Gently pinging @pacman100 for the context.

Current issue I met with the optimizer while using deepspeed zero stage 2

----------------------------------------------------------------------
Traceback (most recent call last):
  File "/workspace/optimum/test_onnxruntime_train.py", line 135, in test_ort_trainer_encoder
    train_result = trainer.train()
  File "/workspace/optimum/optimum/onnxruntime/trainer.py", line 455, in train
    return inner_training_loop(
  File "/workspace/optimum/optimum/onnxruntime/trainer.py", line 815, in _inner_training_loop
    self.optimizer.step()
AttributeError: 'DummyOptim' object has no attribute 'step'

----------------------------------------------------------------------

prathikr · 2023-07-31T18:49:16Z

@JingyaHuang any updates?

kshama-msft · 2023-07-31T19:21:01Z

@JingyaHuang @pacman100 we are waiting for this issue to help unblock integrations within our team. It would be great if this could be fasttracked. Thanks!

JingyaHuang · 2023-08-01T16:06:30Z

Thanks @pacman100 for helping out!

So @prathikr, @kshama-msft, with the help of @pacman100 from the accelerate team, we are having ORTTrainer compatible with transformers 4.31 and accelerate 0.10. Can you review and also try out my branch to ensure the fix? Thanks!

prathikr · 2023-08-01T20:31:04Z

Thank you @JingyaHuang this PR resolved my issue. Please merge ASAP.

kshama-msft · 2023-08-01T20:59:46Z

Thanks Jingya for the prompt fix!

JingyaHuang added 2 commits July 25, 2023 15:32

update trainer to 4.31

917a0a3

update seq2seq to 4.31

d389cba

fix

8577e52

hack and deug

4f78274

JingyaHuang merged commit 5730bd2 into main Aug 1, 2023
62 of 66 checks passed

JingyaHuang deleted the update-ort-trainer-431 branch August 1, 2023 20:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ORT training to be compatible with transformers 4.31 #1227

Update ORT training to be compatible with transformers 4.31 #1227

JingyaHuang commented Jul 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 26, 2023 •

edited

Loading

prathikr commented Jul 26, 2023

JingyaHuang commented Jul 27, 2023

prathikr commented Jul 31, 2023

kshama-msft commented Jul 31, 2023

JingyaHuang commented Aug 1, 2023

prathikr commented Aug 1, 2023

kshama-msft commented Aug 1, 2023

Update ORT training to be compatible with transformers 4.31 #1227

Update ORT training to be compatible with transformers 4.31 #1227

Conversation

JingyaHuang commented Jul 25, 2023 • edited Loading

What does this PR do?

Maybe in the next PRs

HuggingFaceDocBuilderDev commented Jul 26, 2023 • edited Loading

prathikr commented Jul 26, 2023

JingyaHuang commented Jul 27, 2023

prathikr commented Jul 31, 2023

kshama-msft commented Jul 31, 2023

JingyaHuang commented Aug 1, 2023

prathikr commented Aug 1, 2023

kshama-msft commented Aug 1, 2023

JingyaHuang commented Jul 25, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jul 26, 2023 •

edited

Loading