Fine-Tuning process #9

ghost · 2020-06-08T10:38:20Z

Hi!
I would like to know the process of fine-tuning UniLM with inverted SQUAD (hardware, training time, number of steps, parameters, etc.)
Would that be possible?
Thanks in advance!

thusithaC · 2020-06-17T02:45:48Z

Yes, I have the same question. The repo is extremely useful and provides good quality results and easy to use and setup compared to some purely research based github repos.

However, this might be a naive question, but does this repo even include the code needed to train the .bin file. Would love to recreate this in other languages, so it would be extremely helpful if a re-training guide can be included in the readme, with links to the source datasets.

artitw · 2020-08-22T16:36:00Z

@ugmSorcero please see fine-tuning parameters below

--max_seq_length 512 \
--max_position_embeddings 512 \
--mask_prob 0.7 \
--max_pred 48 \
--train_batch_size 32 \
--gradient_accumulation_steps 2 \
--learning_rate 0.00002 \
--warmup_proportion 0.1 \
--label_smoothing 0.1 \
--num_train_epochs 10

artitw · 2020-08-22T16:40:55Z

@thusithaC I will have to play around and think about how to best incorporate training the model from scratch and get back on this. If you have any ideas about that, feel free to let us know.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuning process #9

Fine-Tuning process #9

ghost commented Jun 8, 2020

thusithaC commented Jun 17, 2020

artitw commented Aug 22, 2020 •

edited

Loading

artitw commented Aug 22, 2020

Fine-Tuning process #9

Fine-Tuning process #9

Comments

ghost commented Jun 8, 2020

thusithaC commented Jun 17, 2020

artitw commented Aug 22, 2020 • edited Loading

artitw commented Aug 22, 2020

artitw commented Aug 22, 2020 •

edited

Loading