Question about fine tunning foundations model #329

mahpe · 2024-02-26T10:03:08Z

mahpe
Feb 26, 2024

Dear all

I am trying to fine tune the "large" foundation model for my data. However, when I run "run_train.py" the loss function for the model is not computed.

I use the "foundation" branch and the below arguments:

log_dir = "logs"
model_dir = "."
checkpoints_dir = "checkpoints"
results_dir = "results"
name="MACE_0"
foundation_model="large"
train_file="train.xyz"
valid_fraction=0.05
test_file="test.xyz"
energy_weight=1.0
forces_weight=10.0
E0s="average"
lr=0.01
scaling="rms_forces_scaling"
batch_size=2
max_num_epochs = 150#1500
ema=true
ema_decay=0.99
amsgrad=true
default_dtype="float32"
device="cuda"
seed=3

Do you know why the loss is not calculated and evaluated for each batch?

ilyes319 · 2024-02-26T10:10:18Z

ilyes319
Feb 26, 2024
Maintainer

Can you share the log file and training file please? What do you mean exactly by "the loss is not computed".

1 reply

mahpe Feb 26, 2024
Author

Yes the log file is attached
Test.log

basillicus · 2024-05-23T09:47:22Z

basillicus
May 23, 2024

I think it may be related to this issue.

You may need to rename your labels in your train and test files to REF_energies and REF_forces, and use the flags --energy_key and --forces_key accordingly

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about fine tunning foundations model #329

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Question about fine tunning foundations model #329

mahpe Feb 26, 2024

Replies: 2 comments · 1 reply

ilyes319 Feb 26, 2024 Maintainer

mahpe Feb 26, 2024 Author

basillicus May 23, 2024

mahpe
Feb 26, 2024

Replies: 2 comments 1 reply

ilyes319
Feb 26, 2024
Maintainer

mahpe Feb 26, 2024
Author

basillicus
May 23, 2024