-
-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can you continue training a model from a file? #188
Comments
Which GPU do you have and could you provide your |
What's the required path? Is the above output something you'd expect? I'd be very happy to help! |
My GPU is a NVIDIA GeForce GTX 1650 Ti Mobile. |
Long story short, it's a bug. Some things became private and the current automated testing is done without such an example, #190 addresses the principal issue of lacking API to manage to do it at all. On the other handside: That API is very rough and requires understanding of knowledge around |
The example seems to store and retrieve the config, but not the actual network parameters(eg the ILayer object) itself. What would I need to change to also save and load a Layer object alongside the config? |
There is a |
I already use the |
I'll dig deeper into this, on first glance |
#190 does implement a unit test now, but the |
I wrote a simple xor example from the project linked at the top. In the main function, first only run the |
I think the correct test would be:
and compare the output on the two I didn't get around to dig deeper yet |
I revised the main function so that it does that, the issue still persists. |
I think it may have to do with the loading of the bias weights. I added a third example which is just a single linear layer, and it learned to just be the identity function. When I load it from file, it has the same slope, but all the outputs are shifted. I'm guessing that there is something that is not being saved or loaded properly from the weights.
|
I'll try to make some time for investigating further, personal life events just consume a lot of my spare time lately. |
** Question: **
I am not sure if this is a bug or just a something I am doing wrong, but when I save the model as a file and then reload it as a layer, it does not have a similar loss to the saved version of the model, and it also trains much slower. Is there a setting needed to tell it how to continue? Do I need to save additional data to resume properly?
Project can be found here
Lines 231-238, 296-299 are the significant parts to this question. If I train it from scratch the loss will go down into the 80s after 20,000 iterations, but when I reload it it will start in the 130s and not decrease significantly in 20,000 iterations.
The text was updated successfully, but these errors were encountered: