Free space allocated for LM in the memory after evaluation finishes #2277

ahmedamrelhefnawy · 2024-09-03T12:38:08Z

Deletes lm object after simple_evaluate function finishes

Then, memory space can be easily freed as following:

Free cache using Pytorch or TensorFlow (Depending on the model):

tf.keras.backend.clear_session() for TensorFlow
torch.cuda.empty_cache() for Pytorch

Garbage Collector

import gc
gc.collect()

Important when multiple models are tested sequentially.

CLAassistant · 2024-09-03T12:38:14Z

All committers have signed the CLA.

haileyschoelkopf

Hi! Thanks for this PR! We're aiming to put out v0.4.4 on PyPI this week but will try to review this ASAP.

The sole blocker that comes to mind when doing this is that it may disrupt the usage of the library in which one starts with an already-initialized model, wraps that existing model in an HFLM class, and passes that to simple_evaluate--we'd want to make sure that that does not delete the original initialized model, for those running lm_eval within their training loops. The code would look something like this:

...
my_hf_model = AutoModelForCausalLM.from_pretrained(...)

lm_obj = HFLM(pretrained=my_hf_model)

results = lm_eval.simple_evaluate( 
    model=lm_obj,
    tasks=["taskname1", "taskname2"],
    ...
)

# we should now still be able to use my_hf_model and not free it
my_hf_model.generate(...)

(see https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/interface.md#external-library-usage)

if you would have the bandwidth to test this case (and perhaps add a test, though that wouldn't be required) then that'd make it a lot easier to approve and merge this right away! Else, we'll look at testing that setting ourselves.

ahmedamrelhefnawy · 2024-09-04T16:19:48Z

Hi! Thanks for this PR! We're aiming to put out v0.4.4 on PyPI this week but will try to review this ASAP.

The sole blocker that comes to mind when doing this is that it may disrupt the usage of the library in which one starts with an already-initialized model, wraps that existing model in an HFLM class, and passes that to simple_evaluate--we'd want to make sure that that does not delete the original initialized model, for those running lm_eval within their training loops. The code would look something like this:
...
my_hf_model = AutoModelForCausalLM.from_pretrained(...)

lm_obj = HFLM(pretrained=my_hf_model)

results = lm_eval.simple_evaluate( 
    model=lm_obj,
    tasks=["taskname1", "taskname2"],
    ...
)

# we should now still be able to use my_hf_model and not free it
my_hf_model.generate(...)
(see https://github.com/EleutherAI/lm-evaluation-harness/blob/main/docs/interface.md#external-library-usage)

if you would have the bandwidth to test this case (and perhaps add a test, though that wouldn't be required) then that'd make it a lot easier to approve and merge this right away! Else, we'll look at testing that setting ourselves.

Thanks for your attention,
I tested it and it worked.
However, I can put del lm in an if condition that checks if the attribute model was an instance of lm_eval.api.model.LM
if it was, lm object won't be deleted.

…alized model

ahmedamrelhefnawy · 2024-09-04T21:06:36Z

Note: There's a problem with deleting already-initialized models after evaluation.
Despite deleting my_hf_model, my_hf_tokenizer, and ml_obj, and clearing the cache multiple times, the memory remains occupied.
I also tested this with the main repository (yours) to check if the issue existed there as well, and found the same behavior.
It appears to be a longstanding problem, so I'm documenting it here in case it can be addressed or resolved in future updates.

Applies at fb23b3f where the del lm was without any if conditions

Deletes lm object after simple_evaluate function finishes

fb23b3f

ahmedamrelhefnawy requested review from haileyschoelkopf, lintangsutawika and baberabb as code owners September 3, 2024 12:38

ahmedamrelhefnawy marked this pull request as draft September 4, 2024 07:41

ahmedamrelhefnawy marked this pull request as ready for review September 4, 2024 07:43

haileyschoelkopf requested changes Sep 4, 2024

View reviewed changes

ahmedamrelhefnawy requested a review from haileyschoelkopf September 4, 2024 20:07

Stricted 'lm' object to be deleted only if it wasn't an already-initi…

630bba4

…alized model

ahmedamrelhefnawy closed this Sep 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Free space allocated for LM in the memory after evaluation finishes #2277

Free space allocated for LM in the memory after evaluation finishes #2277

ahmedamrelhefnawy commented Sep 3, 2024 •

edited

Loading

CLAassistant commented Sep 3, 2024 •

edited

Loading

haileyschoelkopf left a comment

ahmedamrelhefnawy commented Sep 4, 2024 •

edited

Loading

ahmedamrelhefnawy commented Sep 4, 2024 •

edited

Loading

Free space allocated for LM in the memory after evaluation finishes #2277

Free space allocated for LM in the memory after evaluation finishes #2277

Conversation

ahmedamrelhefnawy commented Sep 3, 2024 • edited Loading

CLAassistant commented Sep 3, 2024 • edited Loading

haileyschoelkopf left a comment

Choose a reason for hiding this comment

ahmedamrelhefnawy commented Sep 4, 2024 • edited Loading

ahmedamrelhefnawy commented Sep 4, 2024 • edited Loading

ahmedamrelhefnawy commented Sep 3, 2024 •

edited

Loading

CLAassistant commented Sep 3, 2024 •

edited

Loading

ahmedamrelhefnawy commented Sep 4, 2024 •

edited

Loading

ahmedamrelhefnawy commented Sep 4, 2024 •

edited

Loading