Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add expandable segments var #775

Merged
merged 6 commits into from
Jul 22, 2024
Merged

add expandable segments var #775

merged 6 commits into from
Jul 22, 2024

Conversation

rayg1234
Copy link
Collaborator

@rayg1234 rayg1234 commented Jul 18, 2024

Expandable segments is a new cuda feature that helps with memory fragmentation during frequent allocations (ie: in the case of variable batch sizes). see https://pytorch.org/docs/stable/notes/cuda.html.
This report shows we can get a memory savings of 4GB for a 0.5B model for example: https://fairwandb.org/fairchem/fm_testing/reports/Memory-Savings-for-EQV2--Vmlldzo0MzEwMA

brookwander and others added 2 commits July 16, 2024 17:21
…unami enumeration (#764)

* adding new notebook for using fairchem models with NEBs

* adding md tutorials

* blocking code cells that arent needed or take too long
Copy link

codecov bot commented Jul 18, 2024

Codecov Report

All modified and coverable lines are covered by tests ✅

Files Coverage Δ
src/fairchem/core/common/utils.py 63.11% <100.00%> (+0.34%) ⬆️

@rayg1234 rayg1234 marked this pull request as ready for review July 19, 2024 16:28
Copy link
Collaborator

@anuroopsriram anuroopsriram left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.
Can you confirm that there is no increase in memory usage for training smaller models? If that is the case, feel free to merge.

@rayg1234
Copy link
Collaborator Author

LGTM. Can you confirm that there is no increase in memory usage for training smaller models? If that is the case, feel free to merge.

yep: checked here https://fairwandb.org/fairchem/fm_testing/reports/Expandable-segments-on-31M-model--Vmlldzo0MzMxMg

@rayg1234 rayg1234 added this pull request to the merge queue Jul 22, 2024
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to no response for status checks Jul 22, 2024
@rayg1234 rayg1234 added this pull request to the merge queue Jul 22, 2024
@rayg1234 rayg1234 removed this pull request from the merge queue due to a manual request Jul 22, 2024
@rayg1234 rayg1234 added this pull request to the merge queue Jul 22, 2024
Merged via the queue into main with commit bceb1a5 Jul 22, 2024
7 checks passed
@rayg1234 rayg1234 deleted the rgao_expandable_segments branch July 22, 2024 19:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants