Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature: Partial function heterogeneous parallel acceleration of force/stress_cc. #4555

Merged
merged 27 commits into from
Jul 17, 2024

Conversation

grysgreat
Copy link

The GPU heterogeneous version of drhoc and non_linear_core_correction in force/stress_cc is implemented to speed up the overall pw operation.

Linked Issue

Fix #4554

@grysgreat grysgreat changed the title Partial function heterogeneous parallel acceleration of force/stress_cc. Feature: Partial function heterogeneous parallel acceleration of force/stress_cc. Jul 3, 2024
@mohanchen
Copy link
Collaborator

close to resolve some clang-format issues, please resubmit later.

@mohanchen mohanchen closed this Jul 3, 2024
@grysgreat grysgreat reopened this Jul 4, 2024
@grysgreat
Copy link
Author

Redo all bot commits. And delete old function.

@mohanchen mohanchen added the GPU & DCU & HPC GPU and DCU and HPC related any issues label Jul 13, 2024
@mohanchen mohanchen merged commit 336702a into deepmodeling:develop Jul 17, 2024
14 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
GPU & DCU & HPC GPU and DCU and HPC related any issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Performance optimization for module_pw.
3 participants