MPI-parallelization of sktwocnt #94

vanderhe · 2024-08-16T13:09:40Z

Straight forward (non-distributed) and optional MPI parallelization of the sktwocnt binary, employing the following strategy:

fixed maximum distance (static tabulation)
Creates a single batch of dimer distances that are computed by available MPI ranks. Nothing fancy, MPI ranks exceeding the total number of distances are idling. We might want to print a warning or even block such calculations.
dynamic batches with maximum distance (2 cases: converged or max. distance reached)
If the number of MPI ranks undercuts the number of dimer distances contained in the default 1 Bohr batch length, the distances are computed by the different ranks. If the number of ranks exceeds the number of distances within the default 1 Bohr batch length, the batch size is automatically increased to accomodate as many dimer distances as there are MPI ranks. By doing so we mostly avoid ranks idling around, but an increased batch size later requires the Hamiltonian and overlap data to be cropped to the same size one would have obtained with the default batch length of 1 Bohr. Otherwise the length of the converged SK-tables would depend on the number of ranks.

To be merged after #90 (needs to be rebased).

vanderhe · 2024-08-20T19:02:23Z

Rebased on #90 to incorporate shelf search bug fix.

bhourahine

Once this is rebased after the merge of #90 I'll have another look.

bhourahine · 2024-08-24T20:15:31Z

.github/workflows/build.yml

    - name: Compile and Install libXC
      run: |
        git clone https://gitlab.com/libxc/libxc.git
        cd libxc/
        git checkout 6.2.2
        cmake -DCMAKE_INSTALL_PREFIX=${PWD}/${BUILD_DIR}/${INSTALL_DIR} -DENABLE_FORTRAN=True -B ${BUILD_DIR} .
        cd ${BUILD_DIR}
-        make -j 2
+        make -j2


Any reason not to just use all available resources? (likewise for the ctest commands)

Suggested change

make -j2

make -j

Not sure if there is hyper-threading on the VMs. If yes, I would prefer to keep the -j2 version to only utilize physical cores.

bhourahine · 2024-08-24T21:05:28Z

.github/workflows/build.yml

+      if: contains(matrix.mpi, 'openmpi') || contains(matrix.mpi, 'mpich')
+      run: |
+        pushd ${BUILD_DIR}
+        ctest -j1 --output-on-failure


Why -j1 ?

Suggested change

ctest -j1 --output-on-failure

ctest --output-on-failure

It is the -DTEST_MPI_PROCS=2 run. Is there any difference between ctest and ctest -j1? The latter might be a bit more verbose but maybe a bit more transparent too. I don't really care.

utils/test/check_submodule_commits

bhourahine · 2024-08-24T21:15:32Z

sktwocnt/prog/input.f90

+    write(stdOut, "(A,A)") "!!! Parsing error: ", txt
+    write(stdOut, "(2X,A,A)") "File: ", trim(fname)
+    write(stdOut, "(2X,A,I0)") "Line number: ", iLine
+    write(stdOut, "(2X,A,A,A)") "Line: '", trim(line), "'"

    stop


Shouldn't there be an MPI abort first?

Should be fixed by the latest commit.

bhourahine · 2024-08-24T21:16:45Z

sktwocnt/prog/output.f90

@@ -17,8 +18,19 @@ subroutine write_sktables(skham, skover)
    !> Hamiltonian and overlap matrix
    real(dp), intent(in) :: skham(:,:), skover(:,:)

-    call write_sktable_("at1-at2.ham.dat", skham)
-    call write_sktable_("at1-at2.over.dat", skover)
+    if (size(skham, dim=2) > 0) then


Not sure if I've followed the logic fully, but what happens if non-lead processes get here?

They don't/shouldn't call this routine. If they would, all processes will write exactly the same file, with the same name and content (don't know how the filesystem handles this), as all the data is present on all ranks.

Co-authored-by: Ben Hourahine <[email protected]>

vanderhe added 6 commits July 16, 2024 09:26

Hand over WS compression to slateratom

5f459c1

Implement Woods-Saxon confinement matrix elements

424c38f

Rework WS potential in terms of W, a and r0

f10a9b4

Add example to mio/skdef.hsd

60c892b

Add regression test covering the WS compression

2f14d28

Apply the feedback of the code review

46a6984

vanderhe added the enhancement New feature or request label Aug 16, 2024

vanderhe force-pushed the sktwocntMpi branch from 87a9adf to e3b3c98 Compare August 16, 2024 13:14

vanderhe marked this pull request as ready for review August 16, 2024 13:15

vanderhe force-pushed the sktwocntMpi branch from e3b3c98 to 801f6c9 Compare August 16, 2024 13:15

vanderhe marked this pull request as draft August 16, 2024 13:16

vanderhe force-pushed the sktwocntMpi branch 4 times, most recently from 432b601 to 51cbe34 Compare August 16, 2024 13:31

vanderhe marked this pull request as ready for review August 16, 2024 14:53

vanderhe requested a review from bhourahine August 16, 2024 15:05

Fix shelf search bug

2e866b7

vanderhe force-pushed the sktwocntMpi branch from 51cbe34 to 9114d02 Compare August 20, 2024 19:01

vanderhe added this to the 0.4 milestone Aug 23, 2024

bhourahine reviewed Aug 24, 2024

View reviewed changes

vanderhe and others added 3 commits August 25, 2024 21:39

Address the feedback of the code review

36ab95c

Co-authored-by: Ben Hourahine <[email protected]>

Parallelize (MPI, optional) two-center integration

f9d3e84

Address the feedback of the code review

aa34b49

Co-authored-by: Ben Hourahine <[email protected]>

vanderhe force-pushed the sktwocntMpi branch from 9114d02 to aa34b49 Compare August 25, 2024 20:04

Fix variable type

b6440fa

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPI-parallelization of sktwocnt #94

MPI-parallelization of sktwocnt #94

vanderhe commented Aug 16, 2024 •

edited

Loading

vanderhe commented Aug 20, 2024

bhourahine left a comment

bhourahine Aug 24, 2024

vanderhe Aug 25, 2024

bhourahine Aug 24, 2024

vanderhe Aug 25, 2024

bhourahine Aug 24, 2024

vanderhe Aug 25, 2024

bhourahine Aug 24, 2024

vanderhe Aug 25, 2024

MPI-parallelization of sktwocnt #94

Are you sure you want to change the base?

MPI-parallelization of sktwocnt #94

Conversation

vanderhe commented Aug 16, 2024 • edited Loading

vanderhe commented Aug 20, 2024

bhourahine left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vanderhe commented Aug 16, 2024 •

edited

Loading