-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HDF error in a simple parallel run #690
Comments
@andreapiacentini places the error at line 496 in ParaGridIO.cpp (ParaGridIO::close). See here. |
A "dummy" question: in this test there is only one output file and all the timesteps are output to the same |
@andreapiacentini Not a dumb question at all! Would it be possible to re-run using the branch |
My hint is wrong: here the console output where "Closing" appears only once.
The job log is neat: |
I am still trying to track down what's going on inside NetCDF.
therefore the use of the iso format for dates with colons for separating the hours from the minutes and the seconds make the file creation fail. Unfortunately this is inside a silent Yet, closing the diagnostic file causes a crash: my hypothesis is that the |
For the moment I can run the parallel test cleanly to the end with this "a bit less dirty" patch in main.cpp
The standard says that atexit functions are called in reverse order and, indeed, I've checked that the diagnostic.nc file is closed before triggering MPI_Finalize. |
That looks very nice! Would it work with just
so that you don't have to define the additional function Otherwise, can you create an issue branch and make a pull request? $ git checkout develop |
Thanks. |
I'm trying to run the model in a very simple parallel configuration, but this ends in a non-zero exit.
The setup consists of the single column
config_column.cfg
configuration run on a 30 x 30 grid. This should result in a spatially uniform solution, but time varying.I created the init file using
make_init_column.py
, changingnfirst
andnsecond
to 30 (lines 3 and 4 in the file). I also changed the name of the output file toinit_column_30x30.nc
for clarity).Inside the development docker image, I run (from a build directory)
cmake .. -DENABLE_MPI=ON
, and then (from the run directory) I runThe run completes and returns reasonable-looking diagnostics and restart files but gives the following error:
The text was updated successfully, but these errors were encountered: