Add script to check fortran vs metadata without a host model #558

peverwhee · 2024-04-24T02:06:12Z

Summary

Adds a new script that will compare all fortran and metadata files within a directory you provide (recursively).

Description

scripts/fortran_tools/offline_check_fortran_vs_metadata.py: the script in question. Surprisingly simple script that finds the metadata files in the directory provided and calls existing capgen functions to parse and compare to the relevant fortran files.
scripts/ccpp_capgen.py: updates to (1) also return an error if the fortran specifies that a variable is optional, but there is no corresponding "optional = true" metadata field for that variable and (2) allow a user to provide known_ddts to parse_scheme_files() so I don't have to worry about the constituent DDT in my offline script
test/capgen_test/temp_adjust.F90: above change broke test, removed optional attribute from errflg variable

User interface changes?: No

BUT: here's how you run the new script:

./offline_check_fortran_vs_metadata.py --directory <path_to_scheme_file_directory> (--debug)

Caveat(s)/Notes

One thing is that, as it stands, the fortran/metadata parsing and comparison functions raise exceptions within them, so you'll only get errors for one file (and often one issue) at a time if you're running this on a large swath of files. It's already on my to-do list to improve/streamline error handling during parsing/analysis, so that will help this in the future!

The script currently returns "All checks passed" if it gets through the parsing/comparison. This can be changed if needed!

gold2718

Is this test supposed to pass on our test files? If I try it in capgen_test or advection_test, I get something like:

Unknown DDT type, physics_state, at test/advection_test/test_host_mod.meta:52

This seems suboptimal since that DDT is defined in the tested metadata files.

scripts/ccpp_capgen.py

scripts/fortran_tools/offline_check_fortran_vs_metadata.py

climbfuji · 2024-04-24T13:22:32Z

Is this test supposed to pass on our test files? If I try it in capgen_test or advection_test, I get something like:
Unknown DDT type, physics_state, at test/advection_test/test_host_mod.meta:52
This seems suboptimal since that DDT is defined in the tested metadata files.

The idea was to be able to run this on any ccpp scheme without relying on host model information. Simply compare scheme metadata with Fortran code. Therefore it's not surprising to me that the DDT isn't found and this complains. As long as these expected "errors" don't halt the execution of the script I think this is fine (maybe we can suppress that - I haven't tried the script myself yet, but I will today because I need it for the ccpp_prebuild optional argument implementation PRs).

gold2718 · 2024-04-24T15:08:24Z

Is this test supposed to pass on our test files? If I try it in capgen_test or advection_test, I get something like:
Unknown DDT type, physics_state, at test/advection_test/test_host_mod.meta:52
This seems suboptimal since that DDT is defined in the tested metadata files.
The idea was to be able to run this on any ccpp scheme without relying on host model information. Simply compare scheme metadata with Fortran code. Therefore it's not surprising to me that the DDT isn't found and this complains. As long as these expected "errors" don't halt the execution of the script I think this is fine (maybe we can suppress that - I haven't tried the script myself yet, but I will today because I need it for the ccpp_prebuild optional argument implementation PRs).

I agree with the sentiment, however, in this case, the DDT is defined in the metadata that is being scanned, it was just not scanned before a file that used it. And after all, isn't host model metadata something that can be scanned with this tool?
Given that this is not part of the mission of this script, I think it would be great to suppress warnings that are not relevant to the checks being performed.

dustinswales · 2024-04-24T15:34:40Z

@peverwhee This is great. It goes beyond comparing just the attributes in .Fortran/.meta files and does all of the "necessary Capgen compliance testing" (e.g. argument order checking, routines defined in both .F90/.meta, etc...). You could say this script is Capgen-lite, using just the scheme files. This would be a great thing to add to the ccpp-physics repo(s).

I ran this on the files in climbfuji/ccpp-physics#19, and couldn't get to the part comparing the attributes w/o commenting out a couple lines in check_fortran_against_metadata and _compare_fheader_to_mheader, which is just because these scheme files still need some cleanup for Capgen.

peverwhee · 2024-04-24T15:54:55Z

Is this test supposed to pass on our test files? If I try it in capgen_test or advection_test, I get something like:
Unknown DDT type, physics_state, at test/advection_test/test_host_mod.meta:52
This seems suboptimal since that DDT is defined in the tested metadata files.
The idea was to be able to run this on any ccpp scheme without relying on host model information. Simply compare scheme metadata with Fortran code. Therefore it's not surprising to me that the DDT isn't found and this complains. As long as these expected "errors" don't halt the execution of the script I think this is fine (maybe we can suppress that - I haven't tried the script myself yet, but I will today because I need it for the ccpp_prebuild optional argument implementation PRs).
I agree with the sentiment, however, in this case, the DDT is defined in the metadata that is being scanned, it was just not scanned before a file that used it. And after all, isn't host model metadata something that can be scanned with this tool? Given that this is not part of the mission of this script, I think it would be great to suppress warnings that are not relevant to the checks being performed.

Ooh an interesting question. I think there's some complexity here because the script is meant to only parse "scheme" files and assumes there aren't "host" files present (as would be the case in a physics directory like ccpp-physics or atmospheric_physics). But there's no way I can think of to tell which files are host files (in order to skip or parse them). I could add an argument so the user could specify which files are host files and then deal with them somehow that way?

peverwhee · 2024-04-24T16:05:30Z

Is this test supposed to pass on our test files? If I try it in capgen_test or advection_test, I get something like:
Unknown DDT type, physics_state, at test/advection_test/test_host_mod.meta:52
This seems suboptimal since that DDT is defined in the tested metadata files.
The idea was to be able to run this on any ccpp scheme without relying on host model information. Simply compare scheme metadata with Fortran code. Therefore it's not surprising to me that the DDT isn't found and this complains. As long as these expected "errors" don't halt the execution of the script I think this is fine (maybe we can suppress that - I haven't tried the script myself yet, but I will today because I need it for the ccpp_prebuild optional argument implementation PRs).
I agree with the sentiment, however, in this case, the DDT is defined in the metadata that is being scanned, it was just not scanned before a file that used it. And after all, isn't host model metadata something that can be scanned with this tool? Given that this is not part of the mission of this script, I think it would be great to suppress warnings that are not relevant to the checks being performed.
Ooh an interesting question. I think there's some complexity here because the script is meant to only parse "scheme" files and assumes there aren't "host" files present (as would be the case in a physics directory like ccpp-physics or atmospheric_physics). But there's no way I can think of to tell which files are host files (in order to skip or parse them). I could add an argument so the user could specify which files are host files and then deal with them somehow that way?

I have now read the most recent reply from @gold2718 more thoroughly! I can indeed update the script/capgen to allow for ignoring the ddt check since those can come from the host (or I guess a scheme that has yet to be parsed).

gold2718 · 2024-04-24T16:59:54Z

I agree with the sentiment, however, in this case, the DDT is defined in the metadata that is being scanned, it was just not scanned before a file that used it. And after all, isn't host model metadata something that can be scanned with this tool? Given that this is not part of the mission of this script, I think it would be great to suppress warnings that are not relevant to the checks being performed.

Ooh an interesting question. I think there's some complexity here because the script is meant to only parse "scheme" files and assumes there aren't "host" files present (as would be the case in a physics directory like ccpp-physics or atmospheric_physics). But there's no way I can think of to tell which files are host files (in order to skip or parse them). I could add an argument so the user could specify which files are host files and then deal with them somehow that way?

I have now read the most recent reply from @gold2718 more thoroughly! I can indeed update the script/capgen to allow for ignoring the ddt check since those can come from the host (or I guess a scheme that has yet to be parsed).

Right. This has nothing to do with host vs. scheme. For instance, in capgen_test, there is the vmr_type ddt which is part of ddt_suite.xml (so a DDT in a scheme file).

peverwhee · 2024-04-24T17:42:02Z

@gold2718 Updated to skip the ddt checking from the offline script

peverwhee · 2024-04-24T19:03:12Z

@peverwhee This is great. It goes beyond comparing just the attributes in .Fortran/.meta files and does all of the "necessary Capgen compliance testing" (e.g. argument order checking, routines defined in both .F90/.meta, etc...). You could say this script is Capgen-lite, using just the scheme files. This would be a great thing to add to the ccpp-physics repo(s).

I ran this on the files in climbfuji/ccpp-physics#19, and couldn't get to the part comparing the attributes w/o commenting out a couple lines in check_fortran_against_metadata and _compare_fheader_to_mheader, which is just because these scheme files still need some cleanup for Capgen.

Yeah, I was worried about that. I can add a flag to skip order-checking?

mkavulich · 2024-05-14T15:43:47Z

@gold2718 Did you still have unresolved concerns on this PR?

climbfuji · 2024-05-14T15:47:37Z

@gold2718 Did you still have unresolved concerns on this PR?

@mkavulich I would also like to review this - please give me a few more days. Apologies for the delay!

gold2718

Looks good now, thanks!

gold2718 · 2024-05-15T12:51:19Z

scripts/metadata_table.py

Thanks for threading this new input all the way down. It might come in handy elsewhere as that requirement and check have caused problems in other situations.

climbfuji

Thanks @peverwhee !

dustinswales

@peverwhee I have a few itsy-bitsy housekeeping requests, but nothing major. Good work!
Thanks again for creating this script, it already provided great value when working through the physics cleanup associated with #552.
In the future it would be great if CCPP compliant physics repositories ran this script as part of their CI. (I mean this script tells you if your schemes are CCPP compliant!). Maybe this should even be a requirement if a repository wants to advertise that they are "CCPP compliant"
@bluefinweiwei @dudhia FYI, but this would be great to test if your "ccpp-compliant" shared-physics for WRF/MPAS are indeed ccpp compliant.

dustinswales · 2024-04-24T14:50:26Z

scripts/fortran_tools/offline_check_fortran_vs_metadata.py

+from fortran_tools import parse_fortran_file
+from metadata_table import parse_metadata_file
+from ccpp_capgen import find_associated_fortran_file
+from ccpp_capgen import check_fortran_against_metadata, parse_scheme_files


You aren't using check_fortran_against_metadata.
parse_scheme_files calls check_fortran_against_metadata.

dustinswales · 2024-05-15T15:11:55Z

scripts/ccpp_capgen.py

+            # now check: if fortran says the variable is optional, does the metadata match?
+            if fvar:
+                fopt = fvar.get_prop_value('optional')
+                mopt = mvar.get_prop_value('optional')


Harmless, but the line is not needed with line 322.

good catch - removed!

dustinswales · 2024-05-15T15:12:23Z

scripts/ccpp_capgen.py

@@ -520,11 +533,13 @@ def parse_scheme_files(scheme_filenames, run_env):
    table_dict = {} # Duplicate check and for dependencies processing
    header_dict = {} # To check for duplicates
    known_ddts = list()
+    # end if


orphan put out of his misery.

(removed)

peverwhee · 2024-05-20T21:37:50Z

Merged into #549

add script to check fortran vs metadata without a host model

2906220

peverwhee requested review from climbfuji, gold2718, dustinswales, mwaxmonsky, grantfirl and mkavulich as code owners April 24, 2024 02:06

gold2718 requested changes Apr 24, 2024

View reviewed changes

code cleanup; code review

9ce41ee

allow for skipping of ddt check; skip ddt check from offline script

eee30cb

peverwhee requested a review from gold2718 April 24, 2024 17:42

gold2718 approved these changes May 15, 2024

View reviewed changes

climbfuji approved these changes May 15, 2024

View reviewed changes

dustinswales approved these changes May 15, 2024

View reviewed changes

dustinswales mentioned this pull request May 15, 2024

Introduce optional arguments in ccpp schemes and metadata ufs-community/ccpp-physics#189

Merged

code cleanup; review requests

9e43b78

peverwhee closed this May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to check fortran vs metadata without a host model #558

Add script to check fortran vs metadata without a host model #558

peverwhee commented Apr 24, 2024 •

edited

Loading

gold2718 left a comment

climbfuji commented Apr 24, 2024

gold2718 commented Apr 24, 2024

dustinswales commented Apr 24, 2024

peverwhee commented Apr 24, 2024

peverwhee commented Apr 24, 2024

gold2718 commented Apr 24, 2024

peverwhee commented Apr 24, 2024

peverwhee commented Apr 24, 2024

mkavulich commented May 14, 2024

climbfuji commented May 14, 2024

gold2718 left a comment

gold2718 May 15, 2024

climbfuji left a comment

dustinswales left a comment

dustinswales Apr 24, 2024

peverwhee May 16, 2024

dustinswales May 15, 2024

peverwhee May 16, 2024

dustinswales May 15, 2024

peverwhee May 16, 2024

peverwhee commented May 20, 2024

Add script to check fortran vs metadata without a host model #558

Add script to check fortran vs metadata without a host model #558

Conversation

peverwhee commented Apr 24, 2024 • edited Loading

Summary

Description

Caveat(s)/Notes

gold2718 left a comment

Choose a reason for hiding this comment

climbfuji commented Apr 24, 2024

gold2718 commented Apr 24, 2024

dustinswales commented Apr 24, 2024

peverwhee commented Apr 24, 2024

peverwhee commented Apr 24, 2024

gold2718 commented Apr 24, 2024

peverwhee commented Apr 24, 2024

peverwhee commented Apr 24, 2024

mkavulich commented May 14, 2024

climbfuji commented May 14, 2024

gold2718 left a comment

Choose a reason for hiding this comment

gold2718 May 15, 2024

Choose a reason for hiding this comment

climbfuji left a comment

Choose a reason for hiding this comment

dustinswales left a comment

Choose a reason for hiding this comment

dustinswales Apr 24, 2024

Choose a reason for hiding this comment

peverwhee May 16, 2024

Choose a reason for hiding this comment

dustinswales May 15, 2024

Choose a reason for hiding this comment

peverwhee May 16, 2024

Choose a reason for hiding this comment

dustinswales May 15, 2024

Choose a reason for hiding this comment

peverwhee May 16, 2024

Choose a reason for hiding this comment

peverwhee commented May 20, 2024

peverwhee commented Apr 24, 2024 •

edited

Loading