Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ICD11 Ingest #434

Merged
merged 3 commits into from
Apr 12, 2024
Merged

ICD11 Ingest #434

merged 3 commits into from
Apr 12, 2024

Conversation

joeflack4
Copy link
Contributor

@joeflack4 joeflack4 commented Feb 8, 2024

Partially addresses:

Overview

Adds source ICD11 to Mondo.
This will be deemed a success if build-mondo-ingest is run and ICD11 artefacts are present and look good (e.g. lexmatch with >1k rows).

Updates

ICD11 Ingest
- Update: mondo-ingest.Makefile
  - Add: $(COMPONENTSDIR)/icd11.owl
- Add: config/icd11foundation-property-map.sssom.tsv
- Update: ICD11 docs
- Update: config/properties.txt
- Update: config/context.json

Pre-merge checklist

  • Docs
    • docs/ have been added/updated OR
    • No updates to the docs necessary after careful consideration.
  • QC
    • sh run.sh make build-mondo-ingest has been run on this branch (after `docker pull obolibrary/odkfull:dev), and no errors occurred OR ICD11 Ingest - Data #471
    • No functional (code-related) changes to the pipeline are suggested, so no re-run is necessary.
  • Account for any new packages
  • Reviewed
    • Has been sufficiently reviewed by at least one review from a different team member of the Mondo Technical team.

Primary sub-tasks

@joeflack4 joeflack4 marked this pull request as draft February 8, 2024 23:25
@joeflack4 joeflack4 added the icd11 label Feb 8, 2024
@joeflack4 joeflack4 self-assigned this Feb 8, 2024
@joeflack4 joeflack4 added the enhancement New feature or request label Feb 8, 2024
@joeflack4 joeflack4 linked an issue Feb 8, 2024 that may be closed by this pull request
4 tasks
Copy link
Member

@matentzn matentzn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shaping up!

src/ontology/metadata/icd11.yml Outdated Show resolved Hide resolved
src/ontology/metadata/icd11.yml Outdated Show resolved Hide resolved
src/ontology/mondo-ingest.Makefile Outdated Show resolved Hide resolved
@joeflack4 joeflack4 mentioned this pull request Feb 10, 2024
11 tasks
@joeflack4 joeflack4 changed the base branch from main to develop February 12, 2024 17:50
@joeflack4 joeflack4 mentioned this pull request Mar 3, 2024
9 tasks
@joeflack4 joeflack4 force-pushed the icd11-ingest branch 3 times, most recently from 0755d7a to eb23113 Compare March 3, 2024 23:38
@joeflack4 joeflack4 force-pushed the icd11-ingest branch 2 times, most recently from 78f6330 to c2d6bef Compare March 21, 2024 00:05
@joeflack4 joeflack4 changed the title ICD11 ingest ICD11 Ingest Mar 21, 2024
@joeflack4 joeflack4 marked this pull request as ready for review March 21, 2024 19:47
@joeflack4 joeflack4 linked an issue Mar 21, 2024 that may be closed by this pull request
@joeflack4
Copy link
Contributor Author

CC @souzadevinicius This PR handles most of the setup for integrating ICD11 with Mondo. You may want to review this to get an idea of how such a thing is handled.

docs/sources/icd11foundation.md Outdated Show resolved Hide resolved
src/ontology/metadata/icd11.yml Outdated Show resolved Hide resolved
src/ontology/metadata/mondo.sssom.config.yml Show resolved Hide resolved
src/ontology/metadata/mondo.sssom.config.yml Show resolved Hide resolved
@@ -163,6 +163,18 @@ $(COMPONENTSDIR)/icd10who.owl: $(TMPDIR)/icd10who_relevant_signature.txt | compo
remove -T config/properties.txt --select complement --select properties --trim true \
annotate --ontology-iri $(URIBASE)/mondo/sources/icd10who.owl --version-iri $(URIBASE)/mondo/sources/$(TODAY)/icd10who.owl -o $@; fi

# TODO: complete
$(COMPONENTSDIR)/icd11foundation.owl: $(TMPDIR)/icd11foundation_relevant_signature.txt | component-download-icd11foundation.owl

This comment was marked as outdated.

src/ontology/mondo-ingest.Makefile Show resolved Hide resolved
src/scripts/lexmatch-sssom-compare.py Show resolved Hide resolved
- Rename: ICD11 -> ICD11Foundation
- Rename prefix: icd11 -> icd11.foundation
- Add prefixes: icd11.schema, icd11.z
- Add: intensional exclusions TSV (currently empty)
- Add: SPARQL query for selecting all diseases
- Update: mondo-ingest-odk.yaml: New entry for ICD11 in 'components'
- Updates from running 'make update_repo':
  - modified: docs/odk-workflows/ManageDocumentation.md
  - modified: docs/odk-workflows/RepositoryFileStructure.md
  - modified: src/ontology/Makefile
  - modified: src/ontology/run.sh
  - new file: src/scripts/run-command.sh
  - modified: src/scripts/update_repo.sh
- Add: metadata/icd11.yml
- Update: prefixes.csv
- Update: mondo-ingest.Makefile
  - Add: $(COMPONENTSDIR)/icd11.owl
- Add: config/icd11foundation-property-map.sssom.tsv
- Update: ICD11 docs
- Update: config/properties.txt
- Update: config/context.json
- Update: metadata/mondo.sssom.config.yml: added icd11.foundation to subject_prefixes
- Update: lexmatch-sssom-compare.py: entry for icd11
- Update: add-new-source.md: Instructions for additional configuration necessities.

General
- Bugfix: Slurp files were sometimes getting removed because they were considered intermediates and not .PRECIOUS.
- Bugfix: reports/*_exclusion_reasons.robot.template.tsv files were getting removed by the build for same reason as above.
@@ -476,6 +488,7 @@ slurp/%.tsv: $(COMPONENTSDIR)/%.owl $(TMPDIR)/mondo.sssom.tsv $(REPORTDIR)/%_map
--mondo-terms-path $(REPORTDIR)/mirror_signature-mondo.tsv \
--slurp-dir-path slurp/ \
--outpath $@
.PRECIOUS: slurp/%.tsv
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

More rm issues

I checked the build log from last week, and at the end it showed rm reports/icd11foundation_exclusion_reasons.robot.template.tsv slurp/icd11foundation.tsv imports/ro_terms_combined.txt. After adding that 1 .PRECIOUS, now the slurp file is no longer being removed, but the other two still are: rm reports/icd11foundation_exclusion_reasons.robot.template.tsv imports/ro_terms_combined.txt. I think we want these, so I'm going to apply the same fix for both of those as well. I pasted more of the log from build-mondo-ingest below just in case it's of any value.

Log related to (3)

I have the full logs saved as .txt files also if interested

From the build last week:

Release files are now in ../.. - now you should commit, push and make a release         on your git hosting site such as GitHub or GitLab
rm reports/icd11foundation_exclusion_reasons.robot.template.tsv slurp/icd11foundation.tsv imports/ro_terms_combined.txt
make[1]: Leaving directory '/work/src/ontology'
Mondo Ingest has been fully completed

From the build today:

Release files are now in ../.. - now you should commit, push and make a release         on your git hosting site such as GitHub or GitLab
rm reports/icd11foundation_exclusion_reasons.robot.template.tsv imports/ro_terms_combined.txt
make[1]: Leaving directory '/work/src/ontology'
Mondo Ingest has been fully completed

I've followed up on these two remaining issues here:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request icd11
Projects
None yet
Development

Successfully merging this pull request may close these issues.

OMIM import: parents missing
3 participants