You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Support for keywords was added in #688 and additional manual curation was added in #699. A test for checking that all Bioregistry entries have keywords is also implemented, but disabled, in the following block:
msg="manually curated keywords should be sorted and exclusively lowercase",
)
keywords=resource.get_keywords()
self.assertIsNotNone(keywords)
self.assertLess(0, len(keywords), msg=f"{resource.prefix} is missing keywords")
It would be great to go as far as requiring all manually added entries to the Bioregistry have keywords. Unfortunately, this is not part of all of the external data schemata and therefore can't be a global requirement, otherwise it would make import possible. So because of this, we can sporadically do keyword curation campaigns.
Alternatively, it might be interesting to try using LLMs to convert title + description of resources into keyword lists.
The text was updated successfully, but these errors were encountered:
Support for keywords was added in #688 and additional manual curation was added in #699. A test for checking that all Bioregistry entries have keywords is also implemented, but disabled, in the following block:
bioregistry/tests/test_data.py
Lines 917 to 936 in be93f70
It would be great to go as far as requiring all manually added entries to the Bioregistry have keywords. Unfortunately, this is not part of all of the external data schemata and therefore can't be a global requirement, otherwise it would make import possible. So because of this, we can sporadically do keyword curation campaigns.
Alternatively, it might be interesting to try using LLMs to convert title + description of resources into keyword lists.
The text was updated successfully, but these errors were encountered: