Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create report for invalid metadata values #67

Closed
bgajdero opened this issue Nov 2, 2023 · 0 comments
Closed

Create report for invalid metadata values #67

bgajdero opened this issue Nov 2, 2023 · 0 comments
Assignees
Labels
data data integrity related issues

Comments

@bgajdero
Copy link
Contributor

bgajdero commented Nov 2, 2023

Create a report for the entire catalogue database to find invalid values. This report will be used by curators to sort, priorities and fix found issues.
Exclude metadata values with drop-down lists. These are handled in #66 .

  • find bad formats (e.g dates)
  • find outlier values for fields, for example create a score by comparing count of metadata field to count of other values in that metadata, and sort on this score.
  • identify invalid tag values, not in the taxonomy
@bgajdero bgajdero added the data data integrity related issues label Nov 2, 2023
@bgajdero bgajdero added this to the Database Integrity milestone Nov 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data data integrity related issues
Projects
None yet
Development

No branches or pull requests

2 participants