Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dedupe script is broken #4412

Closed
nickumia-reisys opened this issue Aug 2, 2023 · 2 comments
Closed

Dedupe script is broken #4412

nickumia-reisys opened this issue Aug 2, 2023 · 2 comments
Assignees
Labels
bug Software defect or bug component/solr-service Related to Solr-as-a-Service, a brokered Solr offering harvest-duplicates Issues related to Duplicated Datasets

Comments

@nickumia-reisys
Copy link
Contributor

How to reproduce

  1. Run the Check org duplicates action.
  2. Run the Run De-dupe action on any org with the results.

Background:

  • See O+M 2023-08-04 #4409: fema-gov had "8000" duplicates of "1" dataset... After running the de-dupe, all of the data was deleted and needed to be re-harvested.

Expected behavior

Only duplicates deleted.

Actual behavior

Good datasets deleted.

Sketch

TBD

@nickumia-reisys nickumia-reisys added the bug Software defect or bug label Aug 2, 2023
@nickumia-reisys nickumia-reisys mentioned this issue Aug 3, 2023
10 tasks
@hkdctol hkdctol moved this to 📔 Product Backlog in data.gov team board Aug 3, 2023
@btylerburton btylerburton added component/solr-service Related to Solr-as-a-Service, a brokered Solr offering harvest-duplicates Issues related to Duplicated Datasets labels Dec 21, 2023
@btylerburton
Copy link
Contributor

@FuhuXia is this still valid?

@gujral-rei gujral-rei moved this from 📔 Product Backlog to 📟 Sprint Backlog [7] in data.gov team board Jan 18, 2024
@FuhuXia FuhuXia moved this from 📟 Sprint Backlog [7] to 🏗 In Progress [8] in data.gov team board Jan 23, 2024
@FuhuXia
Copy link
Member

FuhuXia commented Jan 24, 2024

Cant replicate it. The dedupe action is working fine.

@FuhuXia FuhuXia closed this as completed Jan 24, 2024
@github-project-automation github-project-automation bot moved this from 🏗 In Progress [8] to ✔ Done in data.gov team board Jan 24, 2024
@btylerburton btylerburton moved this from ✔ Done to 🗄 Closed in data.gov team board Feb 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Software defect or bug component/solr-service Related to Solr-as-a-Service, a brokered Solr offering harvest-duplicates Issues related to Duplicated Datasets
Projects
Archived in project
Development

No branches or pull requests

3 participants