Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove duplicate subjects #1942

Closed
acka47 opened this issue Nov 15, 2023 · 1 comment
Closed

Remove duplicate subjects #1942

acka47 opened this issue Nov 15, 2023 · 1 comment

Comments

@acka47
Copy link
Contributor

acka47 commented Nov 15, 2023

On 2023-11-15, 10:07 I.N. wrote:

bei der folgenden Titelaufnahme: https://nwbib.de/990157191480206441
https://nwbib.de/990157191480206441 wurden die Schlagworteinträge
„Zeitschriften, fortlaufende Sammelwerke“ durch Anreichung automatisch
erzeugt.

Die Informationen für diese textlichen Einträge stammt nicht aus dem
Schlagwortfeld 689 sondern aus dem Feld 084 (050 = DDC-Sachgruppe der
ZDB und Sachgruppen der DNB).

Da das Feld 084 mit dem Eintrag 050 zweimal in der Titelaufnahme
vorkommt, wurden auch zwei textliche Einträge erzeugt. Vielleicht könnte
man noch eine Dublettenprüfung programmieren, um den doppelten Eintrag
zu verhindern?

Here is the example resource's JSON in lobid: http://lobid.org/resources/990157191480206441.json

I suggest, to check all single subjects (that are not of type ComplexSubject) by notation (and maybe id) and to remove duplicates.

@acka47
Copy link
Contributor Author

acka47 commented Jan 23, 2024

There are no duplicate subjects as one is from source "Sachgruppen der DNB" and the other from "DDC-Sachgruppen der ZDB". Thus, we won't fix the data but will have to think about how to adjust the NWBib UI to just show the subject from one source. Opened hbz/nwbib#643 for this and closing this one.

@acka47 acka47 closed this as completed Jan 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

No branches or pull requests

1 participant