Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word/glyph segmentation coordinates are broken #57

Open
mikegerber opened this issue Feb 5, 2021 · 6 comments
Open

Word/glyph segmentation coordinates are broken #57

mikegerber opened this issue Feb 5, 2021 · 6 comments
Assignees
Labels
bug Something isn't working

Comments

@mikegerber
Copy link
Collaborator

image

There is a cluster of bounding boxes elsewhere on the page, so I think the coordinates are broken probably.

@mikegerber mikegerber added the bug Something isn't working label Feb 5, 2021
@mikegerber mikegerber self-assigned this Feb 5, 2021
@mikegerber
Copy link
Collaborator Author

👀 @VChristlein

The glyphs are there, coordinates seem fine too (casual inspection), but not displayed in Page Viewer. I'll investigate more thoroughly next week.

@mikegerber
Copy link
Collaborator Author

EXTRACT-PAGES_00000024 pseg

ocrd-segment-extract-pages -I OCR-D-OCR-CALAMARI -O EXTRACT-PAGES -P plot_segmasks '["page", "region", "line", "word", "glyph"]' -P plot_overlay true

This produces the same problem as Page Viewer, so I guess the problem is with ocrd_calamari ;-) (Image scaled for GitHub upload)

@mikegerber
Copy link
Collaborator Author

The code has been broken since we try to process all lines of a region at once. We are always using the y coordinates of the last line in the text region 😅

@mikegerber
Copy link
Collaborator Author

This is fixed in the latest release.

@mikegerber
Copy link
Collaborator Author

Reopening: I'd like a simple test for this as I don't routinely check word coordinates. I imagine a simple document and a rough check if the word coordinates are plausible.

@kba
Copy link
Member

kba commented Mar 4, 2021

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants