-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Word/glyph segmentation coordinates are broken #57
Comments
The glyphs are there, coordinates seem fine too (casual inspection), but not displayed in Page Viewer. I'll investigate more thoroughly next week. |
The code has been broken since we try to process all lines of a region at once. We are always using the y coordinates of the last line in the text region 😅 |
This is fixed in the latest release. |
Reopening: I'd like a simple test for this as I don't routinely check word coordinates. I imagine a simple document and a rough check if the word coordinates are plausible. |
You could try https://github.com/OCR-D/assets/tree/master/data/kant_aufklaerung_1784-page-region-line-word_glyph/data as test data. |
There is a cluster of bounding boxes elsewhere on the page, so I think the coordinates are broken probably.
The text was updated successfully, but these errors were encountered: