Skip to content

Some python functions to put text lines in LAREX PageXML files

License

Notifications You must be signed in to change notification settings

andbue/pagexmllineseg

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

pagexmllineseg

Some python functions to write text lines into LAREX PageXML files

Opens PageXML file, converts to newest pagexml version 2017, segments the text regions and writes xml to file. Output is written to input file if no output file is given.

How to use:

from pagexmllineseg import pagexmllineseg
pagexmllineseg("larexoutput.xml")

Most lines in the file are taken from https://github.com/mittagessen/kraken/blob/master/kraken/pageseg.py

About

Some python functions to put text lines in LAREX PageXML files

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages