Skip to content

Commit

Permalink
Merge pull request #205 from stweil/update
Browse files Browse the repository at this point in the history
Replace Tesseract model name frk by the new name deu_latf
  • Loading branch information
bertsky committed Apr 29, 2024
2 parents bf93020 + 0923049 commit ef3762e
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 3 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -79,7 +79,7 @@ We strongly recommend setting up a

## Models

Tesseract comes with synthetically trained models for languages (`tesseract-ocr-{eng,deu,frk,...}`
Tesseract comes with synthetically trained models for languages (`tesseract-ocr-{eng,deu,deu_latf,...}`
or scripts (`tesseract-ocr-script-{latn,frak,...}`). In addition, various models
[trained](https://github.com/tesseract-ocr/tesstrain) on scan data are available from the community.

Expand Down
11 changes: 9 additions & 2 deletions ocrd_tesserocr/ocrd-tool.json
Original file line number Diff line number Diff line change
Expand Up @@ -259,12 +259,19 @@
"size": 1525436
},
{
"url": "https://github.com/tesseract-ocr/tessdata_fast/raw/main/frk.traineddata",
"name": "frk.traineddata",
"url": "https://github.com/tesseract-ocr/tessdata_fast/raw/main/deu_latf.traineddata",
"name": "deu_latf.traineddata",
"parameter_usage": "without-extension",
"description": "Tesseract LSTM model for historical (Fraktur typesetting and letterpress printing) German",
"size": 6423052
},
{
"url": "https://github.com/tesseract-ocr/tessdata_fast/raw/main/deu_latf.traineddata",
"name": "frk.traineddata",
"parameter_usage": "without-extension",
"description": "Tesseract LSTM model for historical German (deprecated, replaced by deu_latf)",
"size": 6423052
},
{
"url": "https://github.com/tesseract-ocr/tessdata_fast/raw/main/script/Fraktur.traineddata",
"name": "Fraktur.traineddata",
Expand Down

0 comments on commit ef3762e

Please sign in to comment.