A graphical user interface for the Tesseract OCR engine. The program has been introduced in the Master’s thesis “Analyses and Heuristics for the Improvement of Optical Character Recognition Results for Fraktur Texts” by Paul Vorbach.
Glyph overview for easier detection of errors
Comparison view to compare the original document with the perceived result
Evaluation view with a transcription field
Batch export functionality to handle large projects
You can download and build the source code using Gradle. I will soon publish more detailed information on how to do this as well as a binary release.
- This software uses the Tesseract OCR engine.
- This software uses ocrevalUAtion by Rafael C. Carrasco for providing accuracy measures of the OCR results.
- This software uses the Silk icon set by Mark James (famfamfam.com).
GPLv3
Tesseract GUI - a graphical user interface for the Tesseract OCR engine
Copyright (C) 2014 Paul Vorbach
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program. If not, see <http://www.gnu.org/licenses/>.