Skip to content

nguyenq/VietOCRfx

Repository files navigation

VietOCRfx

A JavaFX GUI frontend prototype for Tesseract OCR engine. Supports optical character recognition for Vietnamese and other languages supported by Tesseract.

VietOCR is released and distributed under the Apache License, v2.0.

Features

  • Multi-platform
  • PDF, TIFF, JPEG, GIF, PNG, BMP image formats
  • Multi-page TIFF images
  • Screenshots
  • Selection box
  • File drag-and-drop
  • Paste image from clipboard
  • Postprocessing for Vietnamese to boost accuracy rate
  • Vietnamese input methods
  • Localized user interface for many languages (Localization project)
  • Integrated scanning support
  • Watch folder monitor for support of batch processing
  • Custom text replacement in postprocessing
  • Spellcheck with Hunspell
  • Support for downloading and installing language data packs and appropriate spell dictionaries

About

JavaFX GUI frontend for Tesseract OCR engine

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages