The SSH Conversion Hub provides an inventory of solutions for conversions between data and file formats.

Use the search bar to discover solutions or use the facets to identify your area of interest.

 

Tesseract OCR

Solution icon
Solution
Is a recipe
No
Description

Tesseract is an open source text recognition (OCR) Engine.  

Tesseract can be used directly via command line, or (for programmers) by using an API to extract printed text from images. It supports a wide variety of languages.

Tesseract doesn’t have a built-in GUI, but there are several available which can be found here.  External tools, wrappers and training projects for Tesseract are listed under the AddOns page on the Home Page.

It has a fully featured API, and can be compiled for a variety of targets including Android and the iPhone.

Input format
Output format
Invocation type
Community
Expected Level of Knowledge
Terms of Use / License
Apache-2.0
Condition of use: Local use
free
Condition of use: Operate service
free
Condition of use: Further development
free
Application category
data processing
Application sub-category
conversion
Status
maintained
Contact
Tesseract User Group: http://groups.google.com/group/tesseract-ocr
Publication date
Last Modification date