OCR

Difference between version 8 and 9 - Previous - Next
Optical Character Recognition -- typically the art of teaching a computer to read printed text (provided as scanned images).

[[...]]

Three principal open-source engines:
   * [http://jocr.sf.net/%|%GOCR] (appears to have a Tcl/Tk frontend)
   * [http://www.gnu.org/software/ocrad/ocrad.html%|%Ocrad] (GNU)
   * [http://sf.net/projects/tesseract-ocr%|%Tesseract OCR] (originally Hewlett-Packard, but now released as open source)

Recommended proprietary packages:
   * [http://www.vividata.com/index.html%|%Vividata OCR Shop XTR Lite]
   * [http://www.hamrick.com/%|%VueScan ]

Examples:
   * [Tcl does OCR with TWAPI and Microsoft Office]
-----
http://www.cs.berkeley.edu/~fateman/kathey/ocrchie.html%|%OCRchie%|%: Modular Optical Character Recognition Software in C++ with Tcl/Tk interface.

<<categories>> Glossary | Handwriting Recognition| Human Language| Image Processing| Word and Text Processing