This script is for Slackware 13.0 only and may be outdated.

SlackBuilds Repository

13.0 > Graphics > tesseract (2.03)

Tesseract is a commercial quality OCR engine originally developed at HP 
between 1985 and 1995. In 1995, this engine was among the top 3 evaluated
by UNLV. It was open-sourced by HP and UNLV in 2005.

You will need to get one of the language packs in order to do anything
useful with tesseract, and that language pack tarball should be present
in the same directory as the SlackBuild script when the package is created.
See for a list of
all available language packs. Note that you can install more than one
(or even all) of the language packs, as they do not conflict with each
other. The build script defaults to use English, but this is easily
changed by passing an alternate value on the command line.

Here is the relevant code from the build script:
# Language pack(s) to use
# We'll install English by default, but you can pass another one (or all)
# of them on the command line (space delimited). If you pass more than one
# (again, space delimited), you must enclose the string in quotes. Examples:
# TESSLANG=fra ./tesseract.SlackBuild
# TESSLANG="deu deu-f eng fra ita nld por spa vie" ./tesseract.SlackBuild
TESSLANG=${TESSLANG:-eng} # Default to English

Maintained by: Pierre Cazenave
Keywords: optical character recognition,ocr,language,google,scan
ChangeLog: tesseract


Source Downloads:
tesseract-2.03.tar.gz (5777b70b11df16c1ac9aa155d7cfc553)

Download SlackBuild:
tesseract.tar.gz.asc (FAQ)

(the SlackBuild does not include the source)

Individual Files:

Validated for Slackware 13.0

See our HOWTO for instructions on how to use the contents of this repository.

Access to the repository is available via:
ftp git cgit http rsync

© 2006-2024 Project. All rights reserved.
Slackware® is a registered trademark of Patrick Volkerding
Linux® is a registered trademark of Linus Torvalds