OCRopus

Not Rated
Tags: ocr

Description
OCRopus(tm) is a state-of-the-art document analysis and Optical Character Recognition (OCR) system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. It will also be an excellent OCR system for many other applications.
Interface: Command Line
Associated Programs
Tesseract OCR Command line OCR tool
Available deb Repositories (how-to add a respository)
Debian 32-bit 64-bit
stable 0.3.1-3+b1 0.3.1-3+b1
testing 0.3.1-4 0.3.1-4
sid 0.3.1-4 0.3.1-4
experimental 0.4.4-1 0.4.4-1

Ubuntu 32-bit 64-bit
lucid 0.3.1-2 0.3.1-2
oneiric 0.3.1-3 0.3.1-3

Rating: Not Rated (0 votes)


Login or Register to rate OCRopus, add a Tag, or designate as an alternative to a Windows app



Upload Screenshots
Images must be in GIF, JPG, or PNG formats and can be no larger than 2 MB. Only one file can be uploaded at a time. A description can be included, but it is optional.
Desc:
File:
You must login or register to upload a screenshot.
Submit Web Links
Submit the title and link (including http://) to an article pertaining to OCRopus and it will appear in the Web Links section of the right banner. Contact us here if an entry needs to be removed.
Title:
Link:
You must login or register to post links.

Write a Review

  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.
More information about formatting options