Top "Tesseract" questions

Tesseract is an OCR (Optical Character Recognition) engine originally developed at HP Labs and now available as an open source library with development sponsored by Google.

Tesseract receipt scanning advice needed

I have struggled off and on again with Tesseract for various OCR projects and I found a use case today …

ocr tesseract receipt
Convert scanned pdf to .txt files using tesseract

I have to convert a .pdf file containing scanned images into .txt files. The tesseract ocr converts only images to .…

tesseract
Looking for ANPR/LPR (Automatic Number Plate Recognition/License Plate Recognition) SDK for android

I am looking for ANPR/LPR (Automatic Number Plate Recognition/License Plate Recognition) SDK for android..Is there any SDK …

android ocr tesseract anpr
Image preprocessing with OpenCV before doing character recognition (tesseract)

I'm trying to develop simple PC application for license plate recognition (Java + OpenCV + Tess4j). Images aren't really good (in …

java opencv tesseract tess4j anpr
Tesseract Ocr Engine Cube mode - Training Tesseract

Can you explain me what cube mode and Cube Data Files are on Tesseract ocr Engine and what is the …

ocr tesseract cube
Tesseract OCR on AWS Lambda via virtualenv

I have spent all week attempting this, so this is a bit of a hail mary. I am attempting to …

python amazon-web-services virtualenv tesseract aws-lambda
"Adding" new fonts to Tesseract eng.traineddata

As far as I know, Tesseract 3.x comes with 6 English (correct me if I'm wrong) fonts. I need to train …

python ocr tesseract
Does Google Cloud Vision OCR API have better accuracy and performance than Tesseract OCR API

I have integrated Google Cloud Vision API in my java application for text recognition from complex formatted documents. One of …

tesseract google-cloud-vision
Alternative to Tesseract OCR Training?

For the past 3 months I've been trying to train the Tesseract With identifying a collection of images I've had, due …

ocr tesseract
Does Tesseract's hOCR output really contain bounding boxes and confidence levels for each character?

In the Tesseract FAQ they say you can: How can I get the coordinates and confidence of each character? There …

ocr tesseract hocr