Is there an OCR library that outputs coordinates of words found within an image?

ocr
Adam Paynter picture Adam Paynter · Feb 18, 2011 · Viewed 24k times · Source

In my experience, OCR libraries tend to merely output the text found within an image but not where the text was found. Is there an OCR library that outputs both the words found within an image as well as the coordinates (x, y, width, height) where those words were found?

Answer

Andrew Cash picture Andrew Cash · Feb 20, 2011

Most commercial OCR engines will return word and character coordinate positions but you have to work with their SDK's to extract the information. Even Tesseract OCR will return position information but it has been not easy to get to. Version 3.01 will make easier but a DLL interface is still being worked on.

Unfortunately, most free OCR programs use Tesseract OCR in its basic form and they only report the raw ASCII results.

www.transym.com - Transym OCR - outputs coordinates. www.rerecognition.com - KADMOS engine returns coordinates.

Also Caere Omnipage, Mitek, Abbyy, Charactell return character positions.