Windows 7 OCR API

slyi picture slyi · May 23, 2011 · Viewed 15.5k times · Source

I have been reviewing replacements for the Office 2007 MODI OCR (OneNote's 2010 solution has lesser quality/results than 2007 :-( ). I notice that Windows 7 contains an OCR library once you install the optional tiff filter

The OCR component gets installed to

%programfiles%\Common Files\microsoft shared\OCR\7.0\xocr3.psp.dll 

but I don't see any API for it?

Does anyone see how this can be interfaced preferably in C#?

ANSWER: Found the soluation, once the optional tiff ifilter win7 feature is installed, i can then get a textoutput of a screenshot using the code/exe on http://www.codeproject.com/KB/cs/IFilter.aspx. Also if add the same [HKEY_CLASSES_ROOT.tiff\PersistentHandler] for .png and .jpg then OCR also works for jpg and png's.

Answer

Piotr Szmyd picture Piotr Szmyd · Jun 13, 2011
  1. Tessnet OCR is a good solution, but pretty old (last release from 2009). There are couple of very good free OCR solutions available for .NET:
  2. Asprise C# OCR SDK. Very good and fast one.
  3. Microsoft Research Project Hawaii Web-based (cloud) OCR solution with full docs and samples (discontinued 2013)
  4. Bing OCR Web based (cloud) OCR replacement for above. (discontinued March 2014)