pytesseract error Windows Error [Error 2]

Sourajit Roy Chowdhury picture Sourajit Roy Chowdhury · Jan 14, 2017 · Viewed 7.8k times · Source

Hi I am trying the python library pytesseract to extract text from image. Please find the code:

from PIL import Image
from pytesseract import image_to_string
print image_to_string(Image.open(r'D:\new_folder\img.png'))

But the following error came:

Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "C:\Python27\lib\site-packages\pytesseract\pytesseract.py", line 161, in image_to_string
config=config)
File "C:\Python27\lib\site-packages\pytesseract\pytesseract.py", line 94, in run_tesseract
stderr=subprocess.PIPE)
File "C:\Python27\lib\subprocess.py", line 710, in __init__
errread, errwrite)
File "C:\Python27\lib\subprocess.py", line 958, in _execute_child
startupinfo)
WindowsError: [Error 2] The system cannot find the file specified

I did not found a specific solution to this. Can anyone help me what to do. Anything more to be downloaded or from where i can download it etc..

Thanks in advance :)

Answer

Fabien picture Fabien · Jan 18, 2017

I had the same trouble and quickly found the solution after reading this post:

OSError: [Errno 2] No such file or directory using pytesser

Just need to adapt it to Windows, replace the following code:

tesseract_cmd = 'tesseract'

with:

tesseract_cmd = 'C:\\Program Files (x86)\\Tesseract-OCR\\tesseract'

(need double \\ to escape first \ in the string)