Any one tried to get numbers only calling the latest version of tesseract 4.0 in python?
The below worked in 3.05 but still returns characters in 4.0, I tried removing all config files but the digits file and still didn't work; any help would be great:
im is an image of a date, black text white background:
import pytesseract
im = imageOfDate
im = pytesseract.image_to_string(im, config='outputbase digits')
print(im)
You can specify the numbers in the tessedit_char_whitelist
as below as a config option
.
ocr_result = pytesseract.image_to_string(image, lang='eng', boxes=False, \
config='--psm 10 --oem 3 -c tessedit_char_whitelist=0123456789')
Hope this help.