Using Tesseract in python and getting inconsistent OCR result – it convert line of digits correctly but each separate digit it cannot convert.
For example for this image 1 Tesseract gives correct results (p0 is the image1)(https://i.stack.imgur.com/hAn9q.png):
pytesseract.image_to_string(p0, config=options) #p0 is image https://i.stack.imgur.com/hAn9q.png
‘431659n’
but for each of the subcell which just have a digit (image 2)(https://i.stack.imgur.com/MxBaK.png) it gives empty result:
pytesseract.image_to_string(p1, config=options) #p1 is image 2 with first digit 4
”
I’m not sure how to get correct result. It is a very clear and simple image of a digit and Tesseract cannot OCR it even though it does it easily when it is an image of the row of digits.
I tried rescaling the individual cell but Tesseract just does not like single digits.