I'm actually developing in Android and I've created a program that the user has to upload a picture of his shop with all the things that he's selling and he's sold (it's about a game). I'm actually in the last part, I need to detect what are those sold items.
Unfortunately the picture has low quality because it's done with a mobile phone. But, I think that if I clean the image using Grey, Thresh and Canny methods of cv2 I can after read the information with OCR tesseract.
Then I'm going to show you the structure of all the images with I'm working in this final part:
After if I do for example a simple Grey process for the second image I got the following:
And finally if I execute all the following code I got this result:
img = cv2.imread('recorte.jpg')
gray = get_grayscale(img)
thresh = thresholding(gray)
opening = opening(gray)
canny = canny(gray)
I know, the result is too far of being professional, but I actually need help. If I execut tesseract I got the folowing result:
reels Ae coe vel boise)
In Summary, I think that I'm working in the well direction, the only thing that I have to do is clean correctly the image, use tesseract with a properly configuration and finaly get a good result. Hopes that you can help me. Thank you!