I have hundreds of images of handwritten notes. They were written from different people but they are in sequence so you know that for example person1
wrote img1.jpg
-> img100.jpg
. The style of handwriting varies a lot from person to person but there are parts of the notes which are always fixed, I imagine that could help an algorithm (it helps me!).
I tried tesseract
and it failed pretty bad at recognizing the text. I'm thinking since each person has like 100 images is there an algorithm I can train by feeding it a small number of examples, like 5 or less and it can learn from that? Or would it not be enough data? From searching around it seems looks like I need to implement a CNN
(e.g. this paper).
My knowledge of ai
is limited though, is this something that I could still do using a library and some studying? If so, what should I do going forward?