I'd like to know if there are any techniques/APIs that can be used to do fast screen fonts OCR?
The following is taken for granted:
- the text to OCR shall come from a screenshot and shall be rendered using screen fonts
- the text to OCR may or may not be anti-aliased
- anti-aliasing may or not use RGB decimation (aka sub-pixel AA aka ClearType etc.)
- the screenshot may be in RGB or RBG order
- the baseline is trivial to find (just look at all the screen fonts: baseline appear very clearly and are easy to find algorithmically)
- a lot of errors are allowed (characters recognition doesn't need to be 100% correct at all)
- fonts are basically known in advance but how exactly the fonts are rendered is not (the size is unknow, the color is unknown, the type of anti-aliasing is unknown). Basically what is known is that it's going to be very common fonts
So I suppose it's not anywhere near as complicated as doing "real" OCR: finding the baseline and "cutting" each character is quite easy to do (I've already done it).
Does anyone know about specific techniques or paper(s) or even APIs allowing to do such a feat?
Note that: this question is not about screen-scraping. This question is not about breaking CAPTCHAs. This question is not about regular OCR (as in OCRing a scanned text). This question is not about GUI-automation (altough some may use it that way).