This SO answer suggests that training tesseract with .tif
files has an advantage over .png
files because the .tif
files can have multiple pages and thus a larger training sample. Yet, this SO question discusses procedures for training with multiple images at once. More so, the man
page for, e.g. mftraining
suggests that it can accept multiple training files.
Is there any reason then not to train with multiple separate image files?