I have looked at existing questions, i.e.:
but have failed when attempting to implement the suggested answers, getting incorrect or unexpected responses.
My constant issue (which seems to be a common one) is that Tesseract constantly assumes that the letter I should be read as a pipe character in literally every image I scan in.
I will almost never have a pipe character in what I'm reading -- virtually 100% of those have been letter Is.
I have tried the tessedit_char_blacklist variable to exclude both pipes and exclamation marks - if I omit pipes, it goes to exclamation marks. If both are excluded, the character is just omitted.
I am on Tesseract v5.0.1.20220118 on Windows 10.
Any help would be appreciated; I imagine I can't be the only person who has this issue.