I have searched but i can't find an algorithm to do despeckling in a scanned document. I have already used Median filter to remove some smaller speckles.
I need an algorithm that removes large speckles from a document, I have tried using Connected Component Labeling (using Aforge) but there is no option to color an object white (remove it) if it is larger than X pixels.
Is there any way to delete objects on my picture that are larger than X pixels (specks, blobs, noise)?
Edit:
Here is the document which I am trying to prepare for OCR. Original Document
As we can se there is a lot of noise on the left of the text. So then I use Blur to blur the noise on the left, and then I Binarized the document and got something like this After Binarization
Now I need to remove the large black area from the left. I just dont know how to do that...