I have to detect all text zone of a newspaper basing on edges automatically (new ideas are welcome).
The result that i need is many TXT file containing each one an article. Take a look to this demonstration.
I have to detect all text zone of a newspaper basing on edges automatically (new ideas are welcome).
The result that i need is many TXT file containing each one an article. Take a look to this demonstration.
This question is way too broad.
If you want a proper answer then you need to ask very specific questions and show us what you have tried. We don't even know what formats you have to work with so we cannot offer any real help other than guessing.
Having said that:
You probably want to look into using image recognition software.
A good API to look into is OpenCV: http://opencv.org/
Here is a tutorial on how to use OpenCV with Java: http://docs.opencv.org/3.0-last-rst/doc/tutorials/introduction/desktop_java/java_dev_intro.html
And here are two similar questions that may help you: