Some web pages that I process in Jsoup are heavy. By "heavy" I mean the page either contains lots of HTML (let's assume the page has already been downloaded), or it requires several iterations on the same document (created only once via Jsoup.parse()).
For that reason, I would like to present to the user a progress bar with a guesstimate of how much time is left.
One approach is to just measure the volume of HTML (in KB or MB) and come up with a speed factor (unfortunately, totally dependent on speed of the system this code runs on).
Another approach is to count the number of nodes?
Due to the obvious in-deterministic nature of this, am I calling for trouble?
Ideas of better ways to handles this?