i currently using crawler4j to crawl a website and return the page url's and that pages parent page url too. i am using the basic crawler which is working fine except it is not returning the PDF's. i know it crawling the PDF's because i have checked what it crawling before the filter is added and the pdf's are showing. the PDF's seem to disappear/skipped when it enters
public void visit(Page page) {
i have no clue why it is doing this. Can anyone help me with this? it would be greatly appreciated! thanks