I am looking to read a pdf and extract the text from it. The pdf is present in one of the url's and I don't wish to download it. I wish to read it on-the-go from the internet. Is this even possible?
I tried using 'Tika' but it doesn't really work. It gave me error:
2019-08-29 15:39:15,416 [MainThread ] [WARNI] Tika server returned status: 500 {'status': 500}
from tika import parser
URL_path = "http://www.---path to .pdf"
raw = parser.from_file(URL_path)
print(raw)