I am using tika
for extracting text from pdf in python
. But, it downloads the .jar on every run. which is time consuming.
[MainThread ] [INFO ] Retrieving http://search.maven.org/remotecontent?filepath=org/apache/tika/tika-server/1.19/tika-server-1.19.jar to /tmp/tika-server.jar.
This happens every time I run the code. Is there a way to manually do it once and stop tika
to do it everytime?