I run a python script in order to load keras, tensorflow and the keras model. Then I can start making predictions, but this takes a few seconds to load everything.
I can loop inside the python script and get good performance predicting in batches, but I want to have also good performance with via independent prediction requests from PHP.
Anyone had success approaching like this? How to make Python script run as service?