I have installed PredictionIO locally, trained the engine using the Universal Recommendation template that I modified for my needs, everything looks fine.
Now that I know that this could fit my needs, I desire to deploy it to production, unfortunately, there is not much documentation about it.
Ideally, I would like to have everything deployed on AWS, there is a part of the documentation describing it, but useless since the CloudFormation template is disabled.
I was thinking about using maybe using Docker to achieve it, but I lack of knowledge about the whole stack and would like to understand the following:
where should the data be stored?
hbase
seems to be the "database", isn't it dangerous to have it on the same server as the rest (event server, prediction server)?how does it scale? do I need multiple instances of PredictionIO running behind a load balancer or is one enough? if so, how to achieve that?
what is a good distributed architecture? in order to scale, I'm pretty sure we will need to separate the EventServer from the PredictionServer, what is the good way to do this?
Hope someone can help. Thanks. Cyril