We can see how to view heroku logs, as well as how to write the last n
lines as a text file.
Is there any established pattern for sensible and easy log storage, (potentially ETL), and analysis?
At least, this would involve:
- storing logs
- moving logs (e.g. via an ETL) to somewhere they can be analysed en mass (e.g. AWS S3 or GCP GCS)
Is there any established pattern to achieve this?
Background
Why would anyone want logs en mass? In case it's relevant, a specific task I'm trying to achieve is to use bayesian inference on web logs to answer questions like: "if a person clicked on A, B and C, then they're x% likely to click on D" (so as to better understand which other pages a user may be interested in, and therefore suggest more relevant pages to the user). This is all pretty straight forward in python or R. But obviously one needs access to the logs (all the logs) before such data science can be carried out.
What I know so far
- Heroku provides several logging addons