I would like to know what are the best practices for building Predictive Modeling solutions organically ? Some of the questions I have are :-
- If I have multiple R model files, what are efficient ways of storing them ?
- Save as .Rdata files on file system
- Serialize to a DB as binary objects
- Since data is processed to create an interim model specific format, is it helpful to use such paradigms as PMML ?
- Also, should one consider such practices as MVC (I'm not a trained software developer, so any insights into such development practices would be very helpful)
I apologize for the open-ended nature of this question. I wish to understand even simple things as recommended folder structure for data staging, model store, scripts collection and such other elements of a data mining solution.
I would be very grateful to members of the community for sharing their experiences and recommendations. Thank you for your time.