Let's say I fit IsolationForest()
algorithm from scikit-learn on time-series based Dataset1 or dataframe1 df1
and save the model using the methods mentioned here & here. Now I want to update my model for new dataset2 or df2
.
My findings:
- this workaround about Incremental learning from sklearn:
...learn incrementally from a mini-batch of instances (sometimes called “online learning”) is key to out-of-core learning as it guarantees that at any given time, there will be only a small amount of instances in the main memory. Choosing a good size for the mini-batch that balances relevancy and memory footprint could involve tuning.
but Sadly IF algorithm doesn't support estimator.partial_fit(newdf)
How I can update the trained on Dataset1 and saved IF model with a new Dataset2?