Apache Spark MLlib algorithms (e.g., Decision Trees) save the model in a location (e.g., myModelPath
) where it creates two directories, viz. myModelPath/data
and myModelPath/metadata
. There are multiple files in these paths and those are not text files. There are some files of format *.parquet
.
I have couple of questions:
- What are the format of these files?
- Which file/files contain actual model?
- Can I save the model to somewhere else, for example in a DB?