I have a dataframe which looks like this:
[ID_number,cust_number,feature1,feature2,feature3,....]
Now I want to write a query that groups by ID_number and applies a User Defined Function on the subsets
[cust_number,feature1,feature2,feature3,......]
grouped by each ID_number I need to apply Machine Learning algorithms on the features and store the weights somehow.
How do I do this using Apache Spark DataFrames(Using Scala) ?