I am working on a project: financial analysis to predict stock outcomes. And I am using pyspark to do it.
df2=df.rdd.map(lambda x:(x[0], DenseVector(x[1:])))
I ran this command to create a dataframe from another dataframe that I have to do scaling and, later to create a regression model. I get an enormous bunch of errors saying
cannot convert String to Float
Please help !