I have a large Data set which i want to import into databricks to do some analytics using scala. The data set is available in this link : https://drive.google.com/open?id=1g4YYALk3nArN8bX2uFS70IpbdSf_Efqj
I want to import this data set such that , the document ID is in the first column and the other test data in the other column.
But when i import the data using following code , it looks like this
val df = spark.read.text("FileStore/tables/plot_summaries.txt")
df.select("value").show()
Can anyone help me to import this in the proper way ? Any help would be highly appreciated. Thank you