I'm trying to test a part of my program which performs transformations on dataframes I want to test several different variations of these dataframe which rules out the option of reading a specific DF from a file
And so my questions are:
- Is there any good tutorial on how to perform unit testing with Spark and dataframes, especially regarding the dataframes creation?
- How can I create these different several lines dataframes without a lot of boilerplate and without reading these from a file?
- Are there any utility classes for checking for specific values inside a dataframe?
I obviously googled that before but could not find anything which was very useful. Among the more useful links I found were:
It would be great if examples/tutorials are in Scala but I'll take whatever language you've got
Thanks in advance