What is strongly-typed API and an untyped API with respect to Spark Datasets ?
How Datasets are similar/dissimilar to DataFrames?
What is strongly-typed API and an untyped API with respect to Spark Datasets ?
How Datasets are similar/dissimilar to DataFrames?
Dataframe API's are untyped API's since the type will only be known during the runtime. Whereas dataset API's are typed API's for which the type will be known during the compile time.
df.select("device").where("signal > 10") // using untyped APIs
ds.filter(_.signal > 10).map(_.device) // using typed APIs