Casting entire column to Spark Timestamp in Scala

Question

I have a dataframe df with the following schema:

ts:double
key:long
val:long

I want to convert the entire ts field to spark TimeStamp but without dropping any column. I can know how to do a select and , something line:

val new_df = df.select($"ts".cast(TimestampType))

However, new_df has only one column (as expected). I can do an join and be done but that's probably not a good approach. I would like to do something like

val new_df = df.map(udf(col("ts"))

that will generate a new_df with columns ts (correctly casted), key and value.

score 1 · Answer 1 · edited Feb 18 '16 at 22:25

1

All you need is either withColumn:

df.withColumn("ts", $"ts".cast(TimestampType))

or simple select with *:

df.select($"*", $"ts".cast(TimestampType))

edited Feb 18 '16 at 22:25

Neel

9,913
16
52
74

answered Feb 18 '16 at 22:18

zero323

322,348
103
959
935

Casting entire column to Spark Timestamp in Scala

1 Answers1