I followed the example in this post to write out a DataFrame
as a csv to an AWS S3 bucket. The result was not a single file but rather a folder with many .csv files. I'm now having trouble reading in this folder as a DataFrame
in SparkR. Below is what I've tried but they do not result in the same DataFrame
that I wrote out.
write.df(df, 's3a://bucket/df', source="csv") #Creates a folder named df in S3 bucket
df_in1 <- read.df("s3a://bucket/df", source="csv")
df_in2 <- read.df("s3a://bucket/df/*.csv", source="csv")
#Neither df_in1 or df_in2 result in DataFrames that are the same as df