Write PySpark Dataframe to GCS with Overwrite

Asked Jan 08 '20 at 14:34

Active Jan 08 '20 at 14:34

Viewed 141 times

When uploading a dataframe as csv from pyspark to GCS, how do I overwrite an existing file? If I were to run this code twice, I would get the error below

test_row = Row(id=1)
test_df = spark.createDataFrame([test_row])

#saving the spark dataframe to GCS
p="test_file"
bucket='test_bucket'
table_id = 'test_folder'
q="gs://{}/{}/{}".format(bucket,table_id,p)
test_df.write.csv(q)

'path gs://test_bucket/test_folder/test_file already exists.;'

asked Jan 08 '20 at 14:34

user147529

Did you get any solution for this? @user147529 – Shadab Hussain Jun 24 '20 at 01:03

Write PySpark Dataframe to GCS with Overwrite

0 Answers0