I want to read parquet files from an S3 bucket.
Here's my code:
for obj in bucket.objects.filter(Prefix=f'some_prefix/'):
response = obj.get()
df = pd.read_parquet(response['Body'],columns=relevant_columns)
#some data processing
df.to_csv('some_path',
storage_options = {'key': key, 'secret': secret},index=False)
I get this error:
ArrowInvalid: Called Open() on an uninitialized FileSource