I am trying to access line delimited JSON data on S3. From my understanding of the docs I should be able to do something like
print data(S3(Chunks(JSONLines))('s3://KEY:SECRET@bucket/dir/part-*.json').peek()
which throws
BotoClientError: BotoClientError: Bucket names cannot contain upper-case characters when using either the sub-domain or virtual hosting calling format.
I have tried variations on this leading to different errors.
I can get the following to work with local files:
print data(chunks(JSONLines)(map(JSONLines, glob("/home/me/data/*")))).peek()
I am not really sure why the (map(JSONLines, glob(
is needed, though.
I do not really understand how to work with type-modofiers