This answer got me to read a file on S3 but I have some huge files (CSV/txt) out there on S3 that I need to do random sampling on to get a manageable size for local processing. Since this does a full object read it would blow me out. Further I would like to be able to read sequentially to be able to select certain records by field content..
Any ideas?