I would like to capture certain information about each file that is in HDFS, such as: name, creation date, modification and last access. I thought about doing it using the Python OS module, but I'm not sure if it would be possible and also how to do it. Another alternative I thought would be to use the HDFS module itself, but the information about it on the internet is scarce and made it even more difficult.
Does anyone have any idea how I might be doing this?