I am trying to build a python script to parse a huge amount of data. I will be generating data from an existing tool which will be parsed by python tool and put into an excel sheet. I haven't yet figured out how the input data has to be. Is there any particular format or patterns anyone would suggest to make parsing easier? The approach in my mind is to use regular expressions and find places in junk data to identify blocks and such.
Is there any standard or format - anything of that sort which will improve the parsing as regular expressions can only be relied on assuming the format of inut data wont change
I believe regex is a bad idea as its error prone. This is why i am seeking other options. Here, i have the option to format or modify raw data also unlike usual scenarios. So, i would like to know all possible ways to make the report generation easier.