i do use python 3.7 to automate some processes which include using dataframe
problem i got is as follow.
using this code:
data=pd.io.parsers.read_csv(basepath + files[0],sep='|',header=None,index_col=None,dtype={'2': 'str'},skiprows=2,usecols=[2,3,10,18,17,1])
The file is so huge its impossible to track every mistake with 00 , and not all number out there are 10 char long some are 9 char long it depends.
i expect result as follows:
4 12345 abcd P1234 A1234
but some lines in column 2 are with 00 at start dataframe automaticaly thinks it's integer and get rid of it to be efficient so sometimes it should be:
4 00123 abcd P1234 A1234
but i end up with
4 123 abcd P1234 A1234
so i chcek documentation to pandas and tried adding dtype it doesn't work for me. Any suggestions how to make it work?