I am trying to read a 4GB CSV file using pandas using the code below:
tp = pd.read_csv('train.csv', sep='\t', iterator=True, chunksize=10000)
train = pd.concat(tp, ignore_index=True)
After using this I am able to read the csv but when I used :
(train.shape)
it shows number of columns to be "1" but there are 24 columns.
I also tried to use sep as ',' but doing that returns me the optput on console as killed. I am using GC instance with 8GB RAM so no issues from that side. Also, if I try reading the CSV using :
pandas.read_csv("train.csv")
this fails for that I have recommended various other questions on stackoverflow which recommended me to read data in chunks.