I am using python 2.7 and Pandas to load a bit large csv file (~10G) using Pandas 'read_csv'
method. This action used to take 3-4 minutes until today, and suddenly it started taking hours without completing. The machine has 30G RAM and multiple CPUs, I checked the usage and nearly all of the memory and CPUs are free. Also the process's status is 'D' for most of the time (linux machine) which I read that usually happens during a wait for an I/O?
How can I debug this to find what's causing the problem?
Thank you