Im trying to learn the basics in data science, so I'm attempting to mine the following dataset (csv) using python.
https://www.kaggle.com/jessevent/all-crypto-currencies
70mg file
My setup
6 Core bulldoser AMD
120 GB SSD
2GB data drive
Ubuntu 12.04 with libvirt KVM/QEMU
Python 3
Python 2.7
It has transpired that my setup can't crunch the numbers as I need to save in execl format .xlsx and i cant save the file as the system hangs.
I'm wondering if its possible to use the native csv file and run numpy, matplotlib and pandas etc, and not call the data using .excel.
I need to be able to use the the dataset, for doing basic data exploration, data cleaning and model construction, validation etc