i am new to data Science and Dta Analytics i hope my question is not too naive. I am currently trying to open a file with pandas and python for machine learning purposes it would be ideal for me to have them all in a DataFrame. Now The file is 18GB large and my RAM is 32 GB but i keep getting memory errors.
- from your experience is it possible?
- if not do you know know of a better to way to go around this? (hive table? increase the size of my RAM to 64? create a database and access it from python) Every input will be welcome!
Thanks in advance.