I am trying to read this dataset from Kaggle: Amazon sales rank data for print and kindle books
The file amazon_com_extras.csv
has a column named "Title" that sometimes contains a comma ',' so all the fields in this .csv are enclosed by quotation marks:
"ASIN","GROUP","FORMAT","TITLE","AUTHOR","PUBLISHER"
"022640014X","book","hardcover","The Diversity Bargain: And Other Dilemmas of Race, Admissions, and Meritocracy at Elite Universities","Natasha K. Warikoo","University Of Chicago Press"
I have read other questions related to this problem but none of them solve it. For example, I have tried:
df = pd.read_csv("amazon_com_extras.csv",engine="python",sep=',')
df = pd.read_csv("amazon_com_extras.csv",engine="python",sep=',',quotechar='"')
But nothing seems to work. I am using Python 3.7.2 and pandas 0.24.1.