I have a large dataset(50k rows) and I wanted to create a histogram from the data with density along the Y axis and values log scaled on the x axis, with a KDE plot superimposed.
This is very small subset of the data being used:
A B C
1 1 4200
1 4 94000
1 4 81000
1 3 30000
1 3 29000
1 1 20400
Current code:
columns= ['A','B','C']
df=pd.read_csv('data.csv', skipinitialspace=True, usecols=columns)
data=df[['C']].dropna().values
data=np.logspace(data)
plt.hist(data, bins='auto')
I currently get the following error logspace() missing 1 required positional argument: 'stop' When I don't use logspace I am able to get a histogram, but not the one I am looking for. I am very new to python so the help is appreciated.