Hi I build a correlationmatrix and want to delete all labels and values under XX I am building a series so i can iterate over the values but i dont know how to delete the raws. the next step is to convert it back to a dataframe. Maybe you know a better way.
Here a mini-example
import numpy as np
import pandas as pd
data = np.random.rand(4,4)
df = pd.DataFrame(data, index = ['varname1', 'varname2', 'varname3', 'varname4'],
columns = ['longname1', 'longname2', 'longname3', 'longname4'])
corr = abs(df.corr().stack())
corr = corr[corr.index.get_level_values(0) != corr.index.get_level_values(1)] #delete doubles
for i in range(len(corr.keys())):
if corr[i] <= 0.2:
corr = corr.drop(corr[i]) # how can i delete the raws