I have a set of columns (col1,col2,col3) in dataframe df1 I have another set of columns (col4,col5,col6) in dataframe df2 Assume this two dataframes has the same number of rows.
How do I generate a correlation table that do pairwise correlation between df1 and df2?
the table will look like
col1 col2 col3
col4 .. .. ..
col5 .. .. ..
col6 .. .. ..
I use df1.corrwith(df2)
, it does not seem to generate the table as required.
I have a asked a similar question here: How to perform Correlation between two dataframes with different column names but now I am dealing with categorical columns.
If it is not comparable directly, is there a standard way to make it comparable (like using get_dummies)? and is that a faster way to automatically process all fields (assume all are categorical) and calculate their correlation?