I am using pandas to compare two large data frames that have been pulled from two Excel workbooks, and after merging and comparing differences, I noticed that the cells that didn't merge have unusual accents and trademark symbols that they did not have in the Excel workbooks. I surmise that these symbols are why the merges didn't occur, but I am not sure why they are showing up in the pandas dataframes or how to get rid of them/get them to be recognized.
I am not sure if this an encoding issue within Python, or an importing issue from Excel --> Python.
The symbols only occur in one column of the dataframe, called 'com_it'. I've tried to replace the symbols using the following, but it did not work:
com['com_it'].replace(u"\u00AE", '', inplace=True)
pano2['com_it'].replace(u"\u00AE", '', inplace=True)
Any help with how to get the symbols to disappear or otherwise get these cells merged would be greatly appreciated!