I'm trying to pull out a list of unique values from a data frame but I keep getting a value that I can't find anywhere in the original data frame. Has anyone run into something like this before?
I read in a text file:
tmpPandaObj = pd.read_csv(fn, sep='\t', header=None)
tmpPandaObj.columns = ['stockId','dt','hhmm','seq','ecalls']
Pull out the unique values:
uniqueStockIdVec = tmpPandaObj.stockId.unique()
Yet I keep getting '\ufeff19049' included in the unique vector. I've searched the text files and data frame as hard as I possibly can, with no luck finding any '\ufeff19049' value. The only unique values should be '19049', '24937', '139677'.