I have a column in a dask
data frame that contains comma separated lists of different categories. I'm looking to replicate the functionality of sklearn
's MultiLabelBinarizer or the pandas
function pd.get_dummies(',')
exactly as this thread describes: Create dummies from column with multiple values in dask
Is there absolutely no way to do this as the one answer there states? Is there a way to implement this if I got a list of all of the values?