I am having a dataframe with following values:
sentence_id words labels
3822445 ['a', 'b', 'c', ''] ['B-PER', 'I-PER', 'I-PER', 'I-PER']
3822446 ['d', 'e', ''] ['B-PER', 'I-PER', 'I-PER']
3822447 ['f', 'g', 'h'] ['B-PER', 'I-PER', 'I-PER']
Exepcting output as:
sentence_id words labels
3822445 'a' 'B-PER'
3822445 'b' 'I-PER'
3822445 'c' 'I-PER'
3822445 '' 'I-PER'
3822446 'd' 'B-PER'
3822446 'e' 'I-PER'
3822446 '' 'I-PER'
3822447 'f' 'B-PER'
3822447 'g' 'I-PER'
3822447 'h' 'I-PER'
I have tried:
dataframe.set_index(['sentence_id']).apply(pd.Series.explode).reset_index()
but giving same output as input. Don't know what's going wrong.